To install click the Add extension button. That's it.

The source code for the WIKI 2 extension is being checked by specialists of the Mozilla Foundation, Google, and Apple. You could also do it yourself at any point in time.

4,5
Kelly Slayton
Congratulations on this excellent venture… what a great idea!
Alexander Grigorievskiy
I use WIKI 2 every day and almost forgot how the original Wikipedia looks like.
Live Statistics
English Articles
Improved in 24 Hours
Added in 24 Hours
Languages
Recent
Show all languages
What we do. Every page goes through several hundred of perfecting techniques; in live mode. Quite the same Wikipedia. Just better.
.
Leo
Newton
Brights
Milds

Analyzed Layout and Text Object

From Wikipedia, the free encyclopedia

Analyzed Layout and Text Object (ALTO) is an open XML Schema developed by the EU-funded project called METAe.[1]

The standard was initially developed for the description of text OCR and layout information of pages for digitized material. The goal was to describe the layout and text in a form to be able to reconstruct the original appearance based on the digitized information - similar to the approach of a lossless image saving operation.

ALTO is often used in combination with Metadata Encoding and Transmission Standard (METS) for the description of the whole digitized object and creation of references across the ALTO files, e.g. reading sequence description.

The standard is hosted by the Library of Congress since 2010 and maintained by the Editorial Board initialized at the same time.

In the time from the final version of the ALTO standard in June 2004 (version 1.0) ALTO was maintained by CCS CCS Content Conversion Specialists GmbH, Hamburg up to version 1.4.

Structure

An ALTO file consists of three major sections as children of the root <alto> element:[2]

  • <Description> section contains metadata about the ALTO file itself and processing information on how the file was created.
  • <Styles> section contains the text and paragraph styles with their individual descriptions:
    • <TextStyle> has font descriptions
    • <ParagraphStyle> has paragraph descriptions, e.g. alignment information
  • <Layout> section contains the content information. It is subdivided into <Page> elements.
<?xml version="1.0"?>
<alto>
  <Description>
    <MeasurementUnit/>
    <sourceImageInformation/>
    <Processing/>
  </Description>
  <Styles>
    <TextStyle/>
    <ParagraphStyle/>
  </Styles>
  <Layout>
    <Page>
      <TopMargin/>
      <LeftMargin/>
      <RightMargin/>
      <BottomMargin/>
      <PrintSpace/>
    </Page>
  </Layout>
</alto>

Software support

See also

References

  1. ^ Stehno, Birgit; Egger, Alexander; Retti, Gregor (April 2003). "METAe—Automated Encoding of Digitized Texts". Literary and Linguistic Computing. 18 (1): 77–88. doi:10.1093/llc/18.1.77.
  2. ^ Structure of ALTO Files

External links

This page was last edited on 18 March 2024, at 06:27
Basis of this page is in Wikipedia. Text is available under the CC BY-SA 3.0 Unported License. Non-text media are available under their specified licenses. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc. WIKI 2 is an independent company and has no affiliation with Wikimedia Foundation.