To install click the Add extension button. That's it.

The source code for the WIKI 2 extension is being checked by specialists of the Mozilla Foundation, Google, and Apple. You could also do it yourself at any point in time.

4,5
Kelly Slayton
Congratulations on this excellent venture… what a great idea!
Alexander Grigorievskiy
I use WIKI 2 every day and almost forgot how the original Wikipedia looks like.
Live Statistics
English Articles
Improved in 24 Hours
Added in 24 Hours
Languages
Recent
Show all languages
What we do. Every page goes through several hundred of perfecting techniques; in live mode. Quite the same Wikipedia. Just better.
.
Leo
Newton
Brights
Milds

Page Analysis and Ground Truth Elements

From Wikipedia, the free encyclopedia

Page Analysis and Ground Truth Elements (PAGE) is an XML standard for encoding digitised documents.[1] Comparable to ALTO (XML), it allows the organisation and structure of a page and its contents to be described.

PAGE XML can be used to describe:[citation needed]

  • page content (regions, lines of text, words, glyphs, reading order, text content, ...)
  • the evaluation of the layout analysis (evaluation profiles, evaluation results, ...)
  • the cutting of the document image (cutting grids)

The format is developed by the Pattern Recognition & Image Analysis Lab (PRIMA) at the University of Salford in Manchester.[citation needed]

It was designed to be used in conjunction with automatic segmentation and transcription techniques (OCR and HTR): indeed, PAGE aims to support each of the different steps in the processing chain for image document analysis (from image enhancement to layout analysis to OCR).[citation needed]

The PAGE XML schema is notably used as an export and import format by automatic transcription software such as eScriptorium[2] and Transkribus.[3] It is also an export format used by Kraken, a turnkey OCR system optimised for documents in historical and non-Latin scripts.[4]

References

  1. ^ "PAGE-XML". July 12, 2022 – via GitHub.
  2. ^ "eScripta – Digital Tools and Techniques for the Study of Ancient Writing".
  3. ^ "How To Export Documents from Transkribus". READ-COOP.
  4. ^ Kiessling, Benjamin (April 5, 2022). "The Kraken OCR system" – via GitHub.

External links

This page was last edited on 22 January 2024, at 07:43
Basis of this page is in Wikipedia. Text is available under the CC BY-SA 3.0 Unported License. Non-text media are available under their specified licenses. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc. WIKI 2 is an independent company and has no affiliation with Wikimedia Foundation.