- Ex Libris - DigiTool multi-page entity
- Index to registered METS Profiles
- Greenstone is a suite of software for building and distributing digital library collections.
- DL Consulting Blog
- Semantic Web technologies for digital preservation: the SPAR project
- OCRopus is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural l...OCRopus is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities. This server allows you to use the system through your web browser.
- Schema for representing OCR results exported from FineReader 8.0 SDK. Copyright 2001-2006 ABBYY, Inc.
- Schema for representing OCR results exported from FineReader 6.0. Copyright 2001-2002 ABBYY, Inc.
- hOCR is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this in...hOCR is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this information invisibly in standard HTML. By building on standard HTML, it automatically inherits well-defined support for most scripts, languages, and common layout options. Furthermore, unlike previous OCR formats, the recognized text and OCR-related information co-exist in the same file and survives editing and manipulation. hOCR markup is independent of the presentation.
- The purpose of this document is to define an open standard for representing OCR results. The goal is to reuse as much existing technology as possible, and ...The purpose of this document is to define an open standard for representing OCR results. The goal is to reuse as much existing technology as possible, and to arrive at a representation that makes it easy to reuse OCR results.
- OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natur...OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.
- TCDL features a bi-annual bulletin of themed issues featuring articles of interests of members of the IEEE Technical Committee on Digital Libraries (TCDL)....TCDL features a bi-annual bulletin of themed issues featuring articles of interests of members of the IEEE Technical Committee on Digital Libraries (TCDL).
- Digicoord ist eine Informationsplattform zu den schweizerischen Digitalisierungsprojekten.
- The DjVuLibre XML Tools provide for editing the metadata, hyperlinks and hidden text associated with DjVu files. Unlike djvused(1) the DjVuLibre XML Tools ...The DjVuLibre XML Tools provide for editing the metadata, hyperlinks and hidden text associated with DjVu files. Unlike djvused(1) the DjVuLibre XML Tools rely on the XML technology and can take advantage of XML editors and verifiers.
- With optical character recognition (OCR), you can scan the contents of a document into a single file of editable text. This article, which focuses on scann...With optical character recognition (OCR), you can scan the contents of a document into a single file of editable text. This article, which focuses on scanning books, describes the steps you need to take to prepare pages for optimal OCR results, and compares various free OCR tools to determine which is the best at extracting the text.
- METS Tool


user