The MEX tool set (MidosaEditor for XML standards) is available at SourceForge in English and German for Windows and MAC. It was developed by the two projects ‹daofind› and ‹daofind+› with support from the Andrew W. Mellon Foundation, New York. MEX
METS Navigator is a METS-based system developed by the Indiana University Digital Library Program for displaying and navigating sets of page images or other multi-part digital objects. METS, the Metadata Encoding and Transmission Standard, is an XML standard, maintained by the Library of Congress, for managing and describing digital library objects. Using the information in the METS <structMap> elements, METS Navigator builds a hierarchical menu that allows users to navigate to specific sections of a document, such as title page, specific chapters, illustrations, etc. METS Navigator also allows simple navigation to the next, previous, first, and last page image or component part of a digital object.
The <div> TYPE attribute vocabulary is a list of terms that may be used to categorise the core structural elements of an object in a METS document conforming to the Australian METS Profile. Examples of how these values may be applied are given in the Appendix – Content Models. The content model in the current version of the document represent use cases that have been tested by the Maintenance Agency, and further content models and vocabulary terms will be added as they are developed.
Large quantities of historical newspapers are being digitized and OCRd. We describe a framework for processing the OCRd text to identify articles and extract metadata for them. We describe the article schema and provide examples of features that facilitate automatic indexing of them. For this processing, we employ lexical semantics, structural models, and community content. Furthermore, we describe visualization and summarization techniques that can be used to present the extracted events.
ALTO (Analyzed Layout and Text Object) is a XML Schema that details technical metadata for describing the layout and content of physical text resources, such as pages of a book or a newspaper. It most commonly serves as an extension schema used within the Metadata Encoding and Transmission Schema (METS) administrative metadata section. However, ALTO instances can also exist as a standalone document used independently of METS.
This site serves as a repository for the NYU Digital Library Team's METS implementation development projects. At present a modest handful of XSLT-based page-turner and search implementations are freely available for use on an "as is" basis. In the pipeline are a java-based SMIL viewer, a java-based application and a perl-based application to extract a METS file from a database using NYU's zeroDB schema.
textMD is a XML Schema maintained by the Library of Congress that details technical metadata for text-based digital objects. It allows for detailing properties such as encoding information (quality, platform, software, agent), character information (character set and size, byte order and size, line terminators), languages, fonts, markup information, processing and textual notes, technical requirements for printing and viewing, and page ordering and sequencing.