Das SBB Zeitungen METS-Profil - Exchange beschreibt das Datenformat für den Austausch von Metadaten für digitale Objekte digitalisierter Zeitungen zwischen der Staatsbibliothek zu Berlin und Dritten, die als Auftragnehmer diese Daten erstellen.
Brown University Library's digital collections contain a mix of public domain, copyrighted (fair-use), and licensed materials. Materials that are under copyright or license agreement are available only to members of the Brown Community. Public domain materials are available to everybody.
The <div> TYPE attribute vocabulary is a list of terms that may be used to categorise the core structural elements of an object in a METS document conforming to the Australian METS Profile. Examples of how these values may be applied are given in the Appendix – Content Models. The content model in the current version of the document represent use cases that have been tested by the Maintenance Agency, and further content models and vocabulary terms will be added as they are developed.
Large quantities of historical newspapers are being digitized and OCRd. We describe a framework for processing the OCRd text to identify articles and extract metadata for them. We describe the article schema and provide examples of features that facilitate automatic indexing of them. For this processing, we employ lexical semantics, structural models, and community content. Furthermore, we describe visualization and summarization techniques that can be used to present the extracted events.