- Greenstone is a suite of software for building and distributing digital library collections.
- Article markedup with Bibo
- talis API for bibliographic information from uk libraries
- Semantic Web technologies for digital preservation: the SPAR project
- Data shared via the Talis Platform is available for use in a wide range of contexts. Data can be accessed via a growing set of consistent and accessible We...Data shared via the Talis Platform is available for use in a wide range of contexts. Data can be accessed via a growing set of consistent and accessible Web Services, suitable both for enriching existing applications or constructing whole new user experiences that leverage the full potential of the Platform capabilities.
- LIBRIS open data
- Online Katalogisieren
- Xsearch is a LIBRIS exclusive lightweight API. With Xsearch, it is possible to retrieve records from LIBRIS in a number of different formats. The Xsearch A...Xsearch is a LIBRIS exclusive lightweight API. With Xsearch, it is possible to retrieve records from LIBRIS in a number of different formats. The Xsearch API is based on http calls that return either XML or text, depending on the chosen format. All formats are encoded in UTF-8. The basic URL for Xsearch is http://libris.kb.se/xsearch, to which the following parameters can be added.
- The Open Library Books API provides a programmatic client-side method for querying information of books using Javascript. This API is inspired by the Goog...The Open Library Books API provides a programmatic client-side method for querying information of books using Javascript. This API is inspired by the Google Books Dynamic links API and is compatible with it.
- The OL Covers API provides a programmatic method to access the book covers and author photos available in the Open Library Covers Repository. All covers an...The OL Covers API provides a programmatic method to access the book covers and author photos available in the Open Library Covers Repository. All covers and author images are contributed by the Open Library community through an easy to use interface. Images can be uploaded at any resolution. There is a discussion on launchpad regarding possibly restricting upload size in the future.
- OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natur...OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.
- GBV Verbund-Wiki
- Large quantities of historical newspapers are being digitized and OCRd. We describe a framework for processing the OCRd text to identify articles and extra...Large quantities of historical newspapers are being digitized and OCRd. We describe a framework for processing the OCRd text to identify articles and extract metadata for them. We describe the article schema and provide examples of features that facilitate automatic indexing of them. For this processing, we employ lexical semantics, structural models, and community content. Furthermore, we describe visualization and summarization techniques that can be used to present the extracted events.
- Generates a METS file connecting image areas, OCRed text and ground truth documents encoded in TEI xml.
- METS / ALTO technical information


user