- This stem extension for PHP provides stemming capability for a variety of languages using Dr. M.F. Porter's Snowball API.
- The Open Library Books API provides a programmatic client-side method for querying information of books using Javascript. This API is inspired by the Goog...The Open Library Books API provides a programmatic client-side method for querying information of books using Javascript. This API is inspired by the Google Books Dynamic links API and is compatible with it.
- hOCR is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this in...hOCR is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this information invisibly in standard HTML. By building on standard HTML, it automatically inherits well-defined support for most scripts, languages, and common layout options. Furthermore, unlike previous OCR formats, the recognized text and OCR-related information co-exist in the same file and survives editing and manipulation. hOCR markup is independent of the presentation.
- With optical character recognition (OCR), you can scan the contents of a document into a single file of editable text. This article, which focuses on scann...With optical character recognition (OCR), you can scan the contents of a document into a single file of editable text. This article, which focuses on scanning books, describes the steps you need to take to prepare pages for optimal OCR results, and compares various free OCR tools to determine which is the best at extracting the text.
- Generates a METS file connecting image areas, OCRed text and ground truth documents encoded in TEI xml.
- xml2json.xslt is a XSLT 1.0 stylesheet to transform arbitrary XML to JSON. There is also a version for javascript. The workings are demonstrated with the a...xml2json.xslt is a XSLT 1.0 stylesheet to transform arbitrary XML to JSON. There is also a version for javascript. The workings are demonstrated with the accompanied xml files. The target of this library is to create javascript-like JSON (Parker convention), not XML-like JSON (see the BadgerFish convention).
- Bisher wird kein direkter Export von MODS unterstützt. Die Metadaten aus Katalogen des GBV ließen sich aber grundsätzlich nach MODS umwandeln, beispielswei...Bisher wird kein direkter Export von MODS unterstützt. Die Metadaten aus Katalogen des GBV ließen sich aber grundsätzlich nach MODS umwandeln, beispielsweise über MARC21.
- XHTML, Web 2.0, Webservices, Open Document Format, ... XML wohin man schaut.
- Beschreibung: Es geht darum eine XML Datei in einen PHP Array zu schreiben, um den im weiteren Scipt zu verwenden. Es ist aber nötig die Namen der XML Feld...Beschreibung: Es geht darum eine XML Datei in einen PHP Array zu schreiben, um den im weiteren Scipt zu verwenden. Es ist aber nötig die Namen der XML Felder zu kennen. Wenn du diese noch nicht kennt, kann man die Testfunktion ausführen (Ungetestete Fu
- Tag-Cloud (auch Wordwolke genannt) werden immmer populärer im Internet. Hier ein einfaches PHP-Script, wo wir aus einer Mysql-DB 3 Felder auslesen und eine...Tag-Cloud (auch Wordwolke genannt) werden immmer populärer im Internet. Hier ein einfaches PHP-Script, wo wir aus einer Mysql-DB 3 Felder auslesen und eine Wordwolke erstellen.
- RSS Aggregator für Code4lib-Blogs
- "Ich war auf der Suche nach einem hReview Parser für Java und habe leider nur ein XSL File gefunden. Leider deshalb, da ich für eine XSL-Transformation ein..."Ich war auf der Suche nach einem hReview Parser für Java und habe leider nur ein XSL File gefunden. Leider deshalb, da ich für eine XSL-Transformation ein valides XML-Dokument brauche, die meisten (X)HTML Seiten aber alles andere als valide sind. Hat j
- One of the most wicked defilers of beautiful markup is inline JavaScript. This makes the markup all but impossible to read, and provides lots of little dar...One of the most wicked defilers of beautiful markup is inline JavaScript. This makes the markup all but impossible to read, and provides lots of little dark corners for bugs to hide.
- // A singleton for recognizing EZProxies and converting URLs such that databases // will work from outside them. Unfortunately, this only works with the ($...// A singleton for recognizing EZProxies and converting URLs such that databases // will work from outside them. Unfortunately, this only works with the ($495) // EZProxy software. If there are open source alternatives, we should support // them too.
- Code4Lib 2007 Lightning Talks
- This tool is designed to beautify PHP code, applying most of the PEAR standard requirements to it. It can even process really scrambled scripts, e.g. all c...This tool is designed to beautify PHP code, applying most of the PEAR standard requirements to it. It can even process really scrambled scripts, e.g. all code in one line, and thus may help you to get scripts into a more readable form.
- Servervariablen vordefiniert
- This page supplements a paper that describes a method to convert thesauri to RDF and OWL.


user