project aims to put some Project Gutenberg ebooks into GitHub so people can fix problems in the files. use GitHub to open up the PG corpus to maintenance and use by libraries and librarians. The result will include MARC records, covers, OPDS feeds and ebook files to facilitate library use. Version-controlled fork and merge workflow, combined with a change triggered back-end build environment will allow scaleable, distributed maintenance of the greatest works of our literary heritage. 43,000 books and their metadata have been moved to the git version control software.
The English Short Title Catalogue (ESTC) lists over 460,000 items published between 1473 and 1800 mainly, but not exclusively, in English published mainly in the British Isles and North America from the collections of the British Library and over 2,000 other libraries
Deutscher Wortschatz contains data generated from newspapers and web resources that are publicly available. The data were collected per language and encompass statistics about co-occurrences of words in randomly selected sentences.
The UK Reading Experience Database (UK RED) is an open access database and research project housed in the English Department of the Open University. It is the largest resource recording the experiences of readers of its kind anywhere. UK RED has amassed over 30,000 records of reading experiences of British subjects, both at home and abroad, and of visitors to the British Isles, between 1450 and 1945. These include both famous and anonymous readers. It is both an open access resource and open to unsolicited public contributions.
The Open Utopia is a complete edition of Thomas More’s Utopia that honors the primary precept of Utopia itself: that all property is common property. But Utopia is more than the story of a far-off land with no private property. It’s a text that instructs us how to approach texts, be they literary or political, in an open manner: open to criticism, open to participation, and open to re-creation.
Handwritten annotations in books are an important key to understand how historical readers used their books. ABO aims to bring these books together. It is a digital library that reveals the variety of traces that readers left in their books. These examples were previously dispersed over many different libraries in the world. Yet it is also a digital laboratory, where visitors can work together: ABO has tools to enrich the early modern annotations with transcriptions and translations. ABO seeks to encourage collaboration.
X. Zhang, and Y. LeCun. (2015)cite arxiv:1502.01710Comment: This technical report is superseded by a paper entitled "Character-level Convolutional Networks for Text Classification", arXiv:1509.01626. It has considerably more experimental results and a rewritten introduction.
H. Seidel. Informatik 2009: Im Focus das Leben, Beiträge der 39. Jahrestagung der Gesellschaft für Informatik e.V. (GI), volume P-154 of GI Edition: Lecture Notes in Informatics (LNI) -- Proceedings, page 5--14. Bonn, Gesellschaft für Informatik, (2009)
S. Mpouli, and J. Ganascia. Proceedings of the Workshop on Resources and Methods for Semantic Processing of Digital Works/Texts, 126, page 21--24. Linköping University Electronic Press, Linköpings universitet, (July 2016)