The overall price of your project is determined using our price matrix. This involves three characteristics: typeface, legibility, and condition. A text that uses a standard modern or equivalent typeface is easier to digitize than a text that uses an obscure or difficult to decipher typeface or handwriting. Likewise, a text that is clear and uses a minimal number of character sets, or a text on pages that are not marred by physical damage such as smudges, tears, or unusual textual features, will be easier to digitize than a smudged text on worn pages. Learn more about the types of documents that can be submitted.
The New Zealand Electronic Text Centre collections provide open access to significant New Zealand and Pacific Island texts and materials.
This encompasses both digitised heritage material and born-digital resources. The collections contain over 2,600 texts (around 65,000 pages) which are made available in several formats and, where possible, under a Creative Commons license.
DocumentCloud is a catalog of primary source documents and a tool for annotating, organizing and publishing them on the web. Documents are contributed by journalists, researchers and archivists. We're helping reporters get more out of documents and helping newsrooms make their online presence more engaging.
Tesla (an acronym for Text Engineering Software Laboratory), is a Java-based open-source framework for computational linguistics, developed by the department of Computational Linguistics at the University of Cologne, Germany.
XCONCUR is an experimental markup language with the major goal to provide a convenient method to express concurrent hierarchies in an XML-like fashion. XCONCUR-CL is a validation component in XCONCUR and allows cross-layer validation.
CATMA integrates three functional, interactive modules: a tagger, a query-builder and an analyzer. The analyzer module contains most of the text analytical functions known to users of TACT
christiane rösinger; n.b. comments: gaga ist böse, weil mainstream/schein/nichts dahinter (erinnert an filme, wo leute in den mittleren 80ern tiefgefroren wurden und jetzt durch ein missgeschick aufgetaut wurden)
P. Moreira, Y. Bizzoni, K. Nielbo, I. Lassen, и M. Thomsen. Proceedings of the The 5th Workshop on Narrative Understanding, стр. 25--35. Toronto, Canada, Association for Computational Linguistics, (июля 2023)
Z. Yang, D. Yang, C. Dyer, X. He, A. Smola, и E. Hovy. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, стр. 1480--1489. San Diego, California, Association for Computational Linguistics, (июня 2016)
A. Nenkova, и R. Passonneau. Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004, стр. 145--152. Boston, Massachusetts, USA, Association for Computational Linguistics, (2004)
A. Blom, F. Carlsson, и E. Wihlborg. Proceedings of the 55th Hawaii International Conference on System Sciences | 2022, стр. 2563-2572. Honolulu, (2022)(Eurobarometer).
O. Decker, A. Yendell, A. Heller, и E. Brähler. Autoritäre Dynamiken in unsicheren Zeiten. Neue Herausforderungen - alte Reaktionen? / Leipziger Autoritarismus Studie 2022, Psychosozial-Verlag, Gießen, (ALLBUS).(2022)
T. Piske, и A. Steinlen. Cognition and Second Language Acquisition: Studies on pre-school, primary school and secondary school children, том 4 из Multilingualism and Language Teaching, Narr Francke Attempto Verlag, Tübingen, (Mikrozensus).(2022)
K. Guhlemann, и C. Best. Arbeit und Altern: Eine Bilanz nach 20 Jahren Forschung und Praxis, Nomos Verlagsgesellschaft, Baden-Baden, (Mikrozensus).(2021)