[accès réservé] Corpus de textes français, du XVIe au XXe s., plus de 3700 œuvres dont 80% de textes littéraires et 20% de textes techniques. Pas d’accès au texte intégral des œuvres
A corpus of Scottish texts from 1945 to the present day ..... currently contains over 1.100 written and spoken texts, totalling over 4 million words of running text. 80% of this total is made up of written texts and 20% is made up of spoken texts, which are provided in the form of an orthographic transcription, synchronised with the source audio or video
An electronic corpus of Old South Slavic prose, with added morphological annotation, word-for-word glosses, and an English translation ..... texts of five manuscripts and nine inscriptions
« The American National Corpus (ANC) project is creating a massive electronic collection of American English, including texts of all genres and transcripts of spoken data produced from 1990 onward. The ANC will provide the most comprehensive picture of American English ever created, and will serve as a resource for education, linguistic and lexicographic research, and technology development. »
La primera concreció del DCA i el seu primer estadi d'elaboració és el Diccionari de Textos Catalans Antics (DTCA) consultable en aquest web, un diccionari de forma-lema que posa a l'abast d'investigadors i estudiosos un cabal d'informació excepcional, ja que tots els textos que s'hi han introduït han estat lematitzats i se'n proporciona, per tant, la informació, no únicament per formes ocasionals, sinó també per lemes