La primera concreció del DCA i el seu primer estadi d'elaboració és el Diccionari de Textos Catalans Antics (DTCA) consultable en aquest web, un diccionari de forma-lema que posa a l'abast d'investigadors i estudiosos un cabal d'informació excepcional, ja que tots els textos que s'hi han introduït han estat lematitzats i se'n proporciona, per tant, la informació, no únicament per formes ocasionals, sinó també per lemes
« The corpus contains more than 360 million words of text, including 20 million words each year from 1990-2007, and it is equally divided among spoken, fiction, popular magazines, newspapers, and academic texts. The corpus will also be updated at least twice each year from this point on, and will therefore serve as a unique record of linguistic changes in American English. The interface allows you to search for exact words or phrases, wildcards, lemmas, part of speech, or any combinations of these. You can search for surrounding words (collocates) within a ten-word window (e.g. all nouns somewhere near chain, all adjectives near woman, or all verbs near key). »
« The American National Corpus (ANC) project is creating a massive electronic collection of American English, including texts of all genres and transcripts of spoken data produced from 1990 onward. The ANC will provide the most comprehensive picture of American English ever created, and will serve as a resource for education, linguistic and lexicographic research, and technology development. »
An electronic corpus of Old South Slavic prose, with added morphological annotation, word-for-word glosses, and an English translation ..... texts of five manuscripts and nine inscriptions
[accès réservé] Corpus de textes français, du XVIe au XXe s., plus de 3700 œuvres dont 80% de textes littéraires et 20% de textes techniques. Pas d’accès au texte intégral des œuvres