jaj > corpus | BibSonomy

Lesezeichen (verstecken)38
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1Diachronic Electronic Corpus of Tyneside English
a corpus of dialect speech from the Tyneside area of North-East England. DECTE is an amalgamation of the existing Newcastle Electronic Corpus of Tyneside English (NECTE) created between 2001 and 2005 (http://research.ncl.ac.uk/necte), and NECTE2, a collection of interviews conducted in the Tyneside area since 2007. It thereby constitutes a rare example of a publicly available on-line corpus presenting dialect material spanning five decades. The present website is designed for research use. DECTE also, however, includes an interactive website, The Talk of the Toon, which integrates topics and narratives of regional cultural significance in the corpus with relevant still and moving images, and which is designed primarily for use in schools and museums and by the general public.
vor 10 Jahren von @jaj
alle anzeigen
corpus
corpus
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
2Google Books: American English (155 billion words)
the Google Books corpus of American English, 155 billion words in size. limited to what you can do via the website at Brigham Young University. The easy thing to do is type in a word or phrase and see its frequency by decade, going back to the 1810s. The interface allows you to look for collocates (words that go with other words), view charts showing relative word frequency in the corpus by decade, handles parts of speech, and gives you various limits and display options. Other kinds of analysis that might be done with text corpora can’t be done through the interface.
vor 12 Jahren von @jaj
alle anzeigen
reference
corpus
tools
corpora
referencecorpustoolscorpora
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1WebBase Project
The Stanford WebBase project has been collecting topic focused snapshots of Web sites. All the resulting archives are available to the public via fast download streams. For example, we collected pages from 350 sites every day for several weeks after the Katrina hurricane disaster. We also collect pages from government Web sites on a regular basis.
vor 12 Jahren von @jaj
alle anzeigen
govdocs
web
harvest
corpus
archive
datasets
dlib
govdocswebharvestcorpusarchivedatasetsdlib
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1JSTOR Data For Research
DFR is a set of web tools for selecting and exploring data sets constructed from content in the JSTOR archive.
vor 12 Jahren von @jaj
alle anzeigen
corpus
datamining
corpusdatamining
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Web as Corpus
English-language corpora compiled from the Web in 2006 and 2007, and more
vor 12 Jahren von @jaj
alle anzeigen
corpus
concordances
corpusconcordances
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1WebAsCorpus.org - find Web Concordances
search the web for words, phrases. get results with hits marked. download all pages for further research.
vor 12 Jahren von @jaj
alle anzeigen
searchengine
linguistics
corpus
textmining
research
searchenginelinguisticscorpustextminingresearch
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1UCI Knowledge Discovery in Databases (KDD) Archive
Online repository of large data sets for researchers in knowledge discovery and data mining. includes Discrete Sequence Data, Image Data, Multivariate Data, Relational Data, Spatio-Temporal Data, Text (corpora), Time Series, Web Data (web pages and log files).
vor 12 Jahren von @jaj
alle anzeigen
data_archive
big_data
corpus
datamining
datasets
data_archivebig_datacorpusdataminingdatasets
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
4LDC - Linguistic Data Consortium
supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards. LDC's Catalog contains hundreds of corpora of language data including Santa Barbara Corpus of Spoken American
vor 12 Jahren von @jaj
alle anzeigen
corpus
corpus
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags

⟨⟨
⟨
2
3
4
⟩
⟩⟩

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

Keine Treffer.

⟨⟨
⟨
⟩
⟩⟩

BibSonomy

Lesezeichen (verstecken)38
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1Diachronic Electronic Corpus of Tyneside English

2Google Books: American English (155 billion words)

1WebBase Project

1JSTOR Data For Research

1Web as Corpus

1WebAsCorpus.org - find Web Concordances

1UCI Knowledge Discovery in Databases (KDD) Archive

4LDC - Linguistic Data Consortium

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

Stöbern

Verwandte Tags

Konzepte

Tags

Lesezeichen (verstecken)38 Anzeigeallesnur LesezeichenLesezeichen pro Seite5102050100 sortieren nachhinzugefügt amTitel RSSBibTeXXML

Publikationen (verstecken) Anzeigeallesnur PublikationenPublikationen pro Seite5102050100 sortieren nachhinzugefügt amTitelAutorErscheinungsdatumEintragstypHilfe für erweiterte Sortierung... RSSBibTeXRDFmehr...

Stöbern

Verwandte Tags

Tags

Lesezeichen (verstecken)38
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...