jaj > corpora | BibSonomy

bookmarks (hide)4
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1early modern digital collections
Wynken de Worde. A list of digital collections of early printed books with open-access reuse policies
8 years ago by @jaj
show all tags
corpora
etexts
open_access
corporaetextsopen_access
(0)
copydelete
- community post
- history of this post
4Leipzig Corpora Collection (LCC) - the Datahub
Deutscher Wortschatz contains data generated from newspapers and web resources that are publicly available. The data were collected per language and encompass statistics about co-occurrences of words in randomly selected sentences.
10 years ago by @jaj
show all tags
corpora
corpora
(0)
copydelete
- community post
- history of this post
5Penn Treebank Project
The Penn Treebank Project annotates naturally-occuring text for linguistic structure. Most notably, we produce skeletal parses showing rough syntactic and semantic information -- a bank of linguistic trees. We also annotate text with part-of-speech tags, and for the Switchboard corpus of telephone conversations, dysfluency annotation. We are located in the LINC Laboratory of the Computer and Information Science Department at the University of Pennsylvania. All data produced by the Treebank is released through the Linguistic Data Consortium.
12 years ago by @jaj
show all tags
computational_research
corpora
linguistics
tools
computational_researchcorporalinguisticstools
(0)
copydelete
- community post
- history of this post
2Google Books: American English (155 billion words)
the Google Books corpus of American English, 155 billion words in size. limited to what you can do via the website at Brigham Young University. The easy thing to do is type in a word or phrase and see its frequency by decade, going back to the 1810s. The interface allows you to look for collocates (words that go with other words), view charts showing relative word frequency in the corpus by decade, handles parts of speech, and gives you various limits and display options. Other kinds of analysis that might be done with text corpora can’t be done through the interface.
12 years ago by @jaj
show all tags
corpora
corpus
reference
tools
corporacorpusreferencetools
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

No matching posts.

⟨⟨
⟨
⟩
⟩⟩

BibSonomy

bookmarks (hide)4
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1early modern digital collections

4Leipzig Corpora Collection (LCC) - the Datahub

5Penn Treebank Project

2Google Books: American English (155 billion words)

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

browse

related tags

concepts

tags

BibSonomy

bookmarks (hide)4 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

1early modern digital collections

4Leipzig Corpora Collection (LCC) - the Datahub

5Penn Treebank Project

2Google Books: American English (155 billion words)

publications (hide) displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

concepts

tags

bookmarks (hide)4
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...