dbenz > dataset | BibSonomy

bookmarks (hide)21
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

2WebBase Project
http://dbpubs.stanford.edu:8091/~testbed/doc2/WebBase/
12 years ago by @dbenz
show all tags
dataset
focussed
topic
web
webBase
datasetfocussedtopicwebwebBase
(0)
copydelete
- community post
- history of this post
1Microsoft Research - Speller Challenge Datasets
Microsoft Research Speller Challenge
14 years ago by @dbenz
show all tags
challenge
dataset
search_engine
speller_challenge
spelling
challengedatasetsearch_enginespeller_challengespelling
(0)
copydelete
- community post
- history of this post
1Longman Dictionaries - Dictionaries for Research
Pearson Longman English Language Teaching (Pearson Longman ELT) is a leading educational publisher of quality resources for all ages and abilities across the curriculum, providing solutions for teachers and students.
14 years ago by @dbenz
show all tags
dataset
dictionary
disambiguation
ldoce
datasetdictionarydisambiguationldoce
(0)
copydelete
- community post
- history of this post
6Mining of Massive Datasets
http://i.stanford.edu/~ullman/mmds.html
14 years ago by @dbenz
show all tags
data
data_mining
dataset
massive
datadata_miningdatasetmassive
(0)
copydelete
- community post
- history of this post
1Summary - Scientext
Scientext is a new, on-line French and English corpus of scientific texts. The corpus includes 4.8 million running tokens in French, 13 million words of research articles in English (medicine and biology), and an English-language sub-corpus of French undergraduate students’ texts (1,1 million words). The corpus is organized to facilitate the linguistic study of authorial position and reasoning in scientific articles through phraseology and lexico-grammatical markers linked to causality.
14 years ago by @dbenz
show all tags
dataset
english
french
science
scientext
texts
datasetenglishfrenchsciencescientexttexts
(0)
copydelete
- community post
- history of this post
1Call for Participation | Second Pascal Challenge on Large Scale Hierarchical Text classification
Following a successful first edition, we are pleased to announce the 2nd edition of the Large Scale Hierarchical Text Classification (LSHTC) Pascal Challenge. The LSHTC Challenge is a hierarchical text classification competition, using large datasets. This year’s challenge will increase the scale and the difficulty of the task, using data from Wikipedia (www.wikipedia.org), in addition to the ODP Web directory data (www.dmoz.org).
14 years ago by @dbenz
show all tags
2011
challenge
dataset
dmoz
text_classification
wikipedia
workshop
2011challengedatasetdmoztext_classificationwikipediaworkshop
(0)
copydelete
- community post
- history of this post
4The ClueWeb09 Dataset
http://boston.lti.cs.cmu.edu/Data/clueweb09/
14 years ago by @dbenz
show all tags
clueweb
dataset
research
web
cluewebdatasetresearchweb
(0)
copydelete
- community post
- history of this post
2Stack Overflow Creative Commons Data Dump - Blog – Stack Overflow
http://blog.stackoverflow.com/2009/06/stack-overflow-creative-commons-data-dump/
14 years ago by @dbenz
show all tags
data
dataset
stackoverflow
datadatasetstackoverflow
(0)
copydelete
- community post
- history of this post
1Spam dataset
http://plg.uwaterloo.ca/~gvcormac/treccorpus07/
14 years ago by @dbenz
show all tags
dataset
spam
datasetspam
(0)
copydelete
- community post
- history of this post
3Billion Triple Challenge 2010 Dataset
http://km.aifb.kit.edu/projects/btc-2010/
14 years ago by @dbenz
show all tags
billion_triple
data
dataset
semantic
semantic_web
billion_tripledatadatasetsemanticsemantic_web
(0)
copydelete
- community post
- history of this post
2Social Network Data
http://www.angela-bohn.de/data.html
14 years ago by @dbenz
show all tags
data
dataset
sna
social_network
datadatasetsnasocial_network
(0)
copydelete
- community post
- history of this post
8What is Twitter, a Social Network or a News Media? - WWW'10
http://an.kaist.ac.kr/traces/WWW2010.html
14 years ago by @dbenz
show all tags
dataset
twitter
www
www2010
datasettwitterwwwwww2010
(0)
copydelete
- community post
- history of this post
8Infochimps Data Marketplace / Commons: Download Sell or Share Databases, statistics, data sets for free
Find and download data in any format, from financial to social networking to GIS data. Or sell data in our data marketplace, at a price you set. We have large data sets, spreadsheets, and databases packed with statistics.
14 years ago by @dbenz
show all tags
data
dataset
datasets
download
search
datadatasetdatasetsdownloadsearch
(0)
copydelete
- community post
- history of this post
1Twitter data sets for download - Infochimps
http://infochimps.org/tags/twitter
14 years ago by @dbenz
show all tags
dataset
download
twitter
datasetdownloadtwitter
(0)
copydelete
- community post
- history of this post
4Extracting Text from Wikipedia
http://evanjones.ca/software/wikipedia2text.html
15 years ago by @dbenz
show all tags
data
dataset
plain_text
python
text
tool
wiki
wikipedia
datadatasetplain_textpythontexttoolwikiwikipedia
(0)
copydelete
- community post
- history of this post
1Research
http://www.p2p.tu-darmstadt.de/research/
15 years ago by @dbenz
show all tags
dataset
social_networks
socialnetwork
datasetsocial_networkssocialnetwork
(0)
copydelete
- community post
- history of this post
3Online Social Networks Research @MPI-SWS
http://socialnetworks.mpi-sws.org/
15 years ago by @dbenz
show all tags
dataset
download
misvlove
social_network
datasetdownloadmisvlovesocial_network
(0)
copydelete
- community post
- history of this post
1Download Wikipedia Category Taxonomy
http://www.eml-research.de/english/research/nlp/download/wikitaxonomy.php
15 years ago by @dbenz
show all tags
categories
category_hierarchy
dataset
download
hierarchy
ontology
taxonomy
wikipedia
categoriescategory_hierarchydatasetdownloadhierarchyontologytaxonomywikipedia
(0)
copydelete
- community post
- history of this post
1[twitter-dev] Re: Tweet Corpus creation for NLP research
http://www.mail-archive.com/twitter-development-talk@googlegroups.com/msg05715.html
15 years ago by @dbenz
show all tags
dataset
twitter
datasettwitter
(0)
copydelete
- community post
- history of this post
7Twapper Keeper - Archive Tweets
Allows you to archive and organize your tweets based upon hash tags.
15 years ago by @dbenz
show all tags
dataset
twapper
twapper_keeper
twitter
datasettwappertwapper_keepertwitter
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
⟩
⟩⟩

publications (hide)2
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1Reorganizing clouds: A study on tag clustering and evaluation
A. Garcia-Plaza, A. Zubiaga, V. Fresno, and R. Martinez. Expert Systems with Applications, 39 (10): 9483 - 9493 (2012)
13 years ago by @dbenz
show all tags
clustering
dataset
evaluation
reference
tag
clusteringdatasetevaluationreferencetag
(0)
copydeleteadd this publication to your clipboard
3Web Text Corpus for Natural Language Processing.
V. Liu, and J. Curran. EACL, The Association for Computer Linguistics, (2006)
14 years ago by @dbenz
show all tags
corpus
dataset
web
synonym_detection
nlp
corpusdatasetwebsynonym_detectionnlp
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
⟩
⟩⟩

BibSonomy

bookmarks (hide)21
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

2WebBase Project

1Microsoft Research - Speller Challenge Datasets

1Longman Dictionaries - Dictionaries for Research

6Mining of Massive Datasets

1Summary - Scientext

1Call for Participation | Second Pascal Challenge on Large Scale Hierarchical Text classification

4The ClueWeb09 Dataset

2Stack Overflow Creative Commons Data Dump - Blog – Stack Overflow

1Spam dataset

3Billion Triple Challenge 2010 Dataset

2Social Network Data

8What is Twitter, a Social Network or a News Media? - WWW'10

8Infochimps Data Marketplace / Commons: Download Sell or Share Databases, statistics, data sets for free

1Twitter data sets for download - Infochimps

4Extracting Text from Wikipedia

1Research

3Online Social Networks Research @MPI-SWS

1Download Wikipedia Category Taxonomy

1[twitter-dev] Re: Tweet Corpus creation for NLP research

7Twapper Keeper - Archive Tweets

publications (hide)2
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1Reorganizing clouds: A study on tag clustering and evaluation

3Web Text Corpus for Natural Language Processing.

browse

related tags

concepts

tags

bookmarks (hide)21 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)2 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

tags

bookmarks (hide)21
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)2
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...