tag :: dataset | BibSonomy

bookmarks (hide)740
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

3Martin Hepp
http://www.heppnetz.de/eclassowl/
18 years ago by @hotho
show all tags
dataset
ontology
datasetontology
copydelete
- community post
- history of this post
4AOL search data mirrors
This collection consists of ~20M web queries collected from ~650k users over three months. The data is sorted by anonymous user ID and sequentially arranged.
18 years ago by @hotho
show all tags
dataset
search
datasetsearch
copydelete
- community post
- history of this post
1Obtaining corpora and text collections for biomedical natural language processing
http://compbio.uchsc.edu/corpora/obtaining.shtml
19 years ago by @hotho
show all tags
nlp
dataset
bio
nlpdatasetbio
copydelete
- community post
- history of this post
1Trec Spam Corpus
http://plg.uwaterloo.ca/~gvcormac/treccorpus/
18 years ago by @hotho
show all tags
set
dataset
corpus
data
trec
spam
setdatasetcorpusdatatrecspam
copydelete
- community post
- history of this post
3Omega Ontology: Home
http://omega.isi.edu/
18 years ago by @hotho
show all tags
nlp
dataset
ontology
omega
nlpdatasetontologyomega
copydelete
- community post
- history of this post
2Benchmark Data Sets used in [RaeOnoMue01] and [MikRaeWesSchMue99]
http://ida.first.fraunhofer.de/projects/bench/benchmarks.htm
18 years ago by @sb3000
show all tags
dataset
datamining
datasetdatamining
copydelete
- community post
- history of this post
1Datasets
http://www.niaad.liacc.up.pt/old/statlog/datasets.html
18 years ago by @hotho
show all tags
dataset
statlog
dm
ml
datasetstatlogdmml
copydelete
- community post
- history of this post
3CLUTO - Family of Data Clustering Software Tools | Karypis Lab
http://glaros.dtc.umn.edu/gkhome/views/cluto
18 years ago by @hotho
show all tags
dataset
clustering
dm
tools
ml
datasetclusteringdmtoolsml
copydelete
- community post
- history of this post
1ACM SIGKDD: Special Issue on Learning from Inbalanced Datasets
http://www.acm.org/sigs/sigkdd/explorations/issue.php?volume=6&issue=1&year=2004&month=06
18 years ago by @hotho
show all tags
inbalanced
dataset
data
svm
learning
inbalanceddatasetdatasvmlearning
copydelete
- community post
- history of this post
3All Our N-gram are Belong to You |:| Google Research Blog
Here at Google Research we have been using word n-gram models for a variety of R&D projects, such as statistical machine translation, speech recognition, spelling correction, entity detection, information extraction, and others. While such models have usu
18 years ago by @avivamagnolia
show all tags
linguistics
models
dataset
n-gram
machine+translation
linguisticsmodelsdatasetn-grammachine+translation
copydelete
- community post
- history of this post
8PyTables - Hierarchical Datasets in Python
http://www.pytables.org/moin
18 years ago by @andreab
show all tags
library
dataset
python
hierarchical
hdf5
tagora
statistics
programming
modules
librarydatasetpythonhierarchicalhdf5tagorastatisticsprogrammingmodules
copydelete
- community post
- history of this post
3eclassOWL
http://www.heppnetz.de/eclassowl/
17 years ago by @schmitz
show all tags
owl
rdf
dataset
ontology
owlrdfdatasetontology
copydelete
- community post
- history of this post
11Enron Email Dataset
http://www.cs.cmu.edu/~enron/
17 years ago by @hotho
show all tags
KI2007WebMining
dataset
enron
email
KI2007WebMiningdatasetenronemail
copydelete
- community post
- history of this post
3LETOR: Benchmark Datasets for Learning to Rank
http://research.microsoft.com/users/tyliu/LETOR/
17 years ago by @hotho
show all tags
dataset
learning
ranking
microsoft
benchmark
datasetlearningrankingmicrosoftbenchmark
copydelete
- community post
- history of this post
1IATE - IATE : EU Terminologiedatenbank
http://iate.europa.eu/iatediff/SearchByQuery.do
17 years ago by @seb
show all tags
nlp
dataset
nlpdataset
copydelete
- community post
- history of this post
1Geoffrey Sampson: Downloadable Resources
http://www.grsampson.net/Resources.html
16 years ago by @hotho
show all tags
nlp
dataset
corpus
tm
lecture
nlpdatasetcorpustmlecture
copydelete
- community post
- history of this post
3Show Us a Better Way: What public data is already available?
http://www.showusabetterway.co.uk/call/data.html
16 years ago by @hotho
show all tags
dataset
public
data
datasetpublicdata
copydelete
- community post
- history of this post
5Trust network datasets - TrustLet
http://www.trustlet.org/wiki/Trust_network_datasets
16 years ago by @beate
show all tags
dataset
web2.0
social-networks
trust
datasetweb2.0social-networkstrust
copydelete
- community post
- history of this post
3Some code and datasets
http://www.kyb.mpg.de/bs/people/pgehler/code/index.html
16 years ago by @hotho
show all tags
matlab
code
dataset
clustering
plsa
matlabcodedatasetclusteringplsa
copydelete
- community post
- history of this post
4ICWSM 2009 - International AAAI Conference on Weblogs and Social Media
http://www.icwsm.org/2009/data/
16 years ago by @hotho
show all tags
2009
web
dataset
conference
social
data
challenge
blog
2009webdatasetconferencesocialdatachallengeblog
copydelete
- community post
- history of this post
3Yahoo datasets
http://www.stanford.edu/class/cs345a/YahooData.pdf
15 years ago by @hotho
show all tags
dataset
yahoo
datasetyahoo
copydelete
- community post
- history of this post
1Sunbelt Viszards Session 2009
https://www.kde.cs.uni-kassel.de/ws/Viszards09/
15 years ago by @beate
show all tags
dataset
sunbelt
datasetsunbelt
copydelete
- community post
- history of this post
1Social Spam Detection Benjamin Markines Ciro Cattuto Filippo Menczer
Social Spam Detection
15 years ago by @hotho
show all tags
dataset
detection
spam
bibsonomy
classification
datasetdetectionspambibsonomyclassification
copydelete
- community post
- history of this post
1NRRC Publications
http://nrrc.mitre.org/NRRC/publications.htm
15 years ago by @mgrani
show all tags
annotation
dataset
timeml
time
opinion
annotationdatasettimemltimeopinion
copydelete
- community post
- history of this post
1Delicious dataset
http://dai-labor.de/index.php?id=1726&L=1
15 years ago by @rwdai
show all tags
dataset
folksonomy
delicious
download
research
datasetfolksonomydeliciousdownloadresearch
copydelete
- community post
- history of this post
1Datasets: Software - Statistical Consulting Center - UMass Amherst
http://www.umass.edu/statdata/statdata/
15 years ago by @vivion
show all tags
dataset
data
statistics
datasetdatastatistics
copydelete
- community post
- history of this post
1R: Data Sets from Montgomery, Peck and Vining's Book
Data Sets from Montgomery, Peck and Vining's Book
15 years ago by @vivion
show all tags
dataset
data
statistics
datasetdatastatistics
copydelete
- community post
- history of this post
1last.fm Dataset
http://www.dcs.gla.ac.uk/~konstas/lastfm/lastfm_dataset.htm
15 years ago by @folke
show all tags
dataset
friends
last.fm
datasetfriendslast.fm
copydelete
- community post
- history of this post
2Massive Scrape of Twitter’s Friend Graph | blog.infochimps.org
http://blog.infochimps.org/2008/12/29/massive-scrape-of-twitters-friend-graph/
15 years ago by @folke
show all tags
dataset
friend
graph
datasetfriendgraph
copydelete
- community post
- history of this post
1How Tweet It Is!: Library Acquires Entire Twitter Archive « Library of Congress Blog
Expect to see an emphasis on the scholarly and research implications of the acquisition. I’m no Ph.D., but it boggles my mind to think what we might be able to learn about ourselves and the world around us from this wealth of data. And I’m certain we’ll learn things that none of us now can even possibly conceive.
14 years ago by @jaeschke
show all tags
library
dataset
twitter
archive
librarydatasettwitterarchive
copydelete
- community post
- history of this post
7SNAP: Stanford Network Analysis Platform
http://snap.stanford.edu/
14 years ago by @hotho
show all tags
dataset
software
stanford
analysis
tools
network
snap
datasetsoftwarestanfordanalysistoolsnetworksnap
copydelete
- community post
- history of this post
1MaxMind - GeoLite City | Free Geolocation Database
http://www.maxmind.com/app/geolitecity
14 years ago by @folke
show all tags
commercial
dataset
ip
free
geolocation
commercialdatasetipfreegeolocation
copydelete
- community post
- history of this post
2Social Network Data
http://www.angela-bohn.de/data.html
14 years ago by @hotho
show all tags
dataset
sna
datasetsna
copydelete
- community post
- history of this post
1Semantic Matching
S-Match is an open source Java framework for semantic matching. It contains semantic matching, minimal semantic matching and structure preserving semantic matching algorithm implementations.
14 years ago by @hotho
show all tags
geonames
dataset
wordnet
geonamesdatasetwordnet
copydelete
- community post
- history of this post
4The ClueWeb09 Dataset
http://boston.lti.cs.cmu.edu/Data/clueweb09/
14 years ago by @dbenz
show all tags
web
dataset
clueweb
research
webdatasetcluewebresearch
copydelete
- community post
- history of this post
1SDMX – Statistical Data and Metadata Exchange
http://sdmx.org/
14 years ago by @sirko
show all tags
dataset
xml
statistics
datasetxmlstatistics
copydelete
- community post
- history of this post
1MOA Massive Online Analysis
http://moa.cs.waikato.ac.nz/
14 years ago by @atzmueller
show all tags
venus
dataset
massive
mining
analysis
moa
venusdatasetmassivemininganalysismoa
copydelete
- community post
- history of this post
1dreamsbox.com | read other people's dreams or share your own
http://www.dreamsbox.com/php/?page=10&daysold=3
14 years ago by @sac
show all tags
journal
dataset
dream
journaldatasetdream
copydelete
- community post
- history of this post
1Eurostat Bulk Download
http://epp.eurostat.ec.europa.eu/NavTree_prod/everybody/BulkDownloadListing?dir=data&sort=2&sort=-2
14 years ago by @sirko
show all tags
dataset
eurostat
dataseteurostat
copydelete
- community post
- history of this post
1Official Google Blog: Statistics for a changing world: Google Public Data Explorer in Labs
http://googleblog.blogspot.com/2010/03/statistics-for-changing-world-google.html
14 years ago by @dolefulrabbit
show all tags
dataset
visualization
datamining
google
statistics
datasetvisualizationdatamininggooglestatistics
copydelete
- community post
- history of this post
1Dataset: Barton - SIMILE
http://simile.mit.edu/wiki/Dataset:_Barton
15 years ago by @dolefulrabbit
show all tags
library
dataset
of
barton
congress
librarydatasetofbartoncongress
copydelete
- community post
- history of this post
1http://www.rkbexplorer.com/data/
http://www.rkbexplorer.com/data/
14 years ago by @mortimer_m8
show all tags
linkeddata
bibliography
rdf
dataset
conference
eswc2010
coreference
semanticweb
publications
linkeddatabibliographyrdfdatasetconferenceeswc2010coreferencesemanticwebpublications
copydelete
- community post
- history of this post
2140kit : The Free, Open Source Twitter Analytics Platform
http://140kit.com/
14 years ago by @hotho
show all tags
dataset
toread
twitter
collection
free
open
datasettoreadtwittercollectionfreeopen
copydelete
- community post
- history of this post
2Measuring User Influence in Twitter
http://twitter.mpi-sws.org/
14 years ago by @hotho
show all tags
dataset
toread
twitter
paper
datasettoreadtwitterpaper
copydelete
- community post
- history of this post
1Summary - Scientext
Scientext is a new, on-line French and English corpus of scientific texts. The corpus includes 4.8 million running tokens in French, 13 million words of research articles in English (medicine and biology), and an English-language sub-corpus of French undergraduate students’ texts (1,1 million words). The corpus is organized to facilitate the linguistic study of authorial position and reasoning in scientific articles through phraseology and lexico-grammatical markers linked to causality.
14 years ago by @dbenz
show all tags
scientext
science
dataset
texts
english
french
scientextsciencedatasettextsenglishfrench
copydelete
- community post
- history of this post
6Mining of Massive Datasets
http://i.stanford.edu/~ullman/mmds.html
14 years ago by @dbenz
show all tags
dataset
massive
data
data_mining
datasetmassivedatadata_mining
copydelete
- community post
- history of this post
3Find Open Datasets and Machine Learning Projects | Kaggle
Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More. Flexible Data Ingestion.
3 years ago by @analyst
show all tags
dataset
kaggle
machine-learning
collection
datasetkagglemachine-learningcollection
copydelete
- community post
- history of this post
1Whole-cell segmentation of tissue images with human-level performance using large-scale data annotation and deep learning | Nature Biotechnology
https://www.nature.com/articles/s41587-021-01094-0
2 years ago by @becker
show all tags
dataset
technologies
large
tissue
different
single
cell
datasettechnologieslargetissuedifferentsinglecell
copydelete
- community post
- history of this post
1TLC Trip Record Data - TLC
https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page
2 years ago by @jaeschke
show all tags
dataset
data
nyt
ny
taxi
datasetdatanytnytaxi
copydelete
- community post
- history of this post
1Global datasets - spatial-analyst.net
http://spatial-analyst.net/wiki/index.php?title=Global_datasets
13 years ago by @procomun
show all tags
dataset
raster
datasetraster
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)402
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

4A large annotated corpus for learning natural language inference
S. Bowman, G. Angeli, C. Potts, and C. Manning. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics, (2015)
8 years ago by @thoni
show all tags
dataset
snli
inference
natural
language
corpus
datasetsnliinferencenaturallanguagecorpus
copydeleteadd this publication to your clipboard
4Folksonomies and clustering in the collaborative system CiteULike
A. Capocci, and G. Caldarelli. Journal of Physics A: Mathematical and Theoretical, 41 (22): 224016 (7pp) (2008)
16 years ago by @hotho
show all tags
dataset
clustering
citeulike
folksonomy
***
network
properties
datasetclusteringciteulikefolksonomy***networkproperties
copydeleteadd this publication to your clipboard
2The distribution of amphibians and reptiles on Samos island (Greece)
J. Speybroeck, D. Bohle, E. Razzetti, M. Dimaki, M. Kirchner, and W. Beukema. Herperozoa 27 (1/2): 39 - 63, (July 2014)
7 years ago by @aandrovitsanea
show all tags
mapping
large
island
reptiles
Amphibia
distribution
herpetofauna
dataset
Hemorrhois
contemporary
record
nummifer
amphibians
Samos
new
reptilia
mappinglargeislandreptilesAmphibiadistributionherpetofaunadatasetHemorrhoiscontemporaryrecordnummiferamphibiansSamosnewreptilia
copydeleteadd this publication to your clipboard
3SimLex-999: Evaluating Semantic Models with (Genuine) Similarity Estimation
F. Hill, R. Reichart, and A. Korhonen. (2014)cite arxiv:1408.3456.
7 years ago by @thoni
show all tags
simlex999
dataset
evaluation
simlex999datasetevaluation
copydeleteadd this publication to your clipboard
2Harnessing Folksonomies to Produce a Social Classification of Resources
A. Zubiaga, V. Fresno, R. Martinez, and A. Garcia-Plaza. IEEE Trans. on Knowl. and Data Eng., 25 (8): 1801--1813 (August 2013)
7 years ago by @thoni
show all tags
dataset
folksonomy
delicious
classification
datasetfolksonomydeliciousclassification
copydeleteadd this publication to your clipboard
1Soil Moisture Neutron Probe Data (FIFE)
E. Kanemasu. (1994)Data set. Available on-line http://www.daac.ornl.gov from Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. doi:10.3334/ORNLDAAC/111. Also published in D. E. Strebel, D. R. Landis, K. F. Huemmrich, and B. W. Meeson (eds.), Collected Data of the First ISLSCP Field Experiment, Vol. 1: Surface Observations and Non-Image Data Sets. CD-ROM. National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, Maryland, U.S.A. (available from http://www.daac.ornl.gov)..
5 years ago by @karinawilliams
show all tags
dataset
fife
datasetfife
copydeleteadd this publication to your clipboard
1An Efficient Content Collaborative – Based and Hybrid Approach for Movie Recommendation Engine
R. Furtado. International Journal of Trend in Scientific Research and Development, 4 (3): 894-904 (April 2020)
4 years ago by @ijtsrd
show all tags
Content-based
dataset
hybrid
prototypes
Collaborative
and
engines
Simple
filtering
recommenders
Content-baseddatasethybridprototypesCollaborativeandenginesSimplefilteringrecommenders
copydeleteadd this publication to your clipboard
2Garbage in, garbage out?: do machine learning application papers in social computing report where human-labeled training data comes from?
R. Geiger, K. Yu, Y. Yang, M. Dai, J. Qiu, R. Tang, and J. Huang. FAT*, page 325-336. ACM, (2020)
4 years ago by @mgrueter
show all tags
dataset
documentation
datasetdocumentation
copydeleteadd this publication to your clipboard
3Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification.
J. Buolamwini, and T. Gebru. FAT, volume 81 of Proceedings of Machine Learning Research, page 77-91. PMLR, (2018)
4 years ago by @mgrueter
show all tags
image
dataset
gender
bias
evaluation
imagedatasetgenderbiasevaluation
copydeleteadd this publication to your clipboard
2GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
A. Wang, A. Singh, J. Michael, F. Hill, O. Levy, and S. Bowman. ICLR (Poster), OpenReview.net, (2019)
4 years ago by @nosebrain
show all tags
nlp
dataset
glue
nlpdatasetglue
copydeleteadd this publication to your clipboard
1The ERA-Interim reanalysis: configuration and performance of the data assimilation system
D. Dee, S. Uppala, A. Simmons, P. Berrisford, P. Poli, S. Kobayashi, U. Andrae, M. Balmaseda, G. Balsamo, P. Bauer and 26 other author(s). Quarterly Journal of the Royal Meteorological Society, 137 (656): 553--597 (Apr 1, 2011)
6 years ago by @pbett
show all tags
era
dataset
observations
eradatasetobservations
copydeleteadd this publication to your clipboard
3Structure in the Enron Email Dataset
P. Keila, and D. Skillicorn. Comput. Math. Organ. Theory, 11 (3): 183--199 (October 2005)
11 years ago by @macek
show all tags
Analysis
ENRON
DataSet
AnalysisENRONDataSet
copydeleteadd this publication to your clipboard
1Accurate telemonitoring of Parkinson’s disease progression by non-invasive speech tests
A. Tsanas, M. Little, P. McSharry, and L. Ramig. (2009)
14 years ago by @andrea.zanda
show all tags
dataset
parkinsons
datasetparkinsons
copydeleteadd this publication to your clipboard
1DING! Dataset Ranking using Formal Descriptions
?. (2009)
13 years ago by @lina.wolf
show all tags
dataset
ranking
datasetranking
copydeleteadd this publication to your clipboard
8Mining the Social Web: Analyzing Data from Facebook, Twitter, LinkedIn, and Other Social Media Sites
M. Russell. O'Reilly Media, Sebastopol, Canada, 1. edition, (2011)
13 years ago by @clemensbaier
show all tags
web
dataset
development
book
datamining
Twitter
2011
KDE
KDD
socialmedia
analysis
visualisation
webdatasetdevelopmentbookdataminingTwitter2011KDEKDDsocialmediaanalysisvisualisation
copydeleteadd this publication to your clipboard
2Mining Massive Datasets
A. Rajaraman, J. Leskovec, and J. Ullman. (2014)
10 years ago by @jaeschke
show all tags
bigdata
dataset
mining
book
data
bigdatadatasetminingbookdata
copydeleteadd this publication to your clipboard
3The UCI KDD Archive of Large Data Sets for Data Mining Research and Experimentation.
S. Bay, D. Kibler, M. Pazzani, and P. Smyth. SIGKDD Explorations, 2 (2): 81-85 (2000)
11 years ago by @vivion
show all tags
dataset
data-mining
multivariate
datasets
statistics
datasetdata-miningmultivariatedatasetsstatistics
copydeleteadd this publication to your clipboard
1A Comparison of ABK-Means Algorithm with Traditional Algorithms
M. Gangavane. International Journal of Trend in Scientific Research and Development, 1 (4): 614-621 (June 2017)
6 years ago by @ijtsrd
show all tags
ClusteringRule
K-Means
Dataset
Cluster
graph
Engineering
NLP
Crime
and
Area-base
Computer
Adaptive-Bisecting
Engine
base
ClusteringRuleK-MeansDatasetClustergraphEngineeringNLPCrimeandArea-baseComputerAdaptive-BisectingEnginebase
copydeleteadd this publication to your clipboard
1Using ERA-Interim reanalysis for creating datasets of energy-relevant climate variables
P. Jones, C. Harpham, A. Troccoli, B. Gschwind, T. Ranchin, L. Wald, C. Goodess, and S. Dorling. Earth System Science Data, 9 (2): 471-495 (July 2017)
6 years ago by @pbett
show all tags
colleagues
energy
dataset
ecem
renewables
climatology
colleaguesenergydatasetecemrenewablesclimatology
copydeleteadd this publication to your clipboard
4Revisiting Unreasonable Effectiveness of Data in Deep Learning Era.
C. Sun, A. Shrivastava, S. Singh, and A. Gupta. ICCV, page 843-852. IEEE Computer Society, (2017)
6 years ago by @loroch
show all tags
dataset
topology
training
deep_learning
datasettopologytrainingdeep_learning
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩