tag :: datasets | BibSonomy

bookmarks (hide)306
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

7Infochimps.org: Free Redistributable Data Sets of Every Kind
http://infochimps.org/
16 years ago by @brightbyte
show all tags
datasets
facts
semanticweb
ontology
datasetsfactssemanticwebontology
copydelete
- community post
- history of this post
11Enron Email Dataset
hosted by CMU
16 years ago by @mstrohm
show all tags
datasets
datasets
copydelete
- community post
- history of this post
7Infochimps.org: Free Redistributable Data Sets of Every Kind
Infochimps.org Free Redistributable Rich Data Sets There are many sources to find out something about everything. Until now, there’s been no good place for you to find out everything about something. The infochimps.org community is assembling and interconnecting the world's best repository for raw data -- a sort of giant free allmanac, with tables on everything you can put in a table. Built by data nerds, used by data nerds, it's a central source for the information you need to power the projects the world needs.
16 years ago by @pitman
show all tags
reuse
data
datasets
free
semanticweb
db
reusedatadatasetsfreesemanticwebdb
copydelete
- community post
- history of this post
6getting theinfo: data sets
a collection of different datasets
16 years ago by @mstrohm
show all tags
mining
datasets
miningdatasets
copydelete
- community post
- history of this post
1OTMI Repository - OpenTextMining
http://www.opentextmining.org/wiki/OTMI_Repository
16 years ago by @lee_peck
show all tags
medical
OTMI
biological
datasets
repository
medicalOTMIbiologicaldatasetsrepository
copydelete
- community post
- history of this post
8Some Datasets Available on the Web » Data Wrangling Blog
The Datawrangling blog was put on the back burner last May while I focused on my startup. Now that I have some bandwidth again, I am getting back to work on several pet projects (including the Amazon EC2 Cluster).
15 years ago by @folke
show all tags
web
datasets
webdatasets
copydelete
- community post
- history of this post
2Treemaps for Space-Contrained Visualization of Hierarchies
http://www.cs.umd.edu/hcil/treemap-history/
17 years ago by @avivagabriel
show all tags
information_visualization
hierarchies
visualization
data
data_visualization
information_science
infographics
datasets
taxonomies
hierarchical
dataviz
information
treemaps
information_visualizationhierarchiesvisualizationdatadata_visualizationinformation_scienceinfographicsdatasetstaxonomieshierarchicaldatavizinformationtreemaps
copydelete
- community post
- history of this post
1data visualization & visual design =<>= information aesthetics
http://www.google.com/url?sa=t&ct=res&cd=5&url=http%3A%2F%2Fwww.infosthetics.com%2F&ei=Y9irRvrKJZ6ooAThr8GPBg&usg=AFQjCNGMm7LwRcB2ohgYnCWKCUydc9y-Ow&sig2=5dRRvGK3HbIq_U9B_yRR3w
17 years ago by @avivagabriel
show all tags
visualization
visual
data
dataviz
datasets
graphics
infoviz
information
aesthetics
visualizationvisualdatadatavizdatasetsgraphicsinfovizinformationaesthetics
copydelete
- community post
- history of this post
1InfoVis tests | Information Esthetics
What makes something “Information Visualization?” Is it just visual titillation? Or is it a tool that interprets, analyzes, and facilitates deeper understanding of data?
17 years ago by @avivagabriel
show all tags
esthetics
visualization
infovis
asthetics
data
infographics
datasets
infoviz
information
aesthetics
estheticsvisualizationinfovisastheticsdatainfographicsdatasetsinfovizinformationaesthetics
copydelete
- community post
- history of this post
1Social Network Fragments
Analysis of social networks via email usage and habits.
17 years ago by @avivagabriel
show all tags
social
SNA
visualization
social_network_analysis
data
email
socialnetworkanalysis
datasets
networks
socialnetworks
socialSNAvisualizationsocial_network_analysisdataemailsocialnetworkanalysisdatasetsnetworkssocialnetworks
copydelete
- community post
- history of this post
12Data.gov
http://www.data.gov/
15 years ago by @mstrohm
show all tags
mining
datasets
miningdatasets
copydelete
- community post
- history of this post
21 billion web page dataset from CMU
http://anyall.org/blog/2009/04/1-billion-web-page-dataset-from-cmu/
15 years ago by @mstrohm
show all tags
mining
datasets
search
miningdatasetssearch
copydelete
- community post
- history of this post
2Text REtrieval Conference (TREC) QA Data
http://trec.nist.gov/data/qa.html
15 years ago by @mkroell
show all tags
datasets
QuestionAnswering
datasetsQuestionAnswering
copydelete
- community post
- history of this post
4Multilabel Classification
http://mlkd.csd.auth.gr/multilabel.html
15 years ago by @folke
show all tags
based
instance
implementation
mulang
regression
datasets
logistic
label
combining
multi
basedinstanceimplementationmulangregressiondatasetslogisticlabelcombiningmulti
copydelete
- community post
- history of this post
1rwdai's dataset Bookmarks on Delicious
http://delicious.com/rwdai/dataset
15 years ago by @mkroell
show all tags
datasets
datasets
copydelete
- community post
- history of this post
1Links to Data Sources
http://www.biostat.umn.edu/~lynn/datalinks.html
15 years ago by @vivion
show all tags
datasets
statistics
datasetsstatistics
copydelete
- community post
- history of this post
7infochimps.org — Find Any Dataset in the World
http://infochimps.org/
15 years ago by @folke
show all tags
datasets
list
repository
datasetslistrepository
copydelete
- community post
- history of this post
1Analytics in the NYT - Trends and Outliers
The burgeoning interest in R demonstrates that there’s demand for analytics to solve real, business-critical problems in a broad spectrum of companies and roles, and that some of the incumbent analytics offerings, in particular SAS and SPSS, don’t sufficiently meet the growing need for analytics in many major companies. Annotated link http://www.diigo.com/bookmark/http%3A%2F%2Fspotfire.tibco.com%2Fcommunity%2Fblogs%2Fenterpriseanalytics%2Farchive%2F2009%2F01%2F08%2Fanalytics-in-the-nyt.aspx
16 years ago by @lystrata
show all tags
R
visualization
data
datasets
statistics
Rvisualizationdatadatasetsstatistics
copydelete
- community post
- history of this post
2Data Catalog Vocabulary (dcat) | DERI Vocabularies
http://vocab.deri.ie/dcat
14 years ago by @acka47
show all tags
linkeddata
vocabulary
opendata
rdf
metadata
datasets
description
dcat
linkeddatavocabularyopendatardfmetadatadatasetsdescriptiondcat
copydelete
- community post
- history of this post
1Datasets « Tore Opsahl
http://toreopsahl.com/datasets/
14 years ago by @folke
show all tags
datasets
tore
opsahl
datasetstoreopsahl
copydelete
- community post
- history of this post
1ve2 - the voiD editor
http://ld2sd.deri.org/ve2/
14 years ago by @acka47
show all tags
editor
linkeddata
opendata
metadata
voiD
datasets
editorlinkeddataopendatametadatavoiDdatasets
copydelete
- community post
- history of this post
1start [Wiki]
http://barcelona.research.yahoo.net/dokuwiki/doku.php
14 years ago by @mkroell
show all tags
datasets
datasets
copydelete
- community post
- history of this post
1Microsoft Learning to Rank Datasets - Microsoft Research
http://research.microsoft.com/en-us/projects/mslr/default.aspx
14 years ago by @beate
show all tags
Bing
MSN
learning-to-rank
datasets
BingMSNlearning-to-rankdatasets
copydelete
- community post
- history of this post
1Stanford Large Network Dataset Collection
http://snap.stanford.edu/data/#socnets
13 years ago by @folke
show all tags
social
sna
public
datasets
networks
socialsnapublicdatasetsnetworks
copydelete
- community post
- history of this post
14chan & 8chan Word Embeddings – Textgain
https://www.textgain.com/portfolio/8chanembeddings/
4 years ago by @mstrohm
show all tags
bias
word-embeddings
datasets
biasword-embeddingsdatasets
copydelete
- community post
- history of this post
1NHANES - Questionnaires, Datasets, and Related Documentation
http://www.cdc.gov/nchs/nhanes/nhanes_questionnaires.htm
13 years ago by @vivion
show all tags
dataset
data
nhanes
datasets
statistics
datasetdatanhanesdatasetsstatistics
copydelete
- community post
- history of this post
1Index von /pub/Health_statistics/NCHS/nhanes/2007-2008/
ftp://ftp.cdc.gov/pub/Health_statistics/NCHS/nhanes/2007-2008/
13 years ago by @vivion
show all tags
dataset
data
nhanes
datasets
download
statistics
ftm
datasetdatanhanesdatasetsdownloadstatisticsftm
copydelete
- community post
- history of this post
44Diigo - Web Highlighter and Sticky Notes, Online Bookmarking and Annotation, Personal Learning Network.
http://www.diigo.com/
13 years ago by @beate
show all tags
bookmarking-systems
highlights
datasets
bookmarking-systemshighlightsdatasets
copydelete
- community post
- history of this post
4Amazon Web Services (AWS) Hosted Public Data Sets
Various US databases provided by federal government agencies. Census, Labor Statistics, Transportation, Economics. Also: A 3D Version of the PubChem Library, Annotated Human Genome Data.
12 years ago by @jaj
show all tags
data
publicdata
datasets
datapublicdatadatasets
copydelete
- community post
- history of this post
3Home Page for 20 Newsgroups Data Set
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. The collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.
12 years ago by @jaj
show all tags
socialnetworking
corpus
data
datasets
socialnetworkingcorpusdatadatasets
copydelete
- community post
- history of this post
1SOS: Dataset Catalog
noaa
12 years ago by @jaj
show all tags
earth
datasets
earthdatasets
copydelete
- community post
- history of this post
1Data Sets Available through YQL - YDN
YQL (Yahoo Query Language) works with arbitrary structured (XML or JSON) documents with repeating elements, such as a list of restaurants or search results. Different "known" collections of these items are presented as "tables" in the YQL syntax, and are notionally namespaced based on the service providing the data.
12 years ago by @jaj
show all tags
api
datasets
apidatasets
copydelete
- community post
- history of this post
9UCI Machine Learning Repository
data sets as a service to the machine learning community.
12 years ago by @jaj
show all tags
reference
machine-learning
corpus
data
datamining
datasets
referencemachine-learningcorpusdatadataminingdatasets
copydelete
- community post
- history of this post
21 billion web page dataset from CMU
a crawl of 1 billion web pages. It’s 5 terabytes compressed — big enough so they have to send it to you by mailing hard drives.
12 years ago by @jaj
show all tags
datasets
datasets
copydelete
- community post
- history of this post
1Pew Research Center Dataset Download
The Pew Research Center makes its data available to the public for secondary analysis.
12 years ago by @jaj
show all tags
public_opinion
datasets
public_opiniondatasets
copydelete
- community post
- history of this post
1dshort.com: Data Sources
financial data
12 years ago by @jaj
show all tags
economics
datasets
economicsdatasets
copydelete
- community post
- history of this post
3DataSets Publisher
a torrent tracker for public datasets. If you are scientist, research developer or just interested in it, you can find and download some dataset or, if you are owner of dataset, you can publish this dataset (become a torrent seeder) at this site.
12 years ago by @jaj
show all tags
torrents
datasets
torrentsdatasets
copydelete
- community post
- history of this post
2UCI Machine Learning Repository: Data Sets
http://archive.ics.uci.edu/ml/datasets.html
13 years ago by @sdo
show all tags
machine
learning
datasets
uci
ml
machinelearningdatasetsuciml
copydelete
- community post
- history of this post
6The Data Hub
http://ckan.net/
13 years ago by @schmidt2
show all tags
dataset
comprehensive_archive_network
data_hub
ckan
datasets
homepage
search
open_data
datasetcomprehensive_archive_networkdata_hubckandatasetshomepagesearchopen_data
copydelete
- community post
- history of this post
6Home - CKAN
http://ckan.net/
13 years ago by @zazi
show all tags
Linked_Data
Datasets
Dataset_Repository
LOD_Cloud
CKAN
Semantic_Web
Linked_DataDatasetsDataset_RepositoryLOD_CloudCKANSemantic_Web
copydelete
- community post
- history of this post
1Regionaldatenbank
https://www.regionalstatistik.de/genesis/online/logon
12 years ago by @folke
show all tags
dataset
germany
data
official
datasets
statistics
datasetgermanydataofficialdatasetsstatistics
copydelete
- community post
- history of this post
1Online Free Access Datasets
http://www.researchpipeline.com/datasets.html
12 years ago by @jaj
show all tags
data_catalog
datasets
data_catalogdatasets
copydelete
- community post
- history of this post
1Knowledge Network for Energy Transitions
The Knowledge Network for Energy Transitions (KNET) is a global network of scholars, educators and organizations who study the economic, political, cultural, environmental and technological aspects of changes in society's major energy systems
12 years ago by @procomun
show all tags
energy
datasets
knowledge
energydatasetsknowledge
copydelete
- community post
- history of this post
4KONECT - The Koblenz Network Collection
http://konect.uni-koblenz.de/
12 years ago by @folke
show all tags
dataset
datasets
network
datasetdatasetsnetwork
copydelete
- community post
- history of this post
2Search-sets with subjectivity annotations
http://www.cs.cornell.edu/home/llee/data/search-subj.html
11 years ago by @bsc
show all tags
subjectivity
datasets
analysis
subjectivitydatasetsanalysis
copydelete
- community post
- history of this post
2KDD Cup 2003 - Datasets
http://www.cs.cornell.edu/projects/kddcup/datasets.html
11 years ago by @clemensbaier
show all tags
paper
citation
datasets
sota
research
papercitationdatasetssotaresearch
copydelete
- community post
- history of this post
1Global datasets - GRASS-Wiki
Elevation data + imagery
12 years ago by @iblis
show all tags
gis
resource
data
cartography
datasets
gisresourcedatacartographydatasets
copydelete
- community post
- history of this post
2Neue Tools sollen Qualität von Online-Texten analysieren | iX
https://www.heise.de/ix/meldung/Neue-Tools-sollen-Qualitaet-von-Online-Texten-analysieren-3582332.html#mobile_detect_force_desktop
8 years ago by @hotho
show all tags
news
dii
argument
kallimachos
crowd
mining
datasets
fake
worker
newsdiiargumentkallimachoscrowdminingdatasetsfakeworker
copydelete
- community post
- history of this post
1CrowdSignals.io: A Massive New Mobile Data Collection Campaign
CrowdSignals.io is an ethical, crowdfunded mobile data collection campaign
7 years ago by @bsc
show all tags
datasets
datasets
copydelete
- community post
- history of this post
1WISDM Lab: Dataset
Activity Recognition Datasets
7 years ago by @bsc
show all tags
activity_recognition
datasets
activity_recognitiondatasets
copydelete
- community post
- history of this post
4caesar0301/awesome-public-datasets: A topic-centric list of high-quality open datasets in public domains. By everyone, for everyone!
https://github.com/caesar0301/awesome-public-datasets
7 years ago by @becker
show all tags
dataset
public
data
datasets
datasetpublicdatadatasets
copydelete
- community post
- history of this post
4The Web Index | by World Wide Web Foundation
Designed and produced by the World Wide Web Foundation, the Web Index is the world’s first multi-dimensional measure of the Web’s growth, utility and impact on people and nations.
10 years ago by @lysander07
show all tags
linkeddata
data
datasets
linkeddatadatadatasets
copydelete
- community post
- history of this post
1Ebola Synthetic Information
http://www.vbi.vt.edu/ndssl/ebola/ebola-data/
10 years ago by @asmelash
show all tags
datasets
ebola
datasetsebola
copydelete
- community post
- history of this post
1WEBSPAM-UK2007 | Datasets | Web Spam Detection
http://www.yr-bcn.es/webspam/datasets/uk2007/
17 years ago by @beate
show all tags
web
datasets
spam
webdatasetsspam
copydelete
- community post
- history of this post
2GiveALink Beta
By donating your bookmarks, you let GiveALink analyze your preferences along with those of many other people. We will mine the resulting collection for interesting insights and use the information to develop novel applications. We will also share bookmark data with the Web research community, hoping to foster the development of many novel Web mining techniques and applications to search, recommendation, navigation, personalization and visualization of the Web.
16 years ago by @mstrohm
show all tags
TOFOLLOW
social-search
folksonomy
datasets
tools
networks
TOFOLLOWsocial-searchfolksonomydatasetstoolsnetworks
copydelete
- community post
- history of this post
3Home - Numbrary
http://numbrary.com/
16 years ago by @brightbyte
show all tags
data
datasets
search
datadatasetssearch
copydelete
- community post
- history of this post
6theinfo
for people with large data sets
16 years ago by @brightbyte
show all tags
free-content
hub
datamining
datasets
facts
wikidata
free-contenthubdataminingdatasetsfactswikidata
copydelete
- community post
- history of this post
4ICWSM 2009 - Data Challenge
http://www.icwsm.org/2009/data/
16 years ago by @mstrohm
show all tags
web
weblogs
datasets
webweblogsdatasets
copydelete
- community post
- history of this post
6(theinfo)
http://theinfo.org/
16 years ago by @gromgull
show all tags
datasets
datasets
copydelete
- community post
- history of this post
1NYTimes Developer Network
Article Search API and many other NYT APIs
16 years ago by @mstrohm
show all tags
datasets
tools
datasetstools
copydelete
- community post
- history of this post
1An Epistemic Dynamic Model for Tagging Systems
includes code and simulations
15 years ago by @mstrohm
show all tags
tagging
folksonomy
simulation
datasets
taggingfolksonomysimulationdatasets
copydelete
- community post
- history of this post
1English Gigaword Language Model Training Recipe
3-gram
15 years ago by @gromgull
show all tags
nlp
language
language-model
n-gram
datasets
nlplanguagelanguage-modeln-gramdatasets
copydelete
- community post
- history of this post
1del.icio.us stats - deli.ckoma
http://deli.ckoma.net/stats#posts_monthly
15 years ago by @mstrohm
show all tags
web2.0
datasets
statistics
web2.0datasetsstatistics
copydelete
- community post
- history of this post
5A list of Social Tagging Datasets made available for research
http://kmi.tugraz.at/staff/markus/datasets/
15 years ago by @gromgull
show all tags
tagging
social
datasets
web20
data-set
taggingsocialdatasetsweb20data-set
copydelete
- community post
- history of this post
1Read-me for Kilgarriff's BNC word frequency lists
http://www.kilgarriff.co.uk/bnc-readme.html
15 years ago by @mkroell
show all tags
datasets
datasets
copydelete
- community post
- history of this post
1CANRI Spatial Data Download
http://www.canri.nsw.gov.au/download/
15 years ago by @beate
show all tags
2009
clustering
seminar
datasets
spatial
2009clusteringseminardatasetsspatial
copydelete
- community post
- history of this post
1DAI-Labor | Datasets
http://www.dai-labor.de/index.php?id=1726&L=1
15 years ago by @lee_peck
show all tags
tu
Berlin
slashdot
DAI-Labor
datasets
delicious
tuBerlinslashdotDAI-Labordatasetsdelicious
copydelete
- community post
- history of this post
1Online financial data APIs and resources
http://kottke.org/09/06/online-financial-data-apis-and-resources
15 years ago by @mkroell
show all tags
datasets
finance
datasetsfinance
copydelete
- community post
- history of this post
1INSNA - Social Network Analysis Data
http://www.insna.org/software/data.html
15 years ago by @folke
show all tags
sna
email
datasets
snaemaildatasets
copydelete
- community post
- history of this post
3DataSets Publisher
http://www.datasetpublisher.com/
15 years ago by @folke
show all tags
datasets
repository
datasetsrepository
copydelete
- community post
- history of this post
1The Future Of Work - It’s Data, Baby - NYTimes.com
Last week, Sam explored trends in the technology jobs market, suggesting that significant opportunities only reveal themselves when examining both the available jobs and the underlying trends in demand for skills. Coincidentally, on the same day that Sam’s piece was published, The New York Times suggested that “the sexy job in the next 10 years will be statisticians.”
15 years ago by @lystrata
show all tags
daily
datasets
article
trends
statistics
dailydatasetsarticletrendsstatistics
copydelete
- community post
- history of this post
2Measuring User Influence in Twitter
incl. a Twitter Dataset
14 years ago by @mstrohm
show all tags
twitter
SNA
mining
datasets
networks
twitterSNAminingdatasetsnetworks
copydelete
- community post
- history of this post
7Infochimps Data Marketplace / Commons: Download Sell or Share Databases, statistics, data sets for free
Find and download data in any format, from financial to social networking to GIS data. Or sell data in our data marketplace, at a price you set. We have large data sets, spreadsheets, and databases packed with statistics.
14 years ago by @dbenz
show all tags
dataset
data
datasets
download
search
datasetdatadatasetsdownloadsearch
copydelete
- community post
- history of this post
1Rada Mihalcea: Downloads
http://www.cse.unt.edu/~rada/downloads.html#semcor
14 years ago by @mkroell
show all tags
datasets
datasets
copydelete
- community post
- history of this post
4FolkRank |:| A Social Semantic Desktop |:| NEPOMUK Consortium
Social bookmark tools are rapidly emerging on the Web. In such systems users are setting up lightweight conceptual structures called folksonomies. The reason for their immediate success is the fact that no specific skills are needed for participating. At
18 years ago by @avivamagnolia
show all tags
folkrank
collective+intelligence
algorithms
folksonomy
datamining
semweb
folksonomies
datasets
social+web
social+design
retrieval
2006
taxonomies
unstructured
chaotic
information+retrieval
social+semantics
social+web+design
folkrankcollective+intelligencealgorithmsfolksonomydataminingsemwebfolksonomiesdatasetssocial+websocial+designretrieval2006taxonomiesunstructuredchaoticinformation+retrievalsocial+semanticssocial+web+design
copydelete
- community post
- history of this post
5Data Documentation Initiative
XML for Social Science datasets etc.
17 years ago by @mobileink
show all tags
standard
metadata
std
ddi
datasets
standardmetadatastdddidatasets
copydelete
- community post
- history of this post
4view contemporary American culture through austere lens of statistics...transformed into images
Looks at contemporary American culture through austere lens of statistics. Each image portrays a specific quantity of something: fifteen million sheets of office paper (five minutes of paper use); 106,000 aluminum cans (thirty seconds of can consumption)
17 years ago by @avivagabriel
show all tags
USA
art
visualization
infographics
datasets
consumption
infoviz
images
american
culture
society
information
consumer
statistics
USAartvisualizationinfographicsdatasetsconsumptioninfovizimagesamericanculturesocietyinformationconsumerstatistics
copydelete
- community post
- history of this post
1climatechange =|= conversations that matter =|= worldcafe =|= visual thinking
http://conversationsthatmatter.typepad.com/climatechange/
17 years ago by @avivagabriel
show all tags
environment
visual_thinking
visualization
climatechange
carbon
greenhouse
infographics
climate
datasets
globalwarming
infoviz
world_cafe
environmentvisual_thinkingvisualizationclimatechangecarbongreenhouseinfographicsclimatedatasetsglobalwarminginfovizworld_cafe
copydelete
- community post
- history of this post
14Swivel =|= Datasets & Information Visualization
http://www.swivel.com/
17 years ago by @avivagabriel
show all tags
information_visualization
visualization
datagraphics
data
dataviz
infographics
datasets
analysis
swivel
infoviz
information
statistics
information_visualizationvisualizationdatagraphicsdatadatavizinfographicsdatasetsanalysisswivelinfovizinformationstatistics
copydelete
- community post
- history of this post
1Research Related to Social Tagging - NLP & IR Group @ UNED
http://nlp.uned.es/social-tagging/
14 years ago by @kasimiro
show all tags
tagging
social
datasets
research
taggingsocialdatasetsresearch
copydelete
- community post
- history of this post
1Vocabulary void | rdfs.org – Your Ontologies Are Here
http://rdfs.org/ns/void/html
15 years ago by @dolefulrabbit
show all tags
vocabulary
void
semweb
linking
datasets
semanticweb
vocabularyvoidsemweblinkingdatasetssemanticweb
copydelete
- community post
- history of this post
1Bioinformatics Links Directory
The Bioinformatics Links Directory features curated links to molecular resources, tools and databases. The links listed in this directory are selected on the basis of recommendations from bioinformatics experts in the field. We also rely on input from our community of bioinformatics users for suggestions.
15 years ago by @sr320
show all tags
database
software
bioinformatics
datamining
genomics
datasets
links
tools
databases
databasesoftwarebioinformaticsdatamininggenomicsdatasetslinkstoolsdatabases
copydelete
- community post
- history of this post
1ICWSM-13 - Datasets - Datasets
http://www.icwsm.org/2013/datasets/datasets/
11 years ago by @asmelash
show all tags
icwsm
youtube
twitter
facebook
datasets
icwsmyoutubetwitterfacebookdatasets
copydelete
- community post
- history of this post
1Presentation of Bio2RDF release 2
Bio2RDF Release 2: Improved coverage, interoperability and provenance of Linked Data for the Life Sciences
11 years ago by @legaultdenis
show all tags
datasets
datasets
copydelete
- community post
- history of this post
1Datensätze – Deutsche Bahn Datenportal
a bookmark
9 years ago by @schmidt2
show all tags
opendata
deutsche_bahn
datasets
opendatadeutsche_bahndatasets
copydelete
- community post
- history of this post
1ImageNet - An image DB organized according to WordNet
ImageNet is an image database organized according to the WordNet hierarchy (currently only the nouns), in which each node of the hierarchy is depicted by hundreds and thousands of images. Currently we have an average of over five hundred images per node. We hope ImageNet will become a useful resource for researchers, educators, students and all of you who share our passion for pictures.
13 years ago by @mstrohm
show all tags
tagging
linguistics
datasets
tagginglinguisticsdatasets
copydelete
- community post
- history of this post
3Webscope from Yahoo! Labs
http://webscope.sandbox.yahoo.com/catalog.php?datatype=g
13 years ago by @folke
show all tags
messenger
yahoo
datasets
research
messengeryahoodatasetsresearch
copydelete
- community post
- history of this post
2Datamob: Public data put to good use
http://www.datamob.org/
13 years ago by @draganigajic
show all tags
publicrecords
visualization
datamining
datasets
directory
100+
mashup
government
database
api
publicdata
*read
*RIL
statistics
publicrecordsvisualizationdataminingdatasetsdirectory100+mashupgovernmentdatabaseapipublicdata*read*RILstatistics
copydelete
- community post
- history of this post
8Some Datasets Available on the Web » Data Wrangling Blog
http://www.datawrangling.com/some-datasets-available-on-the-web
12 years ago by @psinger
show all tags
datasets
free
datasetsfree
copydelete
- community post
- history of this post
1World Gazetteer: download
http://www.world-gazetteer.com/wg.php?x=1129163518&men=stdl&lng=en&gln=xx&dat=32&srt=npan&col=aohdq
13 years ago by @folke
show all tags
world
city
wide
datasets
names
worldcitywidedatasetsnames
copydelete
- community post
- history of this post
3OpenGeoDb
http://opengeodb.org/wiki/OpenGeoDB
12 years ago by @folke
show all tags
dataset
database
amtlicher
gemeindeschluessel
germany
datasets
geolocation
datasetdatabaseamtlichergemeindeschluesselgermanydatasetsgeolocation
copydelete
- community post
- history of this post
2The UK National Archives Digital Archive of Datasets
The National Digital Archive of Datasets (NDAD) preserves and provides online access to archived digital datasets and documents from UK central government departments. Our collection spans 40 years of recent history, with the earliest available dataset dating back to about 1963.
12 years ago by @jaj
show all tags
UK
data_archive
datasets
UKdata_archivedatasets
copydelete
- community post
- history of this post
2Data Sets | GroupLens Research
GroupLens is a research lab in the Department of Computer Science and Engineering at the University of Minnesota. datasets include MovieLens, Wikilens, Book-Crossing, Jester Joke, EachMovie.
12 years ago by @jaj
show all tags
movies
datamining
datasets
cs
moviesdataminingdatasetscs
copydelete
- community post
- history of this post
1Open Economics
The Open Economics project provides open content, data and code related to Economics. This site itself provides interfaces to some (though not all) of the Open Economics datasets and models.
12 years ago by @jaj
show all tags
economics
datasets
economicsdatasets
copydelete
- community post
- history of this post
3Network data
links to some network data sets I've compiled over the years. All of these are free for scientific use
12 years ago by @jaj
show all tags
datasets
datasets
copydelete
- community post
- history of this post
1infochimps.org — Infochimps: Twitter Census
collection of Twitter data
12 years ago by @jaj
show all tags
twitter
datasets
twitterdatasets
copydelete
- community post
- history of this post
1Summary of Data Sets by Data Type
http://kdd.ics.uci.edu/summary.data.type.html
12 years ago by @jaj
show all tags
datasets
datasets
copydelete
- community post
- history of this post
4StatLib :: Data, Software and News from the Statistics Community
StatLib, a system for distributing statistical software, datasets, and information. started in 1989. hosted by the Department of Statistics at Carnegie Mellon University.
12 years ago by @jaj
show all tags
statistical_packages
data
datasets
statistical_packagesdatadatasets
copydelete
- community post
- history of this post
1MVSTATS -- Multivariate Data Analysis 6e and Great Ideas For Teaching Multivariate Statistics
http://www.mvstats.com/
12 years ago by @jaj
show all tags
textbook
datasets
statistics
textbookdatasetsstatistics
copydelete
- community post
- history of this post
1Data Library
datasets that accompany textbooks
12 years ago by @jaj
show all tags
textbook
datasets
textbookdatasets
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)76
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

2Parallel Corpora for bi-lingual English-Ethiopian Languages Statistical Machine Translation.
S. Abate, M. Woldeyohannis, M. Tachbelie, M. Meshesha, S. Atinafu, W. Mulugeta, Y. Assabie, H. Abera, B. Seyoum, T. Abebe and 4 other author(s). COLING, page 3102-3111. Association for Computational Linguistics, (2018)
3 years ago by @asmelash
show all tags
MT
HornMT
datasets
parallel-corpus
MTHornMTdatasetsparallel-corpus
copydeleteadd this publication to your clipboard
1Population structure, migration, and diversifying selection in the Netherlands
A. Abdellaoui, J. Hottenga, P. Knijff, M. Nivard, X. Xiao, P. Scheet, A. Brooks, E. Ehli, Y. Hu, G. Davies and 7 other author(s). Eur J Hum Genet, (March 2013)
11 years ago by @peter.ralph
show all tags
eurogenetics
human_genome
datasets
PCA
eurogeneticshuman_genomedatasetsPCA
copydeleteadd this publication to your clipboard
3Outlier detection for high dimensional data
C. Aggarwal, and P. Yu. SIGMOD Rec., (May 2001)
14 years ago by @vivion
show all tags
hi
large
outlier
datasets
outliers
hilargeoutlierdatasetsoutliers
copydeleteadd this publication to your clipboard
2JW300: A Wide-Coverage Parallel Corpus for Low-Resource Languages.
Z. Agic, and I. Vulic. ACL (1), page 3204-3210. Association for Computational Linguistics, (2019)
3 years ago by @asmelash
show all tags
low-resource
MT
HornMT
datasets
low-resourceMTHornMTdatasets
copydeleteadd this publication to your clipboard
1Preprocessing of Low Response Data for Predictive Modeling
F. Alam. International Journal of Trend in Scientific Research and Development, 3 (3): 157-160 (April 2019)
5 years ago by @ijtsrd
show all tags
Engineering
Datasets
Logistic
Reduction
component
Variable
Regression
analysis
Principal
Computers
EngineeringDatasetsLogisticReductioncomponentVariableRegressionanalysisPrincipalComputers
copydeleteadd this publication to your clipboard
1HumAID: Human-Annotated Disaster Incidents Data from Twitter
F. Alam, U. Qazi, M. Imran, and F. Ofli. (2021)cite arxiv:2104.03090Comment: Accepted in ICWSM-2021, Twitter datasets, Textual content, Natural disasters, Crisis Informatics.
3 years ago by @firojalam
show all tags
Twitter
datasets
Twitterdatasets
copydeleteadd this publication to your clipboard
5Describing Linked Datasets - On the Design and Usage of voiD, the 'Vocabulary of Interlinked Datasets'.
K. Alexander, R. Cyganiak, M. Hausenblas, and J. Zhao. WWW 2009 Workshop: Linked Data on the Web (LDOW2009), Madrid, Spain, (2009)
14 years ago by @munozjuan
show all tags
datasets
linked
datasetslinked
copydeleteadd this publication to your clipboard
10The Berkeley FrameNet Project
C. Baker, C. Fillmore, and J. Lowe. Proceedings of the 17th international conference on Computational linguistics, page 86--90. Morristown, NJ, USA, Association for Computational Linguistics, (1998)
15 years ago by @mkroell
show all tags
linguistic
datasets
linguisticdatasets
copydeleteadd this publication to your clipboard
3The UCI KDD Archive of Large Data Sets for Data Mining Research and Experimentation.
S. Bay, D. Kibler, M. Pazzani, and P. Smyth. SIGKDD Explorations, 2 (2): 81-85 (2000)
11 years ago by @vivion
show all tags
dataset
data-mining
multivariate
datasets
statistics
datasetdata-miningmultivariatedatasetsstatistics
copydeleteadd this publication to your clipboard
2Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format
J. Beckmann, A. Halverson, R. Krishnamurthy, and J. Naughton. Proceedings of the 22nd International Conference on Data Engineering, page 58--. Washington, DC, USA, IEEE Computer Society, (2006)
12 years ago by @sac
show all tags
rdbms
xml
sparse
datasets
rdbmsxmlsparsedatasets
copydeleteadd this publication to your clipboard
5A survey of results on mobile phone datasets analysis
V. Blondel, A. Decuyper, and G. Krings. (2015)cite arxiv:1502.03406.
9 years ago by @hotho
show all tags
phone
survey
datasets
mobile
phonesurveydatasetsmobile
copydeleteadd this publication to your clipboard
1Selecting representative data sets
T. Borovicka, M. Jirina Jr, M. Jirina, and P. Kordik. INTECH Open Access Publisher, (2012)
8 years ago by @ans
show all tags
datasets
validation
datasetsvalidation
copydeleteadd this publication to your clipboard
1On The Effect of Data Set Size on Bias And Variance in Classification Learning
D. Brain, and G. Webb. Proceedings of the Fourth Australian Knowledge Acquisition Workshop (AKAW '99), page 117-128. Sydney, The University of New South Wales, (1999)
8 years ago by @giwebb
show all tags
Learning
large
from
datasets
Learninglargefromdatasets
copydeleteadd this publication to your clipboard
3The Need for Low Bias Algorithms in Classification Learning From Large Data Sets
D. Brain, and G. Webb. Lecture Notes in Computer Science 2431: Principles of Data Mining and Knowledge Discovery: Proceedings of the Sixth European Conference (PKDD 2002), page 62-73. Berlin/Heidelberg, Springer-Verlag, (2002)
8 years ago by @giwebb
show all tags
Learning
large
from
datasets
Learninglargefromdatasets
copydeleteadd this publication to your clipboard
1Whole-genome sequencing of multiple Arabidopsis thaliana populations
J. Cao, K. Schneeberger, S. Ossowski, T. Günther, S. Bender, J. Fitz, D. Koenig, C. Lanz, O. Stegle, C. Lippert and 7 other author(s). Nat Genet, 43 (10): 956-963 (October 2011)
12 years ago by @peter.ralph
show all tags
population_genomics
datasets
arabidopsis
population_genomicsdatasetsarabidopsis
copydeleteadd this publication to your clipboard
3A reference collection for web spam
C. Castillo, D. Donato, L. Becchetti, P. Boldi, S. Leonardi, M. Santini, and S. Vigna. SIGIR Forum, (December 2006)
13 years ago by @beate
show all tags
spam-detection
datasets
web-spam
spam-detectiondatasetsweb-spam
copydeleteadd this publication to your clipboard
1Analysing and Examining Taxonomy and Folksonomy Terms in the Hybrid Subject Device using Machine Learning Techniques
S. Chatterjee, and R. Das. (2022)
3 months ago by @dsostaric1234
show all tags
machine-learning
folksonomy
datasets
taxonomy
big-data
tags
machine-learningfolksonomydatasetstaxonomybig-datatags
copydeleteadd this publication to your clipboard
2Describing Textures in the Wild
M. Cimpoi, S. Maji, I. Kokkinos, S. Mohamed, and A. Vedaldi. 2014 IEEE Conference on Computer Vision and Pattern Recognition, page 3606-3613. (2014)
6 months ago by @andolab
show all tags
deep-learning
datasets
deep-learningdatasets
copydeleteadd this publication to your clipboard
1Exposing Large Datasets with Semantic Sitemaps
R. Cyganiak, R. Delbru, H. Stenzhorn, G. Tummarello, and S. Decker. Proceedings of the 5th European Semantic Web Conference, Berlin, Heidelberg, Springer Verlag, (June 2008)
16 years ago by @eswc2008
show all tags
provenance
foundational-issues-storage-and-retrieval
sitemaps
datasets
crawling
search
provenancefoundational-issues-storage-and-retrievalsitemapsdatasetscrawlingsearch
copydeleteadd this publication to your clipboard
2voiD Guide - Using the Vocabulary of Interlinked Datasets
R. Cyganiak, and M. Hausenblas. (January 2009)http://rdfs.org/ns/void-guide (Last visit 22/4/2010).
14 years ago by @munozjuan
show all tags
vocabulary
void
datasets
vocabularyvoiddatasets
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩