The 20 Newsgroups data set
to 20 dataset newsgroups text by hotho and 1 other person on Apr 12, 2008, 3:32 PM20 Newsgroups
Abstract
This data set consists of 20000 messages taken from 20 Usenet newsgroups.
Information files:
description of the...20 Newsgroups
Abstract
This data set consists of 20000 messages taken from 20 Usenet newsgroups.
Information files:
description of the data
Data files:
20_newsgroups.tar.gz (17.3M; 61.6M uncompressed)
mini_newsgroups.tar.gz A subset composed of 100 articles from each newsgroup. (1.9M; 6.2M uncompressed)
to 20 dataset newsgroups text by hotho on Apr 12, 2008, 3:32 PMSoftware can be downloaded by using:
l: tmskriktext
p: 780387954332
to dm mining software text tm by hotho on Apr 12, 2008, 3:17 PM- to challenge nlp prize semantic text by hotho on Mar 20, 2008, 8:36 AM
Multi-Label Classification
to classification dataset extension multilabel text tools weka by hotho and 1 other person on Nov 23, 2007, 1:12 PM- to corpus dataset text by hotho on Nov 16, 2007, 5:36 PM
- to blog data dm mining ml social text tm toread by hotho and 1 other person on Oct 28, 2007, 3:47 PM
CiteXplore combines literature search with text mining tools for biology.
Search results are cross referenced to EBI a...CiteXplore combines literature search with text mining tools for biology.
Search results are cross referenced to EBI applications based on publication identifiers.
Links to full text versions are provided where available.
to Literatur citeseer database full literature search suche text by hotho on Oct 19, 2007, 10:21 PM- to Dschungelbuch noten text by hotho and 1 other person on May 27, 2007, 10:57 AM
- to 2007 ir mining text tm workshop by hotho on Mar 12, 2007, 6:40 PM