< >
- If you are interested in doing research on Prosper or using Prosper data in support of your research, please contact us.
- Tweets2011 As part of the TREC 2011 microblog track, Twitter provided identifiers for approximately 16 million tweets sampled between January 23rd and F...Tweets2011 As part of the TREC 2011 microblog track, Twitter provided identifiers for approximately 16 million tweets sampled between January 23rd and February 8th, 2011. The corpus is designed to be a reusable, representative sample of the twittersphere - i.e. both important and spam tweets are included.
- d8taplex helps you discover, visualize and explore data found on the web including time series data
- Microsoft Research Speller Challenge
- Pearson Longman English Language Teaching (Pearson Longman ELT) is a leading educational publisher of quality resources for all ages and abilities across t...Pearson Longman English Language Teaching (Pearson Longman ELT) is a leading educational publisher of quality resources for all ages and abilities across the curriculum, providing solutions for teachers and students.
- Scientext is a new, on-line French and English corpus of scientific texts. The corpus includes 4.8 million running tokens in French, 13 million words of re...Scientext is a new, on-line French and English corpus of scientific texts. The corpus includes 4.8 million running tokens in French, 13 million words of research articles in English (medicine and biology), and an English-language sub-corpus of French undergraduate students’ texts (1,1 million words). The corpus is organized to facilitate the linguistic study of authorial position and reasoning in scientific articles through phraseology and lexico-grammatical markers linked to causality.
- Following a successful first edition, we are pleased to announce the 2nd edition of the Large Scale Hierarchical Text Classification (LSHTC) Pascal Challen...Following a successful first edition, we are pleased to announce the 2nd edition of the Large Scale Hierarchical Text Classification (LSHTC) Pascal Challenge. The LSHTC Challenge is a hierarchical text classification competition, using large datasets. This year’s challenge will increase the scale and the difficulty of the task, using data from Wikipedia (www.wikipedia.org), in addition to the ODP Web directory data (www.dmoz.org).
< >
- Proceedings of the 19th international conference on World wide web, page 591--600. ACM, (2010)
- Proceedings of the fourth ACM international conference on Web search and data mining, page 177--186. ACM, (2011)
- EACL, The Association for Computer Linguistics, (2006)
- Proceedings from the 2nd International Conference on Weblogs and Social Media AAAI, (2008)
- InScit2006: International Conference on Multidisciplinary Information Sciences and Technologies, (2006)
- Journal of Physics A: Mathematical and Theoretical 41(22):224016 7pp (2008)
- CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge mining, page 93--102. New York, NY, USA, ACM, (2008)
- Behav Res Methods 37(4):547-559 (November 2005)
- (2006)
- (1998)


group