In this post I want to pull together a couple of ideas around some of the measurable user activity generated as part of CFHE12. This will mainly focus around Twitter with some data from blog posts. I conclude that there are some simple opportunities to incorporate data from twitter into other channels, for example, summary of questions and retweets.
Twitter corpus for Sentiment Analysis from a class (cs224n)at Stanford.
Class page:
https://sites.google.com/site/twittersentimenthelp/for-researchers#Where_is_the_Tweet_corpus_8553
http://www.stanford.edu/~alecmgo/cs224n
LingPipe is tool kit for processing text using computational linguistics. LingPipe is used to do tasks like:
* Find the names of people, organizations or locations in news
* Automatically classify Twitter search results into categories
* Suggest correct spellings of queries
Truthy is a research project that helps you understand how memes spread online. We collect tweets from Twitter and analyze them. With our statistics, images, movies, and interactive data, you can explore these dynamic networks.
Our first application was the study of astroturf campaigns in elections. Currently, we're extending our focus to several themes. Browse the collection on the Memes page. Check out the Movie tool to browse and create animations of meme networks.
Twitter wird sein frisch eingekauftes Echtzeit-DV-System Storm als Open Source veröffentlichen. Damit wird die Technik für die Parallelisierung von Datenbankabfragen für alle verfügbar.
Was verrät die Wortwahl bei Twitter über die Laune des Verfassers? Sehr viel, sagen US-Wissenschaftler. Sie haben Millionen Tweets ausgewertet und festgestellt, wann die Nutzer in Hochstimmung sind - und wann man sie besser nicht anspricht.
Tweets2011
As part of the TREC 2011 microblog track, Twitter provided identifiers for approximately 16 million tweets sampled between January 23rd and February 8th, 2011. The corpus is designed to be a reusable, representative sample of the twittersphere - i.e. both important and spam tweets are included.
Current (beta) version is 0.9.19 (25 JUN 2009). Download instructions After installation set the application premissions for twibble as described in this post.
D. Tang, F. Wei, N. Yang, M. Zhou, T. Liu, and B. Qin. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), page 1555--1565. Baltimore, Maryland, Association for Computational Linguistics, (June 2014)