In this post I want to pull together a couple of ideas around some of the measurable user activity generated as part of CFHE12. This will mainly focus around Twitter with some data from blog posts. I conclude that there are some simple opportunities to incorporate data from twitter into other channels, for example, summary of questions and retweets.
Twitter corpus for Sentiment Analysis from a class (cs224n)at Stanford.
Class page:
https://sites.google.com/site/twittersentimenthelp/for-researchers#Where_is_the_Tweet_corpus_8553
http://www.stanford.edu/~alecmgo/cs224n
LingPipe is tool kit for processing text using computational linguistics. LingPipe is used to do tasks like:
* Find the names of people, organizations or locations in news
* Automatically classify Twitter search results into categories
* Suggest correct spellings of queries
D. Tang, F. Wei, N. Yang, M. Zhou, T. Liu, and B. Qin. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), page 1555--1565. Baltimore, Maryland, Association for Computational Linguistics, (June 2014)
S. Vieweg, A. Hughes, K. Starbird, and L. Palen. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, page 1079--1088. ACM, (2010)