PeerSpective uses the shared interest between you and your friends to help guide Web search. When you run a Google search, PeerSpective includes extra results which may be relevant to your question alongside the results from Google
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. The collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.
Mendeley Desktop is free academic software for managing and sharing research papers. Mendeley Web lets you access your papers online, discover research trends and connect to like-minded researchers.
Save time managing your status update for all your social networks. Post updates by group to separate work and play. Aggregate and organize your friend's feeds.
website for sharing maps created with the GMapCreator software, released by CASA. The maps themselves are not stored on the server -- only a link to another site on the Internet where the map is published. When maps are shared, information about what the map is and what it shows is entered by the owner and this is stored on the server along with the link to where the map is published. The raw data is never stored on the Internet as the maps comprise the pre-rendered tiles made by the GMapCreator, so this is a safe way of sharing a map without giving away the raw data used to create it. MapTube is a product of the work undertaken by the Geographic Virtual Urban Environments (GeoVUE) team based at University College London's Centre for Advanced Spatial Analysis (CASA). GeoVUE is a research Node of the National Centre for e-Social Science (NCeSS). NCeSS (http://www.ncess.ac.uk/) is funded by the Economic and Social Research Council (ESRC)
The Web Ecology Project is an interdisciplinary research group based in Boston, Massachusetts focusing on using large scale data mining to analyze the system-wide flows of culture and community online. In addition to the task of understanding culture on the web through quantitative research and rigorous experimentation, we are attempting to build a science around community management and social media. To that end, we are building tools and conducting research that enable planners to launch data-driven campaigns backed by network science. twitter archive.
The datasets available on the website were mostly collected (scraped) for the purposes of our research. Social networks: online social networks, edges represent interactions between people. Communication networks: email communication networks with edges representing communication. Citation networks: nodes represent papers, edges represent citations. etc.
Setting up ThinkUp takes a couple of minutes if you know how to install a web app. Once you're running, there's lots you can do!. e.g., Archive all of your social data for your Twitter and Facebook accounts in your own database and make it easy to search, sort and filter.
Do you want to archive tweets from your conference? Maybe archive trending hashtags or keywords for historical or analysis purposes? Maybe save your own personal tweets? Twapper Keeper is here to help! How does it work? 1. Create a new Twapper Keeper archive based upon hashtag, keyword, or person 2. Tell your friends about the archive 3. Read, track, export, and analyze as much as you want!
Trust network datasets (social network datasets) are datasets in which there are entities (users, peers, servers, robots, ...) and some social relationships connecting 2 of these entities. The goal is to collect as many datasets as possible in one single place (this wiki) and release them in some standard formats for easy use with software also collected in this wiki, and with a reasonable license on them.