This software is a translation into C++ of the excellent Webgraph library by P. Boldi and S. Vigna. The original library, written in Java, is easy to use but hampered by some requirements of the Java virtual machine. This C++ translation attempts to preserve much of the ease of use (through integration with the Boost Graph Library), but bypass requirements imposed by a virtual machine.
I am obviously a sucka for lists, there's one with 400+ links. I am doing everything humanly possible to try, review (reviews are on the way) and categorize the gazillion startups and old hags alike. I have updated the so called complete list with some cool thumbs (check it out).
This page provides a large hyperlink graph for public download. The graph has been extracted from the Common Crawl 2012 web corpus and covers 3.5 billion web pages and 128 billion hyperlinks between these pages. To the best of our knowledge, this graph is the largest hyperlink graph that is available to the public outside companies such as Google, Yahoo, and Microsoft. Below we provide instructions on how to download the graph as well as basic statistics about its topology.
The Ubiquitous Web Applications Working Group
seeks to simplify the creation of distributed Web applications
involving a wide diversity of devices, including desktop computers,
office equipment, home media appliances, mobile devices (phones),
physical sensors and effectors (including RFID and barcodes).
This will be achieved by building upon existing work on device
independent authoring and delivery contexts by the former DIWG, together with
new work on remote eventing, device coordination and intent-based
events.
Truthy is a research project that helps you understand how memes spread online. We collect tweets from Twitter and analyze them. With our statistics, images, movies, and interactive data, you can explore these dynamic networks.
Our first application was the study of astroturf campaigns in elections. Currently, we're extending our focus to several themes. Browse the collection on the Memes page. Check out the Movie tool to browse and create animations of meme networks.
This document is designed as being a simple but comprehensive introductory publication for anybody trying to get into the Semantic Web: from beginners through to long time hackers.
The Elsevier Grand Challenge: Knowledge Enhancement in the Life Sciences is a contest created to improve the way scientific information is communicated and used. The contest invites members of the scientific community to describe and prototype a tool to improve the interpretation and identification of meaning in (online) journals and text databases relating to the life sciences. Specifically we are looking for new ways to:
Now, the real breakthru of folksonomical-based systems like del.icio.us or flickr is not the lack of structure or commitee-based design in the ontological space, but is the idea that if two people use the same term, it's more probable than they meant the same thing than they meant different things.
B. Berendt, N. Glance, and A. Hotho (Eds.) Workshop at 18th Europ. Conf. on Machine Learning (ECML'08) / 11th Europ. Conf. on Principles and Practice of Knowledge Discovery in Databases (PKDD'08), (2008)
T. Joachims, D. Freitag, and T. Mitchell. Proceedings of the International Joint Conference on
Artificial Intelligence (IJCAI), page 770--777. San Francisco, CA, Morgan Kaufmann, (1997)
R. Cooley, B. Mobasher, and J. Srivastava. Proceedings of the Ninth IEEE International Conference on Tools with Artificial Intelligence (ICTAI'97), IEEE Computer Society, (November 1997)
H. Dai, and B. Mobasher. Proceedings of the Second Semantic Web Mining Workshop at PKDD 2001, km.aifb.uni-karlsruhe.de/semwebmine2002/papers/full/bamshad.pdf, (August 2002)