The Web Data Commons project extracts structured data from the Common Crawl, the largest web corpus available to the public, and provides the extracted data for public download in order to support researchers and companies in exploiting the wealth of information that is available on the Web.
RDFa is an extension to HTML5 that helps you markup things like People, Places, Events, Recipes and Reviews. Search Engines and Web Services use this markup to generate better search listings and give you better visibility on the Web, so that people can find your website more easily.
"Important is that you can use GoodRelations to create a small data package that describes your products and their features and prices, your stores and opening hours, payment options and the like. You simply paste this data package into your Web page using W3C's RDFa format."
J. Neubert. Proceedings of the Linked Data on the Web Workshop (LDOW2009), Madrid, Spain, April 20, 2009, CEUR Workshop Proceedings, 538, (2009)LDOW2009, April 20, 2009, Madrid, Spain.