What is Semantic Similarity? Definition of Semantic Similarity: A concept whereby a set of documents or terms within term lists are assigned a metric based on the likeness of their meaning/semantic content. ( Wikipedia, 2012e ).
The Web Data Commons project extracts structured data from the Common Crawl, the largest web corpus available to the public, and provides the extracted data for public download in order to support researchers and companies in exploiting the wealth of information that is available on the Web.