Misc,

Instance Matching Benchmarks for Linked Data

, , , and .
ISWC, 2014, Tutorial, (October 2014)

Abstract

With the continuously increasing number of datasets published in the Web of Data and form part of the Linked Open Data Cloud, it becomes more and more essential to identify resources that correspond to the same real world object in order to interlink web resources and set the basis for large-scale data integration. This requirement becomes apparent in a multitude of domains ranging from science (marine research, biology, astronomy, pharmacology) to semantic publishing and cultural domains. In this context, instance matching (also referred to as record linkage 16, duplicate detection 3 entity resolution 2, and object identification in the context of databases 18) is of crucial importance. It is though essential at this point to develop, along with instance and entity matching systems, benchmarks to determine the weak and strong points of those systems, as well as their overall quality in order to support users in deciding the system to use for their needs. Hence, well defined, and good quality benchmarks are important for comparing the performance of the developed instance matching systems. In this tutorial we aim at: discussing the state-of-the-art instance matching benchmarks presenting the benchmark design principles providing an analysis of the performance results of instance matching systems for the presented benchmarks presenting the research directions that should be exploited for the creation of novel benchmarks to answer the needs of the Linked Data paradigm.

Tags

Users

  • @fundul

Comments and Reviews