TAIPAN: Automatic Property Mapping for Tabular Data
I. Ermilov, and A. Ngonga Ngomo. Proceedings of 20th International Conference on Knowledge Engineering and Knowledge Management (EKAW'2016), (2016)
The Web encompasses a significant amount of knowledge hidden in entity-attributes tables. Bridging the gap between these tables and the Web of Data thus has the potential to facilitate a large number of applications, including the augmentation of knowledge bases from tables, the search for related tables and the completion of tables using knowledge bases. Computing such bridges is impeded by the poor accuracy of automatic property mapping, the lack of approaches for the discovery of subject columns and the mere size of table corpora. In addition, tables on the Web most commonly do not contain any RDFa or other form of semantic annotation. We propose TAIPAN, a novel approach for recovering the semantics of tables. Our approach begins by identifying subject columns using a combination of structural and semantic features. It then maps binary relations inside a table to predicates from a given knowledge base. Therewith, our solution supports both the tasks of table expansion and knowledge base augmentation. We evaluate our approach on a table dataset generated from real RDF data and a manually curated version of the T2D gold standard. Our results suggest that we outperform the state of the art by up to 85\% F-measure.