Inproceedings,

Managing and Compiling Data Dependencies for Semantic Applications using Databus Client

J. Frey, F. Götz, and M. Hofer.
15th International Conference on Metadata and Semantics Research, Springer, (2021)

Full text

Abstract

Realizing a data-driven application or workflow, that consumes bulk data files from the Web, poses a multitude of challenges ranging from sustainable dependency management supporting automatic updates, to dealing with compression, serialization format, and data model variety. In this work, we present an approach using the novel Databus Client, which is backed by the DBpedia Databus - a data asset release management platform inspired by paradigms and techniques successfully applied in software release management. The approach shifts effort from the publisher to the client while making data consumption and dependency management easier and more unified as a whole. The client leverages 4 layers (download, compression, format, and mapping) that tackle individual challenges and offers a fully automated way for extracting and compiling data assets from the DBpedia Databus, given one command and a flexible dependency configuration using SPARQL or Databus Collections. The current vertical-sliced implementation supports format conversion within as well as mapping between RDF triples, RDF quads, and CSV/TSV files. We developed an evaluation strategy for the format conversion and mapping functionality using so-called round trip tests.

BibTeX key: frey2021databus
entry type: inproceedings
booktitle: 15th International Conference on Metadata and Semantics Research
year: 2021
publisher: Springer
Document: http://svn.aksw.org/papers/2021/databus-client/public.pdf

BibSonomy

Managing and Compiling Data Dependencies for Semantic Applications using Databus Client

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on