Inproceedings,

Towards Generating ETL Processes for Incremental Loading

T. Jörg, and S. Deßloch.
Proceedings of the 2008 International Symposium on Database Engineering &\#38; Applications, page 101--110. New York, NY, USA, ACM, (2008)
DOI: 10.1145/1451940.1451956

Abstract

Extract, Transform, and Load (ETL) processes physically integrate data from multiple, heterogeneous sources in a central repository referred to as data warehouse. Physically integrated data gets stale when source data is changed, hence periodic refreshes are required. For efficiency reasons data warehouses are typically refreshed incrementally, i.e. changes are captured at the sources and propagated to the data warehouse on a regular basis. Dedicated ETL processes referred to as incremental load processes are employed to extract changes from the sources, propagate the changes, and refresh the data warehouse incrementally. Changes required in the data warehouse are inferred from changes captured at the sources during change propagation. The creation of incremental load processes is a complex task reserved to trained ETL programmers. In this paper we review existing Change Data Capture (CDC) techniques and discuss limitations of different approaches. We further review existing techniques for refreshing data warehouses. We then present an approach for generating incremental load processes from abstract schema mappings.

BibTeX key: jorg2008
entry type: inproceedings
address: New York, NY, USA
booktitle: Proceedings of the 2008 International Symposium on Database Engineering &\#38; Applications
year: 2008
pages: 101--110
publisher: ACM
series: IDEAS '08
acmid: 1451956
isbn: 978-1-60558-188-0
numpages: 10
location: Coimbra, Portugal
DOI: 10.1145/1451940.1451956
url: http://doi.acm.org/10.1145/1451940.1451956

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{jorg2008, abstract = {Extract, Transform, and Load (ETL) processes physically integrate data from multiple, heterogeneous sources in a central repository referred to as data warehouse. Physically integrated data gets stale when source data is changed, hence periodic refreshes are required. For efficiency reasons data warehouses are typically refreshed incrementally, i.e. changes are captured at the sources and propagated to the data warehouse on a regular basis. Dedicated ETL processes referred to as incremental load processes are employed to extract changes from the sources, propagate the changes, and refresh the data warehouse incrementally. Changes required in the data warehouse are inferred from changes captured at the sources during change propagation. The creation of incremental load processes is a complex task reserved to trained ETL programmers. In this paper we review existing Change Data Capture (CDC) techniques and discuss limitations of different approaches. We further review existing techniques for refreshing data warehouses. We then present an approach for generating incremental load processes from abstract schema mappings.}, acmid = {1451956}, added-at = {2019-10-16T20:39:39.000+0200}, address = {New York, NY, USA}, author = {J\"{o}rg, Thomas and De\ssloch, Stefan}, biburl = {https://www.bibsonomy.org/bibtex/217743ac6b3a255be7e501b37e01391c5/mialhoma}, booktitle = {Proceedings of the 2008 International Symposium on Database Engineering \&\#38; Applications}, description = {Towards generating ETL processes for incremental loading}, doi = {10.1145/1451940.1451956}, interhash = {d55245ac97061c753f1549c66bca429f}, intrahash = {17743ac6b3a255be7e501b37e01391c5}, isbn = {978-1-60558-188-0}, keywords = {dwa}, location = {Coimbra, Portugal}, numpages = {10}, pages = {101--110}, publisher = {ACM}, series = {IDEAS '08}, timestamp = {2019-10-16T20:39:39.000+0200}, title = {Towards Generating ETL Processes for Incremental Loading}, url = {http://doi.acm.org/10.1145/1451940.1451956}, year = 2008 }

BibSonomy

Towards Generating ETL Processes for Incremental Loading

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on