Discovering Data Lineage from Data Warehouse Procedures

Kalle Tomingas, Priit Järv, Tanel Tammet

2016

Abstract

We present a method to calculate component dependencies and data lineage from the database structure and a large set of associated procedures and queries, independently of actual data in the data warehouse. The method relies on the probabilistic estimation of the impact of data in queries. We present a rule system supporting the efficient calculation of the transitive closure. The dependencies are categorized, aggregated and visualized to address various planning and decision support problems. System performance is evaluated and analysed over several real-life datasets.

Download


Paper Citation


in Harvard Style

Tomingas K., Järv P. and Tammet T. (2016). Discovering Data Lineage from Data Warehouse Procedures . In Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016) ISBN 978-989-758-203-5, pages 101-110. DOI: 10.5220/0006054301010110

in Bibtex Style

@conference{kdir16,
author={Kalle Tomingas and Priit Järv and Tanel Tammet},
title={Discovering Data Lineage from Data Warehouse Procedures},
booktitle={Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)},
year={2016},
pages={101-110},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006054301010110},
isbn={978-989-758-203-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)
TI - Discovering Data Lineage from Data Warehouse Procedures
SN - 978-989-758-203-5
AU - Tomingas K.
AU - Järv P.
AU - Tammet T.
PY - 2016
SP - 101
EP - 110
DO - 10.5220/0006054301010110