Discovering Data Lineage from Data Warehouse Procedures
Kalle Tomingas, Priit Järv, Tanel Tammet
2016
Abstract
We present a method to calculate component dependencies and data lineage from the database structure and a large set of associated procedures and queries, independently of actual data in the data warehouse. The method relies on the probabilistic estimation of the impact of data in queries. We present a rule system supporting the efficient calculation of the transitive closure. The dependencies are categorized, aggregated and visualized to address various planning and decision support problems. System performance is evaluated and analysed over several real-life datasets.
DownloadPaper Citation
in Harvard Style
Tomingas K., Järv P. and Tammet T. (2016). Discovering Data Lineage from Data Warehouse Procedures . In Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016) ISBN 978-989-758-203-5, pages 101-110. DOI: 10.5220/0006054301010110
in Bibtex Style
@conference{kdir16,
author={Kalle Tomingas and Priit Järv and Tanel Tammet},
title={Discovering Data Lineage from Data Warehouse Procedures},
booktitle={Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)},
year={2016},
pages={101-110},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006054301010110},
isbn={978-989-758-203-5},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)
TI - Discovering Data Lineage from Data Warehouse Procedures
SN - 978-989-758-203-5
AU - Tomingas K.
AU - Järv P.
AU - Tammet T.
PY - 2016
SP - 101
EP - 110
DO - 10.5220/0006054301010110