Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/33408
Title: MDM: Governing Evolution in Big Data Ecosystems
Authors: Nadal, Sergi
Abelló, Alberto
Romero, Oscar
VANSUMMEREN, Stijn 
Vassiliadis, Panos
Issue Date: 2018
Publisher: OpenProceedings.org
Source: Proceedings of the 21st International Conference on Extending Database Technology, OpenProceedings.org, p. 682 -685
Abstract: On-demand integration of multiple data sources is a critical requirement in many Big Data settings. This has been coined as the data variety challenge, which refers to the complexity of dealing with an heterogeneous set of data sources to enable their integrated analysis. In Big Data settings, data sources are commonly represented by external REST APIs, which provide data in their original format and continously apply changes in their structure (i.e., schema). Thus, data analysts face the challenge to integrate such multiple sources, and then continuosly adapt their analytical processes to changes in the schema. To address this challenges, in this paper, we present the Metadata Management System, shortly MDM, a tool that supports data stewards and analysts to manage the integration and analysis of multiple heterogeneous sources under schema evolution. MDM adopts a vocabulary-based integration-oriented ontology to conceptualize the domain of interest and relies on local-as-view mappings to link it with the sources. MDM provides user-friendly mechanisms to manage the ontology and mappings. Finally, a query rewriting algorithm ensures that queries posed to the ontology are correctly resolved to the sources in the presence of multiple schema versions, a transparent process to data analysts. On-site, we will showcase using real-world examples how MDM facilitates the management of multiple evolving data sources and enables its integrated analysis.
Document URI: http://hdl.handle.net/1942/33408
Link to publication/dataset: https://openproceedings.org/2018/conf/edbt/paper-307.pdf
ISBN: 978-3-89318-078
DOI: 10.5441/002/edbt.2018.84
Category: C1
Type: Proceedings Paper
Appears in Collections:Research publications

Show full item record

Page view(s)

32
checked on Nov 7, 2023

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.