Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/33436
Title: | A software reference architecture for semantic-aware Big Data systems | Authors: | Nadal, Sergi Herrero, Victor Romero, Oscar Abelló, Alberto Franch, Xavier VANSUMMEREN, Stijn Valerio, Danilo |
Issue Date: | 2017 | Publisher: | ELSEVIER SCIENCE BV | Source: | INFORMATION AND SOFTWARE TECHNOLOGY, 90 , p. 75 -92 | Abstract: | Context: Big Data systems are a class of software systems that ingest, store, process and serve massive amounts of heterogeneous data, from multiple sources. Despite their undisputed impact in current society, their engineering is still in its infancy and companies find it difficult to adopt them due to their inherent complexity. Existing attempts to provide architectural guidelines for their engineering fail to take into account important Big Data characteristics, such as the management, evolution and quality of the data.Objective: In this paper, we follow software engineering principles to refine the lambda-architecture, a reference model for Big Data systems, and use it as seed to create Bolster, a software reference architecture (SRA) for semantic-aware Big Data systems.Method: By including a new layer into the lambda-architecture, the Semantic Layer, Bolster is capable of handling the most representative Big Data characteristics (i.e., Volume, Velocity, Variety, Variability and Veracity).Results: We present the successful implementation of Bolster in three industrial projects, involving five organizations. The validation results show high level of agreement among practitioners from all organizations with respect to standard quality factors.Conclusion: As an SRA, Bolster allows organizations to design concrete architectures tailored to their specific needs. A distinguishing feature is that it provides semantic-awareness in Big Data Systems. These are Big Data system implementations that have components to simplify data definition and exploitation. In particular, they leverage metadata (i.e., data describing data) to enable (partial) automation of data exploitation and to aid the user in their decision making processes. This simplification supports the differentiation of responsibilities into cohesive roles enhancing data governance. (C) 2017 Elsevier B.V. All rights reserved. | Keywords: | Big Data;Software reference architecture;Semantic-aware;Data management;Data analysis | Document URI: | http://hdl.handle.net/1942/33436 | ISSN: | 0950-5849 | e-ISSN: | 1873-6025 | DOI: | 10.1016/j.infsof.2017.06.001 | ISI #: | WOS:000405046400006 | Category: | A1 | Type: | Journal Contribution |
Appears in Collections: | Research publications |
Show full item record
WEB OF SCIENCETM
Citations
42
checked on Oct 19, 2024
Page view(s)
28
checked on Jul 31, 2023
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.