Please use this identifier to cite or link to this item:
Title: Towards more structured data quality assessment in the process mining field: the DaQAPO package
Authors: MARTIN, Niels 
Issue Date: 2020
Source: book of abstract for European R Users Meeting 2020,
Abstract: Process mining is a research field focusing on the extraction of insights on business processes from process execution data embedded in files called event logs. Event logs are a specific data structure originating from information systems supporting a business process such as an Enterprise Resource Planning System or a Hospital Information System. As a research field, process mining predominantly focused on the development of algorithms to retrieve process insights from an event log. However, consistent with the "garbage in-garbage out"-principle, the reliability of the algorithm's outcomes strongly depends upon the data quality of the event log. It has been widely recognized that real-life event logs typically suffer from a multitude of data quality issues, stressing the need for thorough data quality assessment. Currently, event log quality is often judged on an ad-hoc basis, entailing the risk that important issues are overlooked. Hence, the need for a more structured data quality assessment approach within the process mining field. Therefore, the DaQAPO package has been developed, which is an acronym for Data Quality Assessment of Process-Oriented data. It offers an extensive set of functions to automatically identify common data quality problems in process execution data. In this way, it is the first R-package which supports systematic data quality assessment for event data.
Document URI:
Category: C2
Type: Proceedings Paper
Appears in Collections:Research publications

Show full item record

Page view(s)

checked on Sep 7, 2022


checked on Sep 7, 2022

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.