Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/30877
Title: | Towards more structured data quality assessment in the process mining field: the DaQAPO package | Authors: | MARTIN, Niels VAN HOUDT, Greg JANSSENSWILLEN, Gert |
Issue Date: | 2020 | Source: | book of abstract for European R Users Meeting 2020, | Abstract: | Process mining is a research field focusing on the extraction of insights on business processes from process execution data embedded in files called event logs. Event logs are a specific data structure originating from information systems supporting a business process such as an Enterprise Resource Planning System or a Hospital Information System. As a research field, process mining predominantly focused on the development of algorithms to retrieve process insights from an event log. However, consistent with the "garbage in-garbage out"-principle, the reliability of the algorithm's outcomes strongly depends upon the data quality of the event log. It has been widely recognized that real-life event logs typically suffer from a multitude of data quality issues, stressing the need for thorough data quality assessment. Currently, event log quality is often judged on an ad-hoc basis, entailing the risk that important issues are overlooked. Hence, the need for a more structured data quality assessment approach within the process mining field. Therefore, the DaQAPO package has been developed, which is an acronym for Data Quality Assessment of Process-Oriented data. It offers an extensive set of functions to automatically identify common data quality problems in process execution data. In this way, it is the first R-package which supports systematic data quality assessment for event data. | Document URI: | http://hdl.handle.net/1942/30877 | Category: | C2 | Type: | Proceedings Paper |
Appears in Collections: | Research publications |
Show full item record
Page view(s)
138
checked on Sep 7, 2022
Download(s)
72
checked on Sep 7, 2022
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.