Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/30877
Title: | Towards more structured data quality assessment in the process mining field: the DaQAPO package | Authors: | MARTIN, Niels VAN HOUDT, Greg JANSSENSWILLEN, Gert |
Issue Date: | 2020 | Source: | book of abstract for European R Users Meeting 2020, | Abstract: | Process mining is a research field focusing on the extraction of insights on business processes from process execution data embedded in files called event logs. Event logs are a specific data structure originating from information systems supporting a business process such as an Enterprise Resource Planning System or a Hospital Information System. As a research field, process mining predominantly focused on the development of algorithms to retrieve process insights from an event log. However, consistent with the "garbage in-garbage out"-principle, the reliability of the algorithm's outcomes strongly depends upon the data quality of the event log. It has been widely recognized that real-life event logs typically suffer from a multitude of data quality issues, stressing the need for thorough data quality assessment. Currently, event log quality is often judged on an ad-hoc basis, entailing the risk that important issues are overlooked. Hence, the need for a more structured data quality assessment approach within the process mining field. Therefore, the DaQAPO package has been developed, which is an acronym for Data Quality Assessment of Process-Oriented data. It offers an extensive set of functions to automatically identify common data quality problems in process execution data. In this way, it is the first R-package which supports systematic data quality assessment for event data. | Document URI: | http://hdl.handle.net/1942/30877 | Category: | C2 | Type: | Proceedings Paper |
Appears in Collections: | Research publications |
Show full item record
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.