Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/44827
Title: Towards Full Population Testing in Auditing: How Many Process Deviations Should Be Labeled?
Authors: LAGHMOUCH, Manal 
DEPAIRE, Benoit 
JANS, Mieke 
Editors: Lu, X.
Pufahl, L
Song, M.
Issue Date: 2024
Publisher: IEEE
Source: 2024 6th International Conference on Process Mining, ICPM, IEEE, p. 49 -56
Abstract: Conformance checking allows auditors to detect process deviations automatically, resulting in numerous deviations, with only a few being relevant. Identifying notable items amidst this large data set is challenging. Machine learning techniques offer potential solutions, but questions about the required number of labeled deviations and the impact of label quality remain. Our study investigates these factors' effects on Decision Trees and Random Forests. Results demonstrate these models' effectiveness in identifying notable items within imbalanced deviation populations. Achieving 90% precision and recall is feasible with about 400 to 600 labeled deviations, depending on the notable items' population fraction. A higher fraction of notables reduces the required labeled deviations. Varying label quality produced similar results. Additionally, classifications identifying at least 90% notable items are linked to less complex processes.
Keywords: Auditing;Conformance Checking;Deviation Classification;Machine Learning;Notable Item;Process Deviation;Process Mining
Document URI: http://hdl.handle.net/1942/44827
ISBN: 979-8-3503-6503-0
DOI: 10.1109/ICPM63005.2024.10680672
ISI #: 001329238200008
Rights: IEEE Xplore
Category: C1
Type: Proceedings Paper
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
Towards Full Population Testing in Auditing_ How Many Process Deviations Should Be Labeled_.pdf
  Restricted Access
Published version507.64 kBAdobe PDFView/Open    Request a copy
CR_43.pdfPeer-reviewed author version489.31 kBAdobe PDFView/Open
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.