Data Quality in Object-Centric Event Data: IssuesClassification and Evaluation
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Process analysis is concerned with analyzing recorded process executions to validate, monitor, or improve the underlying processes according to business goals. In this context, the paradigm of object-centric event data (OCED) has recently emerged, which relates activity executions to multiple objects instead of a single, pre-determined case. Since OCED can integrate various process perspectives simultaneously, it represents real-life activities more accurately than traditional event logs. Being the input of Object-Centric Process Mining (OCPM), the quality of the data recorded in OCED logs directly influences the results of the process analysis.To ensure reliable outcomes, it is imperative to assess potential quality problems manifesting in the data. While frameworks for such an assessment are available for classic event data, equivalent approaches for assessing quality issues in OCED do not yet exist. This paper provides an analysis and classification of data quality issues in OCED, and compares them to the issues in traditional event data. The classification is both evaluated in terms of its applicability to real-world event logs and through semi-structured interviews with practitioners. Our findings help researchers and practitioners as a road-map for detecting or avoiding data quality problems that hinder the effectiveness of process mining initiatives.