The data set contains an extended set of event logs for evaluating multi-perspective trace clustering approaches in process mining. Event logs were randomly generated from 5 different process models of different complexity levels. The attribute "cluster" refers to the ground truth label. Clusters can only be correctly identified when considering both, the data and the control flow perspective (attributes and trace).
The extended dataset contains more noise levels as well as a different generation process regarding the case attributes.