Abstract
Modern business processes are often characterized by continuous change, which can lead to bias in the results of process mining techniques that assume a static process. This bias is caused by concept drift, which can manifest in many forms and affect various process perspectives. Current research on concept drift in process mining has focused on drift detection techniques in the control-flow perspective, with limited capabilities for comprehensive dynamic profiling of event logs. To address this gap, this paper presents the DyLoPro framework, a generic approach that facilitates the exploration of event log dynamics over time using visual analytics. The framework caters to all types of event logs and allows for the exploration of event log dynamics from various process perspectives, both individually and combined with the performance perspective. Additionally, the framework is accompanied by an efficient and user-friendly Python library, rendering it a valuable instrument for both researchers and practitioners. A case study using large real-life event logs demonstrates the effectiveness of the framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
What a good value for p constitutes, depends on the arrival frequency of cases. Each resulting sublog should be populated enough such that the derived measures are representative, but not too populated, as aggregating over a too long period and/or over too many cases could level out interesting trends and patterns.
- 2.
For our proposed set of mapping functions displayed in Table 1, \(\xi \) is either 1 or 2. However, this framework is rather meant as a starting point, and its verbosity can be extended at the discretion of the user.
- 3.
The dimensionality K, which is constant for every vector \(\boldsymbol{M_i}\) \((i \in [1, \dots , \xi ])\), is completely determined by the concept-specific configuration parameters \(\boldsymbol{\theta }^{C^o}\). We do not include this in the notation for simplicity.
- 4.
For the DFRs concept, two measures are computed. One giving the relative fraction of cases in which \( dfr _k\) occurs at least once, and another one giving the aggregated amount of occurrences per case.
- 5.
Annotated notebooks with the most interesting visualizations for each event log can be found here: https://github.com/BrechtWts/DyLoPro_CaseStudies.
- 6.
- 7.
References
van der Aalst, W., et al.: Process mining manifesto. In: Daniel, F., Barkaoui, K., Dustdar, S. (eds.) BPM 2011. LNBIP, vol. 99, pp. 169–194. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-28108-2_19
Adams, J.N., van Zelst, S.J., Quack, L., Hausmann, K., van der Aalst, W.M.P., Rose, T.: A framework for explainable concept drift detection in process mining. In: Polyvyanyy, A., Wynn, M.T., Van Looy, A., Reichert, M. (eds.) BPM 2021. LNCS, vol. 12875, pp. 400–416. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-85469-0_25
Denisov, V., Fahland, D., van der Aalst, W.M.P.: Unbiased, fine-grained description of processes performance from event data. In: Weske, M., Montali, M., Weber, I., vom Brocke, J. (eds.) BPM 2018. LNCS, vol. 11080, pp. 139–157. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98648-7_9
van Dongen, B.F., de Medeiros, A.K.A., Verbeek, H.M.W., Weijters, A.J.M.M., van der Aalst, W.M.P.: The ProM framework: a new era in process mining tool support. In: Ciardo, G., Darondeau, P. (eds.) ICATPN 2005. LNCS, vol. 3536, pp. 444–454. Springer, Heidelberg (2005). https://doi.org/10.1007/11494744_25
Bose, R.P.J.C., van der Aalst, W.M.P., Žliobaitė, I., Pechenizkiy, M.: Handling concept drift in process mining. In: Mouratidis, H., Rolland, C. (eds.) CAiSE 2011. LNCS, vol. 6741, pp. 391–405. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-21640-4_30
Jebb, A., Parrigon, S., Woo, S.E.: Exploratory data analysis as a foundation of inductive research. Hum. Resour. Manag. Rev. 27, 265–276 (2016). https://doi.org/10.1016/j.hrmr.2016.08.003
Sato, D.M.V., De Freitas, S.C., Barddal, J.P., Scalabrin, E.E.: A survey on concept drift in process mining. ACM Comput. Surv. 54(9), 1–38 (2021). https://doi.org/10.1145/3472752
Song, M., van der Aalst, W.: Supporting process mining by showing events at a glance. In: Proceedings of 17th Annual Workshop on Information Technologies and Systems (WITS 2007), pp. 139–145 (2007)
Teinemaa, I., Dumas, M., La Rosa, M., Maggi, F.M.: Outcome-oriented predictive process monitoring: review and benchmark. ACM Trans. Knowl. Discov. Data 13(2), 1–57 (2019). https://doi.org/10.1145/3301300
Tukey, J.W.: Exploratory Data Analysis. Addison-Wesley, Boston (1977)
Yeshchenko, A., Di Ciccio, C., Mendling, J., Polyvyanyy, A.: Comprehensive process drift detection with visual analytics. In: Laender, A.H.F., Pernici, B., Lim, E.-P., de Oliveira, J.P.M. (eds.) ER 2019. LNCS, vol. 11788, pp. 119–135. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-33223-5_11
Zerbato, F., Soffer, P., Weber, B.: Initial insights into exploratory process mining practices. In: Polyvyanyy, A., Wynn, M.T., Van Looy, A., Reichert, M. (eds.) BPM 2021. LNBIP, vol. 427, pp. 145–161. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-85440-9_9
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Wuyts, B., Weytjens, H., vanden Broucke, S., De Weerdt, J. (2023). DyLoPro: Profiling the Dynamics of Event Logs. In: Di Francescomarino, C., Burattin, A., Janiesch, C., Sadiq, S. (eds) Business Process Management. BPM 2023. Lecture Notes in Computer Science, vol 14159. Springer, Cham. https://doi.org/10.1007/978-3-031-41620-0_9
Download citation
DOI: https://doi.org/10.1007/978-3-031-41620-0_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-41619-4
Online ISBN: 978-3-031-41620-0
eBook Packages: Computer ScienceComputer Science (R0)