A Generic Trace Ordering Framework for Incremental Process Discovery

Schuster, Daniel; Domnitsch, Emanuel; van Zelst, Sebastiaan J.; van der Aalst, Wil M. P.

doi:10.1007/978-3-031-01333-1_21

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13205))

Included in the following conference series:

International Symposium on Intelligent Data Analysis

1321 Accesses
3 Altmetric

Abstract

Executing operational processes generates valuable event data in organizations’ information systems. Process discovery describes the learning of process models from such event data. Incremental process discovery algorithms allow learning a process model from event data gradually. In this context, process behavior recorded in event data is incrementally fed into the discovery algorithm that integrates the added behavior to a process model under construction. In this paper, we investigate the open research question of the impact of the ordering of incrementally selected process behavior on the quality, i.e., recall and precision, of the learned process models. We propose a framework for defining ordering strategies for traces, i.e., observed process behavior, for incremental process discovery. Further, we provide concrete instantiations of this framework. We evaluate different trace-ordering strategies on real-life event data. The results show that trace-ordering strategies can significantly improve the quality of the learned process models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Incremental Discovery of Process Models Using Trace Fragments

Discovery of Frequent Episodes in Event Logs

Incremental Discovery of Hierarchical Process Models

Notes

1.
Note that per trace that is incrementally added, various LCAs might be changed. However, without fully executing the incremental process discovery approach for a trace, we only can compute the first LCA that must be changed. Therefore, there is a risk that the first LCA will be rated as good based on the strategy, but that further LCAs will have to be changed, which the strategy would rate as bad.

References

Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering. IEEE Comput. Soc. Press (1995). https://doi.org/10.1109/ICDE.1995.380415
Augusto, A., et al.: Automated discovery of process models from event logs: review and benchmark. IEEE Trans. Knowl. Data Eng. 31(4), 686–705 (2019). https://doi.org/10.1109/TKDE.2018.2841877
Article Google Scholar
Carmona, J., van Dongen, B., Solti, A., Weidlich, M.: Conformance Checking. Springer, Berlin (2018). https://doi.org/10.1007/978-3-319-99414-7
Book Google Scholar
Conforti, R., La Rosa, M., ter Hofstede, A.H.: Filtering out infrequent behavior from business process event logs. IEEE Trans. Knowl. Data Eng. 29(2), 300–314 (2017). https://doi.org/10.1109/TKDE.2016.2614680
Article Google Scholar
Cornuéjols, A.: Getting order independence in incremental learning. In: Brazdil, P.B. (ed.) ECML 1993. LNCS, vol. 667, pp. 196–212. Springer, Heidelberg (1993). https://doi.org/10.1007/3-540-56602-3_137
Chapter Google Scholar
Dixit, P.M., Buijs, J.C.A.M., van der Aalst, W.M.P.: Prodigy : human-in-the-loop process discovery. In: 12th International Conference on Research Challenges in Information Science (RCIS). IEEE (2018). https://doi.org/10.1109/RCIS.2018.8406657
Dumas, M., La Rosa, M., Mendling, J., Reijers, H.A.: Fundamentals of Business Process Management. Springer, Berlin Heidelberg (2018). https://doi.org/10.1007/978-3-662-56509-4
Book Google Scholar
Fahland, D., van der Aalst, W.M.: Model repair - aligning process models to reality. Inf. Syst. 47, 220–243 (2015). https://doi.org/10.1016/j.is.2013.12.007
Article Google Scholar
Felix Mannhardt: Sepsis cases - event log. https://doi.org/10.4121/uuid:915d2bfb-7e84-49ad-a286-dc35f063a460
Ferilli, S., Esposito, F.: A logic framework for incremental learning of process models. Fundam. Inf. 128, 413–443 (2013). https://doi.org/10.3233/FI-2013-951
Article MathSciNet MATH Google Scholar
Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from event logs - a constructive approach. In: Colom, J.-M., Desel, J. (eds.) PETRI NETS 2013. LNCS, vol. 7927, pp. 311–329. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38697-8_17
Chapter Google Scholar
M. (Massimiliano) de Leoni, Felix Mannhardt: Road traffic fine management process. https://doi.org/10.4121/uuid:270fd440-1057-4fb9-89a9-b699b47990f5
MacGregor, J.N.: The effects of order on learning classifications by example: heuristics for finding the optimal order. Artif. Intell. 34(3), 361–370 (1988). https://doi.org/10.1016/0004-3702(88)90065-3
Article Google Scholar
Schuster, D., van Zelst, S.J., van der Aalst, W.M.P.: Incremental discovery of hierarchical process models. In: Dalpiaz, F., Zdravkovic, J., Loucopoulos, P. (eds.) RCIS 2020. LNBIP, vol. 385, pp. 417–433. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-50316-1_25
Chapter Google Scholar
Schuster, D., van Zelst, S.J., van der Aalst, W.M.P.: Cortado—an interactive tool for data-driven process discovery and modeling. In: Buchs, D., Carmona, J. (eds.) PETRI NETS 2021. LNCS, vol. 12734, pp. 465–475. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-76983-3_23
Chapter Google Scholar
Schuster, D., van Zelst, S.J., van der Aalst, W.M.: Utilizing domain knowledge in data-driven process discovery: a literature review. Comput. Ind. 137, 103612 (2022). https://doi.org/10.1016/j.compind.2022.103612
Article Google Scholar
van der Aalst, W.M.P.: Process Mining: Data Science in Action. Springer, Berlin Heidelberg (2016). https://doi.org/10.1007/978-3-662-49851-4
Book Google Scholar
van der Aalst, W.M.P., Adriansyah, A., van Dongen, B.: Replaying history on process models for conformance checking and performance analysis. WIREs Data Min. Knowl. Disc. 2(2), 182–192 (2012). https://doi.org/10.1002/widm.1045
Article Google Scholar
van Dongen, B.F.: BPI challenge (2020). https://doi.org/10.4121/uuid:52fb97d4-4588-43c9-9d04-3604d4613b51

Download references

Author information

Authors and Affiliations

Fraunhofer Institute for Applied Information Technology FIT, Sankt Augustin, Germany
Daniel Schuster, Sebastiaan J. van Zelst & Wil M. P. van der Aalst
RWTH Aachen University, Aachen, Germany
Daniel Schuster, Emanuel Domnitsch, Sebastiaan J. van Zelst & Wil M. P. van der Aalst

Authors

Daniel Schuster
View author publications
You can also search for this author in PubMed Google Scholar
Emanuel Domnitsch
View author publications
You can also search for this author in PubMed Google Scholar
Sebastiaan J. van Zelst
View author publications
You can also search for this author in PubMed Google Scholar
Wil M. P. van der Aalst
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel Schuster .

Editor information

Editors and Affiliations

University of Rennes, Rennes, France
Tassadit Bouadi
University of Rennes, Rennes, France
Elisa Fromont
University of Munich, LMU, Munich, Germany
Eyke Hüllermeier

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schuster, D., Domnitsch, E., van Zelst, S.J., van der Aalst, W.M.P. (2022). A Generic Trace Ordering Framework for Incremental Process Discovery. In: Bouadi, T., Fromont, E., Hüllermeier, E. (eds) Advances in Intelligent Data Analysis XX. IDA 2022. Lecture Notes in Computer Science, vol 13205. Springer, Cham. https://doi.org/10.1007/978-3-031-01333-1_21

Download citation

DOI: https://doi.org/10.1007/978-3-031-01333-1_21
Published: 07 April 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-01332-4
Online ISBN: 978-3-031-01333-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics