Heuristic Mining Approaches for High-Utility Local Process Models

Dalmas, Benjamin; Tax, Niek; Norre, Sylvie

doi:10.1007/978-3-662-58381-4_2

Heuristic Mining Approaches for High-Utility Local Process Models

Benjamin Dalmas¹⁶,
Niek Tax¹⁷ &
Sylvie Norre¹⁶

Chapter
First Online: 21 November 2018

425 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((TOPNOC,volume 11090))

Abstract

Local Process Models (LPMs) describe structured fragments of process behavior that occur in the context of business processes. Traditional support-based LPM discovery aims to generate a collection of process models that describe highly frequent behavior, in contrast, in High-Utility Local Process Model (HU-LPM) mining the aim is to generate a collection of process models that provide useful business insights according to a specified utility function. Mining LPMs is computationally expensive as the search space depends combinatorially on the number of activities in the business process. In support-based LPM mining, the search space is constrained by leveraging the anti-monotonic property of support (i.e., the apriori principle). We show that there is no property of monotonicity or anti-monotonicity in HU-LPM mining that allows for lossless pruning of the search space. We propose four heuristic methods to explore the search space only partially. We show on a collection of 57 event logs that these heuristics techniques can reduce the size of the search space of HU-LPM mining without much loss in the mined set of HU-LPMs. Furthermore, we analyze the effect of several properties of the event log on the performance of the heuristics through statistical analysis. Additionally, we use predictive modeling with regression trees to explore the relation between combinations of log properties and the effect of the heuristics on the size of the search space and on the quality of the HU-LPMs, where the statistical analysis focuses on the effect of log properties in isolation.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://svn.win.tue.nl/repos/prom/Packages/LocalProcessModelDiscovery/.

References

van der Aalst, W.M.P.: Process Mining: Data Science in Action. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49851-4
Book Google Scholar
van der Aalst, W.M.P., Adriansyah, A., van Dongen, B.F.: Replaying history on process models for conformance checking and performance analysis. WIREs: DMKD 2(2), 182–192 (2012)
Google Scholar
van der Aalst, W.M.P., Weijters, A.J.M.M., Maruster, L.: Workflow mining: discovering process models from event logs. IEEE TKDE 16(9), 1128–1142 (2004)
Google Scholar
Bergenthum, R., Desel, J., Lorenz, R., Mauser, S.: Process mining based on regions of languages. In: Alonso, G., Dadam, P., Rosemann, M. (eds.) BPM 2007. LNCS, vol. 4714, pp. 375–383. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-75183-0_27
Chapter Google Scholar
Buijs, J.C.A.M., van Dongen, B.F., van der Aalst, W.M.P.: A genetic algorithm for discovering process trees. In: IEEE CEC, pp. 1–8. IEEE (2012)
Google Scholar
Buijs, J.C.A.M., Reijers, H.A.: Comparing business process variants using models and event logs. In: Bider, I., Gaaloul, K., Krogstie, J., Nurcan, S., Proper, H.A., Schmidt, R., Soffer, P. (eds.) BPMDS/EMMSAD -2014. LNBIP, vol. 175, pp. 154–168. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-43745-2_11
Chapter Google Scholar
Burges, C., et al.: Learning to rank using gradient descent. In: ICML, pp. 89–96. ACM (2005)
Google Scholar
Dalmas, B., Tax, N., Norre, S.: Heuristics for high-utility local process model mining. In: ATAED. CEUR (2017)
Google Scholar
Dave, U., Patel, S.V., Shah, J., Patel, S.V.: Efficient mining of high utility sequential pattern from incremental sequential dataset. IJCA (2015)
Google Scholar
van Dongen, B.F., de Medeiros, A.K.A., Verbeek, H.M.W., Weijters, A.J.M.M., van der Aalst, W.M.P.: The ProM framework: a new era in process mining tool support. In: Ciardo, G., Darondeau, P. (eds.) ICATPN 2005. LNCS, vol. 3536, pp. 444–454. Springer, Heidelberg (2005). https://doi.org/10.1007/11494744_25
Chapter Google Scholar
Freeman, L.C.: A set of measures of centrality based on betweenness. Sociometry 40, 35–41 (1977)
Article Google Scholar
Gini, C.: Concentration and dependency ratios. Riv. Di Polit. Econ. 87, 769–792 (1997)
Google Scholar
Han, J., Cheng, H., Xin, D., Yan, X.: Frequent pattern mining: current status and future directions. DMKD 15(1), 55–86 (2007)
MathSciNet Google Scholar
International Organization for Standardization: ISO/IEC 19505–1:2012 - Information technology - Object Management Group Unified Modeling Language (OMG UML) - Part 1: Infrastructure (2012)
Google Scholar
Jouck, T., Depaire, B.: PTandLogGenerator: a generator for artificial event data. In: BPM (2016)
Google Scholar
Keller, G., Scheer, A.W., Nüttgens, M.: Semantische Prozeßmodellierung auf der Grundlage" Ereignisgesteuerter Prozeßketten". Inst. für Wirtschaftsinformatik (1992)
Google Scholar
Kendall, M.G.: A new measure of rank correlation. Biometrika 30(1/2), 81–93 (1938)
Article Google Scholar
Leemans, M., van der Aalst, W.M.P.: Discovery of frequent episodes in event logs. In: Ceravolo, P., Russo, B., Accorsi, R. (eds.) SIMPDA 2014. LNBIP, vol. 237, pp. 1–31. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27243-6_1
Chapter Google Scholar
Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from event logs containing infrequent behaviour. In: Lohmann, N., Song, M., Wohed, P. (eds.) BPM 2013. LNBIP, vol. 171, pp. 66–78. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-06257-0_6
Chapter Google Scholar
Lewis, T.G.: Network Science: Theory and Applications. Wiley, Hoboken (2011)
Google Scholar
Liesaputra, V., Yongchareon, S., Chaisiri, S.: Efficient process model discovery using maximal pattern mining. In: Motahari-Nezhad, H.R., Recker, J., Weidlich, M. (eds.) BPM 2015. LNCS, vol. 9253, pp. 441–456. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-23063-4_29
Chapter Google Scholar
Maggi, F.M., Mooij, A.J., van der Aalst, W.M.P.: User-guided discovery of declarative process models. In: IEEE CIDM, pp. 192–199. IEEE (2011)
Google Scholar
Măruşter, L., van Beest, N.R.T.P.: Redesigning business processes: a methodology based on simulation and process mining techniques. KIS 21(3), 267 (2009)
Google Scholar
Object Management Group: Notation (BPMN) version 2.0. OMG Specification (2011)
Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)
Google Scholar
Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: Apers, P., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 1–17. Springer, Heidelberg (1996). https://doi.org/10.1007/BFb0014140
Chapter Google Scholar
Tax, N., Sidorova, N., van der Aalst, W.M.P., Haakma, R.: Heuristic approaches for generating local process models through log projections. In: CIDM, pp. 1–8. IEEE (2016)
Google Scholar
Tax, N., Bockting, S., Hiemstra, D.: A cross-benchmark comparison of 87 learning to rank methods. IPM 51(6), 757–772 (2015)
Google Scholar
Tax, N., Dalmas, B., Sidorova, N., van der Aalst, W.M.P., Norre, S.: Interest-driven discovery of local process models. arXiv preprint arXiv:1703.07116 (2017)
Tax, N., Sidorova, N., Haakma, R., van der Aalst, W.M.P.: Mining local process models. JIDE 3(2), 183–196 (2016)
Google Scholar
Xing, W., Ghorbani, A.: Weighted PageRank algorithm. In: CNSR. pp. 305–314. IEEE (2004)
Google Scholar
Yin, J., Zheng, Z., Cao, L.: USpan: an efficient algorithm for mining high utility sequential patterns. In: SIGKDD, pp. 660–668. ACM (2012)
Google Scholar
Zida, S., Fournier-Viger, P., Wu, C.-W., Lin, J.C.-W., Tseng, V.S.: Efficient mining of high-utility sequential rules. In: Perner, P. (ed.) MLDM 2015. LNCS (LNAI), vol. 9166, pp. 157–171. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21024-7_11
Chapter Google Scholar
Zihayat, M., Wu, C.W., An, A., Tseng, V.S.: Mining high utility sequential patterns from evolving data streams. In: ASE BD&SI, p. 52. ACM (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Clermont-Auvergne University, LIMOS CNRS UMR 6158, Aubière, France
Benjamin Dalmas & Sylvie Norre
Eindhoven University of Technology, Department of Mathematics and Computer Science, P.O. Box 513, 5600 MB, Eindhoven, The Netherlands
Niek Tax

Authors

Benjamin Dalmas
View author publications
You can also search for this author in PubMed Google Scholar
Niek Tax
View author publications
You can also search for this author in PubMed Google Scholar
Sylvie Norre
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Benjamin Dalmas .

Editor information

Editors and Affiliations

Newcastle University, Newcastle upon Tyne, UK
Maciej Koutny
Western Norway University of Applied Sciences, Bergen, Norway
Lars Michael Kristensen
Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland
Wojciech Penczek

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Dalmas, B., Tax, N., Norre, S. (2018). Heuristic Mining Approaches for High-Utility Local Process Models. In: Koutny, M., Kristensen, L., Penczek, W. (eds) Transactions on Petri Nets and Other Models of Concurrency XIII. Lecture Notes in Computer Science(), vol 11090. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-58381-4_2

Download citation

DOI: https://doi.org/10.1007/978-3-662-58381-4_2
Published: 21 November 2018
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-58380-7
Online ISBN: 978-3-662-58381-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics