Abstract
Process mining is an innovative research field aimed at extracting useful information about business processes from event data. An important task herein is process discovery. The results of process discovery are mainly non-stochastic process models, which do not convey a notion of probability or uncertainty. In this paper, Bayesian inference and Markov Chain Monte Carlo is used to build a statistical model on top of a process model using event data, which is able to generate probability distributions for choices in a process’ control-flow. A generic algorithm to build such a model is presented, and it is shown how the resulting statistical model can be used to test different kinds of hypotheses. The algorithm supports the enhancement of discovered process models by exposing probabilistic dependencies, and allows to compare the quality among different models, each of which provides important advancements in the field of process discovery.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
For brevity, we directly apply the constraints by giving different prefixes the same probability parameters: there are 10 different splits, but we only define 5 probability distributions. In practice, equality constraints can be added in the final stage, yielding more flexibility in adding and removing specific constraints.
- 3.
Because of space limitations, the Bayesian model and resulting posterior distributions have not been included in this paper. Instead, we will look at two example use cases in the following section.
- 4.
Note that due to space limitations, we only show the posterior distributions of the variables of interest.
- 5.
- 6.
References
van der Aalst, W.M.P., Weijters, A.J.M.M., Maruster, L.: Workflow mining: discovering process models from event logs. IEEE Trans. Knowl. Data Eng. 16(9), 1128–1142 (2004)
Agrawal, R., Gunopulos, D., Leymann, F.: Mining process models from workflow logs. In: Schek, H.-J., Alonso, G., Saltor, F., Ramos, I. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 467–483. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0101003
Augusto, A., Conforti, R., Dumas, M., La Rosa, M.: Split miner: discovering accurate and simple business process models from event logs (2017)
Augusto, A., et al.: Automated discovery of process models from event logs: review and benchmark. In: IEEE Transactions on Knowledge and Data Engineering (2018)
Datta, A.: Automating the discovery of as-is business process models: probabilistic and algorithmic approaches. Inf. Syst. Res. 9(3), 275–301 (1998)
Di Francescomarino, C., Ghidini, C., Maggi, F.M., Petrucci, G., Yeshchenko, A.: An eye into the future: leveraging a-priori knowledge in predictive business process monitoring. In: Carmona, J., Engels, G., Kumar, A. (eds.) BPM 2017. LNCS, vol. 10445, pp. 252–268. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-65000-5_15
Geman, S., Geman, D.: Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Machine Intell. 6, 721–741 (1984)
Gill, J.: Bayesian Methods: A Social and Behavioral Sciences Approach. Chapman and Hall/CRC (2002)
Janssenswillen, G., Depaire, B., Swennen, M., Jans, M., Vanhoof, K.: bupaR: enabling reproducible business process analysis. Knowledge-Based Syst. 163, 927–930 (2019)
Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from incomplete event logs. In: Ciardo, G., Kindler, E. (eds.) PETRI NETS 2014. LNCS, vol. 8489, pp. 91–110. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07734-5_6
Lesaffre, E., Lawson, A.B.: Bayesian Biostatistics. Wiley, New York (2012)
Marsan, M.A.: Stochastic petri nets: an elementary introduction. In: Rozenberg, G. (ed.) APN 1988. LNCS, vol. 424, pp. 1–29. Springer, Heidelberg (1990). https://doi.org/10.1007/3-540-52494-0_23
Molloy, M.K.: Performance analysis using stochastic Petri nets. IEEE Trans. Comput. 9, 913–917 (1982)
Muñoz-Gama, J., Carmona, J.: A fresh look at precision in process conformance. In: Hull, R., Mendling, J., Tai, S. (eds.) BPM 2010. LNCS, vol. 6336, pp. 211–226. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15618-2_16
Rogge-Solti, A., van der Aalst, W.M.P., Weske, M.: Discovering stochastic petri nets with arbitrary delay distributions from event logs. In: International Conference on Business Process Management. pp. 15–27. Springer (2013)
Spiegelhalter, D.J., Best, N.G., Carlin, B.P., Van Der Linde, A.: Bayesian measures of model complexity and fit. J. Royal Stat. Soc. Ser. B (Statistical Methodology) 64(4), 583–639 (2002)
Verenich, I., Nguyen, H., La Rosa, M., Dumas, M.: White-box prediction of process performance indicators via flow analysis. In: Proceedings of the 2017 International Conference on Software and System Process, pp. 85–94 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Janssenswillen, G., Depaire, B., Faes, C. (2020). Enhancing Discovered Process Models Using Bayesian Inference and MCMC. In: Del Río Ortega, A., Leopold, H., Santoro, F.M. (eds) Business Process Management Workshops. BPM 2020. Lecture Notes in Business Information Processing, vol 397. Springer, Cham. https://doi.org/10.1007/978-3-030-66498-5_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-66498-5_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-66497-8
Online ISBN: 978-3-030-66498-5
eBook Packages: Computer ScienceComputer Science (R0)