Skip to main content

Enhancing Discovered Process Models Using Bayesian Inference and MCMC

  • Conference paper
  • First Online:
Business Process Management Workshops (BPM 2020)

Abstract

Process mining is an innovative research field aimed at extracting useful information about business processes from event data. An important task herein is process discovery. The results of process discovery are mainly non-stochastic process models, which do not convey a notion of probability or uncertainty. In this paper, Bayesian inference and Markov Chain Monte Carlo is used to build a statistical model on top of a process model using event data, which is able to generate probability distributions for choices in a process’ control-flow. A generic algorithm to build such a model is presented, and it is shown how the resulting statistical model can be used to test different kinds of hypotheses. The algorithm supports the enhancement of discovered process models by exposing probabilistic dependencies, and allows to compare the quality among different models, each of which provides important advancements in the field of process discovery.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://github.com/bupaverse/propro.

  2. 2.

    For brevity, we directly apply the constraints by giving different prefixes the same probability parameters: there are 10 different splits, but we only define 5 probability distributions. In practice, equality constraints can be added in the final stage, yielding more flexibility in adding and removing specific constraints.

  3. 3.

    Because of space limitations, the Bayesian model and resulting posterior distributions have not been included in this paper. Instead, we will look at two example use cases in the following section.

  4. 4.

    Note that due to space limitations, we only show the posterior distributions of the variables of interest.

  5. 5.

    By model, we are referring to the Bayesian models. As such, we can compare the goodness-of-fit among different Petri net models, as well as among a single Petri net model with different probability specifications, such as the different specifications in Table 3 vs Table 4a.

  6. 6.

    https://github.com/bupaverse/propro.

References

  1. van der Aalst, W.M.P., Weijters, A.J.M.M., Maruster, L.: Workflow mining: discovering process models from event logs. IEEE Trans. Knowl. Data Eng. 16(9), 1128–1142 (2004)

    Article  Google Scholar 

  2. Agrawal, R., Gunopulos, D., Leymann, F.: Mining process models from workflow logs. In: Schek, H.-J., Alonso, G., Saltor, F., Ramos, I. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 467–483. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0101003

    Chapter  Google Scholar 

  3. Augusto, A., Conforti, R., Dumas, M., La Rosa, M.: Split miner: discovering accurate and simple business process models from event logs (2017)

    Google Scholar 

  4. Augusto, A., et al.: Automated discovery of process models from event logs: review and benchmark. In: IEEE Transactions on Knowledge and Data Engineering (2018)

    Google Scholar 

  5. Datta, A.: Automating the discovery of as-is business process models: probabilistic and algorithmic approaches. Inf. Syst. Res. 9(3), 275–301 (1998)

    Article  Google Scholar 

  6. Di Francescomarino, C., Ghidini, C., Maggi, F.M., Petrucci, G., Yeshchenko, A.: An eye into the future: leveraging a-priori knowledge in predictive business process monitoring. In: Carmona, J., Engels, G., Kumar, A. (eds.) BPM 2017. LNCS, vol. 10445, pp. 252–268. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-65000-5_15

    Chapter  Google Scholar 

  7. Geman, S., Geman, D.: Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Machine Intell. 6, 721–741 (1984)

    Google Scholar 

  8. Gill, J.: Bayesian Methods: A Social and Behavioral Sciences Approach. Chapman and Hall/CRC (2002)

    Google Scholar 

  9. Janssenswillen, G., Depaire, B., Swennen, M., Jans, M., Vanhoof, K.: bupaR: enabling reproducible business process analysis. Knowledge-Based Syst. 163, 927–930 (2019)

    Article  Google Scholar 

  10. Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from incomplete event logs. In: Ciardo, G., Kindler, E. (eds.) PETRI NETS 2014. LNCS, vol. 8489, pp. 91–110. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07734-5_6

    Chapter  Google Scholar 

  11. Lesaffre, E., Lawson, A.B.: Bayesian Biostatistics. Wiley, New York (2012)

    Google Scholar 

  12. Marsan, M.A.: Stochastic petri nets: an elementary introduction. In: Rozenberg, G. (ed.) APN 1988. LNCS, vol. 424, pp. 1–29. Springer, Heidelberg (1990). https://doi.org/10.1007/3-540-52494-0_23

    Chapter  Google Scholar 

  13. Molloy, M.K.: Performance analysis using stochastic Petri nets. IEEE Trans. Comput. 9, 913–917 (1982)

    Google Scholar 

  14. Muñoz-Gama, J., Carmona, J.: A fresh look at precision in process conformance. In: Hull, R., Mendling, J., Tai, S. (eds.) BPM 2010. LNCS, vol. 6336, pp. 211–226. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15618-2_16

    Chapter  Google Scholar 

  15. Rogge-Solti, A., van der Aalst, W.M.P., Weske, M.: Discovering stochastic petri nets with arbitrary delay distributions from event logs. In: International Conference on Business Process Management. pp. 15–27. Springer (2013)

    Google Scholar 

  16. Spiegelhalter, D.J., Best, N.G., Carlin, B.P., Van Der Linde, A.: Bayesian measures of model complexity and fit. J. Royal Stat. Soc. Ser. B (Statistical Methodology) 64(4), 583–639 (2002)

    Article  MathSciNet  Google Scholar 

  17. Verenich, I., Nguyen, H., La Rosa, M., Dumas, M.: White-box prediction of process performance indicators via flow analysis. In: Proceedings of the 2017 International Conference on Software and System Process, pp. 85–94 (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gert Janssenswillen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Janssenswillen, G., Depaire, B., Faes, C. (2020). Enhancing Discovered Process Models Using Bayesian Inference and MCMC. In: Del Río Ortega, A., Leopold, H., Santoro, F.M. (eds) Business Process Management Workshops. BPM 2020. Lecture Notes in Business Information Processing, vol 397. Springer, Cham. https://doi.org/10.1007/978-3-030-66498-5_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-66498-5_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-66497-8

  • Online ISBN: 978-3-030-66498-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics