Skip to main content
Log in

A markov prediction model for data-driven semi-structured business processes

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

In semi-structured case-oriented business processes, the sequence of process steps is determined by case workers based on available document content associated with a case. Transitions between process execution steps are therefore case specific and depend on independent judgment of case workers. In this paper, we propose an instance-specific probabilistic process model (PPM) whose transition probabilities are customized to the semi-structured business process instance it represents. An instance-specific PPM serves as a powerful representation to predict the likelihood of different outcomes. We also show that certain instance-specific PPMs can be transformed into a Markov chain under some non-restrictive assumptions. For instance-specific PPMs that contain parallel execution of tasks, we provide an algorithm to map them to an extended space Markov chain. This way existing Markov techniques can be leveraged to make predictions about the likelihood of executing future tasks. Predictions provided by our technique could generate early alerts for case workers about the likelihood of important or undesired outcomes in an executing case instance. We have implemented and validated our approach on a simulated automobile insurance claims handling semi-structured business process. Results indicate that an instance-specific PPM provides more accurate predictions than other methods such as conditional probability. We also show that as more document data become available, the prediction accuracy of an instance-specific PPM increases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. http://www.processmining.org/prom/start

References

  1. Cook JE, Wolf AL (1998) Discovering models of software processes from event-based data. ACM Trans Softw Eng Method 7(3):215–249

    Article  Google Scholar 

  2. Critical Capabilities for Composite Content Management Applications. Gartner, report. (2010)

  3. Curbera F, Doganata YD, Martens A, Mukhi N, Slominski A (2008) Business provenance—a technology to increase traceability of end-to-end operations. OTM conferences, 1, pp 100–119

  4. Datta A (1998) Automating the discovery of AS-IS business process models: probabilistic and algorithmic approaches. Inf Syst Res 9(3):275–301

    Article  Google Scholar 

  5. Feller W (1957) An introduction to probability theory and its applications, vol 1. Wiley, New York, ISBN 0-471-25708-7

  6. Grienstead CM, Snell L (1991) Introduction to probability. American Mathematical Society. ISBN 0-8218-0749-8

  7. Grigori D, Casati F, Castellanos M, Dayal U, Sayal M, Shan M (2004) Business process intelligence. Comput Ind 53(3):321–343

    Article  Google Scholar 

  8. Herbst J (2000) A machine learning approach to workflow management. ECML, pp 183–194

  9. Herbst J, Karagiannis D (1998) Integrating machine learning and workflow management to support acquisition and adaption of workflow models. DEXA, workshop, pp 745–752

  10. Hillier FS, Lieberman GJ (1986) Introduction to operations research, 4th edn. Holden-Day Inc., San Francisco, CA, USA

  11. IBM Insurance Application Architecture. http://www-03.ibm.com/industries/insurance/us/detail/solution/P669447B27619A15.html?tab=3

  12. Jensen K (1997) Coloured petri nets. Basic concepts, analysis methods and practical use. vol 3, practical use. Monographs in theoretical computer science, Springer, Berlin. ISBN:3-540-62867-3

  13. Lakshmanan GT, Duan S, Keyser PT, Khalaf R, Curbera F (2010) A heuristic approach for making predictions for semi-structured case oriented business processes. Business process management workshops, pp 640–651

  14. Lakshmanan GT, Khalaf R (2012) Leveraging process mining techniques to analyze semi-structured processes, IEEE IT Professional, to appear. http://doi.ieeecomputersociety.org/10.1109/MITP.2012.88

  15. Liu S, Duffy AHB, Whitfield RI, Boyle IM (2010) Integration of decision support systems to improve decision support performance. Knowl Inf Syst 22(3):261–286

    Article  Google Scholar 

  16. Murata T (1989) Petri nets: properties, analysis and applications. In: Proceedings of the IEEE, vol 77, no. 4

  17. Natarajan S, Tadepalli P, Fern A (2011) A relational hierarchical model of decision-theoretic assistance. Knowl Inf Syst (KAIS):1–21

  18. Paz JF, Bajo J, Gonzlez A, Rodrguez S, Corchado JM (2012) Combining case-based reasoning systems and support vector regression to evaluate the atmosphere-ocean interaction. Knowl Inf Syst 30(1):155–177

    Article  Google Scholar 

  19. Pfeffer A (2005) Functional specification of probabilistic process models. AAAI, pp. 663–669

  20. Poh KL (2000) An intelligent decision support system for investment analysis. Knowl Inf Syst, pp 340–358

  21. Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufman

  22. Ross S (2003) Introduction to probability models, 8th edn, Chap. 4

  23. Rozinat A, Wynn MT, van der Aalst WMP, ter Hofstede AHM, Fidge CJ (2009) Workflow simulation for operational decision support. Data Knowl Eng 68(9):834–850

    Article  Google Scholar 

  24. Rozinat A, van der Aalst WMP (2006) Decision mining in ProM. Bus Process Manag:420–425

  25. Rozsnyai S, Slominski A, Lakshmanan GT (2011) Discovering event correlation rules for semi-structured business processes. In: Proceedings of the 5th ACM international conference on Distributed event-based system, ACM, New York, pp 75–86

  26. Schonenberg H, Weber B, van Dongen BF, van der Aalst WMP (2008) Supporting flexible processes through recommendations based on history. BPM:51–66

  27. Taylor HM, Karlin S (1998) An introduction to stochastic modeling, 3rd edn, Chap. 3–4

  28. van der Aalst WMP (2011) Process mining—discovery, conformance and enhancement of business processes. Springer, Berlin, pp I–XVI, 1–352

  29. van der Aalst WMP, Reijers HA, Weijters AJMM, van Dongen BF, Alves de Medeiros AK, Song M et al. (2007) Business process mining: an industrial application. Inf Syst 32(5):713–732

    Google Scholar 

  30. van der Aalst WMP, Schonenberg MH, Song M (2011) Time prediction based on process mining. Inf Syst 36(2):450–475

    Google Scholar 

  31. van der Aalst WMP, van Dongen BF, Gnther CW, Rozinat A, Verbeek E, Weijters T (2009) ProM: the process mining toolkit. BPM (Demos)

  32. van der Aalst WMP, van Dongen BF, Herbst J, Maruster L, Schimm G, Weijters AJMM et al (2003) Workflow mining: a survey of issues and approaches. Data Knowl Eng 47(2):237–267

    Google Scholar 

  33. van der Aalst WMP, Weske M, Grünbauer D (2005) Case handling: a new paradigm for business process support. KDE 53(2):129–162

    Google Scholar 

  34. van Dongen BF, Crooy RA, van der Aalst WMP (2008) Cycle time prediction: when will this case finally be finished? OTM conferences, 1, pp 319–336

  35. Vanderfeesten ITP, Reijers HA, van der Aalst WMP (2011) Product-based workflow support. Inf Syst 36(2):517–535

    Article  Google Scholar 

  36. Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, Burlington

    Google Scholar 

  37. Zenie A (1985) Coloured stochastic petri nets. In: Proceedings of the international workshop on timed petri nets, IEEE Computer Society Press, Torino, pp 262–271

  38. Zhu WD, Becker B, Boudreaux J, Baman S, Gomez D, Marin M, Vaughan A (2000) Advanced case management with IBM case manager, IBM redbooks. http://www.redbooks.ibm.com/abstracts/sg247929.html?Open

Download references

Acknowledgments

We thank Songyun Duan and Paul T. Keyser for valuable discussions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Geetika T. Lakshmanan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lakshmanan, G.T., Shamsi, D., Doganata, Y.N. et al. A markov prediction model for data-driven semi-structured business processes. Knowl Inf Syst 42, 97–126 (2015). https://doi.org/10.1007/s10115-013-0697-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-013-0697-8

Keywords

Navigation