Abstract
In semi-structured case-oriented business processes, the sequence of process steps is determined by case workers based on available document content associated with a case. Transitions between process execution steps are therefore case specific and depend on independent judgment of case workers. In this paper, we propose an instance-specific probabilistic process model (PPM) whose transition probabilities are customized to the semi-structured business process instance it represents. An instance-specific PPM serves as a powerful representation to predict the likelihood of different outcomes. We also show that certain instance-specific PPMs can be transformed into a Markov chain under some non-restrictive assumptions. For instance-specific PPMs that contain parallel execution of tasks, we provide an algorithm to map them to an extended space Markov chain. This way existing Markov techniques can be leveraged to make predictions about the likelihood of executing future tasks. Predictions provided by our technique could generate early alerts for case workers about the likelihood of important or undesired outcomes in an executing case instance. We have implemented and validated our approach on a simulated automobile insurance claims handling semi-structured business process. Results indicate that an instance-specific PPM provides more accurate predictions than other methods such as conditional probability. We also show that as more document data become available, the prediction accuracy of an instance-specific PPM increases.
Similar content being viewed by others
References
Cook JE, Wolf AL (1998) Discovering models of software processes from event-based data. ACM Trans Softw Eng Method 7(3):215–249
Critical Capabilities for Composite Content Management Applications. Gartner, report. (2010)
Curbera F, Doganata YD, Martens A, Mukhi N, Slominski A (2008) Business provenance—a technology to increase traceability of end-to-end operations. OTM conferences, 1, pp 100–119
Datta A (1998) Automating the discovery of AS-IS business process models: probabilistic and algorithmic approaches. Inf Syst Res 9(3):275–301
Feller W (1957) An introduction to probability theory and its applications, vol 1. Wiley, New York, ISBN 0-471-25708-7
Grienstead CM, Snell L (1991) Introduction to probability. American Mathematical Society. ISBN 0-8218-0749-8
Grigori D, Casati F, Castellanos M, Dayal U, Sayal M, Shan M (2004) Business process intelligence. Comput Ind 53(3):321–343
Herbst J (2000) A machine learning approach to workflow management. ECML, pp 183–194
Herbst J, Karagiannis D (1998) Integrating machine learning and workflow management to support acquisition and adaption of workflow models. DEXA, workshop, pp 745–752
Hillier FS, Lieberman GJ (1986) Introduction to operations research, 4th edn. Holden-Day Inc., San Francisco, CA, USA
IBM Insurance Application Architecture. http://www-03.ibm.com/industries/insurance/us/detail/solution/P669447B27619A15.html?tab=3
Jensen K (1997) Coloured petri nets. Basic concepts, analysis methods and practical use. vol 3, practical use. Monographs in theoretical computer science, Springer, Berlin. ISBN:3-540-62867-3
Lakshmanan GT, Duan S, Keyser PT, Khalaf R, Curbera F (2010) A heuristic approach for making predictions for semi-structured case oriented business processes. Business process management workshops, pp 640–651
Lakshmanan GT, Khalaf R (2012) Leveraging process mining techniques to analyze semi-structured processes, IEEE IT Professional, to appear. http://doi.ieeecomputersociety.org/10.1109/MITP.2012.88
Liu S, Duffy AHB, Whitfield RI, Boyle IM (2010) Integration of decision support systems to improve decision support performance. Knowl Inf Syst 22(3):261–286
Murata T (1989) Petri nets: properties, analysis and applications. In: Proceedings of the IEEE, vol 77, no. 4
Natarajan S, Tadepalli P, Fern A (2011) A relational hierarchical model of decision-theoretic assistance. Knowl Inf Syst (KAIS):1–21
Paz JF, Bajo J, Gonzlez A, Rodrguez S, Corchado JM (2012) Combining case-based reasoning systems and support vector regression to evaluate the atmosphere-ocean interaction. Knowl Inf Syst 30(1):155–177
Pfeffer A (2005) Functional specification of probabilistic process models. AAAI, pp. 663–669
Poh KL (2000) An intelligent decision support system for investment analysis. Knowl Inf Syst, pp 340–358
Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufman
Ross S (2003) Introduction to probability models, 8th edn, Chap. 4
Rozinat A, Wynn MT, van der Aalst WMP, ter Hofstede AHM, Fidge CJ (2009) Workflow simulation for operational decision support. Data Knowl Eng 68(9):834–850
Rozinat A, van der Aalst WMP (2006) Decision mining in ProM. Bus Process Manag:420–425
Rozsnyai S, Slominski A, Lakshmanan GT (2011) Discovering event correlation rules for semi-structured business processes. In: Proceedings of the 5th ACM international conference on Distributed event-based system, ACM, New York, pp 75–86
Schonenberg H, Weber B, van Dongen BF, van der Aalst WMP (2008) Supporting flexible processes through recommendations based on history. BPM:51–66
Taylor HM, Karlin S (1998) An introduction to stochastic modeling, 3rd edn, Chap. 3–4
van der Aalst WMP (2011) Process mining—discovery, conformance and enhancement of business processes. Springer, Berlin, pp I–XVI, 1–352
van der Aalst WMP, Reijers HA, Weijters AJMM, van Dongen BF, Alves de Medeiros AK, Song M et al. (2007) Business process mining: an industrial application. Inf Syst 32(5):713–732
van der Aalst WMP, Schonenberg MH, Song M (2011) Time prediction based on process mining. Inf Syst 36(2):450–475
van der Aalst WMP, van Dongen BF, Gnther CW, Rozinat A, Verbeek E, Weijters T (2009) ProM: the process mining toolkit. BPM (Demos)
van der Aalst WMP, van Dongen BF, Herbst J, Maruster L, Schimm G, Weijters AJMM et al (2003) Workflow mining: a survey of issues and approaches. Data Knowl Eng 47(2):237–267
van der Aalst WMP, Weske M, Grünbauer D (2005) Case handling: a new paradigm for business process support. KDE 53(2):129–162
van Dongen BF, Crooy RA, van der Aalst WMP (2008) Cycle time prediction: when will this case finally be finished? OTM conferences, 1, pp 319–336
Vanderfeesten ITP, Reijers HA, van der Aalst WMP (2011) Product-based workflow support. Inf Syst 36(2):517–535
Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, Burlington
Zenie A (1985) Coloured stochastic petri nets. In: Proceedings of the international workshop on timed petri nets, IEEE Computer Society Press, Torino, pp 262–271
Zhu WD, Becker B, Boudreaux J, Baman S, Gomez D, Marin M, Vaughan A (2000) Advanced case management with IBM case manager, IBM redbooks. http://www.redbooks.ibm.com/abstracts/sg247929.html?Open
Acknowledgments
We thank Songyun Duan and Paul T. Keyser for valuable discussions.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lakshmanan, G.T., Shamsi, D., Doganata, Y.N. et al. A markov prediction model for data-driven semi-structured business processes. Knowl Inf Syst 42, 97–126 (2015). https://doi.org/10.1007/s10115-013-0697-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-013-0697-8