Abstract
The analysis of process instance similarity offers valuable input for certain application fields including the evaluation of instance clusters, the identification of compliance abuses, and process optimization. In this paper, we discuss the topic of instance similarity in general: We show that similarity might be determined from different process perspectives such as control flow, time, and instance attributes. Each of these perspectives impose individual requirements on the similarity calculation concerning data and structure. Four metrics for process instance similarity are proposed covering different perspectives. The applicability and feasibility of the proposed metrics are evaluated based on a prototypical implementation and real-world process logs from the BPI challenges.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
If several end nodes are allowed by the respective meta model, the closest one is selected.
- 3.
- 4.
References
van der Aalst, W.M.P., van Dongen, B.F., Herbst, J., Maruster, L., Schimm, G., Weijters, A.: Workflow mining: a survey of issues and approaches. Data Knowl. Eng. 47(2), 237–267 (2003). http://dx.doi.org/10.1016/S0169-023X(03)00066–1
van der Aalst, W.M.P., et al.: Process mining manifesto. In: Daniel, F., Barkaoui, K., Dustdar, S. (eds.) BPM 2011. LNBIP, vol. 99, pp. 169–194. Springer, Heidelberg (2012). doi:10.1007/978-3-642-28108-2_19
van der Aalst, W.M.P., van Hee, K.M.: Workflow Management: Models, Methods, and Systems. MIT Press, Cambridge (2002)
Alonso, G., Casati, F., Kuno, H., Machiraju, V.: Web Services: Concepts. Architectures and Applications. Springer, Berlin (2004)
Anderson, K.: Web services and related technologies (2006)
Becker, M., Laue, R.: A comparative survey of business process similarity measures. Comput. Ind. 63(2), 148–167 (2012). http://dx.doi.org/10.1016/j.compind.2011.11.003
Cao, L.: Activity mining: challenges and prospects. In: Li, X., Zaïane, O.R., Li, Z. (eds.) ADMA 2006. LNCS (LNAI), vol. 4093, pp. 582–593. Springer, Heidelberg (2006). doi:10.1007/11811305_65
Dijkman, R., Dumas, M., van Dongen, B., Käärik, R., Mendling, J.: Similarity of business process models: metrics and evaluation. Inf. Syst. 36(2), 498–516 (2011). http://dx.doi.org/10.1016/j.is.2010.09.006
Dijkman, R.M., et al.: A short survey on process model similarity. In: Bubenko, J., et al. (eds.) Seminal Contributions to Information Systems Engineering: 25 Years of CAiSE, pp. 421–427. Springer, Heidelberg (2013). http://dx.doi.org/10.1007/978-3-642-36926-1_34
van Dongen, B.: Real-life event logs - hospital log (2011). http://dx.doi.org/10.4121/uuid:d9769f3d-0ab0-4fb8-803b-0d1120ffcf54
van Dongen, B.: Bpi challenge 2012 (2012). http://dx.doi.org/10.4121/uuid:3926db30-f712-4394-aebc-75976070e91f
van Dongen, B.: Bpi challenge 2014 (2014). http://dx.doi.org/10.4121/uuid:c3e5d162-0cfd-4bb0-bd82-af5268819c35
van Dongen, B.: Bpi challenge 2015 (2015). http://dx.doi.org/10.4121/uuid:31a308ef-c844-48da-948c-305d167a0ec1
Guenther, C.W.: Mining activity clusters from low-level event logs. In: Eindhoven University of Technology (2006)
Günther, C.W., Rozinat, A., Aalst, W.M.P.: Activity mining by global trace segmentation. In: Rinderle-Ma, S., Sadiq, S., Leymann, F. (eds.) BPM 2009. LNBIP, vol. 43, pp. 128–139. Springer, Heidelberg (2010). doi:10.1007/978-3-642-12186-9_13
Guenther, C.W., Verbeek, E.: Xes standard definition. Technical report. 2.0, Eindhoven University of Technology, March 2014
Hompes, B.F.A., Verbeek, H.M.W., Aalst, W.M.P.: Finding suitable activity clusters for decomposed process discovery. In: Ceravolo, P., Russo, B., Accorsi, R. (eds.) SIMPDA 2014. LNBIP, vol. 237, pp. 32–57. Springer, Heidelberg (2015). doi:10.1007/978-3-319-27243-6_2
Kaufman, L., Rousseeuw, P.J.: Clustering Large Applications (Program CLARA), pp. 126–163. John Wiley & Sons, Inc. (2008). http://dx.doi.org/10.1002/9780470316801.ch3
Keane, M.T., Smyth, B., O’Sullivan, J.: Dynamic similarity: a processing perspective on similarity. Similarity and Categorization. Oxford University Press, Oxford (2001)
Kreher, U., Reichert, M., Rinderle-Ma, S., Dadam, P.: Effiziente repraesentation von vorlagen- und instanzdaten in prozess-management-systemen. Technical report. 2009–08, Ulm University (in German) (2009)
La Rosa, M., Dumas, M., Ekanayake, C.C., García-Bañuelos, L., Recker, J., ter Hofstede, A.H.M.: Detecting approximate clones in business process model repositories. Inf. Syst. 49, 102–125 (2015). http://dx.doi.org/10.1016/j.is.2014.11.010
Lu, R., Sadiq, S.: On the discovery of preferred work practice through business process variants. In: Parent, C., Schewe, K.-D., Storey, V.C., Thalheim, B. (eds.) ER 2007. LNCS, vol. 4801, pp. 165–180. Springer, Heidelberg (2007). doi:10.1007/978-3-540-75563-0_13
Ly, L.T., Maggi, F.M., Montali, M., Rinderle-Ma, S., van der Aalst, W.M.P.: Compliance monitoring in business processes: functionalities, application, and tool-support. Inf. Syst. 54, 209–234 (2015)
de Medeiros, A.K.A., van der Aalst, W.M.P., Weijters, A.: Quantifying process equivalence based on observed behavior. Data Knowl. Eng. 64(1), 55–74 (2008). http://dx.doi.org/10.1016/j.datak.2007.06.010
Pflug, J., Rinderle-Ma, S.: Dynamic instance queuing in process-aware information systems. In: Proceedings of the 28th Annual ACM Symposium on Applied Computing (SAC 2013), pp. 1426–1433 (2013)
Pflug, J., Rinderle-Ma, S.: Application of dynamic instance queuing to activity sequences incooperative business process scenarios. Int. J. Coop. Inf. Syst. 25(1), 1650002 (2016)
Pufahl, L., Bazhenova, E., Weske, M.: Evaluating the performance of a batch activity in process models. In: Fournier, F., Mendling, J. (eds.) BPM 2014. LNBIP, vol. 202, pp. 277–290. Springer, Heidelberg (2015). doi:10.1007/978-3-319-15895-2_24
Rinderle, S., Reichert, M., Dadam, P.: Flexible support of team processes by adaptive workflow systems. Distrib. Parallel Databases 16(1), 91–116 (2004). http://dx.doi.org/10.1023/B:DAPD.0000026270.78463.77
Russell, N., Aalst, W.M.P., Hofstede, A.H.M., Edmond, D.: Workflow resource patterns: identification, representation and tool support. In: Pastor, O., Falcão e Cunha, J. (eds.) CAiSE 2005. LNCS, vol. 3520, pp. 216–232. Springer, Heidelberg (2005). doi:10.1007/11431855_16
Steeman, W.: Bpi challenge 2013 (2013). http://dx.doi.org/10.4121/uuid:a7ce5c55-03a7-4583-b855-98b86e1a2b07
Jin, T., Wang, J., Wu, N., Rosa, M., Hofstede, A.H.M.: Efficient and accurate retrieval of business process models through indexing. In: Meersman, R., Dillon, T., Herrero, P. (eds.) OTM 2010. LNCS, vol. 6426, pp. 402–409. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16934-2_28
Thaler, T., Ternis, S.F., Fettke, P., Loos, P.: A comparative analysis of process instance cluster techniques. In: Thomas, O., Teuteberg, F. (eds.) Wirtschaftsinformatik, pp. 423–437 (2015). http://dblp.uni-trier.de/db/conf/wirtschaftsinformatik/wi2015.html#ThalerTFL15
Verbeek, H.M.W., Aalst, W.M.P.: Decomposed process mining: the ILP case. In: Fournier, F., Mendling, J. (eds.) BPM 2014. LNBIP, vol. 202, pp. 264–276. Springer, Heidelberg (2015). doi:10.1007/978-3-319-15895-2_23
Yan, Z., Dijkman, R., Grefen, P.: Fast business process similarity search. Distrib. Parallel Databases 30(2), 105–144 (2012). http://dx.doi.org/10.1007/s10619-012-7089-z
Acknowledgments
This work has been funded by the Vienna Science and Technology Fund (WWTF) through project ICT15-072.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Pflug, J., Rinderle-Ma, S. (2016). Process Instance Similarity: Potentials, Metrics, Applications. In: Debruyne, C., et al. On the Move to Meaningful Internet Systems: OTM 2016 Conferences. OTM 2016. Lecture Notes in Computer Science(), vol 10033. Springer, Cham. https://doi.org/10.1007/978-3-319-48472-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-48472-3_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48471-6
Online ISBN: 978-3-319-48472-3
eBook Packages: Computer ScienceComputer Science (R0)