Abstract
Process model similarity measures can be utilized for searching process model collections, which is also called similarity-based search. While there are quite a lot of approaches, most of them base on an underlying alignment between the activities of the compared process models. Yet, according to the results of the process model matching contests conducted in recent years, such an alignment seems to be quite difficult to achieve. The Latent Semantic Analysis-based Similarity Search approach described in this paper circumvents the matching challenge by not requiring such a matching. Instead, it uses a Latent Semantic Analysis-based Similarity Measure to query model collections and retrieve similar models. An evaluation with a collection of 80 models resulted in very good results in terms of Precision, Recall, and F-Measure. The best F-Measure value obtained during the experiments was 0.92.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The JAVA API can be downloaded from butler.aifb.kit.edu/asc/LS3/ls3.html.
References
Lau, C.K., Fournier, A.J., Xia, Y., Recker, J., Bernhard, E.: Process model repository governance at suncorp. Technical report, Queensland University of Technology (2011)
Song, L., Wang, J., Wen, L., Wang, W., Tan, S., Kong, H.: Querying process models based on the temporal relations between tasks. In: Proceedings of the 15th IEEE International EDOCW Workshops, pp. 213–222 (2011)
Dumas, M., GarcÃa-Bañuelos, L., Dijkman, R.M.: Similarity search of business process models. IEEE Data Eng. Bull. 32(3), 23–28 (2009)
ter Hofstede, A.H.M., Ouyang, C., Rosa, M.L., Song, L., Wang, J., Polyvyanyy, A.: APQL: a process-model query language. In: Proceedings of the 1st AP-BPM Conference, pp. 23–38 (2013)
Becker, M., Laue, R.: A comparative survey of business process similarity measures. Comput. Ind. 63(2), 148–167 (2012)
Antunes, G., Bakhshandeh, M., Borbinha, J., Cardoso, J., Dadashnia, S., Francescomarino, C.D., Dragoni, M., Fettke, P., Gal, A., Ghidini, C., Hake, P., Khiat, A., Klinkmüller, C., Kuss, E., Leopold, H., Loos, P., Meilicke, C., Niesen, T., Pesquita, C., Péus, T., Schoknecht, A., Sheetrit, E., Sonntag, A., Stuckenschmidt, H., Thaler, T., Weber, I., Weidlich, M.: The process model matching contest 2015. In: Proceedings of the 6th EMISA Workshop, pp. 127–155 (2015)
Reisig, W.: Understanding Petri Nets - Modeling Techniques, Analysis Methods, Case Studies. Springer, Heidelberg (2013)
Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391–407 (1990)
Dumais, S.T., Furnas, G.W., Landauer, T.K., Deerwester, S., Harshman, R.: Using latent semantic analysis to improve access to textual information. In: SIGCHI Conference on Human Factors in Computing Systems, pp. 281–285 (1988)
Landauer, T.K., Foltz, P.W., Laham, D.: An introduction to latent semantic analysis. Discourse Process. 25(2–3), 259–284 (1998)
Dumais, S.T.: LSA and information retrieval: getting back to basics. In: Handbook of Latent Semantic Analysis, pp. 293–321. Lawrence Erlbaum Associates (2007)
Awad, A.: BPMN-Q: a language to query business processes. In: Proceedings of the 2nd EMISA Workshop, pp. 115–128 (2007)
Beeri, C., Eyal, A., Kamenkovich, S., Milo, T.: Querying business processes with BP-QL. Inf. Syst. 33(6), 477–507 (2008)
Dijkman, R., Dumas, M., GarcÃa-Bañuelos, L.: Graph matching algorithms for business process model similarity search. In: Dayal, U., Eder, J., Koehler, J., Reijers, H.A. (eds.) BPM 2009. LNCS, vol. 5701, pp. 48–63. Springer, Heidelberg (2009). doi:10.1007/978-3-642-03848-8_5
Kunze, M., Weidlich, M., Weske, M.: Querying process models by behavior inclusion. Softw. Syst. Model. 14(3), 1105–1125 (2015)
Gater, A., Grigori, D., Bouzeghoub, M.: Indexing process model flow dependencies for similarity search. In: Meersman, R., Panetto, H., Dillon, T., Rinderle-Ma, S., Dadam, P., Zhou, X., Pearson, S., Ferscha, A., Bergamaschi, S., Cruz, I.F. (eds.) OTM 2012. LNCS, vol. 7565, pp. 128–145. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33606-5_9
Yan, Z., Dijkman, R., Grefen, P.: Fast business process similarity search. Distrib. Parallel Databases 30(2), 105–144 (2012)
Kastner, M., Wagdy Saleh, M., Wagner, S., Affenzeller, M., Jacak, W.: Heuristic methods for searching and clustering hierarchical workflows. In: Moreno-DÃaz, R., Pichler, F., Quesada-Arencibia, A. (eds.) EUROCAST 2009. LNCS, vol. 5717, pp. 737–744. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04772-5_95
Awad, A., Polyvyanyy, A., Weske, M.: Semantic querying of business process models. In: Proceedings of the 12th EDOC Conference, pp. 85–94 (2008)
Qiao, M., Akkiraju, R., Rembert, A.J.: Towards efficient business process clustering and retrieval: combining language modeling and structure matching. In: Rinderle-Ma, S., Toumani, F., Wolf, K. (eds.) BPM 2011. LNCS, vol. 6896, pp. 199–214. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23059-2_17
Li, S., Cao, J.: A new similarity search approach on process models. In: Cao, J., Wen, L., Liu, X. (eds.) PAS 2014. CCIS, vol. 495, pp. 11–20. Springer, Heidelberg (2015). doi:10.1007/978-3-662-46170-9_2
Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)
Malinova, M., Dijkman, R., Mendling, J.: Automatic extraction of process categories from process model collections. In: Lohmann, N., Song, M., Wohed, P. (eds.) BPM 2013. LNBIP, vol. 171, pp. 430–441. Springer, Cham (2014). doi:10.1007/978-3-319-06257-0_34
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Zaman, A.N.K., Brown, C.G.: Latent semantic indexing and large dataset: study of term-weighting schemes. In: Proceedings of the 5th ICDIM Conference, pp. 1–4 (2010)
van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworth-Heinemann, Newton (1979)
Turney, P.D., Pantel, P.: From frequency to meaning: vector space models of semantics. J. Artif. Intell. Res. (JAIR) 37, 141–188 (2010)
Vogelaar, J.J.C.L., Verbeek, H.M.W., Luka, B., Aalst, W.M.P.: Comparing business processes to determine the feasibility of configurable models: a case study. In: Daniel, F., Barkaoui, K., Dustdar, S. (eds.) BPM 2011. LNBIP, vol. 100, pp. 50–61. Springer, Heidelberg (2012). doi:10.1007/978-3-642-28115-0_6
Hu, X., Cai, Z., Wiemer-Hastings, P., Graesser, A.C., McNamara, D.S.: Strengths, limitations, and extensions of LSA. In: Handbook of Latent Semantic Analysis, pp. 401–426. Lawrence Erlbaum Associates (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Schoknecht, A., Fischer, N., Oberweis, A. (2017). Process Model Search Using Latent Semantic Analysis. In: Dumas, M., Fantinato, M. (eds) Business Process Management Workshops. BPM 2016. Lecture Notes in Business Information Processing, vol 281. Springer, Cham. https://doi.org/10.1007/978-3-319-58457-7_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-58457-7_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-58456-0
Online ISBN: 978-3-319-58457-7
eBook Packages: Computer ScienceComputer Science (R0)