Abstract
The size of execution data available for process mining analysis grows several orders of magnitude every couple of years. Extracting and selecting the relevant data to be analyzed on each case represents an open challenge in the field. This paper presents a systematic literature review on different approaches to query process data and establish their provenance. In addition, a new query language is proposed, which overcomes the limitations identified during the review. The proposal is based on a combination of data and process perspectives. It provides simple constructs to intuitively formulate questions. An implementation of the language is provided, together with examples of queries to be applied on different aspects of the process analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Vanwersch, R., Shahzad, K., Vanhaecht, K., Grefen, P., Pintelon, L., Mendling, J., Van Merode, G., Reijers, H.A.: Methodological support for business process redesign in health care: a literature review protocol. Int. J. Care Pathways 15(4), 119–126 (2011)
Moreau, L., Clifford, B., Freire, J., Futrelle, J., Gil, Y., Groth, P., Kwasnikowska, N., Miles, S., Missier, P., Myers, J., et al.: The open provenance model core specification (v1. 1). Future Gener. Comput. Syst. 27(6), 743–756 (2011)
Huang, X., Bao, Z., Davidson, S.B., Milo, T., Yuan, X.: Answering regular path queries on workflow provenance. In: 2015 IEEE Proceedings of the 31st International Conference on Data Engineering (ICDE), pp. 375–386. IEEE (2015)
Costa, F., Silva, V., De Oliveira, D., Ocaña, K., Ogasawara, E., Dias, J., Mattoso, M.: Capturing and querying workflow runtime provenance with PROV: a practical approach. In: Proceedings of the Joint EDBT/ICDT 2013 Workshops, pp. 282–289. ACM (2013)
Cuevas-Vicenttin, V., Dey, S., Wang, M.L.Y., Song, T., Ludascher, B.: Modeling and querying scientific workflow provenance in the D-OPM. In: High Performance Computing, Networking, Storage and Analysis (SCC), pp. 119–128. IEEE (2012)
Sakka, M.A., Defude, B.: Towards a scalable semantic provenance management system. In: Hameurlain, A., Küng, J., Wagner, R. (eds.) Transactions on Large-Scale Data- and Knowledge-Centered Systems VII. LNCS, vol. 7720, pp. 96–127. Springer, Heidelberg (2012). doi:10.1007/978-3-642-35332-1_4
Chirigati, F., Freire, J.: Towards integrating workflow and database provenance. In: Groth, P., Frew, J. (eds.) IPAW 2012. LNCS, vol. 7525, pp. 11–23. Springer, Heidelberg (2012). doi:10.1007/978-3-642-34222-6_2
Gadelha, L.M., Wilde, M., Mattoso, M., Foster, I.: MTCProv: a practical provenance query framework for many-task scientific computing. Distrib. Parallel Databases 30(5–6), 351–370 (2012)
Lim, C., Lu, S., Chebotko, A., Fotouhi, F.: OPQL: A first OPM-level query language for scientific workflow provenance. In: 2011 IEEE International Conference on Services Computing (SCC), pp. 136–143. IEEE (2011)
Lim, C., Lu, S., Chebotko, A., Fotouhi, F.: Storing, reasoning, and querying OPM-compliant scientific workflow provenance using relational databases. Future Gener. Comput. Syst. 27(6), 781–789 (2011)
Liu, D.: XQuery meets Datalog: data relevance query for workflow trustworthiness. In: Research Challenges in Information Science (RCIS 2010), pp. 169–174. IEEE (2010)
Bowers, S., McPhillips, T., Ludäscher, B., Cohen, S., Davidson, S.B.: A model for user-oriented data provenance in pipelined scientific workflows. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 133–147. Springer, Heidelberg (2006). doi:10.1007/11890850_15
Solanki, M., Brewster, C.: A knowledge driven approach towards the validation of externally acquired traceability datasets in supply chain business processes. In: Janowicz, K., Schlobach, S., Lambrix, P., Hyvönen, E. (eds.) EKAW 2014. LNCS (LNAI), vol. 8876, pp. 503–518. Springer, Cham (2014). doi:10.1007/978-3-319-13704-9_38
Momotko, M., Subieta, K.: Process query language: a way to make workflow processes more flexible. In: Benczúr, A., Demetrovics, J., Gottlob, G. (eds.) ADBIS 2004. LNCS, vol. 3255, pp. 306–321. Springer, Heidelberg (2004). doi:10.1007/978-3-540-30204-9_21
Koksal, P., Arpinar, S.N., Dogac, A.: Workflow history management. ACM Sigmod Rec. 27(1), 67–75 (1998)
Poppe, O., Giessl, S., Rundensteiner, E.A., Bry, F.: The HIT model: workflow-aware event stream monitoring. In: Hameurlain, A., Küng, J., Wagner, R., Amann, B., Lamarre, P. (eds.) Transactions on Large-Scale Data- and Knowledge-Centered Systems XI. LNCS, vol. 8290, pp. 26–50. Springer, Heidelberg (2013). doi:10.1007/978-3-642-45269-7_2
Liu, D., Pedrinaci, C., Domingue, J.: Semantic enabled complex event language for business process monitoring. In: Proceedings of the 4th International Workshop on Semantic Business Process Management, pp. 31–34. ACM (2009)
Fazzinga, B., Flesca, S., Furfaro, F., Masciari, E., Pontieri, L., Pulice, C.: A framework supporting the analysis of process logs stored in either relational or NoSQL DBMSs. In: Esposito, F., Pivert, O., Hacid, M.-S., Raś, Z.W., Ferilli, S. (eds.) ISMIS 2015. LNCS (LNAI), vol. 9384, pp. 52–58. Springer, Cham (2015). doi:10.1007/978-3-319-25252-0_6
Radeschütz, S., Schwarz, H., Niedermann, F.: Business impact analysis: a framework for a comprehensive analysis and optimization of business processes. Comput. Sci. Res. Dev. 30(1), 69–86 (2015)
Backmann, M., Baumgrass, A., Herzberg, N., Meyer, A., Weske, M.: Model-driven event query generation for business process monitoring. In: Lomuscio, A.R., Nepal, S., Patrizi, F., Benatallah, B., Brandić, I. (eds.) ICSOC 2013. LNCS, vol. 8377, pp. 406–418. Springer, Cham (2014). doi:10.1007/978-3-319-06859-6_36
Song, L., Wang, J., Wen, L., Wang, W., Tan, S., Kong, H.: Querying process models based on the temporal relations between tasks. In: 2011 15th IEEE International Enterprise Distributed Object Computing Conference Workshops (EDOCW), pp. 213–222. IEEE (2011)
Beheshti, S.-M.-R., Benatallah, B., Motahari-Nezhad, H.R., Sakr, S.: A query language for analyzing business processes execution. In: Rinderle-Ma, S., Toumani, F., Wolf, K. (eds.) BPM 2011. LNCS, vol. 6896, pp. 281–297. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23059-2_22
Deutch, D., Milo, T.: Top-K projection queries for probabilistic business processes. In: Proceedings of the 12th International Conference on Database Theory. ACM (2009)
Baquero, A.V., Molloy, O.: Integration of event data from heterogeneous systems to support business process analysis. In: Fred, A., Dietz, J.L.G., Liu, K., Filipe, J. (eds.) IC3K 2012. CCIS, vol. 415, pp. 440–454. Springer, Heidelberg (2013). doi:10.1007/978-3-642-54105-6_29
Metzke, T., Rogge-Solti, A., Baumgrass, A., Mendling, J., Weske, M.: Enabling semantic complex event processing in the domain of logistics. In: Lomuscio, A.R., Nepal, S., Patrizi, F., Benatallah, B., Brandić, I. (eds.) ICSOC 2013. LNCS, vol. 8377, pp. 419–431. Springer, Cham (2014). doi:10.1007/978-3-319-06859-6_37
Ray, M., Liu, M., Rundensteiner, E., Dougherty, D.J., Gupta, C., Wang, S., Mehta, A., Ari, I.: Optimizing complex sequence pattern extraction using caching. In: 2011 IEEE Proceedings of the 27th International Conference on Data Engineering Workshops (ICDEW), pp. 243–248. IEEE (2011)
Räim, M., Ciccio, C., Maggi, F.M., Mecella, M., Mendling, J.: Log-based understanding of business processes through temporal logic query checking. In: Meersman, R., et al. (eds.) OTM 2014. LNCS, vol. 8841, pp. 75–92. Springer, Heidelberg (2014). doi:10.1007/978-3-662-45563-0_5
González López de Murillas, E., Reijers, H.A., van der Aalst, W.M.P.: Connecting databases with process mining: a meta model and toolset. In: Proceedings of the 17th International Conference on Enterprise, Business-Process and Information Systems Modeling, BPMDS (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
González López de Murillas, E., Reijers, H.A., van der Aalst, W.M.P. (2017). Everything You Always Wanted to Know About Your Process, but Did Not Know How to Ask. In: Dumas, M., Fantinato, M. (eds) Business Process Management Workshops. BPM 2016. Lecture Notes in Business Information Processing, vol 281. Springer, Cham. https://doi.org/10.1007/978-3-319-58457-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-58457-7_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-58456-0
Online ISBN: 978-3-319-58457-7
eBook Packages: Computer ScienceComputer Science (R0)