Abstract
Dealing with average-sized event logs is considered a challenging task in process mining, in order to give value to event log data created by a wide variety of systems. An event log consists of a sequence of events for every case that was handled by the system. Discovery algorithms proposed in the literature work well in specific cases, but they usually fail in generic ones. Furthermore, there is no evidence that those existing strategies can handle logs with a large number of variants. We lack a generic approach to allow experts to explore event log data and decompose information into a series of smaller problems, to identify not only outliers, but also relations between the analyzed cases. In this chapter we propose a visual approach for filtering processes based on a low dimensionality representation of cases, a dissimilarity function based on both case attributes and case paths, and the use of entropy and silhouette to evaluate the uncertainty and quality, respectively, of each subset of cases. For each subset of cases, it is possible to reconstruct and evaluate each process model. Those contributions can be combined in an interactive tool to support process discovery. To demonstrate our tool, we use the event log from BPI Challenge 2017.
Supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior(CAPES).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
https://www.celonis.com/ last visited in July, 2018.
References
Mendling, J., Baesens, B., Bernstein, A., Fellmann, M.: Challenges of smart business process management: an introduction to the special issue (2017)
Tiwari, A., Turner, C.J., Majeed, B.: A review of business process mining: state-of-the-art and future trends. Bus. Process Manag. J. 14, 5–22 (2008)
Van der Aalst, W.M., Weijters, A.: Process mining: a research agenda. Comput. Ind. 53, 231–244 (2004)
Van der Aalst, W.M.: Process Mining: Data Science in Action. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49851-4
Van der Aalst, W., Weijters, T., Maruster, L.: Workflow mining: discovering process models from event logs. IEEE Trans. Knowl. Data Eng. 16, 1128–1142 (2004)
Verbeek, H., Van der Aalst, W., Munoz-Gama, J.: Divide and conquer: a tool framework for supporting decomposed discovery in process mining. Comput. J. 60, 1–26 (2017)
De Koninck, P., De Weerdt, J.: Multi-objective trace clustering: finding more balanced solutions. In: Dumas, M., Fantinato, M. (eds.) BPM 2016. LNBIP, vol. 281, pp. 49–60. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58457-7_4
Silva, L.J.S., et al.: Visual support to filtering cases for process discovery. In: ICEIS, no. 1, pp. 38–49 (2018)
Keim, D.A.: Visual exploration of large data sets. Commun. ACM 44, 38–44 (2001)
van Dongen, B.F., de Medeiros, A.K.A., Verbeek, H.M.W., Weijters, A.J.M.M., van der Aalst, W.M.P.: The ProM framework: a new era in process mining tool support. In: Ciardo, G., Darondeau, P. (eds.) ICATPN 2005. LNCS, vol. 3536, pp. 444–454. Springer, Heidelberg (2005). https://doi.org/10.1007/11494744_25
Günther, C.W., Rozinat, A.: Disco: discover your processes. BPM (Demos) 940, 40–44 (2012)
Günther, C.W., van der Aalst, W.M.P.: Fuzzy mining – adaptive process simplification based on multi-perspective metrics. In: Alonso, G., Dadam, P., Rosemann, M. (eds.) BPM 2007. LNCS, vol. 4714, pp. 328–343. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-75183-0_24
Sudhamani, Aruna Devi, T., Kumudavalli, M.: An informative and comparative study of process mining tools. Int. J. Sci. Eng. Res. 8, 8–10 (2017)
Low, W.Z., Van der Aalst, W.M., ter Hofstede, A.H., Wynn, M.T., De Weerdt, J.: Change visualisation: analysing the resource and timing differences between two event logs. Inf. Syst. 65, 106–123 (2017)
Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, vol. 10, pp. 707–710 (1966)
Jaccard, P.: Étude comparative de la distribution florale dans une portion des alpes et des jura. Bull. Soc. Vaudoise Sci. Nat. 37, 547–579 (1901)
Buja, A., McDonald, J., Michalak, J., Stuetzle, W.: Interactive data visualization using focusing and linking. In: Proceeding Visualization 1991, pp. 156–163. IEEE Computer Society Press (1991)
Becker, R.A., Cleveland, W.S., Hill, M.: Brushing scatterplots. Technometrics 29, 127–142 (1987)
Kruskal, J.B., Wish, M.: Multidimensional Scaling, vol. 31 (1978)
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, Hoboken (2012)
Shannon, C.E., Weaver, W.: A Mathematical Theory of Communication. University of Illinois Press, Champaign (1963)
Lopes, H., Barbosa, S.: Uncertainty measures and the concentration of probability density functions. In: Learning and Inferring: Festschrift for Alejandro Frery. College Publications (2015)
Strehl, A., Ghosh, J.: Cluster ensembles–a knowledge reuse framework for combining multiple partitions. J. Mach. Learn. Res. 3, 583–617 (2002)
Buijs, J.C.A.M., van Dongen, B.F., van der Aalst, W.M.P.: On the role of fitness, precision, generalization and simplicity in process discovery. In: Meersman, R., et al. (eds.) OTM 2012. LNCS, vol. 7565, pp. 305–322. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33606-5_19
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis, vol. 344. Wiley, Hoboken (2009)
Verbeek, H., Buijs, J., Van Dongen, B., van der Aalst, W.M.: ProM 6: the process mining toolkit. Proc. BPM Demonstr. Track 615, 34–39 (2010)
Rozinat, A., Günther, C.W., Niks, R.: Process Mining and Automated Process Discovery Software for Professionals-Fluxicon Disco (2017). http://fluxicon.com/disco
Joia, P., Coimbra, D., Cuminato, J.A., Paulovich, F.V., Nonato, L.G.: Local affine multidimensional projection. IEEE Trans. Vis. Comput. Graph. 17, 2563–2571 (2011)
Pagliosa, P., Paulovich, F.V., Minghim, R., Levkowitz, H., Nonato, L.G.: Projection inspector: assessment and synthesis of multidimensional projections. Neurocomputing 150, 599–610 (2015)
Acknowledgements
We thank Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) for partially financing this research.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
González, S.F. et al. (2019). Visual Filtering Tools and Analysis of Case Groups for Process Discovery. In: Hammoudi, S., Śmiałek, M., Camp, O., Filipe, J. (eds) Enterprise Information Systems. ICEIS 2018. Lecture Notes in Business Information Processing, vol 363. Springer, Cham. https://doi.org/10.1007/978-3-030-26169-6_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-26169-6_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26168-9
Online ISBN: 978-3-030-26169-6
eBook Packages: Computer ScienceComputer Science (R0)