Towards a Multi-parametric Visualisation Approach for Business Process Analytics

Bachhofner, Stefan; Kis, Isabella; Di Ciccio, Claudio; Mendling, Jan

doi:10.1007/978-3-319-60048-2_8

Stefan Bachhofner⁸,
Isabella Kis⁸,
Claudio Di Ciccio⁸ &
…
Jan Mendling⁸

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 286))

Included in the following conference series:

International Conference on Advanced Information Systems Engineering

848 Accesses
3 Citations

Abstract

Visualisation is an integral part of many scientific areas and is reportedly an important tool for learning and teaching. One reason for this is the picture superior effect. Nevertheless, little research endeavour has been carried out so far to effectively apply visualisation principles to the emerging field of business process analytics. In this paper a novel multi-parametric visualisation approach is proposed in such a context. General visualisation principles are used to create, evaluate, and improve the approach in the design process. They are drawn from a wide range of fields, and are synthesised from theory and empirical evidence.

You have full access to this open access chapter, Download conference paper PDF

Visual Analytics Meets Process Mining: Challenges and Opportunities

Visual Analytics in Enterprise Architecture Management: A Systematic Literature Review

Visual Analytics: Transferring, Translating and Transforming Knowledge from Analytics Experts to Non-technical Domain Experts in Multidisciplinary Teams

Article Open access 13 July 2022

Olivera Marjanovic, Greg Patmore & Nikola Balnave

Keywords

1 Introduction

Visualisation is a powerful tool for understanding data. In statistics an exploratory data analysis is performed before any statistical method is applied. Any data science process includes a step where the data is explored visually. Medicine and cartography pay particular attention to the colour scheme too. However, in the context of Business Process Management (BPM), little has been done in research to develop visualisation frameworks that effectively help domain experts and process analysts understand the performance of the examined processes. A missed opportunity because information represented visually is more likely to be remembered due to the picture superior effect [7, 11]. Business Process Management Systems (BPMSs) play an important role for process-aware organizations. However, BPMS fall short on powerful process analysis tools, especially from the perspective of visualisation. At times, pie charts are used instead of representations that convey the information more accurately.

In this position paper, the importance of powerful visualisation tools in process science is emphasised. In particular, a set of general visualisation principles is presented. Thereupon, we design an unprecedented multi-parametric approach that visually depicts process execution dynamics on a process model, with the representation of multiple performance metrics at once. The presented framework is based upon the results of a research project held in collaboration with PHACTUM Softwareentwicklung GmbH.

The remainder of the paper is organised as follows. Section 2 introduces the preliminaries from BPM and visualisation. Section 3 proposes our novel multi-parametric visualisation approach for process analytics. Section 4 concludes the paper and outlines further research.

2 Background

BPM is the art and science of overseeing how work is performed in an organisation to ensure consistent outcomes and to take advantage of improvement opportunities [5]. To that extent, the Internet of Events (IoE) [1] opens up new opportunities to process analysts who can rely on the efficient treatment of big data and various sources. Such opportunities include the automated processing of data by means of machine learning techniques and statistical methods, which benefit from the availability of large data sets.

Visualisation is graphical representation of data or concepts [17]. Atomic building blocks of this representation are visual variables, as first described by Bertin et al. [3] and successively clarified by Moody [12] (Fig. 1). Together they form the set of possible visual combinations, i.e., the design space [12]. The chosen visual variable has to preserve the structure of the underlying data [15]. For example, assume a categorical ordinal variable is given such as quantiles or age categories, e.g. young, middle-aged and elderly. In both cases the categories imply an order that has to be maintained. Therefore, a visual variable has to be chosen were perceptual ordering is possible, as shown in Fig. 2. For example, the grey colour map is perceptually ordered because it only varies in brightness (Fig. 2(b)). In contrast, the widely used rainbow colour map is not perceptually ordered because no intuitive sorting of colours is usually sensed by readers (Fig. 2(a)). Another important principle is contextualisation, namely the context+focus paradigm [9, 18], which applies when one wants to focus on a part of a system while showing the context of the system as a whole.

Process mining tools such as Minit [14] and ProM [2]^{Footnote 1} use visualisation extensively to display values related to activities’ performance metrics such as the frequency with which tasks are carried out – see e.g. the inductive visual miner of ProM [10]. Recently also BPMSs such as Camunda^{Footnote 2} have begun to show measured metrics on process models. However, little has been done in research and practice to visualise more than one performance metric. Visualising two or more metrics at once can prove beneficial because the user can identify patterns and relationships, as it happens with mosaic plots, a multivariate visualisation for categorical data in statistics [16]. The following section clarifies this assumption with a use-case example.

3 Outline of the Approach

To illustrate our approach a student loan application process extracted from [5] is used (Fig. 3). We assume that the process has been completed 100,000 times. The process starts when a loan application is received. First, the application is registered and then the applicants credit-worthiness is checked. Then, the application is either conditionally approved or approved. “Conditionally approve student loan” has been completed 80,000 times and “Approve student loan” 20,000 times, respectively. Finally, the complex activity “Sign loan” is activated and the process is completed. The box plot of the simulated activity durations in days are reported in Fig. 4.

Table 1. Colour codes for categories $ c_i $

Full size table

In our approach, the first step is to identify variables that are of interest for the analysis purpose. In this example, we consider (i) the number of times they were executed, namely their frequency, and (ii) the time the activities need to complete. In addition, we want to show outliers with respect to time, to point out where exceptionally long- or short-lasting tasks took place in relation to the others. To detect the outliers, we classify the registered absolute time values into N categories for every activity, based upon the corresponding quantiles. In the following, we will refer to these categories as $ c_i $ where $ c_i \in \{c_1, \ldots , c_N\} $. $ c_i $ is the category of values between the $(i-1)$-th and the i-th quantile. $ c_1 $ and $ c_N $ refer to the outliers. In our example, we consider $N=6$.

As previously stated, maintaining the consistency between the underlying structure of data and the visual representation is essential. In our example, both duration quantiles and frequency are data for which a total order exists. To depict their values, we therefore choose two visual variables which allow for a perceptual ordering: The grey colour map to encode quantiles and the size to encode frequency. A third visual variable is implicitly considered because the information is displayed on the process model, hence the additional parameter is the activity for which the metrics are measured. In our example a radial representation of data overlaps the activity boxes of the model to that extent.

Table 1 lists the colour codes assigned to $ c_i $. In the following, we provide an example of how the described categories $ c_i $ can be visually translated into the diameter of circles over activities, taking into account the execution frequency. Since the information is displayed on top of a process model, the maximum allowed diameter for each category has to be pre-calculated on the basis of the box size for the activity label containers, due to clear readability reasons. We name such a parameter as $\bar{d}$. For example, assume that the maximum allowed diameter is equal to 180 units^{Footnote 3}. The chosen maximum diameter d for an activity should not overcome the activity box. We recall that the diameter of circles here represents the frequency of executed activities. Therefore we scale it by the maximum frequency among all the activities in the process (in this example, 100,000). For activity “Approve student loan”, e.g., we have that $d = \frac{20000}{100000} \cdot \bar{d} = 0.2 * 180 = 36 $ units. For “Check debts” $d = 1.0 * 180 = 180$ units instead. The following equation is then used to determine the diameter $d_{c_i}$ of every category $c_i$.

$$\begin{aligned} d_{c_{i}} = \frac{\lambda _{i} - a}{b - a} \cdot d \end{aligned}$$

(1)

where $ \lambda _{i} $ is the upper bound of category $ c_{i} $, i.e., the i-th quantile, a is the minimum activity duration (i.e., the 0-th quantile), and b is the maximum activity duration (i.e., the 6-th quantile). The formula applied to activity “Check debts”, e.g., results in the following diameters:

$ d_{c_2} = \frac{57.66257 - 40.69231}{71.9848 - 40.69231} \cdot 180 = 97.63478 $
$ d_{c_3} = \frac{64.61496 - 40.69231}{71.9848 - 40.69231} \cdot 180 = 137.6078 $
$ d_{c_4} = \frac{67.28645 - 40.69231}{71.9848 - 40.69231} \cdot 180 = 152.9745 $
$ d_{c_5} = \frac{69.24977 - 40.69231}{71.9848 - 40.69231} \cdot 180 = 164.2678 $

The results are depicted in Fig. 5. Observe that only four diameters were calculated because the diameter for the last category is always equal to d. Repeating this calculations for each activity and projecting the results on the process model leads to the result drawn in Fig. 6. Examining the figure, the outliers can be easily identified by visually extracting the lowest and highest brightened areas. Both activities “Check debts” and “Sign loan” present outliers, but the latter stands out for the ratio of long-lasting executions, as opposed to the former. However, the frequency plays no role in that, as it can be noticed by the correspondence of the diameter of the superimposed circles.

4 Conclusion

This paper has positioned our research endeavour in the visualisation of business process analytics using general visualisation principles based on theory and empirical evidence. In this context, an example has been proposed that deals with the activities’ execution times and their frequency simultaneously visualised on a process model. Beyond the proposed example, a multi-parametric visualisation might be improved by considering additional parameters, e.g., a cost matrix depending on actual costs from accounting, or the extent to which a category is considered to be the least favourable to the business purposes. This matrix can then have an influence of the visualisation, e.g., scaling the size of the graphical elements or modifying the colour scheme.

For our future research, we aim at implementing a prototype applying those principles in practice, so as to perform experiments on case studies with researchers and practitioners in the area. Theoretical concepts to compute Process Performance Indicators (PPIs) on the basis of registered process data have been recently proposed in [8]. We will work to integrate the metrics devised in [8] with our approach. Studies on the influence of virtual and augmented reality on visualisation and how BPM can benefit from this new technologies are in our future plans too, also in the light of the recent advancements in the area [6, 13].

Notes

1.
www.minitlabs.com, www.promtools.org.
2.
https://docs.camunda.org/manual/7.6/webapps/cockpit/bpmn/process-history-views/#heatmap.
3.
By “unit” we mean any display or printing unit of measurement, such as pixels, centimetres, and the like.

References

van der Aalst, W.M.P.: Data scientist: the engineer of the future. In: Mertins, K., Bénaben, F., Poler, R., Bourrières, J.-P. (eds.) Enterprise Interoperability VI. PIC, vol. 7, pp. 13–26. Springer, Cham (2014). doi:10.1007/978-3-319-04948-9_2
Chapter Google Scholar
van der Aalst, W.M.P., van Dongen, B.F., Günther, C.W., Rozinat, A., Verbeek, E., Weijters, T.: Prom: the process mining toolkit. In: BPM (Demos). CEUR Workshop Proceedings, vol. 489. CEUR-WS.org (2009)
Google Scholar
Bertin, J.: Semiology of Graphics. University of Wisconsin Press, Madison (1983)
Google Scholar
Borland, D., Taylor, R.M.: Rainbow color map (still) considered harmful. IEEE Comput. Graph. Appl. 27, 14–17 (2007)
Article Google Scholar
Dumas, M., Rosa, M.L., Mendling, J., Reijers, H.A.: Fundamentals of Business Process Management. Springer Publishing Company, Incorporated (2013)
Book Google Scholar
Filonik, D., Rittenbruch, M., Foth, M.: DataChopin - designing interactions for visualisation composition in a co-located, cooperative environment. In: Luo, Y. (ed.) CDVE 2016. LNCS, vol. 9929, pp. 126–133. Springer, Cham (2016). doi:10.1007/978-3-319-46771-9_17
Chapter Google Scholar
Goolkasian, P.: Pictures, words, and sounds: From which format are we best able to reason? J. Gen. Psychol. 127(4), 439–459 (2000)
Article Google Scholar
Kis, I., Bachhofner, S., Di Ciccio, C., Mendling, J.: Towards a data-driven framework for measuring process performance. In: BPMDS (2017)
Google Scholar
Lamping, J., Rao, R.: The hyperbolic browser: a focus+context technique for visualizing large hierarchies. J. Vis. Lang. Comput. 7(1), 33–55 (1996)
Article Google Scholar
Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Process and deviation exploration with inductive visual miner. In: BPM (Demos), vol. 1295, p. 46 (2014)
Google Scholar
Lidwell, W., K.H., Butler, J.: Universal principles of Design: A Cross-Disciplinary Reference. Rockport Publishers, Gloucester (2003)
Google Scholar
Moody, D.: The physics of notations: toward a scientific basis for constructing visual notations in software engineering. IEEE Trans. Softw. Eng. 35(6), 756–779 (2009)
Article Google Scholar
Poppe, E., Brown, R., Recker, J., Johnson, D., Vanderfeesten, I.: Design and evaluation of virtual environments mechanisms to support remote collaboration on complex process diagrams. Inform. Syst. 66, 59–81 (2017)
Article Google Scholar
Puchovsky, M., Di Ciccio, C., Mendling, J.: A case study on the business benefits of automated process discovery. In: SIMPDA, pp. 35–49 (2016)
Google Scholar
Rogowitz, B.E., Treinish, L.A., Bryson, S.: How not to lie with visualization. Comput. Phys. 10(3), 268–273 (1996)
Article Google Scholar
Theus, M.: Mosaic plots. Wiley Interdisciplinary Reviews: Computational Statistics 4(2), 191–198 (2012)
Article Google Scholar
Tory, M., Möller, T.: Human factors in visualization research. IEEE Trans. Vis. Comput. Graph. 10(1), 72–84 (2004)
Article Google Scholar
Turetken, O., Schuff, D., Sharda, R., Ow, T.T.: Supporting systems analysis and design through fisheye views. Commun. ACM 47(9), 72–77 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Vienna University of Economics and Business, Vienna, Austria
Stefan Bachhofner, Isabella Kis, Claudio Di Ciccio & Jan Mendling

Authors

Stefan Bachhofner
View author publications
You can also search for this author in PubMed Google Scholar
Isabella Kis
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Di Ciccio
View author publications
You can also search for this author in PubMed Google Scholar
Jan Mendling
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stefan Bachhofner .

Editor information

Editors and Affiliations

Universität Duisburg-Essen , Essen, Germany
Andreas Metzger
University of Skövde , Skovde, Sweden
Anne Persson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bachhofner, S., Kis, I., Di Ciccio, C., Mendling, J. (2017). Towards a Multi-parametric Visualisation Approach for Business Process Analytics. In: Metzger, A., Persson, A. (eds) Advanced Information Systems Engineering Workshops. CAiSE 2017. Lecture Notes in Business Information Processing, vol 286. Springer, Cham. https://doi.org/10.1007/978-3-319-60048-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-60048-2_8
Published: 02 June 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-60047-5
Online ISBN: 978-3-319-60048-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Towards a Multi-parametric Visualisation Approach for Business Process Analytics

Abstract

Similar content being viewed by others

Visual Analytics Meets Process Mining: Challenges and Opportunities

Visual Analytics in Enterprise Architecture Management: A Systematic Literature Review

Visual Analytics: Transferring, Translating and Transforming Knowledge from Analytics Experts to Non-technical Domain Experts in Multidisciplinary Teams

Keywords

1 Introduction

2 Background

3 Outline of the Approach

4 Conclusion

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Towards a Multi-parametric Visualisation Approach for Business Process Analytics

Abstract

Similar content being viewed by others

Visual Analytics Meets Process Mining: Challenges and Opportunities

Visual Analytics in Enterprise Architecture Management: A Systematic Literature Review

Visual Analytics: Transferring, Translating and Transforming Knowledge from Analytics Experts to Non-technical Domain Experts in Multidisciplinary Teams

Keywords

1 Introduction

2 Background

3 Outline of the Approach

4 Conclusion

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation