Skip to main content

Parallelization of Transition Counting for Process Mining on Multi-core CPUs and GPUs

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 281))

Abstract

Many process mining tools and techniques produce output models based on the counting of transitions between tasks or users in an event log. Although this counting can be performed in a forward pass through the event log, when analyzing large event logs according to different perspectives it may become impractical or time-consuming to perform multiple such passes. In this work, we show how transition counting can be parallelized by taking advantage of CPU multi-threading and GPU-accelerated computing. We describe the parallelization strategies, together with a set of experiments to illustrate the performance gains that can be expected with such parallelizations.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www.win.tue.nl/bpi/doku.php?id=2016:challenge.

  2. 2.

    https://data.3tu.nl/repository/uuid:9b99a146-51b5-48df-aa70-288a76c82ec4.

References

  1. van der Aalst, W.M.P., Weijters, A.J.M.M., Maruster, L.: Workflow mining: Discovering process models from event logs. IEEE Trans. Knowl. Data Eng. 16, 1128–1142 (2004)

    Article  Google Scholar 

  2. Weijters, A.J.M.M., van der Aalst, W.M.P., de Medeiros, A.K.A.: Process mining with the HeuristicsMiner algorithm. Technical Report WP 166, Eindhoven University of Technology (2006)

    Google Scholar 

  3. Günther, C.W., Rozinat, A.: Disco: Discover your processes. In: BPM 2012 Demonstration Track, CEUR Workshop Proceedings, Vol. 940 (2012)

    Google Scholar 

  4. van der Aalst, W.M.P., Song, M.: Mining social networks: Uncovering interaction patterns in business processes. In: Desel, J., Pernici, B., Weske, M. (eds.) BPM 2004. LNCS, vol. 3080, pp. 244–260. Springer, Heidelberg (2004). doi:10.1007/978-3-540-25970-1_16

    Chapter  Google Scholar 

  5. van der Aalst, P.W.M., Reijers, A.H., Song, M.: Discovering social networks from event logs. Comput. Support. Coop. Work 14(6), 549–593 (2005)

    Article  Google Scholar 

  6. Rauber, T., Rünger, G.: Parallel Programming for Multicore and Cluster Systems. Springer, Heidelberg (2013)

    MATH  Google Scholar 

  7. Veiga, G.M., Ferreira, D.R.: Understanding spaghetti models with sequence clustering for ProM. In: Rinderle-Ma, S., Sadiq, S., Leymann, F. (eds.) BPM 2009. LNBIP, vol. 43, pp. 92–103. Springer, Heidelberg (2010). doi:10.1007/978-3-642-12186-9_10

    Chapter  Google Scholar 

  8. Kundra, D., Juneja, P., Sureka, A.: Vidushi: Parallel implementation of alpha miner algorithm and performance analysis on CPU and GPU architecture. In: Reichert, M., Reijers, H.A. (eds.) BPM 2015. LNBIP, vol. 256, pp. 230–241. Springer, Cham (2016). doi:10.1007/978-3-319-42887-1_19

    Chapter  Google Scholar 

  9. Butenhof, D.R.: Programming with POSIX Threads. Addison-Wesley, Reading (1997)

    Google Scholar 

  10. Nickolls, J., Buck, I., Garland, M., Skadron, K.: Scalable parallel programming with CUDA. ACM Queue 6(2), 40–53 (2008)

    Article  Google Scholar 

  11. Ferreira, D.R., Vasilyev, E.: Using logical decision trees to discover the cause of process delays from event logs. Comput. Ind. 70, 194–207 (2015)

    Article  Google Scholar 

  12. van Dongen, B.F., van Der Aalst, W.M.P.: A meta model for process mining data. In: EMOI-INTEROP 2005, CEUR Workshop Proceedings, Vol. 160 (2005)

    Google Scholar 

  13. Verbeek, H.M.W., Buijs, J.C.A.M., van Dongen, B.F., Aalst, W.M.P.: XES, XESame, and ProM 6. In: Soffer, P., Proper, E. (eds.) CAiSE Forum 2010. LNBIP, vol. 72, pp. 60–75. Springer, Heidelberg (2011). doi:10.1007/978-3-642-17722-4_5

    Chapter  Google Scholar 

  14. Magro, W., Petersen, P., Shah, S.: Hyper-threading technology: Impact on compute-intensive workloads. Intel Technol. J. 6(1), 1–9 (2002)

    Google Scholar 

  15. Nickolls, J., Dally, W.J.: The GPU computing era. IEEE Micro 30(2), 56–69 (2010)

    Article  Google Scholar 

  16. Bell, N., Hoberock, J.: Thrust: A productivity-oriented library for CUDA. In: GPU Computing Gems, 359–371. Jade Edition. Morgan Kaufmann (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Diogo R. Ferreira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Ferreira, D.R., Santos, R.M. (2017). Parallelization of Transition Counting for Process Mining on Multi-core CPUs and GPUs. In: Dumas, M., Fantinato, M. (eds) Business Process Management Workshops. BPM 2016. Lecture Notes in Business Information Processing, vol 281. Springer, Cham. https://doi.org/10.1007/978-3-319-58457-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-58457-7_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-58456-0

  • Online ISBN: 978-3-319-58457-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics