Skip to main content

Ontology-Based Data Access for Extracting Event Logs from Legacy Data: The onprom Tool and Methodology

  • Conference paper
  • First Online:
Business Information Systems (BIS 2017)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 288))

Included in the following conference series:

Abstract

Process mining aims at discovering, monitoring, and improving business processes by extracting knowledge from event logs. In this respect, process mining can be applied only if there are proper event logs that are compatible with accepted standards, such as extensible event stream (XES). Unfortunately, in many real world set-ups, such event logs are not explicitly given, but instead are implicitly represented in legacy information systems. In this work, we exploit a framework and associated methodology for the extraction of XES event logs from relational data sources that we have recently introduced. Our approach is based on describing logs by means of suitable annotations of a conceptual model of the available data, and builds on the ontology-based data access (OBDA) paradigm for the actual log extraction. Making use of a real-world case study in the services domain, we compare our novel approach with a more traditional extract-transform-load based one, and are able to illustrate its added value. We also present a set of tools that we have developed and that support the OBDA-based log extraction framework. The tools are integrated as plugins of the ProM process mining suite.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://tinyurl.com/ovedwx4.

  2. 2.

    https://fluxicon.com/disco/.

  3. 3.

    http://www.celonis.de/en/.

  4. 4.

    http://www.minitlabs.com.

  5. 5.

    http://www.ebitmax.it.

  6. 6.

    http://www.markas.com/en/home.html.

  7. 7.

    http://www.bpmn.org/.

  8. 8.

    https://fluxicon.com/disco/.

  9. 9.

    In W3C terminology, a profile is a sublanguage.

  10. 10.

    It is important to notice that the possible absence of an actual value for Order_Date does not contrast with the class diagram of Fig. 2, which dictates that every purchase order has exactly one creation time. In fact, conceptual models are interpreted under incomplete information: the absence of the creation date for an order does not mean that the order has no creation date, but that such an order has a creation date that is not certainly known.

  11. 11.

    http://ontop.inf.unibz.it/.

  12. 12.

    In the left-hand side of a mapping, curly brackets are used to denote answer variables of the SQL query in the right-hand side.

  13. 13.

    In OWL terms, it is a data property.

  14. 14.

    http://www.promtools.org/.

  15. 15.

    http://optique-project.eu.

References

  1. van der Aalst, W.M.P., et al.: Process mining manifesto. In: Daniel, F., Barkaoui, K., Dustdar, S. (eds.) BPM 2011. LNBIP, vol. 99, pp. 169–194. Springer, Heidelberg (2012). doi:10.1007/978-3-642-28108-2_19

    Chapter  Google Scholar 

  2. van der Aalst, W.M.P.: Process Mining - Data Science in Action, 2nd edn. Springer, Heidelberg (2016)

    Book  Google Scholar 

  3. IEEE Computational Intelligence Society: IEEE Standard for eXtensible Event Stream (XES) for Achieving Interoperability in Event Logs and Event Streams. IEEE Std 1849-2016, i–50 (2016)

    Google Scholar 

  4. Verbeek, H.M.W., Buijs, J.C.A.M., Dongen, B.F., van der Aalst, W.M.P.: XES, XESame, and ProM 6. In: Soffer, P., Proper, E. (eds.) CAiSE Forum 2010. LNBIP, vol. 72, pp. 60–75. Springer, Heidelberg (2011). doi:10.1007/978-3-642-17722-4_5

    Chapter  Google Scholar 

  5. Günther, C.W., van der Aalst, W.M.P.: A generic import framework for process event logs. In: Eder, J., Dustdar, S. (eds.) BPM 2006. LNCS, vol. 4103, pp. 81–92. Springer, Heidelberg (2006). doi:10.1007/11837862_10

    Chapter  Google Scholar 

  6. van der Aalst, W.M.P.: Extracting event data from databases to unleash process mining. In: vom Brocke, J., Schmiedel, T. (eds.) BPM - Driving Innovation in a Digital World. Management for Professionals, pp. 105–128. Springer, Cham (2015). doi:10.1007/978-3-319-14430-6_8

    Google Scholar 

  7. Syamsiyah, A., van Dongen, B.F., van der Aalst, W.M.P.: DB-XES: enabling process discovery in the large. In: Proceedings of the 6th International Symposium on Data-Driven Process Discovery and Analysis (SIMPDA). CEUR, vol. 1757, pp. 63–77. ceur-ws.org (2016)

    Google Scholar 

  8. Calvanese, D., Montali, M., Syamsiyah, A., van der Aalst, W.M.P.: Ontology-driven extraction of event logs from relational databases. In: Reichert, M., Reijers, H.A. (eds.) BPM 2015. LNBIP, vol. 256, pp. 140–153. Springer, Cham (2016). doi:10.1007/978-3-319-42887-1_12

    Chapter  Google Scholar 

  9. Poggi, A., Lembo, D., Calvanese, D., Giacomo, G., Lenzerini, M., Rosati, R.: Linking data to ontologies. In: Spaccapietra, S. (ed.) Journal on Data Semantics X. LNCS, vol. 4900, pp. 133–173. Springer, Heidelberg (2008). doi:10.1007/978-3-540-77688-8_5

    Chapter  Google Scholar 

  10. Calvanese, D., Giacomo, G., Lembo, D., Lenzerini, M., Poggi, A., Rodriguez-Muro, M., Rosati, R.: Ontologies and databases: the DL-Lite approach. In: Tessaris, S., Franconi, E., Eiter, T., Gutierrez, C., Handschuh, S., Rousset, M.-C., Schmidt, R.A. (eds.) Reasoning Web 2009. LNCS, vol. 5689, pp. 255–356. Springer, Heidelberg (2009). doi:10.1007/978-3-642-03754-2_7

    Chapter  Google Scholar 

  11. Calvanese, D., Cogrel, B., Komla-Ebri, S., Kontchakov, R., Lanti, D., Rezk, M., Rodriguez-Muro, M., Xiao, G.: Ontop: answering SPARQL queries over relational databases. Semant. Web J. 8(3), 471–487 (2017). doi:10.3233/SW-160217

    Article  Google Scholar 

  12. Motik, B., Cuenca Grau, B., Horrocks, I., Wu, Z., Fokoue, A., Lutz, C.: OWL 2 Web Ontology Language profiles, 2nd edn. W3C Recommendation, W3C, December 2012. http://www.w3.org/TR/owl2-profiles/

  13. Antonioli, N., Castanò, F., Coletta, S., Grossi, S., Lembo, D., Lenzerini, M., Poggi, A., Virardi, E., Castracane, P.: Ontology-based data management for the Italian public debt. In: Proceedings of FOIS. Frontiers in Artificial Intelligence and Applications, vol. 267, pp. 372–385. IOS Press (2014)

    Google Scholar 

  14. Jiménez-Ruiz, E., Kharlamov, E., Zheleznyakov, D., Horrocks, I., Pinkel, C., Skjæveland, M.G., Thorstensen, E., Mora, J.: BootOX: bootstrapping OWL 2 ontologies and R2RML mappings from relational databases. In: Proceedings of ISWC Posters & Demonstrations Track. CEUR, vol. 1486. ceur-ws.org (2015)

    Google Scholar 

Download references

Acknowledgement

This research has been partially supported by the Euregio IPN12 “KAOS: Knowledge-Aware Operational Support” project, which is funded by the “European Region Tyrol-South Tyrol-Trentino” (EGTC) under the first call for basic research projects and by the UNIBZ internal project “OnProm”. We thank Ario Santoso for the development of the log extraction plug-in of onprom, and Wil van der Aalst for the interesting discussions and insights on the problem of extracting event logs from legacy information systems.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tahir Emre Kalayci .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Calvanese, D., Kalayci, T.E., Montali, M., Tinella, S. (2017). Ontology-Based Data Access for Extracting Event Logs from Legacy Data: The onprom Tool and Methodology. In: Abramowicz, W. (eds) Business Information Systems. BIS 2017. Lecture Notes in Business Information Processing, vol 288. Springer, Cham. https://doi.org/10.1007/978-3-319-59336-4_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-59336-4_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-59335-7

  • Online ISBN: 978-3-319-59336-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics