Abstract
Discovering good process models is essential for different process analysis tasks such as conformance checking and process improvements. Automated process discovery methods often overlook valuable domain knowledge. This knowledge, including insights from domain experts and detailed process documentation, remains largely untapped during process discovery. This paper leverages Large Language Models (LLMs) to integrate such knowledge directly into process discovery. We use rules derived from LLMs to guide model construction, ensuring alignment with both domain knowledge and actual process executions. By integrating LLMs, we create a bridge between process knowledge expressed in natural language and the discovery of robust process models, advancing process discovery methodologies significantly. To showcase the usability of our framework, we conducted a case study with the UWV employee insurance agency, demonstrating its practical benefits and effectiveness.
This research was supported by the research training group “Dataninja” (Trustworthy AI for Seamless Problem Solving: Next Generation Intelligence Joins Robust Data Analysis) funded by the German federal state of North Rhine-Westphalia.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
van der Aa, H., Ciccio, C.D., Leopold, H., Reijers, H.A.: Extracting declarative process models from natural language. In: Advanced Information Systems Engineering - 31st International Conference, CAiSE 2019. Lecture Notes in Computer Science, vol. 11483, pp. 365–382. Springer (2019)
van der Aa, H., Rebmann, A., Leopold, H.: Natural language-based detection of semantic execution anomalies in event logs. Inf. Syst. 102, 101824 (2021)
Augusto, A., et al.: Automated discovery of process models from event logs: review and benchmark. IEEE Trans. Knowl. Data Eng. 31(4), 686–705 (2019)
Dixit, P.M., Verbeek, H.M.W., Buijs, J.C.A.M., van der Aalst, W.M.P.: Interactive data-driven process model construction. In: Conceptual Modeling - 37th International Conference, ER 2018. Lecture Notes in Computer Science, vol. 11157, pp. 251–265. Springer (2018)
van Eck, M.L., Lu, X., Leemans, S.J.J., van der Aalst, W.M.P.: PM\(\hat{\,}\)2: a process mining project methodology. In: Advanced Information Systems Engineering - 27th International Conference, CAiSE 2015. Lecture Notes in Computer Science, vol. 9097, pp. 297–313. Springer (2015)
Grohs, M., Abb, L., Elsayed, N., Rehse, J.: Large language models can accomplish business process management tasks. In: Business Process Management Workshops - BPM 2023 International Workshops, Revised Selected Papers. Lecture Notes in Business Information Processing, vol. 492, pp. 453–465. Springer (2023)
Huang, L., et al.: A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions. CoRR abs/2311.05232 (2023)
Klievtsova, N., Benzin, J., Kampik, T., Mangler, J., Rinderle-Ma, S.: Conversational process modelling: state of the art, applications, and implications in practice. In: Business Process Management Forum - BPM 2023 Forum. Lecture Notes in Business Information Processing, vol. 490, pp. 319–336. Springer (2023)
Kourani, H., Berti, A., Schuster, D., van der Aalst, W.M.P.: Process modeling with large language models. In: Enterprise, Business-Process and Information Systems Modeling - 25th International Conference, BPMDS 2024, and 29th International Conference, EMMSAD 2024. Lecture Notes in Business Information Processing, vol. 511, pp. 229–244. Springer (2024)
Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from event logs containing infrequent behaviour. In: Business Process Management Workshops - BPM 2013 International Workshops, Revised Papers. Lecture Notes in Business Information Processing, vol. 171, pp. 66–78. Springer, Cham (2013)
Maggi, F.M., Bose, R.P.J.C., van der Aalst, W.M.P.: Efficient discovery of understandable declarative process models from event logs. In: Advanced Information Systems Engineering - 24th International Conference, CAiSE 2012. Lecture Notes in Computer Science, vol. 7328, pp. 270–285. Springer (2012)
Norouzifar, A., Dees, M., van der Aalst, W.M.P.: Imposing rules in process discovery: an inductive mining approach. In: Research Challenges in Information Science - 18th International Conference, RCIS 2024, Part I. Lecture Notes in Business Information Processing, vol. 513, pp. 220–236. Springer (2024)
Polyvyanyy, A., van der Aalst, W.M.P., ter Hofstede, A.H.M., Wynn, M.T.: Impact-driven process model repair. ACM Trans. Softw. Eng. Methodol. 25(4), 28:1–28:60 (2017)
Schuster, D., van Zelst, S.J., van der Aalst, W.M.P.: Utilizing domain knowledge in data-driven process discovery: a literature review. Comput. Ind. 137, 103612 (2022)
Schuster, D., van Zelst, S.J., van der Aalst, W.M.P.: Cortado: a dedicated process mining tool for interactive process discovery. SoftwareX 22, 101373 (2023)
Vidgof, M., Bachhofner, S., Mendling, J.: Large language models for business process management: Opportunities and challenges. In: Business Process Management Forum - BPM 2023 Forum. Lecture Notes in Business Information Processing, vol. 490, pp. 107–123. Springer (2023)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Norouzifar, A., Kourani, H., Dees, M., van der Aalst, W.M.P. (2025). Bridging Domain Knowledge and Process Discovery Using Large Language Models. In: Gdowska, K., Gómez-López, M.T., Rehse, JR. (eds) Business Process Management Workshops. BPM 2024. Lecture Notes in Business Information Processing, vol 534. Springer, Cham. https://doi.org/10.1007/978-3-031-78666-2_4
Download citation
DOI: https://doi.org/10.1007/978-3-031-78666-2_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-78665-5
Online ISBN: 978-3-031-78666-2
eBook Packages: Computer ScienceComputer Science (R0)