Abstract
The automation of sequences of repetitive actions performed by human operators in interacting with software applications is crucial to prevent work from being perceived as alienating and boring. Robot applications can automatise these sequences once they have been identified. In this paper, we propose a two-step approach to mine sequences of actions that could be automated from log data produced by the interactions of a human operator with specific software applications. Since the number of possible sequences may be very high and not all the sequences are interesting to be automatised, we focus our mining process on sequences that meet precise patterns. First, Frequent Episode Mining algorithms are applied for extracting all the sequences of actions that occur with at least a minimum frequency. Then, we exploit fuzzy string matching based on the Levenshtein distance for filtering out the sequences that do not match established patterns. We evaluate the effectiveness of the approach using a benchmark dataset and present a case study on a real-world dataset of activity logs generated in the context of the AUTOMIA project.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
References
Fournier-Viger, P., Chen, Y., Nouioua, F., Lin, J.C.-W.: Mining partially-ordered episode rules in an event sequence. In: Asian Conference on Intelligent Information and Database Systems, pp. 3–15. Springer (2021)
Fournier-Viger, P., et al.: The SPMF open-source data mining library version 2. In: Berendt, B., et al. (eds.) ECML PKDD 2016. LNCS (LNAI), vol. 9853, pp. 36–40. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46131-1_8
Fournier-Viger, P., Yang, P., Lin, J.C.-W., Yun, U.: HUE-Span: fast high utility episode mining. In: Li, J., Wang, S., Qin, S., Li, X., Wang, S. (eds.) ADMA 2019. LNCS (LNAI), vol. 11888, pp. 169–184. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-35231-8_12
Gervasio, M., Myers, K.: Learning procedures by augmenting sequential pattern mining with planning knowledge. In: Proceedings of the 8th Annual Conference on Advances in Cognitive Systems (2020)
Han, J., Cheng, H., Xin, D., Yan, X.: Frequent pattern mining: current status and future directions. Data Min. Knowl. Disc. 15(1), 55–86 (2007)
Huang, K.-Y., Chang, C.-H.: Efficient mining of frequent episodes from complex sequences. Inf. Syst. 33(1), 96–114 (2008)
Le, B., Duong, H., Truong, T., Fournier-Viger, P.: Fclosm, fgensm: two efficient algorithms for mining frequent closed and generator sequences using the local pruning strategy. Knowl. Inf. Syst. 53(1), 71–107 (2017)
Lenci, A.: Distributional semantics in linguistic and cognitive research. Italian J. Linguist. 20(1), 1–31 (2008)
Mannila, H., Toivonen, H., Inkeri Verkamo, A.: Discovery of frequent episodes in event sequences. Data Min. Knowl. Disc. 1(3), 259–289 (1997)
Pokou, Y.J.M., Fournier-Viger, P., Moghrabi, C.: Authorship attribution using small sets of frequent part-of-speech skip-grams. In: The Twenty-Ninth International Flairs Conference (2016)
Raissi, C., Poncelet, P., Teisseire, M.: Speed: mining maxirnal sequential patterns over data strearns. In: 2006 3rd International IEEE Conference Intelligent Systems, pp. 546–552 (2006)
Rathore, S., Dawar, S., Goyal, V., Patel, D.: Top-k high utility episode mining from a complex event sequence. In: Proceedings of the 21st International Conference on Management of Data, Computer Society of India (2016)
Tsai, C.-F., Lin, W.-C., Ke, S.-W.: Big data mining with parallel computing: a comparison of distributed and mapreduce methodologies. J. Syst. Softw. 122, 83–92 (2016)
Ulitzsch, E., He, Q., Pohl, S.: Using sequence mining techniques for understanding incorrect behavioral patterns on interactive tasks. J. Educ. Behav. Stat. 47(1), 3–35 (2022)
Ulitzsch, E., et al.: Combining clickstream analyses and graph-modeled data clustering for identifying common response processes. Psychometrika 86(1), 190–214 (2021)
Wu, C.-W., Lin, Y.-F., Yu, P.S., Tseng, V.S.: Mining high utility episodes in complex event sequences. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 536–544 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Dell’Oglio, P., Bondielli, A., Bechini, A., Marcelloni, F. (2023). Leveraging Sequence Mining for Robot Process Automation. In: Abraham, A., Pllana, S., Casalino, G., Ma, K., Bajaj, A. (eds) Intelligent Systems Design and Applications. ISDA 2022. Lecture Notes in Networks and Systems, vol 717. Springer, Cham. https://doi.org/10.1007/978-3-031-35510-3_22
Download citation
DOI: https://doi.org/10.1007/978-3-031-35510-3_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-35509-7
Online ISBN: 978-3-031-35510-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)