Abstract
Sequential pattern mining (SPM) is widely used for data mining and knowledge discovery in various application domains, such as medicine, e-commerce, and the World Wide Web. There has been much work on improving the execution time of SPM or enriching it via considering the time interval between items in sequences. However, no study has evaluated the sequence pattern variant (SPV) that is the original sequence containing frequent patterns including variants, and studied the factors that lead to the variants. Such a study is meaningful for medical tasks such as improving the quality of a disease’s treatment method. This paper proposes methods for evaluating SPVs and understanding variant factors based on a statistical approach while considering the safety and efficiency of sequences and the relating static and dynamic information of the variants. Our proposal is confirmed to be effective by experimentally evaluating the electronic medical record system’s real dataset and feedback from medical workers.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
The Section of Medical Information at Faculty of Medicine University of Miyazaki Hospital. http://www.med.miyazaki-u.ac.jp/home/jyoho/. Accessed 19 June 2019
Various Information of Medical Fee, Ministry of Health, Labour and Welfare (Japan). http://www.iryohoken.go.jp/shinryohoshu/kaitei/. Accessed 19 June 2019
Fournier Viger, P., Lin, C.W., Rage, U., Koh, Y.S., Thomas, R.: A survey of sequential pattern mining. Data Sci. Pattern Recogn. 1, 54–77 (2017)
Garg, N., Agarwal, S.: Process mining for clinical workflows. In: Proceedings of the International Conference on Advances in Information Communication Technology and Computing. pp. 5:1–5:5 (2016)
Hirate, Y., Yamana, H.: Sequential pattern mining with time intervals. In: Ng, W.-K., Kitsuregawa, M., Li, J., Chang, K. (eds.) PAKDD 2006. LNCS (LNAI), vol. 3918, pp. 775–779. Springer, Heidelberg (2006). https://doi.org/10.1007/11731139_90
Honda, Y., Kushima, M., Yamazaki, T., Araki, K., Yokota, H.: Detection and visualization of variants in typical medical treatment sequences. In: Begoli, E., Wang, F., Luo, G. (eds.) DMAH 2017. LNCS, vol. 10494, pp. 88–101. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67186-4_8
Han, J., et al.: PrefixSpan: mining sequential patterns efficiently by prefix-projected pattern growth. In: Proceedings of the 17th International Conference on Data Engineering, pp. 215–224 (2001)
Le, H.H., et al.: Fast generation of clinical pathways including time intervals in sequential pattern mining on electronic medical record systems. In: Proceedings the 4th International Conference on Computer Science and Computational Intelligent, pp. 1726–1731 (2017)
Le, H.H., Kushima, M., Araki, K., Yokota, H.: Differentially private sequential pattern mining considering time interval for electronic medical record systems. In: Proceedings of the 23rd International Database Engineering and Applications Symposium, pp. 95–103 (2019)
Mooney, C., Roddick, J.: Sequential pattern mining: approaches and algorithms. ACM Comput. Surv. 45, 19:1–19:39 (2013)
Raju, V.P., Varma, G.S.: Mining closed sequential patterns in large sequence databases. Int. J. Database Manage. Syst. 7(1), 29–39 (2015)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499 (1994)
System, C.C.: Denshi Karte System WATATUMI (Electronic Medical Records WATATUMI. http://www.corecreate.com/02_01_izanami.html. Accessed 19 June 2019
Uragaki, K., et al.: Sequential pattern mining on electronic medical records with handling time intervals and the efficacy of medicines. In: Proceedings of the 2016 IEEE Symposium on Computers and Communication, pp. 20–25. IEEE (2016)
Yan, X., Han, J., Afshar, R.: CloSpan: mining closed sequential patterns in large datasets. In: Proceedings of the International Conference on Data Mining, pp. 166–177 (2003)
Chen, Y.-L., Chiang, M.-C., Ko, M.-T.: Discovering time-interval sequential patterns in sequence databases. Expert Syst. Appl. 25(3), 343–354 (2003)
Acknowledgement
This research has been supported by Health Labour Sciences Research Grant (Ministry of Health, Labour and Welfare, Japan) and the Kayamori Foundation of Information Science Advancement.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Le, H.H. et al. (2019). Analyzing Sequence Pattern Variants in Sequential Pattern Mining and Its Application to Electronic Medical Record Systems. In: Hartmann, S., Küng, J., Chakravarthy, S., Anderst-Kotsis, G., Tjoa, A., Khalil, I. (eds) Database and Expert Systems Applications. DEXA 2019. Lecture Notes in Computer Science(), vol 11707. Springer, Cham. https://doi.org/10.1007/978-3-030-27618-8_29
Download citation
DOI: https://doi.org/10.1007/978-3-030-27618-8_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-27617-1
Online ISBN: 978-3-030-27618-8
eBook Packages: Computer ScienceComputer Science (R0)