Abstract
Sequential pattern mining is a practical problem whose objective is to discover helpful informative patterns in a stored database such as market transaction databases. It covers many applications in different areas. Recently, a study that improved the runtime for mining patterns was proposed. It was called pseudo-IDLists and it helps prevent duplicate data from replicating during the mining process. However, the idea only works for the special type of sequential patterns, which are clickstream patterns. Direct applying the idea for sequential pattern mining is not feasible. Hence, we proposed adaptions and changes to the novel idea and proposed SUI (Sequential pattern mining Using IDList), a sequential pattern mining algorithm based on pseudo-IDLists. Via experiments on three test databases, we show that SUI is efficient and effective regarding runtime and memory consumption.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the International Conference on Data Engineering (ICDE), pp. 3–14 (1995)
Van, T., Vo, B., Le, B.: Mining sequential patterns with itemset constraints. Knowl. Inf. Syst. 57(2), 311–330 (2018). https://doi.org/10.1007/s10115-018-1161-6
Huynh, B., Trinh, C., Huynh, H., Van, T.T., Vo, B., Snasel, V.: An efficient approach for mining sequential patterns using multiple threads on very large databases. Eng. Appl. Artif. Intell. 74, 242–251 (2018)
Gan, W., Lin, J.C.-W., Fournier-Viger, P., Chao, H.-C., Yu, P.S.: HUOPM: high-utility occupancy pattern mining. IEEE Trans. Cybern. 50, 1195–1208 (2020)
Ao, X., Luo, P., Wang, J., Zhuang, F., He, Q.: Mining precise-positioning episode rules from event sequences. IEEE Trans. Knowl. Data Eng. 30, 530–543 (2018)
Cule, B., Feremans, L., Goethals, B.: Efficiently mining cohesion-based patterns and rules in event sequences. Data Min. Knowl. Disc. 33(4), 1125–1182 (2019). https://doi.org/10.1007/s10618-019-00628-0
Rahman, M.M., Ahmed, C.F., Leung, C.K.-S.: Mining weighted frequent sequences in uncertain databases. Inf. Sci. (Ny) 479, 76–100 (2019)
Mudrick, N.V., Azevedo, R., Taub, M.: Integrating metacognitive judgments and eye movements using sequential pattern mining to understand processes underlying multimedia learning. Comput. Hum. Behav. 96, 223–234 (2019)
Lee, G.H., Han, H.S.: Clustering of tourist routes for individual tourists using sequential pattern mining. J. Supercomput. 76(7), 5364–5381 (2019). https://doi.org/10.1007/s11227-019-03010-5
Tarus, J.K., Niu, Z., Yousif, A.: A hybrid knowledge-based recommender system for e-learning based on ontology and sequential pattern mining. Futur. Gener. Comput. Syst. 72, 37–48 (2017)
Dalmas, B., Fournier-Viger, P., Norre, S.: TWINCLE: a constrained sequential rule mining algorithm for event logs. Procedia Comput. Sci. 112, 205–214 (2017)
Huynh, H.M., Nguyen, L.T.T., Vo, B., Nguyen, A., Tseng, V.S.: Efficient methods for mining weighted clickstream patterns. Expert Syst. Appl. 142, 112993 (2019)
Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: Apers, P., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 1–17. Springer, Heidelberg (1996). https://doi.org/10.1007/BFb0014140
Pei, J., Han, J., Chen, Q., Hsu, M.-C., Mortazavi-Asl, B., Pinto, H., Dayal, U.: PrefixSpan: mining sequential patterns efficiently by prefix-projected pattern growth. In: Proceedings of the International Conference on Data Engineering (ICDE), pp. 215–224 (2001)
Zaki, M.J.: SPADE: An efficient algorithm for mining frequent sequences. Mach. Learn. 42, 31–60 (2001). https://doi.org/10.1023/A:1007652502315
Fournier-Viger, P., Gomariz, A., Campos, M., Thomas, R.: Fast vertical mining of sequential patterns using co-occurrence information. In: Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. pp. 40–52 (2014)
Huynh, H.M., Nguyen, L.T.T., Vo, B., Yun, U., Oplatková, Z.K., Hong, T.-P.: Efficient algorithms for mining clickstream patterns using pseudo-IDLists. Futur. Gener. Comput. Syst. 107, 18–30 (2020)
Fournier-Viger, P., Lin, J.C.W., Gomariz, A., Gueniche, T., Soltani, A., Deng, Z., Lam, H.T.: The SPMF open-source data mining library version 2. In: Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 36–40 (2016)
Acknowledgements
This work was supported by the Internal Grant Agency of Tomas Bata University under the Projects no. IGA/CebiaTech/2020/001. The work was further supported by resources of A.I.Lab at the Faculty of Applied Informatics, Tomas Bata University in Zlin (ailab.fai.utb.cz).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Huynh, H.M., Pham, N.N., Oplatková, Z.K., Nguyen, L.T.T., Vo, B. (2020). Sequential Pattern Mining Using IDLists. In: Nguyen, N.T., Hoang, B.H., Huynh, C.P., Hwang, D., Trawiński, B., Vossen, G. (eds) Computational Collective Intelligence. ICCCI 2020. Lecture Notes in Computer Science(), vol 12496. Springer, Cham. https://doi.org/10.1007/978-3-030-63007-2_27
Download citation
DOI: https://doi.org/10.1007/978-3-030-63007-2_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63006-5
Online ISBN: 978-3-030-63007-2
eBook Packages: Computer ScienceComputer Science (R0)