Abstract
Multilevel, cross-level and sequential knowledge plays a significant role in our several real-life aspects including market basket analysis, bioinformatics, texts mining etc. Many researchers have proposed various approaches for mining hierarchical patterns. However, some of the existing approaches generate many multilevel and cross-level frequent patterns that fail to fetch quality information. It is extremely difficult to extract any meaningful information from these large number of redundant patterns. There exist some approaches that mines multilevel and cross-level closed patterns but unfortunately, there is no cross-level closed pattern mining method proposed yet which maintain the sequence of itemsets. In this paper, we develop an algorithm, called CCSP (Cross-level Closed Sequential Pattern mining) to conduct cross-level hierarchical patterns that provide maximal information. Our work has made contributions in mining patterns, which express the mixed relationship between the generalized and specialized view of the transaction itemsets. We have extensively evaluated our proposed algorithm’s efficiency using a variety of real-life datasets and performing a large number of experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, pp. 3–14. IEEE (1995)
Agrawal, R., Srikant, R., et al.: Fast algorithms for mining association rules. In: Proceeding 20th International Conference very large data bases VLDB, vol. 1215, pp. 487–499 (1994)
Ayres, J., Flannick, J., Gehrke, J., Yiu, T.: Sequential pattern mining using a bitmap representation. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 429–435. ACM (2002)
Cong, S., Han, J., Padua, D.: Parallel mining of closed sequential patterns. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pp. 562–567. ACM (2005)
Gautam, P., Pardasani, K.: A novel approach for discovery multi level fuzzy association rule mining. arXiv preprint arXiv:1003.4068 (2010)
Goethals, B., Zaki, M.J.: Advances in frequent itemset mining implementations: report on fimi’03. ACM SIGKDD Explor. Newsl. 6(1), 109–117 (2004)
Han, J., Fu, Y.: Discovery of multiple-level association rules from large databases. In: VLDB, vol. 95, pp. 420–431 (1995)
Han, J., Fu, Y.: Mining multiple-level association rules in large databases. IEEE Trans. Knowl. Data Eng. 11(5), 798–805 (1999)
Hashem, T., Ahmed, C.F., Samiullah, M., Akther, S., Jeong, B.S., Jeon, S.: An efficient approach for mining cross-level closed itemsets and minimal association rules using closed itemset lattices. Expert Syst. Appl. 41(6), 2914–2938 (2014)
Leung, C.W.K., Chan, S.C.F., Chung, F.I.: An empirical study of a cross-level association rule mining approach to cold-start recommendations. Knowl.-Based Syst. 21(7), 515–529 (2008)
Pei, J., Han, J., Mao, R., et al.: Closet. In: ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery. vol. 4, pp. 21–30 (2000)
Pei, J., Han, J., Mortazavi-Asl, B., Wang, J., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: Mining sequential patterns by pattern-growth: the prefixspan approach. IEEE Trans. Knowl. Data Eng. 16(11), 1424–1440 (2004)
Pokou, Y.J.M., Fournier-Viger, P., Moghrabi, C.: Authorship attribution using small sets of frequent part-of-speech skip-grams. In: FLAIRS Conference, pp. 86–91 (2016)
Pramono, Y.W.T., et al. In: 2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA), pp. 203–208. IEEE (2014)
Shrivastava, V.K., Kumar, P., Pardasani, K.: Fp-tree and cofi based approach for mining of multiple level association rules in large databases. arXiv preprint arXiv:1003.1821 (2010)
Srikant, R., Agrawal, R.: Mining generalized association rules (1995)
Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: Apers, P., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 1–17. Springer, Heidelberg (1996). https://doi.org/10.1007/BFb0014140
Thakur, R.S., Jain, R.C., Pardasani, K.R., India, V.: Mining level-crossing association rules from large databases (2005)
Thakur, R., Jain, R., Pardasani, K.: Mining level-crossing association rules from large databases. J. Comput. Sci. 2(1), 76–81 (2006)
Wan, Y., Liang, Y., Ding, L.Y.: Mining multilevel association rules with dynamic concept hierarchy. In: 2008 International Conference on Machine Learning and Cybernetics, vol. 1, pp. 287–292. IEEE (2008)
Wang, J., Han, J.: Bide: Efficient mining of frequent closed sequences. In: Proceedings 20th International Conference on 2004 Data Engineering, pp. 79–90. IEEE (2004)
Wang, J., Han, J., Pei, J.: Closet+: searching for the best strategies for mining frequent closed itemsets. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 236–245. ACM (2003)
Yan, X., Han, J., Afshar, R.: Clospan: Mining: closed sequential patterns in large datasets. In: Proceedings of the 2003 SIAM International Conference on Data Mining, pp. 166–177. SIAM (2003)
Zaki, M.J.: Spade: an efficient algorithm for mining frequent sequences. Mach. learn. 42(1), 31–60 (2001)
Zaki, M.J., Hsiao, C.J.: Charm: an efficient algorithm for closed itemset mining. In: Proceedings of the 2002 SIAM International Conference on Data Mining, pp. 457–473. SIAM (2002)
Zaki, M.J., Hsiao, C.J.: Efficient algorithms for mining closed itemsets and their lattice structure. IEEE Trans. Knowl. Data Eng. 17(4), 462–478 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Aman, R., Ahmed, C.F. (2018). Mining Cross-Level Closed Sequential Patterns. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2018. Lecture Notes in Computer Science(), vol 10933. Springer, Cham. https://doi.org/10.1007/978-3-319-95786-9_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-95786-9_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95785-2
Online ISBN: 978-3-319-95786-9
eBook Packages: Computer ScienceComputer Science (R0)