Abstract
Pattern mining has been more important in the solution of various data mining jobs over the years. The extraction of common patterns was the primary focus of pattern mining research for a long period of time, with the mining of rare patterns being neglected. Rare pattern mining is becoming more popular as researchers recognize the importance of rare patterns. The hyper-linked data structure is suitable to store sparse data set in the main memory and enables dynamic adjustment of links during the mining process using recursion. However, a sequential approach to discovering rare patterns from a large dataset is inefficient. Hence a CUDA-based parallel algorithm has been implemented to discover rare itemsets. The algorithm is tested using dense and sparse datasets on a GPU. The GPU initialization time affects the time taken to discover rare itemsets. The time taken to transfer data between CPU and GPU is significantly large and the parallel implementation of an algorithm with a recursive approach is unsuitable.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
“Universal Turing Machine”. https://en.wikipedia.org/wiki/Universal_Turing_machine
Padhy, N., Mishra, P., Panigrahi, R. The survey of data mining applications and feature scope. ArXiv, abs/1211.5723 (2012)
Han, J., Cheng, H., Xin, D., et al.: Frequent pattern mining: current status and future directions. Data Min. Knowl. Disc. 15, 55–86 (2007)
Borah, A., Nath, B.: Rare pattern mining: challenges and future perspectives. Complex Intel. Syst. 5(1), 1–23 (2018). https://doi.org/10.1007/s40747-018-0085-9
Borgelt, C.: Frequent item set mining. WIREs Data Mining Knowl. Discov. 2, 437–456 (2012). https://doi.org/10.1002/widm.1074
Szathmary, L., Napoli, A., Valtchev, P.: Towards rare itemset mining. In: 2007 19th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2007, vol. 1, pp. 305–312. IEEE (2007)
Liu, B., Hsu, W., Ma, Y.: Mining association rules with multiple minimum supports. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 337–341. ACM (1999)
Adda, M., Wu, L., Feng, Y.: Rare itemset mining. In: 2007 Sixth International Conference on Machine Learning and Applications, ICMLA 2007, pp. 73–80. IEEE (2007)
Cuzzocrea, A., Dayal, U. (eds.): LNCS, vol. 6862. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23544-3
Bhatt, U., Patel, P.: A novel approach for finding rare items based on multiple minimum support framework. Proc. Comput. Sci. 57, 1088–1095 (2015)
Lu, Y., Richter, F., Seidl, T.: Efficient infrequent pattern mining using negative itemset tree. In: Appice, A., Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z.W. (eds.) Complex Pattern Mining. SCI, vol. 880, pp. 1–16. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-36617-9_1
Darrab, S., Broneske, D., Saake, G.: RPP algorithm: a method for discovering interesting rare itemsets. In: Tan, Y., Shi, Y., Tuba, M. (eds.) Data Mining and Big Data: 5th International Conference, DMBD 2020, Belgrade, Serbia, July 14–20, 2020, Proceedings, pp. 14–25. Springer Singapore, Singapore (2020). https://doi.org/10.1007/978-981-15-7205-0_2
Kanimozhi Selvi, C.S., Tamilarasi, A.: Mining rare itemset with automated support thresholds. J. Comput. Sci. 7(3), 394–399 (2011). https://doi.org/10.3844/jcssp.2011.394.399
Cui, Y., Gan, W., Lin, H., Zheng, W.: FRI-miner: fuzzy rare itemset mining. Appl. Intell. 52, 3387–3402 (2021). https://doi.org/10.1007/s10489-021-02574-1
Jian, L., Wang, C., Liu, Y., et al.: Parallel data mining techniques on Graphics Processing Unit with Compute Unified Device Architecture (CUDA). J. Supercomput. 64, 942–967 (2013)
Adil, S.H., Qamar, S.: Implementation of association rule mining using CUDA. Int. Conf. Emerging Technol. 2009, 332–336 (2009). https://doi.org/10.1109/ICET.2009.5353149
“SPMF Datasets”. http://www.philippe-fournier-viger.com/spmf/index.php?link=datasets.php
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Yadavalli, G., Rai, S. (2023). Discovery of Rare Itemsets Using Hyper-Linked Data Structure: A Parallel Approach. In: Prabhu, S., Pokhrel, S.R., Li, G. (eds) Applications and Techniques in Information Security . ATIS 2022. Communications in Computer and Information Science, vol 1804. Springer, Singapore. https://doi.org/10.1007/978-981-99-2264-2_23
Download citation
DOI: https://doi.org/10.1007/978-981-99-2264-2_23
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-2263-5
Online ISBN: 978-981-99-2264-2
eBook Packages: Computer ScienceComputer Science (R0)