Skip to main content

Discovery of Rare Itemsets Using Hyper-Linked Data Structure: A Parallel Approach

  • Conference paper
  • First Online:
Applications and Techniques in Information Security (ATIS 2022)

Abstract

Pattern mining has been more important in the solution of various data mining jobs over the years. The extraction of common patterns was the primary focus of pattern mining research for a long period of time, with the mining of rare patterns being neglected. Rare pattern mining is becoming more popular as researchers recognize the importance of rare patterns. The hyper-linked data structure is suitable to store sparse data set in the main memory and enables dynamic adjustment of links during the mining process using recursion. However, a sequential approach to discovering rare patterns from a large dataset is inefficient. Hence a CUDA-based parallel algorithm has been implemented to discover rare itemsets. The algorithm is tested using dense and sparse datasets on a GPU. The GPU initialization time affects the time taken to discover rare itemsets. The time taken to transfer data between CPU and GPU is significantly large and the parallel implementation of an algorithm with a recursive approach is unsuitable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. “Universal Turing Machine”. https://en.wikipedia.org/wiki/Universal_Turing_machine

  2. Padhy, N., Mishra, P., Panigrahi, R. The survey of data mining applications and feature scope. ArXiv, abs/1211.5723 (2012)

    Google Scholar 

  3. Han, J., Cheng, H., Xin, D., et al.: Frequent pattern mining: current status and future directions. Data Min. Knowl. Disc. 15, 55–86 (2007)

    Article  MathSciNet  Google Scholar 

  4. Borah, A., Nath, B.: Rare pattern mining: challenges and future perspectives. Complex Intel. Syst. 5(1), 1–23 (2018). https://doi.org/10.1007/s40747-018-0085-9

    Article  Google Scholar 

  5. Borgelt, C.: Frequent item set mining. WIREs Data Mining Knowl. Discov. 2, 437–456 (2012). https://doi.org/10.1002/widm.1074

    Article  Google Scholar 

  6. Szathmary, L., Napoli, A., Valtchev, P.: Towards rare itemset mining. In: 2007 19th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2007, vol. 1, pp. 305–312. IEEE (2007)

    Google Scholar 

  7. Liu, B., Hsu, W., Ma, Y.: Mining association rules with multiple minimum supports. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 337–341. ACM (1999)

    Google Scholar 

  8. Adda, M., Wu, L., Feng, Y.: Rare itemset mining. In: 2007 Sixth International Conference on Machine Learning and Applications, ICMLA 2007, pp. 73–80. IEEE (2007)

    Google Scholar 

  9. Cuzzocrea, A., Dayal, U. (eds.): LNCS, vol. 6862. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23544-3

    Book  Google Scholar 

  10. Bhatt, U., Patel, P.: A novel approach for finding rare items based on multiple minimum support framework. Proc. Comput. Sci. 57, 1088–1095 (2015)

    Article  Google Scholar 

  11. Lu, Y., Richter, F., Seidl, T.: Efficient infrequent pattern mining using negative itemset tree. In: Appice, A., Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z.W. (eds.) Complex Pattern Mining. SCI, vol. 880, pp. 1–16. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-36617-9_1

    Chapter  Google Scholar 

  12. Darrab, S., Broneske, D., Saake, G.: RPP algorithm: a method for discovering interesting rare itemsets. In: Tan, Y., Shi, Y., Tuba, M. (eds.) Data Mining and Big Data: 5th International Conference, DMBD 2020, Belgrade, Serbia, July 14–20, 2020, Proceedings, pp. 14–25. Springer Singapore, Singapore (2020). https://doi.org/10.1007/978-981-15-7205-0_2

    Chapter  Google Scholar 

  13. Kanimozhi Selvi, C.S., Tamilarasi, A.: Mining rare itemset with automated support thresholds. J. Comput. Sci. 7(3), 394–399 (2011). https://doi.org/10.3844/jcssp.2011.394.399

    Article  Google Scholar 

  14. Cui, Y., Gan, W., Lin, H., Zheng, W.: FRI-miner: fuzzy rare itemset mining. Appl. Intell. 52, 3387–3402 (2021). https://doi.org/10.1007/s10489-021-02574-1

    Article  Google Scholar 

  15. Jian, L., Wang, C., Liu, Y., et al.: Parallel data mining techniques on Graphics Processing Unit with Compute Unified Device Architecture (CUDA). J. Supercomput. 64, 942–967 (2013)

    Article  Google Scholar 

  16. Adil, S.H., Qamar, S.: Implementation of association rule mining using CUDA. Int. Conf. Emerging Technol. 2009, 332–336 (2009). https://doi.org/10.1109/ICET.2009.5353149

    Article  Google Scholar 

  17. “SPMF Datasets”. http://www.philippe-fournier-viger.com/spmf/index.php?link=datasets.php

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shwetha Rai .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yadavalli, G., Rai, S. (2023). Discovery of Rare Itemsets Using Hyper-Linked Data Structure: A Parallel Approach. In: Prabhu, S., Pokhrel, S.R., Li, G. (eds) Applications and Techniques in Information Security . ATIS 2022. Communications in Computer and Information Science, vol 1804. Springer, Singapore. https://doi.org/10.1007/978-981-99-2264-2_23

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-2264-2_23

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-2263-5

  • Online ISBN: 978-981-99-2264-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics