Skip to main content
Log in

An efficient algorithm for mining frequent weighted itemsets using interval word segments

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Mining frequent weighted itemsets (FWIs) from weighted-item transaction databases has recently received research interest. In real-world applications, sparse weighted-item transaction databases (SWITDs) are common. For example, supermarkets have many items, but each transaction has a small number of items. In this paper, we propose an interval word segment (IWS) structure to store and process tidsets for enhancing the effectiveness of mining FWIs from SWITDs. The IWS structure allows the intersection of tidsets between two itemsets to be performed very fast. A map array is proposed for storing a 1-bit index for words. From the map array, 1-bits are mapped to create the tidset of an itemset for faster calculation of the weighted support of itemsets. Experimental results for a number of SWITDs show that the method based on IWS structure outperforms existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

References

  1. Agrawal R, Srikant R (1994) Fast algorithms for minings association rules. In: Proceedings of the 20th international conference on very large data bases, pp 487–499

  2. Agrawal R, Mannila H, Srikant R, Toivonen H, Verkamo IA (1996) Fast discovery of association rules. In: Advances in knowledge discovery and data mining (pp 307–328). American Association for Artificial Intelligence Menlo Park

  3. Cai CH, Fu AC, Cheng CH, Kwong WW (1998) Mining association rules with weighted items. In: Proceedings of conference on IEEE intelligence database engineering and applications symposium, pp 68–77

  4. Dong J, Han M (2007) Bittable-FI An efficient mining frequent itemsets algorithm. Knowl-Based Syst 20 (4):329–335

    Article  Google Scholar 

  5. Gangin L, Unil Y, Keun HR (2014) Sliding window based weighted maximal frequent pattern mining over data streams. Expert Syst Appl 41(2):694–708

    Article  Google Scholar 

  6. Grahne G, Zhu J (2005) Fast algorithms for frequent itemset mining using FP-trees. J IEEE Trans Knowl Data Eng:1347–1362

  7. Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. In: Proceedings of conference on ACM SIGMOD management of data, pp 1–12

  8. Lan CG, Hong PT, Lee YH, Lin CW (2015) Tightening upper bounds for mining weighted frequent itemsets. Intelligent Data Analysis 19(2):413–429

    Google Scholar 

  9. Lan CG, Hong PT, Lee YH (2014) An efficient approach for finding weighted sequential patterns from sequence databases. Appl Intell 41(2):439–452

    Article  Google Scholar 

  10. Lan CG, Hong TP, Lee YH, Wang LS, Tsai WC (2013) Enhancing the efficiency in mining weighted frequent itemsets. In: Proceedings of IEEE international conference on system, man, cybernetics, pp 1104–1108

  11. Le B, Nguyen H, Vo B (2010) Efficient algorithms for mining frequent weighted itemsets from weighted items databases. In: Proceedings of the International Conference on Computing and Communication Technologies, pp 1–6

  12. Louie E, Lin T (2000) Finding association rules using fast bit computation: machine-oriented modeling. Foundations of Intelligent System International Symposium 1932:497–505

    MATH  Google Scholar 

  13. Nguyen H, Vo B, Nguyen MH, Hong TP (2015) MBIs:an efficient method for mining frequent weighted utility itemsets from quantitative databases. Journal of Computer Science and Cybernetics 31(1):17–30

    Google Scholar 

  14. Ramkumar GD, Ranka S, Tsur S (1998) Weighted association rules: model and algorithm. In: Proceedings of Fourth ACM Int’l Conference on Knowledge Discovery and Data Mining, pp 1–13

  15. Song W, Yang B, Xu Z (2008) Index-bittableFI: an improve algorithm for mining frequent itemsets. Knowl-Based Syst 21(6):507–513

    Article  Google Scholar 

  16. Tao F, Murtagh F, Farid M (2003) Weighted association rules mining using weighted support and signifocance framework. In: Proceedings of Conference on ACM SIGKDD, pp 661–666

  17. Unil Y, Eunchul Y (2014) An efficient approach for mining weighted approximate closed frequent patterns considering noise constraints. Int J Uncertainty Fuzziness Knowledge Based Syst 22(6):879–912

    Article  MathSciNet  Google Scholar 

  18. Unil Y, Gangin L, Keun HR (2014) Mining maximal frequent patterns by considering weight conditions over data streams. Knowl-Based Syst 55:49–65

    Article  Google Scholar 

  19. Unil Y, Gwangbum P, Eunchul (2015) Efficient mining of robust closed weighted sequential patterns without information loss. Int J Artif Intell Tools 24(1):1–28

    Google Scholar 

  20. Vo B, Le B (2009) Fast algorithm for mining generalized association rules. International Journal of Database and Application 2(3):1–12

    Google Scholar 

  21. Vo B, Coenen F, Le B (2013) A new method for mining frequent weighted itemsets base on WIT-trees. Expert Syst Appl 40(4):1256–1264

    Article  Google Scholar 

  22. Vo B, Hong TP, Le B (2012) DBV-Miner: a dynamic bit - vector approach for fast mining frequent closed itemsets. Expert Syst Appl 39(8):7196–7206

    Article  Google Scholar 

  23. Vo B, Tran NY, Ngo DH (2013) Mining frequent weighted closed itemsets. In: Advanced Computational Methods for Knowledge Engineering. Springer International Publishing, pp 379–390

  24. Wang W, Yang J, Yu P (2000) Efficient mining of weighted association rules (WAR). In: Proceedings of the conference on ACM SIGKDD knowledge discovery and data mining, pp 270–274

  25. Xiaobing L, Kun Z, Witold P (2012) An improved association rules mining method. Expert Syst Appl 39:1362–1374

    Article  Google Scholar 

  26. Zaki M (2000) Scalable algorithms for association mining. IEEE Trans Knowl Data Eng 12(3):372–390

    Article  MathSciNet  Google Scholar 

  27. Zaki MJ, Gouda K (2003) Fast vertical mining using Diffset, pp 327–335

Download references

Acknowledgments

This research is funded by Vietnam National Foundation for Science and Technology Development (NAFOSTED) under grant number 102.05-2015.10.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bay Vo.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nguyen, H., Vo, B., Nguyen, M. et al. An efficient algorithm for mining frequent weighted itemsets using interval word segments. Appl Intell 45, 1008–1020 (2016). https://doi.org/10.1007/s10489-016-0799-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-016-0799-6

Keywords

Navigation