Skip to main content
Log in

TSPIN: mining top-k stable periodic patterns

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Discovering periodic patterns consists of identifying all sets of items (values) that periodically co-occur in a discrete sequence. Although traditional periodic pattern mining algorithms have multiple applications, they have two key limitations. First, they consider that a pattern is not periodic if the time difference between two of its successive occurrences is greater than a maxPer threshold. But this constraint is too strict, as a pattern may be discarded based on only two of its occurrences, although it may be usually periodic. Second, traditional algorithms use a constraint that the support (occurrence frequency) of a pattern must be no less than a minSup threshold. But setting that parameter is not intuitive. Hence, it is usually set by trial and error, which is time-consuming. This paper addresses the first limitation by introducing a concept of stability to find periodic patterns that have a stable periodic behavior. Then, the second limitation is addressed by proposing an algorithm named TSPIN (Top-k Stable Periodic pattern mINer) to find the top-k stable periodic patterns, where the user can directly specify the number of patterns k to be found rather than using the minSup threshold. Several experiments have been performed to assess TSPIN’s performance, and it was found that it is efficient and can discover patterns that reveal interesting insights in real data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data Availability

The code and datasets will be integrated in the SPMF data mining library (http://www.philippe-fournier-viger.com/spmf) following the article acceptance.

References

  1. Afriyie MK, Nofong VM, Wondoh J, Abdel-Fatao H (2020) Mining non-redundant periodic frequent patterns. In: Proceedings of the 12th Asian conference on intelligent information and database systems. Springer, pp 321–331

  2. Agrawal R, Imielinski T, Swami AN (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 19th ACM SIGMOD international conference on management of data. ACM, pp 207–216

  3. Amphawan K, Lenca P, Surarerks A (2009) Mining top-k periodic-frequent pattern from transactional databases without support threshold. In: Proceedings of the 3rd international conference on advances in information technology, pp 18–29

  4. Amphawan K, Surarerks A, Lenca P (2010) Mining periodic-frequent itemsets with approximate periodicity using interval transaction-ids list tree. In: Proceedings of the 3rd international conference on knowledge discovery and data mining, pp 245–248

  5. Bodon F, Schmidt-Thieme L (2005) The relation of closed itemset mining, complete pruning strategies and item ordering in apriori-based FIM algorithms. In: Proceedings of the 9th European conference on principles and practice of knowledge discovery in databases. ACM, pp 437–444

  6. Chon KW, Hwang SH, Kim MS (2018) GMiner: a fast GPU-based frequent itemset mining method for large-scale data. Inform Sci 1(439):19–38

    Article  MathSciNet  Google Scholar 

  7. Cormen TH, Leiserson CE, Rivest RL, Stein C (2009) Introduction to algorithms. MIT press

  8. Dinh DT, Le B, Fournier-Viger P, Huynh VN (2018) An efficient algorithm for mining periodic high-utility sequential patterns. Appl Intell 48(12):4694–4714

  9. Fong ACM, Zhou B, Hui SC, Hong GY, Do T (2011) Web content recommender system based on consumer behavior modeling. IEEE Trans Consum Electron 57(2):962–969

    Article  Google Scholar 

  10. Fournier-Viger P, Gomariz A, Gueniche T, Soltani A, Wu C, Tseng VS (2014) SPMF: a Java open-source pattern mining library. J Mach Learn Res 15(1):3389–3393

    MATH  Google Scholar 

  11. Fournier-Viger P, Lin JCW, Duong QH, Dam TL, Sevcik L, Uhrin D, Voznak M (2017) PFPM: discovering periodic frequent patterns with novel periodicity measures. In: Proceedings of the 2nd Czech-China scientific conference 2016. IntechOpen

  12. Fournier-Viger P, Lin JCW, Duong QH, Dam TL (2016) PHM: mining periodic high-utility itemsets. In: Proceedings of the industrial conference on data mining, pp 64–79

  13. Fournier-Viger P, Lin JCW, Kiran RU, Koh YS, Thomas R (2017) A survey of sequential pattern mining. Data Sci Pattern Recogn 1(1):54–77

    Google Scholar 

  14. Fournier-Viger P, Lin JCW, Truong-Chi T, Nkambou R (2019) A survey of high utility itemset mining. In: High-utility pattern mining. Springer, Cham, pp 1–45

  15. Fournier-Viger P, Lin JCW, Vo B, Truong TC, Zhang J, Le HB (2017) A survey of itemset mining. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 7(4):e1207

  16. Fournier-Viger P, Li Z, Lin JCW, Kiran RU, Fujita H (2018) Discovering periodic patterns common to multiple sequences. In: Proceedings of the 20th international conference on data warehousing and knowledge discovery. Regensburg: Springer, pp 231–246

  17. Fournier-Viger P, Yang P, Lin JCW, Kiran RU (2019) Discovering stable periodic-frequent patterns in transactional data. In: Proceedings of the 32nd international conference on industrial, engineering and other applications of applied intelligent systems. Springer, pp 230–244

  18. Fournier-Viger P, Wu CW, Zida S, Tseng VS (2014) FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. In: Proceedings of the 21st international symposium on methodologies for intelligent systems, pp 83–92

  19. Fournier-Viger P, Yang P, Lin C, Yun U (2019) HUE-Span: fast high utility episode mining. In: Proceedings of the 14th international conference on advanced data mining and applications, pp 169–184

  20. Gama J, Zliobaite I, Bifet A, Pechenizkiy M, Bouchachia H (2014) A survey on concept drift adaptation. J ACM Comput Surv 46(4):1–37

    Article  Google Scholar 

  21. Gouda K, Zaki MJ (2001) Efficiently mining maximal frequent itemsets. In: Proceedings of the 17th IEEE international conference on data mining. ACM, pp 163–170

  22. Grahne G, Zhu J (2005) Fast algorithms for frequent itemset mining using fp-trees. IEEE Trans Knowl Data Eng 17(10):1347–1362

    Article  Google Scholar 

  23. Han J, Pei J, Yin Y, Mao R (2000) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Mining Knowl Discov 8(1):53–87

    Article  MathSciNet  Google Scholar 

  24. Huang K, Chang C (2008) Efficient mining of frequent episodes from complex sequences. Inf Syst 33:96–114

    Article  Google Scholar 

  25. Huang Y, Hsu CL, Tseng VS (2020) PURL: periodic user representation learning from temporal event records for personalized health management. In: Proceedings of the 7th IEEE international conference on big data and smart computing. IEEE, pp 358–365

  26. Islam MA, Acharjee UK (2020) Mining periodic patterns and accuracy calculation for activity monitoring using RF tag arrays. In: Proceedings of the international joint conference on computational intelligence. Springer, pp 85–95

  27. Kiran RU, Kitsuregawa M, Reddy PK (2016) Efficient discovery of periodic-frequent patterns in very large databases. J Syst Softw 112:110–121

    Article  Google Scholar 

  28. Kiran RU, Reddy PK (2010) Mining rare periodic-frequent patterns using multiple minimum supports. In: Proceedings of the 15th international conference on management of data, pp 7–8

  29. Kiran RU, Saideep C, Zettsu K, Toyoda M, Kitsuregawa M, Reddy PK (2019) Discovering partial periodic spatial patterns in spatiotemporal databases. In: Proceedings of the 2019 IEEE international conference on big data. IEEE, pp 233–238

  30. Kiran RU, Venkatesh JN, Fournier-Viger P, Toyoda M, Reddy PK, Kitsuregawa M (2017) Discovering periodic patterns in non-uniform temporal databases. In: Proceedings of the 21th Pacific-Asia conference on knowledge discovery and data mining, vol 2, pp 604–617

  31. Koh YS, Ravana SD (2016) Unsupervised rare pattern mining: a survey. ACM Trans Knowl Discov Data 10(4):45

    Article  Google Scholar 

  32. Kumar V, Valli Kumari V (2013) Incremental mining for regular frequent patterns in vertical format. Int J Eng Tech 5(2):1506–1511

    Google Scholar 

  33. Li H, Hai M, Zhang N, Zhu J, Wang Y, Cao H (2019) Probabilistic maximal frequent itemset mining methods over uncertain databases. Intell Data Analy 23(6):1219–1241

    Article  Google Scholar 

  34. Luna JM, Fournier-Viger P, Ventura S (2019) Frequent itemset mining: a 25 years review. WIREs Data Mining Knowl Discov Wiley 9(6):e1329

    Google Scholar 

  35. Manku GS (2016) Frequent itemset mining over data streams. In: Data stream management. Springer, Berlin, pp 209–219

  36. Mannila H, Toivonen H, Verkamo AI (1995) Discovering frequent episodes in sequences. In: Proceedings of the first international conference on knowledge discovery and data mining, pp 210–215

  37. Muthukrishnan S, Berg EVD, Wu Y (2007) Sequential change detection on data streams. In: Proceedings of the 7th IEEE intern. conf. on data mining workshops, pp 551–550

  38. Nofong VM (2015) Discovering productive periodic frequent patterns in transactional databases. In: Proceedings of the second international conference on data science, pp 141–150

  39. Nofong VM (2018) Fast and memory efficient mining of periodic frequent patterns. In: Proceedings of the 10th Asian conference onmodern approaches for intelligent information and database systems, pp 223–232

  40. Pasquier N, Bastide Y, Taouil R, Lakhal L (1999) Discovering frequent closed itemsets for association rules. In: Proceedings of the 7th international conference on database theory. ACM, pp 398–416

  41. Rashid MM, Gondal I, Kamruzzaman J (2013) Regularly frequent patterns mining from sensor data stream. In: Proceedings of the 20th international conference on neural information processing, pp 417–424

  42. Rashid MM, Karim MR, Jeong BS, Choi HJ (2012) Efficient mining regularly frequent patterns in transactional databases. In: Proceedings of the 17th international conference on database systems for advanced applications, pp 258–271

  43. Surana A, Kiran RU, Reddy PK (2012) An efficient approach to mine periodic-frequent patterns in transactional databases. In: Proceedings of the 16th Pacific-Asia conference on knowledge discovery and data mining, pp 254–266

  44. Tanbeer SK, Ahmed CF, Jeong BS, Lee YK (2009) Discovering periodic-frequent patterns in transactional databases. In: Proceedings of the 13rd Pacific-Asia conference on knowledge discovery and data mining, pp 242–253

  45. Tong YX, Chen L, She J (2015) Mining frequent itemsets in correlated uncertain databases. J Comput Sci Technol 30(4):696–712

    Article  MathSciNet  Google Scholar 

  46. Truong-Chi T, Fournier-Viger P (2019) A survey of high utility sequential pattern mining. In: High-utility pattern mining. Springer, Cham, pp 97–129

  47. Wong MH, Tseng VS, Tseng JC, Liu SW, Tsai CH (2017) Long-term user location prediction using deep learning and periodic pattern mining. In: Proceedings of the 12th International conference on advanced data mining and applications, pp 582–594

  48. Yun U, Kim D, Yoon E, Fujita H (2018) Damped window based high average utility pattern mining over data streams. Knowl-Based Syst 144:188–205

    Article  Google Scholar 

  49. Yun U, Lee G, Yoon E (2017) Efficient high utility pattern mining for establishing manufacturing plans with sliding window control. IEEE Trans Industr Electron 64(9):7239–7249

    Article  Google Scholar 

  50. Zaki MJ, Gouda K (2003) Fast vertical mining using diffsets. In: Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 326–335

  51. Zhang R, Chen W, Hsu TC, Yang H, Chung YC (2019) ANG: a combination of Apriori and graph computing techniques for frequent itemsets mining. J Supercomput 6(2):646–61

    Article  Google Scholar 

  52. Zhang D, Lee K, Lee I (2019) Mining hierarchical semantic periodic patterns from GPS-collected spatio-temporal trajectories. Exp Syst Applic 122:85–101

    Article  Google Scholar 

Download references

Acknowledgements

This study was partly funded by the National Natural Science Foundation of China and the Harbin Institute of Technology.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Philippe Fournier-Viger.

Ethics declarations

Conflict of interests

The authors declare that they have no conflict of interest and competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article belongs to the Topical Collection: Special issue on Artificial intelligence in practice - from theory to application

Guest Editors: Franz Wotawa, Gerhard Friedrich and Ingo Pill

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fournier-Viger, P., Wang, Y., Yang, P. et al. TSPIN: mining top-k stable periodic patterns. Appl Intell 52, 6917–6938 (2022). https://doi.org/10.1007/s10489-020-02181-6

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-020-02181-6

Keywords

Navigation