Abstract
Discovering periodic patterns consists of identifying all sets of items (values) that periodically co-occur in a discrete sequence. Although traditional periodic pattern mining algorithms have multiple applications, they have two key limitations. First, they consider that a pattern is not periodic if the time difference between two of its successive occurrences is greater than a maxPer threshold. But this constraint is too strict, as a pattern may be discarded based on only two of its occurrences, although it may be usually periodic. Second, traditional algorithms use a constraint that the support (occurrence frequency) of a pattern must be no less than a minSup threshold. But setting that parameter is not intuitive. Hence, it is usually set by trial and error, which is time-consuming. This paper addresses the first limitation by introducing a concept of stability to find periodic patterns that have a stable periodic behavior. Then, the second limitation is addressed by proposing an algorithm named TSPIN (Top-k Stable Periodic pattern mINer) to find the top-k stable periodic patterns, where the user can directly specify the number of patterns k to be found rather than using the minSup threshold. Several experiments have been performed to assess TSPIN’s performance, and it was found that it is efficient and can discover patterns that reveal interesting insights in real data.
Similar content being viewed by others
Data Availability
The code and datasets will be integrated in the SPMF data mining library (http://www.philippe-fournier-viger.com/spmf) following the article acceptance.
References
Afriyie MK, Nofong VM, Wondoh J, Abdel-Fatao H (2020) Mining non-redundant periodic frequent patterns. In: Proceedings of the 12th Asian conference on intelligent information and database systems. Springer, pp 321–331
Agrawal R, Imielinski T, Swami AN (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 19th ACM SIGMOD international conference on management of data. ACM, pp 207–216
Amphawan K, Lenca P, Surarerks A (2009) Mining top-k periodic-frequent pattern from transactional databases without support threshold. In: Proceedings of the 3rd international conference on advances in information technology, pp 18–29
Amphawan K, Surarerks A, Lenca P (2010) Mining periodic-frequent itemsets with approximate periodicity using interval transaction-ids list tree. In: Proceedings of the 3rd international conference on knowledge discovery and data mining, pp 245–248
Bodon F, Schmidt-Thieme L (2005) The relation of closed itemset mining, complete pruning strategies and item ordering in apriori-based FIM algorithms. In: Proceedings of the 9th European conference on principles and practice of knowledge discovery in databases. ACM, pp 437–444
Chon KW, Hwang SH, Kim MS (2018) GMiner: a fast GPU-based frequent itemset mining method for large-scale data. Inform Sci 1(439):19–38
Cormen TH, Leiserson CE, Rivest RL, Stein C (2009) Introduction to algorithms. MIT press
Dinh DT, Le B, Fournier-Viger P, Huynh VN (2018) An efficient algorithm for mining periodic high-utility sequential patterns. Appl Intell 48(12):4694–4714
Fong ACM, Zhou B, Hui SC, Hong GY, Do T (2011) Web content recommender system based on consumer behavior modeling. IEEE Trans Consum Electron 57(2):962–969
Fournier-Viger P, Gomariz A, Gueniche T, Soltani A, Wu C, Tseng VS (2014) SPMF: a Java open-source pattern mining library. J Mach Learn Res 15(1):3389–3393
Fournier-Viger P, Lin JCW, Duong QH, Dam TL, Sevcik L, Uhrin D, Voznak M (2017) PFPM: discovering periodic frequent patterns with novel periodicity measures. In: Proceedings of the 2nd Czech-China scientific conference 2016. IntechOpen
Fournier-Viger P, Lin JCW, Duong QH, Dam TL (2016) PHM: mining periodic high-utility itemsets. In: Proceedings of the industrial conference on data mining, pp 64–79
Fournier-Viger P, Lin JCW, Kiran RU, Koh YS, Thomas R (2017) A survey of sequential pattern mining. Data Sci Pattern Recogn 1(1):54–77
Fournier-Viger P, Lin JCW, Truong-Chi T, Nkambou R (2019) A survey of high utility itemset mining. In: High-utility pattern mining. Springer, Cham, pp 1–45
Fournier-Viger P, Lin JCW, Vo B, Truong TC, Zhang J, Le HB (2017) A survey of itemset mining. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 7(4):e1207
Fournier-Viger P, Li Z, Lin JCW, Kiran RU, Fujita H (2018) Discovering periodic patterns common to multiple sequences. In: Proceedings of the 20th international conference on data warehousing and knowledge discovery. Regensburg: Springer, pp 231–246
Fournier-Viger P, Yang P, Lin JCW, Kiran RU (2019) Discovering stable periodic-frequent patterns in transactional data. In: Proceedings of the 32nd international conference on industrial, engineering and other applications of applied intelligent systems. Springer, pp 230–244
Fournier-Viger P, Wu CW, Zida S, Tseng VS (2014) FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. In: Proceedings of the 21st international symposium on methodologies for intelligent systems, pp 83–92
Fournier-Viger P, Yang P, Lin C, Yun U (2019) HUE-Span: fast high utility episode mining. In: Proceedings of the 14th international conference on advanced data mining and applications, pp 169–184
Gama J, Zliobaite I, Bifet A, Pechenizkiy M, Bouchachia H (2014) A survey on concept drift adaptation. J ACM Comput Surv 46(4):1–37
Gouda K, Zaki MJ (2001) Efficiently mining maximal frequent itemsets. In: Proceedings of the 17th IEEE international conference on data mining. ACM, pp 163–170
Grahne G, Zhu J (2005) Fast algorithms for frequent itemset mining using fp-trees. IEEE Trans Knowl Data Eng 17(10):1347–1362
Han J, Pei J, Yin Y, Mao R (2000) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Mining Knowl Discov 8(1):53–87
Huang K, Chang C (2008) Efficient mining of frequent episodes from complex sequences. Inf Syst 33:96–114
Huang Y, Hsu CL, Tseng VS (2020) PURL: periodic user representation learning from temporal event records for personalized health management. In: Proceedings of the 7th IEEE international conference on big data and smart computing. IEEE, pp 358–365
Islam MA, Acharjee UK (2020) Mining periodic patterns and accuracy calculation for activity monitoring using RF tag arrays. In: Proceedings of the international joint conference on computational intelligence. Springer, pp 85–95
Kiran RU, Kitsuregawa M, Reddy PK (2016) Efficient discovery of periodic-frequent patterns in very large databases. J Syst Softw 112:110–121
Kiran RU, Reddy PK (2010) Mining rare periodic-frequent patterns using multiple minimum supports. In: Proceedings of the 15th international conference on management of data, pp 7–8
Kiran RU, Saideep C, Zettsu K, Toyoda M, Kitsuregawa M, Reddy PK (2019) Discovering partial periodic spatial patterns in spatiotemporal databases. In: Proceedings of the 2019 IEEE international conference on big data. IEEE, pp 233–238
Kiran RU, Venkatesh JN, Fournier-Viger P, Toyoda M, Reddy PK, Kitsuregawa M (2017) Discovering periodic patterns in non-uniform temporal databases. In: Proceedings of the 21th Pacific-Asia conference on knowledge discovery and data mining, vol 2, pp 604–617
Koh YS, Ravana SD (2016) Unsupervised rare pattern mining: a survey. ACM Trans Knowl Discov Data 10(4):45
Kumar V, Valli Kumari V (2013) Incremental mining for regular frequent patterns in vertical format. Int J Eng Tech 5(2):1506–1511
Li H, Hai M, Zhang N, Zhu J, Wang Y, Cao H (2019) Probabilistic maximal frequent itemset mining methods over uncertain databases. Intell Data Analy 23(6):1219–1241
Luna JM, Fournier-Viger P, Ventura S (2019) Frequent itemset mining: a 25 years review. WIREs Data Mining Knowl Discov Wiley 9(6):e1329
Manku GS (2016) Frequent itemset mining over data streams. In: Data stream management. Springer, Berlin, pp 209–219
Mannila H, Toivonen H, Verkamo AI (1995) Discovering frequent episodes in sequences. In: Proceedings of the first international conference on knowledge discovery and data mining, pp 210–215
Muthukrishnan S, Berg EVD, Wu Y (2007) Sequential change detection on data streams. In: Proceedings of the 7th IEEE intern. conf. on data mining workshops, pp 551–550
Nofong VM (2015) Discovering productive periodic frequent patterns in transactional databases. In: Proceedings of the second international conference on data science, pp 141–150
Nofong VM (2018) Fast and memory efficient mining of periodic frequent patterns. In: Proceedings of the 10th Asian conference onmodern approaches for intelligent information and database systems, pp 223–232
Pasquier N, Bastide Y, Taouil R, Lakhal L (1999) Discovering frequent closed itemsets for association rules. In: Proceedings of the 7th international conference on database theory. ACM, pp 398–416
Rashid MM, Gondal I, Kamruzzaman J (2013) Regularly frequent patterns mining from sensor data stream. In: Proceedings of the 20th international conference on neural information processing, pp 417–424
Rashid MM, Karim MR, Jeong BS, Choi HJ (2012) Efficient mining regularly frequent patterns in transactional databases. In: Proceedings of the 17th international conference on database systems for advanced applications, pp 258–271
Surana A, Kiran RU, Reddy PK (2012) An efficient approach to mine periodic-frequent patterns in transactional databases. In: Proceedings of the 16th Pacific-Asia conference on knowledge discovery and data mining, pp 254–266
Tanbeer SK, Ahmed CF, Jeong BS, Lee YK (2009) Discovering periodic-frequent patterns in transactional databases. In: Proceedings of the 13rd Pacific-Asia conference on knowledge discovery and data mining, pp 242–253
Tong YX, Chen L, She J (2015) Mining frequent itemsets in correlated uncertain databases. J Comput Sci Technol 30(4):696–712
Truong-Chi T, Fournier-Viger P (2019) A survey of high utility sequential pattern mining. In: High-utility pattern mining. Springer, Cham, pp 97–129
Wong MH, Tseng VS, Tseng JC, Liu SW, Tsai CH (2017) Long-term user location prediction using deep learning and periodic pattern mining. In: Proceedings of the 12th International conference on advanced data mining and applications, pp 582–594
Yun U, Kim D, Yoon E, Fujita H (2018) Damped window based high average utility pattern mining over data streams. Knowl-Based Syst 144:188–205
Yun U, Lee G, Yoon E (2017) Efficient high utility pattern mining for establishing manufacturing plans with sliding window control. IEEE Trans Industr Electron 64(9):7239–7249
Zaki MJ, Gouda K (2003) Fast vertical mining using diffsets. In: Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 326–335
Zhang R, Chen W, Hsu TC, Yang H, Chung YC (2019) ANG: a combination of Apriori and graph computing techniques for frequent itemsets mining. J Supercomput 6(2):646–61
Zhang D, Lee K, Lee I (2019) Mining hierarchical semantic periodic patterns from GPS-collected spatio-temporal trajectories. Exp Syst Applic 122:85–101
Acknowledgements
This study was partly funded by the National Natural Science Foundation of China and the Harbin Institute of Technology.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interests
The authors declare that they have no conflict of interest and competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article belongs to the Topical Collection: Special issue on Artificial intelligence in practice - from theory to application
Guest Editors: Franz Wotawa, Gerhard Friedrich and Ingo Pill
Rights and permissions
About this article
Cite this article
Fournier-Viger, P., Wang, Y., Yang, P. et al. TSPIN: mining top-k stable periodic patterns. Appl Intell 52, 6917–6938 (2022). https://doi.org/10.1007/s10489-020-02181-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-020-02181-6