skip to main content
research-article

Co-occurrence Order-preserving Pattern Mining with Keypoint Alignment for Time Series

Published: 12 June 2024 Publication History

Abstract

Recently, order-preserving pattern (OPP) mining has been proposed to discover some patterns, which can be seen as trend changes in time series. Although existing OPP mining algorithms have achieved satisfactory performance, they discover all frequent patterns. However, in some cases, users focus on a particular trend and its associated trends. To efficiently discover trend information related to a specific prefix pattern, this article addresses the issue of co-occurrence OPP mining (COP) and proposes an algorithm named COP-Miner to discover COPs from historical time series. COP-Miner consists of three parts: extracting keypoints, preparation stage, and iteratively calculating supports and mining frequent COPs. Extracting keypoints is used to obtain local extreme points of patterns and time series. The preparation stage is designed to prepare for the first round of mining, which contains four steps: obtaining the suffix OPP of the keypoint sub-time series, calculating the occurrences of the suffix OPP, verifying the occurrences of the keypoint sub-time series, and calculating the occurrences of all fusion patterns of the keypoint sub-time series. To further improve the efficiency of support calculation, we propose a support calculation method with an ending strategy that uses the occurrences of prefix and suffix patterns to calculate the occurrences of superpatterns. Experimental results indicate that COP-Miner outperforms the other competing algorithms in running time and scalability. Moreover, COPs with keypoint alignment yield better prediction performance.

References

[1]
Daichi Amagata and Takahiro Hara. 2019. Mining top-k co-occurrence patterns across multiple streams. IEEE Trans. Knowl. Data Eng. 29, 10 (2019), 2249–2262.
[2]
Xiang Ao, Haoran Shi, Jin Wang, Luo Zuo, Hongwei Li, and Qing He. 2019. Large-scale frequent episode mining from complex event sequences with hierarchies. ACM Trans. Intell. Syst. Technol. 10, 4 (2019), 36.
[3]
Bing Bai, Guiling Li, Senzhang Wang, Zongda Wu, and Wenhe Yan. 2021. Time series classification based on multi-feature dictionary representation and ensemble learning. Expert Syst. Appl. 169 (2021), 114162.
[4]
G. E. P. Box and David A. Pierce. 1970. Distribution of residual autocorrelations in autoregressive-integrated moving average time series models. J. Am. Stat. Assoc. 65, 332 (1970), 1509–1526.
[5]
Kaushik Chakrabarti, Eamonn Keogh, Sharad Mehrotra, and Michael Pazzani. 2002. Locally adaptive dimensionality reduction for indexing large time series databases. ACM Trans. Datab. Syst. 27, 2 (2002), 188–228.
[6]
Shuhui Cheng, Youxi Wu, Yan Li, Fang Yao, and Fan Min. 2021. TWD-SFNN: Three-way decisions with a single hidden layer feedforward neural network. Inf. Sci. 579 (2021), 15–32.
[7]
Chenglong Dai, Jia Wu, Dechang Pi, Stefanie I. Becker, Lin Cui, Qin Zhang, and Blake Johnson. 2022. Brain EEG time-series clustering using maximum-weight clique. IEEE Trans. Cybernet. 52, 1 (2022), 357–371.
[8]
Xiangjun Dong, Ping Qiu, Jinhu Lü, Longbing Cao, and Tiantian Xu. 2019. Mining top-k useful negative sequential patterns via learning. IEEE Trans. Neural Netw. Learn. Syst. 30, 9 (2019), 2764–2778.
[9]
Philippe Fournier-Viger, Wensheng Gan, Youxi Wu, Mourad Nouioua, Wei Song, Tin C. Truong, and Hai Van Duong. 2022. Pattern mining: Current challenges and opportunities. In Proceedings of the Database Systems for Advanced Applications (DASFAA’22), 34–49.
[10]
Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, and Philip S. Yu. 2019. A survey of parallel sequential pattern mining. ACM Trans. Knowl. Discov. Data 13, 3 (2019), 25.
[11]
Chao Gao, Lei Duan, Guozhu Dong, Haiqing Zhang, Hao Yang, and Changjie Tang. 2016. Mining top-k distinguishing sequential patterns with flexible gap constraints. In International Conference on Web-Age Information Management. Springer, 82–94.
[12]
Shaghayegh Gharghabi, Shima Imani, Anthony Bagnall, Amirali Darvishzadeh, and Eamonn Keogh. 2018. Matrix profile XII: MPdist: A novel time series distance measure to allow data mining in more challenging scenarios. In Proceedings of the IEEE International Conference on Data Mining. IEEE, 965–970.
[13]
Shaghayegh Gharghabi, Shima Imani, Anthony Bagnall, Amirali Darvishzadeh, and Eamonn Keogh. 2020. An ultra-fast time series distance measure to allow data mining in more complex real-world deployments. Data Min. Knowl. Discov. 34, 4 (2020), 1104–1135.
[14]
Dan Guo, Xuegang Hu, Fei Xie, and Xindong Wu. 2013. Pattern matching with wildcards and gap-length constraints based on a centrality-degree graph. Appl. Intell. 39, 1 (2013), 57–74.
[15]
Dan Guo, Ermao Yuan, Xuegang Hu, and Xindong Wu. 2018. Co-occurrence pattern mining based on a biological approximation scoring matrix. Pattern Anal. Appl. 21, 4 (2018), 977–996.
[16]
Nan Han, Shaojie Qiao, Kun Yue, Jianbin Huang, Tingting Tang Qiang He, Faliang Huang, Chunlin He, and Chang-An Yuan. 2022. Algorithms for trajectory points clustering in location-based social networks. ACM Trans. Intell. Syst. Technol. 13, 3 (2022), 43.
[17]
Zengyou He, Ziyao Wu, Guangyao Xu, Yan Liu, and Quan Zou. 2023. Decision tree for sequences. IEEE Trans. Knowl. Data Eng. 35, 1 (2023), 251–263.
[18]
Gengsen Huang, Wensheng Gan, Jian Weng, and Philip S. Yu. 2023. US-rule: Discovering utility-driven sequential rules. ACM Trans. Knowl. Discov. Data 17, 1 (2023), 10.
[19]
Eamonn Keogh and Chotirat Ann Ratanamahatana. 2005. Exact indexing of dynamic time warping. Knowl. Inf. Syst. 7 (2005), 358–386.
[20]
Jinil Kim, Peter Eades, Rudolf Fleischer, Seok-Hee Hong, Costas S. Iliopoulos, Kunsoo Park, Simon J. Puglisi, and Takeshi Tokuyama. 2014. Order-preserving matching. Theor. Comput. Sci. 525 (2014), 68–79.
[21]
Youngho Kim, Munseong Kang, Joong Chae Na, and Jeong Seop Sim. 2023. Order-preserving pattern matching with scaling. Inform. Process. Lett. 180 (2023), 106333.
[22]
Abdelmadjid Lahreche and Bachir Boucheham. 2021. A fast and accurate similarity measure for long time series classification based on local extrema and dynamic time warping. Expert Syst. Appl.168 (2021), 114374.
[23]
Wu Lee, Yuliang Shi, Hongfeng Sun, Lin Cheng, Kun Zhang, Xinjun Wang, and Zhiyong Chen. 2022. MSIPA: Multi-scale interval pattern-aware network for ICU transfer prediction. ACM Trans. Knowl. Discov. Data 16, 1 (2022), 1–17.
[24]
Chun Li, Qingyan Yang, Jianyong Wang, and Ming Li. 2012. Efficient mining of gap-constrained subsequences and its various applications. ACM Trans. Knowl. Discov. Data 6, 1 (2012), 2.
[25]
Qing Li, Jinghua Tan, Jun Wang, and Hsinchun Chen. 2021. A multimodal event-driven LSTM model for stock prediction using online news. IEEE Trans. Knowl. Data Eng. 33, 10 (2021), 3323–3337.
[26]
Qingzhe Li, Liang Zhao, Yi-Ching Lee, and Jessica Lin. 2020. Contrast pattern mining in paired multivariate time series of a controlled driving behavior experiment. ACM Trans. Spatial Algor. Syst. 6, 4 (2020), 25.
[27]
Yangfan Li, Kenli Li, Cen Chen, Xu Zhou, Zeng Zeng, and Keqin Li. 2022. Modeling temporal patterns with dilated convolutions for time-series forecasting. ACM Trans. Knowl. Discov. Data 16, 1 (2022), 1–22.
[28]
Yan Li, Lei Yu, Jing Liu, Lei Guo, Youxi Wu, and Xindong Wu. 2022. NetDPO: (Delta, gamma)-approximate pattern matching with gap constraints under one-off condition. Appl. Intell. 52, 11 (2022), 12155–12174.
[29]
Yan Li, Chang Zhang, Jie Li, Wei Song, Zhenlian Qi, Youxi Wu, and Xindong Wu. 2023. MCoR-Miner: Maximal co-occurrence nonoverlapping sequential rule mining. IEEE Trans. Knowl. Data Eng. 35, 9 (2023), 9531–9546.
[30]
Yan Li, Shuai Zhang, Lei Guo, Jing Liu, Youxi Wu, and Xindong Wu. 2022. NetNMSP: Nonoverlapping maximal sequential pattern mining. Appl. Intell. 52, 9 (2022), 9861–9884.
[31]
Zhenhui Li, Jiawei Han, Ming Ji, Lu-An Tang, Yintao Yu, Bolin Ding, Jae-Gil Lee, and Roland Kays. 2011. MoveMine: Mining moving object data for discovery of animal movement patterns. ACM Trans. Intell. Syst. Technol. 2, 4 (2011), 37.
[32]
Jessica Lin, Eamonn Keogh, Li Wei, and Stefano Lonardi. 2007. Experiencing SAX: A novel symbolic representation of time series. Data Min. Knowl. Discov. 15 (2007), 107–144.
[33]
Qi Lin, Wensheng Gan, Yongdong Wu, Jiahui Chen, and Chien-Ming Chen. 2022. Smart system: Joint utility and frequency for pattern classification. ACM Trans. Manage. Inf. Syst. 13, 4 (2022), 43.
[34]
Kunpeng Liu, Yanjie Fu, Le Wu, Xiaolin Li, Charu Aggarwal, and Hui Xiong. 2023. Automated feature selection: A reinforcement learning perspective. IEEE Trans. Knowl. Data Eng. 35, 3 (2023), 2272–2284.
[35]
Pengfei Ma, Youxi Wu, Yan Li, Lei Guo, He Jiang, Xingquan Zhu, and Xindong Wu. 2022. HW-Forest: Deep forest with hashing screening and window screening. ACM Trans. Knowl. Discov. Data 16, 6 (2022), 123.
[36]
Ioannis Mavroudopoulos and Anastasios Gounaris. 2023. SIESTA: A scalable infrastructure of sequential pattern analysis. IEEE Trans. Big Data 9, 3 (2023), 975–990.
[37]
Fan Min, Zhi-Heng Zhang, Wen jie Zhai, and Rong-Ping Shen. 2020. Frequent pattern discovery with tri-partition alphabets. Inf. Sci. 507 (2020), 715–732.
[38]
Saqib Nawaz, Philippe Fournier-Viger, M. Zohaib Nawaz, Guoting Chen, and Youxi Wu. 2022. MalSPM: Metamorphic malware behavior analysis and classification using sequential pattern mining. Comput. Secur. 118 (2022), 102741.
[39]
Saqib Nawaz, Philippe Fournier-Viger, Unil Yun, Youxi Wu, and Wei Song. 2022. Mining high utility itemsets with hill climbing and simulated annealing. ACM Trans. Manage. Inf. Syst. 13, 1 (2022), 4.
[40]
Filipe Rodrigues, Loulia Markou, and Francisco C. Pereira. 2019. Combining time-series and textual data for taxi demand prediction in event areas: A deep learning approach. Inf. Fusion 49 (2019), 120–129.
[41]
Marco Storace and Oscar De Feo. 2004. Piecewise-linear approximation of nonlinear dynamical systems. IEEE Trans. Circ. Syst. I: Regul. Pap. 51, 4 (2004), 830–842.
[42]
Tin Truong, Hai Duong, B. Le, and Philippe Fournier-Viger. 2019. FMaxCloHUSM: An efficient algorithm for mining frequent closed and maximal high utility sequences. Eng. Appl. Artif. Intell. 85 (2019), 1–20.
[43]
Bay Vo, Sang Pham, Tuong Le, and Zhi-Hong Deng. 2017. A novel approach for mining maximal frequent patterns. Expert Syst. Appl. 73 (2017), 178–186.
[44]
Lizhen Wang, Xuguang Bao, and Lihua Zhou. 2018. Redundancy reduction for prevalent co-location patterns. IEEE Trans. Knowl. Data Eng. 30, 1 (2018), 142–155.
[45]
Lizhen Wang, Yuan Fang, and Lihua Zhou. 2022. Preference-based spatial co-location pattern mining. Springer Singapore. DOI:
[46]
Wei Wang, Jing Tian, Fang Lv, Guodong Xin, Yingfan Ma, and Bailing Wang. 2021. Mining frequent pyramid patterns from time series transaction data with custom constraints. Comput. Secur. 100 (2021), 102088.
[47]
Yuehua Wang, Youxi Wu, Yan Li, Fang Yao, Philippe Fournier-Viger, and Xindong Wu. 2022. Self-adaptive nonoverlapping sequential pattern mining. Appl. Intell. 52, 6 (2022), 6646–6661.
[48]
Xindong Wu, Xingquan Zhu, and Minghui Wu. 2022. The evolution of search: Three computing paradigms. ACM Trans. Manage. Inf. Syst. 13, 2 (2022), 20.
[49]
Youxi Wu, Mingjie Chen, Yan Li, Jing Liu, Zhao Li, Jinyan Li, and Xindong Wu. 2023. ONP-Miner: One-off negative sequential pattern mining. ACM Trans. Knowl. Discov. Data 17, 3 (2023), 37.
[50]
Youxi Wu, Shuhui Cheng, Yan Li, Rongjie Lv, and Fan Min. 2023. STWD-SFNN: Sequential three-way decisions with a single hidden layer feedforward neural network. Inf. Sci. 632 (2023), 299–323.
[51]
Youxi Wu, Meng Geng, Yan Li, Lei Guo, Zhao Li, Philippe Fournier-Viger, Xingquan Zhu, and Xindong Wu. 2021. HANP-Miner: High average utility nonoverlapping sequential pattern mining. Knowl.-Bas. Syst. 229 (2021), 107361.
[52]
Youxi Wu, Qian Hu, Yan Li, Lei Guo, Xingquan Zhu, and Xindong Wu. 2023. OPP-Miner: Order-preserving sequential pattern mining for time series. IEEE Trans. Cybernet. 53, 5 (2023), 3288–3300.
[53]
Youxi Wu, Lanfang Luo, Yan Li, Lei Guo, Philippe Fournier-Viger, Xingquan Zhu, and Xindong Wu. 2022. NTP-Miner: Nonoverlapping three-way sequential pattern mining. ACM Trans. Knowl. Discov. Data 16, 3 (2022), 51.
[54]
Youxi Wu, Yufei Meng, Yan Li, Lei Guo, Xingquan Zhu, Philippe Fournier-Viger, and Xindong Wu. 2023. COPP-Miner: Top-k contrast order-preserving pattern mining for time series classification. IEEE Trans. Knowl. Data Eng. (2023). DOI:
[55]
Youxi Wu, Yao Tong, Xingquan Zhu, and Xindong Wu. 2018. NOSEP: Nonoverlapping sequence pattern mining with gap constraints. IEEE Trans. Cyber. 48, 10 (2018), 2809–2822.
[56]
Youxi Wu, Lingling Wang, Jiadong Ren, Wei Ding, and Xindong Wu. 2014. Mining sequential patterns with periodic wildcard gaps. App. Int. 41, 1 (2014), 99–116.
[57]
Youxi Wu, Xiaohui Wang, Yan Li, Lei Guo, Zhao Li, Ji Zhang, and Xindong Wu. 2022. OWSP-Miner: Self-adaptive one-off weak-gap strong pattern mining. ACM Trans. Manag. Inf. Sys. 13, 3 (2022), 25.
[58]
Youxi Wu, Yuehua Wang, Yan Li, Xingquan Zhu, and Xindong Wu. 2022. Top-k self-adaptive contrast sequential pattern mining. IEEE Trans. Cyber. 52, 11 (2022), 11819–11833.
[59]
Youxi Wu, Xiaoqian Zhao, Yan Li, Lei Guo, Xingquan Zhu, Philippe Fournier-Viger, and Xindong Wu. 2023. OPR-Miner: Order-preserving rule mining for time series. IEEE Trans. Know. Dat. Eng. 35, 11 (2023), 11722–11735.
[60]
Youxi Wu, Changrui Zhu, Yan Li, Lei Guo, and Xindong Wu. 2020. NetNCSP: Nonoverlapping closed sequential pattern mining. Know.-Bas. Sys. 196 (2020), 105812.
[61]
Fei Xie, Xindong Wu, and Xingquan Zhu. 2017. Efficient sequential pattern mining with wildcards for keyphrase extraction. Know.-Bas. Sys. 115 (2017), 27–39.
[62]
Surong Yan, Kwei-Jay Lin, Xiaolin Zheng, and Wenyu Zhang. 2020. Using latent knowledge to improve real-time activity recognition for smart IoT. IEEE Trans. Know. Dat. Eng. 32, 3 (2020), 574–587.
[63]
Jinyoung Yeo, Seungwon Hwang, Sungchul Kim, Eunyee Koh, and Nedim Lipka. 2020. Conversion prediction from clickstream: Modeling market prediction and customer predictability. IEEE Trans. Know. Dat. Eng. 32, 2 (2020), 246–259.
[64]
Jianming Zhan, Jin Ye, Weiping Ding, and Peide Liu. 2022. A novel three-way decision model based on utility theory in incomplete fuzzy decision systems. IEEE Trans. Fuzz. Sys. 30, 7 (2022), 2210–2226.
[65]
Shuainan Zhang, Yafeng He, Xuzhao Li, Wude Yang, and Ying Zhou. 2021. Biolabel-led research pattern positions the effects and mechanisms of Sophorae Tonkinensis radix et rhizome on lung diseases: A novel strategy for computer-aided herbal medicine research based on omics and bioinformatics. Comp. Bio. Med. 136 (2021), 104769.
[66]
Branislav Đurian, Jan Holub, Hannu Peltola, and Jorma Tarhio. 2010. Improving practical exact string matching. Inform. Process. Lett. 110, 4 (2010), 148–152.

Cited By

View all
  • (2025) TK-RNSP: Efficient Top- Repetitive Negative Sequential Pattern mining Information Processing & Management10.1016/j.ipm.2025.10407762:3(104077)Online publication date: May-2025
  • (2025)Co-location pattern mining using approximate Euclidean measureInformation Sciences10.1016/j.ins.2025.122000706(122000)Online publication date: Jul-2025
  • (2024)Scalable Order-Preserving Pattern Mining2024 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM59182.2024.00028(211-220)Online publication date: 9-Dec-2024
  • Show More Cited By

Index Terms

  1. Co-occurrence Order-preserving Pattern Mining with Keypoint Alignment for Time Series

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Management Information Systems
    ACM Transactions on Management Information Systems  Volume 15, Issue 2
    June 2024
    102 pages
    EISSN:2158-6578
    DOI:10.1145/3613621
    • Editor:
    • Heng Xu
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 June 2024
    Online AM: 13 April 2024
    Accepted: 30 March 2024
    Revised: 20 October 2023
    Received: 29 April 2023
    Published in TMIS Volume 15, Issue 2

    Check for updates

    Author Tags

    1. Pattern mining
    2. time series
    3. keypoint alignment
    4. order-preserving
    5. co-occurrence pattern

    Qualifiers

    • Research-article

    Funding Sources

    • Hebei Social Science Foundation Project

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)256
    • Downloads (Last 6 weeks)12
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025) TK-RNSP: Efficient Top- Repetitive Negative Sequential Pattern mining Information Processing & Management10.1016/j.ipm.2025.10407762:3(104077)Online publication date: May-2025
    • (2025)Co-location pattern mining using approximate Euclidean measureInformation Sciences10.1016/j.ins.2025.122000706(122000)Online publication date: Jul-2025
    • (2024)Scalable Order-Preserving Pattern Mining2024 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM59182.2024.00028(211-220)Online publication date: 9-Dec-2024
    • (2024)DCFA-iTimeNet: Dynamic cross-fusion attention network for interpretable time series predictionApplied Intelligence10.1007/s10489-024-05973-255:2Online publication date: 6-Dec-2024

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media