Skip to main content
Log in

Mining partial periodic correlations in time series

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Recently, periodic pattern mining from time series data has been studied extensively. However, an interesting type of periodic pattern, called partial periodic (PP) correlation in this paper, has not been investigated. An example of PP correlation is that power consumption is high either on Monday or Tuesday but not on both days. In general, a PP correlation is a set of offsets within a particular period such that the data at these offsets are correlated with a certain user-desired strength. In the above example, the period is a week (7 days), and each day of the week is an offset of the period. PP correlations can provide insightful knowledge about the time series and can be used for predicting future values. This paper introduces an algorithm to mine time series for PP correlations based on the principal component analysis (PCA) method. Specifically, given a period, the algorithm maps the time series data to data points in a multidimensional space, where the dimensions correspond to the offsets within the period. A PP correlation is then equivalent to correlation of data when projected to a subset of the dimensions. The algorithm discovers, with one sequential scan of data, all those PP correlations (called minimum PP correlations) that are not unions of some other PP correlations. Experiments using both real and synthetic data sets show that the PCA-based algorithm is highly efficient and effective in finding the minimum PP correlations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Aref W, Elfeky M, Elmagarmid A (2004) Incremental, online, and merge mining of partial periodic patterns in time-series databases. IEEE Trans Knowledge Data Eng

  2. Berberidis C, Aref W, Atallah M, Vlahavas I, Elmagarmid A (2002) Multiple and partial periodicity mining in time series databases. In: Proceedings of the European conference on artificial intellegence, pp 370–374

  3. Box P, Jenkins M, Reinsel C (1994) Time series analysis, forecasting and control. Prentice-Hall

  4. Brown GR (1963) Smoothing, forecasting and prediction. Prentice-Hall

  5. Chakrabarti K, Mehrotra S (2000) Local dimensionality reduction: a new approach to indexing high dimensional spaces. In: Proceedings of VLDB, pp 89–100

  6. Chatfield C, Yar M (1988) Holt—Winters forecasting: some practical issues. The Statistician, pp 129–140

  7. Cui B, Ooi BC, Su J, Tan K-L (2003) Contorting high dimensional data for efficient main memory KNN processing. In: Proceedings of ACM-SIGMOD

  8. Elfeky M, Aref W, Elmagarmid A (2004) Using convolution to mine obscure periodic patterns in one pass. In: Proceedings of EDBT, pp 605–620

  9. Farnum RN, Stanton WL (1989) Quantitative forecasting methods. PWS-Kent

  10. Fox J (1997) Applied regression analysis, linear models, and related methods. Sage Publications, Thousand Oaks, CA

    Google Scholar 

  11. Gardner SE, McKenzie E (1985) Forecasting trends in time series. Manage Sci, pp 1237–1246

  12. Han J, Dong G, Yin Y (1999) Efficient mining of partial periodic patterns in time series database. In: Proceedings of ICDE, pp 106–115

  13. Han J, Gong W, Yin Y (1998) Mining segment-wise periodic patterns in time-related databases. In: Proceedings of international conference in knowledge discovery and data mining, pp 214–218

  14. Hipel K, McLeo AI (1994) Time series modelling of water resources and environmental systems. Elsevier

  15. Hui J, Ooi BC, Shen H, Yu C (2003) An adaptive and efficient dimensionality reduction algorithm for high-dimensional indexing. In: Proceedings of ICDE, pp 87–99

  16. Indyk P, Koudas N, Muthukrishnan S (2000) Identifying representative trends in massive time series data sets. In: Proceedings of VLDB, pp 363–372

  17. Jolliffe I (1986) Principal component analysis. Springer-Verlag

  18. Ma S, Hellerstein J (2001) Mining partially periodic event patterns with unknown periods. In: Proceedings of ICDE, pp 205–214

  19. Ord KJ, Koehler BA, Snyder DR (1997) Estimation and prediction for a class of dynamic nonlinear statistical models. J Am Stat Assoc 92:1621–1629

    Article  MATH  Google Scholar 

  20. Shlens J (in press) A tutorial on principal component analysis. http://www.snl.salk.edu/~shlens/pub/notes/pca.pdf

  21. Wang W, Yang J, Yu P (2001) Meta-patterns: revealing hidden periodic patterns. In: Proceedings of ICDM, pp 550–557

  22. Yang J, Wang W, Yu P (2001) Infominer: mining surprising periodic patterns. In: Proceedings of international conference in knowledge discovery and data mining, pp 395–400

  23. Yang J, Wang W, Yu P (2004) Discovering high order periodic patterns. Knowledge Inf Syst J (KAIS) 6(3):243–268

    Article  Google Scholar 

  24. Yang J, Wang W, Yu PS (2000) Mining asynchronous periodic patterns in time series data. In: Proceedings of international conference in knowledge discovery and data mining, pp 275–279

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhen He.

Additional information

Zhen He is a lecturer in the Department of Computer Science at La Trobe University. His main research areas are database systems optimization, time series mining, wireless sensor networks, and XML information retrieval. Prior to joining La Trobe University, he worked as a postdoctoral research associate in the University of Vermont. He holds Bachelors, Honors and Ph.D degrees in Computer Science from the Australian National University.

X. Sean Wang received his Ph.D degree in Computer Science from the University of Southern California in 1992. He is currently the Dorothean Chair Professor in Computer Science at the University of Vermont. He has published widely in the general area of databases and information security, and was a recipient of the US National Science Foundation Research Initiation and CAREER awards. His research interests include database systems, information security, data mining, and sensor data processing.

Byung Suk Lee is associate professor of Computer Science at the University of Vermont. His main research areas are database systems, data modeling, and information retrieval. He held positions in industry and academia: Gold Star Electric, Bell Communications Research, Datacom Global Communications, University of St. Thomas, and currently University of Vermont. He was also a visiting professor at Dartmouth College and a participating guest at Lawrence Livermore National Laboratory. He served on international conferences as a program committee member, a publicity chair, and a special session organizer, and also on US federal funding proposal review panel. He holds a BS degree from Seoul National University, MS from Korea Advanced Institute of Science and Technology, and Ph.D from Stanford University.

Alan C. H. Ling is an assistant professor at Department of Computer Science in University of Vermont. His research interests include combinatorial design theory, coding theory, sequence designs, and applications of design theory.

Rights and permissions

Reprints and permissions

About this article

Cite this article

He, Z., Wang, X.S., Lee, B.S. et al. Mining partial periodic correlations in time series. Knowl Inf Syst 15, 31–54 (2008). https://doi.org/10.1007/s10115-006-0051-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-006-0051-5

Keywords

Navigation