Skip to main content

Semi-supervised Multivariate Sequential Pattern Mining

  • Conference paper
  • First Online:
New Frontiers in Mining Complex Patterns (NFMCP 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9607))

Included in the following conference series:

Abstract

Multivariate sequence analysis is of growing interest for learning on data with numerous correlated time-stamped sequences. It is characterized by correlations among dimensions of multivariate sequences and may not be separately analyzed as multiple independent univariate sequences. On the other hand, labeled data is usually expensive and difficult to obtain in many real-world applications. We present a graph-based semi-supervised learning framework for multivariate sequence classification. The framework explores the correlation within the multivariate sequences, and exploits additional information about the distribution of both labeled and unlabeled data to provide better predictive performance. We also develop an efficient method to extend the graph-based learning approach to out-of-sample prediction. We demonstrate the effectiveness of our approach on real-world multivariate sequence datasets from three domains.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Banko, Z., Abonyi, J.: Correlation based dynamic time warping of multivariate time series. Expert Syst. Appl. 39(17), 12814–12823 (2012)

    Article  Google Scholar 

  2. Belkin, M., Niyogi, P., Sindhwani, V.: On manifold regularization. In: Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics (2005)

    Google Scholar 

  3. Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Workshop on Computational Learning Theory (1998)

    Google Scholar 

  4. Chapelle, O., Schölkopf, B., Zien, A. (eds.): Semi-Supervised Learning. MIT Press, Cambridge (2006)

    Google Scholar 

  5. Chapelle, O., Vapnik, V.: Model selection for support vector machines. In: NIPS (1999)

    Google Scholar 

  6. Cristianini, N., Kandola, J., Elisseeff, A., ShaweTaylor, J.: On kernel target alignment. In: NIPS (2001)

    Google Scholar 

  7. Esling, P., Agon, C.: Time-series data mining. ACM Comput. Surv. 45(1), 1–34 (2012)

    Article  MATH  Google Scholar 

  8. Kadous, M.W.: Temporal Classification: Extending the Classification Paradigm to Multivariate Time Series. Ph.D. Thesis, University of New South Wales (2002)

    Google Scholar 

  9. Kelley, C.T.: Iterative Methods for Linear and Nonlinear Equations. Society for Industrial and Applied Mathematics, Philadelphia (1995)

    Book  MATH  Google Scholar 

  10. Krzanowski, W.J.: Between-groups comparison of principal components. J. Am. Stat. Assoc. 74, 703–707 (1979)

    Article  MathSciNet  MATH  Google Scholar 

  11. Kudo, M., Toyama, J., Shimbo, M.: Multidimensional curve classification using passing-through regions. Pattern Recogn. Lett. 20, 1103–1111 (1999)

    Article  Google Scholar 

  12. Lanckriet, G., Cristianini, N., Bartlett, P., Ghaoui, L.E., Jordan, M.: Learning the kernel matrix with semi-definite programming. In: Proceedings of the International Conference on Machine Learning (2002)

    Google Scholar 

  13. Li, C., Khan, L., Prabhakaran, B.: Real-time classification of variable length multi-attribute motions. Knowl. Inf. Syst. 10(2), 16317183 (2005)

    Google Scholar 

  14. Liao, T.W.: Clustering of time series data-a survey. Pattern Recogn. 38(11), 1857–1874 (2005)

    Article  MATH  Google Scholar 

  15. Marussy, K., Buza, K.: SUCCESS: a new approach for semi-supervised classification of time-series. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2013, Part I. LNCS, vol. 7894, pp. 437–447. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  16. Montero, P., Vilar, J.A.: Tsclust: An r package for time series clustering. J. Stat. Softw. 62(1), 1–43 (2014)

    Article  Google Scholar 

  17. Rath, T.M., Manmatha, R.: Lower-bounding of dynamic time warping distances formultivariate time series. Technical report MM-40, University of Massachusetts (2002)

    Google Scholar 

  18. Scudder, H.J.: Probability of error of some adaptive pattern-recognition machines. IEEE Trans. Inf. Theory 11, 363–371 (1965)

    Article  MathSciNet  MATH  Google Scholar 

  19. Seeger, M.: Learning with labeled and unlabeled data. Technical report, Institute for ANC, Edinburgh, UK (2001)

    Google Scholar 

  20. Shahabi, C., Yan, D.: Real-time pattern isolation and recognition over immersive sensor data streams. In: Proceedings of the 9th International Conference on Multi-Media Modeling (2003)

    Google Scholar 

  21. Sindhwani, V., Niyogi, P., Belkin, M.: Beyond the point cloud: from transductive to semi-supervised learning. In: Proceedings of the 22nd International Conference on Machine Learning (2005)

    Google Scholar 

  22. Smola, A.J., Kondor, R.: Kernels and regularization on graphs. In: Proceedings of the Conference on Learning Theory (2003)

    Google Scholar 

  23. Subakan, Y.C., Kurt, B., Cemgil, A.T., Sankur, B.: Probabilistic sequence clustering with spectral learning. Digit. Sig. Proc. 29, 1–19 (2014)

    Article  MathSciNet  Google Scholar 

  24. Wang, F., Zhang, C.: Label propagation through linear neighborhoods. In: Proceedings of the 23rd International Conference on Machine Learning (2006)

    Google Scholar 

  25. Wang, X., Mueen, A., Ding, H., Trajcevski, G., Scheuermann, P., Keogh, E.: Experimental comparison of representation methods and distance measures for time series data. Data Min. Knowl. Disc. 26(2), 275–309 (2012)

    Article  MathSciNet  Google Scholar 

  26. Wei, L., Keogh, F.J.: Semi-supervised time series classification. In: Proceedings of KDD, pp. 748–753 (2006)

    Google Scholar 

  27. Weng, X., Shen, J.: Classification of multivariate time series using locality preserving projection. Knowl.-based Syst. 21(7), 581–587 (2008)

    Article  Google Scholar 

  28. Xing, Z., Pei, J., Keogh, E.: A brief survey on sequence classification. SIGKDD Explor. 12(1), 40–48 (2010)

    Article  Google Scholar 

  29. Yang, K., Shahabi, C.: A pca-based similarity measure for multivariate time series. In: Proceedings of the 2nd ACM International Workshop on Multimedia Databases, pp. 65–74 (2004)

    Google Scholar 

  30. Zhu, X.: Semi-supervised learning literature survey. Technical report TR 1530, University of Wisconsin Madison, Department of Computer Sciences (2008)

    Google Scholar 

  31. Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using Gaussian fields and harmonic functions. In: Proceedings of the 20th International Conference on Machine Learning (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhao Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Xu, Z., Funaya, K., Chen, H., Leoni, S. (2016). Semi-supervised Multivariate Sequential Pattern Mining. In: Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z. (eds) New Frontiers in Mining Complex Patterns. NFMCP 2015. Lecture Notes in Computer Science(), vol 9607. Springer, Cham. https://doi.org/10.1007/978-3-319-39315-5_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-39315-5_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-39314-8

  • Online ISBN: 978-3-319-39315-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics