Abstract
Multivariate sequence analysis is of growing interest for learning on data with numerous correlated time-stamped sequences. It is characterized by correlations among dimensions of multivariate sequences and may not be separately analyzed as multiple independent univariate sequences. On the other hand, labeled data is usually expensive and difficult to obtain in many real-world applications. We present a graph-based semi-supervised learning framework for multivariate sequence classification. The framework explores the correlation within the multivariate sequences, and exploits additional information about the distribution of both labeled and unlabeled data to provide better predictive performance. We also develop an efficient method to extend the graph-based learning approach to out-of-sample prediction. We demonstrate the effectiveness of our approach on real-world multivariate sequence datasets from three domains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Banko, Z., Abonyi, J.: Correlation based dynamic time warping of multivariate time series. Expert Syst. Appl. 39(17), 12814–12823 (2012)
Belkin, M., Niyogi, P., Sindhwani, V.: On manifold regularization. In: Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics (2005)
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Workshop on Computational Learning Theory (1998)
Chapelle, O., Schölkopf, B., Zien, A. (eds.): Semi-Supervised Learning. MIT Press, Cambridge (2006)
Chapelle, O., Vapnik, V.: Model selection for support vector machines. In: NIPS (1999)
Cristianini, N., Kandola, J., Elisseeff, A., ShaweTaylor, J.: On kernel target alignment. In: NIPS (2001)
Esling, P., Agon, C.: Time-series data mining. ACM Comput. Surv. 45(1), 1–34 (2012)
Kadous, M.W.: Temporal Classification: Extending the Classification Paradigm to Multivariate Time Series. Ph.D. Thesis, University of New South Wales (2002)
Kelley, C.T.: Iterative Methods for Linear and Nonlinear Equations. Society for Industrial and Applied Mathematics, Philadelphia (1995)
Krzanowski, W.J.: Between-groups comparison of principal components. J. Am. Stat. Assoc. 74, 703–707 (1979)
Kudo, M., Toyama, J., Shimbo, M.: Multidimensional curve classification using passing-through regions. Pattern Recogn. Lett. 20, 1103–1111 (1999)
Lanckriet, G., Cristianini, N., Bartlett, P., Ghaoui, L.E., Jordan, M.: Learning the kernel matrix with semi-definite programming. In: Proceedings of the International Conference on Machine Learning (2002)
Li, C., Khan, L., Prabhakaran, B.: Real-time classification of variable length multi-attribute motions. Knowl. Inf. Syst. 10(2), 16317183 (2005)
Liao, T.W.: Clustering of time series data-a survey. Pattern Recogn. 38(11), 1857–1874 (2005)
Marussy, K., Buza, K.: SUCCESS: a new approach for semi-supervised classification of time-series. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2013, Part I. LNCS, vol. 7894, pp. 437–447. Springer, Heidelberg (2013)
Montero, P., Vilar, J.A.: Tsclust: An r package for time series clustering. J. Stat. Softw. 62(1), 1–43 (2014)
Rath, T.M., Manmatha, R.: Lower-bounding of dynamic time warping distances formultivariate time series. Technical report MM-40, University of Massachusetts (2002)
Scudder, H.J.: Probability of error of some adaptive pattern-recognition machines. IEEE Trans. Inf. Theory 11, 363–371 (1965)
Seeger, M.: Learning with labeled and unlabeled data. Technical report, Institute for ANC, Edinburgh, UK (2001)
Shahabi, C., Yan, D.: Real-time pattern isolation and recognition over immersive sensor data streams. In: Proceedings of the 9th International Conference on Multi-Media Modeling (2003)
Sindhwani, V., Niyogi, P., Belkin, M.: Beyond the point cloud: from transductive to semi-supervised learning. In: Proceedings of the 22nd International Conference on Machine Learning (2005)
Smola, A.J., Kondor, R.: Kernels and regularization on graphs. In: Proceedings of the Conference on Learning Theory (2003)
Subakan, Y.C., Kurt, B., Cemgil, A.T., Sankur, B.: Probabilistic sequence clustering with spectral learning. Digit. Sig. Proc. 29, 1–19 (2014)
Wang, F., Zhang, C.: Label propagation through linear neighborhoods. In: Proceedings of the 23rd International Conference on Machine Learning (2006)
Wang, X., Mueen, A., Ding, H., Trajcevski, G., Scheuermann, P., Keogh, E.: Experimental comparison of representation methods and distance measures for time series data. Data Min. Knowl. Disc. 26(2), 275–309 (2012)
Wei, L., Keogh, F.J.: Semi-supervised time series classification. In: Proceedings of KDD, pp. 748–753 (2006)
Weng, X., Shen, J.: Classification of multivariate time series using locality preserving projection. Knowl.-based Syst. 21(7), 581–587 (2008)
Xing, Z., Pei, J., Keogh, E.: A brief survey on sequence classification. SIGKDD Explor. 12(1), 40–48 (2010)
Yang, K., Shahabi, C.: A pca-based similarity measure for multivariate time series. In: Proceedings of the 2nd ACM International Workshop on Multimedia Databases, pp. 65–74 (2004)
Zhu, X.: Semi-supervised learning literature survey. Technical report TR 1530, University of Wisconsin Madison, Department of Computer Sciences (2008)
Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using Gaussian fields and harmonic functions. In: Proceedings of the 20th International Conference on Machine Learning (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Xu, Z., Funaya, K., Chen, H., Leoni, S. (2016). Semi-supervised Multivariate Sequential Pattern Mining. In: Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z. (eds) New Frontiers in Mining Complex Patterns. NFMCP 2015. Lecture Notes in Computer Science(), vol 9607. Springer, Cham. https://doi.org/10.1007/978-3-319-39315-5_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-39315-5_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-39314-8
Online ISBN: 978-3-319-39315-5
eBook Packages: Computer ScienceComputer Science (R0)