Abstract
Time series widely exist in many areas. In reality, the number of labeled time series data is often small and there is a huge number of unlabeled data. Manually labeling these unlabeled examples is time-consuming and expensive, and sometimes it is even impossible. To reduce manual cost and obtain high confident labeled training data for multivariate time series classification, in this paper a reverse nearest neighbor based active semi-supervised learning method is proposed. First, based on information entropy and distribution density of the training data, a sampling strategy is introduced to select the most informative examples for manual annotation. Second, in terms of the newly labeled example by experts, a reverse nearest neighbor based semi-supervised learning method is presented to automatically and accurately label some confident examples. We evaluate our work with a comprehensive set of experiments on diverse multivariate time series data. Experimental results show that our approach can obtain a confident labeled training data with less manual cost.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wei, L., Keogh, E.: Semi-supervised time series classification. In: KDD (2006)
Begum, N., Hu, B., Rakthanmanon, T., Keogh, E.: Towards a minimum description length based stopping criterion for semi-supervised time series classification. In: IEEE IRI (2013)
Nguyen, M.N., Li, X.-L., Ng, S.-K.: Positive unlabeled learning for time series classification. In: IJCAI (2011)
Nguyen, M.N., Li, X.-L., Ng, S.-K.: Ensemble based positive unlabeled learning for time series classification. In: Lee, S.-g., Peng, Z., Zhou, X., Moon, Y.-S., Unland, R., Yoo, J. (eds.) DASFAA 2012, Part I. LNCS, vol. 7238, pp. 243–257. Springer, Heidelberg (2012)
Mabel, G., Chrisoph, B., Isaac, T., Taneet, R., Kosé, B.: On the stopping criteria for k-nearest neighbor in positive unlabeled time series classification problems. Inf. Sci. 328, 42–59 (2016)
He, G., Duan, Y., Peng, R., Jing, X., Qian, T., Wang, L.: Early classification on multivariate time series. Neurocomputing 149, 777–787 (2015)
Yifan, F., Zhu, X., Li, B.: A survey on instance selection for active learning. Knowl. Inf. Sys. 35, 249–283 (2013)
Settles, B.: Active learning literature survey. Computer Sciences Technical report, University of Wisconsin–Madison, 26 January 2010
Guo, H., Wang, W.: An active learning-based SVM multi-class classification model. Pattern Recogn. 48(5), 1577–1597 (2015)
Huang, S.-J., Jinm, R., Zhou, Z.: Active learning by querying informative and representative examples. IEEE Trans. Pattern Anal. Mach. Intell. 36(10), 1936–1949 (2014)
Hady, M.F.A., Schwenker, F.: Combing committee-based semi-supervised learning and active learning. J. Comput. Sci. Technol. 25(4), 681–698 (2010)
Seung, H., Opper, M., Sompolinsky, H.: Query by committee. In: ACM Workshop on Computational Learning Theory, pp. 287–294 (1992)
He, G., Duan, Y., Li, Y., Qian, T., He, J., Jia, X.: Active learning for multivariate time series classification with positive unlabeled data. In: ICTAI (2015)
Zhu, J., Wang, H., Tsou, B.K., Ma, M.: Active learning with sampling by uncertainty and density for data annotations. IEEE Trans. Audio Speech Lang. Process. 18(6), 1323–1331 (2010)
Huang, H., He, Q., He, J., Ma, L.: RADRA: rare category detection iva computation of boundary degree. In: PAKDD (2011)
Xia, C., Hsu, W., Lee, M.L., Ooi, B.C.: BORDER: efficient computation of boundary points. IEEE Trans. Knowl. Data Eng. 18(3), 289–303 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Li, Y., He, G., Xia, X., Li, Y. (2016). A Reverse Nearest Neighbor Based Active Semi-supervised Learning Method for Multivariate Time Series Classification. In: Hartmann, S., Ma, H. (eds) Database and Expert Systems Applications. DEXA 2016. Lecture Notes in Computer Science(), vol 9827. Springer, Cham. https://doi.org/10.1007/978-3-319-44403-1_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-44403-1_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44402-4
Online ISBN: 978-3-319-44403-1
eBook Packages: Computer ScienceComputer Science (R0)