A Reverse Nearest Neighbor Based Active Semi-supervised Learning Method for Multivariate Time Series Classification

Li, Yifei; He, Guoliang; Xia, Xuewen; Li, Yuanxiang

doi:10.1007/978-3-319-44403-1_17

Yifei Li¹⁵,
Guoliang He^15,16,
Xuewen Xia¹⁷ &
…
Yuanxiang Li^15,16

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9827))

Included in the following conference series:

International Conference on Database and Expert Systems Applications

1026 Accesses
2 Citations

Abstract

Time series widely exist in many areas. In reality, the number of labeled time series data is often small and there is a huge number of unlabeled data. Manually labeling these unlabeled examples is time-consuming and expensive, and sometimes it is even impossible. To reduce manual cost and obtain high confident labeled training data for multivariate time series classification, in this paper a reverse nearest neighbor based active semi-supervised learning method is proposed. First, based on information entropy and distribution density of the training data, a sampling strategy is introduced to select the most informative examples for manual annotation. Second, in terms of the newly labeled example by experts, a reverse nearest neighbor based semi-supervised learning method is presented to automatically and accurately label some confident examples. We evaluate our work with a comprehensive set of experiments on diverse multivariate time series data. Experimental results show that our approach can obtain a confident labeled training data with less manual cost.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Wei, L., Keogh, E.: Semi-supervised time series classification. In: KDD (2006)
Google Scholar
Begum, N., Hu, B., Rakthanmanon, T., Keogh, E.: Towards a minimum description length based stopping criterion for semi-supervised time series classification. In: IEEE IRI (2013)
Google Scholar
Nguyen, M.N., Li, X.-L., Ng, S.-K.: Positive unlabeled learning for time series classification. In: IJCAI (2011)
Google Scholar
Nguyen, M.N., Li, X.-L., Ng, S.-K.: Ensemble based positive unlabeled learning for time series classification. In: Lee, S.-g., Peng, Z., Zhou, X., Moon, Y.-S., Unland, R., Yoo, J. (eds.) DASFAA 2012, Part I. LNCS, vol. 7238, pp. 243–257. Springer, Heidelberg (2012)
Chapter Google Scholar
Mabel, G., Chrisoph, B., Isaac, T., Taneet, R., Kosé, B.: On the stopping criteria for k-nearest neighbor in positive unlabeled time series classification problems. Inf. Sci. 328, 42–59 (2016)
Article Google Scholar
He, G., Duan, Y., Peng, R., Jing, X., Qian, T., Wang, L.: Early classification on multivariate time series. Neurocomputing 149, 777–787 (2015)
Article Google Scholar
Yifan, F., Zhu, X., Li, B.: A survey on instance selection for active learning. Knowl. Inf. Sys. 35, 249–283 (2013)
Article Google Scholar
Settles, B.: Active learning literature survey. Computer Sciences Technical report, University of Wisconsin–Madison, 26 January 2010
Google Scholar
Guo, H., Wang, W.: An active learning-based SVM multi-class classification model. Pattern Recogn. 48(5), 1577–1597 (2015)
Article Google Scholar
Huang, S.-J., Jinm, R., Zhou, Z.: Active learning by querying informative and representative examples. IEEE Trans. Pattern Anal. Mach. Intell. 36(10), 1936–1949 (2014)
Article Google Scholar
Hady, M.F.A., Schwenker, F.: Combing committee-based semi-supervised learning and active learning. J. Comput. Sci. Technol. 25(4), 681–698 (2010)
Article Google Scholar
Seung, H., Opper, M., Sompolinsky, H.: Query by committee. In: ACM Workshop on Computational Learning Theory, pp. 287–294 (1992)
Google Scholar
He, G., Duan, Y., Li, Y., Qian, T., He, J., Jia, X.: Active learning for multivariate time series classification with positive unlabeled data. In: ICTAI (2015)
Google Scholar
Zhu, J., Wang, H., Tsou, B.K., Ma, M.: Active learning with sampling by uncertainty and density for data annotations. IEEE Trans. Audio Speech Lang. Process. 18(6), 1323–1331 (2010)
Article Google Scholar
Huang, H., He, Q., He, J., Ma, L.: RADRA: rare category detection iva computation of boundary degree. In: PAKDD (2011)
Google Scholar
Xia, C., Hsu, W., Lee, M.L., Ooi, B.C.: BORDER: efficient computation of boundary points. IEEE Trans. Knowl. Data Eng. 18(3), 289–303 (2006)
Article Google Scholar
http://archive.ics.uci.edu/ml/datasets.html
http://www.cs.cmu.edu/~bobski/
http://www.cs.ucr.edu/~eamonn/time_series_data/

Download references

Author information

Authors and Affiliations

State Key Laboratory of Software Engineering, Wuhan University, Wuhan, China
Yifei Li, Guoliang He & Yuanxiang Li
College of Computer Science, Wuhan University, Wuhan, China
Guoliang He & Yuanxiang Li
School of Software, East China Jiaotong University, Nanchang, China
Xuewen Xia

Authors

Yifei Li
View author publications
You can also search for this author in PubMed Google Scholar
Guoliang He
View author publications
You can also search for this author in PubMed Google Scholar
Xuewen Xia
View author publications
You can also search for this author in PubMed Google Scholar
Yuanxiang Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guoliang He .

Editor information

Editors and Affiliations

Clausthal University of Technology , Clausthal-Zellerfeld, Germany
Sven Hartmann
Victoria University of Wellington , Wellington, New Zealand
Hui Ma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, Y., He, G., Xia, X., Li, Y. (2016). A Reverse Nearest Neighbor Based Active Semi-supervised Learning Method for Multivariate Time Series Classification. In: Hartmann, S., Ma, H. (eds) Database and Expert Systems Applications. DEXA 2016. Lecture Notes in Computer Science(), vol 9827. Springer, Cham. https://doi.org/10.1007/978-3-319-44403-1_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-44403-1_17
Published: 06 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44402-4
Online ISBN: 978-3-319-44403-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics