Skip to main content

A Reverse Nearest Neighbor Based Active Semi-supervised Learning Method for Multivariate Time Series Classification

  • Conference paper
  • First Online:
Database and Expert Systems Applications (DEXA 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9827))

Included in the following conference series:

Abstract

Time series widely exist in many areas. In reality, the number of labeled time series data is often small and there is a huge number of unlabeled data. Manually labeling these unlabeled examples is time-consuming and expensive, and sometimes it is even impossible. To reduce manual cost and obtain high confident labeled training data for multivariate time series classification, in this paper a reverse nearest neighbor based active semi-supervised learning method is proposed. First, based on information entropy and distribution density of the training data, a sampling strategy is introduced to select the most informative examples for manual annotation. Second, in terms of the newly labeled example by experts, a reverse nearest neighbor based semi-supervised learning method is presented to automatically and accurately label some confident examples. We evaluate our work with a comprehensive set of experiments on diverse multivariate time series data. Experimental results show that our approach can obtain a confident labeled training data with less manual cost.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wei, L., Keogh, E.: Semi-supervised time series classification. In: KDD (2006)

    Google Scholar 

  2. Begum, N., Hu, B., Rakthanmanon, T., Keogh, E.: Towards a minimum description length based stopping criterion for semi-supervised time series classification. In: IEEE IRI (2013)

    Google Scholar 

  3. Nguyen, M.N., Li, X.-L., Ng, S.-K.: Positive unlabeled learning for time series classification. In: IJCAI (2011)

    Google Scholar 

  4. Nguyen, M.N., Li, X.-L., Ng, S.-K.: Ensemble based positive unlabeled learning for time series classification. In: Lee, S.-g., Peng, Z., Zhou, X., Moon, Y.-S., Unland, R., Yoo, J. (eds.) DASFAA 2012, Part I. LNCS, vol. 7238, pp. 243–257. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  5. Mabel, G., Chrisoph, B., Isaac, T., Taneet, R., Kosé, B.: On the stopping criteria for k-nearest neighbor in positive unlabeled time series classification problems. Inf. Sci. 328, 42–59 (2016)

    Article  Google Scholar 

  6. He, G., Duan, Y., Peng, R., Jing, X., Qian, T., Wang, L.: Early classification on multivariate time series. Neurocomputing 149, 777–787 (2015)

    Article  Google Scholar 

  7. Yifan, F., Zhu, X., Li, B.: A survey on instance selection for active learning. Knowl. Inf. Sys. 35, 249–283 (2013)

    Article  Google Scholar 

  8. Settles, B.: Active learning literature survey. Computer Sciences Technical report, University of Wisconsin–Madison, 26 January 2010

    Google Scholar 

  9. Guo, H., Wang, W.: An active learning-based SVM multi-class classification model. Pattern Recogn. 48(5), 1577–1597 (2015)

    Article  Google Scholar 

  10. Huang, S.-J., Jinm, R., Zhou, Z.: Active learning by querying informative and representative examples. IEEE Trans. Pattern Anal. Mach. Intell. 36(10), 1936–1949 (2014)

    Article  Google Scholar 

  11. Hady, M.F.A., Schwenker, F.: Combing committee-based semi-supervised learning and active learning. J. Comput. Sci. Technol. 25(4), 681–698 (2010)

    Article  Google Scholar 

  12. Seung, H., Opper, M., Sompolinsky, H.: Query by committee. In: ACM Workshop on Computational Learning Theory, pp. 287–294 (1992)

    Google Scholar 

  13. He, G., Duan, Y., Li, Y., Qian, T., He, J., Jia, X.: Active learning for multivariate time series classification with positive unlabeled data. In: ICTAI (2015)

    Google Scholar 

  14. Zhu, J., Wang, H., Tsou, B.K., Ma, M.: Active learning with sampling by uncertainty and density for data annotations. IEEE Trans. Audio Speech Lang. Process. 18(6), 1323–1331 (2010)

    Article  Google Scholar 

  15. Huang, H., He, Q., He, J., Ma, L.: RADRA: rare category detection iva computation of boundary degree. In: PAKDD (2011)

    Google Scholar 

  16. Xia, C., Hsu, W., Lee, M.L., Ooi, B.C.: BORDER: efficient computation of boundary points. IEEE Trans. Knowl. Data Eng. 18(3), 289–303 (2006)

    Article  Google Scholar 

  17. http://archive.ics.uci.edu/ml/datasets.html

  18. http://www.cs.cmu.edu/~bobski/

  19. http://www.cs.ucr.edu/~eamonn/time_series_data/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guoliang He .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Li, Y., He, G., Xia, X., Li, Y. (2016). A Reverse Nearest Neighbor Based Active Semi-supervised Learning Method for Multivariate Time Series Classification. In: Hartmann, S., Ma, H. (eds) Database and Expert Systems Applications. DEXA 2016. Lecture Notes in Computer Science(), vol 9827. Springer, Cham. https://doi.org/10.1007/978-3-319-44403-1_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-44403-1_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-44402-4

  • Online ISBN: 978-3-319-44403-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics