Skip to main content

Combination of Active and Random Labeling Strategy in the Non-stationary Data Stream Classification

  • Conference paper
  • First Online:
Artificial Intelligence and Soft Computing (ICAISC 2020)

Abstract

A significant problem when building classifiers based on data stream is information about the correct label. Most algorithms assume access to this information without any restrictions. Unfortunately, this is not possible in practice because the objects can come very quickly and labeling all of them is impossible, or we have to pay for providing the correct label (e.g., to human expert). Hence, methods based on partially labeled data, including methods based on an active learning approach, are becoming increasingly popular, i.e., when the learning algorithm itself decides which of the objects are interesting to improve the quality of the predictive model effectively. In this paper, we propose a new method of active learning of data stream classifier. Its quality has been compared with benchmark solutions based on a large number of test streams, and the results obtained prove the usefulness of the proposed method, especially in the case of a low budget dedicated to the labeling of incoming objects.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/w4k2/bals.

References

  1. Abdallah, Z.S., Gaber, M.M., Srinivasan, B., Krishnaswamy, S.: Adaptive mobile activity recognition system with evolving data streams. Neurocomputing 150, 304–317 (2015)

    Article  Google Scholar 

  2. Bouguelia, M., Belaïd, Y., Belaïd, A.: An adaptive streaming active learning strategy based on instance weighting. Pattern Recogn. Lett. 70, 38–44 (2016)

    Article  Google Scholar 

  3. Cano, A., Zafra, A., Ventura, S.: A parallel genetic programming algorithm for classification. In: Corchado, E., Kurzyński, M., Woźniak, M. (eds.) HAIS 2011. LNCS (LNAI), vol. 6678, pp. 172–181. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-21219-2_23

    Chapter  Google Scholar 

  4. Gaber, M.M.: Advances in data stream mining. Wiley Interdisc. Rev. Data Min. Knowl. Discovery 2(1), 79–85 (2012)

    Google Scholar 

  5. Krawczyk, B., Minku, L.L., Gama, J., Stefanowski, J., Woźniak, M.: Ensemble learning for data stream analysis: a survey. Inform. Fusion 37(Supplement C), 132–156 (2017)

    Google Scholar 

  6. Krawczyk, B., Pfahringer, B., Wozniak, M.: Combining active learning with concept drift detection for data stream mining. In: Abe, N., et al. (eds.) IEEE International Conference on Big Data, Big Data 2018, 10–13 December 2018, Seattle, WA, USA, pp. 2239–2244. IEEE (2018). https://doi.org/10.1109/BigData.2018.8622549

  7. Ksieniewicz, P., Woźniak, M., Cyganek, B., Kasprzak, A., Walkowiak, K.: Data stream classification using active learned neural networks. Neurocomputing353, 74–82 (2019). https://doi.org/10.1016/j.neucom.2018.05.130, http://www.sciencedirect.com/science/article/pii/S0925231219303248, Recent Advancements in Hybrid Artificial Intelligence Systems

  8. Ksieniewicz, P., Zyblewski, P.: Stream-learn-open-source python library for difficult data stream batch analysis. arXiv preprint arXiv:2001.11077 (2020)

  9. Kurlej, B., Woźniak, M.: Active learning approach to concept drift problem. Logic J. IGPL 20(3), 550–559 (2012)

    Article  MathSciNet  Google Scholar 

  10. Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  11. Settles, B.: Active Learning. Morgan and Claypool Publishers, San Rafael (2012)

    MATH  Google Scholar 

  12. Shan, J., Zhang, H., Liu, W., Liu, Q.: Online active learning ensemble framework for drifted data streams. IEEE Trans. Neural Netw. Learn. Syst. 30(2), 486–498 (2019). https://doi.org/10.1109/TNNLS.2018.2844332

    Article  Google Scholar 

  13. Zliobaite, I., Bifet, A., Pfahringer, B., Holmes, G.: Active learning with drifting streaming data. IEEE Trans. Neural Netw. Learn. Syst. 25(1), 27–39 (2014)

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the Polish National Science Centre under the grant No. 2017/27/B/ST6/01325 as well as by the statutory funds of the Department of Systems and Computer Networks, Wroclaw University of Science and Technology.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michał Woźniak .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zyblewski, P., Ksieniewicz, P., Woźniak, M. (2020). Combination of Active and Random Labeling Strategy in the Non-stationary Data Stream Classification. In: Rutkowski, L., Scherer, R., Korytkowski, M., Pedrycz, W., Tadeusiewicz, R., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2020. Lecture Notes in Computer Science(), vol 12415. Springer, Cham. https://doi.org/10.1007/978-3-030-61401-0_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-61401-0_54

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-61400-3

  • Online ISBN: 978-3-030-61401-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics