Abstract
A significant problem when building classifiers based on data stream is information about the correct label. Most algorithms assume access to this information without any restrictions. Unfortunately, this is not possible in practice because the objects can come very quickly and labeling all of them is impossible, or we have to pay for providing the correct label (e.g., to human expert). Hence, methods based on partially labeled data, including methods based on an active learning approach, are becoming increasingly popular, i.e., when the learning algorithm itself decides which of the objects are interesting to improve the quality of the predictive model effectively. In this paper, we propose a new method of active learning of data stream classifier. Its quality has been compared with benchmark solutions based on a large number of test streams, and the results obtained prove the usefulness of the proposed method, especially in the case of a low budget dedicated to the labeling of incoming objects.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Abdallah, Z.S., Gaber, M.M., Srinivasan, B., Krishnaswamy, S.: Adaptive mobile activity recognition system with evolving data streams. Neurocomputing 150, 304–317 (2015)
Bouguelia, M., Belaïd, Y., Belaïd, A.: An adaptive streaming active learning strategy based on instance weighting. Pattern Recogn. Lett. 70, 38–44 (2016)
Cano, A., Zafra, A., Ventura, S.: A parallel genetic programming algorithm for classification. In: Corchado, E., Kurzyński, M., Woźniak, M. (eds.) HAIS 2011. LNCS (LNAI), vol. 6678, pp. 172–181. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-21219-2_23
Gaber, M.M.: Advances in data stream mining. Wiley Interdisc. Rev. Data Min. Knowl. Discovery 2(1), 79–85 (2012)
Krawczyk, B., Minku, L.L., Gama, J., Stefanowski, J., Woźniak, M.: Ensemble learning for data stream analysis: a survey. Inform. Fusion 37(Supplement C), 132–156 (2017)
Krawczyk, B., Pfahringer, B., Wozniak, M.: Combining active learning with concept drift detection for data stream mining. In: Abe, N., et al. (eds.) IEEE International Conference on Big Data, Big Data 2018, 10–13 December 2018, Seattle, WA, USA, pp. 2239–2244. IEEE (2018). https://doi.org/10.1109/BigData.2018.8622549
Ksieniewicz, P., Woźniak, M., Cyganek, B., Kasprzak, A., Walkowiak, K.: Data stream classification using active learned neural networks. Neurocomputing353, 74–82 (2019). https://doi.org/10.1016/j.neucom.2018.05.130, http://www.sciencedirect.com/science/article/pii/S0925231219303248, Recent Advancements in Hybrid Artificial Intelligence Systems
Ksieniewicz, P., Zyblewski, P.: Stream-learn-open-source python library for difficult data stream batch analysis. arXiv preprint arXiv:2001.11077 (2020)
Kurlej, B., Woźniak, M.: Active learning approach to concept drift problem. Logic J. IGPL 20(3), 550–559 (2012)
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Settles, B.: Active Learning. Morgan and Claypool Publishers, San Rafael (2012)
Shan, J., Zhang, H., Liu, W., Liu, Q.: Online active learning ensemble framework for drifted data streams. IEEE Trans. Neural Netw. Learn. Syst. 30(2), 486–498 (2019). https://doi.org/10.1109/TNNLS.2018.2844332
Zliobaite, I., Bifet, A., Pfahringer, B., Holmes, G.: Active learning with drifting streaming data. IEEE Trans. Neural Netw. Learn. Syst. 25(1), 27–39 (2014)
Acknowledgements
This work was supported by the Polish National Science Centre under the grant No. 2017/27/B/ST6/01325 as well as by the statutory funds of the Department of Systems and Computer Networks, Wroclaw University of Science and Technology.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Zyblewski, P., Ksieniewicz, P., Woźniak, M. (2020). Combination of Active and Random Labeling Strategy in the Non-stationary Data Stream Classification. In: Rutkowski, L., Scherer, R., Korytkowski, M., Pedrycz, W., Tadeusiewicz, R., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2020. Lecture Notes in Computer Science(), vol 12415. Springer, Cham. https://doi.org/10.1007/978-3-030-61401-0_54
Download citation
DOI: https://doi.org/10.1007/978-3-030-61401-0_54
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61400-3
Online ISBN: 978-3-030-61401-0
eBook Packages: Computer ScienceComputer Science (R0)