Online Semi-supervised Learning from Evolving Data Streams with Meta-features and Deep Reinforcement Learning

Vafaie, Parsa; Viktor, Herna; Paquet, Eric; Michalowski, Wojtek

doi:10.1007/978-3-030-95470-3_6

Parsa Vafaie¹⁶,
Herna Viktor¹⁶,
Eric Paquet^16,17 &
…
Wojtek Michalowski¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 13164))

Included in the following conference series:

International Conference on Machine Learning, Optimization, and Data Science

1783 Accesses

Abstract

Online semi-supervised learning (SSL) from data streams is an emerging area of research with many applications due to the fact that it is often expensive, time-consuming, and sometimes even unfeasible to collect labelled data from streaming domains. State-of-the-art online SSL algorithms use clustering techniques to maintain micro-clusters, or, alternatively, employ wrapper methods that utilize pseudo-labeling based on confidence scores. Current approaches may introduce false behaviour or make limited use of labelled instances, thus potentially leading to important information being overlooked. In this paper, we introduce the novel Online Reinforce SSL algorithm that uses various K Nearest Neighbour (KNN) classifiers to learn meta-features across diverse domains. Our Online Reinforce SSL algorithm features a meta-reinforcement learning agent trained on multiple-source streams obtained by extracting meta-features and subsequently transferring this meta-knowledge to our target domain. That is, the predictions of the KNN learners are used to select pseudo-labels for the target domain as instances arrive via an incremental learning paradigm. Extensive experiments on benchmark datasets demonstrate the value of our approach and confirm that Online Reinforce SSL outperforms both the state-of-the-art and a self-training baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Our repository is available at https://github.com/pvafaie/Online-Reinforce-SSL.

References

van Engelen, J.E., Hoos, H.H.: A survey on semi-supervised learning. Mach. Learn. 109(2), 373–440 (2019). https://doi.org/10.1007/s10994-019-05855-6
Article MathSciNet MATH Google Scholar
Zhu, X., Goldberg, A.B.: Introduction to semi-supervised learning. Synth. Lect. Artif. Intell. Mach. Learn. 3(1), 1–130 (2009)
MATH Google Scholar
Ud Din, S., Shao, J., Kumar, J., Ali, W., Liu, J., Ye, Y.: Online reliable semi-supervised learning on evolving data streams. Inf. Sci. 525, 153–171 (2020). https://www.sciencedirect.com/science/article/pii/S0020025520302322
Hosseini, M.J., Gholipour, A., Beigy, H.: An ensemble of cluster-based classifiers for semi-supervised classification of non-stationary data streams. Knowl. Inf. Syst. 46(3), 567–597 (2015). https://doi.org/10.1007/s10115-015-0837-4
Article Google Scholar
Wang, Y., Li, T.: Improving semi-supervised co-forest algorithm in evolving data streams. Appl. Intell. 4(10), 3248–3262 (2018)
Article Google Scholar
Vafaie, P., Viktor, H., Michalowski, W.: Multi-class imbalanced semi-supervised learning from streams through online ensembles. In: International Conference on Data Mining Workshops (ICDMW) 2020, pp. 867–874 (2020)
Google Scholar
Floyd, S.L.A., Viktor, H.L.: Soft voting windowing ensembles for learning from partially labelled streams. In: Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z. (eds.) NFMCP 2019. LNCS (LNAI), vol. 11948, pp. 85–99. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-48861-1_6
Chapter Google Scholar
Hospedales, T., Antoniou, A., Micaelli, P., Storkey, A.: Meta-learning in neural networks: a survey, arXiv preprint arXiv:2004.05439 (2020)
Finn, C., Abbeel, P., Levine, S.: Model-agnostic meta-learning for fast adaptation of deep networks. In: International Conference on Machine Learning. PMLR, pp. 1126–1135 (2017)
Google Scholar
Mishra, N., Rohaninejad, M., Chen, X., Abbeel, P.: A simple neural attentive meta-learner, arXiv preprint arXiv:1707.03141 (2017)
Zha, D., Lai, K.-H., Wan, M., Hu, X.: Meta-AAD: active anomaly detection with deep reinforcement learning, arXiv preprint arXiv:2009.07415 (2020)
Settles, B.: Active learning. Synth. Lect. Artif. Intell. Mach. Learn. 6(1), 1–114 (2012)
MathSciNet MATH Google Scholar
Haque, A., Khan, L., Baron, M.: Sand: semi-supervised adaptive novel class detection and classification over data stream. In: Thirtieth AAAI Conference on Artificial Intelligence (2016)
Google Scholar
Wagner, T., Guha, S., Kasiviswanathan, S., Mishra, N.: Semi-supervised learning on data streams via temporal label propagation. In: International Conference on Machine Learning. PMLR, pp. 5095–5104 (2018)
Google Scholar
Shao, J., Huang, C., Yang, Q., Luo, G.: Reliable semi-supervised learning. In: 2016 IEEE 16th International Conference on Data Mining (ICDM), pp. 1197–1202. IEEE (2016)
Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms, arXiv preprint arXiv:1707.06347 (2017)
Schulman, J., Moritz, P., Levine, S., Jordan, M., Abbeel, P.: High-dimensional continuous control using generalized advantage estimation, arXiv preprint arXiv:1506.02438 (2015)
Bifet, A., Gavaldà, R., Holmes, G., Pfahringer, B.: Machine Learning for Data Streams with Practical Examples in MOA. MIT Press (2018). https://moa.cms.waikato.ac.nz/book/
Vergara, A., Vembu, S., Ayhan, T., Ryan, M.A., Homer, M.L., Huerta, R.: Chemical gas sensor drift compensation using classifier ensembles. Sens. Actuators B: Chem. 166–167, 320–329 (2012). http://www.sciencedirect.com/science/article/pii/S0925400512002018
Dua, D., Graff, C.: UCI machine learning repository (2017). http://archive.ics.uci.edu/ml
Reiss, A., Stricker, D.: Introducing a new benchmarked dataset for activity monitoring. In: 2012 16th International Symposium on Wearable Computers, pp. 108–109 (2012)
Google Scholar
Raffin, A., Hill, A., Ernestus, M., Gleave, A., Kanervisto, A., Dormann, N.: Stable baselines3 (2019). https://github.com/DLR-RM/stable-baselines3

Download references

Author information

Authors and Affiliations

School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, Canada
Parsa Vafaie, Herna Viktor & Eric Paquet
Digital Technologies, National Research Council, Ottawa, Canada
Eric Paquet
Telfer School of Management, University of Ottawa, Ottawa, Canada
Wojtek Michalowski

Authors

Parsa Vafaie
View author publications
You can also search for this author in PubMed Google Scholar
Herna Viktor
View author publications
You can also search for this author in PubMed Google Scholar
Eric Paquet
View author publications
You can also search for this author in PubMed Google Scholar
Wojtek Michalowski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Herna Viktor .

Editor information

Editors and Affiliations

University of Catania, Catania, Italy
Giuseppe Nicosia
Department of Computer Science, University of Reading, Reading, UK
Varun Ojha
Department of Computer Science, University of Oxford, Oxford, UK
Emanuele La Malfa
Cambridge Judge Business School, University of Cambridge, Cambridge, UK
Gabriele La Malfa
Department of Biochemistry, University of Cambridge, Cambridge, UK
Giorgio Jansen
Department of Industrial and Systems Engineering, University of Florida, Gainesville, FL, USA
Panos M. Pardalos
University of Catania, Catania, Italy
Giovanni Giuffrida
Department of Informatics, Dana-Farber Cancer Institute, Boston, MA, USA
Renato Umeton

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vafaie, P., Viktor, H., Paquet, E., Michalowski, W. (2022). Online Semi-supervised Learning from Evolving Data Streams with Meta-features and Deep Reinforcement Learning. In: Nicosia, G., et al. Machine Learning, Optimization, and Data Science. LOD 2021. Lecture Notes in Computer Science(), vol 13164. Springer, Cham. https://doi.org/10.1007/978-3-030-95470-3_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-95470-3_6
Published: 02 February 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-95469-7
Online ISBN: 978-3-030-95470-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Online Semi-supervised Learning from Evolving Data Streams with Meta-features and Deep Reinforcement Learning