Abstract
Streaming is being increasingly demanded because it helps in analyzing data in real-time and in decision making. Over time, the number of existing devices increases continuously, generating a huge amount of data. Processing this data with traditional algorithms is impractical, so it is necessary to apply distributed algorithms in a Big Data context. In this paper, Apache Spark is used to implement some distributed versions of algorithms based on Extreme Learning Machine (ELM). In addition, these algorithms are evaluated with different real and synthetic datasets by performing two experiments. The first one tries to demonstrate that the performance of the distributed algorithms is the same as that of the sequential versions. The second experiment is a study about the behaviour of the algorithms in the presence of concept drift, an important research area within streaming.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cao, J., Lin, Z., Huang, G.B.: Voting base online sequential extreme learning machine for multi-class classification. In: 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), pp. 2327–2330. IEEE (2013). https://doi.org/10.1109/ISCAS.2013.6572344
Gama, J., Sebastião, R., Rodrigues, P.P.: On evaluating stream learning algorithms. Mach. Learn. 90(3), 317–346 (2013). https://doi.org/10.1007/s10994-012-5320-9
Haneen, A.A., Noraziah, A., Wahab, M.H.A.: A review on data stream classification. J. Phys. Conf. Ser., 012019 (2018). https://doi.org/10.1088/1742-6596/1018/1/012019
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: a new learning scheme of feedforward neural networks. In: 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No. 04CH37541), vol. 2, pp. 985–990. IEEE (2004). https://doi.org/10.1109/IJCNN.2004.1380068
Iwashita, A.S., Papa, J.P.: An overview on concept drift learning. IEEE Access 7, 1532–1547 (2019). https://doi.org/10.1109/ACCESS.2018.2886026
Lan, Y., Soh, Y.C., Huang, G.B.: Ensemble of online sequential extreme learning machine. Neurocomputing 72(13–15), 3391–3395 (2009). https://doi.org/10.1016/j.neucom.2009.02.013
Li, L., Sun, R., Cai, S., Zhao, K., Zhang, Q.: A review of improved extreme learning machine methods for data stream classification. Multimedia Tools Appl. 78(23), 33375–33400 (2019). https://doi.org/10.1007/s11042-019-7543-2
Liang, N.Y., Huang, G.B., Saratchandran, P., Sundararajan, N.: A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans. Neural Netw. 17(6), 1411–1423 (2006). https://doi.org/10.1109/TNN.2006.880583
Puentes, F., Perez-Godoy, M.D., González, P., del Jesus, M.J.: An analysis of technological frameworks for data streams. In: PRAI (2020). under second review
Xu, S., Wang, J.: Dynamic extreme learning machine for data stream classification. Neurocomputing 238, 433–449 (2017). https://doi.org/10.1016/j.neucom.2016.12.078
Acknowledgements
This work was partially supported by the Spanish Ministry of Science and Innovation under project PID2019-107793GB-I00/AEI/10.13039/501100011033
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Puentes-Marchal, F., Pérez-Godoy, M.D., González, P., Jesus, M.J.D. (2021). Implementation of Data Stream Classification Neural Network Models Over Big Data Platforms. In: Rojas, I., Joya, G., Català, A. (eds) Advances in Computational Intelligence. IWANN 2021. Lecture Notes in Computer Science(), vol 12862. Springer, Cham. https://doi.org/10.1007/978-3-030-85099-9_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-85099-9_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-85098-2
Online ISBN: 978-3-030-85099-9
eBook Packages: Computer ScienceComputer Science (R0)