Skip to main content

Implementation of Data Stream Classification Neural Network Models Over Big Data Platforms

  • Conference paper
  • First Online:
Advances in Computational Intelligence (IWANN 2021)

Abstract

Streaming is being increasingly demanded because it helps in analyzing data in real-time and in decision making. Over time, the number of existing devices increases continuously, generating a huge amount of data. Processing this data with traditional algorithms is impractical, so it is necessary to apply distributed algorithms in a Big Data context. In this paper, Apache Spark is used to implement some distributed versions of algorithms based on Extreme Learning Machine (ELM). In addition, these algorithms are evaluated with different real and synthetic datasets by performing two experiments. The first one tries to demonstrate that the performance of the distributed algorithms is the same as that of the sequential versions. The second experiment is a study about the behaviour of the algorithms in the presence of concept drift, an important research area within streaming.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://archive.ics.uci.edu/.

  2. 2.

    https://moa.cms.waikato.ac.nz/.

References

  1. Cao, J., Lin, Z., Huang, G.B.: Voting base online sequential extreme learning machine for multi-class classification. In: 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), pp. 2327–2330. IEEE (2013). https://doi.org/10.1109/ISCAS.2013.6572344

  2. Gama, J., Sebastião, R., Rodrigues, P.P.: On evaluating stream learning algorithms. Mach. Learn. 90(3), 317–346 (2013). https://doi.org/10.1007/s10994-012-5320-9

    Article  MathSciNet  MATH  Google Scholar 

  3. Haneen, A.A., Noraziah, A., Wahab, M.H.A.: A review on data stream classification. J. Phys. Conf. Ser., 012019 (2018). https://doi.org/10.1088/1742-6596/1018/1/012019

  4. Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: a new learning scheme of feedforward neural networks. In: 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No. 04CH37541), vol. 2, pp. 985–990. IEEE (2004). https://doi.org/10.1109/IJCNN.2004.1380068

  5. Iwashita, A.S., Papa, J.P.: An overview on concept drift learning. IEEE Access 7, 1532–1547 (2019). https://doi.org/10.1109/ACCESS.2018.2886026

    Article  Google Scholar 

  6. Lan, Y., Soh, Y.C., Huang, G.B.: Ensemble of online sequential extreme learning machine. Neurocomputing 72(13–15), 3391–3395 (2009). https://doi.org/10.1016/j.neucom.2009.02.013

    Article  Google Scholar 

  7. Li, L., Sun, R., Cai, S., Zhao, K., Zhang, Q.: A review of improved extreme learning machine methods for data stream classification. Multimedia Tools Appl. 78(23), 33375–33400 (2019). https://doi.org/10.1007/s11042-019-7543-2

    Article  Google Scholar 

  8. Liang, N.Y., Huang, G.B., Saratchandran, P., Sundararajan, N.: A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans. Neural Netw. 17(6), 1411–1423 (2006). https://doi.org/10.1109/TNN.2006.880583

    Article  Google Scholar 

  9. Puentes, F., Perez-Godoy, M.D., González, P., del Jesus, M.J.: An analysis of technological frameworks for data streams. In: PRAI (2020). under second review

    Google Scholar 

  10. Xu, S., Wang, J.: Dynamic extreme learning machine for data stream classification. Neurocomputing 238, 433–449 (2017). https://doi.org/10.1016/j.neucom.2016.12.078

    Article  Google Scholar 

Download references

Acknowledgements

This work was partially supported by the Spanish Ministry of Science and Innovation under project PID2019-107793GB-I00/AEI/10.13039/501100011033

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fernando Puentes-Marchal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Puentes-Marchal, F., Pérez-Godoy, M.D., González, P., Jesus, M.J.D. (2021). Implementation of Data Stream Classification Neural Network Models Over Big Data Platforms. In: Rojas, I., Joya, G., Català, A. (eds) Advances in Computational Intelligence. IWANN 2021. Lecture Notes in Computer Science(), vol 12862. Springer, Cham. https://doi.org/10.1007/978-3-030-85099-9_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-85099-9_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-85098-2

  • Online ISBN: 978-3-030-85099-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics