Skip to main content

A MapReduce-Based ELM for Regression in Big Data

  • Conference paper
  • First Online:
Intelligent Data Engineering and Automated Learning – IDEAL 2016 (IDEAL 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9937))

Abstract

Regression is one of the most basic problems in machine learning. In big data era, for regression problem, extreme learning machine (ELM) can get better generalization performance and much fast training speed. However, the enlarging volume of dataset for training makes regression by ELM a challenging task, and it is hard to finish the training in a reasonable time or it will be out of memory. In this paper, through analyzing the theory of ELM, a MapReduce-Based ELM method is proposed. Under the MapReduce framework, ELM submodels are trained in every slave node parallelly. A combination method is designed to combine all the submodels as a complete model. The experiment results demonstrate that the MapReduce-Based ELM can efficient process big dataset on commodity hardware and it has a good performance on speedup under the cloud environment where the dataset is stored as data block in different machines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Lynch, C.: Big data: how do your data grow. Nature 455, 28–29 (2008)

    Article  Google Scholar 

  2. Ayerid, B., Grana, M.: Hyperspectral image nonlinear unmixing and reconstruction by ELM regression ensemble. Neurocomputing 174, 299–309 (2016)

    Article  Google Scholar 

  3. Qiu, S.S., Gao, L.P., Wang, J.: Classification and regression of ELM, LVQ and SVM for E-nose data of strawberry juice. J. Food Eng. 144, 77–85 (2015)

    Article  Google Scholar 

  4. Sa, J.J.D., Backes, A.R.: ELM based signature for texture classification. Pattern Recogn. 51, 395–401 (2015)

    Google Scholar 

  5. Li, J.J., Wang, B.T., et al.: Probabilistic threshold query optimization based on threshold classification using ELM for uncertain data. Neurocomputing 174, 211–219 (2016)

    Article  Google Scholar 

  6. Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70, 489–501 (2006)

    Article  Google Scholar 

  7. Cambria, E., Huang, G.B.: Extreme learning machine. IEEE Intell. Syst. 28, 30–31 (2013)

    Article  Google Scholar 

  8. Ma, C., Ouyang, J.H., et al.: A novel kernel extreme learning machine algorithm based on self-adaptive artificial bee colony optimisation strategy. Int. J. Syst. Sci. 47, 1342–1357 (2016)

    Article  MATH  Google Scholar 

  9. Deng, W.Y., Ong, Y.S., Zheng, Q.H.: A fast reduced kernel extreme learning machine. Neural Netw. 76, 29–38 (2016)

    Article  Google Scholar 

  10. Huang, G.B., Chen, L.: Enhanced random search based incremental extreme learning machine. Neurocomputing 71, 3460–3468 (2008)

    Article  Google Scholar 

  11. Huang, G.B., Li, M.B., et al.: Incremental extreme learning machine with fully complex hidden nodes. Neurocomputing 71, pp. 576–583 (2008)

    Google Scholar 

  12. Lindstrom, A.: Generalized inverse of matrices and its applications. J. Oper. Res. Soc. 23, 598 (1972)

    Google Scholar 

  13. Xin, J.C., Wang, Z.Q., et al.: Elastic extreme learning machine for big data classification. Neurocomputing 149, 464–471 (2015)

    Article  Google Scholar 

  14. Wang, B.T., Huang, S., et al.: Parallel online sequential extreme learning machine based on MapReduce. Neurocomputing 149, 224–232 (2015)

    Article  Google Scholar 

  15. Wang, X.L., Chen, Y.Y., et al.: Parallelized extreme learning machine ensemble based on min-max modular network. Neurocomputing 128, 31–41 (2014)

    Article  Google Scholar 

  16. He, Q., Zhuang, F., Li, J., Shi, Z.: Parallel implementation of classification algorithms based on MapReduce. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds.) RSKT 2010. LNCS, vol. 6401, pp. 655–662. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  17. Ghemawat, S., Gobioff, H., Leung, S.T.: The google file system. In: Proceedings of the 19th ACM Symposium on Operating Systems Principles, pp. 29–43 (2003)

    Google Scholar 

  18. Borthakur, D.: The Hadoop Distributed File System, Architecture and Design (2007)

    Google Scholar 

  19. Hadoop Official Website: http://hadoop.apache.org

  20. Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of the International Joint Conference on Neural Networks (IJCNN2004), vol. 2, pp. 985–990 (2004)

    Google Scholar 

Download references

Acknowledgment

This work is partially supported by the Natural Science Foundation of China & Key research and development program of China (51379198, 2016YFC0301404, 41176076, 31202036).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to T. H. Yan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Wu, B., Yan, T.H., Xu, X.S., He, B., Li, W.H. (2016). A MapReduce-Based ELM for Regression in Big Data. In: Yin, H., et al. Intelligent Data Engineering and Automated Learning – IDEAL 2016. IDEAL 2016. Lecture Notes in Computer Science(), vol 9937. Springer, Cham. https://doi.org/10.1007/978-3-319-46257-8_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46257-8_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46256-1

  • Online ISBN: 978-3-319-46257-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics