A MapReduce-Based ELM for Regression in Big Data

Wu, B.; Yan, T. H.; Xu, X. S.; He, B.; Li, W. H.

doi:10.1007/978-3-319-46257-8_18

B. Wu²¹,
T. H. Yan²¹,
X. S. Xu²¹,
B. He²² &
…
W. H. Li²³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9937))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1864 Accesses
3 Citations

Abstract

Regression is one of the most basic problems in machine learning. In big data era, for regression problem, extreme learning machine (ELM) can get better generalization performance and much fast training speed. However, the enlarging volume of dataset for training makes regression by ELM a challenging task, and it is hard to finish the training in a reasonable time or it will be out of memory. In this paper, through analyzing the theory of ELM, a MapReduce-Based ELM method is proposed. Under the MapReduce framework, ELM submodels are trained in every slave node parallelly. A combination method is designed to combine all the submodels as a complete model. The experiment results demonstrate that the MapReduce-Based ELM can efficient process big dataset on commodity hardware and it has a good performance on speedup under the cloud environment where the dataset is stored as data block in different machines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lynch, C.: Big data: how do your data grow. Nature 455, 28–29 (2008)
Article Google Scholar
Ayerid, B., Grana, M.: Hyperspectral image nonlinear unmixing and reconstruction by ELM regression ensemble. Neurocomputing 174, 299–309 (2016)
Article Google Scholar
Qiu, S.S., Gao, L.P., Wang, J.: Classification and regression of ELM, LVQ and SVM for E-nose data of strawberry juice. J. Food Eng. 144, 77–85 (2015)
Article Google Scholar
Sa, J.J.D., Backes, A.R.: ELM based signature for texture classification. Pattern Recogn. 51, 395–401 (2015)
Google Scholar
Li, J.J., Wang, B.T., et al.: Probabilistic threshold query optimization based on threshold classification using ELM for uncertain data. Neurocomputing 174, 211–219 (2016)
Article Google Scholar
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70, 489–501 (2006)
Article Google Scholar
Cambria, E., Huang, G.B.: Extreme learning machine. IEEE Intell. Syst. 28, 30–31 (2013)
Article Google Scholar
Ma, C., Ouyang, J.H., et al.: A novel kernel extreme learning machine algorithm based on self-adaptive artificial bee colony optimisation strategy. Int. J. Syst. Sci. 47, 1342–1357 (2016)
Article MATH Google Scholar
Deng, W.Y., Ong, Y.S., Zheng, Q.H.: A fast reduced kernel extreme learning machine. Neural Netw. 76, 29–38 (2016)
Article Google Scholar
Huang, G.B., Chen, L.: Enhanced random search based incremental extreme learning machine. Neurocomputing 71, 3460–3468 (2008)
Article Google Scholar
Huang, G.B., Li, M.B., et al.: Incremental extreme learning machine with fully complex hidden nodes. Neurocomputing 71, pp. 576–583 (2008)
Google Scholar
Lindstrom, A.: Generalized inverse of matrices and its applications. J. Oper. Res. Soc. 23, 598 (1972)
Google Scholar
Xin, J.C., Wang, Z.Q., et al.: Elastic extreme learning machine for big data classification. Neurocomputing 149, 464–471 (2015)
Article Google Scholar
Wang, B.T., Huang, S., et al.: Parallel online sequential extreme learning machine based on MapReduce. Neurocomputing 149, 224–232 (2015)
Article Google Scholar
Wang, X.L., Chen, Y.Y., et al.: Parallelized extreme learning machine ensemble based on min-max modular network. Neurocomputing 128, 31–41 (2014)
Article Google Scholar
He, Q., Zhuang, F., Li, J., Shi, Z.: Parallel implementation of classification algorithms based on MapReduce. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds.) RSKT 2010. LNCS, vol. 6401, pp. 655–662. Springer, Heidelberg (2010)
Chapter Google Scholar
Ghemawat, S., Gobioff, H., Leung, S.T.: The google file system. In: Proceedings of the 19th ACM Symposium on Operating Systems Principles, pp. 29–43 (2003)
Google Scholar
Borthakur, D.: The Hadoop Distributed File System, Architecture and Design (2007)
Google Scholar
Hadoop Official Website: http://hadoop.apache.org
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of the International Joint Conference on Neural Networks (IJCNN2004), vol. 2, pp. 985–990 (2004)
Google Scholar

Download references

Acknowledgment

This work is partially supported by the Natural Science Foundation of China & Key research and development program of China (51379198, 2016YFC0301404, 41176076, 31202036).

Author information

Authors and Affiliations

School of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou, 310018, China
B. Wu, T. H. Yan & X. S. Xu
School of Information Science and Engineering College, Ocean University of China, Qingdao, 266100, China
B. He
School of Mechanical, Materials and Mechatronic Engineering, University of Wollongong, Wollongong, NSW, Australia
W. H. Li

Authors

B. Wu
View author publications
You can also search for this author in PubMed Google Scholar
T. H. Yan
View author publications
You can also search for this author in PubMed Google Scholar
X. S. Xu
View author publications
You can also search for this author in PubMed Google Scholar
B. He
View author publications
You can also search for this author in PubMed Google Scholar
W. H. Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to T. H. Yan .

Editor information

Editors and Affiliations

University of Manchester, Manchester, United Kingdom
Hujun Yin
Nanjing University, Nanjing, China
Yang Gao
Yangzhou University, Yangzhou, Jiangsu, China
Bin Li
Aeronautics and Astronautics, Nanjing University Aeronautics and Astronautics, Nanjing, China
Daoqiang Zhang
Nanjing Normal University, Nanjing, China
Ming Yang
Yangzhou University, Yangzhou, Jiangsu, China
Yun Li
Ostfalia University of Applied Sciences, Wolfenbüttel, Germany
Frank Klawonn
University of Seville, Seville, Spain
Antonio J. Tallón-Ballesteros

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, B., Yan, T.H., Xu, X.S., He, B., Li, W.H. (2016). A MapReduce-Based ELM for Regression in Big Data. In: Yin, H., et al. Intelligent Data Engineering and Automated Learning – IDEAL 2016. IDEAL 2016. Lecture Notes in Computer Science(), vol 9937. Springer, Cham. https://doi.org/10.1007/978-3-319-46257-8_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-46257-8_18
Published: 13 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46256-1
Online ISBN: 978-3-319-46257-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics