Abstract
Regression is one of the most basic problems in machine learning. In big data era, for regression problem, extreme learning machine (ELM) can get better generalization performance and much fast training speed. However, the enlarging volume of dataset for training makes regression by ELM a challenging task, and it is hard to finish the training in a reasonable time or it will be out of memory. In this paper, through analyzing the theory of ELM, a MapReduce-Based ELM method is proposed. Under the MapReduce framework, ELM submodels are trained in every slave node parallelly. A combination method is designed to combine all the submodels as a complete model. The experiment results demonstrate that the MapReduce-Based ELM can efficient process big dataset on commodity hardware and it has a good performance on speedup under the cloud environment where the dataset is stored as data block in different machines.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lynch, C.: Big data: how do your data grow. Nature 455, 28–29 (2008)
Ayerid, B., Grana, M.: Hyperspectral image nonlinear unmixing and reconstruction by ELM regression ensemble. Neurocomputing 174, 299–309 (2016)
Qiu, S.S., Gao, L.P., Wang, J.: Classification and regression of ELM, LVQ and SVM for E-nose data of strawberry juice. J. Food Eng. 144, 77–85 (2015)
Sa, J.J.D., Backes, A.R.: ELM based signature for texture classification. Pattern Recogn. 51, 395–401 (2015)
Li, J.J., Wang, B.T., et al.: Probabilistic threshold query optimization based on threshold classification using ELM for uncertain data. Neurocomputing 174, 211–219 (2016)
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70, 489–501 (2006)
Cambria, E., Huang, G.B.: Extreme learning machine. IEEE Intell. Syst. 28, 30–31 (2013)
Ma, C., Ouyang, J.H., et al.: A novel kernel extreme learning machine algorithm based on self-adaptive artificial bee colony optimisation strategy. Int. J. Syst. Sci. 47, 1342–1357 (2016)
Deng, W.Y., Ong, Y.S., Zheng, Q.H.: A fast reduced kernel extreme learning machine. Neural Netw. 76, 29–38 (2016)
Huang, G.B., Chen, L.: Enhanced random search based incremental extreme learning machine. Neurocomputing 71, 3460–3468 (2008)
Huang, G.B., Li, M.B., et al.: Incremental extreme learning machine with fully complex hidden nodes. Neurocomputing 71, pp. 576–583 (2008)
Lindstrom, A.: Generalized inverse of matrices and its applications. J. Oper. Res. Soc. 23, 598 (1972)
Xin, J.C., Wang, Z.Q., et al.: Elastic extreme learning machine for big data classification. Neurocomputing 149, 464–471 (2015)
Wang, B.T., Huang, S., et al.: Parallel online sequential extreme learning machine based on MapReduce. Neurocomputing 149, 224–232 (2015)
Wang, X.L., Chen, Y.Y., et al.: Parallelized extreme learning machine ensemble based on min-max modular network. Neurocomputing 128, 31–41 (2014)
He, Q., Zhuang, F., Li, J., Shi, Z.: Parallel implementation of classification algorithms based on MapReduce. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds.) RSKT 2010. LNCS, vol. 6401, pp. 655–662. Springer, Heidelberg (2010)
Ghemawat, S., Gobioff, H., Leung, S.T.: The google file system. In: Proceedings of the 19th ACM Symposium on Operating Systems Principles, pp. 29–43 (2003)
Borthakur, D.: The Hadoop Distributed File System, Architecture and Design (2007)
Hadoop Official Website: http://hadoop.apache.org
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of the International Joint Conference on Neural Networks (IJCNN2004), vol. 2, pp. 985–990 (2004)
Acknowledgment
This work is partially supported by the Natural Science Foundation of China & Key research and development program of China (51379198, 2016YFC0301404, 41176076, 31202036).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Wu, B., Yan, T.H., Xu, X.S., He, B., Li, W.H. (2016). A MapReduce-Based ELM for Regression in Big Data. In: Yin, H., et al. Intelligent Data Engineering and Automated Learning – IDEAL 2016. IDEAL 2016. Lecture Notes in Computer Science(), vol 9937. Springer, Cham. https://doi.org/10.1007/978-3-319-46257-8_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-46257-8_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46256-1
Online ISBN: 978-3-319-46257-8
eBook Packages: Computer ScienceComputer Science (R0)