A Spatial EA Framework for Parallelizing Machine Learning Methods

Kamath, Uday; Kaers, Johan; Shehu, Amarda; De Jong, Kenneth A.

doi:10.1007/978-3-642-32937-1_21

Uday Kamath²¹,
Johan Kaers²²,
Amarda Shehu²¹ &
…
Kenneth A. De Jong²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7491))

Included in the following conference series:

International Conference on Parallel Problem Solving from Nature

1984 Accesses

Abstract

The scalability of machine learning (ML) algorithms has become increasingly important due to the ever increasing size of datasets and increasing complexity of the models induced. Standard approaches for dealing with this issue generally involve developing parallel and distributed versions of the ML algorithms and/or reducing the dataset sizes via sampling techniques. In this paper we describe an alternative approach that combines features of spatially-structured evolutionary algorithms (SSEAs) with the well-known machine learning techniques of ensemble learning and boosting. The result is a powerful and robust framework for parallelizing ML methods in a way that does not require changes to the ML methods. We first describe the framework and illustrate its behavior on a simple synthetic problem, and then evaluate its scalability and robustness using several different ML methods on a set of benchmark problems from the UC Irvine ML database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

OpenMP Parallelization and Optimization of Graph-Based Machine Learning Algorithms

Convergent Parallel Algorithms for Big Data Optimization Problems

Scalable Random Forest with Data-Parallel Computing

References

Bordes, A., Bottou, L., Gallinari, P.: Sgd-qn: Careful quasi-newton stochastic gradient descent. Journal of Machine Learning Research 10, 1737–1754 (2009)
MathSciNet MATH Google Scholar
Schapire, R.E., Freund, Y., Bartlett, P., Lee, W.S.: Boosting the margin: A new explanation for the effectiveness of voting methods (1997)
Google Scholar
Sarma, J., De Jong, K.: An Analysis of the Effects of Neighborhood Size and Shape on Local Selection Algorithms. In: Ebeling, W., Rechenberg, I., Voigt, H.-M., Schwefel, H.-P. (eds.) PPSN IV. LNCS, vol. 1141, pp. 236–244. Springer, Heidelberg (1996)
Chapter Google Scholar
Tomassini, M.: Spatially structured evolutionary algorithms: artificial evolution in space and time. Natural computing series. Springer (2005)
Google Scholar
Opitz, D., Maclin, R.: Popular ensemble methods: An empirical study. Journal of Artificial Intelligence Research 11, 169–198 (1999)
MATH Google Scholar
Pamuk, B., Can, T.: Coevolution based prediction of protein-protein interactions with reduced training data. In: 2010 5th International Symposium on Health Informatics and Bioinformatics (HIBIT), pp. 187–193 (April 2010)
Google Scholar
Banks, R.B.: Growth and Diffusion Phenomena: Mathematical Frameworks and Applications. Springer (1993)
Google Scholar
Asuncion, A., Newman, D.J.: UCI machine learning repository (2007)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. SIGKDD Explor. Newsl. 11(1), 10–18 (2009)
Article Google Scholar
Yu, C., Skillicorn, D.B.: Parallelizing boosting and bagging (2001)
Google Scholar
Favre, B., Hakkani-Tür, D., Cuendet, S.: Icsiboost (2007), http://code.google.come/p/icsiboost

Download references

Author information

Authors and Affiliations

George Mason University, Fairfax, VA, 22003, USA
Uday Kamath, Amarda Shehu & Kenneth A. De Jong
Shaman Research, Heverlee, 3001, Belgium
Johan Kaers

Authors

Uday Kamath
View author publications
You can also search for this author in PubMed Google Scholar
Johan Kaers
View author publications
You can also search for this author in PubMed Google Scholar
Amarda Shehu
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth A. De Jong
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Centro de Investigacion y de Estudios, Avanzados del Instituto Politecnico Nacional (CINVESTAV-IPN), Departmento de Computation, Av. IPN No. 2508, Col. San Pedro Zacatenco, 0360, Mexico, D.F., Mexico
Carlos A. Coello Coello
Department of Mathematics and Computer Science, University of Catania, V.le A. Doria 6, 95125, Catania, Italy
Vincenzo Cutello & Mario Pavone &
Kanpur Genetic Algorithms Laboratory (KanGAL), Indian Institute of Technology, Kanpur, Kanpur, India
Kalyanmoy Deb
Department of Computer Science, University of New, Mexico, USA
Stephanie Forrest
Department of Mathematics and Computer Science, University of Catania, Viale A. Doria 6, 95125, Catania, Italy
Giuseppe Nicosia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kamath, U., Kaers, J., Shehu, A., De Jong, K.A. (2012). A Spatial EA Framework for Parallelizing Machine Learning Methods. In: Coello, C.A.C., Cutello, V., Deb, K., Forrest, S., Nicosia, G., Pavone, M. (eds) Parallel Problem Solving from Nature - PPSN XII. PPSN 2012. Lecture Notes in Computer Science, vol 7491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32937-1_21

Download citation

DOI: https://doi.org/10.1007/978-3-642-32937-1_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32936-4
Online ISBN: 978-3-642-32937-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics