Controlling the parallel layer perceptron complexity using a multiobjective learning algorithm

Vieira, D. A. G.; Vasconcelos, J. A.; Caminhas, W. M.

doi:10.1007/s00521-006-0052-z

Controlling the parallel layer perceptron complexity using a multiobjective learning algorithm

Original Article
Published: 22 April 2006

Volume 16, pages 317–325, (2007)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

D. A. G. Vieira¹,
J. A. Vasconcelos¹ &
W. M. Caminhas¹

143 Accesses
13 Citations
Explore all metrics

Abstract

This paper deals with the parallel layer perceptron (PLP) complexity control, bias and variance dilemma, using a multiobjective (MOBJ) training algorithm. To control the bias and variance the training process is rewritten as a bi-objective problem, considering the minimization of both training error and norm of the weight vector, which is a measure of the network complexity. This method is applied to regression and classification problems and compared with several other training procedures and topologies. The results show that the PLP MOBJ training algorithm presents good generalization results, outperforming traditional methods in the tested examples.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Battle royale optimizer for training multi-layer perceptron

Article 21 August 2021

Multilayer Perceptron: NSGA II for a New Multi-objective Learning Method for Training and Model Complexity

How effective is the Grey Wolf optimizer in training multi-layer perceptrons

Article 17 January 2015

References

Bartlett PL (1998) The sample complexity of pattern classification with neural networks: the size of the weights is more important than the size of the network. IEEE Trans Inf Theory 44(2):525–536
Article MATH MathSciNet Google Scholar
Caminhas WM, Vieira DAG, Vasconcelos JA (2003) Parallel layer perceptron. Neurocomputing 55(3–4):771–778
Article Google Scholar
Chinrungrueng C, Séquin CH (1995) Optimal adaptive k-means algorithm with dynamic adjustment of learning rate. IEEE Trans Neural Netw 6:157–169
Article Google Scholar
Cortes C, Vapnik V (1995) Support vector networks. Mach Learn 20:273–279
MATH Google Scholar
Costa MA, Braga AP, Menezes BR, Teixiera RA, Parma GG (2003) Training neural networks with a multi-objective sliding mode control algorithm. Neurocomputing 51:467–473
Article Google Scholar
Duda RO, Hart PE (1973) Pattern classification and scene analysis. Wiley-Interscience, New York
MATH Google Scholar
Fahlman SE, Lebiere C (1990) The cascade-correlation learning architecture. In: Touretzky D (ed) Advances in neural information processing systems, vol 2. Morgan Kaufmann, San Mateo
Geman S, Bienenstock E, Doursat R (1992) Neural networks and the bias-variance dilemma. Neural Comput 4(1):1–58
Google Scholar
Hangan MT, Menjah MB (1994) Training feedforward network with the Marquardt algorithm. IEEE Trans Neural Netw 5(6):989–993
Article Google Scholar
Ismail MA, Kamel MS (1989) Multidimensional data clustering utilizing hybrid strategies. Pattern Recognit 22:75–89
Article MATH MathSciNet Google Scholar
Ismail MA, Selim SZ, Aror SK (1984) Efficient clustering of multidimensional data. In: Proceedings of the IEEE international conference on systems man and cybernetics, pp 120–123
Kearns MJ, Schapire RE (1990) Efficient distribution-free learning of probabilistic concepts (Abstract). In: COLT ’90: Proceedings of the 3rd annual workshop on computational learning theory
Lacerda E, Carvalho A, Braga AP, Ludermir TB (2005) Using evolutionary RBF networks for credit assessment. Appl Intell 22(3):167–182
Article Google Scholar
Llyod SP (1982) Least squares quantization in pcm. IEEE Trans Inf Theory 28(2):129–137
Article Google Scholar
MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkley symposium mathematical statistics and probability, vol 1, pp 281–297
Parekh R, Yang J, Honavar V (1987) Constructive neural network learning algorithms for multi-category real-valued pattern classification. Technical report, Iowa State University, Department of Computer Science
Sexton R, Dorsey R (2000) Reliable classification using neural networks: a genetic algorithm and backpropagation comparison. IEEE Trans Knowl Data Eng 30:11–22
Google Scholar
Shawe-Taylor J, Bartlett PL (1998) Structural risk minimization over data-dependent hierarchies. IEEE Trans Inf Theory 44(5):1926–1940
Article MATH MathSciNet Google Scholar
Shor NZ (1977) Cut-off method with space extension in convex programming problems. Cybernetics 12:94–96
Google Scholar
Takahashi RH, Peres PLD, Ferreira PAV (1997) H2/h-infinity multiobjective pid design. IEEE Control Syst 15(5):37–34
Article Google Scholar
Teixeira RA (2001) Treinamento de Redes Neurais Artificias Atraves de Otimizatpo Multi-Objetivo: Uma Nova Abordagem para o Equilibro entre a Polarizacao e a Variancia. PhD Thesis, CPDEE- UFMG
Teixiera RA, Braga AP, Takaha R, Saldanha RR (2000) Improving generalization of MLPs with multi-objective optimization. Neurocomputing 35:189–194
Article Google Scholar
Vapnik VN (1998) Statistical learning theory. Wiley, New York
MATH Google Scholar
Vapnik VN (2001) The nature of statistical learning theory, Statistics for Engineering and Information Science, 2nd edn. Springer, Berlin Heidelberg New York
Yao X (1993) a review of evolutionary artificial neural networks. Int J Intell Syst 8:539–567
Google Scholar

Download references

Acknowledgments

This work was supported by CNPq (grants no. 350902/1997-6, no. 140009/2004-3), CAPES-COFECUB Project Cooperation no. 318/00-II and CAPES (grant no. 3421/04-0), Brazil.

Author information

Authors and Affiliations

Department of Electrical Engineering, Federal University of Minas Gerias, Campus da UFMG (Pampulha), CEP 30.270-010, Belo Horizonte, MG, Brazil
D. A. G. Vieira, J. A. Vasconcelos & W. M. Caminhas

Authors

D. A. G. Vieira
View author publications
You can also search for this author in PubMed Google Scholar
J. A. Vasconcelos
View author publications
You can also search for this author in PubMed Google Scholar
W. M. Caminhas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to W. M. Caminhas.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vieira, D.A.G., Vasconcelos, J.A. & Caminhas, W.M. Controlling the parallel layer perceptron complexity using a multiobjective learning algorithm. Neural Comput & Applic 16, 317–325 (2007). https://doi.org/10.1007/s00521-006-0052-z

Download citation

Received: 17 August 2005
Accepted: 17 February 2006
Published: 22 April 2006
Issue Date: May 2007
DOI: https://doi.org/10.1007/s00521-006-0052-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Controlling the parallel layer perceptron complexity using a multiobjective learning algorithm

Abstract

Access this article

Similar content being viewed by others

Battle royale optimizer for training multi-layer perceptron

Multilayer Perceptron: NSGA II for a New Multi-objective Learning Method for Training and Model Complexity

How effective is the Grey Wolf optimizer in training multi-layer perceptrons

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Controlling the parallel layer perceptron complexity using a multiobjective learning algorithm

Abstract

Access this article

Similar content being viewed by others

Battle royale optimizer for training multi-layer perceptron

Multilayer Perceptron: NSGA II for a New Multi-objective Learning Method for Training and Model Complexity

How effective is the Grey Wolf optimizer in training multi-layer perceptrons

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation