skip to main content
10.1145/2739482.2764662acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
poster

Avoiding Overfitting in Symbolic Regression Using the First Order Derivative of GP Trees

Published: 11 July 2015 Publication History

Abstract

Genetic programming (GP) is widely used for constructing models with applications in control, classification, regression, etc.; however, it has some shortcomings, such as generalization. This paper proposes to enhance the GP generalization by controlling the first order derivative of GP trees in the evolution process. To achieve this goal, a multi-objective GP is implemented. Then, the first order derivative of GP trees is considered as one of its objectives. The proposed method is evaluated on several benchmark problems to provide an experimental validation. The experiments demonstrate the usefulness of the proposed method with the capability of achieving compact solutions with reasonable accuracy on training data and better accuracy on test data.

References

[1]
M. A. Haeri, M. M. Ebadzadeh, and G. Folino. Improving gp generalization: a variance-based layered learning approach. Genetic Programming and Evolvable Machines, 16(1):27--55, 2015.
[2]
J. Ni, R. H. Drieberg, and P. I. Rockett. The use of an analytic quotient operator in genetic programming. Evolutionary Computation, IEEE Transactions on, 17(1):146--152, 2013.
[3]
J. Ni and P. I. Rockett. Tikhonov regularization as a complexity measure in multiobjective genetic programming. Evolutionary Computation, IEEE Transactions on, 19(2):157--166, 2015.
[4]
N. Y. Nikolaev and H. Iba. Regularization approach to inductive genetic programming. Evolutionary Computation, IEEE Transactions on, 5(4):359--375, 2001.
[5]
L. Vanneschi, M. Castelli, and S. Silva. Measuring bloat, overfitting and functional complexity in genetic programming. In Proceedings of the 12th annual conference on Genetic and evolutionary computation, pages 877--884. ACM, 2010.
[6]
E. J. Vladislavleva, G. F. Smits, and D. Den Hertog. Order of nonlinearity as a complexity measure for models generated by symbolic regression via pareto genetic programming. Evolutionary Computation, IEEE Transactions on, 13(2):333--349, 2009.

Cited By

View all
  • (2023)Shapley Value Based Feature Selection to Improve Generalization of Genetic Programming for High-Dimensional Symbolic RegressionData Science and Machine Learning10.1007/978-981-99-8696-5_12(163-176)Online publication date: 5-Dec-2023
  • (2020)Adaptive weighted splinesProceedings of the 2020 Genetic and Evolutionary Computation Conference10.1145/3377930.3390244(1003-1011)Online publication date: 25-Jun-2020
  • (2019)Structural Risk Minimization-Driven Genetic Programming for Enhancing Generalization in Symbolic RegressionIEEE Transactions on Evolutionary Computation10.1109/TEVC.2018.288139223:4(703-717)Online publication date: Aug-2019
  • Show More Cited By
  1. Avoiding Overfitting in Symbolic Regression Using the First Order Derivative of GP Trees

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      GECCO Companion '15: Proceedings of the Companion Publication of the 2015 Annual Conference on Genetic and Evolutionary Computation
      July 2015
      1568 pages
      ISBN:9781450334884
      DOI:10.1145/2739482
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 11 July 2015

      Check for updates

      Author Tags

      1. derivative
      2. generalization
      3. genetic programming
      4. multi-objective optimization
      5. symbolic regression

      Qualifiers

      • Poster

      Conference

      GECCO '15
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)22
      • Downloads (Last 6 weeks)4
      Reflects downloads up to 15 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Shapley Value Based Feature Selection to Improve Generalization of Genetic Programming for High-Dimensional Symbolic RegressionData Science and Machine Learning10.1007/978-981-99-8696-5_12(163-176)Online publication date: 5-Dec-2023
      • (2020)Adaptive weighted splinesProceedings of the 2020 Genetic and Evolutionary Computation Conference10.1145/3377930.3390244(1003-1011)Online publication date: 25-Jun-2020
      • (2019)Structural Risk Minimization-Driven Genetic Programming for Enhancing Generalization in Symbolic RegressionIEEE Transactions on Evolutionary Computation10.1109/TEVC.2018.288139223:4(703-717)Online publication date: Aug-2019
      • (2019)Improving Generalization of Genetic Programming for Symbolic Regression With Angle-Driven Geometric Semantic OperatorsIEEE Transactions on Evolutionary Computation10.1109/TEVC.2018.286962123:3(488-502)Online publication date: Jun-2019
      • (2019)d(Tree)-by-dx: Automatic and Exact Differentiation of Genetic Programming TreesHybrid Artificial Intelligent Systems10.1007/978-3-030-29859-3_12(133-144)Online publication date: 4-Sep-2019
      • (2019)A Discrete Cosine Transform Based Evolutionary Algorithm and Its Application for Symbolic RegressionIntelligent Computing10.1007/978-3-030-22871-2_30(444-462)Online publication date: 23-Jun-2019
      • (2017)Feature Selection to Improve Generalization of Genetic Programming for High-Dimensional Symbolic RegressionIEEE Transactions on Evolutionary Computation10.1109/TEVC.2017.268348921:5(792-806)Online publication date: 1-Oct-2017
      • (2016)Improving Generalisation of Genetic Programming for Symbolic Regression with Structural Risk MinimisationProceedings of the Genetic and Evolutionary Computation Conference 201610.1145/2908812.2908842(709-716)Online publication date: 20-Jul-2016
      • (2016)Improving generalisation of genetic programming for high-dimensional symbolic regression with feature selection2016 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC.2016.7744270(3793-3800)Online publication date: Jul-2016
      • (2016)Genetic Programming with Embedded Feature Construction for High-Dimensional Symbolic RegressionIntelligent and Evolutionary Systems10.1007/978-3-319-49049-6_7(87-102)Online publication date: 9-Nov-2016

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media