Skip to main content

Importance-Based Pruning for Genetic Programming Based Symbolic Regression

  • Conference paper
  • First Online:
AI 2024: Advances in Artificial Intelligence (AI 2024)

Abstract

Pruning is a major research field in neural networks, enhancing their efficiency and generalization. The field of pruning approaches in genetic programming (GP) is continually evolving, with researchers actively exploring new techniques and approaches to optimise the performance of GP models. This research introduces a novel pruning algorithm for Genetic Programming-based Symbolic Regression (GPSR). The proposed method employs a weighting mechanism to identify and filter out unimportant subtrees in each generation. To achieve this, the method arranges all subtrees linearly and assigns weights to each subtree and terminal. It then uses Ordinary Least Squares (OLS) to optimize these weights, enabling the identification of unimportant subtrees and terminals for effective pruning. The algorithm’s effectiveness was evaluated using ten regression datasets, including high-dimensional and complex feature sets. Furthermore, comparisons were made with two other algorithms to evaluate its performance. The results indicate that the proposed approach not only achieves better learning and generalisation performance but also generates smaller trees compared to standard GP, thereby improving interpretability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. EUR-Lex - 32016R0679 - EN - EUR-Lex, https://eur-lex.europa.eu/eli/reg/2016/679/oj, doc ID: 32016R0679 Doc Sector: 3 Doc Title: Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation) (Text with EEA relevance) Doc Type: R Usr_lan: en

  2. scikit-learn: machine learning in Python — scikit-learn 1.4.0 documentation. https://scikit-learn.org/stable/

  3. Tech Note ECS Grid - Support | ECS | Victoria University of Wellington. https://ecs.wgtn.ac.nz/Support/TechNoteEcsGrid

  4. Al-Helali, B., Chen, Q., Xue, B., Zhang, M.: Genetic programming for feature selection based on feature removal impact in high-dimensional symbolic regression. IEEE Trans. Emerging Top. Comput. Intell., 1–14 (2024)

    Google Scholar 

  5. Alfaro-Cid, E., Esparcia-Alcázar, A., Sharman, K., Vega, F.F.D.: Prune and Plant: a new bloat control method for genetic programming. In: 2008 Eighth International Conference on Hybrid Intelligent Systems, pp. 31–35 (2008)

    Google Scholar 

  6. Castelli, M., Gonçalves, I., Manzoni, L., Vanneschi, L.: Pruning Techniques for Mixed Ensembles of Genetic Programming Models (2018)

    Google Scholar 

  7. Chen, Q., Xue, B., Browne, W., Zhang, M.: Evolutionary regression and modelling. In: Banzhaf, W., Machado, P., Zhang, M. (eds.) Handbook of Evolutionary Machine Learning, pp. 121–149. Springer Nature Singapore, Singapore (2024). https://doi.org/10.1007/978-981-99-3814-8_5

    Chapter  Google Scholar 

  8. Kinzett, D., Johnston, M., Zhang, M.: Numerical simplification for bloat control and analysis of building blocks in genetic programming. Evol. Intell. 2(4), 151–168 (2009)

    Article  Google Scholar 

  9. Koza, J.R.: Genetic programming as a means for programming computers by natural selection. Stat. Comput. 4, 87–112 (1994)

    Article  Google Scholar 

  10. Luke, S., Panait, L.: A comparison of bloat control methods for genetic programming. Evol. Comput. 14(3), 309–344 (2006)

    Article  Google Scholar 

  11. Planinić, L., Đurasević, M., Picek, S., Jakobovic, D.: Building the building blocks: from simplification to winning trees in genetic programming (2022)

    Google Scholar 

  12. Poli, R., Langdon, W., Mcphee, N.: A Field Guide to Genetic Programming (2008)

    Google Scholar 

  13. Rimas, M., Chen, Q., Zhang, M.: Bloating reduction in symbolic regression through function frequency-based tree substitution in genetic programming. In: Liu, T., Webb, G., Yue, L., Wang, D. (eds.) AI 2023: Advances in Artificial Intelligence: 36th Australasian Joint Conference on Artificial Intelligence, AI 2023, Brisbane, QLD, Australia, November 28–December 1, 2023, Proceedings, Part II, pp. 429–440. Springer Nature Singapore, Singapore (2024). https://doi.org/10.1007/978-981-99-8391-9_34

    Chapter  Google Scholar 

  14. Rockett, P.: Pruning of genetic programming trees using permutation tests. Evol. Intel. 13(4), 649–661 (2020)

    Article  Google Scholar 

  15. Silva, S., Costa, E.: Dynamic limits for bloat control in genetic programming and a review of past and current bloat theories. Genetic Programm. Evol. Mach. 10(2), 141–179 (2009)

    Google Scholar 

  16. Silva, S., Dignum, S., Vanneschi, L.: Operator equalisation for bloat free genetic programming and a survey of bloat control methods. Genet. Program Evolvable Mach. 13, 197–238 (2011)

    Article  Google Scholar 

  17. Uy, N.Q., Chu, T.H.: Semantic approximation for reducing code bloat in genetic programming. Swarm Evol. Comput. 58, 100729 (2020)

    Article  Google Scholar 

  18. Zhang, H., Chen, Q., Xue, B., Banzhaf, W., Zhang, M.: Modular multi-tree genetic programming for evolutionary feature construction for regression. IEEE Trans. Evol. Comput., 1 (2023)

    Google Scholar 

Download references

Acknowledgement

This work is supported in part by the Marsden Fund of New Zealand Government under Contract MFP-VUW2016 and MFP-VUW1913.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohamad Rimas .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2025 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rimas, M., Chen, Q., Zhang, M. (2025). Importance-Based Pruning for Genetic Programming Based Symbolic Regression. In: Gong, M., Song, Y., Koh, Y.S., Xiang, W., Wang, D. (eds) AI 2024: Advances in Artificial Intelligence. AI 2024. Lecture Notes in Computer Science(), vol 15443. Springer, Singapore. https://doi.org/10.1007/978-981-96-0351-0_14

Download citation

  • DOI: https://doi.org/10.1007/978-981-96-0351-0_14

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-96-0350-3

  • Online ISBN: 978-981-96-0351-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics