skip to main content
10.1145/3638530.3654288acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
poster

Multi-Criterion Feature Selection Based on Clustering Symbolic Regression

Published: 01 August 2024 Publication History

Abstract

Feature selection is a highly effective technique for processing high-dimensional datasets. However, existing methods often prioritize the accuracy of constructed models as the main criterion, leading incomplete and unstable outcomes. In this study, the multi-criterion feature selection algorithm based on clustering symbolic regression (FSSR) is proposed to tackle the issue of simplistic criteria in feature selection. The involvement and sensitivity of features are considered as new criteria of feature selection in FSSR. These criteria can be calculated based on the occurrence frequency and partial derivatives in the expressions. Moreover, Pareto non-dominated sorting is incorporated to fully account for both criteria. Furthermore, a clustering symbolic regression algorithm is proposed to preserve local information during the process. By clustering the training set into subsets, the generalization of expressions is improved to enhance the accuracy of feature selection. When applied to the creep life dataset of high-temperature nickel-based alloys, a new FSSR algorithm with domain-specific prior knowledge is proposed for producing more accurate and interpretable results in feature selection

References

[1]
Q. Chen, B. Xue and M. Zhang. 2022. Genetic Programming for Instance Transfer Learning in Symbolic Regression. IEEE Transactions on Cybernetics 52
[2]
M. Mafarja and S. Mirjalili. 2018. Whale optimization approaches for wrapper feature selection. Applied Soft Computing 62
[3]
K. L. Chiew, C. L. Tan, K. Wong, K. S. C. Yong and W. K. Tiong. 2019. A new hybrid ensemble feature selection framework for machine learning-based phishing detection system. Information Sciences 484
[4]
Q. Liu, T. Odaka, J. Kuroiwa and H. Ogura. 2013. Application of an Artificial Fish Swarm Algorithm in Symbolic Regression. IEICE Transactions on Information and Systems E96.D
[5]
Q. Chen, B. Xue and M. Zhang. 2022. Rademacher Complexity for Enhancing the Generalization of Genetic Programming for Symbolic Regression. IEEE Transactions on Cybernetics 52
[6]
K. Peng, V. C. M. Leung and Q. Huang. 2018. Clustering Approach Based on Mini Batch Kmeans for Intrusion Detection System Over Big Data. IEEE Access 6
[7]
B. Tran, B. Xue and M. Zhang. 2018. A New Representation in PSO for Discretization-Based Feature Selection. IEEE Transactions on Cybernetics 48
[8]
M. Ghosh, R. Guha, R. Sarkar and A. Abraham. 2020. A wrapper-filter feature selection technique based on ant colony optimization. Neural Computing and Applications 32
[9]
R. Guha, M. Ghosh, S. Kapri, S. Shaw, S. Mutsuddi, V. Bhateja and R. Sarkar. 2021. Deluge based Genetic Algorithm for feature selection. Evolutionary Intelligence 14
[10]
Y. Shen, C. Wu, C. Liu, Y. Wu and N. Xiong. 2018. Oriented Feature Selection SVM Applied to Cancer Prediction in Precision Medicine. IEEE Access 6

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '24 Companion: Proceedings of the Genetic and Evolutionary Computation Conference Companion
July 2024
2187 pages
ISBN:9798400704956
DOI:10.1145/3638530
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 August 2024

Check for updates

Author Tags

  1. feature selection
  2. symbolic regression
  3. clustering

Qualifiers

  • Poster

Funding Sources

Conference

GECCO '24 Companion
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 38
    Total Downloads
  • Downloads (Last 12 months)38
  • Downloads (Last 6 weeks)3
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media