skip to main content
10.1145/3587828.3587856acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

FEATURE SELECTION METHOD BASED ON GENETIC ALGORITHM WITH WRAPPER-EMBEDDED TECHNIQUE FOR MEDICAL RECORD CLASSIFICATION

Published: 20 June 2023 Publication History

Abstract

Medical records, also known as Electronic Health Records, can be used to analyze medical data which contributes to greater knowledge of information related to disease, disease phase, and early identification of disease. The main problem in medical records is how to determine the features of the selected subset to make predictions on clinical data. The problem of classifying medical record data involves many features. This creates problems in determining which features have a correlation to the predicted results. Feature selection is an appropriate technique for selecting risk factors for disease in medical records. Feature selection requires optimization techniques so that the accuracy value can be increased. Therefore, feature selection using genetic algorithms has the ability to optimize the selected features. The feature selection technique, namely the wrapper, applies a learning algorithm to test the features to be selected. The embedded engineering approach to feature selection makes it possible to apply learning in the selection process. Genetic algorithm-based feature selection method with wrapper-embedded technique is expected to produce an effective feature subset. This technique evaluates features by using combinations in fitness calculations with the aim of assessing the selected features. The development of genetic algorithms by applying fitness value evaluation and elitist aims to maintain the fitness value in the next generation.

References

[1]
R. Ghorbani, R. Ghousi, A. Makui, dan A. Atashi, “A New Hybrid Predictive Model to Predict the Early Mortality Risk in Intensive Care Units on a Highly Imbalanced Dataset,” IEEE Access, vol. 8, hal. 141066–141079, 2020.
[2]
A. P. Pawlovsky dan H. Matsuhashi, “The use of a novel genetic algorithm in component selection for a kNN method for breast cancer prognosis,” Pan Am. Heal. Care Exch. PAHCE, vol. 2017-March, 2017.
[3]
A. Dewan dan M. Sharma, “Prediction of heart disease using a hybrid technique in data mining classification,” 2015 Int. Conf. Comput. Sustain. Glob. Dev. INDIACom 2015, hal. 704–706, 2015.
[4]
A. A. Kalinin dkk, “Deep learning in pharmacogenomics: From gene regulation to patient stratification,” Pharmacogenomics, vol. 19, no. 7. hal. 629–650, 2018.
[5]
T. Guo, L. Han, L. He, dan X. Yang, “A GA-based feature selection and parameter optimization for linear support higher-order tensor machine,” Neurocomputing, vol. 144, hal. 408–416, 2014.
[6]
J. Duan, S. Mao, J. Jin, Z. Zhou, L. Chen, dan C. L. P. Chen, “A Novel GA-Based Optimized Approach for Regional Multimodal Medical Image Fusion with Superpixel Segmentation,” IEEE Access, vol. 9, hal. 96353–96366, 2021.
[7]
M. H. Nguyen, P. Le Nguyen, K. Nguyen, V. A. Le, T. H. Nguyen, dan Y. Ji, “PM2.5 Prediction Using Genetic Algorithm-Based Feature Selection and Encoder-Decoder Model,” IEEE Access, vol. 9, hal. 57338–57350, 2021.
[8]
B. Guan, C. Zhang, dan J. Ning, “Genetic algorithm with a crossover elitist preservation mechanism for protein–ligand docking,” AMB Express, vol. 7, no. 1, 2017.
[9]
K. Liang, W. Dai, dan R. Du, “A Feature Selection Method Based on Improved Genetic Algorithm,” 2020 Glob. Reliab. Progn. Heal. Manag. PHM-Shanghai 2020, 2020.
[10]
V. Majidnezhad, “A novel hybrid of genetic algorithm and ANN for developing a high efficient method for vocal fold pathology diagnosis,” Eurasip J. Audio, Speech, Music Process., vol. 2015, no. 1, hal. 1–11, 2015.
[11]
Z. Sun, G. Bebis, dan R. Miller, “Object detection using feature subset selection,” Pattern Recognit., vol. 37, no. 11, hal. 2165–2176, 2004.
[12]
R. M. Luque, D. Elizondo, E. López-Rubio, dan E. J. Palomo, “GA-based feature selection approach in biometric hand systems,” Proc. Int. Jt. Conf. Neural Networks, hal. 246–253, 2011.
[13]
C. Peng, X. Wu, W. Yuan, X. Zhang, Y. Zhang, dan Y. Li, “MGRFE: Multilayer Recursive Feature Elimination Based on an Embedded Genetic Algorithm for Cancer Classification,” IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 18, no. 2, hal. 621–632, 2021.
[14]
E. Daniel, “Optimum Wavelet-Based Homomorphic Medical Image Fusion Using Hybrid Genetic-Grey Wolf Optimization Algorithm,” IEEE Sens. J., vol. 18, no. 16, hal. 6804–6811, 2018.
[15]
Z. Huang, C. Yang, X. Zhou, dan T. Huang, “A Hybrid Feature Selection Method Based on Binary State Transition Algorithm and ReliefF,” IEEE J. Biomed. Heal. Informatics, vol. 23, no. 5, hal. 1888–1898, 2019.
[16]
Z. Huang, W. Dong, H. Duan, dan J. Liu, “A Regularized Deep Learning Approach for Clinical Risk Prediction of Acute Coronary Syndrome Using Electronic Health Records,” IEEE Trans. Biomed. Eng., vol. 65, no. 5, hal. 956–968, 2018.
[17]
J. Dhar, “Multistage Ensemble Learning Model with Weighted Voting and Genetic Algorithm Optimization Strategy for Detecting Chronic Obstructive Pulmonary Disease,” IEEE Access, vol. 9, hal. 48640–48657, 2021.
[18]
Z. Huang, W. Dong, H. Duan, dan J. Liu, “A Regularized Deep Learning Approach for Clinical Risk Prediction of Acute Coronary Syndrome Using Electronic Health Records,” IEEE Trans. Biomed. Eng., vol. 65, no. 5, hal. 956–968, 2018.
[19]
S. R. Kamel, R. YaghoubZadeh, dan M. Kheirabadi, “Improving the performance of support-vector machine by selecting the best features by Gray Wolf algorithm to increase the accuracy of diagnosis of breast cancer,” J. Big Data, vol. 6, no. 1, 2019.
[20]
X. Y. Liu, Y. Liang, S. Wang, Z. Y. Yang, dan H. S. Ye, “A Hybrid Genetic Algorithm with Wrapper-Embedded Approaches for Feature Selection,” IEEE Access, vol. 6, hal. 22863–22874, 2018.
[21]
Z. Zhong, X. Yuan, S. Liu, Y. Yang, dan F. Liu, “Machine learning prediction models for prognosis of critically ill patients after open-heart surgery,” Sci. Rep., vol. 11, no. 1, hal. 1–10, 2021.
[22]
X. Zeng, S. Lin, dan C. Liu, “Multi-view Deep Learning Framework for Predicting Patient Expenditure in Healthcare,” 2021.
[23]
T. M. Le, T. M. Vo, T. N. Pham, dan S. V. T. Dao, “A Novel Wrapper-Based Feature Selection for Early Diabetes Prediction Enhanced with a Metaheuristic,” IEEE Access, vol. 9, hal. 7869–7884, 2021.
[24]
A. Rojas-Dominguez, L. C. Padierna, J. M. Carpio Valadez, H. J. Puga-Soberanes, dan H. J. Fraire, “Optimal Hyper-Parameter Tuning of SVM Classifiers with Application to Medical Diagnosis,” IEEE Access, vol. 6, hal. 7164–7176, 2017.
[25]
C. L. Huang dan C. J. Wang, “A GA-based feature selection and parameters optimizationfor support vector machines,” Expert Syst. Appl., vol. 31, no. 2, hal. 231–240, 2006.
[26]
P. B. Jensen, L. J. Jensen, dan S. Brunak, “Mining electronic health records: Towards better research applications and clinical care,” Nat. Rev. Genet., vol. 13, no. 6, hal. 395–405, 2012.
[27]
H. Alharthi, “Healthcare predictive analytics: An overview with a focus on Saudi Arabia,” J. Infect. Public Health, vol. 11, no. 6, hal. 749–756, 2018.
[28]
F. S. Roque dkk, “Using electronic patient records to discover disease correlations and stratify patient cohorts,” PLoS Comput. Biol., vol. 7, no. 8, 2011.
[29]
A. Zheng dan A. Casari, Feature Engineering for Machine Learning. .
[30]
K. J. Max Kuhn, Feature Engineering and Selection: A Practical Approach for Predictive Models. (Chapman & Hall/CRC Data Science Series) 1st Edition, 2019.
[31]
Z. Tao, L. Huiling, W. Wenwen, dan Y. Xia, “GA-SVM based feature selection and parameter optimization in hospitalization expense modeling,” Appl. Soft Comput. J., vol. 75, hal. 323–332, 2019.
[32]
S. B. Sakri, N. B. Abdul Rashid, dan Z. Muhammad Zain, “Particle Swarm Optimization Feature Selection for Breast Cancer Recurrence Prediction,” IEEE Access, vol. 6, hal. 29637–29647, 2018.
[33]
X. Gong, L. Liu, S. Fong, Q. Xu, T. Wen, dan Z. Liu, “Comparative research of swam intelligence clustering algorithms for analyzing medical data,” IEEE Access, vol. 7, hal. 137560–137569, 2019.
[34]
S. J. Lee, Z. Xu, T. Li, dan Y. Yang, “A novel bagging C4.5 algorithm based on wrapper feature selection for supporting wise clinical decision making,” J. Biomed. Inform., vol. 78, hal. 144–155, 2018.
[35]
E. Adua dkk, “Predictive model and feature importance for early detection of type II diabetes mellitus,” Transl. Med. Commun., vol. 6, no. 1, hal. 1–15, 2021.
[36]
M. Zhao, C. Fu, L. Ji, K. Tang, dan M. Zhou, “Feature selection and parameter optimization for support vector machines: A new approach based on genetic algorithm with feature chromosomes,” Expert Syst. Appl., vol. 38, no. 5, hal. 5197–5204, 2011.
[37]
L. Ali, I. Wajahat, N. Amiri Golilarz, F. Keshtkar, dan S. A. C. Bukhari, “LDA–GA–SVM: improved hepatocellular carcinoma prediction through dimensionality reduction and genetically optimized support vector machine,” Neural Comput. Appl., vol. 33, no. 7, hal. 2783–2792, 2021.
[38]
S. Mohan, C. Thirumalai, dan G. Srivastava, “Effective heart disease prediction using hybrid machine learning techniques,” IEEE Access, vol. 7, hal. 81542–81554, 2019.
[39]
M. S. Santos, P. H. Abreu, P. J. García-Laencina, A. Simão, dan A. Carvalho, “A new cluster-based oversampling method for improving survival prediction of hepatocellular carcinoma patients,” J. Biomed. Inform., vol. 58, hal. 49–59, 2015.
[40]
W. Książek, M. Gandor, dan P. Pławiak, “Comparison of various approaches to combine logistic regression with genetic algorithms in survival prediction of hepatocellular carcinoma,” Comput. Biol. Med., vol. 134, 2021.
[41]
W. Książek, F. Turza, dan P. Pławiak, “NCA-GA-SVM: A new two-level feature selection method based on neighborhood component analysis and genetic algorithm in hepatocellular carcinoma fatality prognosis,” Int. j. numer. method. biomed. eng., vol. 38, no. 6, hal. 1–16, 2022.

Index Terms

  1. FEATURE SELECTION METHOD BASED ON GENETIC ALGORITHM WITH WRAPPER-EMBEDDED TECHNIQUE FOR MEDICAL RECORD CLASSIFICATION
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Other conferences
          ICSCA '23: Proceedings of the 2023 12th International Conference on Software and Computer Applications
          February 2023
          385 pages
          ISBN:9781450398589
          DOI:10.1145/3587828
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 20 June 2023

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Electronic Medical Record
          2. Genetic Algorithms
          3. Wrapper-embedded

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Conference

          ICSCA 2023

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 56
            Total Downloads
          • Downloads (Last 12 months)19
          • Downloads (Last 6 weeks)1
          Reflects downloads up to 14 Feb 2025

          Other Metrics

          Citations

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media