skip to main content
10.1145/3587828.3587854acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

Improved Classification Accuracy by Feature Selection using Adaptive Support Method

Published: 20 June 2023 Publication History

Abstract

The explosion of data which is happening now must be utilized to support decision making both in terms of business and other matters. Data which are becoming assets today needs to be analyzed and extracted in order to find valuable information. The results of data analysis can be used to make predictions, one of which is classification. For high dimensions data, we require preprocessing stage so that the model building process is not complex and the analysis is accurate. One of the preprocessing stages that need attention is feature selection. Feature selection is applied to reduce features without diminishing the accuracy and information in the data. Performing feature selection can also be done by using the association rule. Association rule refers to considering the association relationship between items and the frequency of items occurrence as features. However, the obstacle in implementing the association rule is when determining the minimum support value. Therefore, an adaptive support method is proposed to determine the minimum support value automatically based on the characteristics of the dataset. In this present study, a feature selection method using adaptive support is proposed. Based on the experimental results using 3 classifiers, the accuracy and F1-Score values for the feature selection method using adaptive support are higher compared to the Information gain method.

References

[1]
Yoosef B. Abushark. 2022. An intelligent feature selection approach with systolic tree structures for efficient association rules in big data environment. Computers and Electrical Engineering 101, (July 2022), 108080.
[2]
Charu C. Aggarwal. 2021. Artificial Intelligence: A Textbook. Springer International Publishing, Cham.
[3]
Rakesh Agrawal. 1994. Fast Algorithms for Mining Association Rules. In Proceedings of the 20th International Conference on Very Large Data Bases, San Francisco, CA, USA, 487–499.
[4]
Omar Y. Al-Jarrah, Paul D. Yoo, Sami Muhaidat, George K. Karagiannidis, and Kamal Taha. 2015. Efficient Machine Learning for Big Data: A Review. Big Data Research 2, 3 (September 2015), 87–93.
[5]
Kishore Balasubramanian and Ananthamoorthy N.P. 2022. Correlation-based feature selection using bio-inspired algorithms and optimized KELM classifier for glaucoma diagnosis. Applied Soft Computing 128, (October 2022), 109432.
[6]
Graff Casey and Dheeru Dua. 2019. UCI Machine Learning Repository. UCI Machine Learning Repository. Retrieved from http://archive.ics.uci.edu/ml
[7]
R. Chaves, J. Ramírez, J.M. Górriz, and C.G. Puntonet. 2012. Association rule-based feature selection method for Alzheimer's disease diagnosis. Expert Systems with Applications 39, 14 (October 2012), 11766–11774.
[8]
Computer Science Department, The University of Jordan, Amman, Jordan, Methaq Kadhum, Saher Manaseer, and Abdel Latif Abu Dalhoum. 2021. Evaluation Feature Selection Technique on Classification by Using Evolutionary ELM Wrapper Method with Features Priorities. JAIT 12, 1 (2021), 21–28.
[9]
Quang-Huy Duong, Bo Liao, Philippe Fournier-Viger, and Thu-Lan Dam. 2016. An efficient algorithm for mining the top- k high utility itemsets, using novel threshold raising and pruning strategies. Knowledge-Based Systems 104, (July 2016), 106–122.
[10]
K. Femmam and S. Femmam. 2022. Fast and Efficient Feature Selection Method Using Bivariate Copulas. JAIT 13, 3 (2022).
[11]
Consolata Gakii and Richard Rimiru. 2021. Identification of cancer related genes using feature selection and association rule mining. Informatics in Medicine Unlocked 24, (2021), 100595.
[12]
Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao, and Justin Zhan. 2017. Mining of frequent patterns with multiple minimum supports. Engineering Applications of Artificial Intelligence 60, (April 2017), 83–96.
[13]
Erna Hikmawati, Nur Ulfa Maulidevi, and Kridanto Surendro. 2020. Adaptive rule: A novel framework for recommender system. ICT Express (June 2020), S2405959520300916.
[14]
Erna Hikmawati, Nur Ulfa Maulidevi, and Kridanto Surendro. 2021. Minimum threshold determination method based on dataset characteristics in association rule mining. J Big Data 8, 1 (December 2021), 146.
[15]
Erna Hikmawati, Nur Ulfa Maulidevi, and Kridanto Surendro. 2022. Pruning Strategy on Adaptive Rule Model by Sorting Utility Items. IEEE Access (2022), 1–1.
[16]
Erna Hikmawati, Nur Ulfa Maulidevi, and Kridanto Surendro. 2022. Rule-ranking method based on item utility in adaptive rule model. PeerJ Computer Science 8, (June 2022), e1013.
[17]
Erna Hikmawati and Kridanto Surendro. How to Determine Minimum Support in Association Rule. 9th International Conference on Software and Computer Applications (Accepted at Nov 2019)).
[18]
Chenxi Huang, Xin Huang, Yu Fang, Jianfeng Xu, Yi Qu, Pengjun Zhai, Lin Fan, Hua Yin, Yilu Xu, and Jiahang Li. 2020. Sample imbalance disease classification model based on association rule feature selection. Pattern Recognition Letters 133, (May 2020), 280–286.
[19]
Murat Karabatak and M. Cevdet Ince. 2009. A new feature selection method based on association rules for diagnosis of erythemato-squamous diseases. Expert Systems with Applications 36, 10 (December 2009), 12500–12505.
[20]
Jerry Chun-Wei Lin, Lu Yang, Philippe Fournier-Viger, and Tzung-Pei Hong. 2019. Mining of skyline patterns by considering both frequent and utility constraints. Engineering Applications of Artificial Intelligence 77, (January 2019), 229–238.
[21]
José María Luna, Philippe Fournier‐Viger, and Sebastián Ventura. 2019. Frequent itemset mining: A 25 years review. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery (July 2019).
[22]
Diana Martin, Alejandro Rosete, Jess Alcala-Fdez, and Francisco Herrera. 2014. A New Multiobjective Evolutionary Algorithm for Mining a Reduced Set of Interesting Positive and Negative Quantitative Association Rules. IEEE Trans. Evol. Computat. 18, 1 (February 2014), 54–69.
[23]
Friday Zinzendoff Okwonu, Nor Aishah Ahad, Nicholas Oluwole Ogini, Innocent Ejiro Okoloko, and Wan Zakiyatussariroh Wan Husin. 2022. COMPARATIVE PERFORMANCE EVALUATION OF EFFICIENCY FOR HIGH DIMENSIONAL CLASSIFICATION METHODS. JICT 21, No.3 (July 2022), 437–464.
[24]
Jeng-Shyang Pan, Jerry Chun-Wei Lin, Lu Yang, Philippe Fournier-Viger, and Tzung-Pei Hong. 2017. Efficiently mining of skyline frequent-utility patterns. Intelligent Data Analysis 21, 6 (November 2017), 1407–1423.
[25]
Maria Irmina Prasetiyowati, Nur Ulfa Maulidevi, and Kridanto Surendro. 2021. Determining threshold value on information gain feature selection to increase speed and prediction accuracy of random forest. J Big Data 8, 1 (December 2021), 84.
[26]
Maria Irmina Prasetiyowati, Nur Ulfa Maulidevi, and Kridanto Surendro. 2022. The accuracy of Random Forest performance can be improved by conducting a feature selection with a balancing strategy. PeerJ Computer Science 8, (July 2022), e1041.
[27]
M.D. Ruiz, J. Gómez-Romero, M. Molina-Solana, J.R. Campaña, and M.J. Martin-Bautista. 2016. Meta-association rules for mining interesting associations in multiple datasets. Applied Soft Computing 49, (December 2016), 212–223.
[28]
Heungmo Ryang and Unil Yun. 2015. Top-k high utility pattern mining with effective threshold raising strategies. Knowledge-Based Systems 76, (March 2015), 109–126.
[29]
Izwan Nizal Mohd Shaharanee and Jastini Mohd Jamil. 2015. IRRELEVANT FEATURE AND RULE REMOVAL FOR STRUCTURAL ASSOCIATIVE CLASSIFICATION. (2015), 16.
[30]
Fadi Thabtah, Peter Cowling, and Suhel Hammoud. 2006. Improving rule sorting, predictive accuracy and training time in associative classification. Expert Systems with Applications 31, 2 (August 2006), 414–426.
[31]
Jihong Wan, Hongmei Chen, Tianrui Li, Wei Huang, Min Li, and Chuan Luo. 2022. R2CI: Information theoretic-guided feature selection with multiple correlations. Pattern Recognition 127, (July 2022), 108603.
[32]
Jian Zhou and Zhongsheng Hua. 2022. A correlation guided genetic algorithm and its application to feature selection. Applied Soft Computing 123, (July 2022), 108964.

Index Terms

  1. Improved Classification Accuracy by Feature Selection using Adaptive Support Method
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Other conferences
          ICSCA '23: Proceedings of the 2023 12th International Conference on Software and Computer Applications
          February 2023
          385 pages
          ISBN:9781450398589
          DOI:10.1145/3587828
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 20 June 2023

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Accuracy
          2. Adaptive Support
          3. Classification
          4. Feature Selection

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Funding Sources

          • LPDP (Indonesia Endowment Fund for Education), Ministry of Finance, Republic Indonesia

          Conference

          ICSCA 2023

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 30
            Total Downloads
          • Downloads (Last 12 months)10
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 14 Feb 2025

          Other Metrics

          Citations

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media