Abstract
In this paper the modified version of cGAAM (a genetic algorithm for feature selection for clustering) is introduced. As it can be shown, the algorithm is able to find significant subsets of features in data sets that differ in size and number of classes. The common feature of the sets that were used to test the cGAAM is that the examples are provided with class labels. Due to this, although the clustering process was performed without the class labels, the chosen feature sets could be compared with feature subsets returned by Lasso method in terms of classification accuracy. The most important observation from the results presented in the paper is that the classification accuracy obtained with feature subsets returned by cGAAM was not only comparable with accuracy obtained with feature subsets returned by Lasso but almost always was higher than 80% (ionsphere dataset) and 90% (humanactivity dataset).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal R, Gehrke J, Gunopulos D, Raghavan P (1998) Automatic subspace clustering of high dimensional data for data mining applications, vol 27, no 2. ACM, pp 94–105
Burduk R (2012) Recognition task with feature selection and weighted majority voting based on interval-valued fuzzy sets. In: Computational collective intelligence, technologies and applications. Springer, Berlin, pp 204–209
Dy JG, Brodley CE (2000) Feature subset selection and order identification for unsupervised learning. In: ICML
Koprinska I (2010) Feature selection for brain–computer interfaces. In: New frontiers in applied data mining, pp 106–117
Mitra P, Murthy CA, Pal SK (2002) Unsupervised feature selection using feature similarity. IEEE Trans Pattern Anal Mach Intell 24(3):301–312
Rejer I (2013) Genetic algorithms in EEG feature selection for the classification of movements of the left and right hand. In: Proceedings of the 8th international conference on computer recognition systems CORES 2013. Springer, Heidelberg, pp 579–589
Rejer I (2015) Genetic algorithm with aggressive mutation for feature selection in BCI feature space. Pattern Anal Appl 18(3):485–492
Rejer I (2015) Genetic algorithms for feature selection for brain-computer interface. Int J Pattern Recogn Artif Intell 29(5):1559008 (World Scientific Publishing Company)
Rejer I, Twardochleb M (2018) Gamers’ involvement detection from fig EEG data with cGAAM-A method for feature selection for clustering. Expert Syst Appl 101:196–204
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Roy Stat Soc Ser B 58(1):267–288
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Rejer, I. (2020). cGAAM – An Algorithm for Simultaneous Feature Selection and Clustering. In: Burduk, R., Kurzynski, M., Wozniak, M. (eds) Progress in Computer Recognition Systems. CORES 2019. Advances in Intelligent Systems and Computing, vol 977. Springer, Cham. https://doi.org/10.1007/978-3-030-19738-4_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-19738-4_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19737-7
Online ISBN: 978-3-030-19738-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)