Abstract
Nowadays the amount of data that is collected in various settings is growing rapidly. These elaborate data records enable the training of machine learning models that can be used to extract insights and for making better informed decisions. When doing the data mining task, on one hand, feature selection is often used to reduce the dimensionality of the data. On the other hand, we need to decide the structure (parameters) of the model when building the model. However, feature selection and the parameters of the model may interact and affect the performance of the model. Therefore, it is difficult to decide the optimal parameter and the optimal feature subset without an exhaustive search of all the combination of the parameters and the feature subsets which is time-consuming. In this paper, we study how the interaction between feature selection and the parameters of a model affect the performance of the model through experiments on four data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bezdek, J.C.: Models for pattern recognition. In: Pattern Recognition with Fuzzy Objective Function Algorithms. Advanced Applications in Pattern Recognition, pp. 1–13. Springer, Boston (1981). https://doi.org/10.1007/978-1-4757-0450-1_1
Bolón-Canedo, V., Sánchez-Maroño, N., Alonso-Betanzos, A.: A review of feature selection methods on synthetic data. Knowl. Inf. Syst. 34(3), 483–519 (2013)
Bose, I., Mahapatra, R.K.: Business data mining a machine learning perspective. Inf. Manag. 39(3), 211–225 (2001)
Chen, P., Wilbik, A., van Loon, S., Boer, A.-K., Kaymak, U.: Finding the optimal number of features based on mutual information. In: Kacprzyk, J., Szmidt, E., Zadrożny, S., Atanassov, K.T., Krawczak, M. (eds.) IWIFSGN/EUSFLAT -2017. AISC, vol. 641, pp. 477–486. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-66830-7_43
Chiu, S.L.: Fuzzy model identification based on cluster estimation. J. Intell. Fuzzy Syst. 2(3), 267–278 (1994)
Cisco Visual Networking Index: The Zettabyte Era-trends and Analysis (2013). https://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/vni-hyperconnectivity-wp.pdf
Claesen, M., Simm, J., Popovic, D., Moreau, Y., De Moor, B.: Easy hyperparameter search using optunity. arXiv preprint arXiv:1412.1114 (2014)
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3(Mar), 1157–1182 (2003)
Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46(1), 389–422 (2002)
Jang, J.S.R., Sun, C.T., Mizutani, E.: Neuro-fuzzy and Soft Computing, a Computational Approach to Learning and Machine Intelligence. Prentice-Hall Inc., Upper Saddle River (1997)
Kawala, F., Douzal-Chouakria, A., Gaussier, E., Dimert, E.: Prédictions d’activité dans les réseaux sociaux en ligne. In: 4ième Conférence sur les Modèles et l’Analyse des Réseaux: Approches Mathématiques et Informatiques, p. 16 (2013)
Keogh, E., Mueen, A.: Curse of dimensionality. In: Liu L., Özsu, M.T. (eds.) Encyclopedia of Machine Learning, pp. 257–258. Springer, Boston (2011). https://doi.org/10.1007/978-0-387-39940-9
Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artif. Intell. 97(1–2), 273–324 (1997)
Lichman, M.: UCI Machine Learning Repository (2013). http://archive.ics.uci.edu/ml
Liu, H., Motoda, H.: Feature Extraction, Construction and Selection: A Data Mining Perspective, vol. 453. Springer, New York (1998). https://doi.org/10.1007/978-1-4615-5725-8
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theor. 28(2), 129–137 (1982)
MejÃa-Lavalle, M., Sucar, E., Arroyo, G.: Feature selection with a perceptron neural net. In: Proceedings of the International Workshop on Feature Selection for Data Mining, pp. 131–135 (2006)
Pehro, D., Stork, D.: Pattern Classification. Wiley, New York (2001)
Takagi, T., Sugeno, M.: Fuzzy identification of systems and its applications to modeling and control. IEEE Trans. Syst. Man Cybern. 1, 116–132 (1985)
Vergara, J.R., Estévez, P.A.: A review of feature selection methods based on mutual information. Neural Comput. Appl. 24(1), 175–186 (2014)
Young, S.R., Rose, D.C., Karnowski, T.P., Lim, S.H., Patton, R.M.: Optimizing deep learning hyper-parameters through an evolutionary algorithm. In: Proceedings of the Workshop on Machine Learning in High-Performance Computing Environments, p. 4. ACM (2015)
Acknowledgement
This work is partially supported by Philips Research within the scope of the BrainBridge Program.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Chen, P. et al. (2018). On the Interaction Between Feature Selection and Parameter Determination in Fuzzy Modelling. In: Medina, J., Ojeda-Aciego, M., Verdegay, J., Perfilieva, I., Bouchon-Meunier, B., Yager, R. (eds) Information Processing and Management of Uncertainty in Knowledge-Based Systems. Applications. IPMU 2018. Communications in Computer and Information Science, vol 855. Springer, Cham. https://doi.org/10.1007/978-3-319-91479-4_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-91479-4_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91478-7
Online ISBN: 978-3-319-91479-4
eBook Packages: Computer ScienceComputer Science (R0)