Abstract
Developing models and methods to manage data vagueness is a current effervescent research field. Some work has been done with supervised problems but unsupervised problems and uncertainty have still not been studied. In this work, an extension of the Fuzzy Mutual Information Feature Selection algorithm for unsupervised problems is outlined. This proposal is a two stage procedure. Firstly, it makes use of the fuzzy mutual information measure and Battiti’s feature selection algorithm and of a genetic algorithm to analyze the relationships between feature subspaces in a high dimensional space. The second stage uses a simple ad hoc heuristic with the aim to extract the most relevant relationships. It is concluded, given the results from the experiments carried out in this preliminary work, that it is possible to apply frequent pattern mining or similar methods in the second stage to reduce the dimensionality of the data set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alcala-Fdez, J., Sanchez, L., Garcia, S., Jesus, M.J.D., Ventura, S., Garrell, J.M., Otero, J., Romero, C., Bacardit, J., Rivas, V.M., Fernandez, J.C., Herrera, F.: KEEL: A Software Tool to Assess Evolutionary Algorithms to Data Mining Problems. Soft Computing 13(3), 307–318 (2009)
Battiti, R.: Using mutual information for selecting features in supervised neural net learning. IEEE Transactions on Neural Networks 5(4), 537–550 (1994)
Casillas, J., Cordon, O., Jesus, M.J.D., Herrera, F.: Genetic feature selection in a fuzzy rule-based classification system learning process for high-dimensional problems. Information Sciences 136, 135–157 (2001)
Chow, T.W.S., Wang, P., Ma, E.W.M.: A New Feature Selection Scheme Using a Data Distribution Factor for Unsupervised Nominal Data. IEEE Transactions on Systems, Man and Cybernetics - PART B: Cybernetics 38(2), 499–509 (2008)
Conaire, C.O., Connor, N.E.: Unsupervised feature selection for detection using mutual information thresholding. In: Ninth International Workshop on Image Analysis for Multimedia Interactive Services (2008)
Ho, T.K.: The random subspace method for constructing decision forests. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(8), 832–844 (1998)
Hong, Y., Kwong, S., Chang, Y., Ren, Q.: Consensus unsupervised feature ranking from multiple views. Pattern Recognition Letters 29(5), 595–602 (2008)
Hu, Q., Yu, D., Xie, Z., Liu, J.: Fuzzy Probabilistic Approximation Spaces and Their Information Measures. IEEE Transactions on Fuzzy Systems 14(2), 191–201 (2006)
Jensen, R., Shen, Q.: Fuzzy-rough sets assisted attribute selection. IEEE Transactions on Fuzzy Systems 1(15), 73–89 (2007)
Marcelloni, F.: Feature selection based on a modified fuzzy c-means algorithm with supervision. Information Sciences 151 (2003)
Mitra, P., Murthy, C.A., Pal, S.K.: Unsupervised Feature Selection using Feature Similarity. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(3), 301–312 (2002)
Roubus, J.A., Setnes, M., Abonyi, J.: Learning fuzzy classification rules from labelled data. Information Sciences 150, 77–93 (2003)
Sanchez, L., Suarez, M.R., Villar, J.R., Couso, I.: Mutual Information-based Feature Selection and Fuzzy Discretization of Vague Data. International Journal of Aproximate Reasoning (2008), http://dx.doi.org/10.1016/
Sanchez, L., Villar, J.R., Couso, I.: Proceedings of the 12th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems. EUSFLAT, Genetic Feature Selection for Fuzzy Discretized Data (2008)
Sedano, J., Villar, J.R., Corchado, E.S., Curiel, L., Bravo, P.M.: The application of a two-step AI model to an Automated Pneumatic Drilling Process. Accepted to be published in the International Journal of Computer Mathematics (2008)
Thangavel, K., Pethalakshmi, A.: Dimensionality reduction based on rough set theory: A review. Applied Soft Computing 9(1), 1–12 (2009)
Uncu, O., Turksen, I.: A novel feature selection approach: Combining feature wrappers and filters. Information Sciences 177, 449–466 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Villar, J.R., Suárez, M.R., Sedano, J., Mateos, F. (2009). Unsupervised Feature Selection in High Dimensional Spaces and Uncertainty. In: Corchado, E., Wu, X., Oja, E., Herrero, Á., Baruque, B. (eds) Hybrid Artificial Intelligence Systems. HAIS 2009. Lecture Notes in Computer Science(), vol 5572. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02319-4_68
Download citation
DOI: https://doi.org/10.1007/978-3-642-02319-4_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02318-7
Online ISBN: 978-3-642-02319-4
eBook Packages: Computer ScienceComputer Science (R0)