Skip to main content

Unsupervised Feature Selection in High Dimensional Spaces and Uncertainty

  • Conference paper
Hybrid Artificial Intelligence Systems (HAIS 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5572))

Included in the following conference series:

Abstract

Developing models and methods to manage data vagueness is a current effervescent research field. Some work has been done with supervised problems but unsupervised problems and uncertainty have still not been studied. In this work, an extension of the Fuzzy Mutual Information Feature Selection algorithm for unsupervised problems is outlined. This proposal is a two stage procedure. Firstly, it makes use of the fuzzy mutual information measure and Battiti’s feature selection algorithm and of a genetic algorithm to analyze the relationships between feature subspaces in a high dimensional space. The second stage uses a simple ad hoc heuristic with the aim to extract the most relevant relationships. It is concluded, given the results from the experiments carried out in this preliminary work, that it is possible to apply frequent pattern mining or similar methods in the second stage to reduce the dimensionality of the data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alcala-Fdez, J., Sanchez, L., Garcia, S., Jesus, M.J.D., Ventura, S., Garrell, J.M., Otero, J., Romero, C., Bacardit, J., Rivas, V.M., Fernandez, J.C., Herrera, F.: KEEL: A Software Tool to Assess Evolutionary Algorithms to Data Mining Problems. Soft Computing 13(3), 307–318 (2009)

    Article  Google Scholar 

  2. Battiti, R.: Using mutual information for selecting features in supervised neural net learning. IEEE Transactions on Neural Networks 5(4), 537–550 (1994)

    Article  Google Scholar 

  3. Casillas, J., Cordon, O., Jesus, M.J.D., Herrera, F.: Genetic feature selection in a fuzzy rule-based classification system learning process for high-dimensional problems. Information Sciences 136, 135–157 (2001)

    Article  MATH  Google Scholar 

  4. Chow, T.W.S., Wang, P., Ma, E.W.M.: A New Feature Selection Scheme Using a Data Distribution Factor for Unsupervised Nominal Data. IEEE Transactions on Systems, Man and Cybernetics - PART B: Cybernetics 38(2), 499–509 (2008)

    Article  Google Scholar 

  5. Conaire, C.O., Connor, N.E.: Unsupervised feature selection for detection using mutual information thresholding. In: Ninth International Workshop on Image Analysis for Multimedia Interactive Services (2008)

    Google Scholar 

  6. Ho, T.K.: The random subspace method for constructing decision forests. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(8), 832–844 (1998)

    Article  Google Scholar 

  7. Hong, Y., Kwong, S., Chang, Y., Ren, Q.: Consensus unsupervised feature ranking from multiple views. Pattern Recognition Letters 29(5), 595–602 (2008)

    Article  Google Scholar 

  8. Hu, Q., Yu, D., Xie, Z., Liu, J.: Fuzzy Probabilistic Approximation Spaces and Their Information Measures. IEEE Transactions on Fuzzy Systems 14(2), 191–201 (2006)

    Article  Google Scholar 

  9. Jensen, R., Shen, Q.: Fuzzy-rough sets assisted attribute selection. IEEE Transactions on Fuzzy Systems 1(15), 73–89 (2007)

    Article  Google Scholar 

  10. Marcelloni, F.: Feature selection based on a modified fuzzy c-means algorithm with supervision. Information Sciences 151 (2003)

    Google Scholar 

  11. Mitra, P., Murthy, C.A., Pal, S.K.: Unsupervised Feature Selection using Feature Similarity. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(3), 301–312 (2002)

    Article  Google Scholar 

  12. Roubus, J.A., Setnes, M., Abonyi, J.: Learning fuzzy classification rules from labelled data. Information Sciences 150, 77–93 (2003)

    Article  MathSciNet  Google Scholar 

  13. Sanchez, L., Suarez, M.R., Villar, J.R., Couso, I.: Mutual Information-based Feature Selection and Fuzzy Discretization of Vague Data. International Journal of Aproximate Reasoning (2008), http://dx.doi.org/10.1016/

  14. Sanchez, L., Villar, J.R., Couso, I.: Proceedings of the 12th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems. EUSFLAT, Genetic Feature Selection for Fuzzy Discretized Data (2008)

    Google Scholar 

  15. Sedano, J., Villar, J.R., Corchado, E.S., Curiel, L., Bravo, P.M.: The application of a two-step AI model to an Automated Pneumatic Drilling Process. Accepted to be published in the International Journal of Computer Mathematics (2008)

    Google Scholar 

  16. Thangavel, K., Pethalakshmi, A.: Dimensionality reduction based on rough set theory: A review. Applied Soft Computing 9(1), 1–12 (2009)

    Article  Google Scholar 

  17. Uncu, O., Turksen, I.: A novel feature selection approach: Combining feature wrappers and filters. Information Sciences 177, 449–466 (2007)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Villar, J.R., Suárez, M.R., Sedano, J., Mateos, F. (2009). Unsupervised Feature Selection in High Dimensional Spaces and Uncertainty. In: Corchado, E., Wu, X., Oja, E., Herrero, Á., Baruque, B. (eds) Hybrid Artificial Intelligence Systems. HAIS 2009. Lecture Notes in Computer Science(), vol 5572. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02319-4_68

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02319-4_68

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02318-7

  • Online ISBN: 978-3-642-02319-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics