Skip to main content

A Feature Selection Approach in Problems with a Great Number of Features

  • Conference paper
Computer Recognition Systems 2

Part of the book series: Advances in Soft Computing ((AINSC,volume 45))

  • 758 Accesses

Abstract

A misclassification rate is most often used as a feature selection criterion. However, in the cases, when the numerical force of the training set is not sufficiently large in relation to the number of features, the risk of choosing the noisy features is very high. It produces difference between error estimations derived on the basis of the training and testing sets, so the error rate estimation can not be sufficiently confident. Feature preselection based on analysis of dependences between features is recommended in such types of tasks. An advantage of this approach is shown in the paper. As a feature selection criterion the Pearson chi-square statistics has been used.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Duda RO, Hart PE (2000) Pattern Classification (2nd ed.), Wiley

    Google Scholar 

  2. Chernoff H, Lehmann EL (1954) The use of maximum likelihood estimates in Χ2 tests for goodness-of-fit, The Annals of Mathematical Statistics

    Google Scholar 

  3. Rosner BA (2005) Fundamentals of Biostatistics, 6th ed., Duxbury Press

    Google Scholar 

  4. Tadeusiewicz R, Izworski A, Majewski J (1993) Biometria, Wyd. AGH

    Google Scholar 

  5. Jozwik A, Chmielewski L, Sklodowski M, Cudny W (1998) A parallel net of (1-NN, k-NN) classifier for optical inspection of surface defects in ferrites, Machine Graphics & Vision, vol. 7, no. 1–2, pp. 99–112

    Google Scholar 

  6. Newman DJ, Hettich S, Blake CL, Merz CJ (1998) UCI Repository of machine learning databases [http://www.ics.uci.edu/~mlearn/MLRepository.html], Irvine, CA: University of California, Department of Information and Computer Science

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kosla, P. (2007). A Feature Selection Approach in Problems with a Great Number of Features. In: Kurzynski, M., Puchala, E., Wozniak, M., Zolnierek, A. (eds) Computer Recognition Systems 2. Advances in Soft Computing, vol 45. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75175-5_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-75175-5_50

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-75174-8

  • Online ISBN: 978-3-540-75175-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics