Skip to main content

The Data Dimensionality Reduction in the Classification Process Through Greedy Backward Feature Elimination

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 659))

Abstract

The article presents the author’s algorithm of dimensionality reduction of used data set, realized through Greedy Backward Feature Elimination. Results of the dimensionality reduction are verified in the process of classification for 2 selected data sets. These data sets contain the data for the realization of the multiclass classification. The article presents not only a description of the algorithm but also an example and the results of classification, carried out by selected classifier before and after the process of dimensionality reduction. At the end of article, a summary and the possibility of further work are provided.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Agrawal, R., Imielinski, T., Swami, A.: Database mining: a performance perspective. IEEE Trans. Knowl. Data Eng. 5(6), 914–925 (1993)

    Article  Google Scholar 

  2. Aha, D., Kibler, D.: Instance-based learning algorithms. Mach. Learn. 6(1), 37–66 (1991)

    MATH  Google Scholar 

  3. Alpaydin, E., Kaynak, C.: DIGITS data set, UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/Optical+Recognition+of+Handwritten+Digits

  4. Arie, B.D.: Comparison of classification accuracy using Cohen’s weighted Kappa. Expert Syst. Appl. 34(2), 825–832 (2008)

    Article  Google Scholar 

  5. Costa, E., Lorena, A., Carvalho, A., Freitas, A.: A review of performance evaluation measures for hierarchical classifiers. In: AAAI-2007 Workshop, Vancouver, Canada, pp. 182–196 (2007)

    Google Scholar 

  6. Doak, J.: CSE-92-18—An Evaluation of Feature Selection Methods and Their Application to Computer Security. Technical report, UC Davis Dept of Computer Science (1992)

    Google Scholar 

  7. Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Ann. Stat. 28(2), 337–407 (1998)

    Article  MathSciNet  MATH  Google Scholar 

  8. Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)

    MATH  Google Scholar 

  9. Johnson, B.: URBAN data set, UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/Urban+Land+Cover

  10. Johnson, B.: High resolution urban land cover classification using a competitive multi-scale object-based approach. Remote Sens. Lett. 4(2), 131–140 (2013)

    Article  Google Scholar 

  11. Johnson, B., Xie, Z.: Classifying a high resolution image of an urban area using super-object information. ISPRS J. Photogrammetry Remote Sens. 83, 40–49 (2013)

    Article  Google Scholar 

  12. Josinski, H., Kostrzewa, D., Michalczuk, A., Switonski, A.: The exIWO metaheuristic for solving continuous and discrete optimization problems. Sci. World J. (2014). Article id 831,691

    Google Scholar 

  13. Josinski, H., Switonski, A., Jedrasiak, K., Kostrzewa, D.: Human identification based on gait motion capture data. In: IMECS 2012, pp. 507–510 (2012)

    Google Scholar 

  14. Kostrzewa, D., Josinski, H.: The exIWO metaheuristic—a recapitulation of the research on the join ordering problem. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds.) Beyond Databases, Architectures, and Structures, CCIS, vol. 424, pp. 10–19. Springer, Switzerland (2014)

    Chapter  Google Scholar 

  15. Liu, H., Motoda, H.: Feature Selection for Knowledge Discovery and Data Mining. Springer, Heidelberg (1998)

    Book  MATH  Google Scholar 

  16. Machine Learning Group at the University of Waikato: Weka 3. http://www.cs.waikato.ac.nz/~ml/weka/

  17. Mehrabian, A., Lucas, C.: A novel numerical optimization algorithm inspired from weed colonization. Ecol. Inform. 1(4), 355–366 (2006)

    Article  Google Scholar 

  18. Pahlavani, P., Delavar, M., Frank, A.: Using a modified invasive weed optimization algorithm for a personalized urban multi-criteria path optimization problem. Int. J. Appl. Earth Obs. Geoinf. 18, 313–328 (2012)

    Article  Google Scholar 

  19. Powers, D.: Evaluation: from precision, recall and F-score to ROC, informedness, markedness & correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)

    MathSciNet  Google Scholar 

  20. Provost, F., Fawcett, T., Kohavi, R.: The case against accuracy estimation for comparing classifiers. In: ICML 1998, Madison, USA, pp. 445–453 (1998)

    Google Scholar 

  21. Wu, X., Kumar, V., Quinlan, J., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G., Ng, A., Liu, B., Yu, P., Zhou, Z.H., Steinbach, M., Hand, D., Steinberg, D.: Top 10 algorithms in data mining. Knowl. Inf. Syst. 14, 1–37 (2008)

    Article  Google Scholar 

  22. Yang, J., Honavar, V.: Feature subset selection using a genetic algorithm. IEEE Intell. Syst. 13, 44–49 (1998)

    Article  Google Scholar 

Download references

Acknowledgements

This work was partly supported by BKM16/RAU2/507 and BK-219/RAU2/2016 grants from the Institute of Informatics, Silesian University of Technology, Poland.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniel Kostrzewa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Kostrzewa, D., Brzeski, R. (2018). The Data Dimensionality Reduction in the Classification Process Through Greedy Backward Feature Elimination. In: Gruca, A., Czachórski, T., Harezlak, K., Kozielski, S., Piotrowska, A. (eds) Man-Machine Interactions 5. ICMMI 2017. Advances in Intelligent Systems and Computing, vol 659. Springer, Cham. https://doi.org/10.1007/978-3-319-67792-7_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67792-7_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67791-0

  • Online ISBN: 978-3-319-67792-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics