The Data Dimensionality Reduction in the Classification Process Through Greedy Backward Feature Elimination

Kostrzewa, Daniel; Brzeski, Robert

doi:10.1007/978-3-319-67792-7_39

The Data Dimensionality Reduction in the Classification Process Through Greedy Backward Feature Elimination

Daniel Kostrzewa¹⁹ &
Robert Brzeski¹⁹

Conference paper
First Online: 20 September 2017

1226 Accesses
7 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 659))

Abstract

The article presents the author’s algorithm of dimensionality reduction of used data set, realized through Greedy Backward Feature Elimination. Results of the dimensionality reduction are verified in the process of classification for 2 selected data sets. These data sets contain the data for the realization of the multiclass classification. The article presents not only a description of the algorithm but also an example and the results of classification, carried out by selected classifier before and after the process of dimensionality reduction. At the end of article, a summary and the possibility of further work are provided.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Agrawal, R., Imielinski, T., Swami, A.: Database mining: a performance perspective. IEEE Trans. Knowl. Data Eng. 5(6), 914–925 (1993)
Article Google Scholar
Aha, D., Kibler, D.: Instance-based learning algorithms. Mach. Learn. 6(1), 37–66 (1991)
MATH Google Scholar
Alpaydin, E., Kaynak, C.: DIGITS data set, UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/Optical+Recognition+of+Handwritten+Digits
Arie, B.D.: Comparison of classification accuracy using Cohen’s weighted Kappa. Expert Syst. Appl. 34(2), 825–832 (2008)
Article Google Scholar
Costa, E., Lorena, A., Carvalho, A., Freitas, A.: A review of performance evaluation measures for hierarchical classifiers. In: AAAI-2007 Workshop, Vancouver, Canada, pp. 182–196 (2007)
Google Scholar
Doak, J.: CSE-92-18—An Evaluation of Feature Selection Methods and Their Application to Computer Security. Technical report, UC Davis Dept of Computer Science (1992)
Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Ann. Stat. 28(2), 337–407 (1998)
Article MathSciNet MATH Google Scholar
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
MATH Google Scholar
Johnson, B.: URBAN data set, UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/Urban+Land+Cover
Johnson, B.: High resolution urban land cover classification using a competitive multi-scale object-based approach. Remote Sens. Lett. 4(2), 131–140 (2013)
Article Google Scholar
Johnson, B., Xie, Z.: Classifying a high resolution image of an urban area using super-object information. ISPRS J. Photogrammetry Remote Sens. 83, 40–49 (2013)
Article Google Scholar
Josinski, H., Kostrzewa, D., Michalczuk, A., Switonski, A.: The exIWO metaheuristic for solving continuous and discrete optimization problems. Sci. World J. (2014). Article id 831,691
Google Scholar
Josinski, H., Switonski, A., Jedrasiak, K., Kostrzewa, D.: Human identification based on gait motion capture data. In: IMECS 2012, pp. 507–510 (2012)
Google Scholar
Kostrzewa, D., Josinski, H.: The exIWO metaheuristic—a recapitulation of the research on the join ordering problem. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds.) Beyond Databases, Architectures, and Structures, CCIS, vol. 424, pp. 10–19. Springer, Switzerland (2014)
Chapter Google Scholar
Liu, H., Motoda, H.: Feature Selection for Knowledge Discovery and Data Mining. Springer, Heidelberg (1998)
Book MATH Google Scholar
Machine Learning Group at the University of Waikato: Weka 3. http://www.cs.waikato.ac.nz/~ml/weka/
Mehrabian, A., Lucas, C.: A novel numerical optimization algorithm inspired from weed colonization. Ecol. Inform. 1(4), 355–366 (2006)
Article Google Scholar
Pahlavani, P., Delavar, M., Frank, A.: Using a modified invasive weed optimization algorithm for a personalized urban multi-criteria path optimization problem. Int. J. Appl. Earth Obs. Geoinf. 18, 313–328 (2012)
Article Google Scholar
Powers, D.: Evaluation: from precision, recall and F-score to ROC, informedness, markedness & correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)
MathSciNet Google Scholar
Provost, F., Fawcett, T., Kohavi, R.: The case against accuracy estimation for comparing classifiers. In: ICML 1998, Madison, USA, pp. 445–453 (1998)
Google Scholar
Wu, X., Kumar, V., Quinlan, J., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G., Ng, A., Liu, B., Yu, P., Zhou, Z.H., Steinbach, M., Hand, D., Steinberg, D.: Top 10 algorithms in data mining. Knowl. Inf. Syst. 14, 1–37 (2008)
Article Google Scholar
Yang, J., Honavar, V.: Feature subset selection using a genetic algorithm. IEEE Intell. Syst. 13, 44–49 (1998)
Article Google Scholar

Download references

Acknowledgements

This work was partly supported by BKM16/RAU2/507 and BK-219/RAU2/2016 grants from the Institute of Informatics, Silesian University of Technology, Poland.

Author information

Authors and Affiliations

Silesian University of Technology, Gliwice, Poland
Daniel Kostrzewa & Robert Brzeski

Authors

Daniel Kostrzewa
View author publications
You can also search for this author in PubMed Google Scholar
Robert Brzeski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel Kostrzewa .

Editor information

Editors and Affiliations

Institute of Informatics, Silesian University of Technology, Gliwice, Poland
Aleksandra Gruca
Institute of Informatics, Silesian University of Technology, Gliwice, Poland
Tadeusz Czachórski
Institute of Informatics, Silesian University of Technology, Gliwice, Poland
Katarzyna Harezlak
Institute of Informatics, Silesian University of Technology, Gliwice, Poland
Stanisław Kozielski
Institute of Informatics, Silesian University of Technology, Gliwice, Poland
Agnieszka Piotrowska

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kostrzewa, D., Brzeski, R. (2018). The Data Dimensionality Reduction in the Classification Process Through Greedy Backward Feature Elimination. In: Gruca, A., Czachórski, T., Harezlak, K., Kozielski, S., Piotrowska, A. (eds) Man-Machine Interactions 5. ICMMI 2017. Advances in Intelligent Systems and Computing, vol 659. Springer, Cham. https://doi.org/10.1007/978-3-319-67792-7_39

Download citation

DOI: https://doi.org/10.1007/978-3-319-67792-7_39
Published: 20 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67791-0
Online ISBN: 978-3-319-67792-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics