Combined kNN Classifier for Classification of Incomplete Data

Orczyk, Tomasz; Doroz, Rafal; Porwik, Piotr

doi:10.1007/978-3-030-19738-4_3

Tomasz Orczyk¹⁷,
Rafal Doroz¹⁷ &
Piotr Porwik¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 977))

Included in the following conference series:

International Conference on Computer Recognition Systems

646 Accesses

Abstract

Common problem in data classification is the incompleteness of the data, and not always it is possible to re-acquire the missing values. Another approach is to fill-in missing values using some statistical methods. This however distracts the original data and may lead to over-fit the classifier to the artificially generated values, and in consequence to overestimate the classifier accuracy in Cross Validation tests. In this paper we propose a solution where, for a reference data consisting of complete and incomplete records, complete records serve as a reference data for a standard classifier, while the whole set serves as a reference data for single feature subspaced classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aeberhard S, Coomans D, De Vel O (1992) Comparison of classifiers in high dimensional settings. Department of Mathematics and Statistics, James Cook University, North Queensland, Australia, Technical report 92, 02 (1992)
Google Scholar
Aha D, Kibler D (1991) Instance-based learning algorithms. Mach Learn 6:37–66
MATH Google Scholar
Berthold MR, Cebron N, Dill F, Gabriel TR, Kötter T, Meinl T, Ohl P, Thiel K, Wiswedel B (2009) KNIME - the Konstanz information miner: version 2.0 and beyond. SIGKDD Explor Newsl 11(1):26–31. https://doi.org/10.1145/1656274.1656280
Article Google Scholar
Ayres-de Campos D, Bernardes J, Garrido A, Marques-de Sa J, Pereira-Leite L (2000) SisPorto 2.0: a program for automated analysis of cardiotocograms. J Mater-Fetal Med 9(5):311–318
Article Google Scholar
Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7(Jan):1–30
MathSciNet MATH Google Scholar
Dua D, Graff C (2017) UCI machine learning repository. http://archive.ics.uci.edu/ml
Eibe F, Hall M, Witten I (2016) The WEKA workbench. Online appendix for data mining: practical machine learning tools and techniques. Morgan Kaufmann
Google Scholar
Jain AK, Dubes RC (1988) Algorithms for clustering data. Prentice-Hall, Inc., Upper Saddle River
MATH Google Scholar
Khozeimeh F, Alizadehsani R, Roshanzamir M, Khosravi A, Layegh P, Nahavandi S (2017) An expert system for selecting wart treatment method. Comput Biol Med 81:167–175
Article Google Scholar
Mangasarian OL, Street WN, Wolberg WH (1995) Breast cancer diagnosis and prognosis via linear programming. Oper Res 43(4):570–577
Article MathSciNet Google Scholar
Porwik P, Orczyk T, Lewandowski M, Cholewa M (2016) Feature projection k-NN classifier model for imbalanced and incomplete medical data. Biocybern Biomed Eng 36(4):644–656
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer Systems Department, Institute of Computer Science, University of Silesia, Bedzinska 39, 41-200, Sosnowiec, Poland
Tomasz Orczyk, Rafal Doroz & Piotr Porwik

Authors

Tomasz Orczyk
View author publications
You can also search for this author in PubMed Google Scholar
Rafal Doroz
View author publications
You can also search for this author in PubMed Google Scholar
Piotr Porwik
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rafal Doroz .

Editor information

Editors and Affiliations

Faculty of Electronics, Wroclaw University of Science and Technology, Wrocław, Poland
Robert Burduk
Faculty of Electronics, Wroclaw University of Science and Technology, Wrocław, Poland
Marek Kurzynski
Faculty of Electronics, Wroclaw University of Science and Technology, Wrocław, Poland
Michał Wozniak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Orczyk, T., Doroz, R., Porwik, P. (2020). Combined kNN Classifier for Classification of Incomplete Data. In: Burduk, R., Kurzynski, M., Wozniak, M. (eds) Progress in Computer Recognition Systems. CORES 2019. Advances in Intelligent Systems and Computing, vol 977. Springer, Cham. https://doi.org/10.1007/978-3-030-19738-4_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-19738-4_3
Published: 08 May 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19737-7
Online ISBN: 978-3-030-19738-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics