Abstract
The k-nearest neighbor (KNN) rule is a simple and effective nonparametric classification algorithm in pattern classification. However, it suffers from several problems such as sensitivity to outliers and inaccurate classification decision rule. Thus, a local mean-based k-nearest neighbor classifier (LMKNN) was proposed to address these problems, which assigns the query sample with a class label based on the closest local mean vector among all classes. It is proven that the LMKNN classifier achieves better classification performance and is more robust to outliers than the classical KNN classifier. Nonetheless, the unreliable nearest neighbor selection rule and single local mean vector strategy in LMKNN classifier severely have negative effect on its classification performance. Considering these problems in LMKNN, we propose a globally adaptive k-nearest neighbor classifier based on local mean optimization, which utilizes the globally adaptive nearest neighbor selection strategy and the implementation of local mean optimization to obtain more convincing and reliable local mean vectors. The corresponding experimental results conducted on twenty real-world datasets demonstrated that the proposed classifier achieves better classification performance and is less sensitive to the neighborhood size \(k\) compared with other improved KNN-based classification methods.




Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Alcalá-Fdez J et al (2011) Keel data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J Multiple-Valued Logic Soft Comput 17:255–287
Almalawi A, Fahad A, Tari Z, Cheema M, Khalil I (2016) kNNVWC: an efficient k-nearest neighbors approach based on various-widths clustering. IEEE Trans Knowl Data Eng 28(1):68–81
Andoni A, Indyk P (2006) Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. In: Proceedings of 47th annual IEEE symposium on foundations of computer science, pp 459–468
Bailey T, Jain AK, Bailey T (2010) A note on distance-weighted nearest neighbor rules. IEEE Trans Syst Man Cybern 8(4):311–313
Biswas N, Chakraborty S, Mullick SS, Das S (2018) A parameter independent fuzzy weighted k-nearest neighbor classifier. Pattern Recognit Lett 101:80–87
Blake C (1998) UCI repository of machine learning databases. ftp://ftp.ics.uci.edu/pub/machine-learning-databases. Accessed 2017
Cordón I, García S, Fernández A, Herrera F (2018) Imbalance: oversampling algorithms for imbalanced classification in R. Knowl Based Syst 161:329–341
Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Inf Theory 13(1):21–27
Fukunaga K, Narendra PM (1975) A branch and bound algorithm for computing k-nearest neighbors. IEEE Trans Comput 100(7):750–753
García-Pedrajas N, Castillo JAD, Cerruela-García G (2015) A proposal for local k values for k-nearest neighbor rule. IEEE Trans Neural Netw Learn Syst 28(2):470–475
Gou J, Zhang Y, Du L, Xiong T (2012) A local mean-based k-nearest centroid neighbor classifier. Comput J 55(9):1058–1071
Gou J, Zhan Y, Rao Y, Shen X, Wang X, He W (2014) Improved pseudo nearest neighbor classification. Knowl Based Syst 70(C):361–375
Gou J, Ma H, Ou W, Zeng S, Rao Y, Yang H (2019) a generalized mean distance-based k-nearest neighbor classifier. Expert Syst Appl 115:356–372
Jégou H, Douze M, Schmid C (2011) Product quantization for nearest neighbor search. IEEE Trans Pattern Anal Mach Intell 33(1):117–128
Jiang S, Pang G, Wu M, Kuang L (2012) An improved k-nearest neighbor algorithm for text categorization. Expert Syst Appl 39(1):1503–1509
Keller JM, Gray MR, Givens JJS (1985) A fuzzy k-nearest neighbor algorithm. IEEE Trans Syst Man Cybern 15(4):580–585
Li B, Chen YW, Chen Y (2008) The nearest neighbor algorithm of local probability centers. IEEE Trans Syst Man Cybern B Cybern 38(1):141–154
Lin Y, Li J, Lin M, Chen J (2014) A new nearest neighbor classifier via fusing neighborhood information. Neurocomputing 43(16):164–169
Liu H, Zhang S (2012) Noisy data elimination using mutual k-nearest neighbor for classification mining. J Syst Softw 85(5):1067–1074
Liu ZG, Pan Q, Dezert J (2013) A new belief-based k-nearest neighbor classification method. Pattern Recognit 46(3):834–844
Liu ZG, Pan Q, Dezert J, Mercier G (2014) Fuzzy-belief k-nearest neighbor classifier for uncertain data. In: Proceedings of 17th international conference on information fusion (Fusion 2014), pp 1–8
Lv Q, Josephson W, Wang Z, Charikar M, Li K (2007) Multiprobe lsh: efficient indexing for high-dimensional similarity search. In: Proceedings of 33rd international conference on very large data bases, pp 950–961
Ma HG, Gou JP, Wang XL, Ke J, Zeng SN (2017) Sparse coefficient-based k-nearest neighbor classification. IEEE Access 5:16618–16634
Mateos-García D, García-Gutiérrez J, Riquelme-Santos JC (2016) An evolutionary voting for k nearest neighbors. Expert Syst Appl 43:9–14
Mitani Y, Hamamoto Y (2006) A local mean-based nonparametric classifier. Pattern Recognit Lett 27(10):1151–1159
Ning Q, Zhu J, Zhong Z, Hoi Steven CH (2017) Scalable image retrieval by sparse product quantization. IEEE Trans Multimed 19(3):586–597
Pan Z, Wang Y, Ku W (2017a) A new general nearest neighbor classification based on the mutual neighborhood information. Knowl Based Syst 121:142–152
Pan Z, Wang Y, Ku W (2017b) A new k-harmonic nearest neighbor classifier based on the multi-local means. Expert Syst Appl 67:115–125
Rodger JA (2014) A fuzzy nearest neighbor neural network statistical model for predicting demand for natural gas and energy cost savings in public buildings. Expert Syst Appl 41(4):1813–1829
Sánchez JS, Pla F, Ferri FJ (1998) Improving the k-NCN classification rule through heuristic modifications. Pattern Recognit Lett 19(13):1165–1170
Sarkar M (2007) Fuzzy-rough nearest neighbor algorithms in classification. Fuzzy Sets Syst 158(19):2134–2152
Sproull RF (1991) Refinements to nearest-neighbor searching in k-dimensional trees. Algorithmica 6(1–6):579–589
Wagner T (1970) Convergence of the nearest neighbor rule. IEEE Trans Inf Theory IT 17(5):566–571
Xu Y, Zhu Q, Fan Z, Qiu M, Chen Y, Liu H (2013) Coarse to fine k nearest neighbor classifier. Pattern Recognit Lett 34(9):980–986
Yu Z, Chen H, Liu J, You J, Leung H, Han G (2016) Hybrid K-nearest neighbor classifier. IEEE Trans Cybern 46(6):1263–1275
Zeng Y, Yang Y, Zhao L (2008) Pseudo nearest neighbor rule for pattern classification. Expert Syst Appl 36(2):3587–3595
Zhang N, Yang J, Qian JJ (2012) Component-based global k-nn classifier for small sample size problems. Pattern Recognit Lett 33(13):1689–1694
Zhang S, Li X, Ming Z, Wang R (2018) Efficient knn classification with different numbers of nearest neighbors. IEEE Trans Neural Netw Learn Syst 29(5):1774–1785
Acknowledgements
This work is supported in part by the National Key Laboratory Foundation of China under Grant No. HTKJ2020KL504015, the National Natural Science Foundation of China under Grant No. U1903213, and the Key Science and Technology Program of Shaanxi Province under Grant No. 2020GY-005.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Human and animal rights
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Communicated by V. Loia.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Pan, Z., Pan, Y., Wang, Y. et al. A new globally adaptive k-nearest neighbor classifier based on local mean optimization. Soft Comput 25, 2417–2431 (2021). https://doi.org/10.1007/s00500-020-05311-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-020-05311-x