Adapt Bagging to Nearest Neighbor Classifiers

Zhou, Zhi-Hua; Yu, Yang

doi:10.1007/s11390-005-0005-5

Adapt Bagging to Nearest Neighbor Classifiers

Published: January 2005

Volume 20, pages 48–54, (2005)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Zhi-Hua Zhou¹ &
Yang Yu¹

97 Accesses
Explore all metrics

Abstract

It is well-known that in order to build a strong ensemble, the component learners should be with high diversity as well as high accuracy. If perturbing the training set can cause significant changes in the component learners constructed, then Bagging can effectively improve accuracy. However, for stable learners such as nearest neighbor classifiers, perturbing the training set can hardly produce diverse component learners, therefore Bagging does not work well. This paper adapts Bagging to nearest neighbor classifiers through injecting randomness to distance metrics. In constructing the component learners, both the training set and the distance metric employed for identifying the neighbors are perturbed. A large scale empirical study reported in this paper shows that the proposed BagInRand algorithm can effectively improve the accuracy of nearest neighbor classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Building Locally Discriminative Classifier Ensemble Through Classifier Fusion Among Nearest Neighbors

Ensemble Learning with Extremely Randomized k-Nearest Neighbors for Accurate and Efficient Classification

Article 03 December 2024

A parameter-free nearest neighbor algorithm with reduced prediction time and improved performance through injected randomness

Article 11 December 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Dietterich T G. Ensemble Learning. The Handbook of Brain Theory and Neural Networks, 2nd edition, M.A. Arbib (ed.), Cambridge, MA: MIT Press, 2002.
Google Scholar
Krogh A, Vedelsby J. Neural Network Ensembles, Cross Validation, and Active Learning. In Advances in Neural Information Processing Systems 7, Tesauro G, Touretzky D S, Leen T K (eds.), Cambridge, MA: MIT Press, 1995, pp. 231–238.
Google Scholar
Kuncheva L J, Whitaker C J. Measures of diversity in classifier ensembles. Machine Learning, 2003, 51(2): 181–207.
Article MATH Google Scholar
Breiman L. Bagging predictors. Machine Learning, 1996, 24(2): 123–140.
Google Scholar
Efron B, Tibshirani R. An Introduction to the Bootstrap. New York: Chapman & Hall, 1993.
Google Scholar
Aha D W. Lazy learning: Special issue editorial. Artificial Intelligence Review, 1997, 11(1–5): 7–10.
Google Scholar
Dasarathy B V. Nearest Neighbor Norms: NN Pattern Classification Techniques, Los Alamitos, CA: IEEE Computer Society Press, 1991.
Google Scholar
Kolen J F, Pollack J B. Back Propagation is Sensitive to Initial Conditions. In Advances in Neural Information Processing Systems 3, Lippmann R P, Moody J E, Touretzky D S (eds.), San Francisco, CA: Morgan Kaufmann, 1991, pp. 860–867.
Google Scholar
Kwok S W, Carter C. Multiple decision trees. In Proc. the 4th Annual Conference on Uncertainty in Artificial Intelligence, New York, NY, 1988, pp. 327–338.
Dietterich T G. An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization. Machine Learning, 2000, 40(2): 139–157.
Google Scholar
Ali K M, Pazzani M J. Error reduction through learning multiple descriptions. Machine Learning, 1996, 24(3): 173–202.
Google Scholar
Stanfill C, Waltz D. Toward memory-based reasoning. Communications of the ACM, 1986, 29(12): 1213–1228.
Google Scholar
Blake C, Keogh E, Merz C J. UCI repository of machine learning databases. Department of Information and Computer Science, University of California, Irvine, CA, 1998. http://www.ics.uci.edu/~mlearn/MLRepository.html
Google Scholar
Ho T K. Nearest neighbors in random subspaces. In Lecture Notes in Computer Science 1451, Amin A, Dori D, Pudil P, Freeman H (eds.), Berlin: Springer, 1998, pp. 640–648.
Google Scholar
Bay S D. Combine nearest neighbor classifiers through multiple feature subsets. In Proc. the 15th International Conference on Machine Learning, Madison, MI, 1998, pp. 37–45.
Alkoot F M, Kittler J. Moderating k-NN classifiers. Pattern Analysis & Applications, 2002, 5(3): 326–332.
Google Scholar
Zhou Z H, Wu J, Tang W. Ensembling neural networks: Many could be better than all. Artificial Intelligence, 2002, 137(1–2): 239–263.
Google Scholar

Download references

Author information

Authors and Affiliations

National Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210093, P.R. China
Zhi-Hua Zhou & Yang Yu

Authors

Zhi-Hua Zhou
View author publications
You can also search for this author inPubMed Google Scholar
Yang Yu
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Zhi-Hua Zhou.

Additional information

Supported by the National Outstanding Youth Foundation of China under Grant No.60325207, the Fok Ying Tung Education Foundation under Grant No.91067, and the Excellent Young Teachers Program of MOE of China.

Zhi-Hua Zhou received the B.Sc., M.Sc., and Ph.D. degrees in computer science from Nanjing University, China, in 1996, 1998, and 2000, respectively, all with the highest honor. He joined the Department of Computer Science & Technology of Nanjing University as a lecturer in 2001, and is a professor and leader of the LAMDA group at present. His research interests are in machine learning, data mining, pattern recognition, information retrieval, neural computing, and evolutionary computing. In these areas he has published over 40 technical papers in refereed international journals or conference proceedings. He has won the Microsoft Fellowship Award (1999), the National Excellent Doctoral Dissertation Award of China (2003), and award of the National Outstanding Youth Foundation of China (2004). He is on the editorial boards of Artificial Intelligence in Medicine (Elsevier), Knowledge and Information Systems (Springer), and International Journal of Data Warehousing and Mining (Idea Group). He served as the organising chair of the 7th Chinese Workshop on Machine Learning (2000), program co-chair of the 9th Chinese Conference on Machine Learning (2004), and program committee member for numerous international conferences. He is the vice chair of the Artificial Intelligence & Pattern Recognition Society of China Computer Federation, a councilor of Chinese Association of Artificial Intelligence (CAAI), the chief secretary of CAAI Machine Learning Society, and a member of IEEE and IEEE Computer Society.

Yang Yu received the B.Sc. degree in computer science from Nanjing University, China, in 2004. He has won some awards such as China Computer World Scholarship (2004) and Scholarship for outstanding undergraduates. Now he is a member of the LAMDA group and will pursue his M.Sc. degree at the Department of Computer Science & Technology of Nanjing University since September 2005, to be supervised by Prof. Zhi-Hua Zhou. His research interests are in machine learning and evolutionary computing.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, ZH., Yu, Y. Adapt Bagging to Nearest Neighbor Classifiers. J Comput Sci Technol 20, 48–54 (2005). https://doi.org/10.1007/s11390-005-0005-5

Download citation

Received: 19 September 2004
Revised: 12 November 2004
Issue Date: January 2005
DOI: https://doi.org/10.1007/s11390-005-0005-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adapt Bagging to Nearest Neighbor Classifiers

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Building Locally Discriminative Classifier Ensemble Through Classifier Fusion Among Nearest Neighbors

Ensemble Learning with Extremely Randomized k-Nearest Neighbors for Accurate and Efficient Classification

A parameter-free nearest neighbor algorithm with reduced prediction time and improved performance through injected randomness

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now