Comparison of Instances Seletion Algorithms I. Algorithms Survey

Jankowski, Norbert; Grochowski, Marek

doi:10.1007/978-3-540-24844-6_90

Norbert Jankowski²² &
Marek Grochowski²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3070))

Included in the following conference series:

International Conference on Artificial Intelligence and Soft Computing

1515 Accesses
71 Citations

Abstract

Several methods were proposed to reduce the number of instances (vectors) in the learning set. Some of them extract only bad vectors while others try to remove as many instances as possible without significant degradation of the reduced dataset for learning. Several strategies to shrink training sets are compared here using different neural and machine learning classification algorithms. In part II (the accompanying paper) results on benchmarks databases have been presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Reducing Data Volume in Instance Based Learning

Unsupervised instance selection via conjectural hyperrectangles

Article 02 November 2022

A Fast and Efficient Algorithm for Filtering the Training Dataset

References

Duda, R.O., Hart, P.E., Stork, D.G.: Patter Classification and Scene Analysis, 2nd edn. Wiley, Chichester (1997)
Google Scholar
Aha, D.W., Kibler, D., Albert, M.K.: Aha. Machine Learning 6, 37–66 (1991)
Google Scholar
Cover, T.M., Hart, P.E.: Nearest neighbor pattern classification. Institute of Electrical and Electronics Engineers Transactions on Information Theory 13, 21–27 (1967)
MATH Google Scholar
Hart, P.E.: The condensed nearest neighbor rule. IEEE Transactions on Information Theory 14, 515–516 (1968)
Article Google Scholar
Ritter, G.L., Woodruff, H.B., Lowry, S.R., Isenhour, T.L.: An algorithm for a selective nearest neighbor decision rule. IEEE Transactions on Information Theory 21, 665–669 (1975)
Article MATH Google Scholar
Gates, G.: The reduced nearest neighbor rule. IEEE Transactions on Information Theory 18, 431–433 (1972)
Article Google Scholar
Wilson, D.: Asymptotic properties of nearest neighbor rules using edited data. IEEE Transactions on Systems, Man, and Cybernetics 2, 408–421 (1972)
Article MATH Google Scholar
Cameron-Jones, R.M.: Instance selection by encoding length heuristic with random mutation hill climbing. In: Proceedings of the Eighth Australian Joint Conference on Artificial Intelligence, pp. 99–106 (1995)
Google Scholar
Bhattacharya, B.K., Poulsen, R.S., Toussaint, G.T.: Application of proximity graphs to editing nearest neighbor decision rule. In: International Symposium on Information Theory, Santa Monica (1981)
Google Scholar
Brighton, H., Mellish, C.: Advances in instance selection for instance-based learning algorithms. Data Mining and Knowledge Discovery 6, 153–172 (2002)
Article MATH MathSciNet Google Scholar
Skalak, D.B.: Prototype and feature selection by sampling and random mutation hill climbing algorithms. In: International Conference on Machine Learning, pp. 293–301 (1994)
Google Scholar
Schölkopf, B., Smola, A.: Learning with Kernels. MIT Press, Cambridge (2002)
Google Scholar
Grab̧czewski, K., Duch, W.: A general purpose separability criterion for classification systems. In: 4th Conference on Neural Networks and Their Applications, pp. 203–208. Polish Neural Networks Society, Zakopane (1999)
Google Scholar
Adamczak, R., Duch, W., Jankowski, N.: New developments in the feature space mapping model. In: Third Conference on Neural Networks and Their Applications, pp. 65–70. Polish Neural Networks Society, Kule (1997)
Google Scholar
Jankowski, N., Kadirkamanathan, V.: Statistical control of RBF-like networks for classification. In: 7th International Conference on Artificial Neural Networks, pp. 385–390. Springer, Lausanne (1997)
Google Scholar
Tomek, I.: An experiment with the edited nearest-neighbor rule. IEEE Transactions on Systems, Man, and Cybernetics 6, 448–452 (1976)
Article MATH MathSciNet Google Scholar
Grochowski, M.: Wybór wektorów referencyjnych dla wybranych method klasyfikacji. Master’s thesis, Department of Informatics, Nicholas Copernicus University, Poland (2003)
Google Scholar
Jankowski, N.: Data regularization. In: Rutkowski, L., Tadeusiewicz, R. (eds.) Neural Networks and Soft Computing, Zakopane, Poland, pp. 209–214 (2000)
Google Scholar
Wilson, D.R., Martinez, T.R.: Reduction techniques for instance-based learning algorithms. Machine Learning 38, 257–286 (2000)
Article MATH Google Scholar
Kohonen, T.: Learning vector quantization for pattern recognition. Technical Report TKK-F-A601, Helsinki University of Technology, Espoo, Finland (1986)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Informatics, Nicholaus Copernicus University, ul. Grudziądzka 5, 87-100, Toruń, Poland
Norbert Jankowski & Marek Grochowski

Authors

Norbert Jankowski
View author publications
You can also search for this author in PubMed Google Scholar
Marek Grochowski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Artificial Intelligence, Academy of Humanities and Economics, Poland
Leszek Rutkowski
German Research Center of Artificial Intelligence (DFKI), Germany
Jörg H. Siekmann
Institute of Automatics, AGH University of Science and Technology, Al. Mickiewicza 30, PL-30-059, Kraków, Poland
Ryszard Tadeusiewicz
Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Initiative in Soft Computing (BISC), 94720-1776, Berkeley, CA
Lotfi A. Zadeh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jankowski, N., Grochowski, M. (2004). Comparison of Instances Seletion Algorithms I. Algorithms Survey. In: Rutkowski, L., Siekmann, J.H., Tadeusiewicz, R., Zadeh, L.A. (eds) Artificial Intelligence and Soft Computing - ICAISC 2004. ICAISC 2004. Lecture Notes in Computer Science(), vol 3070. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24844-6_90

Download citation

DOI: https://doi.org/10.1007/978-3-540-24844-6_90
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22123-4
Online ISBN: 978-3-540-24844-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Comparison of Instances Seletion Algorithms I. Algorithms Survey

Abstract

Access this chapter

Preview

Similar content being viewed by others

Reducing Data Volume in Instance Based Learning

Unsupervised instance selection via conjectural hyperrectangles

A Fast and Efficient Algorithm for Filtering the Training Dataset

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Comparison of Instances Seletion Algorithms I. Algorithms Survey

Abstract

Access this chapter

Preview

Similar content being viewed by others

Reducing Data Volume in Instance Based Learning

Unsupervised instance selection via conjectural hyperrectangles

A Fast and Efficient Algorithm for Filtering the Training Dataset

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation