Abstract
A new LVQ-inspired adaptive method is introduced to optimize strings for the 1-NN classifier. The updating rule relies on the edit distance. Given an initial number of string prototypes and a training set, the algorithm builds supervised clusters by attaching training samples to prototypes. A prototype is then rewarded to get it closer to the members of its cluster. To this end, the prototype is updated according to the most frequent edit operations resulting from edit distance computations to all members of its cluster. The process reorganizes training samples into new clusters and continues until the convergence of prototypes is achieved. A series of learning/classification experiments is presented which show a better 1-NN performance of the new prototypes with respect to the initial ones, that were originally good for classification.
This work has been partially supported by the grant CTIDIA/2002/80 of Valencian OCYT and by the grant TIC2000-1703-CO3-01 of Spanish CICYT.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dasarathy, B.V. (ed.): Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques. IEEE Computer Society Press, Los Alamitos (1991)
Hart, P.E.: The condensed nearest neighbor rule. IEEE Transactions on Information Theory 14, 515–516 (1968)
Kohonen, T.: Improved versions of learning vector quantization. In: Proc. of the Int. Conf. on Neural Networks, San Diego, CA, vol. 1, pp. 545–550 (1990)
Bezdek, J.C., Reichherzer, T.R., Lim, G.S., Attikiouzel, Y.: Multiple-prototype classifier design. IEEE Trans. on System, Man and Cybernetics 28, 67–79 (1998)
Mollineda, R., Ferri, F., Vidal, E.: An efficient prototype merging strategy for the condensed 1-nn rule through class conditional hierarchical clustering. Pattern Recognition 35, 2771–2782 (2002)
Kohonen, T., Barna, G., Chrisley, R.: Statistical pattern recognition with neural networks: Benchmarking studies. In: Proc. IJCNN, San Diego, CA, vol. I, pp. 61–68. IEEE Computer Soc. Press, Los Alamitos (1988)
Wagner, R.A., Fischer, M.J.: The string-to-string correction problem. J. Assoc. Comput. Machinery 21, 168–173 (1974)
Lundsteen, C., Philip, J., Granum, E.: Quantitative analysis of 6895 digitized trypsin g-banded human chromosomes. Clinic Genetics 18, 355–370 (1980)
Martínez-Hinarejos, C.D., Juan, A., Casacuberta, F.: Median strings for k-nearest neighbour classification. Pattern Recognition Letters (2003)
Andreu, G., Crespo, A., Valiente, J.M.: Selecting the toroidal self-organizing feature maps (TSOFM) best organized to object recognition. In: Proceedings of ICNN 1997, Houston, Texas (USA), vol. 2, pp. 1341–1346. IEEE, Los Alamitos (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mollineda, R.A., Vidal, E., Martínez-Hinarejos, C. (2003). Adaptive Learning for String Classification. In: Perales, F.J., Campilho, A.J.C., de la Blanca, N.P., Sanfeliu, A. (eds) Pattern Recognition and Image Analysis. IbPRIA 2003. Lecture Notes in Computer Science, vol 2652. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44871-6_66
Download citation
DOI: https://doi.org/10.1007/978-3-540-44871-6_66
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40217-6
Online ISBN: 978-3-540-44871-6
eBook Packages: Springer Book Archive