Abstract
A monoid of strings (words) over a finite alphabet is considered. The notion of distance on strings is important in the problem of inductive learning related to artificial intelligence, in cryptography, and in some other fields of mathematics. The distance is defined as a minimum length of the transformation path that transforms one string into another. One example is the Levenstein distance, with the transformations being insertions, deletions, and substitutions of letters. A quadratic algorithm for calculating this distance is known to exist. In this paper, a more general case—insertion and deletion of words of arbitrary length—is considered. For this case, the problem of distance calculation turns out to be unsolvable. The basic results of this work are the formulation of the condition of computability of distance and the algorithm for distance calculation, which is polynomial in string length.
Similar content being viewed by others
References
Knuth, D. and Bendix, P.,Simple Word Problems in Universal Algebras. Computational Problems in Abstract Algebra, Oxford: Pergamon, 1967.
Book, R. and Otto, F.,String Rewriting Systems, Springer, 1993.
Abela, J., Topics in Evolving Transformation Systems,Master's Thesis, 1994.
Stephen, G., String Searching Algorithms,Lect. Notes Comput., Singapore: World Sci., 1994, vol. 3.
Kruskal, J. and Sankoff, D.,Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison, Reading: Addison-Wesley, 1983.
Goldfarb, L., What is Distance and Why Do We Need the Metric Model for Pattern Learning?Pattern Recognit., 1992, vol. 25, pp. 431–438.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Golubitskii, O.D. Distance calculation on strings. Program Comput Soft 26, 97–99 (2000). https://doi.org/10.1007/BF02759195
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF02759195