Abstract:
This paper presents a solution for the problem of finding interchangeable words in the context of an input collection of strings. Interchangeable words are words that can...Show MoreMetadata
Abstract:
This paper presents a solution for the problem of finding interchangeable words in the context of an input collection of strings. Interchangeable words are words that can be replaced indistinctly in phrases or free text without deviating its actual meaning. Under restricted conditions, pairs of interchangeable might be useful for data deduplication, copy detection, software localization, among others. The calculation of the degree of interchangeability involves the accurate calculation of semantic similarity between pairs of words and the search for candidate pairs in the overall search space imposed by the input collection. The solution presented in this paper is composed by a search method for candidate pairs using the Levenshtein distance algorithm
Date of Conference: 28-31 October 2007
Date Added to IEEE Xplore: 31 January 2008
ISBN Information: