ABSTRACT
Semantic similarity or semantic relatedness are features of natural language that contribute to the challenge machines face when analyzing text. Although semantic relatedness is still a complex challenge only few ground truth data set exist. We argue that the available corpora used to evaluate the performance of natural language tools do not capture all elements of the phenomenon. We present a set of simple interventions that illustrate 1) framing effects influence similarity perception, 2) the distribution of similarity across multiple users is important and 3) semantic relatedness is asymmetric.
- Alexander Budanitsky and Graeme Hirst. 2001. Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In Proc. Workshop on WordNet and Other Lexical Resources.Google Scholar
- Lev Finkelstein, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan, Gadi Wolfman, and Eytan Ruppin. 2002. Placing Search in Context: The Concept Revisited. ACM Transactions on Information Systems 20, 1 (2002). Google ScholarDigital Library
- Evgeniy Gabrilovich and Shaul Markovitch. 2007. Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis. In Proc. IJCAI '07. 1606--1611. Google ScholarDigital Library
- Jay J. Jiang and David W. Conrath. 1997. Semantic similarity based on corpus statistics and lexical taxonomy. arXiv cmp-lg/9709008 (1997).Google Scholar
- Philip Resnik. 1995. Using information content to evaluate semantic similarity in a taxonomy. In Proc. IJCAI '95. 448--453. Google ScholarDigital Library
- Michael Strube and Simone Paolo Ponzetto. 2006. WikiRelate! Computing Semantic Relatedness Using Wikipedia. In Proc. AAAI '16. 1419--1424. Google ScholarDigital Library
Index Terms
- Possible Confounds in Word-based Semantic Similarity Test Data
Recommendations
Evaluating semantic similarity and relatedness over the semantic grouping of clinical term pairs
Display Omitted Objective: develop a method to quantify the similarity and relatedness of biomedical and clinical term pairs.Semantic similarity and relatedness measures exploit information extrapolated from the Unified Medical Language System.Evaluates ...
Ontology-based approach for measuring semantic similarity
The challenge of measuring semantic similarity between words is to find a method that can simulate the thinking process of human. The use of computers to quantify and compare semantic similarities has become an important area of research in various ...
Research on Ontology-Based Measuring Semantic Similarity
ICICSE '08: Proceedings of the 2008 International Conference on Internet Computing in Science and EngineeringOntology is an explicit specification of a conceptualization on semantic and knowledge. This paper presents a formal definition on ontology and gives an example. Then, a method to measuring semantic similarity is presented. Semantic similarity is ...
Comments