Abstract
Sentence similarity is the basis for many natural language processing tasks and it is studied in this paper by the tools of ontology and Wikipedia-based Wiktionary. If a word appears in the definition of another word in Wiktionary, the two words can be said to be related to each other. Based on this kind of knowledge from Wiktionary, a graph-based ontology is built. In the graph, nodes represent words, and if one word appears in the definition of the other, there is a line between them in the graph. And the line or degree as it is called is used to compute word similarity. Accordingly, word similarity is used to compute sentence similarity. In the paper, content words such as nouns, verbs, adjectives and adverbs are used to computer sentence similarity. Sentence similarity computed in this way is effective for natural language processing tasks such as question answering, information extraction, etc. And it is used in online chat robot “FreeTalker”.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bruening, B.: Ditransitive asymmetries and a theory of idiom formation. Linguist. Inq. 41(4), 519–562 (2010)
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181 (1997)
Adamic, L.A., Adar, E.: Friends and neighbors on the web. Soc. Netw. 25(3), 211–230 (2003)
Dumais, S.T.: Latent semantic analysis. Ann. Rev. Inf. Sci. Technol. 38(1), 188–230 (2004)
Newman, M.E.J.: Clustering and preferential attachment in growing networks. Phys. Rev. E 64(2), 025102 (2001)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30(1–7), 107–117 (1998)
Lin, H., Bilmes, J.: How to select a good training-data subset for transcription: submodular active selection for sequences. In Proceedings of Conference of the International Speech Communication Association (2009)
Lin, H., Bilmes, J.: Multi-document summarization via budgeted maximization of submodular functions. In: Proceedings of NAACL-HLT (2010)
Zhao, J., Lan, M., Niu, Z., Lu, Y.: Integrating word embeddings and traditional NLP features to measure textual entailment and semantic relatedness of sentence pairs. In: The International Joint Conference on Neural Networks (IJCNN), Killarney, Ireland, 12–17 July, pp. 1–7 (2015)
Han, X., Mao, X.: Improved algorithm of word semantic similarity. China Sciencepaper 11(2), 202–207 (2016)
Lyu, C., Lu, Y., Ji, D., Chen, B.: Deep learning for textual entailment recognition. In: 27th International Conference on Tools with Artificial Intelligence, pp. 154–161 (2015)
Wu, Y., Li, S., Liu, J., Guo, L., Wu, X.: NETASPNO: approximate strict pattern matching under nonoverlapping condition. IEEE Access 6(1), 24350–24361 (2018)
Wu, Y., Shen, C., Jiang, H., Wu, X.: Strict pattern matching under non-overlapping condition. Sci. China Inf. Sci. 60(1), 012–101 (2017)
Wu, Y., Tang, Z., Jiang, H., Wu, X.: Approximate pattern matching with gap constraints. J. Inf. Sci. 42(5), 639–658 (2016)
Wu, Y., Fu, S., Jiang, H., Wu, X.: Strict approximate pattern matching with general gaps. Appl. Intell. 42(3), 566–580 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, Z., Liu, X. (2020). Ontology-Based Computing of Sentence Similarity. In: Liu, Y., Wang, L., Zhao, L., Yu, Z. (eds) Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery. ICNC-FSKD 2019. Advances in Intelligent Systems and Computing, vol 1075. Springer, Cham. https://doi.org/10.1007/978-3-030-32591-6_104
Download citation
DOI: https://doi.org/10.1007/978-3-030-32591-6_104
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32590-9
Online ISBN: 978-3-030-32591-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)