Abstract
Sentence similarity computing is a typical technology used in natural language processing, which aims at finding valuable information from documents. By adapting advanced technologies, such as machine learning and deep learning, current sentence similarity computing methods mainly deal with key words and structures of sentences. The main drawback of current methods is taking no consideration of the influence of sentences context. In this paper, we propose a frame semantics theory based computing method that is built upon FrameNet. By quantitatively analyzing the semantic relations among frames, sentences similarity can be calculated based on the frames that are evoked by the sentences. We also carry out experiments to evaluate the performance of this method with the help of a prototype software tool we developed.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Allahyari, M., et al.: A brief survey of text mining: classification, clustering and extraction techniques. arXiv preprint arXiv:1707.02919 (2017)
Li, J., Liu, C., Yu, J.X., Chen, Y., Sellis, T., Culpepper, J.S.: Personalized influential topic search via social network summarization. In: IEEE International Conference on Data Engineering. IEEE Computer Society (2017)
Atkinson-Abutridy, J., Mellish, C., Aitken, S.: Combining information extraction with genetic algorithms for text mining. IEEE Intell. Syst. 19(3), 22–30 (2004)
Pawar, A., Mago, V.: Calculating the similarity between words and sentences using a lexical database and corpus statistics. arXiv preprint arXiv:1802.05667 (2018)
Farouk, M.: Measuring Sentences Similarity: A Survey. arXiv preprint arXiv:1910.03940 (2019)
Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)
Ruppenhofer, J., Ellsworth, M., Schwarzer-Petruck, M., Johnson, C.R., Scheffczyk, J.: FrameNet II: Extended Theory and Practice (2006)
Fillmore, C.J., Baker, C.F.: Frame semantics for text understanding. In: Proceedings of WordNet and Other Lexical Resources Workshop, NAACL (vol. 6) (June 2001)
Das, D., Schneider, N., Chen, D., Smith, N.A.: Probabilistic frame-semantic parsing. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 948–956. Association for Computational Linguistics (June 2010)
Wang, T., Truptil, S., Benaben, F.: An automatic model-to-model mapping and transformation methodology to serve model-based systems engineering. IseB 15(2), 323–376 (2016). https://doi.org/10.1007/s10257-016-0321-z
Li, J., Liu, C., Islam, M.: Keyword-based correlated network computation over large social media. In: IEEE International Conference on Data Engineering IEEE (2014)
Derczynski, L.: Complementarity, F-score, and NLP Evaluation. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), pp. 261–266 (May 2016)
Mihalcea, R., Corley, C., Strapparava, C.: Corpus-based and knowledge-based measures of text semantic similarity. In: AAAI, vol. 6, No. 2006, pp. 775–780 (July 2006)
Wang, Z., Mi, H., Ittycheriah, A.: Sentence similarity learning by lexical decomposition and composition. arXiv preprint arXiv:1602.07019 (2016)
Abdalgader, K., Skabar, A.: Short-text similarity measurement using word sense disambiguation and synonym expansion. In: Li, J. (ed.) AI 2010. LNCS (LNAI), vol. 6464, pp. 435–444. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-17432-2_44
Lee, M.C., Chang, J.W., Hsieh, T.C.: A grammar-based semantic similarity algorithm for natural language sentences. Sci. World J. 2014 (2014)
Batanović, V., Bojić, D.: Using part-of-speech tags as deep-syntax indicators in determining short-text semantic similarity. Comput. Sci. Inf. Syst. 12(1), 1–31 (2015)
Putra, J.W.G., Tokunaga, T.: Evaluating text coherence based on semantic similarity graph. In: Proceedings of TextGraphs-11: the Workshop on Graph-based Methods for Natural Language Processing, pp. 76–85 (August 2017)
Ji, Y., Jacob, E.: Discriminative improvements to distributional sentence similarity. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (2013)
Acknowledgement
This work was supported by the National Natural Science Foundation of China (61872182), the Open Fund of the Ministry Key Laboratory for Safety-Critical Software Development and Verification (XCA1816401 and XCA19016-06).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, W., Wang, T., Yang, Z., Cao, J. (2020). A Context-Aware Computing Method of Sentence Similarity Based on Frame Semantics. In: Yang, X., Wang, CD., Islam, M.S., Zhang, Z. (eds) Advanced Data Mining and Applications. ADMA 2020. Lecture Notes in Computer Science(), vol 12447. Springer, Cham. https://doi.org/10.1007/978-3-030-65390-3_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-65390-3_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-65389-7
Online ISBN: 978-3-030-65390-3
eBook Packages: Computer ScienceComputer Science (R0)