Abstract
The paper is devoted to the development of a system of synonymous and quasi-synonymous paraphrasing and its practical applications, first of all in the domain of search engine optimization and information extraction. This system is part of the ETAP-3 multifunctional NLP environment created by the Laboratory of Computational Linguistics of the Kharkevich Institute for Information Transmission Problems. Combinatorial dictionaries of Russian, English and some other languages and a rule-driven parser constitute the core of ETAP-3 while a variety of generating modules are used in a number of applications. The paraphrase generator, based on the apparatus of lexical functions, is one such module. We describe the general layout of the paraphrase generator and discuss an experiment that demonstrates its potential as a tool for search optimization.
This study was supported in part by the Russian Foundation of Basic Research with a grant No. 08-06-00344, for which the authors are grateful.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Yangarber, R.: Acquisition of domain knowledge. In: Pazienza, M.T. (ed.) SCIE 2003. LNCS (LNAI), vol. 2700, pp. 1–28. Springer, Heidelberg (2003)
Lin, W., Yangarber, R., Grishman, R.: Bootstrapped Learning of Semantic Classes from Positive and Negative Examples. In: Proceedings of the 20th International Conference on Machine Learning: ICML 2003 Workshop on The Continuum from Labeled to Unlabeled Data in Machine Learning and Data Mining, Washington, D.C (2003)
Shinyama, Y., Sekine, S.: Paraphrase Acquisition for Information Extraction. In: The Second International Workshop on Paraphrasing: Paraphrase Acquisition and Applications (IWP2003), Sapporo, Japan (2003)
Sekine, S.: Automatic Paraphrase Discovery based on Context and Keywords between NE Pairs. In: Proceedings of the International Workshop on Paraphrase 2005, Jeju Island, Korea (2005)
Sekine, S.: On-Demand Information Extraction. In: ACL 2006, 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, Sydney, Australia, July 17-21 (2006)
Mel’čuk, I.A., Žolkovskij, A.K.: Tolkovo-kombinatornyj slovar’ sovremennogo russkogo jazyka. In: Opyt semantiko-sintaksičeskogo opisanija russkoj leksiki, Wiener Slawistischer Almanach, Wien (1984)
Apresjan, J.D.: Izbrannye trudy. Leksičeskaja semantika. Sinonimičeskie sredstva jazyka. Jazyki slavjanskix kul’tur, Moscow (1995)
Mel’čuk, I.: Lexical Functions: A Tool for the Description of Lexical Relations in a Lexicon. In: Wanner, L. (ed.) Lexical Functions in Lexicography and Natural Language Processing, Amsterdam, Philadelphia, pp. 37–102 (1996)
Mel’čuk, I.: The Meaning-Text Approach to the Study of Natural Language and Linguistic Functional Models. In: Embleton, S. (ed.) LACUS Forum, vol. 24, pp. 3–20. LACUS, Chapel Hill (1998)
Mel’čuk, I.A.: Opyt lingvističeskix modelej "Smysl <=> Tekst". Semantika, sintaksis. Shkola Jazyki russkoj kul’tury, Moscow (1999)
Apresjan, J.D., Cinman, L.L.: Formal’naja model’ perifrazirovanija predloženij dlja sistem pererabotki tekstov na estestvennyx jazykax. In: Russkij jazyk v naučnom osveščenii, vol. 4, pp. 102–146 (2002)
Apresjan, J.D., Boguslavsky, I.M., Iomdin, L.L., Cinman, L.L.: Lexical Functions in Actual NLP Applications. In: Wanner, L. (ed.) Selected Lexical and Grammatical Issues in the Meaning-Text Theory. In Honour of Igor Mel’čuk, pp. 199–230. Benjamins Academic Publishers, Amsterdam (2007)
Apresjan, J.D.: Osnovanija sistemnoj leksikografii. In: Jazykovaja kartina mira i sistemnaja leksikografija. Škola Jazyki russkoj kul’tury, Moscow (2006)
Search query statistics of Yandex, http://wordstat.yandex.ru/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Apresjan, J.D., Boguslavsky, I.M., Iomdin, L.L., Cinman, L.L., Timoshenko, S.P. (2009). Semantic Paraphrasing for Information Retrieval and Extraction. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds) Flexible Query Answering Systems. FQAS 2009. Lecture Notes in Computer Science(), vol 5822. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04957-6_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-04957-6_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04956-9
Online ISBN: 978-3-642-04957-6
eBook Packages: Computer ScienceComputer Science (R0)