Skip to main content

Semantic Paraphrasing for Information Retrieval and Extraction

  • Conference paper
Flexible Query Answering Systems (FQAS 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5822))

Included in the following conference series:

  • 806 Accesses

Abstract

The paper is devoted to the development of a system of synonymous and quasi-synonymous paraphrasing and its practical applications, first of all in the domain of search engine optimization and information extraction. This system is part of the ETAP-3 multifunctional NLP environment created by the Laboratory of Computational Linguistics of the Kharkevich Institute for Information Transmission Problems. Combinatorial dictionaries of Russian, English and some other languages and a rule-driven parser constitute the core of ETAP-3 while a variety of generating modules are used in a number of applications. The paraphrase generator, based on the apparatus of lexical functions, is one such module. We describe the general layout of the paraphrase generator and discuss an experiment that demonstrates its potential as a tool for search optimization.

This study was supported in part by the Russian Foundation of Basic Research with a grant No. 08-06-00344, for which the authors are grateful.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Yangarber, R.: Acquisition of domain knowledge. In: Pazienza, M.T. (ed.) SCIE 2003. LNCS (LNAI), vol. 2700, pp. 1–28. Springer, Heidelberg (2003)

    Google Scholar 

  2. Lin, W., Yangarber, R., Grishman, R.: Bootstrapped Learning of Semantic Classes from Positive and Negative Examples. In: Proceedings of the 20th International Conference on Machine Learning: ICML 2003 Workshop on The Continuum from Labeled to Unlabeled Data in Machine Learning and Data Mining, Washington, D.C (2003)

    Google Scholar 

  3. Shinyama, Y., Sekine, S.: Paraphrase Acquisition for Information Extraction. In: The Second International Workshop on Paraphrasing: Paraphrase Acquisition and Applications (IWP2003), Sapporo, Japan (2003)

    Google Scholar 

  4. Sekine, S.: Automatic Paraphrase Discovery based on Context and Keywords between NE Pairs. In: Proceedings of the International Workshop on Paraphrase 2005, Jeju Island, Korea (2005)

    Google Scholar 

  5. Sekine, S.: On-Demand Information Extraction. In: ACL 2006, 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, Sydney, Australia, July 17-21 (2006)

    Google Scholar 

  6. Mel’čuk, I.A., Žolkovskij, A.K.: Tolkovo-kombinatornyj slovar’ sovremennogo russkogo jazyka. In: Opyt semantiko-sintaksičeskogo opisanija russkoj leksiki, Wiener Slawistischer Almanach, Wien (1984)

    Google Scholar 

  7. Apresjan, J.D.: Izbrannye trudy. Leksičeskaja semantika. Sinonimičeskie sredstva jazyka. Jazyki slavjanskix kul’tur, Moscow (1995)

    Google Scholar 

  8. Mel’čuk, I.: Lexical Functions: A Tool for the Description of Lexical Relations in a Lexicon. In: Wanner, L. (ed.) Lexical Functions in Lexicography and Natural Language Processing, Amsterdam, Philadelphia, pp. 37–102 (1996)

    Google Scholar 

  9. Mel’čuk, I.: The Meaning-Text Approach to the Study of Natural Language and Linguistic Functional Models. In: Embleton, S. (ed.) LACUS Forum, vol. 24, pp. 3–20. LACUS, Chapel Hill (1998)

    Google Scholar 

  10. Mel’čuk, I.A.: Opyt lingvističeskix modelej "Smysl <=> Tekst". Semantika, sintaksis. Shkola Jazyki russkoj kul’tury, Moscow (1999)

    Google Scholar 

  11. Apresjan, J.D., Cinman, L.L.: Formal’naja model’ perifrazirovanija predloženij dlja sistem pererabotki tekstov na estestvennyx jazykax. In: Russkij jazyk v naučnom osveščenii, vol. 4, pp. 102–146 (2002)

    Google Scholar 

  12. Apresjan, J.D., Boguslavsky, I.M., Iomdin, L.L., Cinman, L.L.: Lexical Functions in Actual NLP Applications. In: Wanner, L. (ed.) Selected Lexical and Grammatical Issues in the Meaning-Text Theory. In Honour of Igor Mel’čuk, pp. 199–230. Benjamins Academic Publishers, Amsterdam (2007)

    Google Scholar 

  13. Apresjan, J.D.: Osnovanija sistemnoj leksikografii. In: Jazykovaja kartina mira i sistemnaja leksikografija. Škola Jazyki russkoj kul’tury, Moscow (2006)

    Google Scholar 

  14. Search query statistics of Yandex, http://wordstat.yandex.ru/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Apresjan, J.D., Boguslavsky, I.M., Iomdin, L.L., Cinman, L.L., Timoshenko, S.P. (2009). Semantic Paraphrasing for Information Retrieval and Extraction. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds) Flexible Query Answering Systems. FQAS 2009. Lecture Notes in Computer Science(), vol 5822. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04957-6_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04957-6_44

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04956-9

  • Online ISBN: 978-3-642-04957-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics