Skip to main content

Chinese Paraphrases Acquisition Based on Random Walk N Step

  • Conference paper
  • First Online:
  • 4565 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10102))

Abstract

Conventional “pivot-based” approach of acquiring paraphrasing from bilingual corpus has limitations, where only paraphrases within two steps were considered. We propose a graph based model of acquiring paraphrases from phrases translation table. This paper describes the way of constructing graph model from phrases translation table, a random walk algorithm based on N number of steps and a confidence metric for ranking the obtained results. Furthermore, we augment the model to be able to integrate more language pairs, for instance, exploiting English-Japanese phrases translation table for finding more potential Chinese paraphrases. We performed experiments on NTCIR Chinese-English and English-Japanese bilingual corpora and compared with the conventional method. The experimental results showed that the proposed model acquired more paraphrases, and performed more well after English-Japanese phrases translation was added into the graph model.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Barzilay, R., McKeown, K.R.: Extracting paraphrases from a parallel corpus. In: Proceedings of ACL/EACL, pp. 50–57 (2001)

    Google Scholar 

  2. Callison-Burch, C., Koehn, P’., Osborne, M.: Improved statistical machine translation using paraphrases. In: Proceedings of the HLT-NAACL, vol. 1, pp. 7–24. Association for Computational Linguistics, Morristown (2006)

    Google Scholar 

  3. Barzilay, R.: Information fusion for multi-document summarization: paraphrasing and generation. Ph.D. thesis, Columbia University (2003)

    Google Scholar 

  4. Zhou, L., Lin, C.Y., Munteanu, D.S., Hovy, E.: ParaEval: using paraphrases to evaluate summaries automatically. In: Proceedings of the HLT-NAACL, pp. 447–454. Association for Computational Linguistics, Morristown (2006)

    Google Scholar 

  5. Zukerman, I., Raskutti, B.: Lexical query paraphrasing for document retrieval. In: Proceedings of COLING, pp. 1–7. Association for Computational Linguistics, Morristown (2002)

    Google Scholar 

  6. Iordanskaja, L., Kittredge, R., Polguere, A.: Lexical selection and paraphrase in a meaning—text generation model. In: Paris, C.L., Swartout, W.R., Mann, W.C. (eds.) Natural Language Generation in Artificial Intelligence and Computational Linguistics, pp. 293–312. Springer, Heidelberg (1991)

    Chapter  Google Scholar 

  7. McKeown, K.R.: Paraphrasing using given and new information in a question-answer system. In: Proceedings of the ACL, pp. 67–72. Association for Computational Linguistics, Morristown (1979)

    Google Scholar 

  8. 赵世奇.基于统计的复述获取与生成技术研究[学位论文].哈尔滨.哈尔滨工业大学, pp. 1–9 (2009)

    Google Scholar 

  9. Kok, S., Brockett, C.: Hitting the Right Paraphrases in Good Time, pp. 45–153. ACM (2010)

    Google Scholar 

  10. 徐晓华.图上的随机游走学习[学位论文].南京.南京航空航天大学 (2008)

    Google Scholar 

Download references

Acknowledgments

The research work has been partially funded by the International Science and Technology Cooperation Program of China under grant No. 2014DFA11350, National Nature Science Foundation of China (Contract 61370130 and 61473294), and the Fundamental Research Funds for the Central Universities (2015JBM033).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yujie Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Ma, J., Zhang, Y., Xu, J., Chen, Y. (2016). Chinese Paraphrases Acquisition Based on Random Walk N Step. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_57

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50496-4_57

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50495-7

  • Online ISBN: 978-3-319-50496-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics