Skip to main content

Zero Anaphora Resolution in Chinese and Its Application in Chinese-English Machine Translation

  • Conference paper
Book cover Natural Language Processing and Information Systems (NLDB 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4592))

  • 974 Accesses

Abstract

In this paper, we propose a learning classifier based on maximum entropy (ME) for resolving ZA in Chinese. Besides regular grammatical, lexical, positional and semantic features, we develop two innovative Web-based features for extracting additional semantic information of ZA from the Web. Our study shows the Web as a knowledge source can be incorporated effectively in the learning framework and significantly improves its performance. In the application of ZA resolution in MT, it is viewed as a pre-processing module that is detachable and MT-independent. The experiment results demonstrate a signifcant improvement on BLEU/NIST scores after the ZA resolution is employed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Feng, Z.W.: New Review of Machine Translation. Chinese Publishing Company (1994)

    Google Scholar 

  2. Li, C., Thompson, S.: Mandarin Chinese - A Functional Reference Grammar. University of California Press (1981)

    Google Scholar 

  3. Resnik, P., Smith, N.A.: The web as a parallel corpus. Computational Linguistics 127, 349–380 (2003)

    Article  Google Scholar 

  4. Yeh, C.L., Chen, Y.C.: Zero anaphora resolution in chinese with shallow parsing. Journal of Chinese Language and Computing (to appear)

    Google Scholar 

  5. Zhang, W., Zhou, C.L.: Study on meta-anaphoric resolution in chinese discourse understanding. Journal of Software 13, 732–738 (2002)

    MathSciNet  Google Scholar 

  6. Grosz, B.J., Joshi, A.K., Weinstein, S.: Centering: a framework for modeling the local coherence of discourse. Computational Linguistics 21, 203–225 (1995)

    Google Scholar 

  7. Ge, N.Y., Hale, J., Eugene, C.: A statistical approach to anaphora resolution. In: Proc. 6th Workshop on Very Large Corpora, Montreal, Canada, pp. 161–170 (1998)

    Google Scholar 

  8. Soon, W.M., Ng, H.T., Lim, C.Y.: Machine learning approach to coreference resolution of noun phrases. Computational Linguistics 127, 521–544 (2001)

    Article  Google Scholar 

  9. Isozaki, H., Hirao, T.: Japanese zero pronoun resolution based on ranking rules and machine learning. In: Proc. the, Conf. on Empirical Methods in NLP (EMNLP), Sapporo, Japan, pp. 184–191 (2003)

    Google Scholar 

  10. Hinrichs, E.W., Filippova, K., Wunsch, H.: A data-driven approach to pronominal anaphora resolution for german. In: Proc. Recent Advances in Natural Language Processing (RANLP), Borovets, Bulgaria, pp. 239–245 (2005)

    Google Scholar 

  11. Ratnaparkhi, A.: Maximum entropy models for natural language ambiguity resolution. PhD thesis, University of Pennsylvania, Philadelphia (1998)

    Google Scholar 

  12. Pietra, S.D., Pietra, V.D., Lafferty, J.: Inducing features of random fields. IEEE Transactions on Pattern Analysis and Machine Intelligence 19, 380–393 (1977)

    Article  Google Scholar 

  13. Wang, H., Yu, S., Zhan, W.: The specification of the semantic knowledge-based on contemporary chinese. Journal of Chinese Language and Computing 113, 159–176 (2003)

    Google Scholar 

  14. Lapata, M., Keller, F.: Web-based models for natural language processing. ACM Transactions on Speech and Language Processing (TSLP) 2, 1–31 (2005)

    Article  Google Scholar 

  15. Papineni, K., Roukos, S., Zhu, T.: Bleu: a method for automatic evaluation of machine translation. In: Proc. 40th Annual Meeting of the Association for Computational Linguistics (ACL-2002) Philadelphia, PA, US, pp. 311–318 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Zoubida Kedad Nadira Lammari Elisabeth Métais Farid Meziane Yacine Rezgui

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Peng, J., Araki, K. (2007). Zero Anaphora Resolution in Chinese and Its Application in Chinese-English Machine Translation. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds) Natural Language Processing and Information Systems. NLDB 2007. Lecture Notes in Computer Science, vol 4592. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73351-5_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73351-5_32

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73350-8

  • Online ISBN: 978-3-540-73351-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics