Skip to main content

Naxi-Chinese Bilingual Word Alignment Method Based on Entity Constraint

  • Conference paper
Chinese Lexical Semantics (CLSW 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8229))

Included in the following conference series:

  • 2473 Accesses

Abstract

This paper proposes Naxi-Chinese bilingual word alignment method based on entity constraint for the characteristic that entity and entity is alignment in bilingual alignment. First, we mark the corpus of Naxi word segmentation and entity label, using Conditional Random Fields constructs the model of Naxi word segmentation and entity recognition to achieve Naxi word segmentation and entity recognition. Then, using entity alignment constraint relation of Naxi-Chinese sentence uses the specific marker to mark out bilingual entity of Naxi-Chinese sentence. Finally, we introduce features, such as fertility probabilities and distortion probabilities et al, and achieve the Naxi-Chinese bilingual alignment model by GIZA++. Naxi-Chinese bilingual word alignment experimental result shows the method proposed by this paper has better effect compared to only use IBM Models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Guo, H., Che, W., et al.: Web Embedding Fonts Technology of Naxi Pictographs. Computer Engineering 17(31), 203–207 (2005)

    Google Scholar 

  2. Guo, H.: The Design and Development of Outlines Font of Naxi Hieroglyphic and the Embedding Fonts Technology of Naxi Hieroglyphic. Kunming, Kunming University of Science and Technology (2005)

    Google Scholar 

  3. Zhao, F., Yu, Z., Xian, Y., et al.: Naxi-Chinese Bilingual Corpus and Building a Bilingual Corpus Alignment. Journal of Guangxi Normal Uniwersity: Natural Science Edition 27(01), 20–24 (2009)

    Google Scholar 

  4. Zhao, F.: Naxi-Chinese Bilingual Translation Method Research. Kunming, Kunming University of Science and Technology (2010)

    Google Scholar 

  5. Brown, P.F., Mercer, R.L., et al.: The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics 19(2), 263–311 (1993)

    Google Scholar 

  6. Och, F.J., Ney, H.: A Systematic Comparison of Various Statistical Alignment Models. Computational Linguistics 29(1), 19–51 (2003)

    Article  MATH  Google Scholar 

  7. Tang, J., Hong, M., Li, J., et al.: Tree-structured Conditional Random Fields for Semantic annotation. In: 5th International Semantic Web Conference, Athens, USA, pp. 640–653 (2006)

    Google Scholar 

  8. Zhao, H., Huang, C., Li, M.: An Improved Chinese Word Segmentation System with Conditional Random Field. In: Fifth Sighan Workingshop on Chinese Language Processing (SIGHAN-5), pp. 162–165 (2006)

    Google Scholar 

  9. Zheng, W.-N., Bo, P.-B., Liu, Y., Wang, W.-P.: Fast B-spline curve fitting by L-BFGS. Computer Aided Geometric Design 29(7), 448–462 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  10. Nagao, M.: A Framework of a Mechanical Translation between Japanese and English by Analogy Principle. In: Artificial and Human Intelligence, pp. 173–180. North-Holland, Amsterdam (1984)

    Google Scholar 

  11. Zhao, J.: A Survey on Named Entity Recognition, Disambiguation and Cross-Lingual Coreference Resolution. Journal of Chinese Information Processing 23(2), 3–17 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yang, X., Yu, Z., Guo, J., Pan, X., Mao, C. (2013). Naxi-Chinese Bilingual Word Alignment Method Based on Entity Constraint. In: Liu, P., Su, Q. (eds) Chinese Lexical Semantics. CLSW 2013. Lecture Notes in Computer Science(), vol 8229. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45185-0_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-45185-0_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-45184-3

  • Online ISBN: 978-3-642-45185-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics