Skip to main content

Construction of a Database of Parallel Phrases in Chinese and Arabic

  • Conference paper
  • First Online:
Chinese Lexical Semantics (CLSW 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10709))

Included in the following conference series:

  • 1681 Accesses

Abstract

Parallel Corpora are a basic resource for research in bilingual and multilingual Natural Language Process (NLP), comparative linguistics, and translational studies. The basic unit of semantics in natural language is the phrase with fixed form and meaning. Phrasal alignment is a very important application for a parallel corpus. The current paper adopts a “analyze-analyze-match” strategy to select phrases for alignment from a parallel corpus of Chinese and Arabic. An aligned phrase database is built and the verbal phrases are compared between Chinese and Arabic. Verbal phrases in Arabic typically have three categories: Verb + Noun Phrase, Verb + Prepositional Phrase, and Particle + Verb. When aligned to Chinese phrases, each category of Arabic verb phrase corresponds to a number of phrase categories, including verb phrases, as well as noun phrases and prepositional phrases. The translational examples and rules derived from the aligned Arabic and Chinese phrases can have important supporting role in translational studies and comparative linguistics research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Akef, A. M., Yang E., Wang Y.: An analyzing of Arabic phrases for Chinese Arabic syntax phrase database study. In: The 12th China Workshop on Machine Translation (2016) (in Chinese).

    Google Scholar 

  2. Green S., Manning C. D.: Better Arabic parsing: Baselines, evaluations, and analysis. In: Proceedings of the 23rd International Conference on Computational Linguistics (2010).

    Google Scholar 

  3. Levy R., Manning C. D.: Is it harder to parse Chinese, or the Chinese Treebank? In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics (2003).

    Google Scholar 

  4. Liu C.: Overview of Parallel Corpus study. Journal of Yanshan University, 31(6), 120-121 (2007).

    Google Scholar 

  5. Och F J, Ney H.: A systematic comparison of various statistical alignment models. Computational linguistics, 29(1), 19–51 (2003).

    Google Scholar 

  6. Zhang C., Gao X.: The Translation Knowledge Acquisition Based on Phrase Evaluation. Harbin Institute of Technology Press (2012).

    Google Scholar 

  7. Zhou W.: Arabic Grammar Course. Shanghai Foreign Language Education Press (2011).

    Google Scholar 

  8. Homepage of the United Nations Parallel Corpus: https://conferences.unite.un.org/uncorpus, last accessed 2017/7/1.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Erhong Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Akef, A.M., Wang, Y., Yang, E. (2018). Construction of a Database of Parallel Phrases in Chinese and Arabic. In: Wu, Y., Hong, JF., Su, Q. (eds) Chinese Lexical Semantics. CLSW 2017. Lecture Notes in Computer Science(), vol 10709. Springer, Cham. https://doi.org/10.1007/978-3-319-73573-3_49

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73573-3_49

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73572-6

  • Online ISBN: 978-3-319-73573-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics