Abstract
Parallel Corpora are a basic resource for research in bilingual and multilingual Natural Language Process (NLP), comparative linguistics, and translational studies. The basic unit of semantics in natural language is the phrase with fixed form and meaning. Phrasal alignment is a very important application for a parallel corpus. The current paper adopts a “analyze-analyze-match” strategy to select phrases for alignment from a parallel corpus of Chinese and Arabic. An aligned phrase database is built and the verbal phrases are compared between Chinese and Arabic. Verbal phrases in Arabic typically have three categories: Verb + Noun Phrase, Verb + Prepositional Phrase, and Particle + Verb. When aligned to Chinese phrases, each category of Arabic verb phrase corresponds to a number of phrase categories, including verb phrases, as well as noun phrases and prepositional phrases. The translational examples and rules derived from the aligned Arabic and Chinese phrases can have important supporting role in translational studies and comparative linguistics research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Akef, A. M., Yang E., Wang Y.: An analyzing of Arabic phrases for Chinese Arabic syntax phrase database study. In: The 12th China Workshop on Machine Translation (2016) (in Chinese).
Green S., Manning C. D.: Better Arabic parsing: Baselines, evaluations, and analysis. In: Proceedings of the 23rd International Conference on Computational Linguistics (2010).
Levy R., Manning C. D.: Is it harder to parse Chinese, or the Chinese Treebank? In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics (2003).
Liu C.: Overview of Parallel Corpus study. Journal of Yanshan University, 31(6), 120-121 (2007).
Och F J, Ney H.: A systematic comparison of various statistical alignment models. Computational linguistics, 29(1), 19–51 (2003).
Zhang C., Gao X.: The Translation Knowledge Acquisition Based on Phrase Evaluation. Harbin Institute of Technology Press (2012).
Zhou W.: Arabic Grammar Course. Shanghai Foreign Language Education Press (2011).
Homepage of the United Nations Parallel Corpus: https://conferences.unite.un.org/uncorpus, last accessed 2017/7/1.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Akef, A.M., Wang, Y., Yang, E. (2018). Construction of a Database of Parallel Phrases in Chinese and Arabic. In: Wu, Y., Hong, JF., Su, Q. (eds) Chinese Lexical Semantics. CLSW 2017. Lecture Notes in Computer Science(), vol 10709. Springer, Cham. https://doi.org/10.1007/978-3-319-73573-3_49
Download citation
DOI: https://doi.org/10.1007/978-3-319-73573-3_49
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73572-6
Online ISBN: 978-3-319-73573-3
eBook Packages: Computer ScienceComputer Science (R0)