Abstract
Chinese V-N collocations have two possible structural relations: verb-object relation and attributive-head relation. Both of them are widely used in Chinese language processing tasks, but long distance and low frequency collocations are often difficult to extract. A weighted mutual information (WMI) model and a rule-based method were designed to acquire V-N collocations by taking more syntactic structure features into consideration. The WMI model extracted verb-object collocation within clauses. It reduced the interference of illegal collocates and highlighted the weight of long distance collocates, by giving different weights to collocates in different locations. The rule-based method used part of speech patterns to extract verb-object and attributive-head collocations, and inferred implicit collocations. The experiments show that, the WMI model optimizes evaluation scores of long distance collocations, while the rule-based method is more accurate in extracting and distinguishing the two kinds of collocations, including low frequency collocations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Church, K.W., Hanks, P.: Word Association Norms, Mutual Information, and Lexicography. Computational Linguistics 16(1), 22–29 (1990)
Smadja, F.: Retrieving Collocations from Text: Xtract. Computational Linguistics 19(1), 143–177 (1993)
Sun, M.S.: Quantitative Analysis of the Chinese Collocations. Studies of Chinese Language 256(1), 29–38 (1997)
Lin, D.K.: Extracting Collocations from Text Corpora. In: Proceedings of 1st Workshop on Computational Terminology, pp. 57–63. MIT Press, Montreal (1998)
Bai, M.Q., Zheng, J.H.: Study on Ways of Verb-Verb Collocation. Computer Engineering and Applications 40(27), 70–72 (2004)
Wang, X.: A Study on the Automatic Acquisition of Verb-object Collocations in Chinese. Applied Linguistics (1), 137–143 (2005)
Zhu, D.X.: “de” Phrase and Judgment Sentence (“的”字结构和判断句). Studies of Chinese Language (1), 23–27 (1978)
Zhu, D.X.: “de” Phrase and Judgment Sentence (“的”字结构和判断句). Studies of Chinese Language (2), 104–109 (1978)
Yu, S.W., Duan, H.M., Zhu, X.F., Sun, B.: The Basic Processing of Contemporary Chinese Corpus at Peking University Specification. Journal of Chinese Information Processing 16(5), 49–64 (2002)
Yu, S.W., Duan, H.M., Zhu, X.F., Sun, B.: The Basic Processing of Contemporary Chinese Corpus at Peking University Specification. Journal of Chinese Information Processing (continued) 16(6), 58–64 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Qian, X. (2013). Automatic Extraction of Chinese V-N Collocations. In: Ji, D., Xiao, G. (eds) Chinese Lexical Semantics. CLSW 2012. Lecture Notes in Computer Science(), vol 7717. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36337-5_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-36337-5_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36336-8
Online ISBN: 978-3-642-36337-5
eBook Packages: Computer ScienceComputer Science (R0)