Corpus-Based Statistics of Pre-Qin Chinese

Li, Bin; Xi, Ning; Feng, Minxuan; Chen, Xiaohe

doi:10.1007/978-3-642-36337-5_16

Bin Li^21,22,
Ning Xi²²,
Minxuan Feng²¹ &
…
Xiaohe Chen²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7717))

Included in the following conference series:

Workshop on Chinese Lexical Semantics

3052 Accesses
1 Citations

Abstract

The Pre-Qin Chinese plays a key role in the history of Chinese. However, for the lack of annotated corpus, the overview of Pre-Qin Chinese vocabulary is still not clear. This paper introduces the corpus of 25 Pre-Qin classical texts, which are under manual word segmentation and part-of-speech tagging. Then, the character and word frequencies are calculated based on the corpus. The character entropy, the syllables of words and the multiple part-of-speech words are also statistically analyzed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chen, X.H.: Information Processing of Pre-Qin Chinese. In: The 27th Anniversary of Chinese Information Processing Society of China, Beijing (2008)
Google Scholar
Shi, M., Chen, X.H., Li, B.: CRF Based Research on a Unified Ap-proach to Word Segmentation and POS Tagging for Pre-Qin Chinese. Journal of Chinese Information Processing 2(24), 39–45 (2010)
Google Scholar
Zhang, S.D.: Vocabulary Study of Lv Shi Chun Qiu. Shandong Education Press, Jinan (1989)
Google Scholar
Chen, K.J.: Dictionary of Chunqiu Zuozhuan. Zhongzhou Ancient Books Publishing House, Henan (2004)
Google Scholar
Che, S.Y.: Vocabulary Study of Hanfeizi. Bashu Publishing House, Chengdu (2008)
Google Scholar
Ye, Z.B.: Vocabulary Study of Archaic Chinese. The Central Literature Publishing House, Beijing (2007)
Google Scholar
Academia Sinica Tagged Corpus of Old Chinese, http://oldchinese.ling.sinica.edu.tw
Pan, Y.Z.: The Formation and Development of Chinese Basic Vocabulary. Journal of Zhongshan University 1, 98–121 (1959)
Google Scholar
Zhou, J.: Distinction between Basic Vocabulary and General Vocabulary. Journal of Nankai University 3 (1987)
Google Scholar
Feng, Z.W.: The Entropy of Chinese Characters. Revolution of Chinese Characters, 12–17 (1984)
Google Scholar
Zhu, D.X.: Lecture Notes on Grammar. The Commercial Press, Beijing (1983)
Google Scholar
Li, J.X.: The New Chinese Grammar. The Commercial Press, Beijing (1924)
Google Scholar

Download references

Author information

Authors and Affiliations

Research Center of Language and Informatics, Nanjing Normal University, 210097, Nanjing, China
Bin Li, Minxuan Feng & Xiaohe Chen
State Key Lab for Novel Software Technology, Nanjing University, 210023, Nanjing, China
Bin Li & Ning Xi

Authors

Bin Li
View author publications
You can also search for this author in PubMed Google Scholar
Ning Xi
View author publications
You can also search for this author in PubMed Google Scholar
Minxuan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohe Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer School, Wuhan University, 430072, Wuhan, China
Donghong Ji
College of Chinese Language and Literature, Wuhan University, 430072, Wuhan, China
Guozheng Xiao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, B., Xi, N., Feng, M., Chen, X. (2013). Corpus-Based Statistics of Pre-Qin Chinese. In: Ji, D., Xiao, G. (eds) Chinese Lexical Semantics. CLSW 2012. Lecture Notes in Computer Science(), vol 7717. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36337-5_16

Download citation

DOI: https://doi.org/10.1007/978-3-642-36337-5_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36336-8
Online ISBN: 978-3-642-36337-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics