Skip to main content

Developing a Syntactic and Semantic Annotation Tool for Research on Chinese Vocabulary

  • Conference paper
  • First Online:
Book cover Chinese Lexical Semantics (CLSW 2021)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13250))

Included in the following conference series:

Abstract

Syntax and semantics are of great importance for vocabulary research. Large-scale corpus-based studies concerning the syntactic and semantic features of Chinese vocabulary are needed. To fill the gap, this study develops a syntactic and semantic annotation tool based on dependency grammar, and delves into the syntax and semantics of the most frequently used disyllabic visual verb kànjiàn ‘see’ as an example. First, this study developed a syntactic and semantic annotation tool of dependency grammar. Second, this study extracted and selected simple sentences with kànjiàn from large-scale corpora. Third, the tool was used to conduct syntactic and semantic dependency tagging and manual checking. Results show that kànjiàn can appear in six types of syntactic dependencies as a dependent, namely, Head (predicate), Attribute, Coordination, Verb-Object, and Subject-Verb. Moreover, the most frequent syntactic function of kànjiàn is being a predicate, which accounts for 85.4%. The semantic roles that kànjiàn collocates with fall into five categories, which are CONTENT, EXPERIENCER, PLACE, TIME, and MANNER. The two most frequent ones are CONTENT and EXPERIENCER. Further, this paper compared the results with previous Chinese resources. The outcomes of this study prove the importance of the syntactic and semantic annotation tool in analyzing Chinese vocabulary and help us have a deep understanding of the characteristics of Chinese visual verbs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.sogou.com/labs/.

  2. 2.

    http://bcc.blcu.edu.cn/.

  3. 3.

    http://ccl.pku.edu.cn:8080/ccl_corpus/.

  4. 4.

    The translation of these terms comes from this website: http://ltp.ai/docs/appendix.html. As we confirmed with the developer on April 26, 2022, the translations have some errors which mixed a dependency relation with the dependent in the relation. Since there is no new official translation, we only changed COO (coordinate) into COO (coordination) as it should be a noun not a verb or an adjective.

References

  1. Liu, H.: Theory and Practice of Dependency Grammar (yīcún yǔfǎ de lǐlùn yǔ shíjiàn). Science Press (kēxué chūbǎnshè), Beijing (2009)

    Google Scholar 

  2. De Marneffe, M.-C., Nivre, J.: Dependency grammar. Ann. Rev. Linguist. 5, 197–218 (2019)

    Google Scholar 

  3. Ahmad, F., Rahoman, M.-M.: Named entity classification using dependency grammar. In: 2017 20th International Conference of Computer and Information Technology (ICCIT), Dhaka, pp. 1–7. IEEE (2017)

    Google Scholar 

  4. Wang, S., Huang, C.-R.: Compositional operations of Mandarin Chinese perception verb “kàn”: a generative lexicon approach. In: Otoguro, R., Ishikawa, K., Umemoto, H., Yoshimoto, K., Harada, Y. (eds.) Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, pp. 707–714. Tohoku University, Sendai (2010)

    Google Scholar 

  5. Chen, C.: An analysis of semantic differentiation of construction of verbs “KAN”, “KANJIAN”, “KANDAO”, and “KANWAN” in Chinese (kàn kànjiàn kàndào yǔ kànwán de yǔyì chàyì tànjiū rènzhī gòushì yǔfǎ shìjiǎo). Foreign Lang. (wàiguó yǔwén) 36, 83–92 (2020)

    Google Scholar 

  6. Niu, R., Osborne, T.: Chunks are components: a dependency grammar approach to the syntactic structure of Mandarin. Lingua 224, 60–83 (2019)

    Article  Google Scholar 

  7. Feng, Z.: Tessnier’s grammaire de dépendance (tèsīníyēěr de cóngshǔ guānxì yǔfǎ). Foreign Linguist. (guówài yǔyánxué) 63–65 (1983)

    Google Scholar 

  8. Tang, X., Xiao, L.: Research on micro-blog topics mining model on dependency parsing (jīyú yīcún jùfǎ fènxī de wēibó zhǔtí wājué móxíng yánjiū). Inf. Sci. (qíngbào kēxué) 33, 61–65 (2015)

    Google Scholar 

  9. Qiu, L., Zhang, Y., Jin, P., Wang, H.: Multi-view Chinese treebanking. In: Tsujii, J.I., Hajic, J. (eds.) Proceedings of COLING 2014, The 25th International Conference on Computational Linguistics: Technical Papers, pp. 257–268. Dublin City University and Association for Computational Linguistics, Dublin (2014)

    Google Scholar 

  10. Liu, T., Ma, J.: Theories and methods of Chinese automatic syntactic parsing: a critical survey (hànyǔ zìdòng jùfǎ fènxī de lǐlùn yǔ fāngfǎ). Contemp. Linguist. (dāngdài yǔyánxué) 11, 100–112 (2009)

    Google Scholar 

  11. Liu, H., Feng, Z.: Probabilistic valency pattern theory for natural language processing (zìrán yǔyán chùlǐ de gàilǜ pèijià móshì lǐlùn). Linguist. Sci. (yǔyán kēxué) 32–41 (2007)

    Google Scholar 

  12. Liu, H.: The complexity of Chinese syntactic dependency networks. Phys. A Stat. Mech. Appl. 387, 3048–3058 (2008)

    Article  Google Scholar 

  13. Jiang, J., Ouyang, J., Liu, H.: Interlanguage: a perspective of quantitative linguistic typology. Lang. Sci. 74, 85–97 (2019)

    Article  Google Scholar 

  14. Pan, X., Chen, X., Liu, H.: Harmony in diversity: the language codes in English-Chinese poetry translation. Digit. Scholarsh. Humanit. 33, 128–142 (2018)

    Article  Google Scholar 

  15. Gao, S., Yan, W., Liu, H.: A quantitative study on syntactic functions of Chinese verbs based on dependency treebank (jīyú shùkù de xiàndài hànyǔ dòngcí jùfǎ gōngnéng de jìliàng yánjiū). Chin. Lang. Learn. (hànyǔ xuéxí) 5, 105–112 (2010)

    Google Scholar 

  16. Liu, B., Niu, Y., Liu, H.: A comparative study on style-related differences in syntactic functions of part of speech (hànyǔ cílèi jùfǎ gōngnéng de yǔtǐ chàyì yánjiū). Lang. Teach. Linguist. Stud. (yǔyán jiāoxué yǔ yánjiū) 5, 97–104 (2013)

    Google Scholar 

  17. Li, H., Liu, H.: A study of Chinese children’s acquisition of three-word sentences based on an annotated corpus (jīyú jùfǎ biāozhù yǔliàokù de hànyǔ értóng sāncíjù xídé yánjiū). Appl. Linguist. (yǔyán wénzì yīngyòng) 107–116 (2017)

    Google Scholar 

  18. Chen, X.: Dependency network syntax: from dependency treebanks to a classification of Chinese function words. In: Hajicova, E., Gerdes, K., Wanner, L. (eds.) Proceedings of the Second International Conference on Dependency Linguistics (DepLing 2013), pp. 41–50. Charles University, Prague (2013)

    Google Scholar 

  19. Tu, J., Zhu, M.: Imperative sentences classification based on dependency grammar (jīyú yīcún yǔfǎ de qíshǐjù fènlèi yánjiū). Comput. Appl. Softw. (jìsuànjī yīngyòng yǔ ruǎnjiàn) 36, 279–283+322 (2019)

    Google Scholar 

  20. Liu, B., Xu, C.: Quantitative analysis on verb valence evolution of Chinese. In: Gerdes, K., Kahane, S. (eds.) Proceedings of the Fifth International Conference on Dependency Linguistics (Depling, SyntaxFest 2019), pp. 175–180. Association for Computational Linguistics (2019)

    Google Scholar 

  21. Wang, S.: Chinese Multiword Expressions. Springer, Singapore (2020). https://doi.org/10.1007/978-981-13-8510-0

    Book  Google Scholar 

  22. Wang, S., Luo, H.: Exploring the meanings and grammatical functions of idioms in teaching Chinese as a second language. Int. J. Appl. Linguist. 31, 283–300 (2021). https://doi.org/10.1111/ijal.12334

    Article  Google Scholar 

  23. Wang, S., Wu, L., Gong, Q.: The collocations of Chinese tactile adjectives. In: Liu, M., Kit, C., Su, Qi. (eds.) CLSW 2020. LNCS (LNAI), vol. 12278, pp. 711–733. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-81197-6_59

    Chapter  Google Scholar 

  24. Wang, S., Yin, J.: A comparative study of the collocations in legislative Chinese and general Chinese. In: Hong, J.-F., Zhang, Y., Liu, P. (eds.) CLSW 2019. LNCS (LNAI), vol. 11831, pp. 710–724. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-38189-9_72

    Chapter  Google Scholar 

  25. Fu, H.: The similarities, differences and restrictions of the meanings of “kan” and “kanjian” (kàn hé kànjiàn děng cíyì de tóngyì hé zhìyuē). Chin. Lang. Learn. (hànyǔ xuéxí) 1–5 (1993)

    Google Scholar 

  26. Xu, S.: The research of the visual Sensory verbs like “kan” in syntactic and semantics (xiàndài hànyǔ kànlèi shìjué gǎnguān dòngcí jùfǎ yǔyì fènxī). Jilin University, Changchun (2015)

    Google Scholar 

  27. Yin, D.: The evolution of visual verbs in ancient times (shànggǔ kànshìlèi dòngcí de yǎnbiàn guīlǜ). Seeker (qiúsuǒ) 200–201 (2008)

    Google Scholar 

  28. Ma, B., Chen, W.: The sound and meaning association of “kan” and a comparative study of Chinese and English (kàn de yīnyì liánxiǎng jíqí hànyīng bǐjiào yánjiū). Foreign Lang. Res. (wàiyǔ yánjiū) 23–24 (2006)

    Google Scholar 

  29. Han, Y.: Semantic projection and grammaticalization of Chinese visual verbs (hànyǔ shìjiào dòngcí de yǔyì tóushè jí yǔfǎhuà gòunǐ). Foreign Lang. Lit. Stud. (wàiguó yǔyán wénxué) 10–13 (2003)

    Google Scholar 

  30. Zeng, L.: The subjectivisation of “wo-kan”and “ni-kan” (wǒkàn yǔ nǐkàn de zhǔguānhuà). Chin. Lang. Learn. (hànyǔ xuéxí) 15–22 (2005)

    Google Scholar 

  31. Cui, Y.: The grammaticalization path of Chinese visual verb “jian” and its evolutionary motivation and mechanism (hànyǔ shìjiào dòngcí jiàn de yǔfǎhuà lùjìng jíqí yǎnbiàn dòngyīn hé jīzhì). J. Heilongjiang Inst. Educ. (Hēilóngjiāng jiāoyù xuéyuàn xuébào) 33, 118–119 (2014)

    Google Scholar 

  32. Jiang, S., Chi, C.: From visual verbs to disposal prepositions: the grammaticalization and semantic map of “wang” in Wenzhou Dialect (cóng shìjué dòngcí dào chùzhì jiècí wēnzhōu fāngyán wàng de yǔfǎhuà hé yǔyì dìtú). J. Chin. Lang. Hist. (hànyǔshǐ xuébào) 1–11 (2018)

    Google Scholar 

  33. Liu, T., Che, W., Li, Z.: Language technology platform (yǔyán jìshù píngtái). J. Chin. Inf. Process. (zhōngwén xìnxī xuébào) 25, 53–63 (2011)

    Google Scholar 

  34. Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Brodley, C.E., Danyluk, A.P. (eds.) Proceedings of the 18th International Conference on Machine Learning 2001 (ICML 2001), pp. 282–289. Morgan Kaufmann Publishers Inc., San Francisco (2001)

    Google Scholar 

  35. Che, W., Li, Z., Li, Y., Guo, Y., Qin, B., Liu, T.: Multilingual dependency-based syntactic and semantic parsing. In: Stevenson, S., Carreras, X. (eds.) Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task, pp. 49–54. Association for Computational Linguistics, Boulder (2009)

    Google Scholar 

  36. Dai, Y.: Exploration of action bivalent verbs in modern Chinese (xiàndài hànyǔ dòng zuòlèi èrjià dòngcí tànsuǒ). Stud. Chin. Lang. (zhōngguó yǔwén) 3–12 (1998)

    Google Scholar 

  37. Guo, D.: Modern Chinese Verb Classification Dictionary (xiàndài hànyǔ dòngcí fènlèi cídiǎn). Jilin Education Press (jílín jiāoyù chūbǎnshè), Changchun (1994)

    Google Scholar 

  38. Dictionary Editing Room of Institute of Linguistics, Chinese Academy of Social Sciences: The Contemporary Chinese Dictionary. The Commercial Press (shāngwù yìnshūguǎn), Beijing (2016)

    Google Scholar 

  39. Meng, C.: Chinese Verb Usage Dictionary (hànyǔ dòngcí yòngfǎ cídiǎn). The Commercial Press (shāngwù yìnshūguǎn), Beijing (1999)

    Google Scholar 

  40. Yuan, Y., Cao, H.: An introduction to the syntactic-semantic knowledge-base of Chinese verbs (dòngcí jùfǎ yǔyì xìnxī cídiǎn zhīshí nèiróng shuōmíngshū). In: Sun, M., Li, S., Zhang, Y., Liu, Y., He, S., Rao, G. (eds.) The Nineteenth China National Conference on Computational Linguistics (CCL 2020), pp. 518–527. The University of Hainan, Hainan (2020)

    Google Scholar 

Download references

Acknowledgements

This research is funded by the University of Macau (Project Number: MYRG2019-00013-FAH).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shan Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, S., Liu, X., Zhou, J. (2022). Developing a Syntactic and Semantic Annotation Tool for Research on Chinese Vocabulary. In: Dong, M., Gu, Y., Hong, JF. (eds) Chinese Lexical Semantics. CLSW 2021. Lecture Notes in Computer Science(), vol 13250. Springer, Cham. https://doi.org/10.1007/978-3-031-06547-7_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-06547-7_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-06546-0

  • Online ISBN: 978-3-031-06547-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics