Abstract
Semantic role labeling (SRL) is a fundamental task in Chinese language processing, but there are three major problems about the construction of SRL corpora. First, disagreements occurred in previous studies over the definition and number of semantic roles. Second, it is hard for static predicate frames to cover dynamic predicate usages. Third, it is unable to annotate the dropped semantic roles. Abstract Meaning Representation (AMR) is a new method which provides a better solution to the above problems. The researchers use 5,000 sentences in the Chinese AMR corpus to make a comparison between AMR and other SRL resources. Data analysis shows that within the framework of AMR, it is easier to annotate semantic roles based on simplified distinction between core and non-core roles. In addition, 1,045 tokens of dropped roles are annotated under this new framework. This study indicates that AMR offers a better solution for Chinese SRL and sentence meaning processing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
The current CAMR corpus contains 10,149 sentences, and has been published at https://catalog.ldc.upenn.edu/LDC2019T07.
- 4.
The difference < 0 also contains the case of core roles being dropped, and the difference > 0 also contains the case of core roles having not being annotated. But these cases are negligible because they are few in number.
- 5.
Predicates without core roles in CAMR corpus are hard to be separated from other words, so the researchers ignore them currently.
References
Xue, N.: A Chinese semantic lexicon of senses and roles. Lang. Resour. Eval. 40(3–4), 395–403 (2006)
Kipper, K., Dang, H., Palmer, M.: Class-based construction of a verb lexicon. In: Proceedings of the 17th National Conference on Artificial Intelligence and 12th Conference on Innovative Applications of Artificial Intelligence, pp. 691–696 (2000)
Chen, K., Luo, C., Chang, M., et al.: Sinica Treebank. In: Abeillé, A. (ed.) Treebanks. Text, Speech and Language Technology, vol. 20, pp. 231–248. Springer, Dordrecht (2003). https://doi.org/10.1007/978-94-010-0201-1_13
Yuan, Y.: The fineness hierarchy of semantic roles and its application in NLP. J. Chin. Inf. Process. 21(4), 10–20 (2007)
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley framenet project. In: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, pp. 86–90 (1998)
Palmer, M., Gildea, D., Kingsbury, P.: The proposition bank: an annotated corpus of semantic roles. Comput. Linguist. 31(1), 71–106 (2005)
Xue, N., Palmer, M.: Adding semantic roles to the Chinese Treebank. Nat. Lang. Eng. 15(1), 143–172 (2009)
Banarescu, L., Bonial, C., Cai, S., et al.: Abstract meaning representation for sembanking. In: Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, pp. 178–186 (2013)
Li, B., Wen, Y., Bu, L., et al.: Annotating the Little Prince with Chinese AMRs. In: Proceedings of the 10th Linguistic Annotation Workshop (2016)
Bai, X., Xue, N.: Generalizing the semantic roles in the Chinese proposition bank. Lang. Resour. Eval. 50(3), 643–666 (2016)
Li, B., Wen, Y., Song, L., et al.: Construction of Chinese abstract meaning representation corpus with concept-to-word alignment. J. Chin. Inf. Process. 31(6), 93–102 (2017)
Weischedel, R., Hovy, E., Marcus, M., et al.: OntoNotes: a large training corpus for enhanced processing. In: Handbook of Natural Language Processing and Machine Translation (2011)
Cai, S., Knight, K.: Smatch: an evaluation metric for semantic feature structures. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 748–752 (2012)
Li, B., Wen, Y., Bu, L., et al.: A comparative analysis of the AMR graphs from english and Chinese corpus of the Little Prince. J. Chin. Inf. Process. 31(1), 50–57 (2017)
Xue, N., Xia, F., Chiou, F.D., et al.: The Penn Chinese Treebank: phrase structure annotation of a large corpus. Nat. Lang. Eng. 11(2), 207–238 (2005)
Acknowledgements
We thank the reviewers. This work is partially supported by project 18BYY127 under the National Social Science Foundation of China, project 61472191 under the National Science Foundation of China, and Project Funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Song, L., Wen, Y., Ge, S., Li, B., Qu, W. (2020). An Easier and Efficient Framework to Annotate Semantic Roles: Evidence from the Chinese AMR Corpus. In: Hong, JF., Zhang, Y., Liu, P. (eds) Chinese Lexical Semantics. CLSW 2019. Lecture Notes in Computer Science(), vol 11831. Springer, Cham. https://doi.org/10.1007/978-3-030-38189-9_49
Download citation
DOI: https://doi.org/10.1007/978-3-030-38189-9_49
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-38188-2
Online ISBN: 978-3-030-38189-9
eBook Packages: Computer ScienceComputer Science (R0)