Abstract
Dependency analysis is vital for spoken language understanding in spoken dialogue systems. However, existing research has mainly focused on western spoken languages, Japanese, and so on. Little research has been done for spoken Chinese in terms of dependency parsing. Therefore, the new spoken corpus, D-ESCSC (Dependency-Expressive Speech Corpus of Standard Chinese) is built by adding new dependency relations special to spoken Chinese based on a written Chinese annotation scheme. Since spoken Chinese contains typical ill-grammatical phenomena, e.g., translocation, repetition, duplication, and omission, the new atom feature related to punctuation and three feature templates are proposed to improve a graph-based dependency parser. Experimental results on spoken Chinese corpus show that the atom feature and three templates really work and the new parser outperforms the baseline parser. To our best knowledge, it is the first work to report dependency parsing results of spoken Chinese.
- Frederic Bechet, Alexis Nasr, and Benoit Favre. 2014. Adapting dependency parsing to spontaneous speech for open domain spoken language understanding. In Proceedings of the 15th Annual Conference of the International Speech Communication Association. 135--139.Google Scholar
- Bernd Bohnet. 2010. Very high accuracy and fast dependency parsing is not a contradiction. In Proceedings of the 23rd International Conference on Computational Linguistics. 89--97. Google ScholarDigital Library
- Xavier Carreras. 2007. Experiments with a higher-order projective dependency parser. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 957--961.Google Scholar
- Wanxiang Che, Zhenghua Li, and Ting Liu. 2010. LTP: A Chinese language technology platform. In Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations. 13--16. Google ScholarDigital Library
- Anoop Deoras and Ruhi Sarikaya. 2013. Deep belief network based semantic taggers for spoken language understanding. In Proceedings of the 14th Annual Conference of the International Speech Communication Association. 2713--2717.Google Scholar
- Timothy Dozat and Christopher D. Manning. 2017. Deep biaffine attention for neural dependency parsing. In Proceedings of the 5th International Conference on Learning Representations.Google Scholar
- Ryan McDonald Hao Zhang. 2014. Enforcing structural diversity in cube-pruned dependency parsing. In Proceedings of the ACL. 656--661.Google Scholar
- Larry Heck and Dilek Hakkani-Tur. 2012. Exploiting the semantic web for unsupervised spoken language understanding. In Proceedings of the Conference on Spoken Language Technology Workshop. 228--233.Google ScholarCross Ref
- Collins M. Koo, Carreras Pérez X. 2008. Simple semi-supervised dependency parsing. In Proceedings of the ACL. 595--603.Google Scholar
- Terry Koo and Michael Collins. 2010. Efficient third-order dependency parsers. In Proceedings of the ACL. 1--11. Google ScholarDigital Library
- Shuhei Kurita, Daisuke Kawahara, and Sadao Kurohashi. 2017. Neural joint model for transition-based Chinese syntactic analysis. In Proceedings of the ACL. 1204--1214.Google ScholarCross Ref
- Zhenghua Li, Ting Liu, and Wanxiang Che. 2012. Exploiting multiple treebanks for parsing with quasi-synchronous grammars. In Proceedings of the ACL. 675--684. Google ScholarDigital Library
- Zhenghua Li, Min Zhang, and Wenliang Chen. 2014. Ambiguity-aware ensemble training for semi-supervised dependency parsing. In Proceedings of the ACL. 457--467.Google ScholarCross Ref
- Ting Liu, Jinshan Ma, and Sheng Li. 2009. Building a dependency treebank for improving Chinese parser. Journal of Chinese Language and Computing 16, 4 (2009), 207--224.Google Scholar
- Xuefei Liu, Aijun Li, Yuan Jia, and Yiqing Zu. 2014. Syntactic annotation under dependency scheme on Chinese spontaneous speech. In Proceedings of the Conference on Co-ordination and Standardization of Speech Databases and Assessment Techniques. 1--6.Google Scholar
- Jianming Lu. 1980. The phenomenon of translocation in Chinese spoken language. Chinese Language 2 (1980), 28--41.Google Scholar
- Shigeki Matsubara, Nobuo Kawaguchi, and Yasuyoshi Inagaki. 2005. Robust dependency parsing of spontaneous Japanese spoken language. IEICE Transactions on Information and Systems 88, 3 (2005), 545--552. Google ScholarDigital Library
- Shigeki Matsubara, Shinichi Kimura, Nobuo Kawaguchi, Yukiko Yamaguchi, and Yasuyoshi Inagaki. 2002a. Example-based speech intention understanding and its application to in-car spoken dialogue system. In Proceedings of the 19th International Conference on Computational Linguistics. 1--7. Google ScholarDigital Library
- Shigeki Matsubara, Takahisa Murase, Nobuo Kawaguchi, and Yasuyoshi Inagaki. 2002b. Stochastic dependency parsing of spontaneous Japanese spoken language. In Proceedings of the 19th International Conference on Computational Linguistics. 1--6. Google ScholarDigital Library
- Ryan McDonald, Koby Crammer, and Fernando Pereira. 2005a. Online large-margin training of dependency parsers. In Meeting on Association for Computational Linguistics. 91--98. Google ScholarDigital Library
- Ryan McDonald, Fernando Pereira, Kiril Ribarov, and Jan Hajič. 2005b. Non-projective dependency parsing using spanning tree algorithms. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing. 523--530. Google ScholarDigital Library
- Ryan T. McDonald and Fernando C. N. Pereira. 2006. Online learning of approximate dependency parsing algorithms. In Proceedings of European Chapter of the Association for Computational Linguistics. 81--88.Google Scholar
- Russell Moore, Andrew Caines, Calbert Graham, and Paula Buttery. 2015. Incremental dependency parsing and disfluency detection in spoken learner English. In Proceedings of the Conference on Text, Speech, and Dialogue. Springer, 470--479. Google ScholarDigital Library
- Joakim Nivre. 2003. An efficient algorithm for projective dependency parsing. In Proceedings of the 8th International Workshop on Parsing Technologies. 149--160.Google Scholar
- Joakim Nivre. 2007. Dependency parsing of spoken Swedish. Communication--Action--Meaning. A Festschrift to Jens Allwood (2007), 203--211.Google Scholar
- E. W. Noreen. 1989. Computer-intensive methods for testing hypotheses: An introduction. Computer (1989).Google Scholar
- Tomohiro Ohno, Shigeki Matsubara, Hideki Kashioka, Naoto Kato, and Yasuyoshi Inagaki. 2005. Incremental dependency parsing of japanese spoken monologue based on clause boundaries. In Proceedings of the 9th European Conference on Speech Communication and Technology. 3449--3452.Google Scholar
- Li Wang. 2007. A Preliminary Discussion on the Characteristics of Spoken Chinese. Master’s thesis. Tianjin Normal University, Tianjin, China.Google Scholar
- Xia Wang, Aijun Li, and Jianhua Tao. 2007. An expressive speech corpus of standard Chinese. In Proceedings of O-COCOSDA.Google Scholar
- Mohammed Sidi Yakoub, Sid-Ahmed Selouani, and Roger Nkambou. 2015. Mobile spoken dialogue system using parser dependencies and ontology. International Journal of Speech Technology 18, 3 (2015), 449--457. Google ScholarDigital Library
- Hiroyasu Yamada and Yuji Matsumoto. 2003. Statistical dependency analysis with support vector machines. In Proceedings of International Workshop on Parsing Technologies, Vol. 3. 195--206.Google Scholar
- Meishan Zhang, Yue Zhang, Wanxiang Che, and Ting Liu. 2013. Chinese parsing exploiting characters. In Proceedings of the ACL. 125--134.Google Scholar
- Yue Zhang and Joakim Nivre. 2011. Transition-based dependency parsing with rich non-local features. In Proceedings of the Association for Computational Linguistics: Human Language Technologies. 188--193. Google ScholarDigital Library
Index Terms
A Dependency Parser for Spontaneous Chinese Spoken Language
Recommendations
Robust Dependency Parsing of Spontaneous Japanese Spoken Language
Spontaneously spoken Japanese includes a lot of grammatically ill-formed linguistic phenomena such as fillers, hesitations, inversions, and so on, which do not appear in written language. This paper proposes a novel method of robust dependency parsing ...
Incremental Dependency Parsing and Disfluency Detection in Spoken Learner English
TSD 2015: Proceedings of the 18th International Conference on Text, Speech, and Dialogue - Volume 9302This paper investigates the suitability of state-of-the-art natural language processing NLP tools for parsing the spoken language of second language learners of English. The task of parsing spoken learner-language is important to the domains of ...
Neural Character-Level Syntactic Parsing for Chinese
In this work, we explore character-level neural syntactic parsing for Chinese with two typical syntactic formalisms: the constituent formalism and a dependency formalism based on a newly released character-level dependency treebank. Prior works in Chinese ...
Comments