Abstract
This paper introduces an novel framework for speech understanding using extended context-free grammars (ECFGs) by combining statistical methods and rule based knowledge. By only using 1st level labels a considerable lower expense of annotation effort can be achieved. In this paper we derive hierarchical non-deterministic automata from the ECFGs, which are transformed into transition networks (TNs) representing all kinds of labels. A sequence of recognized words is hierarchically decoded by using a Viterbi algorithm. In experiments the difference between a hand-labeled tree bank annotation and our approach is evaluated. The conducted experiments show the superiority of our proposed framework. Comparing to a hand-labeled baseline system (\(\widehat{=} 100\%\)) we achieve 95,4 % acceptance rate for complete sentences and 97.8 % for words. This induces an accuray rate of 95.1 % and error rate of 4.9 %, respectively F1-measure 95.6 % in a corpus of 1 300 sentences.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Thomae, M., Fabian, T., Lieb, R., Ruske, G.: A One-Stage Decoder for Interpretation of Natural Speech. In: Proc. NLP-KE 2003, Beijing, China (2003)
Lieb, R., Ruske, G.: Natural Dialogue Behaviour Using Complex Information Services in Cars. Institute for Human Machine Communication, Techn. University, Munich, Germany, Tech. Rep (2003)
Thomae, M., Fabian, T., Lieb, R., Ruske, G.: Hierarchical Language Models for One-Stage Speech Interpretation. In: Proc. Eurospeech, Lisbon, Portugal (2005)
Ward, W.: The CMU Air Travel Information Service: Understanding Spontaneous Speech. In: Proc. Workshop on Speech and Natural Language. Hidden Valley, Pennsylvania, pp. 127–129 (1990)
Blackburn, P., Bos, J.: Representation and Inference for Natural Language. Leland Stanford Junior University: CLSI Publications (2005)
Wang, Y., Acero, A.: Combination of CFG and N-Gram Modeling in Semantic Grammar Learning. In: Proc. Interspeech (2003)
Ward, W., Issar, S.: Recent Improvements in the CMU Spoken Language Understanding System. In: Proc. of ARPA Human Language Technology Workshop, pp. 213–216 (1994)
Brüggemann-Klein, A., Wood, D.: The Parsing of Extended Context-Free Grammars. In: HKUST Theoretical Computer Science Center Research Report, no. 08 (2002)
Hopcroft, J.E., Motwani, R., Ullman, J.D.: Introduction to automata theory, languages and computation. Addison-Wesley, Reading (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schwärzler, S., Schenk, J., Wallhoff, F., Ruske, G. (2008). Natural Language Understanding by Combining Statistical Methods and Extended Context-Free Grammars. In: Rigoll, G. (eds) Pattern Recognition. DAGM 2008. Lecture Notes in Computer Science, vol 5096. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69321-5_26
Download citation
DOI: https://doi.org/10.1007/978-3-540-69321-5_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69320-8
Online ISBN: 978-3-540-69321-5
eBook Packages: Computer ScienceComputer Science (R0)