Abstract
Natural input dialog systems for entering address data in modern GPS units demand a significantly more robust extraction of slot information. In previous work, prosody was used to detect phrase boundaries (PB) which separate particular address parts. Only input without filler words was used. In this work, carrier sentences are allowed. A boosting approach provides twenty strong prosodic features which model the characteristic of PBs. We introduce the concept of a prosodically marked word boundary (PMB), which enables a better location of the provided information in natural input. Our results on a dataset of 5883 input samples reveal that about 67% of the found PBs indicate a PMB, while most of the remaining boundaries occur within compound words.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kaufhold, C., Nöth, E.: Using prosodic features for predicting phrase boundaries. In: Speech Prosody (2010)
Gorin, A.L., Riccardi, G., Wright, J.H.: How may I help you? Speech Communication 23, 113–127 (1997)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11 (2009)
Kießling, A.: Extraktion und Klassifikation prosodischer Merkmale in der automatischen Sprachverarbeitung. PhD thesis, Pattern Recognition Lab, University of Erlangen-Nuremberg (1996)
Batliner, A., Buckow, J., Niemann, H., Nöth, E., Warnke, V.: The prosody module. In: Wahlster, W. (ed.) Verbmobil: Foundations of Speech-to-Speech Translations, pp. 106–121. Springer, Heidelberg (2000)
Wahlster, W.: Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Germany (2000)
Maier, A., Hönig, F., Zeissler, V., Batliner, A., Körner, E., Yamanaka, N., Ackerman, P., Nöth, E.: A language-independent feature set for the automatic evaluation of prosody. In: Interspeech (2009)
Hönig, F., Batliner, A., Weilhammer, K., Nöth, E.: Islands of failure: Employing word accent information for pronunciation quality assessment of English l2 learners. In: Interspeech (2009)
Selting, M.: Lists as embedded structures and the prosody of list construction as an interactional resource. Journal of Pragmatics 39, 483–526 (2007)
Shukla, M., Nespor, M., Mehler, J.: An interaction between prosody and statistics in the segmentation of fluent speech. Cognitive Psychology 54, 1–32 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kaufhold, C., Stemmer, G., Nöth, E. (2011). Reliable Detection of Important Word Boundaries Using Prosodic Features. In: Habernal, I., Matoušek, V. (eds) Text, Speech and Dialogue. TSD 2011. Lecture Notes in Computer Science(), vol 6836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23538-2_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-23538-2_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23537-5
Online ISBN: 978-3-642-23538-2
eBook Packages: Computer ScienceComputer Science (R0)