Abstract
This paper presents an automatic approach for the detection of the prosodic structures of speech utterances. The algorithm relies on a hierarchical representation of the prosodic organization of the speech utterances. The approach is applied on a corpus of radio French broadcast news and also on radio and TV shows which are more spontaneous speech data. The algorithm detects prosodic boundaries whether they are followed or not by pause. The detection of the prosodic boundaries and of the prosodic structures is based on an approach that integrates little linguistic knowledge and mainly uses the amplitude of the F0 slopes and the inversion of the slopes as described in [1], as well as phone durations. The automatic prosodic segmentation results are then compared to a manual prosodic segmentation made by an expert phonetician. Finally, the results obtained by this automatic approach provide an insight into the most frequently used prosodic structures in the broadcasting speech style as well as in a more spontaneous speech style.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Martin, P.: Prosodic and rhythmic structures in French. Linguistics 25, 925–949 (1987)
Hupin, B., Simon, A.C.: Analyse phonostylistique du discours radiophonique. Expériences sur la mise en fonction professionnelle du phonostyle et sur le lien entre mélodicité et proximité du discours radiophonique. Recherches en communication 28, 103–121 (2009)
Goldman, J.-P., Auchlin, A., Simon, A.C., Avanzi, M.: Phonostylographe: un outil de description prosodique. Comparaison du style radiophonique et lu. Nouveaux Cahiers de Linguistique Franaise 28, 219–237 (2008)
Lacheret-Dujour, A., Obin, N., Avanzi, M.: Design and Evaluation of Shared Prosodic Annotation for French Spontaneous Speech: From Experts Knowledges to Non-Experts Annotations. In: Proceedings of the 4th Linguistic Annotation Workshop, Uppsala, Sweden (2010)
Segal, N., Bartkova, K.: Prosodic structure representation for boundary detection in spontaneous French. In: Proceedings of ICPhS 2007, Saarbrcken, pp. 1197–1200 (2007)
’t Hart, J., Collier, R., Cohen, A.: A Perceptual Study of Intonation. Cambridge U.P., London (1990)
Galliano, S., Gravier, G., Chaubard, L.: The Ester 2 evaluation campaign for rich transcription of French broadcasts. In: Proc. INTERSPEECH 2009, Brighton, UK, pp. 2583–2586 (2009)
Gravier, G., Adda, G., Paulsson, N., Carr, M., Giraudel, A., Galibert, O.: The ETAPE corpus for the evaluation of speech-based TV content processing in the French language. In: Proc. LREC 2012, Istanbul, Turkey (2012)
Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; extended advanced front-end feature extraction algorithm; compression Algorithms, ETSI ES 202 212 (2005)
de Calmès, M., Pérennou, G.: BDLEX: a Lexicon for Spoken and Written French. In: Proc. LREC 1998, Grenade, pp. 1129–1136 (1998)
Jouvet, D., Fohr, D., Illina, I.: Evaluating grapheme-to-phoneme converters in automatic speech recognition context. In: Proc. ICASSP 2012, Kyoto, Japan, pp. 4821–4824 (2012)
Sphinx (2011), http://cmusphinx.sourceforge.net/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Bartkova, K., Jouvet, D. (2013). Automatic Detection of the Prosodic Structures of Speech Utterances. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-01931-4_1
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01930-7
Online ISBN: 978-3-319-01931-4
eBook Packages: Computer ScienceComputer Science (R0)