Visualization of Prosodic Knowledge Using Corpus Driven MEMOInt Intonation Modelling

Escudero-Mancebo, David; Cardeñoso-Payo, Valentín

doi:10.1007/11846406_81

David Escudero-Mancebo²¹ &
Valentín Cardeñoso-Payo²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4188))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

1088 Accesses

Abstract

In this work we show how our intonation corpus driven intonation modelling methodology MEMOInt can help in the graphical visualization of the complex relationships between the different prosodic features which configure the intonational aspects of natural speech. MEMOInt has already been used successfully for the prediction of synthetic F0 contours in the presence of the usual data scarcity problems. Now, we report on the possibilities of using the information gathered in the modelling phase in order to provide a graphical view of the relevance of the various prosodic features which affect the typical F0 movements. The set of classes which group the intonation patterns found in the corpus can be structured in a tree in which the relation between the classes and the prosodic features of the input text is hierarchically correlated. This visual outcome shows to be very useful to carry out comparative linguistic studies of prosodic phenomena and to check the correspondence between previous prosodic knowledge on a language and the real utterances found in a given corpus.

This work has been partially sponsored by Spanish Government (MCYT project TIC2003-08382-C05-03) and by Consejeria de Educacion (JCYL project VA053A05).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Mapping Speech Intonations to the VAD Model of Emotions

The Phonetic Grounding of Prosody: Analysis and Visualisation Tools

Analysis-By-Synthesis Modeling of Bengali Intonation

References

Aaron, A., Pitrelli, E., Pitrelli, J.F.: Conversational computers. Scientific American, 64–70 (June 2005)
Google Scholar
Aguado, P.D., Wimmer, K., Bonafonte, A.: Joint extraction and prediction of fujisaki’s intonation model parameters. In: Proceedings of EuroSpeech 2005 (2005)
Google Scholar
Allen, J., Hunnicutt, M.S., Klatt, D.: From Text to Speech: The MITalk System. Cambridge University Press, Cambridge (1987)
Google Scholar
Botinis, A., Granstrom, B., Moebius, B.: Developments and Paradigms in Intonation Research. Speech Communications 33, 263–296 (2001)
Article MATH Google Scholar
Cardeñoso, V., Escudero, D.: A strategy to solve data scarcity problems in corpus based intonation modelling. In: Proceedings of ICASSP 2004 (2004)
Google Scholar
Escudero, D.: Modelado Estadístico de Entonación con Funciones de Bézier: Aplicaciones a la Conversión Texto Voz. Ph.D. thesis, Dpto. de Informática, Universidad de Valladolid, España (2002)
Google Scholar
Escudero, D., Cardeñoso, V., Bonafonte, A.: Corpus based extraction of quantitative prosodic parameters of stress groups in spanish. In: Proceedings of ICASSP 2002, Mayo (2002)
Google Scholar
Escudero, D., Cardeñoso, V.: Optimized selection of intonation dictionaries in corpus based intonation modelling. In: Proceedings of Eurospeech (September 2005)
Google Scholar
Fujisaki, H., Hirose, K.: Analysis of voice fundamental frequency contours for declarative sentences of Japanese. Journal of Acoustics Society of Japan 5(4), 233–242 (1984)
Google Scholar
Hermes, D.J.: Measuring the perceptual similarity of pitch contours. Journal of Speech, Language, and Hearing Research 41, 73–82 (1994)
Google Scholar
Joskisch, O., Mixdorff, H., Kruschke, H., Kordon, U.: Learning the parameters of quantitative prosody models. In: Proceedings of ICSLP 2000 (2000)
Google Scholar
Navarro-Tomás, T.: Manual de Entonación Española. Madrid, Guadarrama (1944)
Google Scholar
Sosa, J.M.: La Entonación del Español. Cátedra (1999)
Google Scholar
Sproat, R.: Multilingual Text-to-Speech Synthesis. Kluwer, Dordrecht (1998)
Google Scholar
Taylor, P.: Analysis and Synthesis of Intonation using the Tilt Model. Journal of Acoustical Society of America 107(3), 1697–1714 (2000)
Article Google Scholar
Webb, A.: Statistical Pattern Recognition, 2nd edn. Wiley, Chichester (2002)
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

University of Valladolid, Valladolid, 47014, Spain
David Escudero-Mancebo & Valentín Cardeñoso-Payo

Authors

David Escudero-Mancebo
View author publications
You can also search for this author in PubMed Google Scholar
Valentín Cardeñoso-Payo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Masaryk University, Botanická 68a, CZ-602 00, Brno, Czech Republic
Ivan Kopeček
Faculty of Informatics, Department of Computer Graphics and Design, Masaryk University, Botanická 68a, 60200, Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Escudero-Mancebo, D., Cardeñoso-Payo, V. (2006). Visualization of Prosodic Knowledge Using Corpus Driven MEMOInt Intonation Modelling. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_81

Download citation

DOI: https://doi.org/10.1007/11846406_81
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39090-9
Online ISBN: 978-3-540-39091-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics