A Novel Method for Constructing 3D Geometric Articulatory Models

Wei, Jianguo; Liu, Jie; Fang, Qiang; Lu, Wenhuan; Dang, Jianwu; Honda, Kiyoshi

doi:10.1007/s11265-015-1002-8

A Novel Method for Constructing 3D Geometric Articulatory Models

Published: 07 May 2015

Volume 82, pages 295–302, (2016)
Cite this article

Journal of Signal Processing Systems Aims and scope Submit manuscript

Jianguo Wei^1,3,
Jie Liu³,
Qiang Fang²,
Wenhuan Lu¹,
Jianwu Dang^3,4 &
…
Kiyoshi Honda³

389 Accesses
6 Citations
Explore all metrics

Abstract

This study describes a novel method of constructing a geometric articulatory model based on magnetic resonance imaging data by taking the physiological boundaries of speech apparatus into account. Two improvements have been made to the modeling process: i) Images taken from different viewpoints are combined to improve the accuracy of outline annotation. ii) Speech organs’ meshes are modeled with reference to the anatomical structures. Both qualitative and quantitative evaluations indicated that the proposed method surpasses the conventional method. Based on the meshes of the speech organs associated with different articulations, the linear component analysis was used to extract the control parameters. Each speech organ can be described using three control parameters or fewer. After the reconstruction, the average error between model and real data was less than 1.0 mm. This is also the first effort made to construct a 3D vocal tract model based on Chinese MRI data. It will facilitate the theoretical study and practical use in Chinese-speech-production related issues.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-modal recording and modeling of vocal tract movements

Article 14 December 2015

Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets

Segmentation and Analysis of Vocal Tract from MidSagittal Real-Time MRI

References

Guenther, F. H. (1994). A neural network model of speech acquisition and motor equivalent speech production. Biological Cybernetics, 72, 43–53.
Article Google Scholar
Guenther, F. H. (1995). Speech sound acquisition, coarticulation, and rate effects in a neural network model of speech production. Psychological Review, 102, 594–621.
Article Google Scholar
Guenther, F. H. (2006). Cortical interactions underlying the production of speech sounds. Journal of Communication Disorders, 39, 350–365.
Article Google Scholar
Fang, Q., et al. (2008). Investigation of the functional relationship of tongue muscles for the control of a physioloigcal articulatory model. in The 8th national conference of Phonetics. Beijing, China.
Fang, Q., Nishikido, A., & Dang, J. (2009). Feedforward control of a 3D physiological articulatory model for vowel production. Tsinghua Science and Technology, 14(5), 617–622.
Dang, J., & Honda, K. (2004). Construction and control of a physiological articulatory model. Journal of the Acoustical Society of America, 115(2), 853–870.
Article Google Scholar
Birkholz, P., Jackèl, D., & Kröger, B. J. (2007). Simulation of losses due to turbulence in the time-varying vocal system. IEEE Transactions on Audio, Speech and Language Processing, 15(4), 1218–1226.
Article Google Scholar
Perrier, P., Ma, L. & Payan, Y. (2005). Modeling the production of VCV sequences via the inversion if a biomechanical model of the tongue. in INTERSPEECH 2005. Lisbon, Portugal.
Mermelstein, P. (1973). Articulatory model for the study of speech production. Journal of the Acoustical Society of America, 53, 1070–1082.
Article Google Scholar
Maeda, S. (1990). Compensatory articulation during speech: evidence from the analysis and synthesis of vocal-tract shapes using an articulatory model. In W. J. Hardcastle and A. Marchal (Eds.), Speech Production and Speech Modelling, Kluwer Academic, Dordrecht, 131–149.
Badin, P., & Serrurier, A. (2006). Three-dimensional modeling of speech organs: articulatory data and models. Transactions on Technical Committee of Psychological and Physiological Acoustics, 365(H-2006-77), 421–426.
Google Scholar
Engwall, O. (2003). Combining MRI, EMA and EPG measurements in a three-dimensional tongue model. Speech Communication, 41, 303–329.
Article Google Scholar
Rubin, P., & Baer, T. (1981). An articulatory synthesizer for perception research. Journal of the Acoustical Society of America, 70(2), 321–328.
Article Google Scholar
Birkholz, P., Jackèl, D., & Kröger, B. J. (2006). Construction and control of a three-dimensional vocal tract model, in ICASSP. p. 873–876.
Beautemps, D., Badin, P., & Bailly, G. (2001). Linear degrees of freedom in speech production: analysis of cineradio- and labio-film data and articulatory-acoustic modeling. Journal of the Acoustical Society of America, 109(5), 2165–2180.
Article Google Scholar
Badin, P., et al. (1998). A threedimensional linear articulatory model based on MRI data, in The 3rd ESCA/COCOSDA International Workshop on Speech Synthesis. p. 249–254.
Badin, P., et al. (2002). Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images. Journal of Phonetics, 30(3), 533–553.
Article Google Scholar
Masaki, S., Tiede, M. K., et al. (1996). MRI-based speech production study using a synchronized sampling method. Journal of Acoustic Society Japan (E), 20, 375–379.
Article Google Scholar
Beautemps, D., et al. (1996). Evaluation of an articulatory-acoustic model based on a reference subject, in The 4th ISSP.

Download references

Acknowledgments

This work was supported by the National Natural Science-Foundation of China (No. 61175016,61304250), Key Fund projects of 61233009 and financial support from CASS Innovation Project “teaching pronunciation models for speech research”.

Author information

Authors and Affiliations

School of Computer Software, Tianjin University, Tianjin, China
Jianguo Wei & Wenhuan Lu
Phonetics Lab., Institute of Linguistics, Chinese Academy of Social Sciences, Beijing, China
Qiang Fang
Tianjin Key Lab. of Cognitive Computing and Application, Tianjin University, Tianjin, China
Jianguo Wei, Jie Liu, Jianwu Dang & Kiyoshi Honda
School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa, Japan
Jianwu Dang

Authors

Jianguo Wei
View author publications
You can also search for this author in PubMed Google Scholar
Jie Liu
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Fang
View author publications
You can also search for this author in PubMed Google Scholar
Wenhuan Lu
View author publications
You can also search for this author in PubMed Google Scholar
Jianwu Dang
View author publications
You can also search for this author in PubMed Google Scholar
Kiyoshi Honda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiang Fang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wei, J., Liu, J., Fang, Q. et al. A Novel Method for Constructing 3D Geometric Articulatory Models. J Sign Process Syst 82, 295–302 (2016). https://doi.org/10.1007/s11265-015-1002-8

Download citation

Received: 15 November 2014
Revised: 06 March 2015
Accepted: 15 April 2015
Published: 07 May 2015
Issue Date: February 2016
DOI: https://doi.org/10.1007/s11265-015-1002-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Novel Method for Constructing 3D Geometric Articulatory Models

Abstract

Access this article

Similar content being viewed by others

Multi-modal recording and modeling of vocal tract movements

Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets

Segmentation and Analysis of Vocal Tract from MidSagittal Real-Time MRI

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Novel Method for Constructing 3D Geometric Articulatory Models

Abstract

Access this article

Similar content being viewed by others

Multi-modal recording and modeling of vocal tract movements

Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets

Segmentation and Analysis of Vocal Tract from MidSagittal Real-Time MRI

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation