Subjective Tests and Automatic Sentence Modality Recognition with Recordings of Speech Impaired Children

Sztaho, David; Nagy, Katalin; Vicsi, Klara

doi:10.1007/978-3-642-12397-9_34

David Sztaho²⁰,
Katalin Nagy²⁰ &
Klara Vicsi²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5967))

2460 Accesses
1 Citations

Abstract

Prosody recognition experiments have been prepared in the Laboratory of Speech Acoustics, in which, among the others, we were searching for the possibilities of the recognition of sentence modalities. Due to our promising results in the sentence modality recognition, we adopted the method for children modality recognition, and looked for the possibility, how it can be used as an automatic feedback in an audio - visual pronunciation teaching and training system. Our goal was to develop a sentence intonation teaching and training system for speech handicapped children, helping them to learn the correct prosodic pronunciation of sentence. HMM models of modality types were built by training the recognizer with a correctly speaking children database. During the present work, a large database was collected from speech impaired children. Subjective tests were carried out with this database of speech impaired children, in order to examine how human listeners are able to categorize the heard recordings of sentence modalities. Then automatic sentence modality recognition experiments were done with the formerly trained HMM models. By the result of the subjective tests, the probability of acceptance of the sentence modality recognizer can be adjusted. Comparing the result of the subjective tests and the results of the automatic sentence modality recognition tests processed on the database of speech impaired children, it is showed that the automatic recognizer classified the recordings more strictly, but not worse. The introduced method could be implemented as a part of a speech teaching system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Speech Processing and Prosody

Investigating the Recognition of Non-articulatory Sounds by Using Statistical Tests and Support Vector Machine

Perceptual Analysis by Adults of the Speech of Children with Autism Spectrum Disorders, Down’s Syndrome, and Intellectual Disabilities

Article 01 May 2022

References

Vicsi, K.: Computer-Assisted Pronunciation Teaching and Training Methods Based on the Dynamic Spectro-Temporal Characteristics of Speech. In: Dynamics of Speech Production and Perception, pp. 283–304. IOS Press, Amsterdam (2006)
Google Scholar
de Bot, K.: Visual feedback of intonation: Effectiveness and induced practice behavior. Lang. Speech 26(4), 331–335 (1983)
Google Scholar
James, E.: The acquisition of prosodic features of speech using a speech visualizer. IRAL 14(3), 227–243 (1976)
Article Google Scholar
Vicsi, K., Csatári, F., Bakcsi, Z., Tantos, A.: Distance score evaluation of the visualized speech spectra at audio-visual articulation training. In: Proc. Eurospeech, pp. 1911–1914 (1999)
Google Scholar
ISTRA Indiana Speech Training Aid Features. Bloomington, IN: Communication Disorders Technology, Inc. (2003), http://www.comdistec.com/istra_faq.shtml
Vicsi, K., Szaszák, Gy.: Using Prosody for the Imporvement of ASR - Sentence Modality Recognition. In: Proc. of Interspeech2008, Bristol, ISCA Archive (2008), http://www.isca-speech.org/archive
The Snack Sound Toolkit, http://www.speech.kth.se/snack/
HTK Speech Recognition Toolkit, http://htk.eng.cam.ac.uk/
Szaszák, Gy., Vicsi, K.: Speech recognition supported by prosodic information for fixed stress languages. In: Proceeding of TSD conference Brno, pp. 262–269 (2000)
Google Scholar
Szaszák, Gy., Vicsi, K.: Using prosody in fixed stress languages for improvement of speech recognition. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS (LNAI), vol. 4775, pp. 138–149. Springer, Heidelberg (2007)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Speech Acoustics, Department of Telecommunications and Media Informatics, Budapest University of Technology and Economics, Stoczek u. 2, 1111, Budapest, Hungary
David Sztaho, Katalin Nagy & Klara Vicsi

Authors

David Sztaho
View author publications
You can also search for this author in PubMed Google Scholar
Katalin Nagy
View author publications
You can also search for this author in PubMed Google Scholar
Klara Vicsi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Second University of Naples, and IIASS, Via Pellegrino, 84019, Vietri sul Mare, SA, Italy
Anna Esposito
Centre for Language and Communication Studies, Trinity College, The University of Dublin, Dublin 2, Ireland
Nick Campbell & Carl Vogel &
Department of Computing Science & Mathematics, University of Stirling, FK9 4LA, Stirling, Scotland, UK
Amir Hussain
Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, P.O. Box 217, 7500 AE, Enschede, The Netherlands
Anton Nijholt

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sztaho, D., Nagy, K., Vicsi, K. (2010). Subjective Tests and Automatic Sentence Modality Recognition with Recordings of Speech Impaired Children. In: Esposito, A., Campbell, N., Vogel, C., Hussain, A., Nijholt, A. (eds) Development of Multimodal Interfaces: Active Listening and Synchrony. Lecture Notes in Computer Science, vol 5967. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12397-9_34

Download citation

DOI: https://doi.org/10.1007/978-3-642-12397-9_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12396-2
Online ISBN: 978-3-642-12397-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics