Articulation Rate Recognition by Using Artificial Neural Networks

Szczurowska, Izabela; Kuniszyk-Jóźkowiak, Wiesława; Smołka, Elzbieta

doi:10.1007/978-3-540-75175-5_95

Articulation Rate Recognition by Using Artificial Neural Networks

Izabela Szczurowska³,
Wiesława Kuniszyk-Jóźkowiak⁴ &
Elzbieta Smołka⁴

Conference paper

553 Accesses

Part of the book series: Advances in Soft Computing ((AINSC,volume 45))

Abstract

This works concerns the problem of the application of artificial neural networks in the modelling of the hearing process. The aim of the research was to answer the question whether artificial neural networks are able to evaluate speech rate. Speech samples, first recorded during reading of a story with normal and next with slow articulation rate were used as research material. The experiment proceeded in two phases. In the first stage Kohonen network was used. The purpose of that network was to reduce the dimensions of the vector describing the input signals and to obtain the amplitude-time relationship. As a result of the analysis, an output matrix consisting of the neurons winning in a particular time frame was received. The matrix was taken as input for the following networks in the second phase of the experiment. Various types of artificial neural networks were examined with respect to their ability to classify correctly utterances with different speech rates into two groups. Good examination results were accomplished and classification correctness exceeded 88%.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Koopmans-vanBeinum FJ, vanDonzel ME (1996) Relationship between discourse structure and dynamic speech rate. In: Bunnell HT, Idsardi W (Eds), Proceedings ICSLP96, Vol 3, 1724–1727
Google Scholar
Zheng J, Franco H, Stolcke A (2000) Rate-dependent acoustic modeling for large vocabulary conversational speech recognition. In: Proceeding ISCA Tutorial and Research Workshop on Automatic Speech Recognition: Challenges for the new Millennium, Paris, France, 145–149
Google Scholar
Ververidis D, Kotropoulos C (2006) Emotional speech recognition: Resources, features and methods. Speech Communication 48, 1162–1181
Article Google Scholar
Verhasselt JP Martens JP (1996) A fast and reliable rate of speech detector, Proceedings ICSLP96, Vol 3, 2258–2261
Google Scholar
Guitar B, Kopff-Schaefer H, Donahu-Kilburg G, Bond L (1992) Parent verbal interactions and speech rate: A case study in stuttering. Journal of Speech and Hearing Research 35, 742–754
Google Scholar
Howell P, Sackin S (2000) Speech rate modification and its effects on fluency reversal in fluent speakers and people who stutter. Journal of Developmental and Physical Disabilities 12(4)
Google Scholar
Chen WY, Chen SH, Lin CHJ (1996) A speech recognition method based on the sequential Multi-layer Perceptrons. Neural Networks 9(4), 655–669
Article Google Scholar
Farrell K, Mamione R, Assaleh K (1994) Speaker recognition using neural networks and conventional classifiers. IEEE Transaction on Speech and Audio Processing 2(1), part 2, 194–205
Article Google Scholar
Kohonen T (2001) Self-Organizing Maps. Springer, Berlin, Heidelberg, New York
Google Scholar
Kestler HA, Schwenker F (2000) Classification of high-resolution ECG signals, In Howlett R, Jain L Radial basis function neural networks: theory and applications. Heidelberg: Physica-Verlag
Google Scholar
Schwenker F, Kestler HA, Palm G (2001) Three learning phases for radial-basisfunction networks. Neural Networks 14, 439–458.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Physics, Agricultural University, Akademicka 13, 20-950, Lublin, Poland
Izabela Szczurowska
Laboratory of Biocybernetics, Institute of Computer Science, Maria Curie-Sklodowska University, pl. Maria Curie-Sklodowska 1, 20-031, Lublin, Poland
Wiesława Kuniszyk-Jóźkowiak & Elzbieta Smołka

Authors

Izabela Szczurowska
View author publications
You can also search for this author in PubMed Google Scholar
Wiesława Kuniszyk-Jóźkowiak
View author publications
You can also search for this author in PubMed Google Scholar
Elzbieta Smołka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Electronics, Wroclaw University of Technology, Wybrzeze Wyspianskiego 27, 50-370, Wroclaw, Poland
Marek Kurzynski , Edward Puchala , Michal Wozniak & Andrzej Zolnierek , , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Szczurowska, I., Kuniszyk-Jóźkowiak, W., Smołka, E. (2007). Articulation Rate Recognition by Using Artificial Neural Networks. In: Kurzynski, M., Puchala, E., Wozniak, M., Zolnierek, A. (eds) Computer Recognition Systems 2. Advances in Soft Computing, vol 45. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75175-5_95

Download citation

DOI: https://doi.org/10.1007/978-3-540-75175-5_95
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75174-8
Online ISBN: 978-3-540-75175-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics