Detection of Speech Dynamics by Neuromorphic Units

Gómez-Vilda, Pedro; Ferrández-Vicente, José Manuel; Rodellar-Biarge, Victoria; Álvarez-Marquina, Agustín; Mazaira-Fernández, Luis Miguel; Martínez-Olalla, Rafael; Muñoz-Mulas, Cristina

doi:10.1007/978-3-642-02264-7_8

Pedro Gómez-Vilda²⁰,
José Manuel Ferrández-Vicente²¹,
Victoria Rodellar-Biarge²⁰,
Agustín Álvarez-Marquina²⁰,
Luis Miguel Mazaira-Fernández²⁰,
Rafael Martínez-Olalla²⁰ &
…
Cristina Muñoz-Mulas²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5601))

Included in the following conference series:

International Work-Conference on the Interplay Between Natural and Artificial Computation

922 Accesses
1 Citations
3 Altmetric

Abstract

Speech and voice technologies are experiencing a profound review as new paradigms are sought to overcome some specific problems which can not be completely solved by classical approaches. Neuromorphic Speech Processing is an emerging area in which research is turning the face to understand the natural neural processing of speech by the Human Auditory System in order to capture the basic mechanisms solving difficult tasks in an efficient way. In the present paper a further step ahead is presented in the approach to mimic basic neural speech processing by simple neuromorphic units standing on previous work to show how formant dynamics -and henceforth consonantal features-, can be detected by using a general neuromorphic unit which can mimic the functionality of certain neurons found in the Upper Auditory Pathways. Using these simple building blocks a General Speech Processing Architecture can be synthesized as a layered structure. Results from different simulation stages are provided as well as a discussion on implementation details. Conclusions and future work are oriented to describe the functionality to be covered in the next research steps.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Spiking neural networks for physiological and speech signals: a review

Article 25 June 2024

Recent Advances in Nonlinear Speech Processing: Directions and Challenges

Snn and sound: a comprehensive review of spiking neural networks in sound

Article 11 July 2024

References

Delattre, P., Liberman, A., Cooper, F.: Acoustic loci and transitional cues for consonants. J. Acoust. Soc. Am. 27, 769–773 (1955)
Article Google Scholar
Deller, J.R., Proakis, J.G., Hansen, J.H.: Discrete-Time Processing of Speech Signals. Macmillan, New York (1993)
Google Scholar
Gómez, P., Godino, J.I., Alvarez, A., Martínez, R., Nieto, V., Rodellar, V.: Evidence of Glottal Source Spectral Features found in Vocal Fold Dynamics. In: Proc. of the ICASSP 2005, pp. 441–444 (2005)
Google Scholar
Hermansky, H.: Should Recognizers Have Ears? In: ESCA-NATO Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-à-Mousson, France, April 17-18, 1997, pp. 1–10 (1997)
Google Scholar
Ferrández, J.M.: Study and Realization of a Bio-inspired Hierarchical Architecture for Speech Recognition, Ph.D. Thesis, Universidad Politécnica de Madrid (1998) (in Spanish)
Google Scholar
Gómez, P., Martínez, R., Rodellar, V., Ferrández, J.M.: Bio-inspired Systems in Speech Perception: An overview and a study case. In: IEEE/NML Life Sciences Systems and Applications Workshop (by invitation), National Institute of Health, Bethesda, Maryland, July 13-14 (2006)
Google Scholar
Haykin, S.: Neural Networks - A comprehensive Foundation. Prentice-Hall, Upper Saddle River (1999)
MATH Google Scholar
Irino, T., Patterson, R.D.: A time-domain, level-dependent auditory filter: the gammachirp. J. Acoust. Soc. Am. 101(1), 412–419 (1997)
Article Google Scholar
Jahne, B.: Digital Image Processing. Springer, Berlin (2005)
MATH Google Scholar
Mendelson, J.R., Cynader, M.S.: Sensitivity of Cat Primary Auditory Cortex (AI) Neurons to the Direction and Rate of Frequency Modulation. Brain Research 327, 331–335 (1985)
Article Google Scholar
Mountcastle, V.B.: The columnar organization of the neocortex. Brain 120, 701–722 (1997)
Article Google Scholar
Ojemann, G.A.: Organization of language cortex derived from investigation during neurosurgery. Sem. Neuros. 2, 297–305 (1990)
Google Scholar
O’Shaughnessy, D.: Speech Communication. IEEE Press, Park Avenue (2000)
MATH Google Scholar
Rauschecker, J.P., Tian, B., Hauser, M.: Processing of Complex Sounds in the Macaque Nonprimary Auditory Cortex. Science 268, 111–114 (1995)
Article Google Scholar
Sams, M., Salmening, R.: Evidence of sharp frequency tuning in human auditory cortex. Hearing Research 75, 67–74 (1994)
Article Google Scholar
Schreiner, C.E.: Time Domain Analysis of Auditory-Nerve Fibers Firing Rates. Curr. Op. Neurobiol. 5, 489–496 (1995)
Article Google Scholar
Secker, H., Searle, C.: Study and Realization of a Bio-inspired Hierarchical Architecture for Speech Recognition. J. Acoust. Soc. Am. 88(3), 1427–1436 (1990)
Article Google Scholar
Sejnowski, T.J., Rosenberg, C.R.: Parallel networks that learn to pronounce English text. Complex Systems 1, 145–168 (1987)
MATH Google Scholar
Suga, N.: Cortical Computational Maps for Auditory Imaging. Neural Networks 3, 3–21 (1990)
Article Google Scholar
Suga, N.: Basic Acoustic Patterns and Neural Mechanism Shared By Humans and Animals for Auditory Perception: A Neuroethologists view. In: Proceedings of Workshop on the Auditory bases of Speech Perception, ESCA, July 1996, pp. 31–38 (1996)
Google Scholar
Waibel, A.: Neural Network Approaches for Speech Recognition. In: Furui, S., Sondhi, M.M. (eds.) Advances in Speech Signal Processing, pp. 555–597. Dekker, New York (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

Grupo de Informática Aplicada al Tratamiento de Señal e Imagen, Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo, s/n, 28660, Madrid, Spain
Pedro Gómez-Vilda, Victoria Rodellar-Biarge, Agustín Álvarez-Marquina, Luis Miguel Mazaira-Fernández, Rafael Martínez-Olalla & Cristina Muñoz-Mulas
Dpto. Electrónica, Tecnología de Computadoras, Univ. Politécnica de Cartagena, 30202, Cartagena, Spain
José Manuel Ferrández-Vicente

Authors

Pedro Gómez-Vilda
View author publications
You can also search for this author in PubMed Google Scholar
José Manuel Ferrández-Vicente
View author publications
You can also search for this author in PubMed Google Scholar
Victoria Rodellar-Biarge
View author publications
You can also search for this author in PubMed Google Scholar
Agustín Álvarez-Marquina
View author publications
You can also search for this author in PubMed Google Scholar
Luis Miguel Mazaira-Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Martínez-Olalla
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Muñoz-Mulas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dapartamento de Inteligencia Artificial, Universidad Nacional de Educación a Distancia, E.T.S. de Ingeniería Informática, Juan del Rosal, 16, 28040, Madrid, Spain
José Mira & Félix de la Paz &
Departamento de Electrónica, Tecnología de Computadores y Proyectos, Universidad Politécnica de Cartagena, Pl. Hospital, 1, 30201, Cartagena, Spain
José Manuel Ferrández
Departamento de Inteligencia Artificial, Universidad Nacional de Educación a Distancia, E.T.S. de Ingeniería Informática, Juan del Rosal, 16, 28040, Madrid, Spain
José R. Álvarez
Departamento de Electrónica, Tecnología de Computadoras y Proyectos, Universidad Politécnica de Cartagena, Pl. Hospital, 1, 30201, Cartagena
F. Javier Toledo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gómez-Vilda, P. et al. (2009). Detection of Speech Dynamics by Neuromorphic Units. In: Mira, J., Ferrández, J.M., Álvarez, J.R., de la Paz, F., Toledo, F.J. (eds) Methods and Models in Artificial and Natural Computation. A Homage to Professor Mira’s Scientific Legacy. IWINAC 2009. Lecture Notes in Computer Science, vol 5601. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02264-7_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-02264-7_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02263-0
Online ISBN: 978-3-642-02264-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics