Modelling speech processing and recognition in the auditory system with a three-stage architecture

Wesarg, T.; Brückner, B.; Schauer, C.

doi:10.1007/3-540-61510-5_115

Modelling speech processing and recognition in the auditory system with a three-stage architecture

T. Wesarg¹,
B. Brückner¹ &
C. Schauer¹

Poster Presentations 2
Conference paper
First Online: 01 January 2005

118 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1112))

Abstract

One approach to the construction of an engineered system for hearing and efficient speech recognition is the modeling of the human auditory system. We applied this approach to our speech recognition tasks using a coupled modeling concept (Fig. 1) which should reproduce this system in a plausible way (Brückner et al. [1]). Starting with a model of signal processing by the cochlea (Kates [4]), our coupled modeling concept contains a lateral inhibitory neural network (LIN) system (Shamma [2]) performing filter operations by spatial processing of the speech evoked activity in the auditory nerve, and a structured formal neural network (Brückner et al. [3]) for learning and recognition of the spectral representations of the speech stimuli provided by the LIN.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

References

B. Brückner and W. Zander, “Neurobiological modeling and structured neural networks”, Proc. Inter. Conf. Artificial Neural Networks, Amsterdam, Sept. 13–16, 1993, pp. 43–46.
Google Scholar
S. Shamma, “Spatial and Temporal Processing in Central Auditory Networks”, C. Koch and I. Segev (eds.): Methods in Neuronal Modeling, The MIT Press, Cambridge, Massachusetts, pp. 247–289, 1989.
Google Scholar
B. Brückner, T. Wesarg and C. Blumenstein, “Improvements of the modified Hypermap Architecture for Speech Recognition”, Proc. Inter. Conf. Neural Networks, Perth, Australia, Nov.22-Dec.l, 1995, vol. 5, pp. 2891–2895.
Google Scholar
J.M. Kates, “A time-domain digital cochlear model”, IEEE Transactions on Signal Processing, vol. 39, no. 12, pp. 2573–2592, December 1991.
Google Scholar
Teuvo Kohonen, “The hypermap architecture”, In: T. Kohonen, K. Mäkisara, O. Simula, and J. Kangas, editors, Artificial Neural Networks, pp. 1357–1360, Helsinki, 1991. Elsevier Science Publishers.
Google Scholar

Download references

Author information

Authors and Affiliations

Informatics, Federal Institute for Neurobiology, P.O.Box 1860, 39008, Magdeburg, Germany
T. Wesarg, B. Brückner & C. Schauer

Authors

T. Wesarg
View author publications
You can also search for this author in PubMed Google Scholar
B. Brückner
View author publications
You can also search for this author in PubMed Google Scholar
C. Schauer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Christoph von der Malsburg Werner von Seelen Jan C. Vorbrüggen Bernhard Sendhoff

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wesarg, T., Brückner, B., Schauer, C. (1996). Modelling speech processing and recognition in the auditory system with a three-stage architecture. In: von der Malsburg, C., von Seelen, W., Vorbrüggen, J.C., Sendhoff, B. (eds) Artificial Neural Networks — ICANN 96. ICANN 1996. Lecture Notes in Computer Science, vol 1112. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61510-5_115

Download citation

DOI: https://doi.org/10.1007/3-540-61510-5_115
Published: 09 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61510-1
Online ISBN: 978-3-540-68684-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics