Abstract
The processing or the recognition of non stationary process with neural networks is a challenging and yet unsolved issue. The paper discuss the general pattern recognition framework using neural networks in relation with the understanding of the peripheral auditory system. We propose a short-time structure representation of speech for speech analysis and recognition. We give examples of neural networks architecture and applications that are designed to take into account the time structure of the process to be analysed.
Preview
Unable to display preview. Download preview PDF.
References
Alkon, D.L., Blackwell, K.T., Barbourg, G.S., Rigler, A.K. and Vogl, T.P.: Pattern-recognition by an artificial network derived from biologic neuronal systems. Biological Cybernetics, vol. 62 (1990) 363–376
Frisina, R. D. et al.: Differential encoding of rapid changes in sound amplitude by second-order auditory neurons. Exp. Brain Res., vol. 60 (1985) 417–422
Ho, T. V. and Rouat, J.: A Novelty Detector using a Network of Integrate and Fire Neurons. ICANN97.
Langner, G. and Schreiner, C.E.: Periodicity coding in the inferior colliculus of the cat. Neuronal mechanisms. Journal of Neurophysiology, vol.60, 6, (1988) 1799–1822
Patterson, R.D.: Auditory filter shapes derived with noise stimuli. Journal of the Acoustical Society of America, vol. 59, 3 (1976) 640–654
Rouat, J. and Garcia, M.: A prototype speech recogniser based on associative learning and nonlinear speech analysis. In Proc. of the Workshop on Computational Auditory Scene Analysis, International Joint Conference (IEEE-ACM) on Artificial Intelligence, (1995) 7–12. To be published in, Readings In Computational Auditory Scene Analysis, Edited by H. Okuno and D. Rosenthal, Erlbaum
Schreiner, C. E. and Urbas, J. V.: Representation of amplitude modulation in the auditory cortex of the cat. I. The anterior auditory field (AAF). Hearing Research, vol. 21 (1986) 227–241
Schreiner, C.E. and Langner, G.: Periodicity coding in the inferior colliculus of the cat. Topographical organization. Journal of Neurophysiology, vol. 60, 6 (1988) 1823–1840
Shannon, R. V et al.: Speech Recognition with Primarily Temporal Cues. Science, vol. 270 (1995) 303–304
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rouat, J. (1997). Spatio-temporal pattern recognition with neural networks: Application to speech. In: Gerstner, W., Germond, A., Hasler, M., Nicoud, JD. (eds) Artificial Neural Networks — ICANN'97. ICANN 1997. Lecture Notes in Computer Science, vol 1327. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0020130
Download citation
DOI: https://doi.org/10.1007/BFb0020130
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63631-1
Online ISBN: 978-3-540-69620-9
eBook Packages: Springer Book Archive