Speech coding with multilayer networks

Bengio, Yoshua; Cardin, Regis; Cosi, Piero; De Mori, Renato; Merlo, Ettore

doi:10.1007/978-3-642-76153-9_26

Speech coding with multilayer networks

Yoshua Bengio³,
Regis Cardin³,
Piero Cosi⁴,
Renato De Mori³ &
…
Ettore Merlo³

Conference paper

655 Accesses

Part of the book series: NATO ASI Series ((NATO ASI F,volume 68))

Abstract

A set of Multi-Layered Networks (MLN) for Automatic Speech Recognition (ASR) is proposed. Such a set allows to integrate information extracted with variable resolution in the time and in the frequency domain and to keep the number of links between nodes of the networks small in order to allow significant generalization during learning with a reasonable training set size.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jelinek, F.: The development of an experimental discrete dictation recognizer. IEEE Proceedings, pp. 1616–1624, (November 1984).
Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representation by error propagation. In Parallel Distributed Processing: Exploration in the Microstracture of Cognition, vol. 1, MIT Press, 318–362, (1986).
Google Scholar
Plout, D.C., Hintön, G.E.: Learning sets of filters using back propagation. Computer Speech and Language, vol. 2, 35–61, (1987).
Article Google Scholar
Hinton, G.E., Sejnowski, T.J.: Learning and relearning in Boltzmann machines. In Parallel Distributed Processing: Exploration in the Microstracture of Cognition, vol. 1, MIT Press, 282–317, (1986).
Google Scholar
Bourlard, H., Wellekens, C J.: Links between Markov models and multilayer perceptron. IEEE Conference on Neural Networks, Denver Co., (1988).
Google Scholar
Watrous, R.L., Shastri, L.: Learning phonetic features using connectionist networks. Proceedings of the 10th International Joint Conference on Artificial Intelligence, 851–854, (1987).
Google Scholar
Waibel, A., Hanazawa, T., Hinton, G.E., Shikano, K., Lang, K.: Phoneme recognition: neural networks vs hidden Markov models. IEEE Transactions on on Acoustics, Speech and Signal Processing, (1989).
Google Scholar
Gori, M., Bengio Y., De Mori, R.: BPS: A learning algorithm for capturing the dynamic nature of speech. In Proceedings ICNN-89, Washington, D. C, (1989).
Google Scholar
Bengio Y., De Mori, R.: Speaker normalization and automatic speech recognition using spectral lines and neural networks. In Proceedings of the Canadian Conference on Artificial Intelligence (CSCSI-88), Edmonton, Al., (May 1988).
Google Scholar
Cosi, P., Bengio Y., De Mori, R.: On the generalization capabilities of multilayer networks in the extraction of speech properties. In Proceedings of the 11th International Joint Conference on Artificial Intelligence, Detroit Mi., (Aug. 1989).
Google Scholar
De Mori, R., Lam, L., Gilloux, M.: Learning and plan refinement in a knowledge-based system for automatic speech recognition. In IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-9, No.2, 289–305, (1987).
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, Mc Gill University/CRIM, 3480 University Street, Montreal, Quebec, Canada, H3A2A7
Yoshua Bengio, Regis Cardin, Renato De Mori & Ettore Merlo
Centro di Studio per le Ricerche di Fonetica CNR, via G. Oberdan, 10, 35122, Padova, Italy
Piero Cosi

Authors

Yoshua Bengio
View author publications
You can also search for this author in PubMed Google Scholar
Regis Cardin
View author publications
You can also search for this author in PubMed Google Scholar
Piero Cosi
View author publications
You can also search for this author in PubMed Google Scholar
Renato De Mori
View author publications
You can also search for this author in PubMed Google Scholar
Ettore Merlo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Laboratoire de Recherche en Informatique, Université de Paris Sud, Bâtiment 490, F-91405, Orsay Cedex, France
Françoise Fogelman Soulié
Institut National Polytechnique de Grenoble, LTIRF, 46, avenue Félix Viallet, F-38031, Grenoble Cedex, France
Jeanny Hérault

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bengio, Y., Cardin, R., Cosi, P., De Mori, R., Merlo, E. (1990). Speech coding with multilayer networks. In: Soulié, F.F., Hérault, J. (eds) Neurocomputing. NATO ASI Series, vol 68. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76153-9_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-76153-9_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-76155-3
Online ISBN: 978-3-642-76153-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics