A Mixture of Recurrent Neural Networks for Speaker Normalisation

Trentin, Edmondo; Giuliani, Diego

doi:10.1007/s005210170004

A Mixture of Recurrent Neural Networks for Speaker Normalisation

Published: May 2001

Volume 10, pages 120–135, (2001)
Cite this article

Neural Computing & Applications Aims and scope Submit manuscript

Edmondo Trentin¹ &
Diego Giuliani¹

89 Accesses
5 Citations
Explore all metrics

In spite of recent advances in automatic speech recognition, the performance of state-of-the-art speech recognisers fluctuates depending on the speaker. Speaker normalisation aims at the reduction of differences between the acoustic space of a new speaker and the training acoustic space of a given speech recogniser, improving performance. Normalisation is based on an acoustic feature transformation, to be estimated from a small amount of speech signal. This paper introduces a mixture of recurrent neural networks as an effective regression technique to approach the problem. A suitable Vit-erbi-based time alignment procedure is proposed for generating the adaptation set. The mixture is compared with linear regression and single-model connectionist approaches. Speaker-dependent and speaker-independent continuous speech recognition experiments with a large vocabulary, using Hidden Markov Models, are presented. Results show that the mixture improves recognition performance, yielding a 21% relative reduction of the word error rate, i.e. comparable with that obtained with model-adaptation approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adaptation of Deep Neural Network Acoustic Models for Robust Automatic Speech Recognition

A Comparison of Adaptation Techniques and Recurrent Neural Network Architectures

Milestones in speaker recognition

Article Open access 15 February 2024

R. Sharma, D. Govind, … S. R. M. Prasanna

Author information

Authors and Affiliations

ITC-irst, Centro per la Ricerca Scientifica e Technologica, Povo (Trento), Italy, , , , , , IT
Edmondo Trentin & Diego Giuliani

Authors

Edmondo Trentin
View author publications
You can also search for this author in PubMed Google Scholar
Diego Giuliani
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Trentin, E., Giuliani, D. A Mixture of Recurrent Neural Networks for Speaker Normalisation. Neural Comput & Applic 10, 120–135 (2001). https://doi.org/10.1007/s005210170004

Download citation

Issue Date: May 2001
DOI: https://doi.org/10.1007/s005210170004

Keywords:Mixture of neural networks; Multivariate regression; Recurrent neural network; Speaker adaptation; Speaker normalisation; Speech recognition

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Mixture of Recurrent Neural Networks for Speaker Normalisation

Access this article

Similar content being viewed by others

Adaptation of Deep Neural Network Acoustic Models for Robust Automatic Speech Recognition

A Comparison of Adaptation Techniques and Recurrent Neural Network Architectures

Milestones in speaker recognition

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

A Mixture of Recurrent Neural Networks for Speaker Normalisation

Access this article

Similar content being viewed by others

Adaptation of Deep Neural Network Acoustic Models for Robust Automatic Speech Recognition

A Comparison of Adaptation Techniques and Recurrent Neural Network Architectures

Milestones in speaker recognition

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation