Conferences >2015 IEEE International Confe...

Context dependent phone models for LSTM RNN acoustic modelling

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Long Short Term Memory Recurrent Neural Networks (LSTM RNNs), combined with hidden Markov models (HMMs), have recently been show to outperform other acoustic models such ...Show More

Metadata

Abstract:

Long Short Term Memory Recurrent Neural Networks (LSTM RNNs), combined with hidden Markov models (HMMs), have recently been show to outperform other acoustic models such as Gaussian mixture models (GMMs) and deep neural networks (DNNs) for large scale speech recognition. We argue that using multi-state HMMs with LSTM RNN acoustic models is an unnecessary vestige of GMM-HMM and DNN-HMM modelling since LSTM RNNs are able to predict output distributions through continuous, instead of piece-wise stationary, modelling of the acoustic trajectory. We demonstrate equivalent results for context independent whole-phone or 3-state models and show that minimum-duration modelling can lead to improved results. We go on to show that context dependent whole-phone models can perform as well as context dependent states, given a minimum duration model.

Published in: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 19-24 April 2015

Date Added to IEEE Xplore: 06 August 2015

Electronic ISBN:978-1-4673-6997-8

ISSN Information:

DOI: 10.1109/ICASSP.2015.7178839

Conference Location: South Brisbane, QLD, Australia

Contents

References is not available for this document.

Context dependent phone models for LSTM RNN acoustic modelling

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Context dependent phone models for LSTM RNN acoustic modelling

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?