Journals & Magazines >IEEE Transactions on Audio, S... >Volume: 21 Issue: 11

Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

While Deep Neural Networks (DNNs) have achieved tremendous success for large vocabulary continuous speech recognition (LVCSR) tasks, training these networks is slow. Even...Show More

Metadata

Abstract:

While Deep Neural Networks (DNNs) have achieved tremendous success for large vocabulary continuous speech recognition (LVCSR) tasks, training these networks is slow. Even to date, the most common approach to train DNNs is via stochastic gradient descent, serially on one machine. Serial training, coupled with the large number of training parameters (i.e., 10-50 million) and speech data set sizes (i.e., 20-100 million training points) makes DNN training very slow for LVCSR tasks. In this work, we explore a variety of different optimization techniques to improve DNN training speed. This includes parallelization of the gradient computation during cross-entropy and sequence training, as well as reducing the number of parameters in the network using a low-rank matrix factorization. Applying the proposed optimization techniques, we show that DNN training can be sped up by a factor of 3 on a 50-hour English Broadcast News (BN) task with no loss in accuracy. Furthermore, using the proposed techniques, we are able to train DNNs on a 300-hr Switchboard (SWB) task and a 400-hr English BN task, showing improvements between 9-30% relative over a state-of-the art GMM/HMM system while the number of parameters of the DNN is smaller than the GMM/HMM system.

Published in: IEEE Transactions on Audio, Speech, and Language Processing ( Volume: 21, Issue: 11, November 2013)

Page(s): 2267 - 2276

Date of Publication: 03 October 2013

ISSN Information:

DOI: 10.1109/TASL.2013.2284378

Contents

References is not available for this document.

Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?