Conferences >2014 IEEE International Confe...

On parallelizability of stochastic gradient descent for speech DNNS

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper compares the theoretical efficiency of model-parallel and data-parallel distributed stochastic gradient descent training of DNNs. For a typical Switchboard DNN...Show More

Metadata

Abstract:

This paper compares the theoretical efficiency of model-parallel and data-parallel distributed stochastic gradient descent training of DNNs. For a typical Switchboard DNN with 46M parameters, the results are not pretty: With modern GPUs and interconnects, model parallelism is optimal with only 3 GPUs in a single server, while data parallelism with a minibatch size of 1024 does not even scale to 2 GPUs. We further show that data-parallel training efficiency can be improved by increasing the minibatch size (through a combination of AdaGrad and automatic adjustments of learning rate and minibatch size) and data compression. We arrive at an estimated possible end-to-end speed-up of 5 times or more. We do not address issues of robustness to process failure or other issues that might occur during training, nor of speed of convergence differences between ASGD and SGD parameter update patterns.

Published in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 04-09 May 2014

Date Added to IEEE Xplore: 14 July 2014

Electronic ISBN:978-1-4799-2893-4

ISSN Information:

DOI: 10.1109/ICASSP.2014.6853593

Conference Location: Florence, Italy

Contents

References is not available for this document.

On parallelizability of stochastic gradient descent for speech DNNS

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

On parallelizability of stochastic gradient descent for speech DNNS

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?