research-article

BK.Synapse: A scalable distributed training framework for deep learning

Authors:
Dinh Viet Sang

Hanoi University of Science and Technology, Hanoi, Vietnam

Hanoi University of Science and Technology, Hanoi, Vietnam
View Profile

,
Phan Ngoc Lan

Hanoi University of Science and Technology, Hanoi, Vietnam

Hanoi University of Science and Technology, Hanoi, Vietnam
View Profile

SoICT '19: Proceedings of the 10th International Symposium on Information and Communication TechnologyDecember 2019Pages 43–48https://doi.org/10.1145/3368926.3369690

Published:04 December 2019Publication History

SoICT '19: Proceedings of the 10th International Symposium on Information and Communication Technology

Pages 43–48

ABSTRACT

Training neural networks efficiently is a thoroughly-researched topic that plays an important role in their adoption. Major advancements have been made, including the use of multiple nodes to further decrease training time. However, training at scale usually means adding on multiple layers of complex deployment logic and parallelization concerns, distracting researchers from the core of their algorithms. This paper presents a framework called BK.Synapse that can facilitate distributed training while maintaining clarity, simplicity, and user-friendliness. The design is modular, allowing flexible and easy deployment on a variety of hardware specifications. The framework is benchmarked in a case study: training a neural network for an object detection problem. Our results show a good amount of improvements over conventional training, with very few modifications to the existing codebase. The resulting model also performs relatively well upon further testing.

References

Overview - icdar 2019 robust reading challenge on scanned receipts ocr and information extraction. https://rrc.cvc.uab.es/?ch=13.Google Scholar
M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, et al. Tensorflow: A system for large-scale machine learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), pages 265--283, 2016.Google ScholarDigital Library
S. Chetlur, C. Woolley, P. Vandermersch, J. Cohen, J. Tran, B. Catanzaro, and E. Shelhamer. cudnn: Efficient primitives for deep learning. arXiv preprint arXiv:1410.0759, 2014.Google Scholar
M. P. Forum. Mpi: A message-passing interface standard. Technical report, Knoxville, TN, USA, 1994.Google ScholarDigital Library
M. Grinberg. Flask Web Development: Developing Web Applications with Python. O'Reilly Media, Inc., 1st edition, 2014.Google Scholar
K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770--778, 2016.Google ScholarCross Ref
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the 22nd ACM international conference on Multimedia, pages 675--678. ACM, 2014.Google ScholarDigital Library
N. P. Jouppi, C. Young, N. Patil, D. Patterson, G. Agrawal, R. Bajwa, S. Bates, S. Bhatia, N. Boden, A. Borchers, et al. In-datacenter performance analysis of a tensor processing unit. In 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA), pages 1--12. IEEE, 2017.Google ScholarDigital Library
T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollar. Focal loss for dense object detection. 2017 IEEE International Conference on Computer Vision (ICCV), Oct 2017.Google ScholarCross Ref
A. Paszke, S. Gross, S. Chintala, and G. Chanan. Pytorch: Tensors and dynamic neural networks in python with strong gpu acceleration. PyTorch: Tensors and dynamic neural networks in Python with strong GPU acceleration, 2017.Google Scholar
R. Raina, A. Madhavan, and A. Y. Ng. Large-scale deep unsupervised learning using graphics processors. In Proceedings of the 26th annual international conference on machine learning, pages 873--880. ACM, 2009.Google ScholarDigital Library
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 779--788, 2016.Google ScholarCross Ref
A. Sergeev and M. D. Balso. Horovod: fast and easy distributed deep learning in tensorflow. CoRR, abs/1802.05799, 2018.Google Scholar
L. Yeager, J. Bernauer, A. Gray, and M. Houston. Digits: the deep learning gpu training system. In ICML 2015 AutoML Workshop, 2015.Google Scholar

Recommendations

Distributed training for accelerating metalearning algorithms
BiDEDE '21: Proceedings of the International Workshop on Big Data in Emergent Distributed Environments

The lack of large amounts of training data diminishes the power of deep learning to train models with a high accuracy. Few shot learning (i.e. learning using few data samples) is implemented by Meta-learning, a learn to learn approach. Most gradient ...
Read More
A Hitchhiker’s Guide On Distributed Training Of Deep Neural Networks
Abstract
Deep learning has led to tremendous advancements in the field of Artificial Intelligence. One caveat, however, is the substantial amount of compute needed to train these deep learning models. Training a benchmark dataset like ImageNet ...
Highlights
- End to end survey on various aspects of distributed training of neural networks.
Read More
SINGA: A Distributed Deep Learning Platform
MM '15: Proceedings of the 23rd ACM international conference on Multimedia

Deep learning has shown outstanding performance in various machine learning tasks. However, the deep complex model structure and massive training data make it expensive to train. In this paper, we present a distributed deep learning system, called SINGA, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

SoICT '19: Proceedings of the 10th International Symposium on Information and Communication Technology
December 2019
551 pages
ISBN:9781450372459
DOI:10.1145/3368926

Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 December 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Deep Learning
Development Tool
Distributed Training
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate147of318submissions,46%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 36
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

BK.Synapse: A scalable distributed training framework for deep learning

SoICT '19: Proceedings of the 10th International Symposium on Information and Communication Technology

ABSTRACT

References

Cited By

Recommendations

Distributed training for accelerating metalearning algorithms

A Hitchhiker’s Guide On Distributed Training Of Deep Neural Networks

SINGA: A Distributed Deep Learning Platform

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

BK.Synapse: A scalable distributed training framework for deep learning

SoICT '19: Proceedings of the 10th International Symposium on Information and Communication Technology

ABSTRACT

References

Cited By

Recommendations

Distributed training for accelerating metalearning algorithms

A Hitchhiker’s Guide On Distributed Training Of Deep Neural Networks

SINGA: A Distributed Deep Learning Platform

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media