abstract

Characterizing the Performance of Accelerated Jetson Edge Devices for Training Deep Learning Models

Authors:

SIGMETRICS '23: Abstract Proceedings of the 2023 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems

Pages 37 - 38

https://doi.org/10.1145/3578338.3593530

Published: 19 June 2023 Publication History

Get Access

Abstract

Deep Neural Network (DNN) models are becoming ubiquitous in a variety of contemporary domains such as Autonomous Vehicles, Smart cities and Healthcare. They help drones to navigate, identify suspicious activities from safety cameras, and perform diagnostics over medical imaging. Fast DNN inferencing close to the data source is enabled by a growing class of accelerated edge devices such as NVIDIA Jetson and Google Coral which host low-power Graphics Processing Units (GPUs) and Tensor Processing Units (TPUs) along with ARM CPUs in a compact form-factor to offer a superior performance-to-energy ratio. E.g., the NVIDIA Jetson AGX Xavier kit has a 512-core Volta GPU, an 8-core ARM CPU and 32GB LPDDR4x memory, that operates within 65W of power, costs US999 and is smaller than a paperback novel.

Recently, there has been a push towards training DNN models on the edge [2]. This is driven by the massive growth in data collected from edge devices in Cyber-Physical Systems (CPS) and Internet of Things (IoT), the need to refresh the models periodically, the bandwidth constraints in moving all this data to Cloud data centers for training, and a heightened emphasis on privacy by retaining data on the edge. This has led to techniques like federated and geo-distributed learning that train DNN models locally on data on an edge device and aggregate them centrally. In this abstract, we summarise and highlight key results from our full paper [5].

Supplemental Material

MP4 File

The growing capacity of GPU-accelerated edge devices like NVIDIA Jetson and techniques like federated learning motivate the need for a holistic characterization of DNN training on the edge. Training DNNs is resource-intensive and can stress an edge?s GPU, CPU, memory and storage capacities. In this paper, we vary device and training parameters such as I/O pipelining and parallelism, storage media, mini-batch sizes and power modes, and examine their effect on CPU and GPU utilization, fetch stalls, training time, energy usage, and variability. Our analysis exposes several resource inter-dependencies and counter-intuitive insights, while also helping quantify known wisdom.

Download
157.42 MB

References

[1]

S. Baller, A. Jindal, M. Chadha, and M. Gerndt. 2021. DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices. In 2021 IEEE International Conference on Cloud Engineering (IC2E).

Google Scholar

[2]

Jiasi Chen and Xukan Ran. 2019. Deep Learning With Edge Computing: A Review. Proc. IEEE 107, 8 (2019).

Google Scholar

[3]

Stephan Holly, Alexander Wendt, and Martin Lechner. 2020. Profiling Energy Consumption of Deep Neural Networks on NVIDIA Jetson Nano. In 2020 11th International Green and Sustainable Computing Workshops (IGSC).

Google Scholar

[4]

Jayashree Mohan, Amar Phanishayee, Ashish Raniwala, and Vijay Chidambaram. 2021. Analyzing and Mitigating Data Stalls in DNN Training. Proc. VLDB Endow. 14, 5 (2021).

Digital Library

Google Scholar

[5]

Prashanthi S.K, Sai Anuroop Kesanapalli, and Yogesh Simmhan. December 2022. Characterizing the Performance of Accelerated Jetson Edge Devices for Training Deep Learning Models. Proc. ACM Meas. Anal. Comput. Syst. 6, 3 (December 2022).

Google Scholar

[6]

Yu Emma Wang, Gu-Yeon Wei, and David Brooks. 2019. Benchmarking tpu, gpu, and cpu platforms for deep learning. arXiv preprint arXiv:1907.10701 (2019).

Google Scholar

Index Terms

Characterizing the Performance of Accelerated Jetson Edge Devices for Training Deep Learning Models
1. Computer systems organization
  1. Architectures
    1. Parallel architectures
  2. Embedded and cyber-physical systems
2. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks
  2. Parallel computing methodologies

Recommendations

Characterizing the Performance of Accelerated Jetson Edge Devices for Training Deep Learning Models
SIGMETRICS '23

Deep Neural Network (DNN) models are becoming ubiquitous in a variety of contemporary domains such as Autonomous Vehicles, Smart cities and Healthcare. They help drones to navigate, identify suspicious activities from safety cameras, and perform ...
Characterizing the Performance of Accelerated Jetson Edge Devices for Training Deep Learning Models
POMACS

Deep Neural Networks (DNNs) have had a significant impact on domains like autonomous vehicles and smart cities through low-latency inferencing on edge computing devices close to the data source. However, DNN training on the edge is poorly explored. ...
PowerTrain: Fast, generalizable time and power prediction models to optimize DNN training on accelerated edges
Abstract
Accelerated edge devices, like Nvidia’s Jetson with 1000+ CUDA cores, are increasingly used for DNN training and federated learning, rather than just for inferencing workloads. A unique feature of these compact devices is their fine-grained ...
Highlights
- ML-based prediction models for estimating runtime and power of edge DNN training.
- Demonstrate the generalizability of prediction models.
- Optimization of training time using the Pareto front.

Comments

Information & Contributors

Information

Published In

SIGMETRICS '23: Abstract Proceedings of the 2023 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems

June 2023

123 pages

ISBN:9798400700743

DOI:10.1145/3578338

General Chair:
Evgenia Smirni
William & Mary, US
,
Program Chairs:
Konstantin Avrachenkov
INRIA Sophia Antipolis, FR
,
Phillipa Gill
Google, US
,
Bhuvan Urgaonkar
Penn State University, US and Amazon, US

ACM SIGMETRICS Performance Evaluation Review Volume 51, Issue 1
SIGMETRICS '23
June 2023
108 pages
ISSN:0163-5999
DOI:10.1145/3606376
Editor:
Zhenhua Liu
Stony Brook University
Issue’s Table of Contents

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 June 2023

Check for updates

Author Tags

Qualifiers

Abstract

Data Availability

Funding Sources

Ministry of Education, India
Department of Science and Technology, India

Conference

SIGMETRICS '23

Sponsor:

SIGMETRICS

SIGMETRICS '23: ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems

June 19 - 23, 2023

Florida, Orlando, United States

Acceptance Rates

Overall Acceptance Rate 459 of 2,691 submissions, 17%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
158
Total Downloads

Downloads (Last 12 months)98
Downloads (Last 6 weeks)10

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Index Terms

Recommendations

Characterizing the Performance of Accelerated Jetson Edge Devices for Training Deep Learning Models

Characterizing the Performance of Accelerated Jetson Edge Devices for Training Deep Learning Models

PowerTrain: Fast, generalizable time and power prediction models to optimize DNN training on accelerated edges