Conferences >2022 27th Asia and South Paci...

PUMP: Profiling-free Unified Memory Prefetcher for Large DNN Model Support

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Modern DNNs are going deeper and wider to achieve higher accuracy. However, existing deep learning frameworks require the whole DNN model to fit into the GPU memory when ...Show More

Metadata

Abstract:

Modern DNNs are going deeper and wider to achieve higher accuracy. However, existing deep learning frameworks require the whole DNN model to fit into the GPU memory when training with GPUs, which puts an unwanted limitation on training large models. Utilizing NVIDIA Unified Memory (UM) could inherently support training DNN models beyond GPU memory capacity. However, naively adopting UM would suffer a significant performance penalty due to the delay of data transfer. In this paper, we propose PUMP, a Profiling-free Unified Memory Prefetcher. PUMP exploits GPU asynchronous execution for prefetch; that is, there exists a delay between the time that CPU launches a kernel and the time the kernel executes in GPU. PUMP extracts memory blocks accessed by the kernel when launching and swaps these blocks into GPU memory. Experimental results show PUMP achieves about 2x speedup on the average compared to the baseline that naively enables UM.

Published in: 2022 27th Asia and South Pacific Design Automation Conference (ASP-DAC)

Date of Conference: 17-20 January 2022

Date Added to IEEE Xplore: 21 February 2022

ISBN Information:

ISSN Information:

DOI: 10.1109/ASP-DAC52403.2022.9712507

Conference Location: Taipei, Taiwan

Funding Agency:

Contents

References is not available for this document.

PUMP: Profiling-free Unified Memory Prefetcher for Large DNN Model Support

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

PUMP: Profiling-free Unified Memory Prefetcher for Large DNN Model Support

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?