Journals & Magazines >IEEE Transactions on Cloud Co... >Volume: 11 Issue: 1

Gemini: Enabling Multi-Tenant GPU Sharing Based on Kernel Burst Estimation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Recent years have seen rapid adoption of GPUs in various types of platforms because of the tremendous throughput powered by massive parallelism. However, as the computing...Show More

Metadata

Abstract:

Recent years have seen rapid adoption of GPUs in various types of platforms because of the tremendous throughput powered by massive parallelism. However, as the computing power of GPU continues to grow at a rapid pace, it also becomes harder to utilize these additional resources effectively with the support of GPU sharing. In this work, we designed and implemented Gemini, a user-space runtime scheduling framework to enable fine-grained GPU allocation control with support for multi-tenancy and elastic allocation, which are critical for cloud and resource providers. Our key idea is to introduce the concept of kernel burst, which refers to a group of consecutive kernels launched together without being interrupted by synchronous events. Based on the characteristics of kernel burst, we proposed a low overhead event-driven monitor and a dynamic time-sharing scheduler to achieve our goals. Our experiment evaluations using five types of GPU applications show that Gemini enabled multi-tenant and elastic GPU allocation with less than 5% performance overhead. Furthermore, compared to static scheduling, Gemini achieved 20%

$\sim$ 30% performance improvement without requiring prior knowledge of applications.

Published in: IEEE Transactions on Cloud Computing ( Volume: 11, Issue: 1, 01 Jan.-March 2023)

Page(s): 854 - 867

Date of Publication: 11 October 2021

ISSN Information:

DOI: 10.1109/TCC.2021.3119205

Contents

References is not available for this document.

Gemini: Enabling Multi-Tenant GPU Sharing Based on Kernel Burst Estimation

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Gemini: Enabling Multi-Tenant GPU Sharing Based on Kernel Burst Estimation

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?