Throughput Maximization of Delay-Aware DNN Inference in Edge Computing by Exploring DNN Model Partitioning and Inference Parallelism

Throughput Maximization of Delay-Aware DNN Inference in Edge Computing by Exploring DNN Model Partitioning and Inference Parallelism | IEEE Journals & Magazine | IEEE Xplore

IEEE Account

Purchase Details

Profile Information

Need Help?