Delay-Aware DNN Inference Throughput Maximization in Edge Computing via Jointly Exploring Partitioning and Parallelism | IEEE Conference Publication | IEEE Xplore