CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU | IEEE Conference Publication | IEEE Xplore