Multi-Scale Hybrid Fusion Network for Mandarin Audio-Visual Speech Recognition | IEEE Conference Publication | IEEE Xplore