Learning Differentiable Sparse and Low Rank Networks for Audio-Visual Object Localization | IEEE Conference Publication | IEEE Xplore