Commentary Paper on “Person Tracking With Audio-Visual Cues Using the Iterative Decoding Framework” | IEEE Conference Publication | IEEE Xplore