Audio-visual intent-to-speak detection for human-computer interaction | IEEE Conference Publication | IEEE Xplore