Watch, Listen, and Answer: Open-Ended VideoQA with Modulated Multi-Stream 3D ConvNets | IEEE Conference Publication | IEEE Xplore