Cited By
View all- Zhang HLiu MLiu ZSong XWang YNie LSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Multi-factor adaptive vision selection for egocentric video question answeringProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3694520(59310-59328)Online publication date: 21-Jul-2024
- Yu TFu KZhang JHuang QYu J(2024)Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question AnsweringIEEE Transactions on Image Processing10.1109/TIP.2024.339098433(3115-3129)Online publication date: 2024
- Xu FZhong ZZhu YZhou YLi G(2024)Appearance-Motion Dual-Stream Heterogeneous Network for VideoQAMultiMedia Modeling10.1007/978-3-031-53311-2_16(212-227)Online publication date: 28-Jan-2024
- Show More Cited By