Cited By
View all- P.J. JKovoor B(2025)Video Question AnsweringJournal of Visual Communication and Image Representation10.1016/j.jvcir.2024.104320105:COnline publication date: 11-Feb-2025
- Wang YLiu MSong XNie L(2024)Harnessing Representative Spatial-Temporal Information for Video Question AnsweringACM Transactions on Multimedia Computing, Communications, and Applications10.1145/367539920:10(1-20)Online publication date: 5-Jul-2024
- Zhang QChang CSu MChang HRoy D(2024)HMTV: hierarchical multimodal transformer for video highlight query on baseballMultimedia Systems10.1007/s00530-024-01479-630:5Online publication date: 23-Sep-2024
- Show More Cited By