Cited By
View all- Kuang JShen YXie JLuo HXu ZLi RLi YCheng XLin XHan Y(2025)Natural Language Understanding and Inference with MLLM in Visual Question Answering: A SurveyACM Computing Surveys10.1145/371168057:8(1-36)Online publication date: 31-Jan-2025
- Wen HSong XChen XWei YNie LChua THui Yang GWang HHan SHauff CZuccon GZhang Y(2024)Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image RetrievalProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657727(229-239)Online publication date: 10-Jul-2024