Cited By
View all- Sun GQin CWang JChen ZXu RTao Z(2024)SQ-LLaVA: Self-Questioning for Large Vision-Language AssistantComputer Vision – ECCV 202410.1007/978-3-031-72673-6_9(156-172)Online publication date: 22-Oct-2024
Cross-modal retrieval has been a compelling topic in the multimodal community. Recently, to mitigate the high cost of data collection, the co-occurred pairs (e.g., image and text) could be collected from the Internet as a large-scaled cross-modal ...
Semi-supervised medical image segmentation presents a compelling approach to streamline large-scale image analysis, alleviating annotation burdens while maintaining comparable performance. Despite recent strides in cross-supervised training ...
The problem of multilabel classification has attracted great interest in the last decade, where each instance can be assigned with a set of multiple class labels simultaneously. It has a wide variety of real-world applications, e.g., automatic image ...
Association for Computing Machinery
New York, NY, United States
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in