Scene-text Oriented Visual Entailment: Task, Dataset and Solution
Abstract
References
Index Terms
- Scene-text Oriented Visual Entailment: Task, Dataset and Solution
Recommendations
A Multilingual Approach to Scene Text Visual Question Answering
Document Analysis SystemsAbstractScene Text Visual Question Answering (ST-VQA) has recently emerged as a hot research topic in Computer Vision. Current ST-VQA models have a big potential for many types of applications but lack the ability to perform well on more than one language ...
Multimodal grid features and cell pointers for scene text visual question answering
Highlights- New model for scene text VQA that jointly reasons about textual and visual modalities.
- Attending on multi-modal features is better than attending separately to each modality.
- Grid features from an object detection backbone proves ...
AbstractThis paper presents a new model for the task of scene text visual question answering. In this task questions about a given image can only be answered by reading and understanding scene text. Current state of the art models for this task make use ...
Beyond visual semantics: Exploring the role of scene text in image understanding
Highlights- Images use visual and scene text to convey ideas.
- Jointly leveraging scene text and visual cues leads to robust semantic interpretation.
- Contextual encoding capture dynamics between co-occurring visual and text elements.
- Text ...
AbstractImages with visual and scene text content are ubiquitous in everyday life. However, current image interpretation systems are mostly limited to using only the visual features, neglecting to leverage the scene text content. In this paper, we ...
Comments
Information & Contributors
Information
Published In

- General Chairs:
- Abdulmotaleb El Saddik,
- Tao Mei,
- Rita Cucchiara,
- Program Chairs:
- Marco Bertini,
- Diana Patricia Tobon Vallejo,
- Pradeep K. Atrey,
- M. Shamim Hossain
Sponsors
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Author Tags
Qualifiers
- Research-article
Funding Sources
- Guangxi Natural Science Foundation
- Open Research Fund of Guangxi Key Laboratory of Multimedia Communications and Network Technology
- Fundamental Research Funds for the Central Universities, SCUT
- Guangxi Scientific and Technological Bases and Talents Special Projects
- CAAI-Huawei MindSpore Open Fund and the Science and Technology Planning Project of Guangdong Province
- CCF-Zhipu AI Large Model Fund
- Guangxi Natural Science Foundation Key Project
- Open Research Fund of Key Laboratory of Big Data and Intelligent Robot (SCUT), Ministry of Education
- National Natural Science Foundation of China
Conference
Acceptance Rates
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 158Total Downloads
- Downloads (Last 12 months)79
- Downloads (Last 6 weeks)6
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in