Loading [a11y]/accessibility-menu.js
Recorrect Net: Visual Guidance for Image Captioning | IEEE Conference Publication | IEEE Xplore

Recorrect Net: Visual Guidance for Image Captioning


Abstract:

Most image caption methods directly learn the mapping relationship from image to text. In practice, however, paying attention to both sentence structure and visual conten...Show More

Abstract:

Most image caption methods directly learn the mapping relationship from image to text. In practice, however, paying attention to both sentence structure and visual content at the same time can be difficult. In this paper, we propose a model, called Re-correct Net, which aims to use the existing caption information by other captioners, to guide the visual content in the generation of new caption. In addition, to obtain the more accurate caption, our method uses the existing textured entity as additional prior knowledge. Experiments show that our model can be used as re-correct block after all captioner training, which is beneficial to improve the quality of caption and is also flexible.
Date of Conference: 17-19 November 2021
Date Added to IEEE Xplore: 04 January 2022
ISBN Information:

ISSN Information:

Conference Location: Beijing, China

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.