Journals & Magazines >IEEE Transactions on Multimedia >Volume: 25

Scene Graph Refinement Network for Visual Question Answering

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Visual Question Answering aims to answer the free-form natural language question based on the visual clues in a given image. It is a difficult problem as it requires unde...Show More

Metadata

Abstract:

Visual Question Answering aims to answer the free-form natural language question based on the visual clues in a given image. It is a difficult problem as it requires understanding the fine-grained structured information of both language and image for compositional reasoning. To establish the compositional reasoning, recent works attempt to introduce the scene graph in VQA. However, as the generated scene graphs are usually quite noisy, it greatly limits the performance of question answering. Therefore, this paper proposes to refine the scene graphs for improving the effectiveness. Specifically, we present a novel Scene Graph Refinement network (SGR), which introduces a transformer-based refinement network to enhance the object and relation features for better classification. Moreover, as the question provides valuable clues for distinguishing whether the

$\left\langle \mathit{subject, predicate, object} \right\rangle$ triplets are helpful or not, the SGR network exploits the semantic information presented in the questions to select the most relevant relations for question answering. Extensive experiments are conducted on the GQA benchmark demonstrate the effectiveness of our method.

Published in: IEEE Transactions on Multimedia ( Volume: 25)

Page(s): 3950 - 3961

Date of Publication: 22 April 2022

ISSN Information:

DOI: 10.1109/TMM.2022.3169065

Funding Agency:

Contents

References is not available for this document.

Scene Graph Refinement Network for Visual Question Answering

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Scene Graph Refinement Network for Visual Question Answering

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?