Journals & Magazines >IEEE Transactions on Multimedia >Volume: 26

MFFNet: Multi-Modal Feature Fusion Network for V-D-T Salient Object Detection

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This article discusses the limitations of single- and two-modal salient object detection (SOD) methods and the emergence of multi-modal SOD techniques that integrate Visi...Show More

Metadata

Abstract:

This article discusses the limitations of single- and two-modal salient object detection (SOD) methods and the emergence of multi-modal SOD techniques that integrate Visible, Depth, or Thermal information. However, current multi-modal methods often rely on simple fusion techniques such as addition, multiplication and concatenation, to combine the different modalities, which is ineffective for challenging scenes, such as low illumination and background messy. To address this issue, we propose a novel multi-modal feature fusion network (MFFNet) for V-D-T salient object detection, where the two key points are the triple-modal deep fusion encoder and the progressive feature enhancement decoder. The MFFNet's triple-modal deep fusion (TDF) module is designed to integrate the features of the three modalities and explore their complementarity by utilizing mutual optimization during the encoding phase. In addition, the progressive feature enhancement decoder consists of the weighted context-enhanced feature (WCF) module, region optimization (RO) module and boundary perception (BP) module to produce region-aware and contour-aware features. After that, a multi-scale fusion (MF) module is proposed to integrate these features and generate high-quality saliency maps. We conduct extensive experiments on the VDT-2048 dataset, and our results show that the proposed MFFNet outperforms 12 state-of-the-art multi-modal methods.

Published in: IEEE Transactions on Multimedia ( Volume: 26)

Page(s): 2069 - 2081

Date of Publication: 03 July 2023

ISSN Information:

DOI: 10.1109/TMM.2023.3291823

Funding Agency:

Contents

References is not available for this document.

MFFNet: Multi-Modal Feature Fusion Network for V-D-T Salient Object Detection

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MFFNet: Multi-Modal Feature Fusion Network for V-D-T Salient Object Detection

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?