Attention-guided salient object detection using autoencoder regularization

Xu, Cheng; Liu, Xianhui; Zhao, Weidong

doi:10.1007/s10489-022-03917-2

Attention-guided salient object detection using autoencoder regularization

Published: 09 July 2022

Volume 53, pages 6481–6495, (2023)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Cheng Xu¹,
Xianhui Liu¹ &
Weidong Zhao¹

477 Accesses
1 Citation
Explore all metrics

Abstract

A saliency detection task simulates the attention mechanism of the human visual system, which focuses on what draws the most attention in a picture, and performs accurate localization and pixel-level segmentation of the object. Existing detection methods based on neural networks usually perform calculation of the object position information and edge information separately in each layer, resulting in calculation redundancy and insufficient utilization of information. To address this issue, this paper proposes an attention-guided detection network using an autoencoder (AGA-Net). First, using a proposed attention-guided multi-scale (AM) module, results from deep layers can be used to highlight features of the foreground and suppress features of the background and extract different scale features that are more relevant to the detection task. Second, a bi-refinement (BR) module composed of two sub-networks is proposed. One sub-network extracts information of the foreground to find redundant areas in the prediction results, and the other uses background information to supplement missing boundary information. Finally, the new model uses a variational autoencoder (VAE) branch to realize the image restoration task. It shares the encoder module with the object detection task and helps it escape from a local minimum in the converging process. Extensive experiments on six benchmark datasets were conducted and the proposed method is compared with 19 state-of-the-art methods, demonstrating that the new method has the best results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Visual saliency detection via invariant feature constrained stacked denoising autoencoder

Article 15 February 2023

Image saliency detection via multi-scale iterative CNN

Article 06 August 2019

Saliency Detection via CNN Coarse Learning and Compactness Based ELM Refinement

Data Availability

The datasets and materials used or analysed during the current study are available from the corresponding author on reasonable request.

Code Availability

The code used or analysed during the current study is available from the corresponding author on reasonable request.

References

Borji A, Cheng M-M, Hou Q, Jiang H, Li J (2019) Salient object detection: a survey. Comput Vis Media 5(2):117–150
Article Google Scholar
Wang L, Lu H, Ruan X, Yang M-H (2015) Deep networks for saliency detection via local estimation and global search. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3183–3192
Liu Y, Zhang X-Y, Bian J-W, Zhang L, Cheng M-M (2021) Samnet: stereoscopically attentive multi-scale network for lightweight salient object detection. IEEE Trans Image Process 30:3804–3814
Article Google Scholar
Ji Y, Zhang H, Zhang Z, Liu M (2021) Cnn-based encoder-decoder networks for salient object detection: a comprehensive review and recent advances. Inf Sci 546:835–857
Article MathSciNet Google Scholar
Qin X, Zhang Z, Huang C, Gao C, Dehghan M, Jagersand M (2019) Basnet: boundary-aware salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7479–7489
Pang Y, Zhao X, Zhang L, Lu H (2020) Multi-scale interactive network for salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9413–9422
Wang S, Liu X, Zhao J, Liu Y, Liu S, Liu Y, Zhao J (2021) Computer auxiliary diagnosis technique of detecting cholangiocarcinoma based on medical imaging: a review. Comput Methods Programs Biomed 208:106265
Article Google Scholar
Shi L, Wang Z, Pan B, Shi Z (2020) An end-to-end network for remote sensing imagery semantic segmentation via joint pixel-and representation-level domain adaptation. IEEE Geosci Remote Sens Lett 18(11):1896–1900
Article Google Scholar
Dogan Y, Keles HY (2020) Semi-supervised image attribute editing using generative adversarial networks. Neurocomputing 401:338–352
Article Google Scholar
Chen S, Tan X, Wang B, Lu H, Hu X, Fu Y (2020) Reverse attention-based residual network for salient object detection. IEEE Trans Image Process 29:3763–3776
Article MATH Google Scholar
Fan D-P, Zhai Y, Borji A, Yang J, Shao L (2020) Bbs-net: Rgb-d salient object detection with a bifurcated backbone strategy network. In: Proceedings of the european conference on computer vision (ECCV), pp 275–292
Qin X, Zhang Z, Huang C, Dehghan M, Zaiane OR, Jagersand M (2020) U2-net: going deeper with nested u-structure for salient object detection. Pattern Recogn 106:107404
Article Google Scholar
Myronenko A (2019) 3d Mri brain tumor segmentation using autoencoder regularization. Brainlesion: glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries:311–320
He S, Lau RW, Liu W, Huang Z, Yang Q (2015) Supercnn: a superpixelwise convolutional neural network for salient object detection. Int J Comput Vis 115(3):330–344
Article MathSciNet Google Scholar
Hu T, Yang M, Yang W, Li A (2019) An end-to-end differential network learning method for semantic segmentation. Int J Mach Learn Cybern 10(7):1909–1924
Article Google Scholar
Zhang J, Zhang T, Dai Y, Harandi M, Hartley R (2018) Deep unsupervised saliency detection: a multiple noisy labeling perspective. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9029–9038
Hou Q, Cheng M-M, Hu X, Borji A, Tu Z, Torr PH (2017) Deeply supervised salient object detection with short connections. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3203–3212
Liu N, Han J, Yang M-H (2018) Picanet: learning pixel-wise contextual attention for saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3089–3098
Liu J-J, Hou Q, Cheng M-M, Feng J, Jiang J (2019) A simple pooling-based design for real-time salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3917–3926
Gehring J, Auli M, Grangier D, Yarats D, Dauphin YN (2017) Convolutional sequence to sequence learning. In: Proceedings of the 34th international conference on machine learning, pp 1243–1252
Zhang X, Wang T, Qi J, Lu H, Wang G (2018) Progressive attention guided recurrent network for salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 714–722
Zhang L, Dai J, Lu H, He Y, Wang G (2018) A bi-directional message passing model for salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1741–1750
Sun F, Li W, Guan Y (2019) Self-attention recurrent network for saliency detection. Multimed Tools Appl 78(21):30793–30807
Article Google Scholar
Zhang X, Wang T, Qi J, Lu H, Wang G (2018) Progressive attention guided recurrent network for salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 714–722
Wang H, Dai L, Cai Y, Sun X, Chen L (2018) Salient object detection based on multi-scale contrast. Neural Netw 101:47–56
Article Google Scholar
Wang H, Fan R, Cai P, Liu M (2021) Sne-roadseg+: rethinking depth-normal translation and deep supervision for freespace detection. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, pp 1140–1145
Su J, Li J, Zhang Y, Xia C, Tian Y (2019) Selectivity or invariance: Boundary-aware salient object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3799–3808
Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 248–255
Zhang C, Li G, Du S (2019) Multi-scale dense networks for hyperspectral remote sensing image classification. IEEE Trans Geosci Remote Sens 57(11):9201–9222
Article Google Scholar
Gao S, Cheng M-M, Zhao K, Zhang X-Y, Yang M-H, Torr PH (2021) Res2net: a new multi-scale backbone architecture. IEEE Trans Pattern Anal Mach Intell 43(2):652–662
Article Google Scholar
Wang L, Lu H, Wang Y, Feng M, Wang D, Yin B, Ruan X (2017) Learning to detect salient objects with image-level supervision. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 136–145
Yang C, Zhang L, Lu H, Ruan X, Yang M. -H. (2013) Saliency detection via graph-based manifold ranking. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3166–3173
Li G, Yu Y (2015) Visual saliency based on multiscale deep features. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 5455–5463
Yan Q, Xu L, Shi J, Jia J (2013) Hierarchical saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1155–1162
Li Y, Hou X, Koch C, Rehg JM, Yuille AL (2014) The secrets of salient object segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 280–287
Movahedi V, Elder JH (2010) Design and perceptual validation of performance measures for salient object segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 49–56
Zhou H, Xie X, Lai J-H, Chen Z, Yang L (2020) Interactive two-stream decoder for accurate and fast saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9141–9150
Zhang J, Xie J, Barnes N (2020) Learning noise-aware encoder-decoder from noisy labels by alternating back-propagation for saliency detection. In: Proceedings of the european conference on computer vision (ECCV), pp 349–366
Liu Y, Gu Y-C, Zhang X-Y, Wang W, Cheng M-M (2020) Lightweight salient object detection via hierarchical visual perception learning. IEEE Trans Cybern 51(9):4439–4449
Article Google Scholar
Liu J-J, Hou Q, Cheng M-M (2020) Dynamic feature integration for simultaneous detection of salient object, edge, and skeleton. IEEE Trans Image Process 29:8652–8667
Article MATH Google Scholar
Zeng Y, Zhuge Y, Lu H, Zhang L, Qian M, Yu Y (2019) Multi-source weak supervision for saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6074–6083
Feng M, Lu H, Ding E (2019) Attentive feedback network for boundary-aware salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1623–1632
Wu R, Feng M, Guan W, Wang D, Lu H, Ding E (2019) A mutual learning method for salient object detection with intertwined multi-supervision. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8150– 8159
Wang T, Zhang L, Wang S, Lu H, Yang G, Ruan X, Borji A (2018) Detect globally, refine locally: a novel approach to saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3127–3135
Chen S, Tan X, Wang B, Hu X (2018) Reverse attention for salient object detection. In: Proceedings of the European conference on computer vision (ECCV), pp 234–250
Wang T, Borji A, Zhang L, Zhang P, Lu H (2017) A stagewise refinement model for detecting salient objects in images. In: Proceedings of the IEEE international conference on computer vision, pp 4019–4028
Luo Z, Mishra A, Achkar A, Eichel J, Li S, Jodoin P-M (2017) Non-local deep features for salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6609–6617
Zhang P, Wang D, Lu H, Wang H, Yin B (2017) Learning uncertain convolutional features for accurate saliency detection. In: Proceedings of the IEEE international conference on computer vision, pp 212–221

Download references

Funding

National Key Research and Development Program of China. No. 2018YFB1404402.

Author information

Authors and Affiliations

School of Electronics and Information Engineering, Tongji University, Shanghai, China
Cheng Xu, Xianhui Liu & Weidong Zhao

Authors

Cheng Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xianhui Liu
View author publications
You can also search for this author in PubMed Google Scholar
Weidong Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xianhui Liu.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xu, C., Liu, X. & Zhao, W. Attention-guided salient object detection using autoencoder regularization. Appl Intell 53, 6481–6495 (2023). https://doi.org/10.1007/s10489-022-03917-2

Download citation

Accepted: 20 June 2022
Published: 09 July 2022
Issue Date: March 2023
DOI: https://doi.org/10.1007/s10489-022-03917-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Attention-guided salient object detection using autoencoder regularization

Abstract

Access this article

Similar content being viewed by others

Visual saliency detection via invariant feature constrained stacked denoising autoencoder

Image saliency detection via multi-scale iterative CNN

Saliency Detection via CNN Coarse Learning and Compactness Based ELM Refinement

Data Availability

Code Availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Attention-guided salient object detection using autoencoder regularization

Abstract

Access this article

Similar content being viewed by others

Visual saliency detection via invariant feature constrained stacked denoising autoencoder

Image saliency detection via multi-scale iterative CNN

Saliency Detection via CNN Coarse Learning and Compactness Based ELM Refinement

Data Availability

Code Availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation