WUSL–SOD: Joint weakly supervised, unsupervised and supervised learning for salient object detection

Liu, Yan; Zhang, Yunzhou; Wang, Zhenyu; Ma, Rong; Qiu, Feng; Coleman, Sonya; Kerr, Dermot

doi:10.1007/s00521-023-08545-6

WUSL–SOD: Joint weakly supervised, unsupervised and supervised learning for salient object detection

Original Article
Published: 19 April 2023

Volume 35, pages 15837–15856, (2023)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Yan Liu¹,
Yunzhou Zhang ORCID: orcid.org/0000-0003-0610-3732²,
Zhenyu Wang¹,
Rong Ma³,
Feng Qiu²,
Sonya Coleman⁴ &
…
Dermot Kerr⁴

304 Accesses
1 Citation
Explore all metrics

Abstract

Deep learning methods for salient object detection (SOD) have been studied actively and promisingly. However, it is still challenging for the studies with two aspects. The first one is a single type of label from the network to convey limit information, which leads to the poor generalization ability of the network. The second one is the difficulty to improve the accuracy and detect details of target. To address these challenges, we develop a novel approach via joint weakly supervised, unsupervised and supervised learning for SOD (WUSL–SOD), which differs from existing methods just based on ground-truth or other sparse labels. Specifically, to optimize the objective of the image, the unsupervised learning module (ULM) is designed to generate coarse saliency feature and suppress background noises via attention guiding mechanism. Then, we propose the weakly supervised learning module (WLM) based on scribbles for producing relatively accurate saliency feature. Note that this structure is used to enhance the details and remedy the deficiency of scribbles in WLM. For further refining information from the ULM and WLM, we propose a supervised learning module (SLM), which is not only applied to process and refine information from the ULM and WLM, but also enhance the image details and capture the entire target area. Furthermore, we also exchange information between the SLM and the WLM to obtain more accurate saliency maps. Extensive experiments on five datasets demonstrate that the proposed approach can effectively outperform the state-of-the-art approaches and achieve real-time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

Fig. 13

Image saliency detection via multi-scale iterative CNN

Article 06 August 2019

SalNet: Edge Constraint Based End-to-End Model for Salient Object Detection

Deep Salient Object Detection via Hierarchical Network Learning

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request

References

Borji A, Cheng MM, Jiang H, Li J (2015) Salient object detection: a benchmark. IEEE Transact Image process 24(12):5706–5722
Article MathSciNet MATH Google Scholar
Borji A, Itti L (2012) Exploiting local and global patch rarities for saliency detection. In: 2012 IEEE conference on computer vision and pattern recognition, pp. 478–485. IEEE
Chen S, Tan X, Wang B, Hu X (2018) Reverse attention for salient object detection. In: proceedings of the european conference on computer vision (ECCV), pp. 234–250
Chen Z, Xu Q, Cong R, Huang Q (2020) Global context-aware progressive aggregation network for salient object detection. In: proceedings of the AAAI conference on artificial intelligence, 34, pp. 10599–10606
Cheng MM, Mitra NJ, Huang X, Torr PH, Hu SM (2014) Global contrast based salient region detection. IEEE Transact Pattern Anal Mach Intell 37(3):569–582
Article Google Scholar
Cheng MM, Zhang FL, Mitra NJ, Huang X, Hu SM (2010) Repfinder: finding approximately repeated scene elements for image editing. ACM Transact Graphics (TOG) 29(4):1–8
Article Google Scholar
Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vision 88(2):303–338
Article Google Scholar
Fan DP, Cheng MM, Liu Y, Li T, Borji A (2017) Structure-measure: A new way to evaluate foreground maps. In: Proceedings of the IEEE international conference on computer vision, pp. 4548–4557
Fan DP, Gong C, Cao Y, Ren B, Cheng MM, Borji A (2018) Enhanced-alignment measure for binary foreground map evaluation. arXiv preprint arXiv:1805.10421
Fang H, Zhang D, Zhang Y, Chen M, Li J, Hu Y, Cai D, He X (2021) Salient object ranking with position-preserved attention. In: Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pp. 16331–16341
Feng M, Lu H, Ding E (2019) Attentive feedback network for boundary-aware salient object detection. In: Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition, pp. 1623–1632
Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, Lu H (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 3146–3154
Gao S, Zhang W, Wang Y, Guo Q, Zhang C, He Y, Zhang W (2022) Weakly-supervised salient object detection using point supervision. In: AAAI
Gao Y, Wang M, Tao D, Ji R, Dai Q (2012) 3-d object retrieval and recognition with hypergraph analysis. IEEE Transact Image Process 21(9):4290–4303
Article MathSciNet MATH Google Scholar
Hou Q, Cheng MM, Hu X, Borji A, Tu Z, Torr PH (2017) Deeply supervised salient object detection with short connections. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3203–3212
Huang Z, Wang X, Huang L, Huang C, Wei Y, Liu W (2019) Ccnet: Criss-cross attention for semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 603–612
Jiang P, Ling H, Yu J, Peng J (2013) Salient region detection by ufo: Uniqueness, focusness and objectness. In: Proceedings of the IEEE international conference on computer vision, pp. 1976–1983
Lee S, Lee M, Lee J, Shim H (2021) Railroad is not a train: Saliency as pseudo-pixel supervision for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 5495–5505
Li G, Xie Y, Lin L (2018) Weakly supervised salient object detection using image labels. In: proceedings of the AAAI conference on artificial intelligence, 32
Li G, Yu Y (2015)Visual saliency based on multiscale deep features. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5455–5463
Li G, Yu Y (2015) Visual saliency based on multiscale deep features. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5455–5463
Li G, Yu Y (2016) Visual saliency detection based on multiscale deep cnn features. IEEE Tansact Iage Pocess 25(11):5012–5024
Article MathSciNet MATH Google Scholar
Lin X, Wu Z, Chen G, Li G, Yu Y(2022) A causal debiasing framework for unsupervised salient object detection
Liu JJ, Hou Q, Cheng MM (2020) Dynamic feature integration for simultaneous detection of salient object, edge, and skeleton. IEEE Transact Image Process 29:8652–8667
Article MATH Google Scholar
Liu JJ, Hou Q, Cheng MM, Feng J, Jiang J (2019) A simple pooling-based design for real-time salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 3917–3926
Liu JJ, Liu ZA, Peng P, Cheng MM (2021) Rethinking the u-shape structure for salient object detection. IEEE Transact Image Process 30:9030–9042
Article Google Scholar
Liu N, Han J, Yang MH (2018) Picanet: Learning pixel-wise contextual attention for saliency detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3089–3098
Liu N, Zhang N, Wan K, Shao L, Han J (2021) Visual saliency transformer. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 4722–4732
Liu Y, Cheng MM, Zhang XY, Nie GY, Wang M (2022) DNA: Deeply supervised nonlinear aggregation for salient object detection. IEEE Transact Cybernet 52(7):6131–6142
Article Google Scholar
Liu Y, Wang P, Cao Y, Liang Z, Lau RW (2021) Weakly-supervised salient object detection with saliency bounding boxes. IEEE Transact Image Process 30:4423–4435
Article Google Scholar
Liu Y, Zhang Y, Liu S, Coleman S, Wang Z, Qiu F (2021) Salient object detection by aggregating contextual information. Pattern Recognit Lett
Boosting broader receptive fields for salient object detection (2023) Ma, M., Xia, C., Xie, C., Chen, X., Li. J. IEEE Transact Image Process 32:1026–1038
Google Scholar
Mohammadi S, Noori M, Bahri A, Majelan SG, Havaei M (2020) Cagnet: Content-aware guidance for salient object detection. Pattern Recognit 103:107303
Article Google Scholar
Mukherjee S, Valenzise G, Cheng I (2019) Potential of deep features for opinion-unaware, distortion-unaware, no-reference image quality assessment. In: International conference on smart multimedia, pp. 87–95. Springer
Nguyen DT, Dax M, Mummadi CK, Ngo TPN, Nguyen THP, Lou Z, Brox T (2019) Deepusps: Deep robust unsupervised saliency prediction with self-supervision. arXiv preprint arXiv:1909.13055
Pang Y, Zhao X, Zhang L, Lu H (2020) Multi-scale interactive network for salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 9413–9422
Piao Y, Ji W, Li J, Zhang M, Lu H (019) Depth-induced multi-scale recurrent attention network for saliency detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 7254–7263
Qin X, Zhang Z, Huang C, Dehghan M, Zaiane OR, Jagersand M (2020) U2-net: Going deeper with nested u-structure for salient object detection. Pattern Recognit 106:107404
Article Google Scholar
Qin X, Zhang Z, Huang C, Gao C, Dehghan M, Jagersand M (2019) Basnet: Boundary-aware salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 7479–7489
Qin Y, Lu H, Xu Y, Wang H (2015) Saliency detection via cellular automata. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 110–119
Sun F, Li W (2019) Saliency guided deep network for weakly-supervised image segmentation. Pattern Recognit Lett
Tang M, Djelouah A, Perazzi F, Boykov Y, Schroers C (2018) Normalized cut loss for weakly-supervised cnn segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1818–1827
Tong N, Lu H, Ruan X, Yang MH (2015) Salient object detection via bootstrap learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1884–1892
Wang L, Lu H, Ruan X, Yang MH (2015) Deep networks for saliency detection via local estimation and global search. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3183–3192
Wang L, Lu H, Wang Y, Feng M, Wang D, Yin B, Ruan X (2017) Learning to detect salient objects with image-level supervision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 136–145
Wang T, Borji A, Zhang L, Zhang P, Lu H (2017) A stagewise refinement model for detecting salient objects in images. In: Proceedings of the IEEE international conference on computer vision, pp. 4019–4028
Wang T, Zhang L, Wang S, Lu H, Yang G, Ruan X, Borji A (2018) Detect globally, refine locally: A novel approach to saliency detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3127–3135
Wang W, Shen J, Dong X, Borji A (2018) Salient object detection driven by fixation prediction. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1711–1720
Wang W, Zhao S, Shen J, Hoi SC, Borji A (2019) Salient object detection with pyramid attention and salient edges. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 1448–1457
Wang Z, Zhang Y, Liu Y, Liu S, Coleman S, Kerr D (2021) Mfc-net: Multi-feature fusion cross neural network for salient object detection. Image Vision Comput. 104243
Wang Z, Zhang Y, Liu Y, Wang Z, Coleman S, Kerr D (2022) Tf-sod: a novel transformer framework for salient object detection. Neural Comput Appl 34(14):11789–11806
Article Google Scholar
Wei J, Wang S, Huang Q (2020) F\(^3\)net: Fusion, feedback and focus for salient object detection. In: Proceedings of the AAAI conference on artificial intelligence, 34, pp. 12321–12328
Wei Y, Liang X, Chen Y, Shen X, Cheng MM, Feng J, Zhao Y, Yan S (2016) Stc: A simple to complex framework for weakly-supervised semantic segmentation. IEEE transactions on pattern analysis and machine intelligence 39(11):2314–2320
Article Google Scholar
Wei Y, Wen F, Zhu W, Sun J (2012) Geodesic saliency using background priors. In: European conference on computer vision, pp. 29–42. Springer
Wu R, Feng M, Guan W, Wang D, Lu H, Ding E (2019) A mutual learning method for salient object detection with intertwined multi-supervision. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8150–8159
Wu Z, Su L, Huang Q (2019) Cascaded partial decoder for fast and accurate salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 3907–3916
Wu Z, Su L, Huang Q (2019) Stacked cross refinement network for edge-aware salient object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 7264–7273
Yan Q, Xu L, Shi J, Jia J (2013) Hierarchical saliency detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1155–1162
Yang C, Zhang L, Lu H, Ruan X, Yang MH (2013) Saliency detection via graph-based manifold ranking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3166–3173
Yu S, Zhang B, Xiao J, Lim EG (2020) Structure-consistent weakly supervised salient object detection with local saliency coherence. arXiv preprint arXiv:2012.04404
Zeng Y, Feng M, Lu H, Yang G, Borji A (2018) An unsupervised game-theoretic approach to saliency detection. IEEE Transact Image Process 27(9):4545–4554
Article MathSciNet MATH Google Scholar
Zeng Y, Zhang P, Zhang J, Lin Z, Lu H (2019) Towards high-resolution salient object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 7234–7243
Zeng Y, Zhuge Y, Lu H, Zhang L, Qian M, Yu Y (2019) Multi-source weak supervision for saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 6074–6083
Zhai Y, Shah M (2006) Visual attention detection in video sequences using spatiotemporal cues. In: Proceedings of the 14th ACM international conference on Multimedia, pp. 815–824
Zhang D, Han J, Zhang Y (2017) Supervision by fusion: Towards unsupervised learning of deep salient object detector. In: Proceedings of the IEEE international conference on computer vision, pp. 4048–4056
Zhang D, Tian H, Han J (2021) Few-cost salient object detection with adversarial-paced learning. arXiv preprint arXiv:2104.01928
Zhang J, Yu X, Li A, Song P, Liu B, Dai Y (2020) Weakly-supervised salient object detection via scribble annotations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 12546–12555
Zhang J, Zhang T, Dai Y, Harandi M, Hartley R (2018) Deep unsupervised saliency detection: A multiple noisy labeling perspective. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 9029–9038
Zhang L, Ai J, Jiang B, Lu H, Li X (2017) Saliency detection via absorbing markov chain with learnt transition probability. IEEE Transact Image Process 27(2):987–998
Article MathSciNet MATH Google Scholar
Zhang N, Deng S, Ye H, Zhang W, Chen H (2022) Robust triple extraction with cascade bidirectional capsule network. Expert Syst Appl 187:115806
Article Google Scholar
Zhang Q, Zhang L, Wang D, Shi Y, Lin J (2021) Global and local information aggregation network for edge-aware salient object detection. J Visual Commun Image Represent 81:103350
Article Google Scholar
Zhang X, Wang T, Qi J, Lu H, Wang G (2018) Progressive attention guided recurrent network for salient object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 714–722
Zhao JX, Liu JJ, Fan DP, Cao Y, Yang J, Cheng MM (2019) Egnet: Edge guidance network for salient object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 8779–8788
Zhao R, Oyang W, Wang X (2016) Person re-identification by saliency learning. IEEE Transact Pattern Anal Mach Intell 39(2):356–370
Article Google Scholar
Zhao T, Wu X (2019) Pyramid feature attention network for saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 3085–3094
Zhao X, Pang Y, Zhang L, Lu H, Zhang L (2020) Suppress and balance: A simple gated network for salient object detection. In: European conference on computer vision, pp. 35–51. Springer
Zhou H, Xie X, Lai JH, Chen Z, Yang L (2020) Interactive two-stream decoder for accurate and fast saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 9141–9150
Zhou T, Brown M, Snavely N, Lowe DG (2017) Unsupervised learning of depth and ego-motion from video. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1851–1858
Zhou X, Fang H, Liu Z, Zheng B, Sun Y, Zhang J, Yan C (2021) Dense attention-guided cascaded network for salient object detection of strip steel surface defects. IEEE Transactions on Instrumentation and Measurement
Zhu D, Dai L, Luo Y, Zhang G, Shao X, Itti L, Lu J (2018) Multi-scale adversarial feature learning for saliency detection. Symmetry 10(10):457
Article Google Scholar
Zhu Y, Zhao C, Guo H, Wang J, Zhao X, Lu H (2018) Attention couplenet: Fully convolutional attention coupling network for object detection. IEEE Transact Image Process 28(1):113–126
Article MathSciNet Google Scholar
Zhuge M, Fan DP, Liu N, Zhang D, Xu D, Shao L (2022) Salient object detection via integrity learning. IEEE Transact Pattern Anal Mach Intell

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (No. 61973066, 61471110), Foundation of Key Laboratory of Aerospace System Simulation(6142002200301), Foundation of Key Laboratory of Equipment Reliability(61420030302), Major Science and technology innovation engineering projects of Shandong Province(2019JZZY010128) and Distinguished Creative Talent Program of Liaoning Colleges and Universities (LR2019027).

Author information

Authors and Affiliations

Faculty of Robot Science and Engineering, Northeastern University, Shenyang, China
Yan Liu & Zhenyu Wang
College of Information Science and Engineering, Northeastern University, Shenyang, China
Yunzhou Zhang & Feng Qiu
Beijing Simulation Center, Beijing, China
Rong Ma
School of Computing, Engineering and Intelligent Systems, Ulster University, Coleraine, Northern Ireland, UK
Sonya Coleman & Dermot Kerr

Authors

Yan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yunzhou Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Rong Ma
View author publications
You can also search for this author in PubMed Google Scholar
Feng Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Sonya Coleman
View author publications
You can also search for this author in PubMed Google Scholar
Dermot Kerr
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yunzhou Zhang.

Ethics declarations

Conflict of interest

No potential conflict of interest was reported by the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Liu, Y., Zhang, Y., Wang, Z. et al. WUSL–SOD: Joint weakly supervised, unsupervised and supervised learning for salient object detection. Neural Comput & Applic 35, 15837–15856 (2023). https://doi.org/10.1007/s00521-023-08545-6

Download citation

Received: 05 August 2022
Accepted: 28 March 2023
Published: 19 April 2023
Issue Date: July 2023
DOI: https://doi.org/10.1007/s00521-023-08545-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

WUSL–SOD: Joint weakly supervised, unsupervised and supervised learning for salient object detection

Abstract

Access this article

Similar content being viewed by others

Image saliency detection via multi-scale iterative CNN

SalNet: Edge Constraint Based End-to-End Model for Salient Object Detection

Deep Salient Object Detection via Hierarchical Network Learning

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

WUSL–SOD: Joint weakly supervised, unsupervised and supervised learning for salient object detection

Abstract

Access this article

Similar content being viewed by others

Image saliency detection via multi-scale iterative CNN

SalNet: Edge Constraint Based End-to-End Model for Salient Object Detection

Deep Salient Object Detection via Hierarchical Network Learning

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation