Abstract
In the salient object detection (SOD) models based on convolutional neural network (CNN), the high-level semantic features and low-level features of the image are effectively fused and complementary, which can effectively improve the performance of SOD. However, there are usually great differences between high-level semantic features and low-level features, and low-level features are often rich in noise. How to make full use of different features and avoid noise interference is a hot issue for researchers. Different from the traditional methods, this paper proposes a novel feature extraction and fusion network (EFNet). By setting a middle-level feature extraction module as the medium for the fusion of high-level semantic features and low-level image features, this special module integrates the two by reducing the difference between low-level image features and deep semantic features; In addition, a feature enhancement module is applied to enhance the image features, and the proposed SOD method can obtain good performance. Experimental results on five benchmark datasets show that the proposed method outperforms 15 state-of-the-art methods on five important evaluation metrics. Code will be available at: https://github.com/dc3234/EFNet.
Similar content being viewed by others
References
Atrish A, Singh N, Kumar K , Kumar V (2017) An automated hierarchical framework for player recognition in sports image. In: Proceedings of the international conference on video and image processing, pp 103–108
Chen Z, Xu Q, Cong R, Huang Q (2020) Global context-aware progressive aggregation network for salient object detection. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 10599–10606
Cheng MM, Zhang FL, Mitra NJ, Huang X, Hu SM (2010) Repfinder: finding approximately repeated scene elements for image editing. ACM transactions on graphics (TOG) 29(4):1–8
Fang C, Tian H, Zhang D, Zhang Q, Han J, Han J (2021) Densely Nested Top-Down Flows for Salient Object Detection. arXiv:2102.09133
Feng M, Lu H, Ding E (2019) Attentive feedback network for boundary-aware salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1623–1632
Gu Y, Wang L, Wang Z, Liu Y, Cheng MM, Lu SP (2020) Pyramid constrained self-attention network for fast video salient object detection. In proceedings of the AAAI conference on artificial intelligence 34(07):10869–10876
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Hou Q, Cheng MM, Hu X, Borji A, Tu Z, Torr PH (2017) Deeply supervised salient object detection with short connections. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3203–3212
Kang D, Park S, Paik J (2020) Sd BAN: Salient object detection using bilateral attention network with dice coefficient loss. IEEE Access 8:104357–104370
Krishna R, Kumar K (2020) P-MEC: polynomial congruence based multimedia encryption technique over cloud. IEEE consumer electronics magazine
Kumar K (2021) Text query based summarized event searching interface system using deep learning over cloud. Multimed Tools Appl 80:11079–11094
Kumar K, Kumar A, Bahuguna A (2017) D-CAD: Deep and crowded anomaly detection. In: Inproceedings of the 7th International Conference on Computer and Communication Technology, pp 100–105
Kumar K, Shrimankar DD (2017) F-DES: Fast and deep event summarization. IEEE Transactions on Multimedia 20(2):323–334
Kumar K, Shrimankar DD, Singh N (2018) Eratosthenes sieve based key-frame extraction technique for event summarization in videos. Multimed Tools Appl 77(6):7383–7404
Kumar K, Shrimankar DD, Singh N (2019) Key-lectures: keyframes extraction in video lectures. In: Machine intelligence and signal analysis. Springer, Singapore, pp 453–459
Kumar A, Singh N, Kumar P, Vijayvergia A, Kumar K (2017) A novel superpixel based color spatial feature for salient object detection. In: 2017 conference on information and communication technology (CICT). IEEE, pp 1–5
Lee H, Kim D (2018, March) Salient region-based online object tracking. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, pp 1170–1177
Li Y, Hou X, Koch C, Rehg JM, Yuille AL (2014) The secrets of salient object segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 280–287
Li G, Yu Y (2015) Visual saliency based on multiscale deep features. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5455–5463
Liang Y, Qin G, Sun M, Yan J, Jiang H (2021) MAFNEt: Multi-style attention fusion network for salient object detection. Neurocomputing 422:22–33
Liu Y, Cheng MM, Zhang XY, Nie GY (2021) DNA: deeply supervised nonlinear aggregation for salient object detection. IEEE Transactions on Cybernetics
Liu N, Han J, Yang MH (2018) Picanet: Learning pixel-wise contextual attention for saliency detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3089–3098
Liu JJ, Hou Q, Cheng MM (2020) Dynamic feature integration for simultaneous detection of salient object, edge, and skeleton. IEEE Trans Image Process 29:8652–8667
Liu JJ, Hou Q, Cheng MM, Feng J, Jiang J (2019) A simple pooling-based design for real-time salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3917–3926
Liu Y, Zhang XY, Bian JW, Zhang L, Cheng MM (2021) Samnet: Stereoscopically attentive multi-scale network for lightweight salient object detection. IEEE Trans Image Process 30:3804–3814
Luo W, Li Y, Urtasun R, Zemel R (2016) Understanding the effective receptive field in deep convolutional neural networks. In: Proceedings of the 30th International conference on neural information processing systems, pp 4905–4913
Ma YF, Lu L, Zhang HJ, Li M (2002, December) A user attention model for video summarization. In: Proceedings of the tenth ACM International conference on multimedia, pp 533–542
Mohammadi S, Noori M, Bahri A, Majelan SG, Havaei M (2020) CAGNEt: content-aware guidance for salient object detection. Pattern Recogn 103:107303
Noori M, Mohammadi S, Majelan SG, Bahri A, Havaei M (2020) DFNEt: Discriminative feature extraction and integration network for salient object detection. Eng Appl Artif Intell 89:103419
Pang Y, Zhao X, Zhang L, Lu H (2020) Multi-scale interactive network for salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 9413–9422
Qin X, Zhang Z, Huang C, Gao C, Dehghan M, Jagersand M (2019) Basnet: Boundary-aware salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7479–7489
Ravi V, Narasimhan H, Chakraborty C, Pham TD (2021) Deep learning-based meta-classifier approach for COVID-19 classification using CT scan and chest X-ray images
Ravi V, Narasimhan H, Pham TD (2021) EfficientNet-Based Convolutional Neural Networks for Tuberculosis Classification. In: Advances in Artificial Intelligence, Computation, and Data Science. Springer, Cham, pp 227–244
Sharma S, Kumar K (2021) ASL-3DCNN: American sign language recognition technique using 3-D convolutional neural networks. Multimed Tools Appl, 1–13
Sharma S, Kumar K, Singh N (2017) D-FES: Deep facial expression recognition system. In: 2017 conference on information and communication technology (CICT). IEEE, pp 1–6
Sharma S, Kumar K, Singh N (2020) Deep eigen space based asl recognition system. IETE J Res, 1–11
Simonyan K (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556
Wang L, Lu H, Wang Y, Feng M, Wang D, Yin B, Ruan X (2017) Learning to detect salient objects with image-level supervision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 136–145
Wei J, Wang S, Huang Q (2020) F3Net: fusion, Feedback and Focus for Salient Object Detection. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 12321–12328
Wu R, Feng M, Guan W, Wang D, Lu H, Ding E (2019) A mutual learning method for salient object detection with intertwined multi-supervision. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 8150–8159
Wu Z, Su L, Huang Q (2019) Cascaded partial decoder for fast and accurate salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3907–3916
Xi T, Zhao W, Wang H, Lin W (2016) Salient object detection with spatiotemporal background priors for video. IEEE Trans Image Process 26(7):3425–3436
Yan Q, Xu L, Shi J, Jia J (2013) Hierarchical saliency detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1155–1162
Yang C, Zhang L, Lu H, Ruan X, Yang MH (2013) Saliency detection via graph-based manifold ranking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3166–3173
Zhang L, Dai J, Lu H, He Y, Wang G (2018) A bi-directional message passing model for salient object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1741–1750
Zhang P, Wang D, Lu H, Wang H, Ruan X (2017) Amulet: Aggregating multi-level convolutional features for salient object detection. In: Proceedings of the IEEE International conference on computer vision, pp 202–211
Zhao JX, Liu JJ, Fan DP, Cao Y, Yang J, Cheng MM (2019) EGNEt: edge guidance network for salient object detection. In: Proceedings of the IEEE/CVF International conference on computer vision, pp 8779–8788
Zhao X, Pang Y, Zhang L, Lu H, Zhang L (2020) Suppress and balance: a simple gated network for salient object detection. In: European conference on computer vision. Springer, Cham, pp 35–51
Zinkevich M, Weimer M, Smola AJ, Li L (2010) Parallelized stochastic gradient descent. In: NIPS, vol 4, p 4
Acknowledgements
This research project was supported by the Natural Science Foundation of Zhejiang Province of China (NO.LY19F030013).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no conflict of interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Dai, C., Pan, C. & He, W. Feature extraction and fusion network for salient object detection. Multimed Tools Appl 81, 33955–33969 (2022). https://doi.org/10.1007/s11042-022-12394-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12394-1