Abstract
With the rapid advancement of modern medical technology, microscopy imaging systems have become one of the key technologies in medical image analysis. However, manual use of microscopes presents issues such as operator dependency, inefficiency, and time consumption. To enhance the efficiency and accuracy of medical image capture and reduce the burden of subsequent quantitative analysis, this paper proposes an improved microscope salient object detection algorithm based on U2-Net, incorporating deep learning technology. The improved algorithm first enhances the network’s key information extraction capability by incorporating the Convolutional Block Attention Module (CBAM) into U2-Net. It then optimizes network complexity by constructing a Simple Pyramid Pooling Module (SPPM) and uses Ghost convolution to achieve model lightweighting. Additionally, data augmentation is applied to the slides to improve the algorithm’s robustness and generalization. The experimental results show that the size of the improved algorithm model is 72.5 MB, which represents a 56.85% reduction compared to the original U2-Net model size of 168.0 MB. Additionally, the model’s prediction accuracy has increased from 92.24 to 97.13%, providing an efficient means for subsequent image processing and analysis tasks in microscopy imaging systems.
Graphical Abstract











Similar content being viewed by others
References
Qureshi I, Yan J, Abbas Q, Shaheed K, Riaz AB, Wahid A, Khan MW, Szczuko P (2023) Medical image segmentation using deep semantic-based methods: a review of techniques, applications and emerging trends. Inf Fusion 90:316–352
Yi S, Zhang B, Gong L, Xiaosong G (2022) Thoughts on the development strategy of digital medicine. Transp Med 36:131–133
Wang X, Yu S, Lim EG, Wong MLD (2024) Salient object detection: a mini review. Front Signal Process 4:1356793
Keren Fu, Jiang Y, Ji G-P, Zhou T, Zhao Q, Fan D-P (2022) Light field salient object detection: a review and benchmark. Comput Vis Med 8(4):509–534
Wang Y Wang, Wang R, Fan X, Wang T, He X (2023) Pixels, regions, and objects: multiple enhancement for salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10031–10040
Talaei Khoei T, OuldSlimane H, Kaabouch N (2023) Deep learning: systematic review, models, challenges, and research directions. Neural Comput Appl 35(31):23103–23124
Zhang X, Zhai D, Li T, Zhou Y, Lin Y (2023) Image inpainting based on deep learning: a review. Inf Fusion 90:74–94
Dhamija T, Gupta A, Gupta S, Anjum, Katarya R, Singh G (2023) Semantic segmentation in medical images through transfused convolution and transformer networks. Appl Intell 53(1):1132–1148
Yu-Huan Wu, Gao S-H, Mei J, Jun Xu, Fan D-P, Zhang R-G, Cheng M-M (2021) Jcs: an explainable covid-19 diagnosis system by joint classification and segmentation. IEEE Trans Image Process 30:3113–3126
Shahin AI, Aly W, Aly S (2023) Mbtfcn: A novel modular fully convolutional network for MRI brain tumor multi-classification. Expert Syst Appl 212:118776
Zhang J, Zhang Y, Jin Y, Jilan Xu, Xiaowei Xu (2023) Mdu-net: multiscale densely connected u-net for biomedical image segmentation. Health Inf Sci Syst 11(1):13
Li J, Liu K, Hu Y, Zhang H, Heidari AA, Chen H, Zhang W, Algarni AD, Elmannai H (2023) Eres-UNet++: liver CT image segmentation based on high-efficiency channel attention and Res-UNet++. Comput Biol Med 158:106501
Huang H, Lin L, Tong R, Hu H, Zhang Q, Iwamoto Y, Han X, Chen YW, Wu J (2020) Unet 3+: a full-scale connected unet for medical image segmentation. In: ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 1055–1059. IEEE
Long J, Yang C, Song X, Zeng Z, Ren Y (2024) Polyp segmentation network based on lightweight model and reverse attention mechanisms. Int J Imaging Syst Technol 34(3):e23062
Li J, Han D, Wang X, Yi P, Yan L, Li X (2023) Multisensor medical-image fusion technique based on embedding bilateral filter in least squares and salient detection. Sensors 23(7):3490
Qin X, Zhang Z, Huang C, Dehghan M, Zaiane OR, Jagersand M (2020) U2-Net: going deeper with nested U-structure for salient object detection. Pattern Recog 106:107404
Hu S, Li H, Hao D (2024) Improved multistage edge-enhanced medical image segmentation network of U-Net. Comput Eng 50(4):286–293. https://doi.org/10.19678/j.issn.1000-3428.0067779
Qin C, Wang Y, Zhang J (2024) Medical image segmentation network based on convolution capsule encoder and multi-scale local feature co-occurrence. Appl Res Comput 41(4):1264–1269
Xu P, Liang Y, Li Y (2023) Medical image segmentation with fusion of multi-scale semantics and residual bottleneck attention. Comput Eng 49:162–170
Zhang X, Zhang S, Zhang D, Liu R (2023) Medical image segmentation model with introduced group attention. J Image Graphics 28:3231–3242
Zheng X, Tang P, Ai L, Liu D, Zhang Y, Wang B (2022) White blood cell detection using saliency detection and CenterNet: a two‐stage approach. J Biophotonics 16. https://doi.org/10.1002/jbio.202200174
You Z, Jiang M, Shi Z, Zhao M, Shi C, Du S, Hérard AS, Souedet N (2022) Delzescaux T Multiscale segmentation- and error-guided iterative convolutional neural network for cerebral neuron segmentation in microscopic images. Microsc Res Tech 85:3541–3552
Asha SB, Gopakumar G, Subrahmanyam GR (2024) Saliency and boundary guided segmentation framework for cell counting in microscopy images. Expert Syst Appl 124309. https://doi.org/10.1016/j.eswa.2024.124309
Ferreira DS, Ramalho GL, Torres D, Tobias AH, Rezende MT, Medeiros FN, Bianchi AG, Carneiro CM, Ushizima DM (2019) Saliency-driven system models for cell analysis with deep learning. Comput Methods Programs Biomed 182:105053
Celik G (2023) Detection of Covid-19 and other pneumonia cases from CT and X-ray chest images using deep learning based on feature reuse residual block and depthwise dilated convolutions neural network. Appl Soft Comput 133:109906
Araujo A, Norris W, Sim J (2019) Computing receptive fields of convolutional neural networks. Distill 4(11):e21
Wang P, Chen P, Yuan Y, Liu D, Huang Z, Hou X, Cottrell G (2018) Understanding convolution for semantic segmentation. In: 2018 IEEE winter conference on applications of computer vision (WACV), pp 1451–1460. Ieee
Niu Z, Zhong G, Hui Yu (2021) A review on the attention mechanism of deep learning. Neurocomputing 452:48–62
Siyu Lu, Liu M, Yin L, Yin Z, Liu X, Zheng W (2023) The multi-modal fusion in visual question answering: a review of attention mechanisms. PeerJ Comput Sci 9:e1400
de Santana Correia A, Colombini EL (2022) Attention, please! a survey of neural attention models in deep learning. Artif Intell Rev 55(8):6037–6124
Woo S, Park J, Lee JY, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Shah HA, Kang JM (2023) An optimized multi-organ cancer cells segmentation for histopathological images based on CBAM-residual U-Net. IEEE Access. https://doi.org/10.1109/ACCESS.2023.3295914
Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2881–2890
Peng J, Liu Y, Tang S, Hao Y, Chu L, Chen G, Wu Z, Chen Z, Yu Z, Du Y et al (2022) Pp-liteseg: a superior real-time semantic segmentation model. arXiv preprint arXiv:2204.02681
Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, p 1580–1589
Yang L, Zhang R-Y, Li L, Xie X (2021) Simam: a simple, parameterfree attention module for convolutional neural networks. In: International conference on machine learning, pp 11863–11874. PMLR
Yin Y, Han Z, Jian M, Wang G-G, Chen L, Wang R (2023) AMSUnet: a neural network using atrous multi-scale convolution for medical image segmentation. Comput Biol Med 162:107120
Bouguezzi S, Faiedh H, Souani C. Slim mobilenet: an enhanced deep convolutional neural network. In: 2021 18th International Multi-Conference on Systems, Signals & Devices (SSD), pp 12–16. IEEE
Riaz Z, Khan B, Abdullah S, Khan S, Islam MS (2023) Lung tumor image segmentation from computer tomography images using MobileNetV2 and transfer learning. Bioengineering 10(8):981
AbdElaziz M, Dahou A, Alsaleh NA, Elsheikh AH, Saba AI, Ahmadein M (2021) Boosting COVID-19 image classification using MobileNetV3 and aquila optimizer algorithm. Entropy 23(11):1383
Acknowledgements
We would like to express our sincere gratitude to all the authors and colleagues involved in this study for their outstanding contributions to data collection and analysis. We also thank the Hubei Provincial Education Department for their guidance and encouragement throughout the research process.
Funding
We would like to express our sincere gratitude to the Hubei Provincial Education Department for their generous support and funding (Project No. D20201705). This project played an important role in advancing our research and achieving our goals.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, Y., Fang, R., Zhang, N. et al. An improved algorithm for salient object detection of microscope based on U2-Net. Med Biol Eng Comput 63, 383–397 (2025). https://doi.org/10.1007/s11517-024-03205-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11517-024-03205-w