Skip to main content
Log in

An improved algorithm for salient object detection of microscope based on U2-Net

  • Original Article
  • Published:
Medical & Biological Engineering & Computing Aims and scope Submit manuscript

Abstract

With the rapid advancement of modern medical technology, microscopy imaging systems have become one of the key technologies in medical image analysis. However, manual use of microscopes presents issues such as operator dependency, inefficiency, and time consumption. To enhance the efficiency and accuracy of medical image capture and reduce the burden of subsequent quantitative analysis, this paper proposes an improved microscope salient object detection algorithm based on U2-Net, incorporating deep learning technology. The improved algorithm first enhances the network’s key information extraction capability by incorporating the Convolutional Block Attention Module (CBAM) into U2-Net. It then optimizes network complexity by constructing a Simple Pyramid Pooling Module (SPPM) and uses Ghost convolution to achieve model lightweighting. Additionally, data augmentation is applied to the slides to improve the algorithm’s robustness and generalization. The experimental results show that the size of the improved algorithm model is 72.5 MB, which represents a 56.85% reduction compared to the original U2-Net model size of 168.0 MB. Additionally, the model’s prediction accuracy has increased from 92.24 to 97.13%, providing an efficient means for subsequent image processing and analysis tasks in microscopy imaging systems.

Graphical Abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Qureshi I, Yan J, Abbas Q, Shaheed K, Riaz AB, Wahid A, Khan MW, Szczuko P (2023) Medical image segmentation using deep semantic-based methods: a review of techniques, applications and emerging trends. Inf Fusion 90:316–352

    Article  Google Scholar 

  2. Yi S, Zhang B, Gong L, Xiaosong G (2022) Thoughts on the development strategy of digital medicine. Transp Med 36:131–133

    Google Scholar 

  3. Wang X, Yu S, Lim EG, Wong MLD (2024) Salient object detection: a mini review. Front Signal Process 4:1356793

    Article  Google Scholar 

  4. Keren Fu, Jiang Y, Ji G-P, Zhou T, Zhao Q, Fan D-P (2022) Light field salient object detection: a review and benchmark. Comput Vis Med 8(4):509–534

    Article  Google Scholar 

  5. Wang Y Wang, Wang R, Fan X, Wang T, He X (2023) Pixels, regions, and objects: multiple enhancement for salient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10031–10040

  6. Talaei Khoei T, OuldSlimane H, Kaabouch N (2023) Deep learning: systematic review, models, challenges, and research directions. Neural Comput Appl 35(31):23103–23124

    Article  Google Scholar 

  7. Zhang X, Zhai D, Li T, Zhou Y, Lin Y (2023) Image inpainting based on deep learning: a review. Inf Fusion 90:74–94

    Article  Google Scholar 

  8. Dhamija T, Gupta A, Gupta S, Anjum, Katarya R, Singh G (2023) Semantic segmentation in medical images through transfused convolution and transformer networks. Appl Intell 53(1):1132–1148

    Article  Google Scholar 

  9. Yu-Huan Wu, Gao S-H, Mei J, Jun Xu, Fan D-P, Zhang R-G, Cheng M-M (2021) Jcs: an explainable covid-19 diagnosis system by joint classification and segmentation. IEEE Trans Image Process 30:3113–3126

    Article  Google Scholar 

  10. Shahin AI, Aly W, Aly S (2023) Mbtfcn: A novel modular fully convolutional network for MRI brain tumor multi-classification. Expert Syst Appl 212:118776

    Article  Google Scholar 

  11. Zhang J, Zhang Y, Jin Y, Jilan Xu, Xiaowei Xu (2023) Mdu-net: multiscale densely connected u-net for biomedical image segmentation. Health Inf Sci Syst 11(1):13

    Article  PubMed  PubMed Central  Google Scholar 

  12. Li J, Liu K, Hu Y, Zhang H, Heidari AA, Chen H, Zhang W, Algarni AD, Elmannai H (2023) Eres-UNet++: liver CT image segmentation based on high-efficiency channel attention and Res-UNet++. Comput Biol Med 158:106501

    Article  PubMed  Google Scholar 

  13. Huang H, Lin L, Tong R, Hu H, Zhang Q, Iwamoto Y, Han X, Chen YW, Wu J (2020) Unet 3+: a full-scale connected unet for medical image segmentation. In: ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 1055–1059. IEEE

  14. Long J, Yang C, Song X, Zeng Z, Ren Y (2024) Polyp segmentation network based on lightweight model and reverse attention mechanisms. Int J Imaging Syst Technol 34(3):e23062

    Article  Google Scholar 

  15. Li J, Han D, Wang X, Yi P, Yan L, Li X (2023) Multisensor medical-image fusion technique based on embedding bilateral filter in least squares and salient detection. Sensors 23(7):3490

    Article  PubMed  PubMed Central  Google Scholar 

  16. Qin X, Zhang Z, Huang C, Dehghan M, Zaiane OR, Jagersand M (2020) U2-Net: going deeper with nested U-structure for salient object detection. Pattern Recog 106:107404

    Article  Google Scholar 

  17. Hu S, Li H, Hao D (2024) Improved multistage edge-enhanced medical image segmentation network of U-Net. Comput Eng 50(4):286–293. https://doi.org/10.19678/j.issn.1000-3428.0067779

  18. Qin C, Wang Y, Zhang J (2024) Medical image segmentation network based on convolution capsule encoder and multi-scale local feature co-occurrence. Appl Res Comput 41(4):1264–1269

  19. Xu P, Liang Y, Li Y (2023) Medical image segmentation with fusion of multi-scale semantics and residual bottleneck attention. Comput Eng 49:162–170

    Google Scholar 

  20. Zhang X, Zhang S, Zhang D, Liu R (2023) Medical image segmentation model with introduced group attention. J Image Graphics 28:3231–3242

    Article  CAS  Google Scholar 

  21. Zheng X, Tang P, Ai L, Liu D, Zhang Y, Wang B (2022) White blood cell detection using saliency detection and CenterNet: a two‐stage approach. J Biophotonics 16. https://doi.org/10.1002/jbio.202200174

  22. You Z, Jiang M, Shi Z, Zhao M, Shi C, Du S, Hérard AS, Souedet N (2022) Delzescaux T Multiscale segmentation- and error-guided iterative convolutional neural network for cerebral neuron segmentation in microscopic images. Microsc Res Tech 85:3541–3552

    Article  PubMed  Google Scholar 

  23. Asha SB, Gopakumar G, Subrahmanyam GR (2024) Saliency and boundary guided segmentation framework for cell counting in microscopy images. Expert Syst Appl 124309. https://doi.org/10.1016/j.eswa.2024.124309

  24. Ferreira DS, Ramalho GL, Torres D, Tobias AH, Rezende MT, Medeiros FN, Bianchi AG, Carneiro CM, Ushizima DM (2019) Saliency-driven system models for cell analysis with deep learning. Comput Methods Programs Biomed 182:105053

    Article  PubMed  Google Scholar 

  25. Celik G (2023) Detection of Covid-19 and other pneumonia cases from CT and X-ray chest images using deep learning based on feature reuse residual block and depthwise dilated convolutions neural network. Appl Soft Comput 133:109906

    Article  PubMed  Google Scholar 

  26. Araujo A, Norris W, Sim J (2019) Computing receptive fields of convolutional neural networks. Distill 4(11):e21

    Article  Google Scholar 

  27. Wang P, Chen P, Yuan Y, Liu D, Huang Z, Hou X, Cottrell G (2018) Understanding convolution for semantic segmentation. In: 2018 IEEE winter conference on applications of computer vision (WACV), pp 1451–1460. Ieee

  28. Niu Z, Zhong G, Hui Yu (2021) A review on the attention mechanism of deep learning. Neurocomputing 452:48–62

    Article  Google Scholar 

  29. Siyu Lu, Liu M, Yin L, Yin Z, Liu X, Zheng W (2023) The multi-modal fusion in visual question answering: a review of attention mechanisms. PeerJ Comput Sci 9:e1400

    Article  Google Scholar 

  30. de Santana Correia A, Colombini EL (2022) Attention, please! a survey of neural attention models in deep learning. Artif Intell Rev 55(8):6037–6124

    Article  Google Scholar 

  31. Woo S, Park J, Lee JY, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19

  32. Shah HA, Kang JM (2023) An optimized multi-organ cancer cells segmentation for histopathological images based on CBAM-residual U-Net. IEEE Access. https://doi.org/10.1109/ACCESS.2023.3295914

  33. Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2881–2890

  34. Peng J, Liu Y, Tang S, Hao Y, Chu L, Chen G, Wu Z, Chen Z, Yu Z, Du Y et al (2022) Pp-liteseg: a superior real-time semantic segmentation model. arXiv preprint arXiv:2204.02681

  35. Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, p 1580–1589

  36. Yang L, Zhang R-Y, Li L, Xie X (2021) Simam: a simple, parameterfree attention module for convolutional neural networks. In: International conference on machine learning, pp 11863–11874. PMLR

  37. Yin Y, Han Z, Jian M, Wang G-G, Chen L, Wang R (2023) AMSUnet: a neural network using atrous multi-scale convolution for medical image segmentation. Comput Biol Med 162:107120

    Article  CAS  PubMed  Google Scholar 

  38. Bouguezzi S, Faiedh H, Souani C. Slim mobilenet: an enhanced deep convolutional neural network. In: 2021 18th International Multi-Conference on Systems, Signals & Devices (SSD), pp 12–16. IEEE

  39. Riaz Z, Khan B, Abdullah S, Khan S, Islam MS (2023) Lung tumor image segmentation from computer tomography images using MobileNetV2 and transfer learning. Bioengineering 10(8):981

    Article  PubMed  PubMed Central  Google Scholar 

  40. AbdElaziz M, Dahou A, Alsaleh NA, Elsheikh AH, Saba AI, Ahmadein M (2021) Boosting COVID-19 image classification using MobileNetV3 and aquila optimizer algorithm. Entropy 23(11):1383

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We would like to express our sincere gratitude to all the authors and colleagues involved in this study for their outstanding contributions to data collection and analysis. We also thank the Hubei Provincial Education Department for their guidance and encouragement throughout the research process.

Funding

We would like to express our sincere gratitude to the Hubei Provincial Education Department for their generous support and funding (Project No. D20201705). This project played an important role in advancing our research and achieving our goals.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Run Fang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Y., Fang, R., Zhang, N. et al. An improved algorithm for salient object detection of microscope based on U2-Net. Med Biol Eng Comput 63, 383–397 (2025). https://doi.org/10.1007/s11517-024-03205-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11517-024-03205-w

Keywords