Skip to main content
Log in

CPSNet: a cyclic pyramid-based small lesion detection network

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The presence of small lesions is an important marker for determining whether a patient will develop malignant tumors. Clinical practitioners could easily overlook the presence of small lesions, meaning automated approaches are essential for screening test results. The use of deep learning-based detectors for this purpose has so far been suboptimal as small lesions easily lose the spatial information during the convolution operation, resulting in unsatisfactory detection accuracy and limited application in clinical decision making. In this paper, we propose a Cyclic Pyramid-based Small lesion detection Network (CPSNet), which iteratively enhances the features in the parallel layer of the Feature Parallel Network (FPN), the features learned in the loop are fused again with the initial FPN to compensate for the inadequacy problem in the initial training. In addition, we propose an aggregated dilation block (ADB) to capture small variations at different scales and a global attention block (GAB) to adaptively recalibrate the channel-based feature responses while focusing on the target spatial information and highlighting the most relevant feature channels. Extensive experiments on eight organs included in the DeepLesion dataset show that our method has a high detection accuracy(mAP=60.4) and a high overall sensitivity(80.5%), which is superior to the state-of-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Guo K, Chen T, Ren S, Li N, Hu M, Kang J (2022) Federated learning empowered real-time medical data processing method for smart healthcare. IEEE/ACM Trans Comput Biol Bioinforma

  2. Guo K, Shen C, Hu B, Hu M, Kui X (2022) Rsnet: relation separation network for few-shot similar class recognition. IEEE Trans Multimed

  3. Lee S-g, Bae JS, Kim H, Kim JH, Yoon S (2018) Liver lesion detection from weakly-labeled multi-phase ct volumes with a grouped single shot multibox detector. In: Medical image computing and computer assisted intervention-MICCAI 2018: 21st international conference, Granada, Spain, September 16-20, 2018, Proceedings, Part II 11, pp 693–701. Springer

  4. Chiao J-Y, Chen K-Y, Liao KY-K, Hsieh P-H, Zhang G, Huang T-C (2019) Detection and classification the breast tumors using mask r-cnn on sonograms. Medicine 98(19)

  5. Tao Q, Ge Z, Cai J, Yin J, See S (2019) Improving deep lesion detection using 3d contextual and spatial attention. In: Medical image computing and computer assisted intervention-MICCAI 2019: 22nd International Conference, Shenzhen, China, October 13–17, 2019, Proceedings, Part VI 22, pp. 185–193. Springer

  6. Cai J, Yan K, Cheng C-T, Xiao J, Liao C-H, Lu L, Harrison AP (2020) Deep volumetric universal lesion detection using light-weight pseudo 3d convolution and surface point regression. In: Medical image computing and computer assisted intervention-MICCAI 2020: 23rd International Conference, Lima, Peru, October 4–8, 2020, Proceedings, Part IV 23, pp 3–13. Springer

  7. Li H, Chen L, Han H, Kevin Zhou S (2022) Satr: slice attention with transformer for universal lesion detection. In: International conference on medical image computing and computer-assisted intervention, pp 163–174. Springer

  8. Duan K, Bai S, Xie L, Qi H, Huang Q, Tian Q (2022) Centernet++ for object detection. arXiv preprint arXiv:2204.08394

  9. He C, Li K, Zhang Y, Tang L, Zhang Y, Guo Z, Li X (2023) Camouflaged object detection with feature decomposition and edge reconstruction. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 22046–22055

  10. Liu Z, Han K, Xue K, Song Y, Liu L, Tang Y, Zhu Y (2022) Improving ct-image universal lesion detection with comprehensive data and feature enhancements. Multimedia Systems 28(5):1741–1752

    Article  Google Scholar 

  11. Girshick R, Donahue J, Darrell T, Malik, J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587

  12. Uijlings JR, Van De Sande KE, Gevers T, Smeulders AW (2013) Selective search for object recognition. Int J Comput Vis 104:154–171

    Article  Google Scholar 

  13. Zitnick CL, Dollár P (2014) Edge boxes: locating object proposals from edges. In: Computer Vision-ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6–12, 2014, Proceedings, Part V 13, pp 391–405. Springer

  14. Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90

    Article  Google Scholar 

  15. Yang J, He Y, Huang X, Xu J, Ye X, Tao G, Ni B (2020) Alignshift: bridging the gap of imaging thickness in 3d anisotropic volumes. In: Medical image computing and computer assisted intervention-MICCAI 2020: 23rd International Conference, Lima, Peru, October 4–8, 2020, Proceedings, Part IV 23, pp 562–572. Springer

  16. Yang J, He Y, Kuang K, Lin Z, Pfister H, Ni B (2021) Asymmetric 3d context fusion for universal lesion detection. In: Medical Image Computing and Computer Assisted Intervention-MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part V 24, pp 571–580. Springer

  17. Yang J, Huang X, He Y, Xu J, Yang C, Xu G, Ni B (2021) Reinventing 2d convolutions for 3d images. IEEE J Biomed Health Inform 25(8):3009–3018

    Article  Google Scholar 

  18. Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788

  19. Lin T-Y, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988

  20. Zhao P, Li H, Jin R, Zhou SK (2023) Diffuld: diffusive universal lesion detection. arXiv preprint arXiv:2303.15728

  21. Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708

  22. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  23. Yu F, Koltun V (2015) Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122

  24. Liu S, Huang D et al (2018) Receptive field block net for accurate and fast object detection. In: Proceedings of the European conference on computer vision (ECCV), pp 385–400

  25. Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473

  26. Cheng J, Dong L, Lapata M (2016) Long short-term memory-networks for machine reading. arXiv preprint arXiv:1601.06733

  27. Li Z, Peng C, Yu G, Zhang X, Deng Y, Sun J (2018) Detnet: A backbone network for object detection. arXiv preprint arXiv:1804.06215

  28. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Process Syst 30

  29. Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803

  30. Zhang H, Goodfellow I, Metaxas D, Odena A (2019) Self-attention generative adversarial networks. In: International conference on machine learning, pp 7354–7363. PMLR

  31. Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141

  32. Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst 28

  33. Redmon J, Farhadi A (2018) Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767

  34. He K, Gkioxari G, Dollár P, Girshick R (2017) Mask r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969

  35. Tang Y-B, Yan K, Tang Y-X, Liu J, Xiao J, Summers RM (2019) Uldor: a universal lesion detector for ct scans with pseudo masks and hard negative example mining. In: 2019 IEEE 16th International symposium on biomedical imaging (ISBI 2019), pp 833–836. IEEE

  36. Yan K, Bagheri M, Summers RM (2018) 3d context enhanced region-based convolutional neural network for end-to-end lesion detection. In: Medical image computing and computer assisted intervention-MICCAI 2018: 21st International Conference, Granada, Spain, September 16–20, 2018, Proceedings, Part I, pp 511–519 . Springer

  37. Li F, Zhang H, Liu S, Guo J, Ni LM, Zhang L (2022) Dn-detr: accelerate detr training by introducing query denoising. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 13619–13627

  38. Yan K, Wang X, Lu L, Summers RM (2017) Deeplesion: automated deep mining, categorization and detection of significant radiology image findings using largescale clinical lesion annotations. arXiv preprint arXiv:1710.01766

Download references

Acknowledgements

The authors would like to thank radiologists of the Medical Imaging Department of Affiliated Hospital of Jiangsu University. This work was supported by the National Natural Science Foundation of China (61976106); China Postdoctoral Science Foundation (2017M611737); Six talent peaks project in Jiangsu Province (DZXX-122); Key RESEARCH and development program for social development (SH2021056); Zhenjiang city key research and development plan (SH2020011); Jiangsu province emergency management science and technology project (YJGL-TG-2020-8).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yi Liu.

Ethics declarations

Declarations

This paper is the expanding version of conference paper that was published in International Conference on Artificial Intelligence and Security. The corresponding reference is Tang, Y., Liu, Z., Song, Y., Han, K., Su, J., Wang, W., ... & Zhang, J. Automatic CT Lesion Detection Based on Feature Pyramid Inference with Multi-scale Response In International Conference on Artificial Intelligence and Security, pp. 167-179, Springer, Cham, 2021.

Conflict of interest

The authors declare no confict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhu, Y., Liu, Z., Song, Y. et al. CPSNet: a cyclic pyramid-based small lesion detection network. Multimed Tools Appl 83, 39983–40001 (2024). https://doi.org/10.1007/s11042-023-17024-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-17024-y

Keywords