Towards a Benchmark for Colorectal Cancer Segmentation in Endorectal Ultrasound Videos: Dataset and Model Development

Jiang, Yuncheng; Hu, Yiwen; Zhang, Zixun; Wei, Jun; Feng, Chun-Mei; Tang, Xuemei; Wan, Xiang; Liu, Yong; Cui, Shuguang; Li, Zhen

doi:10.1007/978-3-031-72111-3_69

Yuncheng Jiang^14,15,16,17,
Yiwen Hu^14,15,
Zixun Zhang^14,15,16,
Jun Wei^14,15,
Chun-Mei Feng¹⁶,
Xuemei Tang¹⁸,
Xiang Wan¹⁷,
Yong Liu¹⁶,
Shuguang Cui^14,15 &
…
Zhen Li^14,15

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15008))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

1687 Accesses

Abstract

Endorectal ultrasound (ERUS) is an important imaging modality that provides high reliability for diagnosing the depth and boundary of invasion in colorectal cancer. However, the lack of a large-scale ERUS dataset with high-quality annotations hinders the development of automatic ultrasound diagnostics. In this paper, we collected and annotated the first benchmark dataset that covers diverse ERUS scenarios, i.e. colorectal cancer segmentation, detection, and infiltration depth staging. Our ERUS-10K dataset comprises 77 videos and 10,000 high-resolution annotated frames. Based on this dataset, we further introduce a benchmark model for colorectal cancer segmentation, named the Adaptive Sparse-context TRansformer (ASTR). ASTR is designed based on three considerations: scanning mode discrepancy, temporal information, and low computational complexity. For generalizing to different scanning modes, the adaptive scanning-mode augmentation is proposed to convert between raw sector images and linear scan ones. For mining temporal information, the sparse-context transformer is incorporated to integrate inter-frame local and global features. For reducing computational complexity, the sparse-context block is introduced to extract contextual features from auxiliary frames. Finally, on the benchmark dataset, the proposed ASTR model achieves a 77.6% Dice score in rectal cancer segmentation, largely outperforming previous state-of-the-art methods.

Y. Jiang, Y. Hu, Z. Zhang—Equal contributions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Rethinking Breast Lesion Segmentation in Ultrasound: A New Video Dataset and A Baseline Network

Optimized Breast Lesion Segmentation in Ultrasound Videos Across Varied Resource-Scant Environments

Shifting More Attention to Breast Lesion Segmentation in Ultrasound Videos

References

Chen, J., et al.: Transunet: transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306 (2021)
Cheng, X., et al.: Implicit motion handling for video camouflaged object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13864–13873 (2022)
Google Scholar
Du, Y., Jiang, Y., Tan, S., Wu, X., Dou, Q., Li, Z., Li, G., Wan, X.: Arsdm: colonoscopy images synthesis with adaptive refinement semantic diffusion models. In: International conference on medical image computing and computer-assisted intervention, pp. 339–349. Springer (2023). https://doi.org/10.1007/978-3-031-43895-0_32
Favoriti, P., Carbone, G., Greco, M., Pirozzi, F., Pirozzi, R.E.M., Corcione, F.: Worldwide burden of colorectal cancer: a review. Updat. Surg. 68, 7–11 (2016)
Article Google Scholar
Gao, S.H., Cheng, M.M., Zhao, K., Zhang, X.Y., Yang, M.H., Torr, P.: Res2net: A new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 43(2), 652–662 (2019)
Article Google Scholar
Ghiasi, G., et al.: Simple copy-paste is a strong data augmentation method for instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2918–2928 (2021)
Google Scholar
Han, H., Liao, H., Zhang, D., Kong, W., Chen, F.: Thyroid nodule diagnosis in dynamic contrast-enhanced ultrasound via microvessel infiltration awareness. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 169–179. Springer (2023). https://doi.org/10.1007/978-3-031-43987-2_17
Hünerbein, M.: Endorectal ultrasound in rectal cancer. Colorectal Dis. 5(5), 402–405 (2003)
Article Google Scholar
Ji, G.P., et al.: Progressively normalized self-attention network for video polyp segmentation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 142–152. Springer (2021). https://doi.org/10.1007/978-3-030-87193-2_14
Li, J., et al.: Dsmt-net: dual self-supervised multi-operator transformation for multi-source endoscopic ultrasound diagnosis. IEEE Trans. Med. Imaging (2023)
Google Scholar
Li, J., Zheng, Q., Li, M., Liu, P., Wang, Q., Sun, L., Zhu, L.: Rethinking breast lesion segmentation in ultrasound: a new video dataset and a baseline network. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 391–400. Springer (2022). https://doi.org/10.1007/978-3-031-16440-8_38
Li, J., Wang, W., Chen, J., Niu, L., Si, J., Qian, C., Zhang, L.: Video semantic segmentation via sparse temporal transformer. In: Proceedings of the 29th ACM International Conference on Multimedia, pp. 59–68 (2021)
Google Scholar
Lin, J., et al.: Shifting more attention to breast lesion segmentation in ultrasound videos. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 497–507. Springer (2023)
Google Scholar
Ma, J., He, Y., Li, F., Han, L., You, C., Wang, B.: Segment anything in medical images. Nat. Commun. 15(1), 654 (2024)
Google Scholar
Rieger, N., Tjandra, J., Solomon, M.: Endoanal and endorectal ultrasound: applications in colorectal surgery. ANZ J. Surg. 74(8), 671–675 (2004)
Article Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Sun, G., Liu, Y., Ding, H., Probst, T., Van Gool, L.: Coarse-to-fine feature mining for video semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3126–3137 (2022)
Google Scholar
Wang, Y., et al.: Deep attentive features for prostate segmentation in 3d transrectal ultrasound. IEEE Trans. Med. Imaging 38(12), 2768–2778 (2019)
Article Google Scholar
Wei, J., Hu, Y., Zhang, R., Li, Z., Zhou, S.K., Cui, S.: Shallow attention network for polyp segmentation. In: de Bruijne, M., et al. (eds.) MICCAI 2021. LNCS, vol. 12901, pp. 699–708. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-87193-2_66
Chapter Google Scholar
Zhang, M., et al.: Dynamic context-sensitive filtering network for video salient object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1553–1563 (2021)
Google Scholar
Zheng, S., et al.: Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6881–6890 (2021)
Google Scholar

Download references

Acknowledgments

This work was supported by NSFC with Grant No. 62293482, by the Basic Research Project No. HZQB-KCZYZ-2021067 of Hetao Shenzhen HK S&T Cooperation Zone, by Shenzhen General Program No. JCYJ20220530143600001, by Shenzhen-Hong Kong Joint Funding No. SGDX20211123112401002, by the Shenzhen Outstanding Talents Training Fund 202002, by Guangdong Research Project No. 2017ZT07X152 and No. 2019CX01X104, by the Guangdong Provincial Key Laboratory of Future Networks of Intelligence (Grant No. 2022B1212010001), by the Guangdong Provincial Key Laboratory of Big Data Computing, %The Chinese University of Hong Kong, Shenzhen CHUK-Shenzhen, by the NSFC 61931024 &12326610, by Key Area R&D Program of Guangdong Province with grant No. 2018B030338001, by the Key Area R&D Program of Guangdong Province with grant No. 2018B030338001, by the Shenzhen Key Laboratory of Big Data and Artificial Intelligence (Grant No. ZDSYS201707251409055), by Tencent & Huawei Open Fund, by the National Research Foundation, Singapore under its AI Singapore Programme (AISG Award No: AISG2-TC-2021-003), and by the Agency for Science, Technology and Research (A*STAR) through its AME Programmatic Funding Scheme Under Project A20H4b0141.

Author information

Authors and Affiliations

Shenzhen Future Network of Intelligence Insititute (FNii), Shenzhen, China
Yuncheng Jiang, Yiwen Hu, Zixun Zhang, Jun Wei, Shuguang Cui & Zhen Li
SSE, The Chinese University of Hong Kong, Shenzhen, China
Yuncheng Jiang, Yiwen Hu, Zixun Zhang, Jun Wei, Shuguang Cui & Zhen Li
IHPC, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
Yuncheng Jiang, Zixun Zhang, Chun-Mei Feng & Yong Liu
Shenzhen Research Institute of Big Data (SRIBD), Shenzhen, China
Yuncheng Jiang & Xiang Wan
Affiliated Hospital of North Sichuan Medical College, Sichuan, China
Xuemei Tang

Authors

Yuncheng Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yiwen Hu
View author publications
You can also search for this author in PubMed Google Scholar
Zixun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Wei
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Mei Feng
View author publications
You can also search for this author in PubMed Google Scholar
Xuemei Tang
View author publications
You can also search for this author in PubMed Google Scholar
Xiang Wan
View author publications
You can also search for this author in PubMed Google Scholar
Yong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shuguang Cui
View author publications
You can also search for this author in PubMed Google Scholar
Zhen Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhen Li .

Editor information

Editors and Affiliations

Children’s National Hospital/George Washington University, Washington, DC, USA
Marius George Linguraru
The Chinese University of Hong Kong, Hong Kong, China
Qi Dou
Technical University of Denmark, Kgs Lyngby, Denmark
Aasa Feragen
Imperial College London, London, UK
Stamatia Giannarou
Imperial College London, London, UK
Ben Glocker
Universitat de Barcelona, Barcelona, Spain
Karim Lekadir
Helmholtz Munich, Technical University of Munich and King’s College London, Munich, Germany
Julia A. Schnabel

Ethics declarations

Disclosure of Interests

The authors have no competing interests to declare that are relevant to the content of this article.

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 5374 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, Y. et al. (2024). Towards a Benchmark for Colorectal Cancer Segmentation in Endorectal Ultrasound Videos: Dataset and Model Development. In: Linguraru, M.G., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2024. MICCAI 2024. Lecture Notes in Computer Science, vol 15008. Springer, Cham. https://doi.org/10.1007/978-3-031-72111-3_69

Download citation

DOI: https://doi.org/10.1007/978-3-031-72111-3_69
Published: 06 October 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-72110-6
Online ISBN: 978-3-031-72111-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Towards a Benchmark for Colorectal Cancer Segmentation in Endorectal Ultrasound Videos: Dataset and Model Development

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Rethinking Breast Lesion Segmentation in Ultrasound: A New Video Dataset and A Baseline Network

Optimized Breast Lesion Segmentation in Ultrasound Videos Across Varied Resource-Scant Environments

Shifting More Attention to Breast Lesion Segmentation in Ultrasound Videos

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Disclosure of Interests

1 Electronic supplementary material

Supplementary material 1 (pdf 5374 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Towards a Benchmark for Colorectal Cancer Segmentation in Endorectal Ultrasound Videos: Dataset and Model Development

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Rethinking Breast Lesion Segmentation in Ultrasound: A New Video Dataset and A Baseline Network

Optimized Breast Lesion Segmentation in Ultrasound Videos Across Varied Resource-Scant Environments

Shifting More Attention to Breast Lesion Segmentation in Ultrasound Videos

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Disclosure of Interests

1 Electronic supplementary material

Supplementary material 1 (pdf 5374 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation