Abstract
Polyps detection plays an important role in colonoscopy, cancer diagnosis, and early treatment. Many efforts have been made to improve the encoder-decoder framework using the global feature with an attention mechanism to enhance local features, helping to effectively segment diversity polyps. However, using only global information derived from the last encoder block leads to the loss of regional information from intermediate layers. Furthermore, defining the boundaries of some polyps is challenging because there is visual interference between the benign region and the polyps at the border. To address these problems, we propose two novel modules: the Cascading Context module (CCM) and the Attention Balance module (BAM), aiming to build an effective polyp segmentation model. Specifically, CCM combines the extracted regional information of the current layer and the lower layer, then pours it into the upper layer - fusing regional and global information analogous to a waterfall pattern. The BAM uses the prediction output of the adjacent lower layer as a guide map to implement the attention mechanism for the three regions separately: the background, polyp, and boundary curve. BAM enhances local context information when deriving features from the encoder block. Our proposed approach is evaluated on three benchmark datasets with six evaluation metrics for segmentation quality and gives competitive results compared to other advanced methods, for both accuracy and efficiency. Code is available at https://github.com/ntcongvn/CCBANet.
T.-C. Nguyen, T.-P. Nguyen and G.-H. Diep—Equal contribution.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bernal, J., Sánchez, F.J., Fernández-Esparrach, G., Gil, D., Rodríguez, C., Vilariño, F.: Wm-dova maps for accurate polyp highlighting in colonoscopy: Validation vs saliency maps from physicians. Comput Med. Imaging Graph. 43, 99–111 (2015)
Fan, D.P., et al.: Pranet: parallel reverse attention network for polyp segmentation. In: Martel, A.L., Aet al. (eds.) Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, pp. 263–273. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59725-2_26
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016). https://doi.org/10.1109/CVPR.2016.90
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on imagenet classification. CoRR abs/1502.01852 (2015), http://arxiv.org/abs/1502.01852
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018). https://doi.org/10.1109/CVPR.2018.00745
Jha, D., et al.: Resunet++: an advanced architecture for medical image segmentation. In: 2019 IEEE International Symposium on Multimedia (ISM), pp. 225–2255 (2019). https://doi.org/10.1109/ISM46123.2019.00049
Nguyen, D.V., Tran, H.T.T., Thang, T.C.: A delay-aware adaptation framework for cloud gaming under the computation constraint of user devices. In: Ro, Y.M., et al. (eds.) MMM 2020. LNCS, vol. 11962, pp. 27–38. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-37734-2_3
Li, H., Xiong, P., An, J., Wang, L.: Pyramid attention network for semantic segmentation (2018)
Lin, C.Y., Chiu, Y.C., Ng, H.F., Shih, T.K., Lin, K.H.: Global-and-local context network for semantic segmentation of street view images. Sensors 20(10) (2020). https://doi.org/10.3390/s20102907, https://www.mdpi.com/1424-8220/20/10/2907
Qin, X., Zhang, Z., Huang, C., Dehghan, M., Zaiane, O.R., Jagersand, M.: U2-net: going deeper with nested u-structure for salient object detection. Pattern Recogn. 106, 107404 (2020). https://doi.org/10.1016/j.patcog.2020.107404, http://dx.doi.org/10.1016/j.patcog.2020.107404
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Sung, H., et al.: Global cancer statistics 2020: globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. (2021)
Vázquez, D., et al.: A benchmark for endoluminal scene segmentation of colonoscopy images. J. Healthcare Eng. 2017 (2017)
Wang, X., Girshick, R., Gupta, A., He, K.: Non-local neural networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7794–7803 (2018). https://doi.org/10.1109/CVPR.2018.00813
Zhang, R., Li, G., Li, Z., Cui, S., Qian, D., Yu, Y.: Adaptive context selection for polyp segmentation. In: Martel, A.L., et al. (eds.) Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, pp. 253–262. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59725-2_25
Zhang, Z., Liu, Q., Wang, Y.: Road extraction by deep residual u-net. IEEE Geosci. Remote Sens. Lett. 15(5), 749–753 (2018)
Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N., Liang, J.: UNet++: a nested u-net architecture for medical image segmentation. In: Stoyanov, D., et al. (eds.) DLMIA/ML-CDS -2018. LNCS, vol. 11045, pp. 3–11. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00889-5_1
Acknowledgements
This research is funded by Vietnam National University HoChiMinh City (VNU-HCM) under grant number DS2020-42-01.
Gia-Han Diep was funded by Vingroup Joint Stock Company and supported by the Domestic Master/ PhD Scholarship Programme of Vingroup Innovation Foundation (VINIF), Vingroup Big Data Institute (VINBIGDATA), code VINIF.2020.ThS.JVN.04.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Nguyen, TC., Nguyen, TP., Diep, GH., Tran-Dinh, AH., Nguyen, T.V., Tran, MT. (2021). CCBANet: Cascading Context and Balancing Attention for Polyp Segmentation. In: de Bruijne, M., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2021. MICCAI 2021. Lecture Notes in Computer Science(), vol 12901. Springer, Cham. https://doi.org/10.1007/978-3-030-87193-2_60
Download citation
DOI: https://doi.org/10.1007/978-3-030-87193-2_60
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87192-5
Online ISBN: 978-3-030-87193-2
eBook Packages: Computer ScienceComputer Science (R0)