Abstract
Right ventricle segmentation plays an important role in the computer-aided diagnosis of heart diseases. However, due to the small area of right ventricle and limited dataset, the performances of the existing deep learning segmentation methods are not good enough. For some small areas of right ventricle that are difficult to segment, we apply a novel dual attention module on the decoding path of Dilated R2 U-net to extract better feature representations in this work. The dual attention module in this work is divided into position attention module and channel attention module. The positional attention module suppresses the irrelevant feature representations in the feature map and enhances the useful feature representations to improve the sensitivity and prediction accuracy of the model. The channel attention module enhances the interdependence of the feature representation of channels by gathering the information of the associated channels in the feature map. We use dilated convolutions to expand the receptive field of the model. By adding dual attention modules, our model shows higher precision than Dilated U-net on the Right Ventricle Segmentation Challenge (RVSC) test dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Jonathan, L., Evan, S., Trevor, D.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: ECCV (2018)
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. In: ECCV (2014)
Zhao, H., et al.: Pyramid scene parsing network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)
Alom, M.Z., et al.: Recurrent residual convolutional neural network based on U-net (R2 U-net) for medical image segmentation. arXiv preprint arXiv:1802.06955 (2018)
Jin, Q., et al.: DUNet: a deformable network for retinal vessel segmentation. Knowl.-Based Syst. 178, 149–162 (2019)
Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N., Liang, J.: UNet++: a nested U-Net architecture for medical image segmentation. In: Stoyanov, D., et al. (eds.) DLMIA/ML-CDS -2018. LNCS, vol. 11045, pp. 3–11. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00889-5_1
Bahdanau, D., Kyunghyun, C., Yoshua, B.: Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014)
Fu, J., et al.: Dual attention network for scene segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2019)
Huang, Z., et al.: Ccnet: Criss-cross attention for semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision (2019)
Huang, L., et al.: Interlaced sparse self-attention for semantic segmentation. arXiv preprint arXiv:1907.12273 (2019)
Hu, J., Li, S., Gang, S.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018)
Zhang, H., et al.: Context encoding for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018)
Tran, P.V.: A fully convolutional neural network for cardiac segmentation in short-axis MRI. arXiv preprint arXiv:1604.00494 (2016)
Oktay, O., et al.: Anatomically constrained neural networks (ACNNs): application to cardiac image enhancement and segmentation. IEEE Trans. Med. Imaging 37.2, 384–395 (2017)
Painchaud, N., et al.: Cardiac segmentation with strong anatomical guarantees. In: IEEE Trans. Med. Imaging 39.11, 3703–3713 (2020)
Oktay, O., et al.: Attention U-net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 (2018)
Yu, F., Vladlen, K.: Multi-scale context aggregation by Dilated convolutions. arXiv preprint arXiv:1511.07122 (2015)
Simard, P.Y., Steinkraus, D., Platt, J.C.: Best practices for convolutional neural networks applied to visual document analysis. Icdar, vol. 3, no. 2003 (2003)
Acknowledgment
The study was supported in part by the National Natural Science Foundation of China under Grants 61801393, 61801391 and 61801395, and in part by the Fundamental Research Funds for the Central Universities under Grant 3102020QD1001.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Jiang, L., Cui, H., Yuwen, C., Zhang, Y. (2021). Dual Attention Guided R2 U-Net Architecture for Right Ventricle Segmentation in MRI Images. In: Peng, Y., Hu, SM., Gabbouj, M., Zhou, K., Elad, M., Xu, K. (eds) Image and Graphics. ICIG 2021. Lecture Notes in Computer Science(), vol 12889. Springer, Cham. https://doi.org/10.1007/978-3-030-87358-5_41
Download citation
DOI: https://doi.org/10.1007/978-3-030-87358-5_41
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87357-8
Online ISBN: 978-3-030-87358-5
eBook Packages: Computer ScienceComputer Science (R0)