Dual Attention Guided R2 U-Net Architecture for Right Ventricle Segmentation in MRI Images

Jiang, Lei; Cui, Hengfei; Yuwen, Chang; Zhang, Yanning

doi:10.1007/978-3-030-87358-5_41

Lei Jiang^14,15,
Hengfei Cui^14,15,
Chang Yuwen^14,15 &
…
Yanning Zhang¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12889))

Included in the following conference series:

International Conference on Image and Graphics

1741 Accesses
1 Citations

Abstract

Right ventricle segmentation plays an important role in the computer-aided diagnosis of heart diseases. However, due to the small area of right ventricle and limited dataset, the performances of the existing deep learning segmentation methods are not good enough. For some small areas of right ventricle that are difficult to segment, we apply a novel dual attention module on the decoding path of Dilated R2 U-net to extract better feature representations in this work. The dual attention module in this work is divided into position attention module and channel attention module. The positional attention module suppresses the irrelevant feature representations in the feature map and enhances the useful feature representations to improve the sensitivity and prediction accuracy of the model. The channel attention module enhances the interdependence of the feature representation of channels by gathering the information of the associated channels in the feature map. We use dilated convolutions to expand the receptive field of the model. By adding dual attention modules, our model shows higher precision than Dilated U-net on the Right Ventricle Segmentation Challenge (RVSC) test dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Jonathan, L., Evan, S., Trevor, D.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: ECCV (2018)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. In: ECCV (2014)
Google Scholar
Zhao, H., et al.: Pyramid scene parsing network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)
Google Scholar
Alom, M.Z., et al.: Recurrent residual convolutional neural network based on U-net (R2 U-net) for medical image segmentation. arXiv preprint arXiv:1802.06955 (2018)
Jin, Q., et al.: DUNet: a deformable network for retinal vessel segmentation. Knowl.-Based Syst. 178, 149–162 (2019)
Google Scholar
Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N., Liang, J.: UNet++: a nested U-Net architecture for medical image segmentation. In: Stoyanov, D., et al. (eds.) DLMIA/ML-CDS -2018. LNCS, vol. 11045, pp. 3–11. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00889-5_1
Chapter Google Scholar
Bahdanau, D., Kyunghyun, C., Yoshua, B.: Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014)
Fu, J., et al.: Dual attention network for scene segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2019)
Google Scholar
Huang, Z., et al.: Ccnet: Criss-cross attention for semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision (2019)
Google Scholar
Huang, L., et al.: Interlaced sparse self-attention for semantic segmentation. arXiv preprint arXiv:1907.12273 (2019)
Hu, J., Li, S., Gang, S.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018)
Google Scholar
Zhang, H., et al.: Context encoding for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018)
Google Scholar
Tran, P.V.: A fully convolutional neural network for cardiac segmentation in short-axis MRI. arXiv preprint arXiv:1604.00494 (2016)
Oktay, O., et al.: Anatomically constrained neural networks (ACNNs): application to cardiac image enhancement and segmentation. IEEE Trans. Med. Imaging 37.2, 384–395 (2017)
Google Scholar
Painchaud, N., et al.: Cardiac segmentation with strong anatomical guarantees. In: IEEE Trans. Med. Imaging 39.11, 3703–3713 (2020)
Google Scholar
Oktay, O., et al.: Attention U-net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 (2018)
Yu, F., Vladlen, K.: Multi-scale context aggregation by Dilated convolutions. arXiv preprint arXiv:1511.07122 (2015)
Simard, P.Y., Steinkraus, D., Platt, J.C.: Best practices for convolutional neural networks applied to visual document analysis. Icdar, vol. 3, no. 2003 (2003)
Google Scholar

Download references

Acknowledgment

The study was supported in part by the National Natural Science Foundation of China under Grants 61801393, 61801391 and 61801395, and in part by the Fundamental Research Funds for the Central Universities under Grant 3102020QD1001.

Author information

Authors and Affiliations

National Engineering Laboratory for Integrated Aero-Space-Ground -Ocean Big Data Application Technology, School of Computer Science, Northwestern Polytechnical University, Xi’an, 710072, China
Lei Jiang, Hengfei Cui, Chang Yuwen & Yanning Zhang
Centre for Multidisciplinary Convergence Computing (CMCC), School of Computer Science, Northwestern Polytechnical University, Xi’an, 710072, China
Lei Jiang, Hengfei Cui & Chang Yuwen

Authors

Lei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Hengfei Cui
View author publications
You can also search for this author in PubMed Google Scholar
Chang Yuwen
View author publications
You can also search for this author in PubMed Google Scholar
Yanning Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hengfei Cui .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Yuxin Peng
Tsinghua University, Beijing, China
Shi-Min Hu
Tampere University, Tampere, Finland
Moncef Gabbouj
Zhejiang University, Hangzhou, China
Kun Zhou
Technion – Israel Institute of Technology, Haifa, Israel
Michael Elad
Tsinghua University, Beijing, China
Kun Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, L., Cui, H., Yuwen, C., Zhang, Y. (2021). Dual Attention Guided R2 U-Net Architecture for Right Ventricle Segmentation in MRI Images. In: Peng, Y., Hu, SM., Gabbouj, M., Zhou, K., Elad, M., Xu, K. (eds) Image and Graphics. ICIG 2021. Lecture Notes in Computer Science(), vol 12889. Springer, Cham. https://doi.org/10.1007/978-3-030-87358-5_41

Download citation

DOI: https://doi.org/10.1007/978-3-030-87358-5_41
Published: 30 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87357-8
Online ISBN: 978-3-030-87358-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics