A Cascade Attention Network for Liver Lesion Classification in Weakly-Labeled Multi-phase CT Images

Chen, Xiao; Lin, Lanfen; Hu, Hongjie; Zhang, Qiaowei; Iwamoto, Yutaro; Han, Xianhua; Chen, Yen-Wei; Tong, Ruofeng; Wu, Jian

doi:10.1007/978-3-030-33391-1_15

Xiao Chen^22,24,
Lanfen Lin²²,
Hongjie Hu²³,
Qiaowei Zhang²³,
Yutaro Iwamoto²⁴,
Xianhua Han²⁴,
Yen-Wei Chen^22,24,25,
Ruofeng Tong²² &
…
Jian Wu²²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11795))

Included in the following conference series:

2934 Accesses
5 Citations

Abstract

Focal liver lesion classification is important to the diagnostics of liver disease. In clinics, lesion type is usually determined by multi-phase contrast-enhanced CT images. Previous methods of automatic liver lesion classification are conducted on lesion-level, which rely heavily on lesion-level annotations. In order to reduce the burden of annotation, in this paper, we explore automatic liver lesion classification with weakly-labeled CT images (i.e. with only image-level labels). The major challenge is how to localize the region of interests (ROIs) accurately by using only coarse image-level annotations and accordingly make the right lesion classification decision. We propose a cascade attention network to address the challenge by two stages: Firstly, a dual-attention dilated residual network (DADRN) is proposed to generate a class-specific lesion localization map, which incorporates spatial attention and channel attention blocks for capturing the high-level feature map’s long-range dependencies and helps to synthesize a more semantic-consistent feature map, and thereby boosting weakly-supervised lesion localization and classification performance; Secondly, a multi-channel dilated residual network (MCDRN) embedded with a convolutional long short-term memory (CLSTM) block is proposed to extract temporal enhancement information and make the final classification decision. The experiment results show that our method could achieve a mean classification accuracy of 89.68%, which significantly mitigates the performance gap between weakly-supervised approaches and fully supervised counterparts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Roy, S., et al.: Three-dimensional spatiotemporal features for fast content-based retrieval of focal liver lesions. IEEE Trans. Biomed. Eng. 61(11), 2768–2778 (2014)
Article Google Scholar
Xu, Y., et al.: Combined density, texture and shape features of multi-phase contrast-enhanced CT images for CBIR of focal liver lesions: a preliminary study. Innov. Med. Healthc. 2015, 215–224 (2015)
Google Scholar
Yang, W., et al.: Content-based retrieval of focal liver lesions using bag-of-visual-words representations of single-and multiphase contrast-enhanced CT images. J. Digit. Imaging 25(6), 708–719 (2012)
Article Google Scholar
Diamant, I., et al.: Improved patch-based automated liver lesion classification by separate analysis of the interior and boundary regions. IEEE J. Biomed. Health Inf. 20(6), 1585–1594 (2016)
Article Google Scholar
Xu, Y., et al.: Texture-specific bag of visual words model and spatial cone matching-based method for the retrieval of focal liver lesions using multiphase contrast-enhanced CT images. Int. J. Comput. Assist. Radiol. Surg. 13(1), 151–164 (2018)
Article Google Scholar
Yasaka, K., et al.: Deep learning with convolutional neural network for differentiation of liver masses at dynamic contrast-enhanced CT: a preliminary study. Radiology 286(3), 887–896 (2017)
Article Google Scholar
Frid-Adar, M., Diamant, I., Klang, E., Amitai, M., Goldberger, J., Greenspan, H.: Modeling the intra-class variability for liver lesion detection using a multi-class patch-based CNN. In: Wu, G., Munsell, B.C., Zhan, Y., Bai, W., Sanroma, G., Coupé, P. (eds.) Patch-MI 2017. LNCS, vol. 10530, pp. 129–137. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67434-6_15
Chapter Google Scholar
Liang, D., et al.: Combining convolutional and recurrent neural networks for classification of focal liver lesions in multi-phase CT images. In: Frangi, A.F., Schnabel, J.A., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds.) MICCAI 2018. LNCS, vol. 11071, pp. 666–675. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00934-2_74
Chapter Google Scholar
Li, X., et al.: H-DenseUNet: hybrid densely connected UNet for liver and tumor segmentation from CT volumes. IEEE Trans. Med. Imaging 37(12), 2663–2674 (2018)
Article Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-Excitation networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7132–7141 (2017)
Google Scholar
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Chapter Google Scholar
Yu, F., Koltun, V., Funkhouser, T.A.: Dilated residual networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 472–480 (2017)
Google Scholar
Selvaraju, R., Cogswell, M., Das, A., et al.: Grad-CAM: visual explanations from deep networks via gradient-based localization. In: Proceedings of ICCV 2017, pp. 618–626 (2017)
Google Scholar
Dong, C., Chen, Y., Lin, L., et al.: Segmentation of liver and spleen based on computational anatomy models. Comput. Biol. Med. 67, 146–160 (2015)
Article Google Scholar
Chen, X., et al.: A dual-attention dilated residual network for liver lesion classification and localization on CT images. In: Proceedings of IEEE ICIP 2019 (2019, in press)
Google Scholar

Download references

Acknowledgements

This work was supported in part by Major Scientific Research Project of Zhejiang Lab under the Grant No. 2018DG0ZX01, in part by the Science and Technology Support Program of Hangzhou under the Grant No. 20172011A038, and in part by the Grant-in Aid for Scientific Research from the Japanese Ministry for Education, Science, Culture and Sports (MEXT) under the Grant No. 18H03267.

Author information

Authors and Affiliations

College of Computer Science and Technology, Zhejiang University, Hangzhou, China
Xiao Chen, Lanfen Lin, Yen-Wei Chen, Ruofeng Tong & Jian Wu
Department of Radiology, Sir Run Run Shaw Hospital, Hangzhou, China
Hongjie Hu & Qiaowei Zhang
College of Information Science and Engineering, Ritsumeikan University, Kyoto, Japan
Xiao Chen, Yutaro Iwamoto, Xianhua Han & Yen-Wei Chen
Zhejiang Lab, Hangzhou, China
Yen-Wei Chen

Authors

Xiao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Lanfen Lin
View author publications
You can also search for this author in PubMed Google Scholar
Hongjie Hu
View author publications
You can also search for this author in PubMed Google Scholar
Qiaowei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yutaro Iwamoto
View author publications
You can also search for this author in PubMed Google Scholar
Xianhua Han
View author publications
You can also search for this author in PubMed Google Scholar
Yen-Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ruofeng Tong
View author publications
You can also search for this author in PubMed Google Scholar
Jian Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Lanfen Lin , Hongjie Hu or Yen-Wei Chen .

Editor information

Editors and Affiliations

Shanghai Jiaotong University, Shanghai, China
Qian Wang
NVIDIA GmbH, Munich, Germany
Fausto Milletari
University of Houston, Houston, TX, USA
Hien V. Nguyen
Technical University Munich, Munich, Germany
Shadi Albarqouni
King's College London, London, UK
M. Jorge Cardoso
NVIDIA GmbH, Munich, Germany
Nicola Rieke
NVIDIA, Santa Clara, CA, USA
Ziyue Xu
Imperial College London, London, UK
Konstantinos Kamnitsas
Johns Hopkins University, Baltimore, MD, USA
Vishal Patel
University of Houston, Houston, TX, USA
Badri Roysam
UT Southwestern Medical Center, Dallas, TX, USA
Steve Jiang
Chinese Academy of Sciences, Beijing, China
Kevin Zhou
University of Arkansas, Fayetteville, AR, USA
Khoa Luu
University of Arkansas, Fayetteville, AR, USA
Ngan Le

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, X. et al. (2019). A Cascade Attention Network for Liver Lesion Classification in Weakly-Labeled Multi-phase CT Images. In: Wang, Q., et al. Domain Adaptation and Representation Transfer and Medical Image Learning with Less Labels and Imperfect Data. DART MIL3ID 2019 2019. Lecture Notes in Computer Science(), vol 11795. Springer, Cham. https://doi.org/10.1007/978-3-030-33391-1_15

Download citation

DOI: https://doi.org/10.1007/978-3-030-33391-1_15
Published: 13 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33390-4
Online ISBN: 978-3-030-33391-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)