Skip to main content

Long-Tailed Multi-label Retinal Diseases Recognition via Relational Learning and Knowledge Distillation

  • Conference paper
  • First Online:
Medical Image Computing and Computer Assisted Intervention – MICCAI 2022 (MICCAI 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13432))

Abstract

More and more people are suffering from ocular diseases, which may cause blindness if not treated promptly. However, it is not easy to diagnose these diseases for the barely visible clinical symptoms. Even though some computer-aided approaches have been developed to help ophthalmologists make an accurate diagnosis, there still exist some challenges to be solved. For example, one patient may suffer from more than one retinal disease and these diseases often exhibit a long-tailed distribution, making it difficult to be automatically classified. In this work, we propose a novel framework that utilizes the correlations among these diseases in a knowledge distillation manner. Specifically, we apply the correlations from three main aspects (i.e., multi-task relation, feature relation, and pathological region relation) to recognize diseases more exactly. Firstly, we take diabetic retinopathy (DR) lesion segmentation and severity grading as the downstream tasks to train the network backbone for the findings that segmentation may improve the classification. Secondly, the long-tailed dataset is divided into several subsets to train multiple teacher networks according to semantic feature relation, which can help reduce the label co-occurrence and class imbalance. Thirdly, an improved attention mechanism is adopted to explore relations among pathological regions. Finally, a class-balanced distillation loss is introduced to distill the multiple teacher models into a student model. Extensive experiments are conducted to validate the superiority of our proposed method. The results have demonstrated that we achieve state-of-the-art performance on the publicly available datasets. Code will be available at: https://github.com/liyiersan/RLKD.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Rahmani, B., et al.: The cause-specific prevalence of visual impairment in an urban population: the Baltimore Eye Survey. Ophthalmology 103(11), 1721–1726 (1996)

    Article  Google Scholar 

  2. Abramoff, M.D., Fort, P.E., Han, I.C., Jayasundera, K.T., Sohn, E.H., Gardner, T.W.: Approach for a clinically useful comprehensive classification of vascular and neural aspects of diabetic retinal disease. Invest. Ophthal. Visual Sci. 59(1), 519–527 (2018)

    Article  Google Scholar 

  3. Li, X., Jia, M., Islam, M.T., Yu, L., Xing, L.: Self-supervised feature learning via exploiting multi-modal data for retinal disease diagnosis. IEEE Trans. Med. Imaging 39(12), 4023–4033 (2020)

    Article  Google Scholar 

  4. Li, X., et al.: Rotation-oriented collaborative self-supervised learning for retinal disease diagnosis. IEEE Trans. Med. Imaging 40(9), 2284–2294 (2021)

    Article  Google Scholar 

  5. He, J., Li, C., Ye, J., Qiao, Y., Gu, L.: Self-speculation of clinical features based on knowledge distillation for accurate ocular disease classification. Biomed. Signal Process. Control 67, 102491 (2021)

    Article  Google Scholar 

  6. Iscen, A., Araujo, A., Gong, B., Schmid, C.: Class-balanced distillation for long-tailed visual recognition. arXiv preprint arXiv:2104.05279 (2021)

  7. Weinreb, R.N., Aung, T., Medeiros, F.A.: The pathophysiology and treatment of glaucoma: a review. JAMA 311(18), 1901–1911 (2014)

    Article  Google Scholar 

  8. Hartigan, J.A., Wong, M.A.: Algorithm AS 136: K-means clustering algorithm. J. Royal Stat. Soc. Ser. C (Appl. Stat.) 28(1), 100–108 (1979)

    Google Scholar 

  9. Ju, L., Wang, X., Yu, Z., Wang, L., Zhao, X., Ge, Z.: Long-tailed multi-label retinal diseases recognition using hierarchical information and hybrid knowledge distillation. arXiv preprint arXiv:2111.08913 (2021)

  10. Zhou, Y., Wang, B., Huang, L., Cui, S., Shao, L.: A benchmark for studying diabetic retinopathy: segmentation, grading, and transferability. IEEE Trans. Med. Imaging 40(3), 818–828 (2020)

    Article  Google Scholar 

  11. Zhou, Y., Huang, L., Zhou, T., Shao, L.: CCT-Net: category-invariant cross-domain transfer for medical single-to-multiple disease diagnosis. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8260–8270 (2021)

    Google Scholar 

  12. Sharma, S., Yu, N., Fritz, M., Schiele, B.: Long-tailed recognition using class-balanced experts. In: Akata, Z., Geiger, A., Sattler, T. (eds.) DAGM GCPR 2020. LNCS, vol. 12544, pp. 86–100. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-71278-5_7

    Chapter  Google Scholar 

  13. Xiang, L., Ding, G., Han, J.: Learning from multiple experts: self-paced knowledge distillation for long-tailed classification. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12350, pp. 247–263. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58558-7_15

    Chapter  Google Scholar 

  14. Foo, A., Hsu, W., Lee, M.L., Lim, G., Wong, T.Y.: Multi-task learning for diabetic retinopathy grading and lesion segmentation. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 13267–13272 (2020)

    Google Scholar 

  15. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

    Google Scholar 

  16. Ju, L., et al.: Relational subsets knowledge distillation for long-tailed retinal diseases recognition. In: de Bruijne, M., et al. (eds.) MICCAI 2021. LNCS, vol. 12908, pp. 3–12. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-87237-3_1

    Chapter  Google Scholar 

  17. Lin, T. Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)

    Google Scholar 

  18. Wu, T., Huang, Q., Liu, Z., Wang, Yu., Lin, D.: Distribution-balanced loss for multi-label classification in long-tailed datasets. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12349, pp. 162–178. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58548-8_10

    Chapter  Google Scholar 

  19. Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28

    Chapter  Google Scholar 

  20. Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1

    Chapter  Google Scholar 

  21. Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K. Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017)

    Google Scholar 

  22. Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 618–626 (2017)

    Google Scholar 

  23. International competition on ocular disease intelligent recognition. https://odir2019.grand-challenge.org

Download references

Acknowledgements

This work is supported by Science and Technology Cooperation Project of The Xinjiang Production and Construction Corps (No. 2019BC008, 2017DB004).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hua Zou .

Editor information

Editors and Affiliations

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 596 KB)

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhou, Q., Zou, H., Wang, Z. (2022). Long-Tailed Multi-label Retinal Diseases Recognition via Relational Learning and Knowledge Distillation. In: Wang, L., Dou, Q., Fletcher, P.T., Speidel, S., Li, S. (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2022. MICCAI 2022. Lecture Notes in Computer Science, vol 13432. Springer, Cham. https://doi.org/10.1007/978-3-031-16434-7_68

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-16434-7_68

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-16433-0

  • Online ISBN: 978-3-031-16434-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics