Improved Automatic Diabetic Retinopathy Severity Classification Using Deep Multimodal Fusion of UWF-CFP and OCTA Images

El Habib Daho, Mostafa; Li, Yihao; Zeghlache, Rachid; Atse, Yapo Cedric; Le Boité, Hugo; Bonnin, Sophie; Cosette, Deborah; Deman, Pierre; Borderie, Laurent; Lepicard, Capucine; Tadayoni, Ramin; Cochener, Béatrice; Conze, Pierre-Henri; Lamard, Mathieu; Quellec, Gwenolé

doi:10.1007/978-3-031-44013-7_2

Mostafa El Habib Daho^13,14,
Yihao Li^13,14,
Rachid Zeghlache^13,14,
Yapo Cedric Atse^13,14,
Hugo Le Boité^15,16,
Sophie Bonnin¹⁷,
Deborah Cosette¹⁸,
Pierre Deman^19,20,
Laurent Borderie²⁰,
Capucine Lepicard²¹,
Ramin Tadayoni^15,17,
Béatrice Cochener^13,14,22,
Pierre-Henri Conze^14,23,
Mathieu Lamard^13,14 &
…
Gwenolé Quellec¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14096))

Included in the following conference series:

International Workshop on Ophthalmic Medical Image Analysis

754 Accesses

Abstract

Diabetic Retinopathy (DR), a prevalent and severe complication of diabetes, affects millions of individuals globally, underscoring the need for accurate and timely diagnosis. Recent advancements in imaging technologies, such as Ultra-WideField Color Fundus Photography (UWF-CFP) imaging and Optical Coherence Tomography Angiography (OCTA), provide opportunities for the early detection of DR but also pose significant challenges given the disparate nature of the data they produce. This study introduces a novel multimodal approach that leverages these imaging modalities to notably enhance DR classification. Our approach integrates 2D UWF-CFP images and 3D high-resolution 6$\,\times \,$6 mm$^3$ OCTA (both structure and flow) images using a fusion of ResNet50 and 3D-ResNet50 models, with Squeeze-and-Excitation (SE) blocks to amplify relevant features. Additionally, to increase the model’s generalization capabilities, a multimodal extension of Manifold Mixup, applied to concatenated multimodal features, is implemented. Experimental results demonstrate a remarkable enhancement in DR classification performance with the proposed multimodal approach compared to methods relying on a single modality only. The methodology laid out in this work holds substantial promise for facilitating more accurate, early detection of DR, potentially improving clinical outcomes for patients.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://evired.org/.
2.
https://pytorch.org/.

References

Early treatment diabetic retinopathy study design and baseline patient characteristics: Etdrs report number 7. Ophthalmology 98(5, Supplement), 741–756 (1991). https://doi.org/10.1016/S0161-6420(13)38009-9
Akhavan Aghdam, M., Sharifi, A., Pedram, M.M.: Combination of RS-fMRI and SMRI data to discriminate autism spectrum disorders in young children using deep belief network. J. Dig. Imaging 31, 895–903 (2018)
Article Google Scholar
Al-Absi, H.R., Islam, M.T., Refaee, M.A., Chowdhury, M.E., Alam, T.: Cardiovascular disease diagnosis from DXA scan and retinal images using deep learning. Sensors 22(12), 4310 (2022)
Article Google Scholar
El-Sappagh, S., Abuhmed, T., Islam, S.R., Kwak, K.S.: Multimodal multitask deep learning model for Alzheimer’s disease progression detection based on time series data. Neurocomputing 412, 197–215 (2020)
Article Google Scholar
Hao, X., et al.: Mixgen: a new multi-modal data augmentation (2023)
Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 7132–7141 (2018). https://doi.org/10.1109/CVPR.2018.00745
Lahsaini, I., El Habib Daho, M., Chikh, M.A.: Deep transfer learning based classification model for COVID-19 using chest CT-scans. Pattern Recogn. Lett. 152, 122–128 (2021). https://doi.org/10.1016/j.patrec.2021.08.035
Article Google Scholar
Li, J., et al.: Ultra-widefield color fundus photography combined with high-speed ultra-widefield swept-source optical coherence tomography angiography for non-invasive detection of lesions in diabetic retinopathy. Front. Public Health 10 (2022). https://doi.org/10.3389/fpubh.2022.1047608
Li, T., et al.: Applications of deep learning in fundus images: a review (2021). https://arxiv.org/abs/2101.09864
Li, Y., et al.: Multimodal information fusion for glaucoma and diabetic retinopathy classification. In: Antony, B., Fu, H., Lee, C.S., MacGillivray, T., Xu, Y., Zheng, Y. (eds.) OMIA 2022. LNCS, vol. 13576, pp. 53–62. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-16525-2_6
Lin, R., Hu, H.: Adapt and explore: multimodal mixup for representation learning. Available at SSRN (2023). https://doi.org/10.2139/ssrn.4461697
Liu, Z., et al.: Learning multimodal data augmentation in feature space (2023)
Google Scholar
Qian, X., et al.: A combined ultrasonic b-mode and color doppler system for the classification of breast masses using neural network. Eur. Radiol. 30, 3023–3033 (2020)
Article Google Scholar
Quellec, G., Al Hajj, H., Lamard, M., Conze, P.H., Massin, P., Cochener, B.: Explain: explanatory artificial intelligence for diabetic retinopathy diagnosis. Med. Image Anal. 72, 102118 (2021). https://doi.org/10.1016/j.media.2021.102118
Article Google Scholar
Shamshad, F., et al.: Transformers in medical imaging: a survey. Med. Image Anal. 88, 102802 (2023). https://doi.org/10.1016/j.media.2023.102802
Article Google Scholar
Silva, P.S., et al.: Diabetic retinopathy severity and peripheral lesions are associated with nonperfusion on ultrawide field angiography. Ophthalmology 122(12), 2465–2472 (2015). https://doi.org/10.1016/j.ophtha.2015.07.034
Article Google Scholar
Sleeman, W.C., Kapoor, R., Ghosh, P.: Multimodal classification: current landscape, taxonomy and future directions. ACM Comput. Surv. 55(7) (2022). https://doi.org/10.1145/3543848
Sun, Z., Yang, D., Tang, Z., et al.: Optical coherence tomography angiography in diabetic retinopathy: an updated review. Eye 35(11), 149–161 (2021). https://doi.org/10.1038/s41433-020-01233-y
Article Google Scholar
Teo, Z.L., et al.: Global prevalence of diabetic retinopathy and projection of burden through 2045: systematic review and meta-analysis. Ophthalmology 128(11), 1580–1591 (2021)
Article Google Scholar
Verma, V., et al.: Manifold mixup: better representations by interpolating hidden states (2019)
Google Scholar
Wisely, C.E., et al.: Convolutional neural network to identify symptomatic Alzheimer’s disease using multimodal retinal imaging. Br. J. Ophthalmol. 106(3), 388–395 (2022). https://doi.org/10.1136/bjophthalmol-2020-317659
Article Google Scholar
Wu, J., et al.: Gamma challenge: glaucoma grading from multi-modality images. arXiv preprint arXiv:2202.06511 (2022)
Xiong, J., et al.: Multimodal machine learning using visual fields and peripapillary circular oct scans in detection of glaucomatous optic neuropathy. Ophthalmology 129(2), 171–180 (2022)
Article MathSciNet Google Scholar
Yang, J., Zhang, B., Wang, E., et al.: Ultra-wide field swept-source optical coherence tomography angiography in patients with diabetes without clinically detectable retinopathy. BMC Ophthalmol. 21(1), 192 (2021). https://doi.org/10.1186/s12886-021-01933-3
Article Google Scholar
Zang, P., et al.: A diabetic retinopathy classification framework based on deep-learning analysis of oct angiography. Transl. Vision Sci. Technol. 11(7), 10–10 (2022)
Article Google Scholar
Zhang, H., Cissé, M., Dauphin, Y.N., Lopez-Paz, D.: mixup: beyond empirical risk minimization. CoRR abs/1710.09412 (2017). https://arxiv.org/abs/1710.09412
Zhao, X., Chen, Y., Liu, S., Zang, X., Xiang, Y., Tang, B.: TMMDA: a new token mixup multimodal data augmentation for multimodal sentiment analysis. In: Proceedings of the ACM Web Conference 2023. WWW 2023, pp. 1714–1722. Association for Computing Machinery (2023). https://doi.org/10.1145/3543507.3583406
Zong, W., Lee, J.K., Liu, C., Carver, E.N., Feldman, A.M., Janic, E.A.: A deep dive into understanding tumor foci classification using multiparametric MRI based on convolutional neural network. Med. Phys. 47(9), 4077–4086 (2020)
Article Google Scholar

Download references

Acknowledgements

The work takes place in the framework of Evired, an ANR RHU project. This work benefits from State aid managed by the French National Research Agency under “Investissement d’Avenir” program bearing the reference ANR-18-RHUS-0008.

Author information

Authors and Affiliations

Univ Bretagne Occidentale, Brest, France
Mostafa El Habib Daho, Yihao Li, Rachid Zeghlache, Yapo Cedric Atse, Béatrice Cochener & Mathieu Lamard
LaTIM UMR 1101, Inserm, Brest, France
Mostafa El Habib Daho, Yihao Li, Rachid Zeghlache, Yapo Cedric Atse, Béatrice Cochener, Pierre-Henri Conze, Mathieu Lamard & Gwenolé Quellec
Ophthalmology Department, Lariboisiere Hospital, APHP, Paris, France
Hugo Le Boité & Ramin Tadayoni
Paris Cité University, Paris, France
Hugo Le Boité
Ophthalmology Department, Rothschild Foundation Hospital, Paris, France
Sophie Bonnin & Ramin Tadayoni
Carl Zeiss Meditec Inc, Dublin, CA, USA
Deborah Cosette
ADCIS, Saint-Contest, 14280, France
Pierre Deman
Evolucare Technologies, Le Pecq, 78230, France
Pierre Deman & Laurent Borderie
AP-HP, Paris, France
Capucine Lepicard
Ophthalmology Department, CHRU Brest, Brest, France
Béatrice Cochener
IMT Atlantique, Brest, France
Pierre-Henri Conze

Authors

Mostafa El Habib Daho
View author publications
You can also search for this author in PubMed Google Scholar
Yihao Li
View author publications
You can also search for this author in PubMed Google Scholar
Rachid Zeghlache
View author publications
You can also search for this author in PubMed Google Scholar
Yapo Cedric Atse
View author publications
You can also search for this author in PubMed Google Scholar
Hugo Le Boité
View author publications
You can also search for this author in PubMed Google Scholar
Sophie Bonnin
View author publications
You can also search for this author in PubMed Google Scholar
Deborah Cosette
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Deman
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Borderie
View author publications
You can also search for this author in PubMed Google Scholar
Capucine Lepicard
View author publications
You can also search for this author in PubMed Google Scholar
Ramin Tadayoni
View author publications
You can also search for this author in PubMed Google Scholar
Béatrice Cochener
View author publications
You can also search for this author in PubMed Google Scholar
Pierre-Henri Conze
View author publications
You can also search for this author in PubMed Google Scholar
Mathieu Lamard
View author publications
You can also search for this author in PubMed Google Scholar
Gwenolé Quellec
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Mostafa El Habib Daho or Yihao Li .

Editor information

Editors and Affiliations

Alfred Health, Melbourne, VIC, Australia
Bhavna Antony
Hong Kong University of Science and Technology, Hong Kong, Hong Kong
Hao Chen
Pazhou Lab, Guangzhou, China
Huihui Fang
A*STAR, Institute of High Performance Computing, Singapore, Singapore
Huazhu Fu
University of Washington, Seattle, WA, USA
Cecilia S. Lee
University of Liverpool, Liverpool, UK
Yalin Zheng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

El Habib Daho, M. et al. (2023). Improved Automatic Diabetic Retinopathy Severity Classification Using Deep Multimodal Fusion of UWF-CFP and OCTA Images. In: Antony, B., Chen, H., Fang, H., Fu, H., Lee, C.S., Zheng, Y. (eds) Ophthalmic Medical Image Analysis. OMIA 2023. Lecture Notes in Computer Science, vol 14096. Springer, Cham. https://doi.org/10.1007/978-3-031-44013-7_2

Download citation

DOI: https://doi.org/10.1007/978-3-031-44013-7_2
Published: 16 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44012-0
Online ISBN: 978-3-031-44013-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Improved Automatic Diabetic Retinopathy Severity Classification Using Deep Multimodal Fusion of UWF-CFP and OCTA Images