Abstract
Retinal optical coherence tomography (OCT) images provide crucial insights into the health of the posterior ocular segment. Therefore, the advancement of automated image analysis methods is imperative to equip clinicians and researchers with quantitative data, thereby facilitating informed decision-making. The application of deep learning (DL)-based approaches has gained extensive traction for executing these analysis tasks, demonstrating remarkable performance compared to labor-intensive manual analyses. However, the acquisition of retinal OCT images often presents challenges stemming from privacy concerns and the resource-intensive labeling procedures, which contradicts the prevailing notion that DL models necessitate substantial data volumes for achieving superior performance. Moreover, limitations in available computational resources constrain the progress of high-performance medical artificial intelligence, particularly in less developed regions and countries. This paper introduces a novel ensemble learning mechanism designed for recognizing retinal diseases under limited resources (e.g., data, computation). The mechanism leverages insights from multiple pre-trained models, facilitating the transfer and adaptation of their knowledge to retinal OCT images. This approach establishes a robust model even when confronted with limited labeled data, eliminating the need for an extensive array of parameters, as required in learning from scratch. Comprehensive experimentation on real-world datasets demonstrates that the ensemble models constructed by the proposed ensemble method show superior performance over the baseline models under sparse labeled data, especially the triple ensemble model, which achieves the accuracy of 92.06%, which is 8.27%, 7.99%, and 11.14% better than the three baseline models, respectively. In addition, compared with the three baseline models learned from scratch, the triple ensemble model has fewer trainable parameters, only 3.677M, which is lower than the three baseline models of 8.013M, 4.302M, and 20.158M, respectively.
Graphical abstract





Similar content being viewed by others
Notes
A large-scale image dataset comprising real-life images.
References
Lim LS, Mitchell P, Seddon JM, Holz FG, Wong TY (2012) Age-related macular degeneration. The Lancet 379(9827):1728–1738
Pead E, Megaw R, Cameron J, Fleming A, Dhillon B, Trucco E, MacGillivray T (2019) Automated detection of age-related macular degeneration in color fundus photography: a systematic review. Surv Ophthalmol 64(4):498–511
Bhagat N, Grigorian RA, Tutela A, Zarbin MA (2009) Diabetic macular edema: pathogenesis and treatment. Surv Ophthalmol 54(1):1–32
Moreira-Neto CA, Moult EM, Fujimoto JG, Waheed NK, Ferrara D et al (2018) Choriocapillaris loss in advanced age-related macular degeneration. J Ophthalmol 2018
Shao Y, Li X-R (2008) Effects of diabetic retinopathy on quality of life. [Zhonghua yan ke za Zhi] Chin J Ophthalmol 44(7):660–663
Drexler W, Fujimoto JG (2008) State-of-the-art retinal optical coherence tomography. Prog Retinal Eye Res 27(1):45–88
Drexler W, Sattmann H, Hermann B, Ko TH, Stur M, Unterhuber A, Scholda C, Findl O, Wirtitsch M, Fujimoto JG et al (2003) Enhanced visualization of macular pathology with the use of ultrahigh-resolution optical coherence tomography. Arch Ophthalmol 121(5):695–706
Hee MR, Puliafito CA, Duker JS, Reichel E, Coker JG, Wilkins JR, Schuman JS, Swanson EA, Fujimoto JG (1998) Topography of diabetic macular edema with optical coherence tomography. Ophthalmology 105(2):360–370
Bowd C, Zangwill LM, Berry CC, Blumenthal EZ, Vasile C, Sanchez-Galeana C, Bosworth CF, Sample PA, Weinreb RN (2001) Detecting early glaucoma by assessment of retinal nerve fiber layer thickness and visual function. Invest Ophthalmol Vis Sci 42(9):1993–2003
Ergun E, Hermann B, Wirtitsch M, Unterhuber A, Ko TH, Sattmann H, Scholda C, Fujimoto JG, Stur M, Drexler W (2005) Assessment of central visual function in Stargardt’s disease/fundus flavimaculatus with ultrahigh-resolution optical coherence tomography. Invest Ophthalmol Vis Sci 46(1):310–316
Chen S, Shu T, Zhao H, Tang YY (2023) Mask-CNN-transformer for real-time multi-label weather recognition. Knowl Based Syst 278:110881
Chen S, Shu T, Zhao H, Zhong G, Chen X (2023) TempEE: temporal-spatial parallel transformer for radar echo extrapolation beyond auto-regression. IEEE Trans Geosci Rem Sens
Chen S, Long G, Shen T, Zhou T, Jiang J (2023) Spatial-temporal prompt learning for federated weather forecasting. arXiv preprint arXiv:2305.14244
Usman M, Fraz MM, Barman SA (2017) Computer vision techniques applied for diagnostic analysis of retinal oct images: a review. Arch Comput Methods Eng 24:449–465
Sunnetci KM, Alkan A (2023) Biphasic majority voting-based comparative COVID-19 diagnosis using chest X-ray images. Expert Syst App 216:119430
Chen S, Ren S, Wang G, Huang M, Xue C (2023) Interpretable CNN-multilevel attention transformer for rapid recognition of pneumonia from chest X-ray images. IEEE J Biomed Health Informat
Peng H, Chen S, Niu N, Wang J, Yao Q, Wang L, Wang Y, Tang J, Wang G, Huang M et al (2023) Diffusion model in semi-supervised scleral segmentation. In: 2023 IEEE 6th International conference on pattern recognition and artificial intelligence (PRAI), pp 376–382. IEEE
Lee CS, Baughman DM, Lee AY (2017) Deep learning is effective for classifying normal versus age-related macular degeneration OCT images. Ophthalmol Retina 1(4):322–327
Huang L, He X, Fang L, Rabbani H, Chen X (2019) Automatic classification of retinal optical coherence tomography images with layer guided convolutional neural network. IEEE Signal Process Lett 26(7):1026–1030
Mohan R, Ganapathy K, Arunmozhi R (2022) Comparison of the proposed DCNN model with standard cnn architectures for retinal diseases classification. Journal of Population Therapeutics and Clinical Pharmacology= Journal de la Therapeutique des Populations et de la Pharmacologie Clinique 29(3):112–122
Sunija A, Kar S, Gayathri S, Gopi VP, Palanisamy P (2021) Octnet: a lightweight CNN for retinal disease classification from optical coherence tomography images. Comput Methods Prog Biomed 200:105877
Thomas A, Harikrishnan P, Krishna AK, Ponnusamy P, Gopi VP (2021) Automated detection of age-related macular degeneration from oct images using multipath CNN. J Comput Sci Eng 15(1):34–46
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
Lee T, Yoo S (2021) Augmenting few-shot learning with supervised contrastive learning. Ieee Access 9:61466–61474
Chen J, Zhuo L, Wei Z, Zhang H, Fu H, Jiang Y-G (2023) Knowledge driven weights estimation for large-scale few-shot image recognition. Pattern Recogn 142:109668
Zheng Z, Feng X, Yu H, Gao M (2022) Cooperative density-aware representation learning for few-shot visual recognition. Neurocomputing 471:208–218
Kim J, Chi M (2021) SAFFNet: self-attention-based feature fusion network for remote sensing few-shot scene classification. Remote Sens 13(13):2532
Zhu Y, Min W, Jiang S (2020) Attribute-guided feature learning for few-shot image recognition. IEEE Trans Multimed 23:1200–1209
Lim JY, Lim KM, Ooi SY, Lee CP (2021) Efficient-prototypicalnet with self knowledge distillation for few-shot learning. Neurocomputing 459:327–337
Wei M, Wu Q, Ji H, Wang J, Lyu T, Liu J, Zhao L (2023) A skin disease classification model based on densenet and convnext fusion. Electronics 12(2):438
Li W, Du L, Liao J, Yin D, Xu X (2021) Classification of COVID-19 images based on transfer learning and feature fusion. Imaging Sci J 69(1–4):133–142
Liu W, Ouyang H, Liu Q, Cai S, Wang C, Xie J, Hu W (2022) Image recognition for garbage classification based on transfer learning and model fusion. Math Prob Eng 2022
Ai Z, Huang X, Feng J, Wang H, Tao Y, Zeng F, Lu Y (2022) FN-OCT: disease detection algorithm for retinal optical coherence tomography based on a fusion network. Front Neuroinformat 16:876927
Latha V, Ashok L, Sreeni K (2021) Automated macular disease detection using retinal optical coherence tomography images by fusion of deep learning networks. In: 2021 National conference on communications (NCC), pp 1–6. IEEE
Kermany DS, Goldbaum M, Cai W, Valentim CC, Liang H, Baxter SL, McKeown A, Yang G, Wu X, Yan F et al (2018) Identifying medical diagnoses and treatable diseases by image-based deep learning. cell 172(5):1122–1131
Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on computer vision and pattern recognition, pp 248–255. Ieee
Glorot X, Bordes A, Bengio Y (2011) Deep sparse rectifier neural networks. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp 315–323. JMLR Workshop and Conference Proceedings
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Loshchilov I, Hutter F (2016) SGDR: stochastic gradient descent with warm restarts. arXiv preprint arXiv:1608.03983
Sokolova M, Lapalme G (2009) A systematic analysis of performance measures for classification tasks. Inf Process Manag 45(4):427–437
Powers DM (2020) Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv preprint arXiv:2010.16061
Davis J, Goadrich M (2006) The relationship between precision-recall and ROC curves. In: Proceedings of the 23rd international conference on machine learning, pp 233–240
Oğuz FE, Alkan A, Schöler T (2023) Emotion detection from ECG signals with different learning algorithms and automated feature engineering. SIViP 1–9
El-Nouby A, Izacard G, Touvron H, Laptev I, Jegou H, Grave E (2021) Are large-scale datasets necessary for self-supervised pre-training? arXiv preprint arXiv:2112.10740
Chen H, Wang Y, Guo T, Xu C, Deng Y, Liu Z, Ma S, Xu C, Xu C, Gao W (2021) Pre-trained image processing transformer. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12299–12310
Malla S, Alphonse P (2021) COVID-19 outbreak: an ensemble pre-trained deep learning model for detecting informative tweets. Appl Soft Comput 107:107495
Noor FA, Munzerin I, Iqbal AA, Islam T, Hossain E (2021) An ensemble learning based approach to autonomous COVID19 detection using transfer learning with the help of pre-trained deep neural network models. In: 2021 24th International conference on computer and information technology (ICCIT), pp 1–6. IEEE
Rahman Z, Hossain MS, Islam MR, Hasan MM, Hridhee RA (2021) An approach for multiclass skin lesion classification based on ensemble learning. Inform Med Unlocked 25:100659
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, J., Peng, H., Chen, S. et al. Ensemble learning for retinal disease recognition under limited resources. Med Biol Eng Comput 62, 2839–2852 (2024). https://doi.org/10.1007/s11517-024-03101-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11517-024-03101-3