Abstract
With the advancement in modern imaging techniques like CT scan, MRI, PET scan etc., a vast amount of data is generated every day in the field of healthcare. Big data contains hidden information, which necessitates the development of intelligent systems to analyze it and extract relevant information, allowing for accurate and cost-effective decisions in the medical field. By utilizing the untapped potential of the big data available in the medical field, very precise models can be developed for the medical diagnosis of retinal diseases. Optical coherence tomography (OCT) is a non-invasive imaging test that captures different, distinctive layers of the retina and optic nerve in a living eye to map and measure their thickness, that helps diagnose various retinal disorders. With the advancement of the application of deep learning-based techniques in the field of medical sciences, the use of convolutional neural network (CNN) based approaches for disease detection is gaining popularity. While the manual examination of 3D OCT images for the diagnosis of retinal disorders requires extensive time and expert intervention, the use of CNNs provides an effective automated option that provides results with higher accuracy while also reducing the time involved in the overall process. In this paper, we have implemented the aforementioned idea by proposing OCT-CNN, a CNN architecture, that automatically classifies retinal OCT images and identifies potential disorders in a living eye. Several techniques have been employed to enhance the performance of the proposed approach, including digital enhancement of the images, dropout regularization, adaptive learning rates, and early stopping of training to attain optimal performance. The performance of the proposed OCT-CNN is evaluated on the UCSD dataset against several popular deep CNN architectures and existing state-of-the-art approaches to automatic retinal OCT classification. The proposed OCT-CNN attains the best performance on all evaluated metrics, pushing the classification accuracies to 99.28% on CNV, 99.9% on DME, 99.38% on DRUSEN, and 100% on NORMAL images, indicating its superiority over existing state-of-the-art techniques.











Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Sun H, Liu Z, Wang G, Lian W, Ma J (2019) Intelligent analysis of medical big data based on deep learning. IEEE Access 7:142022–142037. https://doi.org/10.1109/ACCESS.2019.2942937
Davenport T, Kalakota R (2019) The potential for artificial intelligence in healthcare. Future Healthc J 6(2):94–98. https://doi.org/10.7861/futurehosp.6-2-94
Shafqat S, Fayyaz M, Khattak HA et al (2021) Leveraging deep learning for designing healthcare analytics heuristic for diagnostics. Neural Process Lett. https://doi.org/10.1007/s11063-021-10425-w
Najafabadi MM, Villanustre F, Khoshgoftaar TM et al (2015) Deep learning applications and challenges in big data analytics. J Big Data 2:1. https://doi.org/10.1186/s40537-014-0007-7
Marmor MF (2000) Mechanisms of fluid accumulation in retinal edema. In: 2nd International symposium on macular edema. Springer, Dordrecht, pp 35–45. https://doi.org/10.1007/978-94-011-4152-9_4
Rasti R, Rabbani H, Mehridehnavi A, Hajizadeh F (2018) Macular OCT classification using a multi-scale convolutional neural network ensemble. IEEE Trans Med Imaging 37(4):1024–1034. https://doi.org/10.1109/TMI.2017.2780115
Mathenge W (2014) Age-related macular degeneration. Community Eye Health 27(87):49–50
Romero-Aroca P (2013) Current status in diabetic macular edema treatments. World J Diabetes 4(5):165–169. https://doi.org/10.4239/wjd.v4.i5.165
Huang D, Swanson EA, Lin CP, Schuman JS, Stinson WG, Chang W, Hee MR, Flotte T, Gregory K, Puliafito CA (1991) Optical coherence tomography. Science 254(5035):1178–1181
Wu H, Gu X (2015) Towards dropout training for convolutional neural networks. Neural Netw 71:1–10
Liu Y-Y, Chen M, Ishikawa H, Wollstein G, Schuman JS, Rehg JM (2011) Automated macular pathology diagnosis in retinal OCT images using multi-scale spatial pyramid and local binary patterns in texture and shape encoding. Med Image Anal 15(5):748–759. https://doi.org/10.1016/j.media.2011.06.005
Global Burden of Disease Study (2013) Collaborators (2015) Global, regional, and national incidence, prevalence, and years lived with disability for 301 acute and chronic diseases and injuries in 188 countries, 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013. Lancet 386:743–800
Litjens G, Kooi T, Bejnordi BE et al (2017) A survey on deep learning in medical image analysis. Med Image Anal 42:60–88
Kermany D, Zhang K, Goldbaum M (2018) Large dataset of labeled optical coherence tomography (oct) and chest X-ray images. Mendeley Data, v3. https://doi.org/10.17632/rscbjbr9sj
Liu YY, Chen M, Ishikawa H, Wollstein G, Schuman JS, Rehg JM (2011) Automated macular pathology diagnosis in retinal OCT images using multi-scale spatial pyramid and local binary patterns in texture and shape encoding. Med Image Anal 15(5):748–759
Farsiu S, Chiu SJ, O’Connell RV, Folgar FA, Yuan E, Izatt JA, Toth CA (2014) Age-related eye disease study 2 ancillary spectral domain optical coherence tomography study group. Quantitative classification of eyes with and without intermediate age-related macular degeneration using optical coherence tomography. Ophthalmology 121(1):162–172. https://doi.org/10.1016/j.ophtha.2013.07.013
Shi F, Cai N, Gu Y, Hu D, Ma Y, Chen Y, Chen X (2019) DeSpecNet: a CNN-based method for speckle reduction in retinal optical coherence tomography images. Phys Med Biol 64(17):175010
Zhang K, Zuo W, Chen Y, Meng D, Zhang L (2017) Beyond a Gaussian denoiser: residual learning of deep CNN for image denoising. IEEE Trans Image Process 26(7):3142–3155
Srinivasan PP, Kim LA, Mettu PS, Cousins SW, Comer GM, Izatt JA, Farsiu S (2014) Fully automated detection of diabetic macular edema and dry age-related macular degeneration from optical coherence tomography images. Biomed Opt Express 5(10):3568–3577
Fang L, Jin Y, Huang L, Guo S, Zhao G, Chen X (2019) Iterative fusion convolutional neural networks for classification of optical coherence tomography images. J Vis Commun Image Represent 59:327–333
Fang L, Wang C, Li S, Rabbani H, Chen X, Liu Z (2019) Attention to lesion: lesion-aware convolutional neural network for retinal optical coherence tomography image classification. IEEE Trans Med Imaging 38(8):1959–1970
Huang L, He X, Fang L, Rabbani H, Chen X (2019) Automatic classification of retinal optical coherence tomography images with layer guided convolutional neural network. IEEE Signal Process Lett 26(7):1026–1030
Abirami MS, Vennila B, Suganthi K, Kawatra S, Vaishnava A (2021) Detection of choroidal neovascularization (CNV) in retina OCT images using VGG16 and DenseNet CNN. Wirel Pers Commun. https://doi.org/10.1007/s11277-021-09086-8
Nandy Pal M, Roy S, Banerjee M (2021) Content based retrieval of retinal OCT scans using twin CNN. Sādhanā 46:174. https://doi.org/10.1007/s12046-021-01701-5
Mishra SS, Mandal B, Puhan NB (2022) MacularNet: towards fully automated attention-based deep CNN for macular disease classification. SN Comput Sci 3:142. https://doi.org/10.1007/s42979-022-01024-0
Yoo TK, Choi JY, Kim HK (2021) Feasibility study to improve deep learning in OCT diagnosis of rare retinal diseases with few-shot classification. Med Biol Eng Comput 59:401–415. https://doi.org/10.1007/s11517-021-02321-1
Virdi G, Virdi T, Elkeeb A (2021) Optical coherence tomography analysis using deep learning: a synthetic data approach. Investig Ophthalmol Vis Sci 62(11):22
Liu X, Ali TK, Singh P, Shah A, McKinney SM, Ruamviboonsuk P, Turner AW, Keane PA, Chotcomwongse P, Nganthavee V, Chia M et al (2022) Deep learning to detect OCT-derived diabetic macular edema from color retinal photographs: a multicenter validation study. Ophthalmol Retina 6(5):398–410. https://doi.org/10.1016/j.oret.2021.12.021
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105
Scarpa G, Gargiulo M, Mazza A, Gaetano R (2018) A CNN-based fusion method for feature extraction from sentinel data. Remote Sens 10(2):236
Khalajzadeh H, Mansouri M, Teshnehlab M (2014) Face recognition using convolutional neural network and simple logistic classifier. In: Soft computing in industrial applications, pp 197–207
Arel I, Rose DC, Karnowski TP (2010) Deep machine learning-a new frontier in artificial intelligence research [research frontier]. IEEE Comput Intell Mag 5(4):13–18
Zuiderveld K (1994) Contrast limited adaptive histogram equalization. In: Graphics gems, pp 474–485
Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In: European conference on computer vision, pp 818–833
Agarap AF (2018) Deep learning using rectified linear units (relu). arXiv preprint. arXiv:1803.08375
Rawat W, Wang Z (2017) Deep convolutional neural networks for image classification: a comprehensive review. Neural Comput 29(9):2352–2449
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint. arXiv:1412.6980
Prechelt L (2012). Early stopping—but when? In: Montavon G, Orr GB, Müller KR (eds) Neural networks: tricks of the trade. Lecture notes in computer science, vol 7700. Springer, Berlin. https://doi.org/10.1007/978-3-642-35289-8_5
Yu XH, Chen GA, Cheng SX (1995) Dynamic learning rate optimization of the backpropagation algorithm. IEEE Trans Neural Netw 6(3):669–677
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint. arXiv:1704.04861
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint. arXiv:1409.1556
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2818–2826
Powers DM (2020) Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv preprint. arXiv:2010.16061
Bengio Y, Grandvalet Y (2004) No unbiased estimator of the variance of k-fold cross-validation. J Mach Learn Res 5:1089–1105
Funding
Not applicable.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Bansal, P., Harjai, N., Saif, M. et al. Utilization of big data classification models in digitally enhanced optical coherence tomography for medical diagnostics. Neural Comput & Applic 36, 225–239 (2024). https://doi.org/10.1007/s00521-022-07973-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-022-07973-0