Abstract
Diabetic retinopathy (DR) is an eye disease caused by diabetes and can progress to certain degrees. Because DR’s the final stage can cause blindness, early detection is crucial to prevent visual disturbances. With the development of GPU technology, image classification and object detection can be done faster. Particularly on medical images, these processes play an important role in disease detection. In this work, we improved our previous work to detect diabetic retinopathy using Faster RCNN and attention layer. In the detection phase, firstly non-used area of DR image was extracted using compute unified device architecture with gradient-based edge detection method. Then Mask RCNN was used instead of faster region-based convolutional neural networks (Faster RCNN) to detect lesion areas more successful. With the proposed method, more successful results were obtained than the our previous work in DenseNet, MobileNet and ResNet networks. In addition, more successful results were obtained than other works in the literature in ACC and AUC metrics obtained by using VGG19.
Similar content being viewed by others
Data availability
Previously reported diabetic retinopathy datasets were used to support this study and are available at https://www.adcis.net/en/third-party/messidor/, https://www.kaggle.com/c/diabetic-retinopathy-detection/data, https://www.it.lut.fi/project/imageret/diaretdb0/, https://www.it.lut.fi/project/imageret/diaretdb1/, and https://ieee-dataport.org/open-access/indian-diabetic-retinopathy-image-dataset-idrid. These datasets are cited at relevant places within the text as references [41–44].
References
Doganay, S., Evereklioglu, C., Er, H., et al.: Comparison of serum NO, TNF-\(\alpha \), IL-1\(\beta \), sIL-2R, IL-6 and IL-8 levels with grades of retinopathy in patients with diabetes mellitus. Eye 16, 163–170 (2002). https://doi.org/10.1038/sj.eye.6700095
Jinfeng, G., Qummar, S., Junming, Z., Ruxian, Y., Khan, F.G.: Ensemble framework of deep CNNs for diabetic retinopathy detection. Comput. Intell. Neurosci. (2020). https://doi.org/10.1155/2020/8864698
Lee, D.K., Oh, S.J.: Variable block size motion estimation implementation on compute unified device architecture (CUDA). In: IEEE International Conference on Consumer Electronics (ICCE), vol. 2013, 633–634 (2013). https://doi.org/10.1109/ICCE.2013.6487048
Lecun, Y., Bengio, Y.: Convolutional networks for images, speech, and time-series. In: Arbib, M.A. (Ed.) The Handbook of Brain Theory and Neural Networks. MIT Press (1995)
Krizhevsky, A.: Convolutional Neural Networks for Object Classification in CUDA, EECE1742S: Programming Massively Parallel Multiprocessors Using CUDA (2009)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet Classification with Deep Convolutional Neural Networks. NIPS’12 Curran Associates Inc., USA (2012)
Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9, Boston, MA, USA (2015)
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2818–2826, Las Vegas, NV, USA (2016)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: The 3rd International Conference on Learning Representations (ICLR2015) (2015). arXiv:1409.1556
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778, Las Vegas, NV, USA (2016)
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA (2017)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587, Columbus, OH, USA (2014)
Girshick, R.B.: Fast R-CNN, Tech. Rep., CoRR (2015). arXiv:1504.08083
Wang, L., Wang, H., Huang, Y., Yan, B., Chang, Z., Liu, Z., Zhao, M., Cui, L., Song, J., Li, F.: Trends in the application of deep learning networks in medical image analysis: evolution between 2012 and 2020. Eur. J. Radiol. 146, 66 (2022)
Wei, H., Sehgal, A., Kehtarnavaz, N.: A deep learning-based smartphone app for real-time detection of retinal abnormalities in fundus images. In: Real-Time Image Processing and Deep Learning 2019, vol. 10996, p. 1099602. International Society for Optics and Photonics (2019)
Hacisoftaoglu, R.E., Karakaya, M., Sallam, A.B.: Deep learning frameworks for diabetic retinopathy detection with smartphone-based retinal imaging systems. Pattern Recognit. Lett. 135, 409–417 (2020)
Erciyas, A., Barışçı, N.: An effective method for detecting and classifying diabetic retinopathy lesions based on deep learning. Comput. Math. Methods Med. 34, 66 (2021)
Toğaçar, M.: Detection of retinopathy disease using morphological gradient and segmentation approaches in fundus images. Comput. Methods Programs Biomed. 214, 106579 (2022). https://doi.org/10.1016/j.cmpb.2021.106579
Bhuiyan, A., Govindaiah, A., Deobhakta, A., Hossain, M., Rosen, R., Smith, T.: Automated diabetic retinopathy screening for primary care settings using deep learning. Intell. Based Med. 5, 100045 (2021)
Zhang, C., Lei, T., Chen, P.: Diabetic retinopathy grading by a source-free transfer learning approach. Biomed. Signal Process. Control 73, 103423 (2022)
Gulshan, V., Peng, L., Coram, M., Stumpe, M.C., Wu, D., Narayanaswamy, A., et al.: Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316(22), 2402–2410 (2016)
Costa, P., Campilho, A.: Convolutional bag of words for diabetic retinopathy detection from eye fundus images. IPSJ Trans. Comput. Vision Appl. 9(1), 10 (2017)
Gargeya, R., Leng, T.: Automated identification of diabetic retinopathy using deep learning. Ophthalmology 124(7), 962–969 (2017)
Wang, Z., Yin, Y., Shi, J., Fang, W., Li, H., Wang, X.: Zoom-in-net: deep mining lesions for diabetic retinopathy detection. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 267–275. Springer (2017)
Chen, Y.-W., Wu, T.-Y., Wong, W.-H., Lee, C.-Y.: Diabetic retinopathy detection based on deep convolutional neural networks. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1030–1034. IEEE (2018)
Mansour, R.F.: Deep-learning-based automatic computer-aided diagnosis system for diabetic retinopathy. Biomed. Eng. Lett. 8(1), 41–57 (2018)
Quellec, G., Charrière, K., Boudi, Y., Cochener, B., Lamard, M.: Deep image mining for diabetic retinopathy screening. Med Image Anal. 39, 178–193 (2017). https://doi.org/10.1016/j.media.2017.04.012
Gutiérrez, J., Che, Z., Zhai, G., Callet, P.L.: Saliency4ASD: challenge, dataset and tools for visual attention modeling for autism spectrum disorder. Signal Process. Image Commun. 92, 116092 (2021)
Cai, Y., Droste, R., Sharma, H., Chatelain, P., Drukker, L., Papageorghiou, A.T., Noble, J.A.: Spatio-temporal visual attention modelling of standard biometry plane-finding navigation Med. Image Anal. 65, 101762 (2020)
Obeso, A.M., Benois-Pineau, J., Vázquez, M.S.G., Acosta, A.Á.R.: Visual vs internal attention mechanisms in deep neural networks for image classification and object detection. Pattern Recognit. 123, 108411 (2022)
Zhao, B., Wu, X., Feng, J., Peng, Q., Yan, S.: Diversified visual attention networks for fine-grained object classification. IEEE Trans. Multimed. 19(6), 1245–1256 (2017). https://doi.org/10.1109/TMM.2017.2648498
Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhutdinov, R., Zemel, R., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. IEEE Trans. Multimed. 19(6), 66 (2017)
Liu, S., Wang, Q., Li, X.: Attention based network for remote sensing scene classification. In: IGARSS 2018—2018 IEEE International Geoscience and Remote Sensing Symposium, pp. 4740–4743 (2018). https://doi.org/10.1109/IGARSS.2018.8519232.
Cao, C., et al.: Look and think twice: capturing top-down visual attention with feedback convolutional neural networks. In: IEEE International Conference on Computer Vision (ICCV), vol. 2015, pp. 2956–2964 (2015). https://doi.org/10.1109/ICCV.2015.338
Redmon, J., Divvala, S., Girshick,R., Farhadi, A.: ’You only look once: unified real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788 (2016)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y. et al.: ’Ssd: single shot multibox detector. In: European Conference on Computer Vision, pp. 21–37 (2016)
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems (2015), pp. 91–99
He, K., Gkioxari, G., Dollar, P., Girshick, R.: Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV) (2017), pp. 2961–2969
Kamble, V.V., Kokate, R.D.: Automated diabetic retinopathy detection using radial basis function. Procedia Comput. Sci. 167, 799–808 (2020)
Porwal, P., Pachade, S., Kamble , R., et al.: Indian diyabetic retinopathy image dataset (IDRid). In: IEEE Dataport (2018)
Kaggle Diabetic Retinopathy Detection Competition: https://www.kaggle.com/c/diabetic-retinopathy-detection
Decencière, E., Zhang, X., Cazuguel, G., et al.: Feedback on a publicly distributed image database: the Messidor database. Image Anal. Stereol. 33(3), 231–234 (2014)
Díaz-Pernil, D., Berciano, A., Peña-Cantillana, F., Gutiérrez-Naranjob, M.A.: Segmenting images with gradient-based edge detection using membrane computing. Pattern Recognit. Lett. 34(8), 846–855 (2013)
Erciyas, A., Erdal, E., Ergüzen, A.: Medical image processing with CUDA. In: International Advanced Researches & Engineering Congress (IAREC’17) (2017). ISBN: 978-605-245-037-6
Acknowledgements
We thank the editors, reviewers, and all cited authors.
Funding
No funds, grants, or other support was received.
Author information
Authors and Affiliations
Contributions
N.B. and A.E. designed the project, main conceptual ideas and project outline. A.E. solved all the technical details and did the numerical calculations for the proposed method and got the results. N.B., H.M.Ü., and H.P. took the lead in writing the article. A.E and N.B. wrote the article template. H.M.Ü and H.P. helped moderate the article. All authors provided critical feedback and assisted in research, analysis, and finalization of the manuscript. All authors discussed the results and contributed to the final article.
Corresponding author
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Erciyas, A., Barışçı, N., Ünver, H.M. et al. Improving detection and classification of diabetic retinopathy using CUDA and Mask RCNN. SIViP 17, 1265–1273 (2023). https://doi.org/10.1007/s11760-022-02334-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-022-02334-9