Retinal Vessels Segmentation Based on a Convolutional Neural Network

Brancati, Nadia; Frucci, Maria; Gragnaniello, Diego; Riccio, Daniel

doi:10.1007/978-3-319-75193-1_15

Nadia Brancati¹⁵,
Maria Frucci¹⁵,
Diego Gragnaniello¹⁵ &
…
Daniel Riccio^15,16

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10657))

Included in the following conference series:

Iberoamerican Congress on Pattern Recognition

2565 Accesses
10 Citations

Abstract

We present a supervised method for vessel segmentation in retinal images. The segmentation issue has been addressed as a pixel-level binary classification task, where the image is divided into patches and the classification (vessel or non-vessel) is performed on the central pixel of the patch. The input image is then segmented by classifying all of its pixels. A Convolutional Neural Network (CNN) has been used for the classification task, and the network has been trained on a large number of samples, in order to obtain an adequate generalization ability. Since blood vessels are characterized by a linear structure, we have introduced a further layer into the classic CNN including directional filters. The method has been tested on the DRIVE dataset producing satisfactory results, and its performance has been compared to that of other supervised and unsupervised methods.

You have full access to this open access chapter, Download conference paper PDF

W–net: A Convolutional Neural Network for Retinal Vessel Segmentation

Residual Multiscale Full Convolutional Network (RM-FCN) for High Resolution Semantic Segmentation of Retinal Vasculature

Retina Blood Vessels Segmentation and Classification with the Multi-featured Approach

Article 08 August 2024

Keywords

1 Introduction and Background

The automatic segmentation of blood vessels from retinal fundus images has gained interest in the image processing community, due to its applicability to several problems in different fields. Precisely, the automatic analysis of retinal blood vessels is a basic step both for the diagnosis of several retinal pathologies [2] (e.g. diabetic retinopathy, arteriosclerosis, hypertension, and various cardiovascular diseases), and for person verification in biometric systems, since the retinal vascular structure is different for each individual [15]. Several methods have been proposed in the literature for retinal image segmentation since it remains a challenging task due to the complex nature of vascular structures, illumination variations and the anatomical variability between subjects. Existing methods can generally be divided into two categories: supervised and unsupervised. In general, the performance of supervised methods is superior to that of the unsupervised ones, but with a lower speed and higher computational complexity compare to other unsupervised methods. Supervised methods are based on machine learning techniques and require a manually annotated set of training images in order to classify a pixel as either a vessel or non-vessel. They involve a k-NN classifier, a Support Vector Machine, a Bayesian classifier in combination with features obtained through the multi-scale analysis of Gabor wavelets, AdaBoost or a CNN, etc. [6, 8, 11, 17, 19,20,21]. Unsupervised segmentation methods work without any prior knowledge and are based on matched filtering, centerline tracking, mathematical morphology, or other rule-based techniques [3,4,5, 7, 12, 22,23,24].

In this paper a supervised method based on a CNN and on the use of directional filters, as proposed in [7], is presented. The method has been tested on the DRIVE [19] dataset and its performance has been compared to that of other methods in the literature. The experimental results confirm the goodness of the proposed approach.

The rest of the paper is organized as follows: in Sect. 2, the proposed CNN architecture is presented; Sect. 3 describes the experiments, together with the training strategy, and shows the obtained results; and finally in Sect. 4 some conclusions are drawn.

2 The Method

Our solution is based on a CNN used as a pixel classifier and on the introduction of directional filters. As performed by the majority of authors in the field, only the green channel of RGB color retinal image is considered, since the vessels are characterized by the highest contrast in this channel. In this work, the vessel segmentation issue is addressed as a pixel-level binary classification task. The network computes the probability of a pixel being a vessel, using as input a patch of the image, representing a square window centered on the pixel itself. Next, the input image is segmented by classifying all its pixels. The CNN is trained on a high number of patches, in which the central pixel is annotated by using the relative ground truth included in the dataset. Details about the training dataset will be given in the next section.

A CNN is organized in stacked trainable stages, called layers, each composed of processing units operating on the output of the previous layer. A CNN consists of a number of convolutional and sub-sampling layers optionally followed by fully connected layers. A convolutional layer will have k filters (or kernels) devoted to produce k feature maps. Each map is then sub-sampled typically by means of pooling layers, a process that progressively reduces the spatial size of the representation, the amount of parameters and the computation in the network. The pooling can be of different types: Max, Average, Sum, etc., but max pooling is usually applied where the largest element from the current feature map within a window is considered [16]. The output from the convolutional and pooling layers consists in high-level features of the input image. The purpose of the Fully Connected Layers tecnique is to use these features to classify the input image into various classes based on the training dataset.

The layers of the proposed CNN architecture are:

A first non-learnable convolutional layer: to obtain a better performance of our method, we have introduced a new layer representing directional filters to guide the training of the network, also on the linear behavior which characterizes blood vessels. In fact, vessels are thin and elongated structures whose pixels are aligned along different directions. Thus, by fixing a number of different orientations and by taking into account the gray-levels in a suitable window centered on a pixel p, directional information can be computed for p. Next, directional information is combined by higher layers of network to obtain more complex features. Differently from the filters of other Convolutional layers, the directional filters are not learned during the training process. The directional filters consist in twelve windows, each having a size of \(7\times 7\), like the ones presented in [7] (see Fig. 1). Each window represents a direction such that an angle of \(15^\circ \) between two successive directions is obtained.
Five convolutional layers: all the filters of these layers have a size of \(3\times 3\) and a stride of 1. The first layer of this block learns 32 filters, the second and the third learn 64 filters and finally the fourth and fifth learn 128 filters. All the filters of these layers are initialized with a Xavier initialization [9] and they are equipped with rectification non linearity (ReLU) [13], i.e. the output volume is max(0, e), where e is the outcome of convolution.
Five max-pooling layers: max-pooling is performed after each convolutional layer. It is computed on a window of size \(3\times 3\) and the stride is set to 2.
Three fully connected layers: the first two fully connected layers learn 256 filters, and these are used to learn non-linear combinations of the features provided from the previous layers. Moreover, to try to avoid overfitting, these two layers implement dropout regularization [18], where the ratio is set to 0.5. Finally, the number of filters of the last layer is 2, since in our case the classification problem is binary. All the filters of these layers are initialized with values sampled from the N(0, 0.01) Gaussian distribution.

Table 1 summarizes the parameters of the CNN layers, where n-C stands for the Convolutional layer with non-learnable filters, C + P stands for the Convolutional layer followed by Max-pooling layer and FC stands for the Fully Connected layer.

Table 1. Summary of the proposed CNN architecture

Full size table

3 Experiments

3.1 Training Strategy and Parameters Setting

The training phase consists in an iterative presentation of the patches together with their associated labels. Patches are randomly extracted from the set of training images of the DRIVE dataset. In particular, DRIVE contains 20 training images and 20 testing images and each image is associated with both a mask delimiting the Field of View (FOV) of the retinal image and two ground truths generated by two ophthalmologists. However, only patches completely contained in the FOV of the retinal images and only the ground truths of the first expert are taken into account. Each patch of an image I has a \(27\times 27\) size and is labeled as a vessel or non-vessel depending on whether its central pixel belongs to foreground or background of the ground truth associated with I. The experiments have been performed considering two types of training sets: a non-balanced set in which most of the patches are labeled as non-vessels and a balanced set including a balanced percentage of patches differently labeled. Precisely, the non-balanced training set is composed of 480, 000 random patches, while the balanced set includes about 700, 000 patches.

For the testing phase, patches are extracted by DRIVE test images. In particular, for each pixel p of a test image I, the patch centered on p is obtained and it is involved in the testing phase only if it is completely contained in the FOV of I.

The number of epochs of the network is set to 15, while the batch size is equal to 256. The learning rate initially is set to \(10^{-2}\), and it is decreased every six epochs by a factor of 10. To train our network, we used the NVIDIA Deep Learning GPU Training System [1] within the Caffe framework [10].

3.2 Results

A qualitative evaluation of the method is possible with reference to Fig. 2, where in each line the input image, the ground truth of the first expert, and the result of our segmentation method are shown from left to right.

We have quantitatively evaluated our method for both training sets, balanced and non-balanced, and in both cases with or without the max-pooling layers and also with or without directional filters. We have computed the Accuracy, Sensitivity and Specificity, as performed by the majority of researchers [14].

Table 2. Results of the proposed method applying different training strategies

Full size table

Quantitative results by applying different training strategies are given in Table 2. For the non-balanced training set, the best result is obtained without the max-pooling layers in respect of all the considered measures. For the balanced training set, we obtained a better performance as regards accurary and specifity without max-pooling layers, while a lower performance was obtained only as regards sensitivity. However, the non-balanced training set and the network without max-pooling layers provided the best performance in terms of accuracy and specificity and also gave a high sensitivity value.

To demonstrate how the introduction of directional filters in the network provides a better performance of our method, we computed the considered measures also with and without directional filters considering the NoBal / NoPool strategy (see Table 3). We observed that the use of directional filters produces a high sensitivity value, while equivalent values are obtained for the remaining measures.

Table 3. Results of the proposed method with or without the directional filters (DF)

Full size table

Table 4. Performance Comparisons

Full size table

Finally, we also compared the performance of our method with that of other unsupervised and supervised methods in the literature. The average values of accuracy, sensitivity and specificity can be checked with reference to Table 4, where the highest values are in bold. Our method has a better performance as regards sensitivity with respect to all methods, a better performance as regards accuracy with respect to the supervised methods and, and a lower performance only as regards specificity with respect to some methods.

4 Conclusion

In this work, we have presented a supervised vessel segmentation method based on a Convolutional Neural Network. The adopted CNN architecture includes a specific layer to compute the directional features. The introduction of this layer allows an improved performance of the network in terms of sensitivity. The method provides results that are satisfactory both from a qualitative point of view and quantitatively. The performance of the method has been checked on the DRIVE dataset in terms of evaluation parameters such as accuracy, sensitivity and specificity. Comparisons have also been made with other unsupervised and supervised methods in the literature, showing that the suggested method has the highest performance in terms of sensitivity.

References

NVIDIA DIGITS. https://developer.nvidia.com/digits
Abràmoff, M.D., Garvin, M.K., Sonka, M.: Retinal imaging and image analysis. IEEE Rev. Biomed. Eng. 3, 169–208 (2010)
Article Google Scholar
Al-Rawi, M., Qutaishat, M., Arrar, M.: An improved matched filter for blood vessel detection of digital retinal images. Comput. Biol. Med. 37(2), 262–267 (2007)
Article Google Scholar
Azzopardi, G., Strisciuglio, N., Vento, M., Petkov, N.: Trainable cosfire filters for vessel delineation with application to retinal images. Med. Image Anal. 19(1), 46–57 (2015)
Article Google Scholar
Chutatape, O., Zheng, L., Krishnan, S.M.: Retinal blood vessel detection and tracking by matched Gaussian and Kalman filters. In: Proceedings of the 20th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 1998, vol. 6, pp. 3144–3149. IEEE (1998)
Google Scholar
Fraz, M.M., Remagnino, P., Hoppe, A., Uyyanonvara, B., Rudnicka, A.R., Owen, C.G., Barman, S.A.: An ensemble classification-based approach applied to retinal blood vessel segmentation. IEEE Trans. Biomed. Eng. 59(9), 2538–2548 (2012)
Article Google Scholar
Frucci, M., Riccio, D., Sanniti di Baja, G., Serino, L.: Direction-based segmentation of retinal blood vessels. In: Beltrán-Castañón, C., Nyström, I., Famili, F. (eds.) CIARP 2016. LNCS, vol. 10125, pp. 1–9. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-52277-7_1
Chapter Google Scholar
Fu, H., Xu, Y., Wong, D.W.K., Liu, J.: Retinal vessel segmentation via deep learning network and fully-connected conditional random fields. In: 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI), pp. 698–701. IEEE (2016)
Google Scholar
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. Proc. Int. Conf. Artif. Intell. Stat. 9, 249–256 (2010)
Google Scholar
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM International Conference on Multimedia. pp. 675–678. ACM (2014)
Google Scholar
Marín, D., Aquino, A., Gegúndez-Arias, M.E., Bravo, J.M.: A new supervised method for blood vessel segmentation in retinal images by using gray-level and moment invariants-based features. IEEE Trans. Med. Imaging 30(1), 146–158 (2011)
Article Google Scholar
Mendonca, A.M., Campilho, A.: Segmentation of retinal blood vessels by combining the detection of centerlines and morphological reconstruction. IEEE Trans. Med. Imaging 25(9), 1200–1213 (2006)
Article Google Scholar
Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning (ICML-10), pp. 807–814 (2010)
Google Scholar
Powers, D.M.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation (2011)
Google Scholar
Ramya, M., Sornalatha, M.: Personal identification based on retinal blood vessel segmentation. Int. J. Emerg. Technol. Comput. Sci. Electron. 7(1), 164–168 (2014)
Google Scholar
Scherer, D., Müller, A., Behnke, S.: Evaluation of pooling operations in convolutional architectures for object recognition. Artif. Neural Netw. ICANN 2010, 92–101 (2010)
Google Scholar
Soares, J.V., Leandro, J.J., Cesar, R.M., Jelinek, H.F., Cree, M.J.: Retinal vessel segmentation using the 2-D Gabor wavelet and supervised classification. IEEE Trans. Med. Imaging 25(9), 1214–1222 (2006)
Article Google Scholar
Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar
Staal, J., Abràmoff, M.D., Niemeijer, M., Viergever, M.A., Van Ginneken, B.: Ridge-based vessel segmentation in color images of the retina. IEEE Trans. Med. Imaging 23(4), 501–509 (2004)
Article Google Scholar
Strisciuglio, N., Azzopardi, G., Vento, M., Petkov, N.: Supervised vessel delineation in retinal fundus images with the automatic selection of B-COSFIRE filters. Mach. Vis. Appl. 27(8), 1137–1149 (2016)
Article Google Scholar
Vega, R., Sanchez-Ante, G., Falcon-Morales, L.E., Sossa, H., Guevara, E.: Retinal vessel extraction using lattice neural networks with dendritic processing. Comput. Biol. Med. 58, 20–30 (2015)
Article Google Scholar
Yin, Y., Adel, M., Bourennane, S.: Automatic segmentation and measurement of vasculature in retinal fundus images using probabilistic formulation. Comput. Math. Methods Med. 2013 (2013)
Google Scholar
Zhang, B., Zhang, L., Zhang, L., Karray, F.: Retinal vessel extraction by matched filter with first-order derivative of Gaussian. Comput. Biol. Med. 40(4), 438–445 (2010)
Article Google Scholar
Zhao, Y., Rada, L., Chen, K., Harding, S.P., Zheng, Y.: Automated vessel segmentation using infinite perimeter active contour model with hybrid region information with application to retinal images. IEEE Trans. Med. Imaging 34(9), 1797–1807 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute for High Performance Computing and Networking National Research Council of Italy (ICAR-CNR), Naples, Italy
Nadia Brancati, Maria Frucci, Diego Gragnaniello & Daniel Riccio
University of Naples “Federico II”, Naples, Italy
Daniel Riccio

Authors

Nadia Brancati
View author publications
You can also search for this author in PubMed Google Scholar
Maria Frucci
View author publications
You can also search for this author in PubMed Google Scholar
Diego Gragnaniello
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Riccio
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel Riccio .

Editor information

Editors and Affiliations

Universidad Federico Santa María, Santiago, Chile
Marcelo Mendoza
Carlos III University of Madrid, Madrid, Spain
Sergio Velastín

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Brancati, N., Frucci, M., Gragnaniello, D., Riccio, D. (2018). Retinal Vessels Segmentation Based on a Convolutional Neural Network. In: Mendoza, M., Velastín, S. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2017. Lecture Notes in Computer Science(), vol 10657. Springer, Cham. https://doi.org/10.1007/978-3-319-75193-1_15

Download citation

DOI: https://doi.org/10.1007/978-3-319-75193-1_15
Published: 04 February 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-75192-4
Online ISBN: 978-3-319-75193-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Retinal Vessels Segmentation Based on a Convolutional Neural Network

Abstract

Similar content being viewed by others

W–net: A Convolutional Neural Network for Retinal Vessel Segmentation

Residual Multiscale Full Convolutional Network (RM-FCN) for High Resolution Semantic Segmentation of Retinal Vasculature

Retina Blood Vessels Segmentation and Classification with the Multi-featured Approach

Keywords

1 Introduction and Background

2 The Method

3 Experiments