Abstract
Swelling of the optic nerve head (ONH) is most accurately quantitatively assessed via volumetric measures using 3D spectral-domain optical coherence tomography (SD-OCT). However, SD-OCT is not always available as its use is primarily limited to specialized eye clinics rather than in primary care or telemedical settings. Thus, there is still a need for severity assessment using more widely available 2D fundus photographs. In this work, we propose a machine-learning approach to locally estimate the degree of the optic disc swelling at each pixel location from only a 2D fundus photograph as the input. For training purposes, a thickness map of the swelling (reflecting the distance between the top and bottom surfaces of the ONH and surrounding retina) as measured from SD-OCT at each pixel location was used as the ground truth. A random-forest classifier was trained to output each thickness value from local fundus features pertaining to textural and color information. Eighty-eight image pairs of ONH-centered SD-OCT and registered fundus photographs from different subjects with optic disc swelling were used for training and evaluating the model in a leave-one-subject-out fashion. Comparing the thickness map from the proposed method to the ground truth via SD-OCT, a root-mean-square (RMS) error of 1.66 mm\(^3\) for the entire ONH region was achieved, and Spearman’s correlation coefficient was \(R=0.73\). Regional volumes for the nasal, temporal, inferior, superior, and peripapillary regions had RMS errors of 0.64 mm\(^3\), 0.61 mm\(^3\), 0.74 mm\(^3\), 0.71 mm\(^3\), and 1.30 mm\(^3\), respectively, suggesting that there is enough evidence in a singular color fundus photograph to estimate local swelling information.
Similar content being viewed by others
1 Introduction
For many years, color fundus photographs have been a common imaging modality for ophthalmologists to examine the back of the eye in cases of optic disc swelling Fig. 1(a) [5]. Traditionally, assessment of swelling via fundus photographs has been a very challenging task due to the specialized expertise required and the inability to compute volumetric measures of the swelling [8]. With the introduction of spectral-domain optical coherence tomography (SD-OCT) Fig. 1(b) and development of associated retinal-layer-segmentation algorithms [3, 10] Fig. 1(c), SD-OCT-based volumetric measures have recently shown great promise in overcoming many of the limitations of a traditional fundus-based assessment.
However, SD-OCT is not always available as its use is primarily limited to specialized eye clinics rather than in primary care or telemedical settings. Thus, the need for better approaches for the assessment of optic disc swelling via fundus photographs alone still exists. In this area, Echegaray et al. [2] proposed a decision-tree system that automatically grades/stages the optic disc swelling using image features directly extracted from monocular fundus photographs. However, this approach only provides ordinal severity grades rather than continuous volumetric measures. Tang et al. [9] demonstrated that the stereoscopic color fundus photographs have the potential to reconstruct the depth information and allow the volumetric estimation for the optic disc swelling, but the requirement of carefully acquired stereo images rather than monocular images limits its applicability. More recently, Agne et al. [1] proposed a regression approach, which can directly estimate the total retinal volume (TRV) at the optic-nerve-head (ONH) region by only inputting a single fundus photograph; however this approach doesn’t predict local thickness values as may be needed to compute regional volumes.
Thus, to overcome the limitations above, we propose a machine-learning method that estimates the local volumetric information by only requiring a single monocular color fundus photograph Fig. 1(a) as the input. The proposed method outputs a thickness map with \(200\times 200\) pixels covering \(6\times 6\) mm\(^2\) at the ONH region. Based on the resulting thickness map, the volumes of the peripapillary region, the nasal, temporal, inferior, and superior quadrants, as well as the TRV can be computed shown in Fig. 1(d). Results are quantitatively assessed using the root-mean-square errors between the model’s outputs and the OCT ground truths, as well as Spearman’s rank correlation coefficients. Visualizations of the predicted thickness maps are also provided for qualitative assessment.
2 Methods
2.1 Overview
For the purpose of a fair comparison between the two image modalities, the input fundus photographs were registered to the SD-OCT image domain and centered/cropped at the ONH Fig. 1(a). Next, the blood vessels were inpainted and the resulting fundus image had features pertaining to textural and color information extracted. Based on these selected features, random forest classifiers, which will be discussed more thoroughly in Sect. 3, were trained to estimate the depth information of the retina at the pixel level and be able to output an ONH thickness map that makes regional volumetric measurements computable Fig. 1(d).
2.2 Preprocessing
We first registered the input fundus photograph with the ONH-centered SD-OCT en-face image. In particular, the SD-OCT images were segmented using 2D/3D graph-theoretical algorithms [3, 10], and the en-face image was created by averaging the pixel intensities along each A-scan within the retinal pigment epithelium (RPE) complex. After that, we applied blood vessel inpainting on the ONH-centered/registered fundus photograph Fig. 2(a) to suppress the negative effects from blood vessels on the predicted thickness map. During the processes of vessel inpainting, a blood vessel probability map was computed using a deep learning based approach using U-Net [6] Fig. 2(b). Then, this vessel probability map was thresholded (\(p=0.5\)) into a blood vessel mask. Next, a binary morphology dilation (spherical filter size: \(r=1\) pixel) was used to ensure that the vessels were completely encompassed by the mask. By overlapping the cropped fundus photograph with the dilated blood vessel mask Fig. 2(c), a blood vessel inpainted image using second order interpolation was created Fig. 2(d).
2.3 Feature Extraction
In the processes of assessing optic disc swelling via fundus photographs, neuro-ophthalmologists grade the swelling severity by inspecting key observable features on the image. Similarly, in this work, several feature sets (categories include: image intensity, color representations, gradient, and texture information) were extracted from the inpainted image to help the proposed classifier to estimate the thickness information at pixel-level; a total of 58 features were used as listed in Table 1.
To quantify the textural information, Gabor filters [7] are commonly used to analyze image objects with specific combinations of frequencies, directions, and regions of interest. Here, Gabor magnitude responses were computed at 0\(^\circ \), 45\(^\circ \), 90\(^\circ \), and 135\(^\circ \) with wavelengths of two and four pixels at each orientation Fig. 3.
Textural features were also obtained via use of gray-level co-occurrence matrices (GLCM) which involves statistically considering the spatial relationship of pixels [4]. The GLCMs were computed for each pixel in the inpainted image at an offset of one pixel at the right using three different neighborhood sizes: \(10\times 10\), \(15\times 15\), and \(20\times 20\). For each GLCM, statistical properties, including variance, contrast, entropy Fig. 4(a), homogeneity Fig. 4(b), and inertia, were used to create different feature images.
In addition, entropy can also be used to evaluate the image information in a particular region of interest. In this work, both small and large sliding window sets were applied on the inpainted fundus photographs to compute image entropy in regionsFootnote 1. For the smaller windows, these computed entropy values were interpreted as quantitative indicators of the presence of homogeneous textures; for the larger windows, the computed entropy values indicated the presence of heterogeneous textures. The image gradient direction and magnitude were also included as features as well to enhance the borders among the regions with texture and/or color changes. Along with the red, green, and blue intensity values, the inpainted fundus image was also converted to the hue-saturation-value (HSV) color space, and the separate channel values were used as features as well. Differences in color contrast can be accentuated in the hue and saturation Fig. 4(c) channels, which are highly indicative of texture since an opaque texture is more associated with swelling. In addition, the histogram-equalized image Fig. 4(d) as well as an intensity mapped image with the top and bottom one percent of pixels saturated were used as features. Both images work to emphasize contrast in color between regions of differing colors or differing intensities.
3 Experimental Methods
A total of 88 subjects with optic disc swelling having both volumetric SD-OCT as well as color fundus images were used for experimental analysis. The true thickness information at the ONH (i.e., the ground truth) for each subject was calculated based on the segmented internal limiting membrane (ILM) and the lower bounding surface of the RPE complex in the SD-OCT image. With a total of 58 input features from the input registered fundus image, a random forest bagged ensemble regression model was trained using 500 learning cycles and feature importance was calculated as part of the training process. To reduce computational complexity, the model was trained on two and a half percent of the pixels in each image chosen randomly. Evaluation was performed in a leave-one-subject-out approach so that the model for each subject was obtained by training the classifier on the images from the remaining 87 subjects. After predictions were made for individual pixel locations, volumes were calculated for the peripapillary, nasal, temporal, inferior, and superior regions. The peripapillary region was defined as the region inside a central circle with radius 1.73 mm. The nasal, temporal, inferior, and superior regions were defined as the four interior quadrants of the peripappilary circle using the 135\(^\circ \) and 45\(^\circ \) lines as boundaries. The overall TRV was calculated as well. Errors and correlations were then calculated for each individual region.
4 Results
When comparing the total retinal volume (TRV) calculated from the retinal thickness predictions generated from the described model and the ground truth from OCT images, a root-mean-square-error of 1.66 mm\(^3\) was achieved. Spearman’s correlation coefficient was \(R=0.73\). When comparing regional volumes, the nasal, temporal, inferior, superior, and peripapillary regions had root-mean-square-errors of 0.64 mm\(^3\), 0.61 mm\(^3\), 0.74 mm\(^3\), 0.71 mm\(^3\), and 1.30 mm\(^3\), respectively. The correlations (R) were 0.71 (nasal), 0.72 (temporal), 0.61 (inferior), 0.65 (superior), and 0.75 (peripapillary). Examples of comparisons between the total retinal thickness maps from the SD-OCTs (i.e., ground truths) and from the monocular fundus photographs are shown in Fig. 5.
Average feature importance across all models was calculated by permuting the features and looking for change in the model error. Top features were found to be entropy in large neighborhoods as well as features that accentuate color change, such as hue, saturation Fig. 4(c), or the histogram equalized image Fig. 4(d). All features that had distinctly different values for the optic disc compared to the peripheral area were helpful in distinguishing swollen regions from non-swollen regions.
5 Discussion and Conclusion
In this preliminary study, we have shown that the proposed method demonstrates the monocular fundus photographs as a potentially lower cost but more available alternative to the SD-OCT in the primary care or telemedical settings in cases of assessment for optic disc swelling. Using features pertaining to textural and color information obtained directly from the fundus photographs, akin to properties neuro-ophthalmologists use, local thickness measurements can be estimated using a regression model trained on a variety of subjects with varying degrees of swelling. The ability to assess depth information at the swollen regions can help clinicians identify onset of diseases in earlier stages; for example, papilledema (a particular type of optic disc swelling due to increased intracranial pressure) often presents with swelling in the nasal quadrant relatively early [11]. In addition, the predicted thickness maps provide a future opportunity to create 3D retinal shape models directly from the 2D monocular fundus photographs. The retinal shape information is hypothetically helpful for distinguishing the different causes of the optic disc swelling.
The limitations of this work may include: (1) the lack of subjects with severe optic disc swelling causes the trained classifiers to slightly underestimate the retinal thickness at the extremely swollen regions, and (2) the thickness predictions at the regions with imperfect vessel inpainting are less accurate due to the inconsistent texture information between the swollen retinal tissue and the vessel residual. Potential future work can involve collecting more SD-OCT and fundus image pairs with various severity of optic disc swelling or creating 3D retinal shape models directly from the 2D fundus photographs to help identify causes of optic disc swelling.
Notes
- 1.
The small sliding window sizes include: \(5\times 5\), \(7\times 7\), \(11\times 11\), \(13\times 13\), \(15\times 15\), \(17\times 17\), \(21\times 21\), \(25\times 25\); the large sliding window sizes include: \(37\times 37\), \(49\times 49\), \(73\times 73\), \(101\times 101\).
References
Agne, J., Wang, J.K., Kardon, R.H., Garvin, M.K.: Determining degree of optic nerve edema from color fundus photography. In: Proceedings of SPIE 9414, Medical Imaging 2015: Computer-Aided Diagnosis, pp. 94140F\(\_\)1–94140F\(\_\)9 (2015). http://proceedings.spiedigitallibrary.org/proceeding.aspx?doi=10.1117/12.2081423
Echegaray, S., Zamora, G., Yu, H., Luo, W., Soliz, P., Kardon, R.: Automated analysis of optic nerve images for detection and staging of papilledema. Investig. Ophthalmol. Vis. Sci. 52(10), 7470–7478 (2011)
Garvin, M.K., Abràmoff, M.D., Wu, X., Russell, S.R., Burns, T.L., Sonka, M.: Automated 3-D intraretinal layer segmentation of macular spectral-domain optical coherence tomography images. IEEE Trans. Med. Imaging 28(9), 1436–1447 (2009)
Haralick, R.M., Dinstein, I., Shanmugam, K.: Textural features for image classification. IEEE Trans. Syst. Man Cybern. SMC 3(6), 610–621 (1973)
Hayreh, S.S.: Optic disc edema in raised intracranial pressure - v. pathogenesis. Archiv. Ophthalmol. 95(9), 1553–1565 (1977)
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015 Part III. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Roslan, R., Jamil, N.: Texture feature extraction using 2-D Gabor filters. In: 2012 International Symposium on Computer Applications and Industrial Electronics (ISCAIE), pp. 173–178, December 2012
Scott, C.J., Kardon, R.H., Lee, A.G., Frisén, L., Wall, M.: Diagnosis and grading of papilledema in patients with raised intracranial pressure using optical coherence tomography vs clinical expert assessment using a clinical staging scale. Archiv. Ophthalmol. 128(6), 705–711 (2010)
Tang, L., Kardon, R.H., Wang, J.K., Garvin, M.K., Lee, K., Abràmoff, M.D.: Quantitative evaluation of papilledema from stereoscopic color fundus photographs. Investig. Ophthalmol. Vis. Sci. 53(8), 4490–4497 (2012)
Wang, J.K., Kardon, R.H., Kupersmith, M.J., Garvin, M.K.: Automated quantification of volumetric optic disc swelling in papilledema using spectral-domain optical coherence tomography. Investig. Ophthalmol. Vis. Sci. 53(7), 4069–4075 (2012)
Wang, J.K., Miri, M.S., Kardon, R.H., Garvin, M.K.: Automated 3-D region-based volumetric estimation of optic disc swelling in papilledema using spectral-domain optical coherence tomography. In: Proceedings of SPIE 8672, Medical Imaging 2013: Biomedical Applications in Molecular, Structural, and Functional Imaging, pp. 867214\(\_1\)–867214\(\_1\)8 (2013)
Acknowledgments
This study was supported, in part, by the Department of Veterans Affairs Merit Award I01 RX001786 and the National Institutes of Health R01 EY023279.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Johnson, S.S., Wang, JK., Islam, M.S., Thurtell, M.J., Kardon, R.H., Garvin, M.K. (2018). Local Estimation of the Degree of Optic Disc Swelling from Color Fundus Photography. In: Stoyanov, D., et al. Computational Pathology and Ophthalmic Medical Image Analysis. OMIA COMPAY 2018 2018. Lecture Notes in Computer Science(), vol 11039. Springer, Cham. https://doi.org/10.1007/978-3-030-00949-6_33
Download citation
DOI: https://doi.org/10.1007/978-3-030-00949-6_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00948-9
Online ISBN: 978-3-030-00949-6
eBook Packages: Computer ScienceComputer Science (R0)