Automatic Segmentation of Extraocular Muscles Using Superpixel and Normalized Cuts

Xing, Qi; Li, Yifan; Wiggins, Brendan; Demer, Joseph L.; Wei, Qi

doi:10.1007/978-3-319-27857-5_45

Automatic Segmentation of Extraocular Muscles Using Superpixel and Normalized Cuts

Qi Xing²⁵,
Yifan Li²⁶,
Brendan Wiggins²⁷,
Joseph L. Demer²⁸ &
…
Qi Wei²⁷

Conference paper
First Online: 18 December 2015

2868 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9474))

Abstract

This paper proposes a novel automatic method to segment extraocular muscles and orbital structures. Instead of conventional segmentation at the pixel level, superpixels at the structure level were used as the basic image processing unit. A region adjacency graph was built based on the neighborhood relationship among superpixels. Using Normalized Cuts on the region adjacency graph, we refined the segmentation by using a variety of features derived from the classical shape cues, including contours and continuity. To demonstrate the efficiency of the method, segmentation of Magnetic Resonance images of five healthy subjects was performed and analyzed. Three region-based image segmentation evaluation metrics were applied to quantify the automatic segmentation accuracy against manual segmentation. Our novel method could produce accurate and reproducible eye muscle segmentation.

Download conference paper PDF

1 Introduction

The extraocular muscles (EOMs) implement eye movements. Through Magnetic Resonance Imaging (MRI), it has been found that many forms of binocular alignment (strabismus) are associated with anatomical abnormalities of EOMs [1]. In clinical practice, EOM enlargement is a key quantity to examine in diagnosing several complex strabismus [2] including thyroid eye disease [3, 4]. Therefore, how to reliably and efficiently outline the EOM boundaries from clinical MRI becomes an important practical and research question. In all published studies to date, investigators segment EOM boundaries manually [1, 5, 6], which is labor expensive and may introduce user dependent artifacts.

Several computer-aided semi-automatic [7–9] and automatic segmentation [10, 11] methods have been developed. However, all of these methods used image pixels as the underlying representation primitive. It is known that pixels are not the most natural representation of visual scenes, since they do not take into account the local patterns among neighboring pixels and are subject to noise. It would be more natural and efficient to process the image with perceptually meaningful patches containing many pixels that share similar features.

We propose a fully automatic EOM segmentation method based on superpixel, region adjacency graphing and Normalized Cuts, while integrating the prior shape information. Rather than processing images on the pixel level, the approach builds upon local feature of the EOMs. We consider small image patches obtained from superpixel over-segmentation [12–15] as the basic unit of any further image processing procedures, such as filtering, detection and segmentation. We show that by building a region adjacency graph of the superpixels, we can develop a robust method to outline the eye socket boundary and the EOMs within. The performance of our automatic segmentation method was evaluated by comparing our results to manual segmentation which showed high accuracy.

2 Related Work

Firbank et al. [8] showed the feasibility to segment EOMs using the active contours. However, this approach is sensitive to the boundary initialization, since it can be easily trapped in local minima [16]. In addition, the accuracy is influenced by the convergence criteria — higher accuracy requires tighter convergence criteria and longer computation time [17]. Souza et al. proposed a mathematical morphology method to semi-automatically segment EOMs [9, 18]. They performed an iterative grayscale closing operations to segment the orbital wall which was then used as the region of interest. The EOMs were outlined in the region of interest through Laplacian or Gaussian detector and opening operations. However, the size of flat disk used for the morphology operation was fixed and only worked on the pixel level. The number of iterations had to be carefully supervised. A more recent semi-automatic approach deformed 3D geometric template models of the EOMs to the MRI images of individual patients [19]. Image features of the EOMs were detected and filtered to guide fitting of the generic anatomical template. However, the template model has to be built by considering anatomical characteristics of the EOMs. In addition, a global registration between the image sequence and the template model had to be performed at the beginning.

3 Methodology

3.1 Superpixel Over-Segmentation

The image segmentation algorithm Superpixel [13] groups pixels with coherent intensities and spatial locations into patches of pixels. These superpixels provide a high level representation of the original image which can be used for further processing. The geometric shapes of superpixels are not restricted to rectangular. Such flexibility enables representation of features more naturally by maintaining the boundaries of the objects in the image. Accurate segmentation can then be performed by merging the local superpixels which have similar features.

We applied the k-means algorithm to group nearby pixels into superpixels in uniform sizes [12]. Unlike other superpixellization methods [20, 21], the k-means method produces a more regularized grid of superpixels, which is important for building the region adjacency graph. Figure 1(a) shows a T-1 weighted quasi-coronal MRI image perpendicular to the long axis of the orbit with 312 micron pixels and 2 mm plane thickness. Figure 1(b) illustrates the result of the superpixel over-segmentation. The boundaries of the superpixels preserve the true structure boundaries. More importantly, the shape and area characteristics of the EOMs and the eye socket are relatively consistent [9]. As the algorithm restricts, the number of pixels in each superpixel is nearly constant across the image.

3.2 Region Adjacency Graph Construction

Human visual perception is good at recognizing individual objects even with varying intensities or textures. As an algorithm with no priors, superpixel segmentation algorithms [12, 13, 22, 23] have the tendency to over-segment the image (Fig. 1(c)(d)). Region adjacency graph (RAG) is a data structure common for many segmentation algorithms [24]. We used RAG to specify spatial connection of neighboring superpixels. Each superpixel was defined as a node in a graph (Fig. 2). Each superpixel was connected through the edges to all its neighbors. The RAG was used to merge adjacent regions provided that these regions have similar intensity distributions. Denote $n_i$ to be a node in RAG with mean intensity $I_i$ and $n_j$ to be one of its neighbors. The edge weight between $n_i$ and $n_j$ is defined as $w_{ij}=\exp \bigl (\frac{-\Vert I_{i}-I_{j}\Vert ^2}{\sigma ^2}\bigr )$, where $\sigma ^2$ is overall image variance. $w_{ij}$ measures the intensity similarity between $n_i$ and $n_j$.

3.3 Normalized Cuts Segmentation

Superpixel segmentation is a bottom-up approach as it merges individual pixels together. Once we have the superpixels represented by the RAG, we applied the Normalized Cuts algorithm (Ncut) [25], a top-down approach to partition the graph recursively until finding the ocular structures. Applying the Normalized Cuts other on superpixels other than pixels is more robust to image noise. In addition, it has an obvious advantage of being computationally efficient, as the size of the affinity matrix and the complexity of the RAG employed for image representation are significantly reduced.

In each division, the Ncut algorithm optimally divides one region into two subregions $N_1$ and $N_2$ by removing edges connecting them in RAG:

$$\begin{aligned} Ncut(N_1,N_2)=\frac{cut(N_1, N_2)}{assoc(N_1,N)}+\frac{cut(N_1,N_2)}{assoc(N_2,N)}. \end{aligned}$$

(1)

$ cut(N_1, N_2)=\sum _{n_i\in N_1 \& n_j\in N_2}w_{ij}$ computes the degree of dissimilarity between $N_1$ and $N_2$ as summed weight of all the removed edges. $assoc(N_1,N)$ defines the total edge weight from nodes in $N_1$ to all nodes in the current region.

Figure 3 shows the segmentation results after applying Normalized Cuts. The final nodes after Normalized Cuts contain many superpixels and are colored in grey scale. The Normalized Cuts segmentation can successfully highlight some of the ocular structures such as the superior rectus muscle, the oblique rectus muscle and the optic nerve. However, the lateral and medial rectus muscles had incomplete boundaries and their boundaries are connected with the orbital wall, making them difficult to segment. Further automatic operations are needed to solve discontinuity issues and label each structure.

3.4 Orbital Wall and Extraocular Muscle Segmentation

Segmenting the orbital wall was studied previously. Souza et al. [9] applied an iterative grayscale mathematical morphology operation to segment the orbits. But their method required the user to specify a flat disk template and number of recursive iterations of erosions. Firbank et al. [8] manually outlined the boundary around the eye socket. We proposed an automatic method to extract the orbital wall with shape prior knowledge. The Laplacian of the Gaussian [26] and connected components labeling methods are applied to detect the connected boundaries of the orbital wall, rectus muscles and optic nerve from the segmented Normalized Cuts image shown in Fig. 4(a).

To extract the orbital wall, we considered the prior knowledge that the eye socket was always located near the image center. The center of each region produced by Normalized Cuts was calculated and shown in Fig. 4(b). The centers of the optic nerve and the orbital wall were the two closest centers near the image center. Using the k-nearest algorithm, the regions of the optic nerve and the orbital wall can be identified from the boundary map. Any region outside the orbital wall were removed (see (Fig. 4(c)). Two of the closed boundaries inside the orbital wall were identified as the superior and inferior rectus muscles (Fig. 4(c)). To segment the lateral rectus muscle and medial rectus muscle with incomplete boundaries, the convex hull around the orbital wall was calculated and shown as the red closed curve in Fig. 4(d). The generated closed orbital wall completed the initial discontinuous boundaries of the lateral and medial rectus muscles which are in contact with the orbital wall. The convex hull served as the region of interests and reserves the natural boundary of the eye socket in the original image (Fig. 4(e)) and the superpxiel image (Fig. 4(f)). Finally, image region and hole filling algorithms were used to complete the EOM segmentation (Fig. 5(a)). Figure 5(b) shows the segmented boundaries overlaid with the MRI and superpixel images.

4 Experiments and Results

4.1 Materials

The T1-weighted MRI images of both eyes were acquired from 5 health subjects and provided by Dr. Joseph Demer at UCLA. Eight coronal images at the slice thickness of 2 mm for each eye were segmented. All images were digitized with $256\times 256$ pixels and 16 bits gray-level of resolution at voxel size of 0.3 mm $\times $ 0.3 mm $\times $ 2.0 mm. We asked two operators to independently and manually trace the ocular structure boundaries, which were used as ground truth for accuracy assessment.

4.2 Shape Error Analysis

One critical issue is to determine the number of superpixels n when applying the k-means algorithm. If there are too few superpixels, we may miss out important structures. On the other hand, if there are too many, we lose the superpixel’s advantage of being representative, robust and efficient. In order to determine an appropriate n, parameter learning was performed by analyzing the influence of n on the segmentation accuracy. We first manually segmented one set of MRI images by tracing the boundaries of the ocular structures. We applied superpixel over-segmentation on the same set of images while varying $200<n<2600$. For each n, we overlapped the manually traced boundaries to the superpixel image (Fig. 1(c)(d)). The shape error between the manual and the superpixel segmentations was computed using the boundary-based measurement [13] to quantify how close the superpixel boundaries are to the manual segmentation (approximating ground truth). The shape error was calculated as the mean absolute distance between the superpixel boundaries and the ground truth boundaries. Figure 6 plots the shape error as a function of n for different ocular structures. Unsurprisingly, as the number of superpixels increases, the error drops monotonically to zero when each pixel becomes one superpixel. According to “elbow selection criterion” [27, 28], $n=1800$ was chosen as the number of superpixels for our MRI dataset in subsequent operations.

4.3 Performance Evaluation

Area-based metric [9] and volume-based metric [7, 8] are commonly used to evaluate the accuracy of image segmentation. One drawback is that these two metrics do not consider the overlap between ground truth and computer generated segmentation results. We decided to use the region-based metrics to assess our proposed approach: Variation of Information (VI) [29], Probabilistic Rand Index (RI) [30] and Segmentation Covering Criteria (Covering) [31].

Variation of Information. The Variation of Information metric was introduced for the purpose of clustering comparison [29]. It measures the distance between two segmentations in terms of their average conditional entropy defined as

$$\begin{aligned} VI(S,S')=H(S)+H(S)-2I(S,S'), \end{aligned}$$

(2)

where H and I represent the entropies and mutual information between two clusters of data S and $S'$.

Rand Index. Rand Index [30] was first proposed for general clustering evaluation. It operates by comparing the compatibility of assignments between pairs of elements in the clusters. The Rand Index between the automatical segmentation and the manual segmentation X and G is given by summing the number of pairs of pixels that have the same label in $X\bigcup G$ and those with different labels in both segmentations, divided by the total number of pairs of pixels.

Segmentation Covering. We define the covering of segmentation S by segmentation $S'$ as

$$\begin{aligned} C(S' \rightarrow S) = 1/N * \sum _{R \in S} \left| R \right| * max_{R' \in S'} O(R, R'), \end{aligned}$$

(3)

where N is the total number of pixels in the image and $O(R,R')=\frac{\left| R \cap R' \right| }{\left| R \cup R' \right| }$ is the overlap between two regions R and $R'$ [31].

Table 1. Computational time (in seconds) of applying three collision detection methods.

Full size table

Table 1 summarized evaluation results using the region-based metrics. The IR, LR, MR and SR muscles of both left and right eyes were analyzed. We computed two scores for the Segmentation Covering for EOMs, which were segmentations at a optimal data set scale (ODS) and a optimal image scale (OIS). We also examined the Rand Index and Variation of Information quantities compared to the manual segmentation. Lower value of Variation of Information metric indicates greater similarity. Our average Variation of Information value is 1.51 which outperforms other image segmentation outcomes [32]. Rand Index metric is in the range [0 1]. Higher value indicates greater similarity between two segmentations. For different EOMs, The average value for Rand Index metric is greater than 0.82 which shows the excellent performance of our algorithm. The standard deviation of Rand Index is about 0.03 demonstrating that our method can generate consistent results for four different EOMs. With respect to Segmentation Covering, we compute the normalized overlap score in range [0 1]. The larger the value, the more accurate the algorithm. The average value for different EOMs is 0.78, which is consistent to Rand Index result. In summary, as Table 1 shows, our automatic segmentation algorithm was able to segment boundaries fairly close to the manual segmented boundaries which illustrates the accuracy and effectiveness of our approach.

5 Conclusions

Extraocular muscles segmentation from MRI is an important and challenging task for clinical diagnosis. This study has demonstrated an automatic method using Superpixel, Region Adjacency Graph and Normalized Cuts. The results were compared to manual segmentations. Region-based segmentation evaluation metrics showed that our method was able to segment boundaries fairly accurately. In the future, we plan to improve the efficiency of segmentation on superpixels using Normalized Cuts by using GPU to accelerate the Normalized Cuts. Alternatively, we will employ other more efficient graph cut algorithms to segmentation the images. To improve the reliability for the optic nerve identification, we will consider the shape prior of the optic nerve and locate it by using circle fitting method on the segmented shapes. The method will be applied to automatically reconstruct 3D patient-specific EOM models which will be used in clinical diagnosis and surgical planning.

References

Bijlsma, W.R., Mourits, M.P.: Radiologic measurement of extraocular muscle volumes in patients with Graves’ orbitopathy: a review and guideline. Orbit 25, 83–91 (2006)
Article Google Scholar
Ben Simon, G.J., Syed, H.M., Douglas, R., McCann, J.D., Goldberg, R.A.: Extraocular muscle enlargement with tendon involvement in thyroid-associated orbitopathy. Am. J. Ophthalmol. 137, 1145–1147 (2004)
Article Google Scholar
Dal Canto, A.J., Crowe, S., Perry, J.D., Traboulsi, E.I.: Intraoperative relaxed muscle positioning technique for strabismus repair in thyroid eye disease. Ophthalmology 113, 2324–2330 (2006)
Article Google Scholar
Gupta, A., Sadeghi, P.B., Akpek, E.K.: Occult thyroid eye disease in patients presenting with dry eye symptoms. Am. J. Ophthalmol. 147, 919–923 (2009)
Article Google Scholar
Kono, R., Poukens, V., Demer, J.L.: Quantitative analysis of the structure of the human extraocular muscle pulley system. Invest. Ophth. Vis. Sci. 43, 2923–2932 (2002)
Google Scholar
Chaudhuri, Z., Demer, J.L.: Sagging eye syndrome: connective tissue involution as a cause of horizontal and vertical strabismus in older patients. JAMA Ophthalmol. 131, 619–625 (2013)
Article Google Scholar
Firbank, M.J., Coulthard, A.: Evaluation of a technique for estimation of extraocular muscle volume using 2D MRI. Brit. J. Radiol. 73, 1282–1289 (2000)
Article Google Scholar
Firbank, M.J., Harrison, R.M., Williams, E.D., Coulthard, A.: Measuring extraocular muscle volume using dynamic contours. Magn. Reson. Imaging 19, 257–265 (2001)
Article Google Scholar
Souza, A.D.A., Ruiz, E.E.S., Cruz, A.A.V.: Extraocular muscle quantification using mathematical morphology: a semi-automatic method for analyzing muscle enlargement in orbital diseases. Comput. Med. Imag. Grap. 31, 39–45 (2007)
Article Google Scholar
Szucs-Farkas, Z., Toth, J., Balazs, E., Galuska, L., Burman, K.D., Karanyi, Z., Leovey, A., Nagy, E.V.: Using morphologic parameters of extraocular muscles for diagnosis and follow-up of Graves’ ophthalmopathy: diameters, areas, or volumes? Am. J. Roentgenol. 179, 1005–1010 (2002)
Article Google Scholar
Lv, B., Wu, T.N., Lu, K., Xie, Y.: Automatic segmentation of extraocular muscle using level sets methods with shape prior. In: Long, M. (ed.) IFMBE Proceedings. IFMBE, vol. 39, pp. 904–907. Springer, Heidelberg (2012)
Google Scholar
Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Susstrunk, S.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE T. Pattern Anal. 34, 2274–2282 (2012)
Article Google Scholar
Ren, X., Malik, J.: Learning a classification model for segmentation. In: ICCV, pp. 10–17 (2003)
Google Scholar
Fulkerson, B., Vedaldi, A., Soatto, S.: Class segmentation and object localization with superpixel neighborhoods. In: ICCV, pp. 670–677 (2009)
Google Scholar
Levinshtein, A., Sminchisescu, C., Dickinson, S.: Optimal contour closure by superpixel grouping. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 480–493. Springer, Heidelberg (2010)
Chapter Google Scholar
Chan, T., Vese, L.: Active contours without edges. IEEE T. Image Process. 10, 266–277 (2001)
Article MATH Google Scholar
Kass, M., Witkin, A., Terzopoulos, D.: Snakes: active contour models. Int. J. Comput. Vision 1, 321–331 (1988)
Article Google Scholar
Souza, A., Ruiz, E.: Fast and accurate detection of extraocular muscle borders using mathematical morphology. In: IEMBS, pp. 1779–1782 (2000)
Google Scholar
Wei, Q., Sueda, S., Miller, J., Demer, J., Pai, D.: Template-based reconstruction of human extraocular muscles from magnetic resonance images. In: ISBI, pp. 105–108 (2009)
Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient graph-based image segmentation. Int. J. Comput. Vision 59, 167–181 (2004)
Article Google Scholar
Vedaldi, A., Soatto, S.: Quick shift and kernel methods for mode seeking. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol. 5305, pp. 705–718. Springer, Heidelberg (2008)
Chapter Google Scholar
Conrad, C., Mertz, M., Mester, R.: Contour-relaxed superpixels. In: Heyden, A., Kahl, F., Olsson, C., Oskarsson, M., Tai, X.-C. (eds.) EMMCVPR 2013. LNCS, vol. 8081, pp. 280–293. Springer, Heidelberg (2013)
Chapter Google Scholar
Gould, S., Rodgers, J., Cohen, D., Elidan, G., Koller, D.: Multi-class segmentation with relative location prior. Int. J. Comput. Vision 80, 300–316 (2008)
Article Google Scholar
Tremeau, A., Colantoni, P.: Regions adjacency graph applied to color image segmentation. IEEE T. Image Process. 9, 735–744 (2000)
Article Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE T. Pattern Anal. 22, 888–905 (2000)
Article Google Scholar
Sharifi, M., Fathy, M., Tayefeh Mahmoudi, M.: A classified and comparative study of edge detection algorithms. In: ITCC, pp. 117–120 (2002)
Google Scholar
Thorndike, R.L.: Who belongs in the family? Psychometrika 18, 267–276 (1953)
Article Google Scholar
Tibshirani, R., Walther, G., Hastie, T.: Estimating the number of clusters in a data set via the gap statistic. J. Roy. Stat. Soc. B Met. 63, 411–423 (2001)
Article MATH MathSciNet Google Scholar
Meila, M.: Comparing clusterings: an axiomatic view. In: ICML, pp. 577–584, New York (2005)
Google Scholar
Rand, W.M.: Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc. 66, 846–850 (1971)
Article Google Scholar
Malisiewicz, T., Efros, A.A.: Improving spatial support for objects via multiple segmentations. In: BMVC, pp. 1–10 (2007)
Google Scholar
Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. IEEE T. Pattern Anal. 33, 898–916 (2011)
Article Google Scholar

Download references

Acknowledgments

Supported by the Jeffress Trust Awards, NIH grant EY08313 and an unrestricted grant from Research to Prevent Blindness.

Author information

Authors and Affiliations

Department of Computer Science, George Mason University, Fairfax, VA, USA
Qi Xing
Lake Braddock Secondary School, Burke, VA, USA
Yifan Li
Department of Bioengineering, George Mason University, Fairfax, VA, USA
Brendan Wiggins & Qi Wei
Department of Neurology, Jules Stein Eye Institute, David Geffen Medical School at University of California, Los Angeles, USA
Joseph L. Demer

Authors

Qi Xing
View author publications
You can also search for this author in PubMed Google Scholar
Yifan Li
View author publications
You can also search for this author in PubMed Google Scholar
Brendan Wiggins
View author publications
You can also search for this author in PubMed Google Scholar
Joseph L. Demer
View author publications
You can also search for this author in PubMed Google Scholar
Qi Wei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qi Wei .

Editor information

Editors and Affiliations

University of Nevada, Reno, Nevada, USA
George Bebis
NASA Ames Research Center, Moffett Field, California, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, California, USA
Bahram Parvin
Desert Research Institute, Reno, Nevada, USA
Darko Koracin
University of Houston, Houston, Texas, USA
Ioannis Pavlidis
IBM T.J. Watson Research Center, Yorktown Heights, New York, USA
Rogerio Feris
Purdue University, West Lafayette, Indiana, USA
Tim McGraw
Side Effects Software, Santa Monica, California, USA
Mark Elendt
The DiVE, Durham, North Carolina, USA
Regis Kopper
Texas A&M University, College Station, Texas, USA
Eric Ragan
Kent State University, Kent, Ohio, USA
Zhao Ye
Lawrence Berkeley National Laboratory, Berkeley, California, USA
Gunther Weber

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xing, Q., Li, Y., Wiggins, B., Demer, J.L., Wei, Q. (2015). Automatic Segmentation of Extraocular Muscles Using Superpixel and Normalized Cuts. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2015. Lecture Notes in Computer Science(), vol 9474. Springer, Cham. https://doi.org/10.1007/978-3-319-27857-5_45

Download citation

DOI: https://doi.org/10.1007/978-3-319-27857-5_45
Published: 18 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27856-8
Online ISBN: 978-3-319-27857-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics