Abstract
Intensity-based multi-atlas strategies have shown leading performance in segmenting healthy subjects, but when lesions are present, the abnormal lesion intensities affect the fusion result. Here, we propose a reformulated statistical fusion approach for multi-atlas segmentation that is applicable to both healthy and injured brains. This method avoids the interference of lesion intensities on the segmentation by incorporating two a priori masks to the Non-Local STAPLE statistical framework. First, we extend the theory to include a lesion mask, which improves the voxel correspondence between the target and the atlases. Second, we extend the theory to include a known label mask, that forces the label decision in case it is beforehand known and enables seamless integration of manual edits. We evaluate our method with simulated and MS patient images and compare our results with those of other state-of-the-art multi-atlas strategies: Majority vote, Non-local STAPLE, Non-local Spatial STAPLE and Joint Label Fusion. Quantitative and qualitative results demonstrate the improvement in the lesion areas.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Brain parcellation has become an essential tool for understanding neurological structural-functional associations at a millimeter scale. The resulting voxelwise tissue classifications are integral to identifying structural regions for connectomics, functional activations, quantitative/metabolical changes, diffusion connectivity, etcetera. These techniques require reproducible segmentations; however, manual delineation is time-consuming, exhibits poor reproducibility, and is subject to inter- and intra- operator variability. For these reasons, automatic brain parcellation has been widely studied [1,2,3]. Several automatic strategies have been proposed in the literature to segment brain structures, such as deformable, learning-based, region-based, etc. [4,5,6]; however, most of these methods are structure-specific and do not allow segmentation of the whole brain. In contrast, atlas-based strategies provide a whole parcellation when the atlases used have all the structures labelled.
In multi-atlas segmentation, a collection of atlases is registered to the target image and their labels are propagated and fused in the target image space, obtaining the final segmentation. Label fusion strategies based on intensities [7,8,9,10] have been demonstrated to be robust and provide good performance when dealing with healthy subjects. However, as most other state-of-the-art methods, they are designed to segment healthy subjects and their performance tends to be affected when segmenting brains hindered by tumors and lesions, for instance, as a result of multiple sclerosis (MS) [12].
Herein, we propose a novel statistical fusion algorithm that reformulates the non-local STAPLE (NLS) [8] statistical framework to handle (anatomical) MRI visible lesions. As in NLS, our method models the registered atlases as collections of volumetric patches with intensity and label information. To complement the non-local criteria, we introduce lesion mask information to resolve the imperfect correspondences between the healthy atlases and the lesioned target derived from inaccurate registrations. Additionally, a second mask is integrated into the estimation process, which forces the voxel label assignation in case it is known beforehand. For instance, this modification is useful when segmenting brains with tumors for which sub-regions are known. Together, these innovations enable inclusion of masks of abnormal anatomy and manually provided edits within modern statistical fusion approaches. We derive the theoretical basis governing our method and demonstrate segmentation improvement with respect to other multi-atlas strategies on the state of the art on both simulated and MS images.
2 Theory
Consider a target gray-level image (with lesions) represented as a vector \(I\in \mathrm I\!R^{N\times 1}\). Let \(T\in \mathcal {L}^{N\times 1}\) be the latent representation of the true target segmentation, where \(\mathcal {L}=\{0,\ldots ,L-1\}\) is the set of possible labels which can be assigned to a concrete voxel. Let \(M\in \{0,1\}^{N\times 1}\) be a binary lesion mask indicating whether a given voxel i of the target image contains or is part of a lesion and \(K\in \{0,1\}^{N\times 1}\) a second mask specifying if for a given voxel i of the target image, the true label is known, hence \(M_i=p\left( I_i\in lesion\right) \) and \(K_i=p\left( T_i=T_k\in \mathcal {L}^{N\times 1}\right) \). Note that both masks are optional and can be neglected if all voxels in the mask are set to 0. Consider a set R of registered healthy atlases with associated gray level images, \(A\in \mathrm I\!R^{N\times R}\), and propagated label decisions, \(D\in \mathcal {L}^{N\times R}\). Let \(\theta \in [0,1]^{R\times N\times L\times L}\) be the performance level parameters of the raters (registered atlases), defined voxel-wise. Each element of \(\theta \), \(\theta _{jis's}\), represents the probability that rater j observes label \(s'\) given that the true label is s at a given voxel i and the corresponding voxel \(i^{*}\) on the associated atlas−i.e., \(\theta _{jis's}=p\left( D_{i^{*}j}=s',A_j|T_i=s,I_i,M_i,K_i,\theta _{jis's}\right) \), where \(i^{*}\) is the voxel on atlas j that corresponds to the target voxel i.
2.1 Non-local Correspondence Model
Non-local STAPLE (NLS) [8] incorporates the concept of patch-based non-local correspondence based on the image intensities of both the target image I and the registered atlases A to the STAPLE framework. Although this concept has proven useful for matching healthy tissues to account for registration accuracy, we cannot rely on intensity similarities between the target lesion areas and the healthy atlases to rectify registration errors. Therefore, we assume that voxel correspondence inside the lesions cannot be further improved based on intensity and, hence, enforce the non-local weighting (\(\alpha _{ji'i}\)) between voxel i in the target image at voxel \(i'\) on the jth atlas as follows:
where \(\wp (\cdot )\) is the set of intensities in the patch neighborhood of a given intensity location. In this definition, \(\wp _{M_i}=\wp (1-M_i)\) is the masking term that excludes lesion voxels from the patch calculation and enforces the same patch neighborhood size/shape in both the atlas and the target, \(\Vert {\wp _{M_i}\circ (\wp (A_{i'j}-\wp (I_i))}\Vert ^{2}_2\) is the L2-norm between the atlas patch centered at \(i'\) and the target patch centered at i, \(\varepsilon _{i'i}^{2}\) is the Euclidean distance in physical space between i and \(i'\), \(\sigma _i\) and \(\sigma _d\) are the standard deviations of the intensity and distance weights, and \(Z_\alpha \) is a partition function that enforces the constraint that \(\sum _{i'\in \mathcal {N}(i)}{\alpha _{ji'i}}=1\), where \(\mathcal {N}(i)\) is the set of voxels in the search neighborhood of a given target voxel. \(\delta (i'=i)\) is the Dirac delta function, and \(\Vert \wp _{M_i}\Vert \) is the number of voxels in the patch neighborhood.
2.2 The Algorithm
If the exact voxel correspondences between the target and the atlases (non-local model) were known, the lesion mask, and the target and atlas intensity relationships could be ignored and the spatial STAPLE [11] definition of \(\theta \) could be used.
However, this correspondence is not known and we have to learn it with the model defined in Sect. 2.1. Note that using this model we can approximate the relationship by taking the expected value of \(p\left( D_{i^{*}j}=s',A_j | T_i=s,I_i,M_i,K_i,\theta _{jis's}\right) \) across the raters. Using an assumption of conditional independence between the labels, lesion mask and intensity, we approximate the density function as:
E-step. Let \(W\in \mathrm I\!R^{L\times N}\), where \(W_{si}^{(t)}\) represents the probability that the true label associated with voxel i is s at iteration t of the algorithm given the provided information and the performance level parameters.
Using Bayes’ rule to separate the prior label probability (\(p\left( T_i=s\right) \)) and assuming independence among the raters, we can rewrite this equation as follows:
where \(\delta \left( s'=s\right) \) is the Dirac delta function (probability that the known label for voxel i of the truth segmentation is s). Using the non-local correspondence model and the approximated density function, we obtain:
M-step. In this step, the calculated \(W_{si}^{(t)}\) is used to update \(\theta _{ji}^{(t+1)}\) by maximizing the expectation of the complete data log likelihood. As the complete data log likelihood is not observable, it is replaced by its conditional expectation given the observable data D, A, I, M, K using the current estimate \(\theta \).
As each row of \(\theta \) must sum one to be a valid probability mass function, we can maximize the performance level parameters for each element by using a Lagrange multiplier (\(\lambda \)) to formulate the constrained optimization problem.
By solving this equation, we obtain
2.3 Initialization and Priors
The voxel-wise prior \(p\left( T_i=s\right) \) was initialized using the weak log-odds majority vote, as in NLSS. The performance parameters, \(\theta _{jis's}\), were initialized assuming each atlas has high performance as: 1, if \(s=s'\); 0.95, if \(s=s'\); 0, if \(s\ne s'\); and \(\frac{0.05}{L-1}\), otherwise. The search neighborhood \(\mathcal {N}(\cdot )\) was set to \(7\times 7\times 7\), patch \(\wp (\cdot )\) dimensions to \(5\times 5\times 5\) and \(\sigma _i\) and \(\sigma _d\) were set to 0.25 and 1.5, respectively. Algorithm convergence was detected when the average change in the diagonal elements of \(\theta \) was below \(10^{-4}\).
3 Experiments and Results
The atlases used in our experiments were taken from the MICCAI 2012 Grand Challenge and Workshop on Multi-Atlas Labeling database [13]. This database consists of 35 T1-w MR images, obtained from the OASISFootnote 1 project and labeled by Neuromorphometrics, Inc.Footnote 2, and includes labels for the whole brain. PCA atlas selection was performed and only the 15 most similar atlases were used for segmentation. All images were histogram normalized and N4 bias field corrected before registration. All pair-wise registrations were performed using an initial affine registration (niftyregFootnote 3) followed by a non-rigid (ANTsFootnote 4) procedure. In all the registrations performed, the lesions were masked-out to avoid their intensities to interfere in the similarity metric calculation.
As benchmarks, we compare the proposed algorithm to majority vote (MV) [14], non-local STAPLE (NLS) [8], non-local Spatial STAPLE (NLSS) [9] and Joint Label Fusion (JLF) [10]. For a fair comparison, all the parameters that NLS and NLSS share with our algorithm were set to the same values. Also, JLF was executed with the same patch and neighborhood size.
3.1 Simulated Lesions
Evaluating the performance of segmentation algorithms on real lesioned images is not an easy task since there is a lack of public databases with ground truth for both lesions and structures. For this reason, in the first experiment, we simulated two sets of artificially lesioned images: (1) 10 with uniform intensity lesions, to test the proposed theory and, (2) 15 with lesion shapes, intensities and locations obtained from an in-house MS patient database, to simulate realistic cases. All the lesions were generated on random subjects from the MICCAI 2012 database. The lesion load of the generated images ranged from [33.49−119.74] mm\(^3\) in the first cohort and from [3.16−26.96] mm\(^3\) in the second one.
We evaluated the segmentation results quantitatively using a global Dice Similarity Coefficient (DSC) across all the structures as the main measure. As the lesion intensities not only necessarily affect the lesion area segmentation itself, but also the surrounding tissues, two measures were calculated: (1) DSC inside the lesion mask, and (2) DSC inside a mask that included three voxels of the contour. Note that \(\mathcal {N}(\cdot )\) was set to \(7\times 7\times 7\).
Figure 1(A) shows that, inside the lesion mask, our method performed significantly better than all the intensity-based strategies (JLF, NLS and NLSS) in both cohorts. However, the performance was similar to that of MV. This is due to the fact that we cannot trust the intensities inside the lesions, and we can only rely on an accurate registration (same as MV does). On the other hand, when the performance was analyzed around the lesion areas, our proposal was the one that provided the best results (similar to MV in the first cohort and to JLF in the second one). This behavior is depicted in Fig. 1(B), where JLF (b and h) misclassifies several structures inside the lesion areas, whereas in NLSS (c and i) the segmentation is being also affected in the surrounding structures.
For the evaluation of the manual edits (K mask integration), we segmented the first dataset again, this time feeding the algorithm with the same lesion mask for both M and K. The results showed, as expected, a DSC of 1 inside the lesion areas (M/K mask), whereas the mean DSC around the lesions was \(0.7901\pm 0.0463\), very similar to that of the first execution (\(0.7919\pm 0.0457\)), conserving a similar effect on the tissues surrounding the lesions.
3.2 MSSeg 2016 Challenge
For the second experiment, we qualitatively compared the fusion results obtained by the analyzed algorithms on a MS patient database (MSSeg 2016 challengeFootnote 5).
Figure 2 shows the segmentations obtained with all the analyzed multi-atlas strategies. As we can observe from Fig. 2(a), MS lesions are hypo-intense in T1-w modality, which makes its intensity profile similar of that of the gray matter (GM) and even sometimes similar to the cerebro-spinal fluid (CSF) which may affect the results of intensity-based algorithms. The lesions shown in Fig. 2(b), should be classified as white matter, however, the intensity-based algorithms of the state of the art, Fig. 2(f−h), tend to misclassify those regions as GM or CSF, whereas our method, Fig. 2(c), shows better classification results in those areas. When our method is fed with a K mask, Fig. 2(d), the lesion surrounding voxels remain practically the same as when the K mask is not used, Fig. 2(c), whereas the segmentation result inside the lesions agrees entirely with the labels imposed by this mask, as seen in Sect. 3.1.
4 Discussion
Accurate structural volume measurements are important in MS, since the atrophy of some structures such as the deep GM is relevant to the disease progression. However, we have shown that multi-atlas strategies based on intensities, which achieve good segmentation results on healthy subjects, are affected by lesions, and therefore corrupting real measures.
Herein, we have presented the theory to modify the non-local STAPLE framework to deal with MRI visible lesions. The experiments performed show that our proposal outperforms the state-of-the-art multi-atlas strategies in the lesion areas for both simulated and MS patient images.
Over-performance of MV compared to the state-of-the-art intensity-based strategies was observed on the experiments performed on the uniform intensity lesions database around the lesion areas. This behavior could be due to the fact that the other strategies are patch-based. These strategies consider mean patch differences to calculate the correspondences, hence the bright voxels of the lesions could bias the mean intensity, finding wrong atlas correspondences. Even though, this is an extreme case to test the proposed theory, it shows, combined with the over-performance of MV inside the lesion areas, the effect of the lesion intensities on the segmentation.
In this work, we have only focused on the segmentation performance of the lesion areas, since those are the ones concerning the proposed reformulation. Nonetheless, as these areas are better segmented with our strategy, the average whole brain segmentation performance slightly increases compared to the non-local STAPLE variants. This small improvement is due to lesions are small compared to the whole brain volume. For this reason, we believe that extending our theory to other methods of the literature, such as JLF, would be beneficial in terms of segmentation accuracy of the lesion areas but also of the whole brain.
References
Iglesias, J.E., Sabuncu, M.R.: Multi-atlas segmentation of biomedical images: a survey. Med. Image Anal. 24(1), 205–219 (2015)
Li, W., Wang, G., Fidon, L., Ourselin, S., Cardoso, M.J., Vercauteren, T.: On the compactness, efficiency, and representation of 3D convolutional networks: brain parcellation as a pretext task. In: Niethammer, M., et al. (eds.) IPMI 2017. LNCS, vol. 10265, pp. 348–360. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59050-9_28
Wachinger, C., Reuter, M., Klein, T.: DeepNAT: deep convolutional neural network for segmenting neuroanatomy. NeuroImage 170, 434–445 (2018)
González-Villà, S., Oliver, A., Valverde, S., Wang, L., Zwiggelaar, R., Lladó, X.: A review on brain structures segmentation in magnetic resonance imaging. Artif. Intell. Med. 73, 45–69 (2016)
Kushibar, K., et al.: Automated sub-cortical brain structure segmentation combining spatial and deep convolutional features. Med. Image Anal. 48, 177–186 (2018)
Dolz, J., Desrosiers, C., Ayed, I.B.: 3D fully convolutional networks for subcortical segmentation in MRI: a large-scale study. NeuroImage 170, 456–470 (2018)
Coupé, P., Manjón, J.V., Fonov, V., Pruessner, J., Robles, M., Collins, D.L.: Patch-based segmentation using expert priors: application to hippocampus and ventricle segmentation. NeuroImage 54(2), 940–954 (2011)
Asman, A.J., Landman, B.A.: Non-local statistical label fusion for multi-atlas segmentation. Med. Image Anal. 17(2), 194–208 (2013)
Huo, Y., Asman, A.J., Plassard, A.J., Landman, B.A.: Simultaneous total intracranial volume and posterior fossa volume estimation using multi-atlas label fusion. Hum. Brain Mapp. 38(2), 599–616 (2017)
Wang, H., Suh, J.W., Das, S.R., Pluta, J.B., Craige, C., Yushkevich, P.A.: Multi-atlas segmentation with joint label fusion. IEEE Trans. Pattern Anal. Mach. Intell. 35(3), 611–623 (2013)
Asman, A.J., Landman, B.A.: Formulating spatially varying performance in the statistical fusion framework. IEEE Trans. Med. Imaging 31(6), 1326–1336 (2012)
González-Villà, S., et al.: Evaluating the effect of multiple sclerosis lesions on automatic brain structure segmentation. NeuroImage Clin. 15, 228–238 (2017)
Landman, B., Warfield, S.: MICCAI 2012 workshop on multi-atlas labeling. In: MICCAI Grand Challenge and Workshop on Multi-Atlas Labeling, Nice, France (2012)
Sabuncu, M., Yeo, B., Van Leemput, K., Fischl, B., Golland, P.: A generative model for image segmentation based on label fusion. IEEE Trans. Med. Imaging 29(10), 1714–1729 (2010)
Acknowledgements
The authors would like to thank Jose Bernal for helpful discussion. Sandra González-Villà holds a UdG-BRGR2015 grant. This work has been supported by “La Fundació la Marató de TV3”, by Retos de Investigación TIN2014-55710-R and TIN2015-73563-JIN, by DPI2017-86696-R and by UdG mobility grant MOB17. This research was supported by NSF CAREER 1452485, NIH R01-EB017230, and the National Center for Advancing Translational Sciences, Grant 2 UL1 TR000445-06. We appreciate the NIH S10 Shared Instrumentation Grant 1S10OD020154-01 (Smith), Vanderbilt IDEAS grant (Holly-Bockelmann, Walker, Meliler, Palmeri, Weller), and ACCRE’s Big Data TIPs grant from Vanderbilt University. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health, National Science Foundation, or other sponsors.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
González-Villà, S., Huo, Y., Oliver, A., Lladó, X., Landman, B.A. (2018). Multi-atlas Parcellation in the Presence of Lesions: Application to Multiple Sclerosis. In: Bai, W., Sanroma, G., Wu, G., Munsell, B., Zhan, Y., Coupé, P. (eds) Patch-Based Techniques in Medical Imaging. Patch-MI 2018. Lecture Notes in Computer Science(), vol 11075. Springer, Cham. https://doi.org/10.1007/978-3-030-00500-9_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-00500-9_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00499-6
Online ISBN: 978-3-030-00500-9
eBook Packages: Computer ScienceComputer Science (R0)