Multi-view longitudinal CNN for multiple sclerosis lesion segmentation

https://doi.org/10.1016/j.engappai.2017.06.006Get rights and content

Highlights

  • A convolutional neural network based method for multiple sclerosis lesion segmentation is proposed.

  • The network utilizes longitudinal data, a novel contribution in the domain of MS lesion analysis.

  • The use of longitudinal data significantly improves segmentation accuracy.

  • State-of-the-art results are obtained on a public benchmark dataset.

  • Expert human level segmentation accuracy is obtained by the proposed method.

Abstract

In this work, a deep-learning based automated method for Multiple Sclerosis (MS) lesion segmentation is presented. Automatic segmentation of MS lesions is a challenging task due to their variability in shape, size, location and texture in Magnetic Resonance (MR) images. In the proposed scheme, MR intensities and White Matter (WM) priors are used to extract candidate lesion voxels, following which Convolutional Neural Networks (CNN) are utilized for false positive reduction and final segmentation result. The proposed network uses longitudinal data, a novel contribution in the domain of MS lesion analysis. The method obtained state-of-the-art results on the 2015 Longitudinal MS Lesion Segmentation Challenge dataset, and achieved a performance level equivalent to a trained human rater. Automatic segmentation methods, such as the one proposed, once proven in accuracy and robustness, can help diagnosis and patient follow-up while reducing the time consuming need of manual segmentation.

Introduction

Multiple Sclerosis is one of the most common non-traumatic neurological diseases in young adults. It is a chronic inflammatory disease in which the immune system attacks the central nervous system (CNS) and damages myelin, myelin producing cells and underlying nerve fibers. Damages of the myelin cause scarring of brain tissues, mostly in the white matter, which are termed MS Lesions. The impairment of the CNS due to MS ultimately leads to deficiency in sensation, movement and cognition.

MRI plays an important role in the diagnosis and treatment of MS. The Revised McDonald Criteria, which incorporate the combination of clinical characteristics and MRI features, have been devised for the purpose of diagnosis of MS (Polman et al., 2005). According to these criteria, MS can be diagnosed after a single clinical episode when MS lesions are visible in MRI. Early diagnosis is important due to the availability of therapies that slow the progression of the disease. Once MS has been diagnosed, subsequent MR scans, usually performed on a yearly basis, are used to track the progress of the disease and make further treatment decisions.

Lesions may appear, disappear or change size and shape between consecutive MR scans (Guttmann et al., 1995). It is therefore the case that temporal variability in WM tissues may provide a strong indication for the presence of MS lesions, as depicted in Fig. 1. Recently, lesion segmentation algorithms that utilize longitudinal information were shown to obtain enhanced results over algorithms that segment each time point independently (Roy et al., 2015).

Due to the clinical importance and the challenging nature of automatic MS lesion detection and segmentation, several challenges have been organized, such as the MS lesion segmentation challenge in MICCAI 2008 (Styner et al., 2008) and the Longitudinal MS lesion segmentation challenge in ISBI 2015 (Carass et al., 2017). In the latter challenge, competing teams were able to make use of longitudinal data as well as multiple contrast images in order to provide accurate automatic lesion segmentations. A variety of algorithms were proposed. Top performing methods used supervised classification frameworks, such as Random Forests (Geremia et al., 2010) and Deep Neural Networks (Brosch et al., 2016).

Convolutional Neural Networks have become increasingly popular following the 2012 ImageNet classification challenge, in which Alex Krizhevsky’s network won by a large margin (Krizhevsky et al., 2012). Since then, CNNs have been successfully used for additional applications, such as object detection and segmentation. Due to the complex structure and the enormous number of parameter of CNNs, understanding why they perform so well is not straightforward, and several works have been dedicated for this purpose Zeiler and Fergus (2014), Simonyan et al. (2013). In recent years, CNNs have also been used successfully for medical image analysis, in which volumetric data is commonly available. A multi-view CNN, in which candidate 3D volumes are decomposed to axial, coronal and sagittal views, achieved state-of-the-art accuracy in lymph node detection. This multi-view framework, compared to 3D CNNs, was able to reduce the computations required for training and testing and was robust to overfitting due to the reduced number of network weights (Roth et al., 2014). Both 2D and 3D CNNs were recently proposed for the segmentation of MS lesions (Carass et al., 2017).

This work presents a longitudinal Multi-View CNN for the MS lesion segmentation task. The input to the CNN are patches from multiple views, multiple contrast images and multiple time points. To the best of our knowledge, this is the first CNN that takes advantage of longitudinal data for MS lesion segmentation. The proposed segmentation method was evaluated on the dataset provided in the 2015 ISBI challenge, and achieved state-of-the-art accuracy on the test set. This method, which was trained on the relatively small number of patients available in the challenge training set, was able to generalize well on the test set and achieve human level performance.

The rest of the paper is organized as follows: The proposed segmentation method is detailed in Section 2. Evaluations on the 2015 ISBI dataset are presented in Section 3. Experimental results are detailed in Section 4. Finally, Discussion of various aspects of the segmentation method and concluding remarks are provided in Section 5.

Section snippets

Methods

There are three main phases in the proposed segmentation method: Pre-Processing, Candidate Extraction and CNN Prediction. The Pre-Processing phase consists of a set of commonly-used steps, including co-Registration, brain extraction, Bias field correction and Intensity normalization. In the Candidate Extraction phase, masks based on FLAIR and WM prior are generated and applied to the MR images. In the CNN Prediction phase, the multi-view CNN outputs a lesion probability for every voxel in the

Evaluation

The proposed segmentation algorithms were evaluated on the dataset of the 2015 Longitudinal Multiple Sclerosis Segmentation Challenge. The overall data is composed of two parts: (1) Training data consisting of longitudinal images from 5 patients; (2) Test data consisting of longitudinal images from 14 different patients. For each patient, the data includes T1-weighted, T2-weighted, PD-weighted, and T2-weighted FLAIR MRI with 4-6 time points acquired on a 3T MR scanner. T1-weighted images have

Experimental results

This section begins with an evaluation of the proposed system on the training set, which enables to obtain the design and parameters that yield optimal results. Sections 4.1–4.4 show several cross-validation evaluation experiments on the training dataset. These experiments were conducted on an overall amount of 21 cases. Using the optimal design and parameters found by cross validation, Section 4.5 provides experimental results on a separate test set of 61 cases.

Discussion and conclusion

This section addresses several of the design considerations in setting up the proposed system: A Patch Based solution was constructed . Today, most recent works that focus on object segmentation are often successful with the use of Fully convolutional networks (Long et al., 2015). These networks involve convolutions on the entire volume. In the proposed method, the candidate extraction stage eliminates the vast majority of voxels in the volume as possible lesion candidates. Therefore,

Acknowledgment

Part of this work was funded by the INTEL Collaborative Research Institute for Computational Intelligence (ICRI-CI) .

References (24)

  • CarassAaron et al.

    Longitudinal multiple sclerosis lesion segmentation: resource & challenge

    NeuroImage

    (2017)
  • JenkinsonMark et al.

    Improved optimization for the robust and accurate linear registration and motion correction of brain images

    Neuroimage

    (2002)
  • BroschTom et al.

    Deep 3D convolutional encoder networks with shortcuts for multiscale feature integration applied to multiple sclerosis lesion segmentation

    IEEE Transactions on Medical Imaging

    (2016)
  • Chollet, F., 2015. Keras, https://github.com/fchollet/keras/. (Accessed 3 January...
  • GeremiaEzequiel et al.

    Spatial decision forests for MS lesion segmentation in multi-channel MR images

  • GirshickRoss et al.

    Rich feature hierarchies for accurate object detection and semantic segmentation

  • GuttmannC.R. et al.

    The evolution of multiple sclerosis lesions on serial MR.

    American Journal of Neuroradiology

    (1995)
  • ISBI 2015. longitudinal MS lesion segmentation evaluation website, https://smart-stats-tools.org/node/26. (Accessed 3...
  • KrizhevskyAlex et al.

    Imagenet classification with deep convolutional neural networks

  • LongJonathan et al.

    Fully convolutional networks for semantic segmentation

  • MazziottaJohn et al.

    A probabilistic atlas and reference system for the human brain: international consortium for brain mapping (ICBM)

    Philosophical Transactions of the Royal Society, Series B (Biological Sciences

    (2001)
  • MechrezRoey et al.

    Patch-based segmentation with spatial consistency: application to MS lesions in brain MRI

    Journal of Biomedical Imaging

    (2016)
  • Cited by (68)

    • Liver lesion changes analysis in longitudinal CECT scans by simultaneous deep learning voxel classification with SimU-Net

      2023, Medical Image Analysis
      Citation Excerpt :

      These methods assume that the prior lesions segmentations are available as input; none performs lesion changes analysis at the voxel or lesion levels. Birenbaum and Greenspan (2017) describe a method for multiple sclerosis lesion detection and segmentation in pairs of MRI scans. The method detects and segments lesions in each scan separately with a single view CNN and then computes the lesions volume difference.

    • Realistic image normalization for multi-Domain segmentation

      2021, Medical Image Analysis
      Citation Excerpt :

      A plethora of pre-processing techniques exists to normalize medical images prior to any image analysis. A common approach, described as a standardization (Birenbaum and Greenspan, 2017; Kamnitsas et al., 2017b; Casamitjana et al., 2017; Chen et al., 2018), consists in normalizing each pixel intensity value in an input image by subtracting from it, the image average intensity and dividing it by the standard deviation. However, this simple strategy does not take into account the global statistics of the dataset.

    • A machine learning framework for accelerating the design process using CAE simulations: An application to finite element analysis in structural crashworthiness

      2021, Computer Methods in Applied Mechanics and Engineering
      Citation Excerpt :

      Machine learning (ML) within the field of artificial intelligence (AI) has emerged as a solution to cope with the challenges that have arisen during this data science revolution. ML has made substantial developments in the area of human–computer interactions through natural language processing [7,8], financial forecasting [9], and diagnostic tools for medical analysis [10,11]. ML and AI applications in the automotive sector have primarily focused on advanced driver assistance systems, vehicle controls, and autonomous driving systems [12,13].

    View all citing articles on Scopus
    View full text