Multi-view longitudinal CNN for multiple sclerosis lesion segmentation

doi:10.1016/j.engappai.2017.06.006

Engineering Applications of Artificial Intelligence

Volume 65, October 2017, Pages 111-118

https://doi.org/10.1016/j.engappai.2017.06.006 Get rights and content

Highlights

•
A convolutional neural network based method for multiple sclerosis lesion segmentation is proposed.
•
The network utilizes longitudinal data, a novel contribution in the domain of MS lesion analysis.
•
The use of longitudinal data significantly improves segmentation accuracy.
•
State-of-the-art results are obtained on a public benchmark dataset.
•
Expert human level segmentation accuracy is obtained by the proposed method.

Abstract

In this work, a deep-learning based automated method for Multiple Sclerosis (MS) lesion segmentation is presented. Automatic segmentation of MS lesions is a challenging task due to their variability in shape, size, location and texture in Magnetic Resonance (MR) images. In the proposed scheme, MR intensities and White Matter (WM) priors are used to extract candidate lesion voxels, following which Convolutional Neural Networks (CNN) are utilized for false positive reduction and final segmentation result. The proposed network uses longitudinal data, a novel contribution in the domain of MS lesion analysis. The method obtained state-of-the-art results on the 2015 Longitudinal MS Lesion Segmentation Challenge dataset, and achieved a performance level equivalent to a trained human rater. Automatic segmentation methods, such as the one proposed, once proven in accuracy and robustness, can help diagnosis and patient follow-up while reducing the time consuming need of manual segmentation.

Introduction

Multiple Sclerosis is one of the most common non-traumatic neurological diseases in young adults. It is a chronic inflammatory disease in which the immune system attacks the central nervous system (CNS) and damages myelin, myelin producing cells and underlying nerve fibers. Damages of the myelin cause scarring of brain tissues, mostly in the white matter, which are termed MS Lesions. The impairment of the CNS due to MS ultimately leads to deficiency in sensation, movement and cognition.

MRI plays an important role in the diagnosis and treatment of MS. The Revised McDonald Criteria, which incorporate the combination of clinical characteristics and MRI features, have been devised for the purpose of diagnosis of MS (Polman et al., 2005). According to these criteria, MS can be diagnosed after a single clinical episode when MS lesions are visible in MRI. Early diagnosis is important due to the availability of therapies that slow the progression of the disease. Once MS has been diagnosed, subsequent MR scans, usually performed on a yearly basis, are used to track the progress of the disease and make further treatment decisions.

Lesions may appear, disappear or change size and shape between consecutive MR scans (Guttmann et al., 1995). It is therefore the case that temporal variability in WM tissues may provide a strong indication for the presence of MS lesions, as depicted in Fig. 1. Recently, lesion segmentation algorithms that utilize longitudinal information were shown to obtain enhanced results over algorithms that segment each time point independently (Roy et al., 2015).

Due to the clinical importance and the challenging nature of automatic MS lesion detection and segmentation, several challenges have been organized, such as the MS lesion segmentation challenge in MICCAI 2008 (Styner et al., 2008) and the Longitudinal MS lesion segmentation challenge in ISBI 2015 (Carass et al., 2017). In the latter challenge, competing teams were able to make use of longitudinal data as well as multiple contrast images in order to provide accurate automatic lesion segmentations. A variety of algorithms were proposed. Top performing methods used supervised classification frameworks, such as Random Forests (Geremia et al., 2010) and Deep Neural Networks (Brosch et al., 2016).

Convolutional Neural Networks have become increasingly popular following the 2012 ImageNet classification challenge, in which Alex Krizhevsky’s network won by a large margin (Krizhevsky et al., 2012). Since then, CNNs have been successfully used for additional applications, such as object detection and segmentation. Due to the complex structure and the enormous number of parameter of CNNs, understanding why they perform so well is not straightforward, and several works have been dedicated for this purpose Zeiler and Fergus (2014), Simonyan et al. (2013). In recent years, CNNs have also been used successfully for medical image analysis, in which volumetric data is commonly available. A multi-view CNN, in which candidate 3D volumes are decomposed to axial, coronal and sagittal views, achieved state-of-the-art accuracy in lymph node detection. This multi-view framework, compared to 3D CNNs, was able to reduce the computations required for training and testing and was robust to overfitting due to the reduced number of network weights (Roth et al., 2014). Both 2D and 3D CNNs were recently proposed for the segmentation of MS lesions (Carass et al., 2017).

This work presents a longitudinal Multi-View CNN for the MS lesion segmentation task. The input to the CNN are patches from multiple views, multiple contrast images and multiple time points. To the best of our knowledge, this is the first CNN that takes advantage of longitudinal data for MS lesion segmentation. The proposed segmentation method was evaluated on the dataset provided in the 2015 ISBI challenge, and achieved state-of-the-art accuracy on the test set. This method, which was trained on the relatively small number of patients available in the challenge training set, was able to generalize well on the test set and achieve human level performance.

The rest of the paper is organized as follows: The proposed segmentation method is detailed in Section 2. Evaluations on the 2015 ISBI dataset are presented in Section 3. Experimental results are detailed in Section 4. Finally, Discussion of various aspects of the segmentation method and concluding remarks are provided in Section 5.

Section snippets

Methods

There are three main phases in the proposed segmentation method: Pre-Processing, Candidate Extraction and CNN Prediction. The Pre-Processing phase consists of a set of commonly-used steps, including co-Registration, brain extraction, Bias field correction and Intensity normalization. In the Candidate Extraction phase, masks based on FLAIR and WM prior are generated and applied to the MR images. In the CNN Prediction phase, the multi-view CNN outputs a lesion probability for every voxel in the

Evaluation

The proposed segmentation algorithms were evaluated on the dataset of the 2015 Longitudinal Multiple Sclerosis Segmentation Challenge. The overall data is composed of two parts: (1) Training data consisting of longitudinal images from 5 patients; (2) Test data consisting of longitudinal images from 14 different patients. For each patient, the data includes T1-weighted, T2-weighted, PD-weighted, and T2-weighted FLAIR MRI with 4-6 time points acquired on a 3T MR scanner. T1-weighted images have

Experimental results

This section begins with an evaluation of the proposed system on the training set, which enables to obtain the design and parameters that yield optimal results. Sections 4.1–4.4 show several cross-validation evaluation experiments on the training dataset. These experiments were conducted on an overall amount of 21 cases. Using the optimal design and parameters found by cross validation, Section 4.5 provides experimental results on a separate test set of 61 cases.

Discussion and conclusion

This section addresses several of the design considerations in setting up the proposed system: A Patch Based solution was constructed . Today, most recent works that focus on object segmentation are often successful with the use of Fully convolutional networks (Long et al., 2015). These networks involve convolutions on the entire volume. In the proposed method, the candidate extraction stage eliminates the vast majority of voxels in the volume as possible lesion candidates. Therefore,

Acknowledgment

Part of this work was funded by the INTEL Collaborative Research Institute for Computational Intelligence (ICRI-CI) .

References (24)

CarassAaron et al.
Longitudinal multiple sclerosis lesion segmentation: resource & challenge
NeuroImage
(2017)
JenkinsonMark et al.
Improved optimization for the robust and accurate linear registration and motion correction of brain images
Neuroimage
(2002)
BroschTom et al.
Deep 3D convolutional encoder networks with shortcuts for multiscale feature integration applied to multiple sclerosis lesion segmentation
IEEE Transactions on Medical Imaging
(2016)
Chollet, F., 2015. Keras, https://github.com/fchollet/keras/. (Accessed 3 January...
GeremiaEzequiel et al.
Spatial decision forests for MS lesion segmentation in multi-channel MR images
GirshickRoss et al.
Rich feature hierarchies for accurate object detection and semantic segmentation
GuttmannC.R. et al.
The evolution of multiple sclerosis lesions on serial MR.
American Journal of Neuroradiology
(1995)
ISBI 2015. longitudinal MS lesion segmentation evaluation website, https://smart-stats-tools.org/node/26. (Accessed 3...
KrizhevskyAlex et al.
Imagenet classification with deep convolutional neural networks
LongJonathan et al.
Fully convolutional networks for semantic segmentation

MazziottaJohn et al.

A probabilistic atlas and reference system for the human brain: international consortium for brain mapping (ICBM)

Philosophical Transactions of the Royal Society, Series B (Biological Sciences

(2001)

MechrezRoey et al.

Patch-based segmentation with spatial consistency: application to MS lesions in brain MRI

Journal of Biomedical Imaging

(2016)

Cited by (68)

Automatic polyp segmentation via image-level and surrounding-level context fusion deep neural network
2023, Engineering Applications of Artificial Intelligence
More than 95% of colorectal cancers are gradually transformed from polyps, so regular colonoscopy polyp examination plays an important role in cancer prevention and early treatment. However, automatic polyp segmentation remains a challenging task due to the low-contrast tissue environment and the small size and variety (e.g., shape, color, texture) of polyps. In this case, the rich context information in colonoscopy images is worth exploring to address the above issues. On the one hand, the image-level context with a global receptive field can be used to enhance the discrimination between the foreground and the background to alleviate the occult and indistinguishability of polyps in colonoscopy images. On the other hand, the surrounding-level context focused on the surrounding pathological region of the polyp has more detailed features that are beneficial for polyp segmentation. Therefore, we propose a novel network named ISCNet that aims to fuse image-level and surrounding-level context information for polyp segmentation. Specifically, we first introduce the Global-Guided Context Aggregation (GGCA) module to explicitly model the foreground and background of polyp segmentation through image-level context, thereby flexibly enhancing polyp-related features and suppressing background-related features. Then, we design the Diverse Surrounding Context Focus (DSCF) module to focus on the surrounding area of the polyp to extract diverse local contexts to refine the segmentation results. Finally, we fuse the feature maps derived from these two modules so that our ISCNet can enjoy the facilitation of both the image-level and surrounding-level context information. To verify the effectiveness of our method, we conduct comprehensive experimental evaluations on three challenging datasets. The quantitative and qualitative experimental results confirm that our ISCNet outperforms current state-of-the-art methods by a large margin. Our code is available at https://github.com/vvmedical/ISCNet.
Liver lesion changes analysis in longitudinal CECT scans by simultaneous deep learning voxel classification with SimU-Net
2023, Medical Image Analysis
Citation Excerpt :
These methods assume that the prior lesions segmentations are available as input; none performs lesion changes analysis at the voxel or lesion levels. Birenbaum and Greenspan (2017) describe a method for multiple sclerosis lesion detection and segmentation in pairs of MRI scans. The method detects and segments lesions in each scan separately with a single view CNN and then computes the lesions volume difference.
The identification and quantification of liver lesions changes in longitudinal contrast enhanced CT (CECT) scans is required to evaluate disease status and to determine treatment efficacy in support of clinical decision-making. This paper describes a fully automatic end-to-end pipeline for liver lesion changes analysis in consecutive (prior and current) abdominal CECT scans of oncology patients. The three key novelties are: (1) SimU-Net, a simultaneous multi-channel 3D R2U-Net model trained on pairs of registered scans of each patient that identifies the liver lesions and their changes based on the lesion and healthy tissue appearance differences; (2) a model-based bipartite graph lesions matching method for the analysis of lesion changes at the lesion level; (3) a method for longitudinal analysis of one or more of consecutive scans of a patient based on SimU-Net that handles major liver deformations and incorporates lesion segmentations from previous analysis. To validate our methods, five experimental studies were conducted on a unique dataset of 3491 liver lesions in 735 pairs from 218 clinical abdominal CECT scans of 71 patients with metastatic disease manually delineated by an expert radiologist. The pipeline with the SimU-Net model, trained and validated on 385 pairs and tested on 249 pairs, yields a mean lesion detection recall of 0.86±0.14, a precision of 0.74±0.23 and a lesion segmentation Dice of 0.82±0.14 for lesions > 5 mm. This outperforms a reference standalone 3D R2-UNet mdel that analyzes each scan individually by ∼50% in precision with similar recall and Dice score on the same training and test datasets. For lesions matching, the precision is 0.86±0.18 and the recall is 0.90±0.15. For lesion classification, the specificity is 0.97±0.07, the precision is 0.85±0.31, and the recall is 0.86±0.23. Our new methods provide accurate and comprehensive results that may help reduce radiologists' time and effort and improve radiological oncology evaluation.
Combining multi-view ensemble and surrogate lagrangian relaxation for real-time 3D biomedical image segmentation on the edge
2022, Neurocomputing
Real-time 3D biomedical image segmentation is always preferred considering the exponentially growing medical imaging data for the past decade. Recently deep learning has significantly boosted the performance of automatic medical image segmentation with high computation and memory requirements, especially for 3D biomedical images. Meanwhile, the privacy and security of patient data have always been the primary concern in medical applications among hospitals and clinics, and there also exists some applications which need real-time processing in clinic practice. Thus, 3D biomedical image segmentation is typically required to be performed locally (i.e. on the edge) with limited computation and memory resources. In this paper, we propose to combine multi-view ensemble and Surrogate Lagrangian relaxation (SLR) for real-time 3D biomedical image segmentation on the edge. Instead of directly dealing with 3D biomedical images, our segmentation conducts on the three 2D domains of the 3D images with an ensemble strategy. In addition, Surrogate Lagrangian relaxation is proposed to compress the model to enable high efficiency and real-time processing. Experiments on a typical edge Nvidia GPU show that our method achieves real-time processing which is $1.5 \times$ faster with an improvement of $9 %$ on accuracy compared with single-view models. It also saves $26 \times$ computational resources and $6 \times$ memory resources compared to 3D segmentation models.
Automatic detection of white matter hyperintensities via mask region-based convolutional neural networks using magnetic resonance images
2022, Deep Learning for Medical Applications with Unique Data
Because white matter hyperintensities (WMHs) are associated with many different types of brain disease or disorders, they need to be detected as early as possible. Accurate detection of WMHs occurring in the brain is important for physicians to decide on the appropriate treatment method and to determine the type, location, size, and boundary detection of the pathologic case with high accuracy. This study proposes a mask region-based convolutional neural network method for the automatic detection of WMHs on magnetic resonance (MR) scans. Three datasets, one of which is specific to this study and two of which are given publicly available, are provided for experimental studies. As a result of test set in the study, multiple sclerosis lesions and brain tumors are successfully detected on MR slices with a high mean average precision score of 0.94. In addition, precision and the Dice similarity coefficient have scores as 0.86 and 0.82, respectively.
Realistic image normalization for multi-Domain segmentation
2021, Medical Image Analysis
Citation Excerpt :
A plethora of pre-processing techniques exists to normalize medical images prior to any image analysis. A common approach, described as a standardization (Birenbaum and Greenspan, 2017; Kamnitsas et al., 2017b; Casamitjana et al., 2017; Chen et al., 2018), consists in normalizing each pixel intensity value in an input image by subtracting from it, the image average intensity and dividing it by the standard deviation. However, this simple strategy does not take into account the global statistics of the dataset.
Image normalization is a building block in medical image analysis. Conventional approaches are customarily employed on a per-dataset basis. This strategy, however, prevents the current normalization algorithms from fully exploiting the complex joint information available across multiple datasets. Consequently, ignoring such joint information has a direct impact on the processing of segmentation algorithms. This paper proposes to revisit the conventional image normalization approach by, instead, learning a common normalizing function across multiple datasets. Jointly normalizing multiple datasets is shown to yield consistent normalized images as well as an improved image segmentation when intensity shifts are large. To do so, a fully automated adversarial and task-driven normalization approach is employed as it facilitates the training of realistic and interpretable images while keeping performance on par with the state-of-the-art. The adversarial training of our network aims at finding the optimal transfer function to improve both, jointly, the segmentation accuracy and the generation of realistic images. We have evaluated the performance of our normalizer on both infant and adult brain images from the iSEG, MRBrainS and ABIDE datasets. The results indicate that our contribution does provide an improved realism to the normalized images, while retaining a segmentation accuracy at par with the state-of-the-art learnable normalization approaches.
A machine learning framework for accelerating the design process using CAE simulations: An application to finite element analysis in structural crashworthiness
2021, Computer Methods in Applied Mechanics and Engineering
Citation Excerpt :
Machine learning (ML) within the field of artificial intelligence (AI) has emerged as a solution to cope with the challenges that have arisen during this data science revolution. ML has made substantial developments in the area of human–computer interactions through natural language processing [7,8], financial forecasting [9], and diagnostic tools for medical analysis [10,11]. ML and AI applications in the automotive sector have primarily focused on advanced driver assistance systems, vehicle controls, and autonomous driving systems [12,13].
This paper presents a novel framework for predicting computer-aided engineering (CAE) simulation results using machine learning (ML). The framework is applied to finite element (FE) simulations of dynamic axial crushing of rectangular crush tubes that are typically used in vehicle crashworthiness applications. A virtual design of experiments that varies the size and wall thickness of the FE model is performed to generate the necessary training data. This process generates designs with varying numbers of nodes and elements that are handled by the ML system. However, the explicit design parameters and meshing techniques that were used to generate the training data remain unknown to the ML system. Instead, 3D convolutional neural networks (CNN) autoencoders are used to process the initial FE model data (i.e., nodes, elements, thickness, etc.) to automatically determine these features in an unsupervised manner. A voxelization strategy that operates on the mass of individual nodes is proposed to handle the unstructured nature of the nodes and elements while capturing variations in the wall thickness of the FE models. The flattened latent space generated by the 3D-CNN-autoencoder is then used as input into long-short term memory neural networks (LSTM-NN) to predict the force–displacement response as well as the deformation of the mesh. The training process of both the 3D-CNN-autoencoders and LSTM-NN is systematically studied to highlight the robustness of the framework. The proposed ML system utilizes only 16% of the simulations generated in the virtual design of experiments to achieve good predictive capability. Once trained, the proposed framework can predict the deformation of the mesh and resulting force–displacement response of a new design up to $\sim$ 330 and $\sim$ 2,960,000 times faster, respectively, than the conventional FE approach with good accuracy. This computational speed up offers design engineers and scientists a potential tool for accelerating the design exploration process with CAE simulation tools, such as FE analysis.

View all citing articles on Scopus

View full text

Multi-view longitudinal CNN for multiple sclerosis lesion segmentation

Highlights

Abstract

Introduction

Section snippets

Methods

Evaluation

Experimental results

Discussion and conclusion

Acknowledgment

NeuroImage

Neuroimage

Deep 3D convolutional encoder networks with shortcuts for multiscale feature integration applied to multiple sclerosis lesion segmentation

IEEE Transactions on Medical Imaging

Spatial decision forests for MS lesion segmentation in multi-channel MR images

Rich feature hierarchies for accurate object detection and semantic segmentation

The evolution of multiple sclerosis lesions on serial MR.

American Journal of Neuroradiology

Imagenet classification with deep convolutional neural networks

Fully convolutional networks for semantic segmentation

A probabilistic atlas and reference system for the human brain: international consortium for brain mapping (ICBM)

Philosophical Transactions of the Royal Society, Series B (Biological Sciences

Patch-based segmentation with spatial consistency: application to MS lesions in brain MRI

Journal of Biomedical Imaging