Direct automated quantitative measurement of spine by cascade amplifier regression network with manifold regularization

doi:10.1016/j.media.2019.04.012

Medical Image Analysis

Volume 55, July 2019, Pages 103-115

https://doi.org/10.1016/j.media.2019.04.012 Get rights and content

Highlights

•
A novel regression network named CARN is proposed to achieve automated quantitative measurement of the spine, which provides a reliable measurement for the clinical diagnosis and assessment of spinal diseases.
•
The local structure-preserved manifold regularization (LSPMR) is proposed to generate discriminative feature embedding, which largely improves the performance of multiple indices estimation.
•
The adaptive local shape-constrained manifold regularization (ALSCMR) is proposed to alleviate overfitting. This provides a novel approach for multi-output regression to improve the generalization of the multi-output regression network.

Abstract

Automated quantitative measurement of the spine (i.e., multiple indices estimation of heights, widths, areas, and so on for the vertebral body and disc) plays a significant role in clinical spinal disease diagnoses and assessments, such as osteoporosis, intervertebral disc degeneration, and lumbar disc herniation, yet still an unprecedented challenge due to the variety of spine structure and the high dimensionality of indices to be estimated. In this paper, we propose a novel cascade amplifier regression network (CARN) with manifold regularization including local structure-preserved manifold regularization (LSPMR) and adaptive local shape-constrained manifold regularization (ALSCMR), to achieve accurate direct automated multiple indices estimation. The CARN architecture is composed of a cascade amplifier network (CAN) for expressive feature embedding and a linear regression model for multiple indices estimation. The CAN produces an expressive feature embedding by cascade amplifier units (AUs), which are used for selective feature reuse by stimulating effective feature and suppressing redundant feature during propagating feature map between adjacent layers. During training, the LSPMR is employed to obtain discriminative feature embedding by preserving the local geometric structure of the latent feature space similar to the target output manifold. The ALSCMR is utilized to alleviate overfitting and generate realistic estimation by learning the multiple indices distribution. Experiments on T1-weighted MR images of 215 subjects and T2-weighted MR images of 20 subjects show that the proposed approach achieves impressive performance with mean absolute errors of 1.22 ± 1.04 mm and 1.24 ± 1.07 mm for the 30 lumbar spinal indices estimation of the T1-weighted and T2-weighted spinal MR images respectively. The proposed method has great potential in clinical spinal disease diagnoses and assessments.

Graphical abstract

Introduction

The quantitative measurement of the spine (i.e., multiple indices estimation of heights, widths, areas, and so on for the vertebral body and disc) is a practical means of clinical spinal disease diagnoses and assessments, such as osteoporosis, intervertebral disc degeneration, and lumbar disc herniation. Among these indices to be estimated, the vertebral body height (VBH) and intervertebral disc height (IDH) are the most valuable for these spinal diseases diagnoses and assessments. As shown in Fig. 1, the 30 estimated indices for the lumbar spine include 15 VBHs and 15 IDHs. Each vertebral body (intervertebral disc) contains 3 VBHs (IDHs) including anterior, middle, and posterior VBHs (IDHs). In clinical practice, the VBHs can be used to assess the vertebral fracture risk for the osteoporotic patients (McCloskey, Johansson, Oden, Kanis, 2012, Tatoń, Rokita, Korkosz, Wróbel, 2014) based on the fact that the VBHs are correlated with the bone strength. Furthermore, the IDH decreases with the intervertebral disc degeneration (Jarman, Arpinar, Baruah, Klein, Maiman, Muftuler, 2014, Salamat, Hutchings, Kwong, Magnussen, Hancock, 2016) and lumbar disc herniation (Tunset et al., 2013).

Automated quantitative measurement of the spine is of significant clinical importance due to several advantages including, time-saving, reproducibility, and higher consistency compared with manual quantitative measurement but remains as an exceedingly intractable task due to the following challenges:

•
It is difficult to obtain expressive feature embedding for such complex regression problem due to the high dimensionality of estimated indices (as shown in Fig. 1(a)).
•
Discriminative feature embedding is intractable to be generated due to the excessive ambiguity of the boundary between vertebral body (VB) and intervertebral disc for abnormal spine (as shown in Fig. 1(d)).
•
The implicit correlations between different estimated indices are difficult to be captured (as shown in Fig. 1(d), the heights of the abnormal disc and the heights of adjacent vertebral body are correlated because disc abnormality leads to simultaneous changes of IDH and the adjacent VBH).
•
The complex relationship between the spinal images and the estimated indices arises from the variability of images. Images with the same estimated indices often exhibit great variability due to inter-subject variations.
•
Insufficient labeled data, which possibly results in overfitting.

Existing relevant works for multiple indices estimation of the spine fall into three categories: (1) Manual measurements; (2) automated segmentation; (3) direct estimation.

Manual measurements aim to quantify the spine by manually measuring the disc height in vitro (Brinckmann and Grootenboer, 1991), detecting the landmark of the spine (Tunset, Kjaer, Chreiteh, Jensen, 2013, Videman, Battié, Gibbons, Gill, 2014) from MRI, and segmenting the disc and vertebral body from MRI (Videman et al., 2014). These manual methods are limited in clinical practice because they are time-consuming, tedious, nonreproducible, and susceptible to high inter-observer variability.

Automated segmentation-based methods focus on segmenting the intervertebral disc or vertebral body by active shape models (Castro et al., 2012), multi-atlas based models (Wang and Forsberg, 2016), superpixels based models (Barbieri et al., 2015), and deep learning based models (Korez et al., 2017). Although these methods achieve accurate segmentation of the intervertebral disc and vertebral body, the obtained segmentation is incapable of directly computing the required estimated indices.

In recent years, an increasing number of approaches emerged in the direct quantitative measurement of anatomical structures without the need for segmentation. These methods have achieved great performance in quantitative estimation such as cardiac volume (Xue, Lum, Mercado, Landis, Warrington, Li, 2017, Zhen, Wang, Islam, Bhaduri, Chan, Li, 2016, Zhen, Wang, Islam, Bhaduri, Chan, Li, 2014) and spinal curvature (Wu, Bailey, Rasoulinejad, Li, 2017, Sun, Zhen, Bailey, Rasoulinejad, Yin, Li, 2017). Zhen et al. (2014) used Multi-features and regression forests (Multi-features+RF) to jointly estimate the cardiac bi-ventricular volumes. Zhen et al. (2016) adopted Multi-scale convolutional deep belief network to learn unsupervised cardiac image representation and regression forests (MCDBN+RF) to generate bi-ventricular volumes estimation. Xue et al. (2017) utilized a convolutional neural network (CNN) and recurrent neural network in conjunction with both temporal and spatial information for full quantification of left ventricle. Sun et al. (2017) exploited histogram of oriented gradient descriptor (Dalal and Triggs, 2005) and structured support vector regression (HOG+SSVR) to improve the performance of spinal curvature assessment by exploiting the intrinsic inter-output correlation under the l2, 1-norm regularization and preserving the local geometrical structure invariance via manifold regularization.

Although these methods achieved promising performance in the quantification of the cardiac volume and spinal curvature, they are incapable of achieving quantitative measurement of the spine since they suffer from the following limitations. 1) Lack of expressive and discriminative feature representation. The hand-crafted features are not capable of capturing task-aware spinal structures robustly. Traditional CNN (Simonyan and Zisserman, 2014) is incapable of generating an expressive and discriminative feature for multiple indices estimation because CNN possibly loses effective feature due to the lack of an explicit structure for feature reuse. 2) Incapability of learning the estimated indices distribution, which will lead to unreasonable estimation and overfitting.

In this study, a cascade amplifier regression network (CARN) with manifold regularization is proposed for quantitative measurement of the spine from MR images. The CARN architecture is comprised of a cascade amplifier network (CAN) for expressive feature embedding and a linear regression model for multiple indices estimation; the manifold regularization including local structure-preserved manifold regularization (LSPMR) and adaptive local shape-constrained manifold regularization (ALSCMR) is proposed to construct the loss function. In the CAN, amplifier unit (AU) aims to reuse the selected feature between adjacent layers. As shown in Fig. 2 (b), the AU generates the selected feature by stimulating the effective feature of the anterior layer but suppressing the redundant feature. The selected feature is reused in the posterior layer by a concatenation operator. CAN reuses multi-level features selectively for representing complex spine, thus an expressive feature embedding is obtained. Using the CAN, the MR images are embedded into a latent feature space. The high dimensional indices lie in a target output manifold due to the correlations between these indices. To take advantage of the relationship between the latent feature space and target output manifold, the LSPMR is proposed to generate a discriminative feature embedding which preserves the local geometrical structure of the target output manifold. Additionally, the ALSCMR is designed to restrict the output of the CARN to the target output manifold. As a result, the distribution of the estimated indices is close to the real distribution, which reduces the impact of outliers and alleviates overfitting. Combining the expressive and discriminative feature embedding produced by CAN and LSPMR with ALSCMR, a simple linear regression model, i.e., a fully connected network, is sufficient to produce accurate estimation results.

The main contributions are as follows:

•
A novel regression network named CARN is proposed to achieve automated quantitative measurement of the spine, which provides a reliable measurement for the clinical diagnosis and assessment of spinal diseases.
•
The local structure-preserved manifold regularization (LSPMR) is proposed to generate discriminative feature embedding, which reduces the variability and largely improves the performance of multiple indices estimation.
•
The adaptive local shape-constrained manifold regularization (ALSCMR) is proposed to alleviate overfitting. This provides a novel approach for multi-output regression to improve the generalization of the multi-output regression network.

In this work, we advance our preliminary attempt (Pang et al., 2018) on quantitative measurement of the spine in the following aspects:

•
The LSPMR is proposed to obtain discriminative feature embedding, which largely reduces the variability in multi-output regression, and therefore achieves accurate multiple indices estimation.
•
The robustness of the proposed CARN is validated by extended experiments using a larger dataset which contains 215 T1-weighted images and 20 T2-weighted images.
•
The effectiveness of the proposed CARN is validated by comparing the performance with relevant machine learning based approaches.
•
The loss weight of the local shape-constrained manifold regularization for each sample is determined adaptively. The sample with more reconstruction error of local linear representation in target output manifold has more probability to be an outlier and therefore has more loss weight of local shape-constrained manifold regularization to alleviate overfitting. As a result, the estimated indices are close to their real distribution.

Section snippets

Cascade amplifier regression network architecture

The proposed CARN architecture achieves automated multiple indices estimation of the spine through an expressive feature embedding obtained by the CAN and a linear regression model. As shown in Fig. 2, CAN is a network which provides an expressive feature embedding by selective feature reuse using a series of AUs. The AU in CAN achieves selective feature reuse between the adjacent layers by a gate, multiplier, adder and concatenate operator. The selected feature map is generated by stimulating

Loss function with manifold regularization

The loss function improves the spinal indices estimation accuracy by combining a preliminary loss loss_p with LSPMR loss loss_l in conjunction with ALSCMR loss loss_a. The preliminary loss is designed to minimize the distance between the estimation of indices and the ground truth. As shown in Fig. 3, the LSPMR is employed to achieve discriminative feature embedding by preserving the local geometrical structure of the latent feature space as same as the target output manifold. The ALSCMR is aimed

Datasets

There are two datasets including: 1) The T1-weighted dataset which includes 215 subjects is collected from multi-center and different manufacturers using the parameters as follows: repetition time (TR) $= 600$ msec; echo time (TE) $= 14$ msec; flip angle (FA) $= 90^{\circ}$ . There are four clinical groups in the subjects, including 101 patients with lumbar disc herniation (LDH), 18 patients with intervertebral disc degeneration (IDD), 29 patients with lumbar spondylolisthesis (LS), and 67 normal subjects. The

Results and analysis

The performance of the proposed method is evaluated on T1 dataset and T2 dataset separately due to the variation between the T1-weighted and T2-weighted MR images.

Conclusion

We have presented an accurate and robust method for automated quantitative measurement of the spine using CARN with manifold regularization. The CAN achieves expressive feature embedding by reusing the selected feature. The feature selection is implemented by stimulating the effective feature but suppressing the redundant feature during propagating feature map between adjacent layers. Whether the feature is effective or redundant is automatically learned during training. The LSPMR enhances the

Acknowledgments

Computations were performed using the data analytics Cloud at SHARCNET (http://www.sharcnet.ca) provided through the Southern Ontario Smart Computing Innovation Platform (SOSCIP); the SOSCIP consortium is funded bythe Ontario Government and the Federal Economic Development Agency for Southern Ontario. Financial support for this work was partly provided by the China Scholarship Council (no. 201708440350), the National Natural Science Foundation of China (no. U1501256), and the Science and

References (40)

E. McCloskey et al.
Fracture risk assessment
Clin. Biochem.
(2012)
T. Videman et al.
Aging changes in lumbar discs and vertebrae and their interaction: a 15-year follow-up study
Spine J.
(2014)
C. Wang et al.
Segmentation of Intervertebral Discs in 3D MRI Data Using Multi-atlas Based Registration
Lecture Notes in Computer Science
(2016)
P.A. Yushkevich et al.
User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability
Neuroimage
(2006)
X. Zhen et al.
Multi-scale deep networks and regression forests for direct bi-ventricular volume estimation
Med. Image Anal.
(2016)
X. Zhen et al.
Direct and simultaneous estimation of cardiac four chamber volumes by multioutput sparse regression
Med. Image Anal.
(2017)
P.D. Barbieri et al.
Vertebral body segmentation of spine MR images using superpixels
Computer-Based Medical Systems (CBMS), 2015 IEEE 28th International Symposium on
(2015)
M. Belkin et al.
Laplacian eigenmaps for dimensionality reduction and data representation
Neural Comput.
(2003)
P. Brinckmann et al.
Change of disc height, radial disc bulge, and intradiscal pressure from discectomy an in vitro investigation on human lumbar discs
Spine
(1991)
I. Castro et al.
3d reconstruction of intervertebral discs from t1-weighted magnetic resonance images
Biomedical Imaging (ISBI), 2012 9th IEEE International Symposium on
(2012)

K. Cho et al.

Learning phrase representations using RNN encoder–decoder for statistical machine translation

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

(2014)

N. Dalal et al.

Histograms of oriented gradients for human detection

2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05)

(2005)

K. Hara et al.

Growing Regression Forests by Classification: Applications to Object Pose Estimation

Computer Vision – ECCV 2014

(2014)

K. He et al.

Deep residual learning for image recognition

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

(2016)

S. Hochreiter et al.

Long short-term memory

Neural Comput.

(1997)

G. Huang et al.

Densely connected convolutional networks

IEEE Conference on Computer Vision and Pattern Recognition

(2017)

M. Huang et al.

Brain tumor segmentation based on local independent projection-based classification

IEEE Trans. Biomed. Eng.

(2014)

J.P. Jarman et al.

Intervertebral disc height loss demonstrates the threshold of major pathological changes during degeneration

Eur. Spine J.

(2014)

R. Korez et al.

Intervertebral disc segmentation in mr images with 3d convolutional networks

Medical Imaging 2017: Image Processing

(2017)

W. Liu et al.

Large graph construction for scalable semi-supervised learning

Proceedings of the 27th international conference on machine learning (ICML-10)

(2010)

Cited by (36)

The use of deep learning in medical imaging to improve spine care: A scoping review of current literature and clinical applications
2023, North American Spine Society Journal
Artificial intelligence is a revolutionary technology that promises to assist clinicians in improving patient care. In radiology, deep learning (DL) is widely used in clinical decision aids due to its ability to analyze complex patterns and images. It allows for rapid, enhanced data, and imaging analysis, from diagnosis to outcome prediction. The purpose of this study was to evaluate the current literature and clinical utilization of DL in spine imaging.
This study is a scoping review and utilized the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) methodology to review the scientific literature from 2012 to 2021. A search in PubMed, Web of Science, Embased, and IEEE Xplore databases with syntax specific for DL and medical imaging in spine care applications was conducted to collect all original publications on the subject. Specific data was extracted from the available literature, including algorithm application, algorithms tested, database type and size, algorithm training method, and outcome of interest.
A total of 365 studies (total sample of 232,394 patients) were included and grouped into 4 general applications: diagnostic tools, clinical decision support tools, automated clinical/instrumentation assessment, and clinical outcome prediction. Notable disparities exist in the selected algorithms and the training across multiple disparate databases. The most frequently used algorithms were U-Net and ResNet. A DL model was developed and validated in 92% of included studies, while a pre-existing DL model was investigated in 8%. Of all developed models, only 15% of them have been externally validated.
Based on this scoping review, DL in spine imaging is used in a broad range of clinical applications, particularly for diagnosing spinal conditions. There is a wide variety of DL algorithms, database characteristics, and training methods. Future studies should focus on external validation of existing models before bringing them into clinical use.
Feature-correlation-aware history-preserving-sparse-coding framework for automatic vertebra recognition
2023, Computers in Biology and Medicine
Automatic vertebra recognition from magnetic resonance imaging (MRI) is of significance in disease diagnosis and surgical treatment of spinal patients. Although modern methods have achieved remarkable progress, vertebra recognition still faces two challenges in practice: (1) Vertebral appearance challenge: The vertebral repetitive nature causes similar appearance among different vertebrae, while pathological variation causes different appearance among the same vertebrae; (2) Field of view (FOV) challenge: The FOVs of the input MRI images are unpredictable, which exacerbates the appearance challenge because there may be no specific-appearing vertebrae to assist recognition. In this paper, we propose a Feature-cOrrelation-aware history-pReserving-sparse-Coding framEwork (FORCE) to extract highly discriminative features and alleviate these challenges. FORCE is a recognition framework with two elaborated modules: (1) A feature similarity regularization (FSR) module to constrain the features of the vertebrae with the same label (but potentially with different appearances) to be closer in the latent feature space in an Eigenmap-based regularization manner. (2) A cumulative sparse representation (CSR) module to achieve feed-forward sparse coding while preventing historical features from being erased, which leverages both the intrinsic advantages of sparse codes and the historical features for obtaining more discriminative sparse codes encoding each vertebra. These two modules are embedded into the vertebra recognition framework in a plug-and-play manner to improve feature discrimination. FORCE is trained and evaluated on a challenging dataset containing 600 MRI images. The evaluation results show that FORCE achieves high performance in vertebra recognition and outperforms other state-of-the-art methods.
Deep learning-based diagnosis of disc degenerative diseases using MRI: A comprehensive review
2023, Computers and Electrical Engineering
Deep learning (DL) models in general and convolutional neural networks (CNN) in particular, have rapidly turned out to be methodologies of interest for applications concerned with analysing medical images for classification of different types of disc abnormalities to avoid false outcomes in diagnosis and to automate the domain to a greater extent. In this paper, a detailed review has been conducted on how different state-of-the-art DL methodologies have been applied on disc disease diagnosis by using various medical imaging modalities. It focuses on how to maximize the decision analysis in disease diagnosis in terms of five different aspects, such as types of medical imaging modalities used, datasets and their available categories, pre-processing techniques, various DL models, and performance metrics used for disc degenerative disease (DDD) classification. Further, this study outlines quantitative, qualitative, and critical analysis of the five objectives. amongst the selected studies most of them used a pre-trained model or constructed a new DL model to classify DDD. Finally, this review outlines eight open challenges for researchers who are interested in DDD classification models. This review study will enhance the knowledge domain of researchers and will also provide a comprehensive insight of the effectiveness of the DL techniques being employed in medical diagnosis of DDD.
Task relevance driven adversarial learning for simultaneous detection, size grading, and quantification of hepatocellular carcinoma via integrating multi-modality MRI
2022, Medical Image Analysis
Citation Excerpt :
Specifically, for small size HCC, the accuracy of size grading decreased from 80.68% to 68.67% when removing MaTrans and decreased from 80.68% to 72.67% when removing Trd-Rg-D. For tiny size HCC, the accuracy of size grading decreased from 77.78% to 64.44% when removing MaTrans and decreased from 77.78% to 68.89% when removing Trd-Rg-D. The performance of multi-index quantification has been validated by comparing with three SOTA quantification networks (i.e. VGG-16 Indices-Net (Xue et al., 2017), CARN (Pang et al., 2019), and DE-Net (Lin et al., 2020)). The same network setting as the comparison experiment of detection task and size grading task is performed to these three SOTA quantification networks.
Hepatocellular Carcinoma (HCC) detection, size grading, and quantification (i.e. the center point coordinates, max-diameter, and area) by using multi-modality magnetic resonance imaging (MRI) are clinically significant tasks for HCC assessment and treatment. However, delivering the three tasks simultaneously is extremely challenging due to: (1) the lack of effective an mechanism to capture the relevance among multi-modality MRI information for multi-modality feature fusion and selection; (2) the lack of effective mechanism and constraint strategy to achieve mutual promotion of multi-task. In this paper, we proposed a task relevance driven adversarial learning framework (TrdAL) for simultaneous HCC detection, size grading, and multi-index quantification using multi-modality MRI (i.e. in-phase, out-phase, T2FS, and DWI). The TrdAL first obtains expressive feature of dimension reduction via using a CNN-based encoder. Secondly, the proposed modality-aware Transformer is utilized for multi-modality MRI features fusion and selection, which solves the challenge of multi-modality information diversity via capturing the relevance among multi-modality MRI. Then, the innovative task relevance driven and radiomics guided discriminator (Trd-Rg-D) is used for united adversarial learning. The Trd-Rg-D captures the internal high-order relationships to refine the performance of multi-task simultaneously. Moreover, adding the radiomics feature as the prior knowledge into Trd-Rg-D enhances the detailed feature extraction. Lastly, a novel task interaction loss function is used for constraining the TrdAL, which enforces the higher-order consistency among multi-task labels to enhance mutual promotion. The TrdAL is validated on a corresponding multi-modality MRI of 135 subjects. The experiments demonstrate that TrdAL achieves high accuracy of (1) HCC detection: specificity of 93.71%, sensitivity of 93.15%, accuracy of 93.33%, and IoU of 82.93%; (2) size grading: accuracy of large size, medium size, small size, tiny size, and healthy subject are 90.38%, 87.74%, 80.68%, 77.78%, and 96.87%; (3) multi-index quantification: the mean absolute error of center point, max-diameter, and area are 2.74mm, 3.17mm, and 144.51mm $^{2}$ . All of these results indicate that the proposed TrdAL provides an efficient, accurate, and reliable tool for HCC diagnosis in clinical.
Development and assessment of deep learning system for the location and classification of rib fractures via computed tomography
2022, European Journal of Radiology
The purpose of this study was to evaluate the performance of a deep learning system for the automatic diagnosis and classification of rib fractures.
This retrospective study analyzed computed tomography (CT) data of patients diagnosed with a rib fracture between 1 January 2019 and 23 July 2020 in two hospitals, including 591 patients from Suzhou TCM hospital and 75 patients from Jintan TCM hospital. A deep learning system (Dr.Wise@ChestFracture v1.0) based on a convolutional neural network framework was used as a diagnostic tool, and a human–model comparison experiment was designed to compare the diagnostic efficiencies of the deep learning system and radiologists. Furthermore, a secondary classification model was established to distinguish the different types of fracture. First, a classification model to differentiate between fresh and old fractures was developed. Second, a submodel to determine any misalignment in fresh fractures was established.
For all fracture types, the detection efficiency (recall) of the system was statistically significantly better than that of radiologists with different levels of experience (all p < 0.0167 except for senior radiologists). The F1-score of the system for diagnosing rib fractures was similar to that of the radiologists. The system was much faster than the radiologists in assessing rib fractures (all p < 0.0167). The two classification models can distinguish between fresh and old fractures (accuracy = 87.63%) and determine whether there is any misalignment in fresh fractures (accuracy = 95.22%) or not.
The use of a deep learning system can accurately, automatically, and rapidly diagnose and classify rib fractures, helping doctors improve the diagnostic efficiency and reducing their workload. The classification models can distinguish different types of rib fracture well.
Reasoning discriminative dictionary-embedded network for fully automatic vertebrae tumor diagnosis
2022, Medical Image Analysis
Fully automatic vertebrae tumor diagnosis (FAVTD) means using an end-to-end network to directly perform vertebrae recognition and tumor diagnosis from MRI images. FAVTD is clinically crucial for tumor screening and treatment, which helps prevent further metastasis and save the patients’ lives. However, FAVTD has not yet been fully attempted due to the challenges raised by tumor appearance variability as well as MRI image field of view (FOV) and/or characteristics diversity. We propose a REasoning DiscriminativE diCtIonary-embeDded nEtwork (RE-DECIDE) to tackle the challenges in FAVTD. RE-DECIDE contains an elaborated enhanced-supervision recognition network (ERN) and a self-adaptive reasoning diagnosis network (SRDN). ERN is implemented in a feed-forward dictionary learning manner, which encodes each vertebra by the sparse codes and uses the sparse projections of the vertebrae coordinates onto multiple observation axes for supervision. ERN thus provides multiple sparse encodings of all vertebrae (and their ground truths) to enhance supervision, which reinforces the discrimination of different vertebrae and thus improves recognition performance. SRDN first highlights the most informative feature in the recognized vertebrae based on an attention mechanism. It then performs feature interaction, i.e., exchanges features of different vertebrae based on the graph reasoning mechanism. A reasoning controlling strategy is designed to prompt feature interaction in vertebrae with the same diagnosis labels and meanwhile reduces that in vertebrae with different labels, which avoids over-smoothing and improves diagnosis performance. RE-DECIDE is trained and evaluated using a challenging dataset consisting of 600 MRI images; the evaluation results show that RE-DECIDE achieves high performance in both recognition (accuracy: 0.940) and diagnosis (AUC: 0.947) tasks.

View all citing articles on Scopus

View full text

Direct automated quantitative measurement of spine by cascade amplifier regression network with manifold regularization

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Cascade amplifier regression network architecture

Loss function with manifold regularization

Datasets

Results and analysis

Conclusion

Acknowledgments

Clin. Biochem.

Spine J.

Neuroimage

Med. Image Anal.

Med. Image Anal.

Vertebral body segmentation of spine MR images using superpixels

Computer-Based Medical Systems (CBMS), 2015 IEEE 28th International Symposium on

Laplacian eigenmaps for dimensionality reduction and data representation

Neural Comput.

Change of disc height, radial disc bulge, and intradiscal pressure from discectomy an in vitro investigation on human lumbar discs

Spine

3d reconstruction of intervertebral discs from t1-weighted magnetic resonance images

Biomedical Imaging (ISBI), 2012 9th IEEE International Symposium on

Learning phrase representations using RNN encoder–decoder for statistical machine translation

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Histograms of oriented gradients for human detection

2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05)

Growing Regression Forests by Classification: Applications to Object Pose Estimation

Computer Vision – ECCV 2014

Deep residual learning for image recognition

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Long short-term memory

Neural Comput.

Densely connected convolutional networks

IEEE Conference on Computer Vision and Pattern Recognition

Brain tumor segmentation based on local independent projection-based classification

IEEE Trans. Biomed. Eng.

Intervertebral disc height loss demonstrates the threshold of major pathological changes during degeneration

Eur. Spine J.

Intervertebral disc segmentation in mr images with 3d convolutional networks

Medical Imaging 2017: Image Processing

Large graph construction for scalable semi-supervised learning

Proceedings of the 27th international conference on machine learning (ICML-10)

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)