3D Deep Learning for Efficient and Robust Landmark Detection in Volumetric Data

Zheng, Yefeng; Liu, David; Georgescu, Bogdan; Nguyen, Hien; Comaniciu, Dorin

doi:10.1007/978-3-319-24553-9_69

Yefeng Zheng¹⁷,
David Liu¹⁷,
Bogdan Georgescu¹⁷,
Hien Nguyen¹⁷ &
…
Dorin Comaniciu¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9349))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

11k Accesses
58 Citations
6 Altmetric

Abstract

Recently, deep learning has demonstrated great success in computer vision with the capability to learn powerful image features from a large training set. However, most of the published work has been confined to solving 2D problems, with a few limited exceptions that treated the 3D space as a composition of 2D orthogonal planes. The challenge of 3D deep learning is due to a much larger input vector, compared to 2D, which dramatically increases the computation time and the chance of over-fitting, especially when combined with limited training samples (hundreds to thousands), typical for medical imaging applications. To address this challenge, we propose an efficient and robust deep learning algorithm capable of full 3D detection in volumetric data. A two-step approach is exploited for efficient detection. A shallow network (with one hidden layer) is used for the initial testing of all voxels to obtain a small number of promising candidates, followed by more accurate classification with a deep network. In addition, we propose two approaches, i.e., separable filter decomposition and network sparsification, to speed up the evaluation of a network. To mitigate the over-fitting issue, thereby increasing detection robustness, we extract small 3D patches from a multi-resolution image pyramid. The deeply learned image features are further combined with Haar wavelet features to increase the detection accuracy. The proposed method has been quantitatively evaluated for carotid artery bifurcation detection on a head-neck CT dataset from 455 patients. Compared to the state-of-the-art, the mean error is reduced by more than half, from 5.97 mm to 2.64 mm, with a detection speed of less than 1 s/volume.

Download to read the full chapter text

Chapter PDF

Robust Landmark Detection in Volumetric Data with Efficient 3D Deep Learning

A Cascade Regression Model for Anatomical Landmark Detection

Dense Volume-to-Volume Vascular Boundary Detection

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Zhan, Y., Dewan, M., Harder, M., Krishnan, A., Zhou, X.S.: Robust automatic knee MR slice positioning through redundant and hierarchical anatomy detection. IEEE Trans. Medical Imaging 30(12), 2087–2100 (2010)
Article Google Scholar
Schwing, A.G., Zheng, Y.: Reliable extraction of the mid-sagittal plane in 3D brain MRI via hierarchical landmark detection. In: Proc. Int’l Sym. Biomedical Imaging, pp. 213–216 (2014)
Google Scholar
Liu, D., Zhou, S., Bernhardt, D., Comaniciu, D.: Vascular landmark detection in 3D CT data. In: Proc. of SPIE Medical Imaging, pp. 1–7 (2011)
Google Scholar
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., Manzagol, P.A.: Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. The Journal of Machine Learning Research 11, 3371–3408 (2010)
MathSciNet MATH Google Scholar
Prasoon, A., Petersen, K., Igel, C., Lauze, F., Dam, E., Nielsen, M.: Deep feature learning for knee cartilage segmentation using a triplanar convolutional neural network. In: Mori, K., Sakuma, I., Sato, Y., Barillot, C., Navab, N. (eds.) MICCAI 2013, Part II. LNCS, vol. 8150, pp. 246–253. Springer, Heidelberg (2013)
Chapter Google Scholar
Roth, H.R., et al.: A new 2.5D representation for lymph node detection using random sets of deep convolutional neural network observations. In: Golland, P., Hata, N., Barillot, C., Hornegger, J., Howe, R. (eds.) MICCAI 2014, Part I. LNCS, vol. 8673, pp. 520–527. Springer, Heidelberg (2014)
Google Scholar
Rigamonti, R., Sironi, A., Lepetit, V., Fua, P.: Learning separable filters. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 2754–2761 (2013)
Google Scholar
Denton, E., Zaremba, W., Bruna, J., LeCun, Y., Fergus, R.: Exploiting linear structure within convolutional networks for efficient evaluation. In: Advances in Neural Information Processing Systems, pp. 1–11 (2014)
Google Scholar
Acar, E., Dunlavy, D.M., Kolda, T.G.: A scalable optimization approach for fitting canonical tensor decompositions. Journal of Chemometrics 25(2), 67–86 (2011)
Article Google Scholar
Tu, Z.: Probabilistic boosting-tree: Learning discriminative methods for classification, recognition, and clustering. In: Proc. Int’l Conf. Computer Vision, pp. 1589–1596 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Imaging and Computer Vision, Siemens Corporate Technology, Princeton, NJ, USA
Yefeng Zheng, David Liu, Bogdan Georgescu, Hien Nguyen & Dorin Comaniciu

Authors

Yefeng Zheng
View author publications
You can also search for this author in PubMed Google Scholar
David Liu
View author publications
You can also search for this author in PubMed Google Scholar
Bogdan Georgescu
View author publications
You can also search for this author in PubMed Google Scholar
Hien Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Dorin Comaniciu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yefeng Zheng .

Editor information

Editors and Affiliations

TU München, Garching, Germany
Nassir Navab
Lehrstuhl Informatik 5, University of Erlangen-Nuremberg, Erlangen, Germany
Joachim Hornegger
Brigham and Women's Hospital, Boston, Massachusetts, USA
William M. Wells
University of Sheffield, Sheffield, Suffolk, United Kingdom
Alejandro Frangi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zheng, Y., Liu, D., Georgescu, B., Nguyen, H., Comaniciu, D. (2015). 3D Deep Learning for Efficient and Robust Landmark Detection in Volumetric Data. In: Navab, N., Hornegger, J., Wells, W., Frangi, A. (eds) Medical Image Computing and Computer-Assisted Intervention -- MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science(), vol 9349. Springer, Cham. https://doi.org/10.1007/978-3-319-24553-9_69

Download citation

DOI: https://doi.org/10.1007/978-3-319-24553-9_69
Published: 18 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24552-2
Online ISBN: 978-3-319-24553-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics