Abstract
Recently, deep learning has demonstrated great success in computer vision with the capability to learn powerful image features from a large training set. However, most of the published work has been confined to solving 2D problems, with a few limited exceptions that treated the 3D space as a composition of 2D orthogonal planes. The challenge of 3D deep learning is due to a much larger input vector, compared to 2D, which dramatically increases the computation time and the chance of over-fitting, especially when combined with limited training samples (hundreds to thousands), typical for medical imaging applications. To address this challenge, we propose an efficient and robust deep learning algorithm capable of full 3D detection in volumetric data. A two-step approach is exploited for efficient detection. A shallow network (with one hidden layer) is used for the initial testing of all voxels to obtain a small number of promising candidates, followed by more accurate classification with a deep network. In addition, we propose two approaches, i.e., separable filter decomposition and network sparsification, to speed up the evaluation of a network. To mitigate the over-fitting issue, thereby increasing detection robustness, we extract small 3D patches from a multi-resolution image pyramid. The deeply learned image features are further combined with Haar wavelet-like features to increase the detection accuracy. The proposed method has been quantitatively evaluated for carotid artery bifurcation detection on a head-neck CT dataset from 455 patients. Compared to the state of the art, the mean error is reduced by more than half, from 5.97 mm to 2.64 mm, with a detection speed of less than 1 s/volume.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zhan Y, Dewan M, Harder M, Krishnan A, Zhou XS (2011) Robust automatic knee MR slice positioning through redundant and hierarchical anatomy detection. IEEE Trans Med Imag 30(12):2087–2100
Schwing AG, Zheng Y (2014) Reliable extraction of the mid-sagittal plane in 3D brain MRI via hierarchical landmark detection. In: Proceedings of the international symposium on biomedical imaging, pp 213–216
Zheng Y, Tek H, Funka-Lea G, Zhou SK, Vega-Higuera F, Comaniciu D (2011) Efficient detection of native and bypass coronary ostia in cardiac CT volumes: anatomical versus pathological structures. In: Proceedings of the international conference on medical image computing and computer assisted intervention, pp 403–410
Liu D, Zhou S, Bernhardt D, Comaniciu D (2011) Vascular landmark detection in 3D CT data. In: Proceedings of the SPIE medical imaging, pp 1–7
Zheng Y, Lu X, Georgescu B, Littmann A, Mueller E, Comaniciu D (2009) Robust object detection using marginal space learning and ranking-based multi-detector aggregation: application to automatic left ventricle detection in 2D MRI images. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1343–1350
Zheng Y, John M, Liao R, Nottling A, Boese J, Kempfert J, Walther T, Brockmann G, Comaniciu D (2012) Automatic aorta segmentation and valve landmark detection in C-arm CT for transcatheter aortic valve implantation. IEEE Trans Med Imaging 31(12):2307–2321
Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol PA (2010) Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11:3371–3408
Yan Z, Zhan Y, Peng Z, Liao S, Shinagawa Y, Metaxas DN, Zhou, XS (2015) Bodypart recognition using multi-stage deep learning. In: Proceedings of the information processing in medical imaging, pp 449–461
Liu F, Yang L (2015) A novel cell detection method using deep convolutional neural network and maximum-weight independent set. In: Proceedings of the international conference on medical image computing and computer assisted intervention, pp 349–357
Roth HR, Lu L., Seff A, Cherry KM, Hoffman J, Wang S, Liu J, Turkbey E, Summers RM (2014) A new 2.5D representation for lymph node detection using random sets of deep convolutional neural network observations. In: Proceedings of the international conference on medical image computing and computer assisted intervention, pp 520–527
Carneiro G, Nascimento JC, Freitas A (2012) The segmentation of the left ventricle of the heart from ultrasound data using deep learning architectures and derivative-based search methods. IEEE Trans Image Process 21(3):968–982
Ghesu FC, Krubasik E, Georgescu B, Singh V, Zheng Y, Hornegger J, Comaniciu D (2016) Marginal space deep learning: efficient architecture for volumetric image parsing. IEEE Trans Med Imag 35(5):1217–1228
Cheng X, Zhang L, Zheng Y (2016) Deep similarity learning for multimodal medical images. Comput Methods Biomech Biomed Eng Imaging Vis 4:1–5
Miao S, Wang ZJ, Zheng Y, Liao R (2016) Real-time 2D/3D registration via CNN regression. In: Proceedings of the IEEE international symposium on biomedical imaging, pp 1–4
Prasoon A, Petersen K, Igel C, Lauze F, Dam E, Nielsen M (2013) Deep feature learning for knee cartilage segmentation using a triplanar convolutional neural network. In: Proceedings of the international conference on medical image computing and computer assisted intervention, vol 8150, pp 246–253
Rigamonti R, Sironi A, Lepetit V, Fua P (2013) Learning separable filters. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2754–2761
Denton E, Zaremba W, Bruna J, LeCun Y, Fergus R (2014) Exploiting linear structure within convolutional networks for efficient evaluation. In: Advances in neural information processing systems, pp 1–11
Zheng Y, Liu D, Georgescu B, Nguyen H, Comaniciu D (2015) 3D deep learning for efficient and robust landmark detection in volumetric data. In: Proceedings of the international conference medical image computing and computer assisted intervention, pp 565–572
Acar E, Dunlavy DM, Kolda TG (2011) A scalable optimization approach for fitting canonical tensor decompositions. J Chemom 25(2):67–86
Tu Z (2005) Probabilistic boosting-tree: learning discriminative methods for classification, recognition, and clustering. In: Proceedings of the international conference on computer vision, pp 1589–1596
Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139
Liu D, Zhou S, Bernhardt D, Comaniciu D (2010) Search strategies for multiple landmark detection by submodular maximization. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2831–2838
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Zheng, Y., Liu, D., Georgescu, B., Nguyen, H., Comaniciu, D. (2017). Robust Landmark Detection in Volumetric Data with Efficient 3D Deep Learning. In: Lu, L., Zheng, Y., Carneiro, G., Yang, L. (eds) Deep Learning and Convolutional Neural Networks for Medical Image Computing. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-42999-1_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-42999-1_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42998-4
Online ISBN: 978-3-319-42999-1
eBook Packages: Computer ScienceComputer Science (R0)