Skip to main content

Robust Landmark Detection in Volumetric Data with Efficient 3D Deep Learning

  • Chapter
  • First Online:
Deep Learning and Convolutional Neural Networks for Medical Image Computing

Abstract

Recently, deep learning has demonstrated great success in computer vision with the capability to learn powerful image features from a large training set. However, most of the published work has been confined to solving 2D problems, with a few limited exceptions that treated the 3D space as a composition of 2D orthogonal planes. The challenge of 3D deep learning is due to a much larger input vector, compared to 2D, which dramatically increases the computation time and the chance of over-fitting, especially when combined with limited training samples (hundreds to thousands), typical for medical imaging applications. To address this challenge, we propose an efficient and robust deep learning algorithm capable of full 3D detection in volumetric data. A two-step approach is exploited for efficient detection. A shallow network (with one hidden layer) is used for the initial testing of all voxels to obtain a small number of promising candidates, followed by more accurate classification with a deep network. In addition, we propose two approaches, i.e., separable filter decomposition and network sparsification, to speed up the evaluation of a network. To mitigate the over-fitting issue, thereby increasing detection robustness, we extract small 3D patches from a multi-resolution image pyramid. The deeply learned image features are further combined with Haar wavelet-like features to increase the detection accuracy. The proposed method has been quantitatively evaluated for carotid artery bifurcation detection on a head-neck CT dataset from 455 patients. Compared to the state of the art, the mean error is reduced by more than half, from 5.97 mm to 2.64 mm, with a detection speed of less than 1 s/volume.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zhan Y, Dewan M, Harder M, Krishnan A, Zhou XS (2011) Robust automatic knee MR slice positioning through redundant and hierarchical anatomy detection. IEEE Trans Med Imag 30(12):2087–2100

    Article  Google Scholar 

  2. Schwing AG, Zheng Y (2014) Reliable extraction of the mid-sagittal plane in 3D brain MRI via hierarchical landmark detection. In: Proceedings of the international symposium on biomedical imaging, pp 213–216

    Google Scholar 

  3. Zheng Y, Tek H, Funka-Lea G, Zhou SK, Vega-Higuera F, Comaniciu D (2011) Efficient detection of native and bypass coronary ostia in cardiac CT volumes: anatomical versus pathological structures. In: Proceedings of the international conference on medical image computing and computer assisted intervention, pp 403–410

    Google Scholar 

  4. Liu D, Zhou S, Bernhardt D, Comaniciu D (2011) Vascular landmark detection in 3D CT data. In: Proceedings of the SPIE medical imaging, pp 1–7

    Google Scholar 

  5. Zheng Y, Lu X, Georgescu B, Littmann A, Mueller E, Comaniciu D (2009) Robust object detection using marginal space learning and ranking-based multi-detector aggregation: application to automatic left ventricle detection in 2D MRI images. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1343–1350

    Google Scholar 

  6. Zheng Y, John M, Liao R, Nottling A, Boese J, Kempfert J, Walther T, Brockmann G, Comaniciu D (2012) Automatic aorta segmentation and valve landmark detection in C-arm CT for transcatheter aortic valve implantation. IEEE Trans Med Imaging 31(12):2307–2321

    Article  Google Scholar 

  7. Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol PA (2010) Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11:3371–3408

    MathSciNet  MATH  Google Scholar 

  8. Yan Z, Zhan Y, Peng Z, Liao S, Shinagawa Y, Metaxas DN, Zhou, XS (2015) Bodypart recognition using multi-stage deep learning. In: Proceedings of the information processing in medical imaging, pp 449–461

    Google Scholar 

  9. Liu F, Yang L (2015) A novel cell detection method using deep convolutional neural network and maximum-weight independent set. In: Proceedings of the international conference on medical image computing and computer assisted intervention, pp 349–357

    Google Scholar 

  10. Roth HR, Lu L., Seff A, Cherry KM, Hoffman J, Wang S, Liu J, Turkbey E, Summers RM (2014) A new 2.5D representation for lymph node detection using random sets of deep convolutional neural network observations. In: Proceedings of the international conference on medical image computing and computer assisted intervention, pp 520–527

    Google Scholar 

  11. Carneiro G, Nascimento JC, Freitas A (2012) The segmentation of the left ventricle of the heart from ultrasound data using deep learning architectures and derivative-based search methods. IEEE Trans Image Process 21(3):968–982

    Article  MathSciNet  Google Scholar 

  12. Ghesu FC, Krubasik E, Georgescu B, Singh V, Zheng Y, Hornegger J, Comaniciu D (2016) Marginal space deep learning: efficient architecture for volumetric image parsing. IEEE Trans Med Imag 35(5):1217–1228

    Article  Google Scholar 

  13. Cheng X, Zhang L, Zheng Y (2016) Deep similarity learning for multimodal medical images. Comput Methods Biomech Biomed Eng Imaging Vis 4:1–5

    Google Scholar 

  14. Miao S, Wang ZJ, Zheng Y, Liao R (2016) Real-time 2D/3D registration via CNN regression. In: Proceedings of the IEEE international symposium on biomedical imaging, pp 1–4

    Google Scholar 

  15. Prasoon A, Petersen K, Igel C, Lauze F, Dam E, Nielsen M (2013) Deep feature learning for knee cartilage segmentation using a triplanar convolutional neural network. In: Proceedings of the international conference on medical image computing and computer assisted intervention, vol 8150, pp 246–253

    Google Scholar 

  16. Rigamonti R, Sironi A, Lepetit V, Fua P (2013) Learning separable filters. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2754–2761

    Google Scholar 

  17. Denton E, Zaremba W, Bruna J, LeCun Y, Fergus R (2014) Exploiting linear structure within convolutional networks for efficient evaluation. In: Advances in neural information processing systems, pp 1–11

    Google Scholar 

  18. Zheng Y, Liu D, Georgescu B, Nguyen H, Comaniciu D (2015) 3D deep learning for efficient and robust landmark detection in volumetric data. In: Proceedings of the international conference medical image computing and computer assisted intervention, pp 565–572

    Google Scholar 

  19. Acar E, Dunlavy DM, Kolda TG (2011) A scalable optimization approach for fitting canonical tensor decompositions. J Chemom 25(2):67–86

    Article  Google Scholar 

  20. Tu Z (2005) Probabilistic boosting-tree: learning discriminative methods for classification, recognition, and clustering. In: Proceedings of the international conference on computer vision, pp 1589–1596

    Google Scholar 

  21. Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139

    Article  MathSciNet  MATH  Google Scholar 

  22. Liu D, Zhou S, Bernhardt D, Comaniciu D (2010) Search strategies for multiple landmark detection by submodular maximization. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2831–2838

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yefeng Zheng .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Zheng, Y., Liu, D., Georgescu, B., Nguyen, H., Comaniciu, D. (2017). Robust Landmark Detection in Volumetric Data with Efficient 3D Deep Learning. In: Lu, L., Zheng, Y., Carneiro, G., Yang, L. (eds) Deep Learning and Convolutional Neural Networks for Medical Image Computing. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-42999-1_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42999-1_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42998-4

  • Online ISBN: 978-3-319-42999-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics