Predicting Semantic Descriptions from Medical Images with Convolutional Neural Networks

Schlegl, Thomas; Waldstein, Sebastian M.; Vogl, Wolf-Dieter; Schmidt-Erfurth, Ursula; Langs, Georg

doi:10.1007/978-3-319-19992-4_34

Thomas Schlegl¹⁷,
Sebastian M. Waldstein¹⁸,
Wolf-Dieter Vogl^17,18,
Ursula Schmidt-Erfurth¹⁸ &
…
Georg Langs¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9123))

Included in the following conference series:

International Conference on Information Processing in Medical Imaging

6176 Accesses
34 Citations

Abstract

Learning representative computational models from medical imaging data requires large training data sets. Often, voxel-level annotation is unfeasible for sufficient amounts of data. An alternative to manual annotation, is to use the enormous amount of knowledge encoded in imaging data and corresponding reports generated during clinical routine. Weakly supervised learning approaches can link volume-level labels to image content but suffer from the typical label distributions in medical imaging data where only a small part consists of clinically relevant abnormal structures. In this paper we propose to use a semantic representation of clinical reports as a learning target that is predicted from imaging data by a convolutional neural network. We demonstrate how we can learn accurate voxel-level classifiers based on weak volume-level semantic descriptions on a set of 157 optical coherence tomography (OCT) volumes. We specifically show how semantic information increases classification accuracy for intraretinal cystoid fluid (IRC), subretinal fluid (SRF) and normal retinal tissue, and how the learning algorithm links semantic concepts to image content and geometry.

T. Schlegl—This work has received funding from the European Union FP7 (KHRESMOI FP7-257528, VISCERAL FP7-318068) and the Austrian Federal Ministry of Science, Research and Economy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dietterich, T.G., Lathrop, R.H., Lozano-Pérez, T.: Solving the multiple instance problem with axis-parallel rectangles. Artif. Intell. 89(1), 31–71 (1997)
Article MATH Google Scholar
Maron, O., Lozano-Pérez, T.: A framework for multiple-instance learning. In: NIPS ’97 Proceedings of the 1997 Conference on Advances in Neural Information Processing Systems 10, pp. 570–576. MIT press, Cambridge (1998)
Google Scholar
Srivastava, N., Salakhutdinov, R.: Multimodal learning with deep Boltzmann machines. In: Advances in Neural Information Processing Systems 25 (NIPS 2012), vol. 25, pp. 2231–2239 (2012)
Google Scholar
Leistner, C., Saffari, A., Santner, J., Bischof, H.: Semi-supervised random forests. In: 12th International Conference on Computer Vision, pp. 506–513, IEEE (2009)
Google Scholar
Zhou, Z.H., Zhang, M.L.: Multi-instance multi-label learning with application to scene classification. In: Proceedings of Neural Information Processing Systems (NIPS), vol. 19, pp. 1609–1616 (2007)
Google Scholar
Cinbis, R.G., Verbeek, J., Schmid, C.: Multi-fold MIL training for weakly supervised object localization. In: Conference on Computer Vision and Pattern Recognition, IEEE (2014)
Google Scholar
Verbeek, J., Triggs, B.: Region classification with markov field aspect models. In: Conference on Computer Vision and Pattern Recognition, pp. 1–8, IEEE (2007)
Google Scholar
Fukushima, K.: Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol. Cybern. 36(4), 193–202 (1980)
Article MATH Google Scholar
Lee, H., Grosse, R., Ranganath, R., Ng, A.Y.: Unsupervised learning of hierarchical representations with convolutional deep belief networks. Commun. ACM 54(10), 95–103 (2011)
Article Google Scholar
Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Conference on Computer Vision and Pattern Recognition, pp. 3642–3649, IEEE (2012)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems 25 (NIPS 2012), vol. 25, pp. 1097–1105 (2012)
Google Scholar
Brosch, T., Tam, R.: Manifold learning of brain MRIs by deep learning. Medical Image Computing and Computer-Assisted Intervention, pp. 633–640 (2013)
Google Scholar
Schlegl, T., Ofner, J., Langs, G.: Unsupervised pre-training across image domains improves lung tissue classification. In: Menze, B., Langs, G., Montillo, A., Kelm, M., Müller, H., Zhang, S., Cai, W.T., Metaxas, D. (eds.) MCV 2014. LNCS, vol. 8848, pp. 82–94. Springer, Heidelberg (2014)
Google Scholar
Oquab, M., Bottou, L., Laptev, I., Sivic, J.: Weakly supervised object recognition with convolutional neural networks. Technical Report HAL-01015140, INRIA (2014)
Google Scholar
Pradhan, S., Ward, W., Hacioglu, K., Martin, J., Jurafsky, D.: Shallow semantic parsing using support vector machines. In: Proceedings of HLT/NAACL, pp. 233–240 (2004)
Google Scholar
Garvin, M.K., Abràmoff, M.D., Wu, X., Russell, S.R., Burns, T.L., Sonka, M.: Automated 3-D intraretinal layer segmentation of macular spectral-domain optical coherence tomography images. IEEE Trans. Med. Imaging 28(9), 1436–1447 (2009)
Article Google Scholar
Bergstra, J., Breuleux, O., Bastien, F., Lamblin, P., Pascanu, R., Desjardins, G., Turian, J., Warde-Farley, D., Bengio, Y.: Theano: A CPU and GPU math expression compiler. In: Proceedings of the Python for Scientific Computing Conference (SciPy), vol. 4 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Computational Imaging Research Lab, Department of Biomedical Imaging and Image-guided Therapy, Medical University, Vienna, Austria
Thomas Schlegl, Wolf-Dieter Vogl & Georg Langs
Christian Doppler Laboratory for Ophthalmic Image Analysis, Vienna Reading Center, Department of Ophthalmology and Optometry, Medical University Vienna, Vienna, Austria
Sebastian M. Waldstein, Wolf-Dieter Vogl & Ursula Schmidt-Erfurth

Authors

Thomas Schlegl
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian M. Waldstein
View author publications
You can also search for this author in PubMed Google Scholar
Wolf-Dieter Vogl
View author publications
You can also search for this author in PubMed Google Scholar
Ursula Schmidt-Erfurth
View author publications
You can also search for this author in PubMed Google Scholar
Georg Langs
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thomas Schlegl .

Editor information

Editors and Affiliations

Centre for Medical Image Computing, University College London, London, United Kingdom
Sebastien Ourselin
Centre for Medical Image Computing, University College London, London, United Kingdom
Daniel C. Alexander
Dept. of Radiology, Harvard Medical School Brigham and Women's Hospital, Boston, Massachusetts, USA
Carl-Fredrik Westin
Centre for Medical Image Computing, University College London, London, United Kingdom
M. Jorge Cardoso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schlegl, T., Waldstein, S.M., Vogl, WD., Schmidt-Erfurth, U., Langs, G. (2015). Predicting Semantic Descriptions from Medical Images with Convolutional Neural Networks. In: Ourselin, S., Alexander, D., Westin, CF., Cardoso, M. (eds) Information Processing in Medical Imaging. IPMI 2015. Lecture Notes in Computer Science(), vol 9123. Springer, Cham. https://doi.org/10.1007/978-3-319-19992-4_34

Download citation

DOI: https://doi.org/10.1007/978-3-319-19992-4_34
Published: 23 June 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19991-7
Online ISBN: 978-3-319-19992-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics