SHAMANN: Shared Memory Augmented Neural Networks

Bercea, Cosmin I.; Pauly, Olivier; Maier, Andreas; Ghesu, Florin C.

doi:10.1007/978-3-030-20351-1_65

Cosmin I. Bercea²⁰,
Olivier Pauly²¹,
Andreas Maier¹⁹ &
…
Florin C. Ghesu^18,19

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11492))

Included in the following conference series:

International Conference on Information Processing in Medical Imaging

5463 Accesses
2 Citations

Abstract

Current state-of-the-art methods for semantic segmentation use deep neural networks to learn the segmentation mask from the input image signal as an image-to-image mapping. While these methods effectively exploit global image context, the learning and computational complexities are high. We propose shared memory augmented neural network actors as a dynamically scalable alternative. Based on a decomposition of the image into a sequence of local patches, we train such actors to sequentially segment each patch. To further increase the robustness and better capture shape priors, an external memory module is shared between different actors, providing an implicit mechanism for image information exchange. Finally, the patch-wise predictions are aggregated to a complete segmentation mask. We demonstrate the benefits of the new paradigm on a challenging lung segmentation problem based on X-Ray images, as well as on two synthetic tasks based on MNIST. On the X-Ray data, our method achieves state-of-the-art accuracy with a significantly reduced model size, 3–5 times compared to reference methods. In addition, we reduce the number of failure cases by at least half.

C. I. Bercea and O. Pauly—Contributed to this work during their time at Siemens Healthineers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 2481–2495 (2017)
Article Google Scholar
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. CoRR abs/1409.0473 (2014)
Google Scholar
Ghesu, F.C., et al.: Marginal space deep learning: efficient architecture for volumetric image parsing. IEEE Trans. Med. Imaging 35(5), 1217–1228 (2016)
Article Google Scholar
Glocker, B., Pauly, O., Konukoglu, E., Criminisi, A.: Joint classification-regression forests for spatially structured multi-object segmentation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7575, pp. 870–881. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33765-9_62
Chapter Google Scholar
Graves, A., et al.: Hybrid computing using a neural network with dynamic external memory. Nature 538(7626), 471–476 (2016)
Article Google Scholar
He, K., Gkioxari, G., Dollár, P., Girshick, R.B.: Mask R-CNN. In: Proceedings of the International Conference on Computer Vision, pp. 2980–2988. IEEE (2017)
Google Scholar
Heimann, T., Meinzer, H.P.: Statistical shape models for 3D medical image segmentation: a review. Med. Image Anal. 13(4), 543–563 (2009)
Article Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Iglesias, J.E., Sabuncu, M.R.: Multi-atlas segmentation of biomedical images: a survey. Med. Image Anal. 24(1), 205–219 (2015)
Article Google Scholar
Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Milletari, F., et al.: Hough-CNN: deep learning for segmentation of deep brain regions in MRI and ultrasound. Comput. Vis. Image Underst. 164, 92–102 (2017)
Article Google Scholar
van den Oord, A., Kalchbrenner, N., Kavukcuoglu, K.: Pixel recurrent neural networks. In: Proceedings of the International Conference on Machine Learning, vol. 48, pp. 1747–1756 (2016)
Google Scholar
Pritzel, A., et al.: Neural episodic control. In: Proceedings of the International Conference on Machine Learning, vol. 70, pp. 2827–2836 (2017)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(4), 640–651 (2017)
Article Google Scholar
Sprechmann, P., et al.: Memory-based parameter adaptation. In: International Conference on Learning Representations (2018)
Google Scholar
Sukhbaatar, S., Weston, J., Fergus, R.: End-to-end memory networks. In: Advances in Neural Information Processing Systems, pp. 2440–2448. Curran Associates, Inc. (2015)
Google Scholar
Vinyals, O., Blundell, C., Lillicrap, T., Kavukcuoglu, K., Wierstra, D.: Matching networks for one shot learning. In: Advances in Neural Information Processing Systems, pp. 3630–3638. Curran Associates, Inc. (2016)
Google Scholar
Vinyals, O., Toshev, A., Bengio, S., Erhan, D.: Show and tell: a neural image caption generator. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3156–3164 (2015)
Google Scholar
Visin, F., et al.: ReSeg: a recurrent neural network-based model for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 426–433 (2016)
Google Scholar
Wang, H., Suh, J.W., Das, S.R., Pluta, J., Craige, C., Yushkevich, P.A.: Multi-atlas segmentation with joint label fusion. IEEE Trans. Pattern Anal. Mach. Intell. 35(3), 611–623 (2013)
Article Google Scholar
Wang, M., Lu, Z., Li, H., Liu, Q.: Memory-enhanced decoder for neural machine translation. In: Empirical Methods in Natural Language Processing (2016)
Google Scholar
Wang, X., Peng, Y., Lu, L., Lu, Z., Bagheri, M., Summers, R.M.: ChestX-Ray8: hospital-scale chest X-Ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3462–3471 (2017)
Google Scholar
Werbos, P.J.: Backpropagation through time: what it does and how to do it. Proc. IEEE 78(10), 1550–1560 (1990)
Article Google Scholar
Yang, D., et al.: Automatic liver segmentation using an adversarial image-to-image network. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D.L., Duchesne, S. (eds.) MICCAI 2017. LNCS, vol. 10435, pp. 507–515. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66179-7_58
Chapter Google Scholar
Yu, F., Koltun, V., Funkhouser, T.A.: Dilated residual networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 636–644 (2017)
Google Scholar
Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 6230–6239 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Digital Technology and Innovation, Siemens Healthineers, Erlangen, Germany
Florin C. Ghesu
Pattern Recognition Lab, Friedrich-Alexander-Universität, Erlangen, Germany
Andreas Maier & Florin C. Ghesu
Robert Bosch GmbH, Corporate Research–Computer Vision, Hildesheim, Germany
Cosmin I. Bercea
Google Cloud AI, Munich, Germany
Olivier Pauly

Authors

Cosmin I. Bercea
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Pauly
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Maier
View author publications
You can also search for this author in PubMed Google Scholar
Florin C. Ghesu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Florin C. Ghesu .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong, China
Albert C. S. Chung
Department of Radiology, University of Pennsylvania, Philadelphia, PA, USA
James C. Gee
Department of Radiology, University of Pennsylvania, Philadelphia, PA, USA
Paul A. Yushkevich
Department of Natural Language Processing, Baidu Inc., Shenzhen, China
Siqi Bao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bercea, C.I., Pauly, O., Maier, A., Ghesu, F.C. (2019). SHAMANN: Shared Memory Augmented Neural Networks. In: Chung, A., Gee, J., Yushkevich, P., Bao, S. (eds) Information Processing in Medical Imaging. IPMI 2019. Lecture Notes in Computer Science(), vol 11492. Springer, Cham. https://doi.org/10.1007/978-3-030-20351-1_65

Download citation

DOI: https://doi.org/10.1007/978-3-030-20351-1_65
Published: 22 May 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-20350-4
Online ISBN: 978-3-030-20351-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics