Medical image semantic segmentation based on deep learning

Jiang, Feng; Grigorev, Aleksei; Rho, Seungmin; Tian, Zhihong; Fu, YunSheng; Jifara, Worku; Adil, Khan; Liu, Shaohui

doi:10.1007/s00521-017-3158-6

Medical image semantic segmentation based on deep learning

Neural Computing in Next Generation Virtual Reality Technology
Published: 15 July 2017

Volume 29, pages 1257–1265, (2018)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Feng Jiang¹,
Aleksei Grigorev¹,
Seungmin Rho²,
Zhihong Tian^3,1,
YunSheng Fu³,
Worku Jifara¹,
Khan Adil¹ &
…
Shaohui Liu¹

3763 Accesses
84 Citations
12 Altmetric
Explore all metrics

Abstract

The image semantic segmentation has been extensively studying. The modern methods rely on the deep convolutional neural networks, which can be trained to address this problem. A few years ago networks require the huge dataset to be trained. However, the recent advances in deep learning allow training networks on the small datasets, which is a critical issue for medical images, since the hospitals and research organizations usually do not provide the huge amount of data. In this paper, we address medical image semantic segmentation problem by applying the modern CNN model. Moreover, the recent achievements in deep learning allow processing the whole image per time by applying concepts of the fully convolutional neural network. Our qualitative and quantitate experiment results demonstrated that modern CNN can successfully tackle the medical image semantic segmentation problem.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Review on Progress in Semantic Image Segmentation and Its Application to Medical Images

Article 31 July 2021

A Survey of Semantic Segmentation on Biomedical Images Using Deep Learning

A Study on Deep Learning Models for Medical Image Segmentation

References

Shotton J, Winn J, Rother C Crimininsi A (2006) TextonBoost: joint appearance, shape and context modeling for multi-class object recognition and segmentation. In Proceedings of European conference on computer vision, vol 3951, Chapter 1, pp. 1–15
Jiang J, Trundle P, Ren J (2010) Medical image analysis with artificial neural networks. Comput Med Imaging Graph 34(8):617–631
Article Google Scholar
Zheng S, Jayasumana S, Romera-Paredes B, Vineet V, Su Z, Du D, Huang C, Torr P (2015) Conditional random fields as recurrent neural networks. In: Proceedings of the ICCV, pp 1529–1537
Long J, Shelhamer E, Darrell T (2015) [Slices] fully convolutional networks for semantic segmentation. In: Cvpr 2015
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of the NIPS, pp 1–9
Girshick R, Donahue J, Darrell T (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE conference on computer vision pattern recognition, pp 580–587
Yan Z, Zhang H, Jia Y, Breuel T, Yu Y (2016) Combining the best of convolutional layers and recurrent layers: a hybrid network for semantic segmentation. arXiv:1603.04871
Visin F, Ciccone M, Romero A, Kastner K, Kyunghyun C, Bengio Y, Matteucci M, Courville A (2016) ReSeg: a recurrent neural network-based model for semantic segmentation. In IEEE conference on computer vision pattern recognition workshops
Pinheiro PHO, Collobert R (2014) Recurrent convolutional neural networks for scene Labeling. In: Proceedings of the 31st international conference on Machine Learning, pp 82–90
Chen B-W, Wang J-C, Wang J-F (2009) A novel video summarization based on mining the story-structure and semantic relations among concept entities. IEEE Trans Multimedia 11(2):295–312
Article Google Scholar
Chen B-W, Chen C-Y, Wang J-F (2013) Smart homecare surveillance system: behavior identification based on state transition support vector machines and sound directivity pattern analysis. IEEE Trans Syst Man Cybern Syst 43(6):1279–1289
Article Google Scholar
Chen B-W, Tsai A-C, Wang J-F (2009) Structuralized context-aware content and scalable resolution support for wireless VoD services. IEEE Trans Consum Electron 55(2):713–720
Article Google Scholar
Chen L-C, Barron JT Papandreou G Murphy K Yuille AL (2015) Semantic image segmentation with task-specific edge detection using CNNs and a discriminatively trained domain transform. p 12
Gastal ESL, Oliveira MM (2011) Domain transform for edge-aware image and video processing. ACM Trans Graph 30(4):1
Article Google Scholar
Chen L-C, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2014) Semantic image segmentation with deep convolutional nets and fully connected CRFs. In: Iclr, pp 1–14
Ngo TA, Carneiro G (2015) Lung segmentation in chest radiographs using distance regularized level set and deep-structured learning and inference. In: IEEE international conference on image processing (ICIP), pp 2140–2143
Wolf I, Böttger T, Grunewald K, Schöbinger M, Fink C, Risse F, Kauczor HU, Meinzer HP (2007) Implementation and evaluation of a new workflow for registration and segmentation of pulmonary MRI data for regional lung perfusion assessment. Phys Med Biol 52(5):1261–1275
Article Google Scholar
Candemir S, Jaeger S, Palaniappan K, Musco JP, Singh RK, Xue Z, Karargyris A, Antani S, Thoma G, McDonald CJ (2014) Lung segmentation in chest radiographs using anatomical atlases with nonrigid registration. IEEE Trans Med Imaging 33(2):577–590
Article Google Scholar
Chae S-H, Lee J, Won C, Pan SB (2014) Lung segmentation using prediction-based segmentation improvement for chest tomosynthesis. Int J Biosci Biotechnol 6(3):81–90
Google Scholar
Li C, Xu C, Gui C, Fox MD (2010) Distance regularized level set evolution and its application to image segmentation. IEEE Trans Image Process 19(12):3243–3254
Article MathSciNet MATH Google Scholar
Zhang W, Zeng S, Wang D, Xue X (2015) Weakly supervised semantic segmentation for social images. In: Proceedings of the IEEE computer society conference on computer vision pattern recognition, vol 07, 12-June, pp. 2718–2726
Papandreou G, Chen L-C, Murphy KP, Yuille AL (2015) Weakly-and semi-supervised learning of a deep convolutional network for semantic image segmentation. In Proceedings of the ICCV, pp 1742–1750
Vezhnevets A, Buhmann JM (2010) Towards weakly supervised semantic segmentation by means of multiple instance and multitask learning. In: Proceedings of the IEEE computer society conference on computer vision pattern recognition, pp 3249–3256
Xu J, Schwing AG, Urtasun R (2014) Tell me what you see and i will show you where it is. In: 2014 IEEE conference on computer vision pattern recognition (CVPR), pp 3190–3197
Rajchl M, Lee MCH, Oktay O, Kamnitsas K, Passerat-palmbach J, Bai W, Kainz B, Rueckert D (2017) DeepCut: object segmentation from bounding box annotations using convolutional neural networks. IEEE Trans Med Imaging 36(2):674–683
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. ImageNet Chall, pp 1–10. arXiv:1409.1556
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. Proceedings of the IEEE computer society conference computer vision pattern recognition vol 07, 12-June, pp 1–9
Zaremba W, Sutskever I, Vinylas O (2014) Recurrent neural network regularization. Arxiv preprint arXiv:1409.2329
Jia Y, Shelhamer E, Dohanue J et al. (2014) Caffe: convolutional architecture fpr fast feature embedding. arXiv:1408.5093
Sutskever I, Martens J, Dahl GE (2013) On the importance of initialization and momentum in deep learning. In Jwml W&Cp, vol 28, issue 2010, pp 1139–1147
Bottou L (2012) Stochastic gradient descent tricks. In: Montavon G, Orr GB, Müller KR (eds) Neural networks: tricks of the trade. Lecture notes in computer science, vol 7700. Springer, Berlin, Heidelberg

Download references

Acknowledgements

This work is partially funded by the MOE—Microsoft Key Laboratory of Natural Language Processing and Speech, Harbin Institute of Technology, the Major State Basic Research Development Program of China (973 Program 2015CB351804), and the National Natural Science Foundation of China under Grant Nos. 61572155, 61672188, and 61572153. We would also like to acknowledge NVIDIA Corporation who kindly provided two sets of GPU.

Author information

Feng Jiang and Aleksei Grigorev have contributed equally to this work.

Authors and Affiliations

Department Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, China
Feng Jiang, Aleksei Grigorev, Zhihong Tian, Worku Jifara, Khan Adil & Shaohui Liu
Department of Media Software, Sungkyul University, Anyang, Korea
Seungmin Rho
Institute of Computer Application, Chinese Academy of Engineering Physics, Mianyang, 621900, China
Zhihong Tian & YunSheng Fu

Authors

Feng Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Aleksei Grigorev
View author publications
You can also search for this author in PubMed Google Scholar
Seungmin Rho
View author publications
You can also search for this author in PubMed Google Scholar
Zhihong Tian
View author publications
You can also search for this author in PubMed Google Scholar
YunSheng Fu
View author publications
You can also search for this author in PubMed Google Scholar
Worku Jifara
View author publications
You can also search for this author in PubMed Google Scholar
Khan Adil
View author publications
You can also search for this author in PubMed Google Scholar
Shaohui Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Feng Jiang.

Ethics declarations

Conflict of interest

The authors declared that they have no conflicts of interest to this work. We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jiang, F., Grigorev, A., Rho, S. et al. Medical image semantic segmentation based on deep learning. Neural Comput & Applic 29, 1257–1265 (2018). https://doi.org/10.1007/s00521-017-3158-6

Download citation

Received: 03 November 2016
Accepted: 07 July 2017
Published: 15 July 2017
Issue Date: March 2018
DOI: https://doi.org/10.1007/s00521-017-3158-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Medical image semantic segmentation based on deep learning

Abstract

Access this article

Similar content being viewed by others

A Review on Progress in Semantic Image Segmentation and Its Application to Medical Images

A Survey of Semantic Segmentation on Biomedical Images Using Deep Learning

A Study on Deep Learning Models for Medical Image Segmentation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Medical image semantic segmentation based on deep learning

Abstract

Access this article

Similar content being viewed by others

A Review on Progress in Semantic Image Segmentation and Its Application to Medical Images

A Survey of Semantic Segmentation on Biomedical Images Using Deep Learning

A Study on Deep Learning Models for Medical Image Segmentation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation