Elsevier

Neurocomputing

Volume 451, 3 September 2021, Pages 1-11
Neurocomputing

A dense connection encoding–decoding convolutional neural network structure for semantic segmentation of thymoma

https://doi.org/10.1016/j.neucom.2021.04.023Get rights and content

Highlights

  • Three-channel pseudo-color images preprocessing method is designed by concatenating different CT windows.

  • A dense skip connection encoding–decoding model (DSC-Net) is proposed to perform automatic segmentation of thymoma base on a deep convolutional neural network.

  • Dense connections are introduced into the architecture of the encoding path across different level feature maps in the DSC-Net.

  • Different level skip connections are designed between the encoding and decoding path in the DSC-Net.

Abstract

Accurately positioning and segmenting thymoma from computed tomography (CT) images is of great importance for an image-driven thymoma analysis. In clinical practice, the diagnosis and segmentation of thymomas for radiologists are time-consuming and inefficient tasks. Thus, it is necessary to develop a method to accurately and efficiently realize automatic segmentation of thymoma. Here, a dense skip connection encoding–decoding model (DSC-Net), which is a deep convolutional neural network, was proposed to perform automatic segmentation of thymoma with the ability to fuse feature maps under receptive fields of different scales. An image preprocessing method was also proposed to provide much more texture information and enhance the contrast between thymoma and its surrounding tissues. A total of 310 subjects who underwent contrast-enhanced CT scanning were included in this ethically-approved retrospective study. All of the CT slices were manually labeled by four experienced radiologists, and 80% of images were included in the training set and the rest were included in the testing set. The performance of segmentation was evaluated by calculating the accuracy, intersection over union (IoU), and Boundary F1 contour matching score (BFScore) between the predicted segmentation and the manual labels. For segmentation of thymoma in the testing set, the accuracy, IoU and BFScore were 92.96%, 87.86% and 0.9087 respectively. Compared to the U-Net method, the DSC-Net model improved IoU by 3.94%. In addition, the efficacy and robustness of DSC-Net in segmentation of different patients and different types of thymoma classified by the WHO histological classification criteria were verified. The proposed preprocessing method and DSC-Net demonstrated improved performance in segmentation of thymomas, suggesting the ability to provide consistent delineation and assist radiologists in their clinical applications.

Introduction

Mediastinal tumors are common clinical chest diseases, of which thymoma is the most common primary anterior mediastinal tumor, accounting for 20% to 40% of all mediastinal tumors in adults [1]. However, thymoma has not been thoroughly studied [1], [2]. Whether the cancerous part can be accurately identified and segmented during the diagnostic process is of great significance for doctors to determine the subsequent treatment strategy [3]. Current technologies often require manual inspections and segmentation by radiologists, which are time-consuming, labor intensive, and prone to error. Therefore, it is urgent to realize the automatic segmentation and diagnosis of thymoma. Computer-aided diagnosis, particularly with the artificial intelligence technique using deep learning, is increasingly being employed in clinical diagnosis [4], [5], [6].

Automatic segmentation methods based on medical images mainly include morphological methods, statistical methods and convolutional neural network (CNN)-based methods. Tang et al. [7] proposed an image segmentation algorithm by improving the traditional threshold method. Hui-Yan et al. [8] combined the regional growth algorithm and OSTU algorithm to effectively segment abdominal MRI images with fuzzy edges. Joan et al. [9] used an iterative regional growth algorithm to effectively segment the chromatin in ovarian cells. Angelina et al. [10] proposed a new region growing and merging algorithm of the medical image segmentation algorithm combined with the genetic algorithm. At the same time, image segmentation methods based on fuzzy clustering and their improved algorithms have been widely used in the medical field, such as the automatic intuitive fuzzy clustering method (FCM) for brain MRI image segmentation [11], and the FCM clustering method with bilateral filtering for medical image segmentation [12].

In recent years, with the improvement of the computing power and the increase in the amount of medical imaging data, the development of automatic segmentation method based on CNN has become a hot research field. U-Net [13] was proposed for the cell segmentation task of electron microscopy images. The structures of encoding–decoding path and skip connection between the encoding and decoding paths have excellent performance in the segmentation of medical images. Thus, U-Net based image segmentation has been further applied to the segmentation of various medical images such as retinal blood vessels [14], lung parenchyma [15] and pancreas [16]. One of the research contents is the improvement of the encoding–decoding structure. One study replaced convolutional blocks in U-Net with a ResNet block to perform segmentation of liver tumors [17]. Li et al. proposed H-Dense U-Net [18], in which dense blocks were used to replace the convolution operation in the encoding path. It solves the difficulty of calculation by training a network with a deep structure. The network can retain low-level spatial features to better explore the contextual information between slices. Another research direction is the skip connections, U-Net++ [19], which replaces skip connections with a series of long and short connections, captures features of different levels, and integrates them through feature concatenation. MultiResUNet [20] added the residual blocks into the encoding path, decoding path and skip connection respectively, and replaced skip connections with respaths, the features of the encoder that perform additional convolution operations before concatenating with the corresponding features in the decoder. These studies aimed to retain the effective features to the greatest extent in the down-sampling process and locate the active area as accurately as possible in the up-sampling process. Among them, residual modules and dense connections play important roles. However, adding residual connections only to the encoding path would not compensate for the lost information during upsampling, and adding dense connections only to skip connections would not retain low-level features during downsampling. Therefore, it is important to retain and recover effective features in both processes of subsampling and upsampling.

Thus, in order to realize fast and accurate segmentation of thymoma, we proposed a method of dense feature connections, designed an encoding–decoding structure based on the U-Net model, and applied the dense feature connection method to both the encoding path and decoding path. Dense connections were introduced into the architecture of the encoding path across different level feature maps, which could compensate for the loss during the down-sampling process. Meanwhile, different level skip connections were designed between the encoding and decoding paths to introduce more abundant feature information from the encoding path to the decoding path by building more effective connections.

In addition, unlike 8-bit natural images, medical images, including CT images, usually have 10–12 bit depth. Direct mapping of the medical images to 8-bit images would result in information loss; thus, we proposed an image preprocessing method. Using different CT windows, three-channel pseudo-color images were designed to provide more texture information and enhance the contrast between thymoma and its surrounding tissues.

Section snippets

Data acquisition

The experimental protocol was approved by the Institutional Review Board of Tangdu Hospital affiliated with the Air Force Medical University in Xi’an, China. About 310 patients (male/female: 149/161; mean age ± standard deviation: 50.23 ± 13.92 years), who underwent contrast-enhanced CT scanning and were confirmed with thymoma before surgery were retrospectively enrolled (the demographic distribution characteristics of the dataset and the information of the World Health Organization (WHO)

Training details

Three hundred and ten patients who were histologically confirmed with thymoma were included in the current study, and the CT images of 247 patients (80%) were randomly selected to constitute the training set, and the remaining 63 patients (20%) constituted the testing set.

During the training, the cross-entropy loss function was chosen as the loss function and the RMSProp Optimizer was chosen for optimization. Throughout the training, 200 epochs were set, and in each epoch, the batch size was

Performance of DSC-Net and image preprocessing method

DSC-Net was trained with the three-channel pseudo-color data set to verify the overall performance of the network structure and image preprocessing method. For the training set, the accuracy was 92.23%, IoU was 88.77% and the BFScore was 0.9435; for the testing set, accuracy was 92.96%, IoU was 87.86% and the BFScore was 0.9087 (Table 2). The three parameters showed a similarity between training and testing, suggesting that the network model was fully trained without over-fitting. On the

Discussion

In the current study, in order to realize the automatic segmentation of thymoma, we proposed a DSC-Net model and an image preprocessing method. This model showed a strong feature learning ability in thorax-enhanced CT images and obtained encouraging performance for the automatic segmentation of thymoma with accuracy = 92.96%, IoU = 87.86%, and BFScore = 0.9087. The speed of 0.04 s per slice of the CT image showed the efficacy of DSC-Net in reducing the radiologists’ workload.

For the

Conclusion

In the current study, we proposed a dense skip connection network structure and a method of image preprocessing to realize the accurate segmentation of thymoma. The dense connections in the encoding path and between the encoding and decoding paths could recover the loss of information in the down-sampling process. The image preprocessing method could effectively convert medical images into three-channel pseudo-color images and provide more information than a single-channel. DSC-Net showed

Funding

This work was supported by the National Natural Science Foundation of Chinaunder Grant Nos. 61431013, 31670828 and 81730016; the Open Funding Project of National Key Laboratory of Human Factors Engineering, Grant No. 6142222190103; National Clinical Research Center for Digestive Diseases, Xi’an, China under Grant No. 2015BAI13B07.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

Jingyuan Li, Wenfang Sun, and Gang Xing designed the structure of the convolutional neural network; Jingyuan Li and Wenfang Sun designed the image preprocessing method; Jingyuan Li, Wenfang Sun, Gang Xing, Yi Zhang, Guangbin Cui, and Karen M. von Deneen designed the experimental process; Xiulong Feng and Wen Wang collected the CT scans; Jingyuan Li, Wenfang Sun, Yi Zhang, and Karen M. von Deneen wrote and revised the article.

Dr. Jingyuan Li received her Bachelor degree in Optoelectronic Information Science and Engineering in 2017 and is pursuing her PhD study major in Information and Communication Engineering at Xidian University. Jingyuan Li’s research direction is radiomics, mainly focusing on the detection and segmentation of lesions, the staging and classification of cancer, and the prognosis of a therapeutic schedule based on deep learning using medical imaging and clinical data. She is committed to applying

References (30)

  • E.M. Marom

    Imaging thymoma

    J. Thorac. Oncol.

    (2010)
  • M. Havaei et al.

    Brain tumor segmentation with deep neural networks

    Med. Image Anal.

    (2017)
  • P. Kaur et al.

    RETRACTED: A robust kernelized intuitionistic fuzzy c-means clustering algorithm in segmentation of noisy medical images

    Pattern Recogn. Lett.

    (2013)
  • E.M. Marom

    Advances in thymoma imaging

    J. Thorac. Imag.

    (2013)
  • M.F.K. Benveniste et al.

    Role of imaging in the diagnosis, staging, and treatment of thymoma

    Radiographics

    (2011)
  • A. Barbu et al.

    Automatic detection and segmentation of lymph nodes from CT data

    IEEE Trans. Med. Imag.

    (2012)
  • S. Wang, Z. M, Z. Liu, Z. Liu, D. Gu, Central focused convolutional neural networks: Developing a data-driven model for...
  • X.D.E.A. Tang

    Fast image segmentation method based on threshold

    Control Decis. Conf.

    (2008)
  • H.Y.S.I. Jiang

    Medical image segmentation based on improved ostu algorithm and regional growth algorithm

    J. Northeastern Univ.

    (2006)
  • H.S. Wu et al.

    Region growing segmentation of chromatin clumps of ovarian cells using adaptive gradients

    J. Imag. Sci. Technol.

    (2004)
  • S. Angelina, L.P. Suresh, S.H.K. Veni, Image segmentation based on genetic algorithm for region growth and region...
  • Y. Liu et al.
  • O. Ronneberger et al.

    U-Net: Convolutional networks for biomedical image segmentation

    Med. Image Comput. Comput. Assist. Intervent. (MICCAI)

    (2015)
  • X. Gao, Y. Cai, C. Qiu, Y. Cui, Retinal blood vessel segmentation based on the Gaussian matched filter and U-net, 10th...
  • B. Ait Skourt, A. El Hassani, A. Majda, Lung CT image segmentation using deep neural networks, Procedia Comput. Sci....
  • Cited by (0)

    Dr. Jingyuan Li received her Bachelor degree in Optoelectronic Information Science and Engineering in 2017 and is pursuing her PhD study major in Information and Communication Engineering at Xidian University. Jingyuan Li’s research direction is radiomics, mainly focusing on the detection and segmentation of lesions, the staging and classification of cancer, and the prognosis of a therapeutic schedule based on deep learning using medical imaging and clinical data. She is committed to applying deep learning methods in clinical applications to reduce stress on medical staff and improve diagnostic efficiency.

    Dr. Wenfang Sun received his MS and PhD degrees from Xidian University, Xi’an, in 2006 and 2008 respectively. He is currently an Associate Professor at Xidian University. His research interests include pattern recognition and intelligent systems, medical image processing, and remote sensing data processing.

    Xiulong Feng received his MS degree at the Air Force Medical University in 2020. He is currently a radiologist in the Department of Radiology, Tangdu Hospital, Air Force Medical University. His research interests include neuroimaging and molecular imaging pathological basis of chest and abdominal diseases.

    Gang Xing received his Bachelor degree in biomedical engineering from Xidian University, Xi’an, China, in 2019 and is pursuing his MS study major in biomedical engineering at Xi’an Jiaotong University, Xi’an, China. His research direction is medical image analysis.

    Dr. Karen M. von Deneen received her Bachelor degree at Morehead State University in 1998, received her MS degree at Oregon State University in 2002 and received her PhD degree at the University of Florida in 2009. She is currently an Associate Professor at the School of Life Science and Technology, Xidian University. Her research direction is the study of the Neural Mechanisms of Acupuncture Treatment on Obesity.

    Dr. Wen Wang serves as the deputy chairman of the Youth Committee of the Radiology Specialty Committee of the Chinese Research Hospital Association, is a member of the Standing Committee of the Integrated Imaging Specialty Committee of the Chinese Medical Doctors Association of the Chinese Medical Doctor Association, and is a member of the Teaching Steering Committee of the Air Force Military Medical University. His research interests include imaging of neurological diseases and medical image processing.

    Dr. Yi Zhang received his Bachelor degree in Electronic Engineering in 2003, and then received his MS and PhD degrees in Pattern Recognition and Intelligent Control in 2006 and 2009 respectively. He completed all of his studies at Xidian University, where he also worked as a Professor since 2015. He is the director of the Center for Brain Imaging at the School of Life Science and Technology at Xidian University. He also worked as a visiting scholar in the Department of Psychiatry at the McKnight Brain Institute of the University of Florida in the United States from 2010-2011. His research interests include medical image processing and analysis and neural mechanism investigation.

    Dr. Guangbin Cui received his MD degree at the Air Force Medical University in 1993. He is currently the Director of Radiology and the Director of the Department of Radiology and Nuclear Medicine at Tangdu Hospital of the Air Force Medical University. He is also the Chief Physician, Professor, and Doctoral Tutor. His research interests include neuroimaging and molecular imaging pathological basis of chest and abdominal diseases.

    1

    Jingyuan Li, Wenfang Sun and Xiulong Feng contributed equally to this work.

    View full text