MD-Unet: a deformable network for nasal cavity and paranasal sinus tumor segmentation

Li, Fu-hao; Zhao, Xi-mei

doi:10.1007/s11760-021-02073-3

MD-Unet: a deformable network for nasal cavity and paranasal sinus tumor segmentation

Original Paper
Published: 15 January 2022

Volume 16, pages 1225–1233, (2022)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

516 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

In recent years, with the rapid development of deep learning, medical image segmentation has brought breakthrough progress. U-Net has become the most prominent and popular deep network architecture in the field. Despite its overall excellent performance in medical image segmentation, we found that the classical U-Net architecture was insensitive to image details and had the problem of local and global consistency separation through experiments on nasal and paranasal sinus tumor datasets. To solve these problems, we propose an improved multi-scale neural network based on deformable convolution, called Multi-scale Deformable U-Net (MD-Unet). According to the spatial deformation characteristics of nasal cavity and paranasal sinus tumors, deformable convolution can be used to adaptively obtain the receiving field according to the shape of objects and extract features of different scales for fusion, so as to fully learn image details and improve feature extraction ability. We also use Tversky loss function to solve the problem of sample imbalance in the dataset, and obtain high sensitivity and generalization ability. Experimental results show that the proposed algorithm can effectively improve the segmentation accuracy of nasal and paranasal sinus tumors by 5.75%, 3.30%, 1.22% and 0.56% compared with U-Net, Res-Unet, Attention U-Net, and UNet++, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multiscale Encoder and Omni-Dimensional Dynamic Convolution Enrichment in nnU-Net for Brain Tumor Segmentation

DCRUNet++: A Depthwise Convolutional Residual UNet++ Model for Brain Tumor Segmentation

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

References

Li, W.D., Liu, W.J.: Clinical features, pathological classification and influencing factors of tumors in nasal cavity and paranasal sinuses. China Pract. Med. 5, 60–61 (2015)
Google Scholar
Ma, Q., Yao, X.J., Qian, B.: Diagnostic value of CT tomography combined with CD24 detection in the early sinus carcinoma. CT Theory Appl. 27(4), 537–542 (2018)
Google Scholar
Rouhi, R., Jafari, M., Kasaei, S., et al.: Benign and malignant breast tumors classification based on region growing and cnn segmentation. Exp. Syst. Appl. 42(3), 990–1002 (2015)
Article Google Scholar
Yang, J.F., Qiao, P.R., Li, Y.M., et al.: Review of machine learning classification problems and algorithms research. Stat. Dec. 35(6), 36–40 (2019)
Google Scholar
Panigrahi, S., Nanda, A., Swarnkar, T.: Deep learning approach for image classification. In: Proceedings of the 2nd International Conference on Data Science and Business Analytics. IEEE Computer Society, 14(2): 97–101 (2018).
Passera, K., Potepan, P., Setti, E., et al: A fuzzy-C-means clustering algorithm for a volumetric analysis of paranasal sinus and nasal cavity cancers. In: International Conference of the IEEE Engineering in Medicine and Biology Society, New York, pp. 3078–3081 (2006).
Passera, K. M., Potepan, P., Brambilla, L., et al: ITAC volume assessment through a Gaussian hidden Markov random field model-based algorithm. In: International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 1218–1221 (2008).
Yann, L.C., Bottou, L., Bengio, Y.S., et al.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Gao, H., Yao, D., Yang, Y., et al.: Multiscale 3-D-CNN based on spatial-spectral joint feature extraction for hyperspectral remote sensing images classification. J. Elect. Imag. (2020). https://doi.org/10.1117/1.JEI.29.1.013007
Article Google Scholar
Anwar, S.M., Majid, M., Qayyum, A., et al.: Medical image analysis using convolutional neural networks: a review. J. Med. Syst. 42(11), 226–234 (2018)
Article Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: International Conference on Medical image computing and computer-assisted intervention. Springer, New York, pp. 234–241 (2015).
Litjens, G., Kooi, T., Bejnordi, B.E., et al.: A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017)
Article Google Scholar
Xiao, L., Lu, C., Wang, Y.Y., et al.: A primary analysis on CT and MRI features of common malignant sinonasal tumors. J. Pract. Med. 33(06), 986–989 (2017)
Google Scholar
Dai, J. F., Qi, H. Z., Xiong, Y. W., et al.: Deformable convolutional networks. In: Proceedings of the IEEE international conference on computer vision, pp. 764–773 (2017).
Salehi, S. S. M., Erdogmus, D., Gholipour, A.: Tversky loss function for image segmentation using 3D fully convolutional deep networks. In: International Workshop on Machine Learning in Medical Imaging, Springer, Cham, pp. 379–387 (2017).
Lee, F.K.H., Yeung, D.K.W., King, A.D., et al.: Segmentation of NasoPharyngeal Carcinoma (NPC) lesions in MR images. Int. J. Rad. Oncol. Biol. Phys. 61(2), 608–620 (2005)
Article Google Scholar
Zhou, J., Chan, K.L., Xu, P., et al.: Nasopharyngeal carcinoma lesion segmentation from MR images by support vector machine. In: Proceedings of the 3rd IEEE International Symposium on Biomedical Imaging: Nano to Macro. Piscataway, NJ: IEEE, pp. 1364–1367 (2006).
Ritthipravat, P., Tatanun, C., Bhongmakapat, T., et al.: Automatic segmentation of nasopharyngeal carcinoma from CT images. In: Proceedings of the 2008 International Conference on Biomedical Engineering and Informatics. Washington, DC: IEEE Computer Society, pp. 18–22 (2008).
Tatanun, C., Ritthipravat, P., Bhongmakapat, T., et al.: Automatic segmentation of nasopharyngeal carcinoma from CT images: region growing based technique. In: Proceedings of the 2010 2nd International Conference on Signal Processing System. Washington, DC: IEEE Computer Society, pp. 537–541 (2010).
Ibtehaz, N., Rahman, M.S.: MultiResUNet: Rethinking the U-Net architecture for multimodal biomedical image segmentation. Neural Netw. 121, 74–87 (2019)
Article Google Scholar
Drozdzal, M., Vorontsov, E., Chartrand, G., et al.: The importance of skip connections in biomedical image segmentation. In: Deep Learning and Data Labeling for Medical Applications, Springer, pp. 179–187 (2016).
Lin, T. Y., Dollar, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the 30th IEEE/CVF Conference on Computer Vision and Pattern Recognition, Honolulu, pp. 936–944 (2017).
Zhang, W.L., Li, R.J., Deng, H.T., et al.: Deep convolutional neural networks for multi-modality isointense infant brain image segmentation. Neuroimage 108, 214–224 (2015)
Article Google Scholar
Jin, Q., Meng, Z., Pham, T.D., et al.: DUNet: a deformable network for retinal vessel segmentation. Knowl. Based Syst. 178, 149–162 (2019)
Article Google Scholar
Chollet, F., et al.: Keras. GitHub. https://github.com/fchollet/keras (2015).
Abadi, M., Barham, P., Chen, J. M., et al.: Tensorflow: a system for large-scale machine learning. In: Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, Savannah, pp. 265–283 (2016).
Kingma, D., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014).
Xiao, X., Lian, S., Luo, Z., et al.: Weighted Res-UNet for high-quality retina vessel segmentation. In: International Conference on Information Technology in Medicine and Education, Hangzhou, pp. 327–331 (2018).
Oktay, O., Schlemper, J., Folgoc, L. L., et al.: Attention U-Net: learning where to look for the pancreas. In: Medical Imaging with Deep Learning, London, pp. 137–142 (2018).
Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N., et al.: UNet++: a nested U-Net architecture for medical image segmentation. Lect. Notes Comput. Sci. 11045, 3–11 (2018)
Article Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science and Technology, Qingdao University, Qingdao, 266071, China
Fu-hao Li & Xi-mei Zhao
Shandong Province Key Laboratory of Digital Medicine and Computer Aided Surgery, Qingdao, 266000, China
Xi-mei Zhao

Authors

Fu-hao Li
View author publications
You can also search for this author in PubMed Google Scholar
Xi-mei Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fu-hao Li.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, Fh., Zhao, Xm. MD-Unet: a deformable network for nasal cavity and paranasal sinus tumor segmentation. SIViP 16, 1225–1233 (2022). https://doi.org/10.1007/s11760-021-02073-3

Download citation

Received: 29 April 2021
Revised: 17 October 2021
Accepted: 27 October 2021
Published: 15 January 2022
Issue Date: July 2022
DOI: https://doi.org/10.1007/s11760-021-02073-3

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MD-Unet: a deformable network for nasal cavity and paranasal sinus tumor segmentation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multiscale Encoder and Omni-Dimensional Dynamic Convolution Enrichment in nnU-Net for Brain Tumor Segmentation

DCRUNet++: A Depthwise Convolutional Residual UNet++ Model for Brain Tumor Segmentation

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now