Skip to main content
Log in

MD-Unet: a deformable network for nasal cavity and paranasal sinus tumor segmentation

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

In recent years, with the rapid development of deep learning, medical image segmentation has brought breakthrough progress. U-Net has become the most prominent and popular deep network architecture in the field. Despite its overall excellent performance in medical image segmentation, we found that the classical U-Net architecture was insensitive to image details and had the problem of local and global consistency separation through experiments on nasal and paranasal sinus tumor datasets. To solve these problems, we propose an improved multi-scale neural network based on deformable convolution, called Multi-scale Deformable U-Net (MD-Unet). According to the spatial deformation characteristics of nasal cavity and paranasal sinus tumors, deformable convolution can be used to adaptively obtain the receiving field according to the shape of objects and extract features of different scales for fusion, so as to fully learn image details and improve feature extraction ability. We also use Tversky loss function to solve the problem of sample imbalance in the dataset, and obtain high sensitivity and generalization ability. Experimental results show that the proposed algorithm can effectively improve the segmentation accuracy of nasal and paranasal sinus tumors by 5.75%, 3.30%, 1.22% and 0.56% compared with U-Net, Res-Unet, Attention U-Net, and UNet++, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Li, W.D., Liu, W.J.: Clinical features, pathological classification and influencing factors of tumors in nasal cavity and paranasal sinuses. China Pract. Med. 5, 60–61 (2015)

    Google Scholar 

  2. Ma, Q., Yao, X.J., Qian, B.: Diagnostic value of CT tomography combined with CD24 detection in the early sinus carcinoma. CT Theory Appl. 27(4), 537–542 (2018)

    Google Scholar 

  3. Rouhi, R., Jafari, M., Kasaei, S., et al.: Benign and malignant breast tumors classification based on region growing and cnn segmentation. Exp. Syst. Appl. 42(3), 990–1002 (2015)

    Article  Google Scholar 

  4. Yang, J.F., Qiao, P.R., Li, Y.M., et al.: Review of machine learning classification problems and algorithms research. Stat. Dec. 35(6), 36–40 (2019)

    Google Scholar 

  5. Panigrahi, S., Nanda, A., Swarnkar, T.: Deep learning approach for image classification. In: Proceedings of the 2nd International Conference on Data Science and Business Analytics. IEEE Computer Society, 14(2): 97–101 (2018).

  6. Passera, K., Potepan, P., Setti, E., et al: A fuzzy-C-means clustering algorithm for a volumetric analysis of paranasal sinus and nasal cavity cancers. In: International Conference of the IEEE Engineering in Medicine and Biology Society, New York, pp. 3078–3081 (2006).

  7. Passera, K. M., Potepan, P., Brambilla, L., et al: ITAC volume assessment through a Gaussian hidden Markov random field model-based algorithm. In: International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 1218–1221 (2008).

  8. Yann, L.C., Bottou, L., Bengio, Y.S., et al.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)

    Article  Google Scholar 

  9. Gao, H., Yao, D., Yang, Y., et al.: Multiscale 3-D-CNN based on spatial-spectral joint feature extraction for hyperspectral remote sensing images classification. J. Elect. Imag. (2020). https://doi.org/10.1117/1.JEI.29.1.013007

    Article  Google Scholar 

  10. Anwar, S.M., Majid, M., Qayyum, A., et al.: Medical image analysis using convolutional neural networks: a review. J. Med. Syst. 42(11), 226–234 (2018)

    Article  Google Scholar 

  11. Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: International Conference on Medical image computing and computer-assisted intervention. Springer, New York, pp. 234–241 (2015).

  12. Litjens, G., Kooi, T., Bejnordi, B.E., et al.: A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017)

    Article  Google Scholar 

  13. Xiao, L., Lu, C., Wang, Y.Y., et al.: A primary analysis on CT and MRI features of common malignant sinonasal tumors. J. Pract. Med. 33(06), 986–989 (2017)

    Google Scholar 

  14. Dai, J. F., Qi, H. Z., Xiong, Y. W., et al.: Deformable convolutional networks. In: Proceedings of the IEEE international conference on computer vision, pp. 764–773 (2017).

  15. Salehi, S. S. M., Erdogmus, D., Gholipour, A.: Tversky loss function for image segmentation using 3D fully convolutional deep networks. In: International Workshop on Machine Learning in Medical Imaging, Springer, Cham, pp. 379–387 (2017).

  16. Lee, F.K.H., Yeung, D.K.W., King, A.D., et al.: Segmentation of NasoPharyngeal Carcinoma (NPC) lesions in MR images. Int. J. Rad. Oncol. Biol. Phys. 61(2), 608–620 (2005)

    Article  Google Scholar 

  17. Zhou, J., Chan, K.L., Xu, P., et al.: Nasopharyngeal carcinoma lesion segmentation from MR images by support vector machine. In: Proceedings of the 3rd IEEE International Symposium on Biomedical Imaging: Nano to Macro. Piscataway, NJ: IEEE, pp. 1364–1367 (2006).

  18. Ritthipravat, P., Tatanun, C., Bhongmakapat, T., et al.: Automatic segmentation of nasopharyngeal carcinoma from CT images. In: Proceedings of the 2008 International Conference on Biomedical Engineering and Informatics. Washington, DC: IEEE Computer Society, pp. 18–22 (2008).

  19. Tatanun, C., Ritthipravat, P., Bhongmakapat, T., et al.: Automatic segmentation of nasopharyngeal carcinoma from CT images: region growing based technique. In: Proceedings of the 2010 2nd International Conference on Signal Processing System. Washington, DC: IEEE Computer Society, pp. 537–541 (2010).

  20. Ibtehaz, N., Rahman, M.S.: MultiResUNet: Rethinking the U-Net architecture for multimodal biomedical image segmentation. Neural Netw. 121, 74–87 (2019)

    Article  Google Scholar 

  21. Drozdzal, M., Vorontsov, E., Chartrand, G., et al.: The importance of skip connections in biomedical image segmentation. In: Deep Learning and Data Labeling for Medical Applications, Springer, pp. 179–187 (2016).

  22. Lin, T. Y., Dollar, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the 30th IEEE/CVF Conference on Computer Vision and Pattern Recognition, Honolulu, pp. 936–944 (2017).

  23. Zhang, W.L., Li, R.J., Deng, H.T., et al.: Deep convolutional neural networks for multi-modality isointense infant brain image segmentation. Neuroimage 108, 214–224 (2015)

    Article  Google Scholar 

  24. Jin, Q., Meng, Z., Pham, T.D., et al.: DUNet: a deformable network for retinal vessel segmentation. Knowl. Based Syst. 178, 149–162 (2019)

    Article  Google Scholar 

  25. Chollet, F., et al.: Keras. GitHub. https://github.com/fchollet/keras (2015).

  26. Abadi, M., Barham, P., Chen, J. M., et al.: Tensorflow: a system for large-scale machine learning. In: Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, Savannah, pp. 265–283 (2016).

  27. Kingma, D., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014).

  28. Xiao, X., Lian, S., Luo, Z., et al.: Weighted Res-UNet for high-quality retina vessel segmentation. In: International Conference on Information Technology in Medicine and Education, Hangzhou, pp. 327–331 (2018).

  29. Oktay, O., Schlemper, J., Folgoc, L. L., et al.: Attention U-Net: learning where to look for the pancreas. In: Medical Imaging with Deep Learning, London, pp. 137–142 (2018).

  30. Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N., et al.: UNet++: a nested U-Net architecture for medical image segmentation. Lect. Notes Comput. Sci. 11045, 3–11 (2018)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fu-hao Li.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Fh., Zhao, Xm. MD-Unet: a deformable network for nasal cavity and paranasal sinus tumor segmentation. SIViP 16, 1225–1233 (2022). https://doi.org/10.1007/s11760-021-02073-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-021-02073-3

Keywords

Navigation