Abstract
In recent years, with the rapid development of deep learning, medical image segmentation has brought breakthrough progress. U-Net has become the most prominent and popular deep network architecture in the field. Despite its overall excellent performance in medical image segmentation, we found that the classical U-Net architecture was insensitive to image details and had the problem of local and global consistency separation through experiments on nasal and paranasal sinus tumor datasets. To solve these problems, we propose an improved multi-scale neural network based on deformable convolution, called Multi-scale Deformable U-Net (MD-Unet). According to the spatial deformation characteristics of nasal cavity and paranasal sinus tumors, deformable convolution can be used to adaptively obtain the receiving field according to the shape of objects and extract features of different scales for fusion, so as to fully learn image details and improve feature extraction ability. We also use Tversky loss function to solve the problem of sample imbalance in the dataset, and obtain high sensitivity and generalization ability. Experimental results show that the proposed algorithm can effectively improve the segmentation accuracy of nasal and paranasal sinus tumors by 5.75%, 3.30%, 1.22% and 0.56% compared with U-Net, Res-Unet, Attention U-Net, and UNet++, respectively.











Similar content being viewed by others
References
Li, W.D., Liu, W.J.: Clinical features, pathological classification and influencing factors of tumors in nasal cavity and paranasal sinuses. China Pract. Med. 5, 60–61 (2015)
Ma, Q., Yao, X.J., Qian, B.: Diagnostic value of CT tomography combined with CD24 detection in the early sinus carcinoma. CT Theory Appl. 27(4), 537–542 (2018)
Rouhi, R., Jafari, M., Kasaei, S., et al.: Benign and malignant breast tumors classification based on region growing and cnn segmentation. Exp. Syst. Appl. 42(3), 990–1002 (2015)
Yang, J.F., Qiao, P.R., Li, Y.M., et al.: Review of machine learning classification problems and algorithms research. Stat. Dec. 35(6), 36–40 (2019)
Panigrahi, S., Nanda, A., Swarnkar, T.: Deep learning approach for image classification. In: Proceedings of the 2nd International Conference on Data Science and Business Analytics. IEEE Computer Society, 14(2): 97–101 (2018).
Passera, K., Potepan, P., Setti, E., et al: A fuzzy-C-means clustering algorithm for a volumetric analysis of paranasal sinus and nasal cavity cancers. In: International Conference of the IEEE Engineering in Medicine and Biology Society, New York, pp. 3078–3081 (2006).
Passera, K. M., Potepan, P., Brambilla, L., et al: ITAC volume assessment through a Gaussian hidden Markov random field model-based algorithm. In: International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 1218–1221 (2008).
Yann, L.C., Bottou, L., Bengio, Y.S., et al.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Gao, H., Yao, D., Yang, Y., et al.: Multiscale 3-D-CNN based on spatial-spectral joint feature extraction for hyperspectral remote sensing images classification. J. Elect. Imag. (2020). https://doi.org/10.1117/1.JEI.29.1.013007
Anwar, S.M., Majid, M., Qayyum, A., et al.: Medical image analysis using convolutional neural networks: a review. J. Med. Syst. 42(11), 226–234 (2018)
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: International Conference on Medical image computing and computer-assisted intervention. Springer, New York, pp. 234–241 (2015).
Litjens, G., Kooi, T., Bejnordi, B.E., et al.: A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017)
Xiao, L., Lu, C., Wang, Y.Y., et al.: A primary analysis on CT and MRI features of common malignant sinonasal tumors. J. Pract. Med. 33(06), 986–989 (2017)
Dai, J. F., Qi, H. Z., Xiong, Y. W., et al.: Deformable convolutional networks. In: Proceedings of the IEEE international conference on computer vision, pp. 764–773 (2017).
Salehi, S. S. M., Erdogmus, D., Gholipour, A.: Tversky loss function for image segmentation using 3D fully convolutional deep networks. In: International Workshop on Machine Learning in Medical Imaging, Springer, Cham, pp. 379–387 (2017).
Lee, F.K.H., Yeung, D.K.W., King, A.D., et al.: Segmentation of NasoPharyngeal Carcinoma (NPC) lesions in MR images. Int. J. Rad. Oncol. Biol. Phys. 61(2), 608–620 (2005)
Zhou, J., Chan, K.L., Xu, P., et al.: Nasopharyngeal carcinoma lesion segmentation from MR images by support vector machine. In: Proceedings of the 3rd IEEE International Symposium on Biomedical Imaging: Nano to Macro. Piscataway, NJ: IEEE, pp. 1364–1367 (2006).
Ritthipravat, P., Tatanun, C., Bhongmakapat, T., et al.: Automatic segmentation of nasopharyngeal carcinoma from CT images. In: Proceedings of the 2008 International Conference on Biomedical Engineering and Informatics. Washington, DC: IEEE Computer Society, pp. 18–22 (2008).
Tatanun, C., Ritthipravat, P., Bhongmakapat, T., et al.: Automatic segmentation of nasopharyngeal carcinoma from CT images: region growing based technique. In: Proceedings of the 2010 2nd International Conference on Signal Processing System. Washington, DC: IEEE Computer Society, pp. 537–541 (2010).
Ibtehaz, N., Rahman, M.S.: MultiResUNet: Rethinking the U-Net architecture for multimodal biomedical image segmentation. Neural Netw. 121, 74–87 (2019)
Drozdzal, M., Vorontsov, E., Chartrand, G., et al.: The importance of skip connections in biomedical image segmentation. In: Deep Learning and Data Labeling for Medical Applications, Springer, pp. 179–187 (2016).
Lin, T. Y., Dollar, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the 30th IEEE/CVF Conference on Computer Vision and Pattern Recognition, Honolulu, pp. 936–944 (2017).
Zhang, W.L., Li, R.J., Deng, H.T., et al.: Deep convolutional neural networks for multi-modality isointense infant brain image segmentation. Neuroimage 108, 214–224 (2015)
Jin, Q., Meng, Z., Pham, T.D., et al.: DUNet: a deformable network for retinal vessel segmentation. Knowl. Based Syst. 178, 149–162 (2019)
Chollet, F., et al.: Keras. GitHub. https://github.com/fchollet/keras (2015).
Abadi, M., Barham, P., Chen, J. M., et al.: Tensorflow: a system for large-scale machine learning. In: Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, Savannah, pp. 265–283 (2016).
Kingma, D., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014).
Xiao, X., Lian, S., Luo, Z., et al.: Weighted Res-UNet for high-quality retina vessel segmentation. In: International Conference on Information Technology in Medicine and Education, Hangzhou, pp. 327–331 (2018).
Oktay, O., Schlemper, J., Folgoc, L. L., et al.: Attention U-Net: learning where to look for the pancreas. In: Medical Imaging with Deep Learning, London, pp. 137–142 (2018).
Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N., et al.: UNet++: a nested U-Net architecture for medical image segmentation. Lect. Notes Comput. Sci. 11045, 3–11 (2018)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Li, Fh., Zhao, Xm. MD-Unet: a deformable network for nasal cavity and paranasal sinus tumor segmentation. SIViP 16, 1225–1233 (2022). https://doi.org/10.1007/s11760-021-02073-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-021-02073-3