Abstract
Biomedical image segmentation plays an increasingly important role in medical diagnosis. However, it remains a challenging task to segment the medical images due to their diversity of structures. Convolutional networks (CNNs) commonly uses pooling to enlarge the receptive field, which usually results in irreversible information loss. In order to solve this problem, we rethink the alternative method of pooling operation. In this paper, we embed the wavelet transform into the U-Net architecture to achieve the purpose of down-sampling and up-sampling which we called wavelet U-Net (WU-Net). Specifically, in the encoder module, we use discrete wavelet transform (DWT) to replace the pooling operation to reduce the resolution of the image, and use inverse wavelet transform (IWT) to gradually restore the resolution in the decoder module. Besides, we use Densely Cross-level Connection strategy to encourage feature re-use and to enhance the complementarity between cross-level information. Furthermore, in Attention Feature Fusion module (AFF), we introduce the channel attention mechanism to select useful feature maps, which can effectively improve the segmentation performance of the network. We evaluated this model on the digital retinal images for vessel extraction (DRIVE) dataset and the child heart and health study (CHASEDB1) dataset. The results show that the proposed method outperforms the classic U-Net method and other state-of-the-art methods on both datasets.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alom, M.Z., Hasan, M., Yakopcic, C., Taha, T.M., Asari, V.K.: Recurrent residual convolutional neural network based on U-net(R2U-Net) for medical image segmentation (2018)
Badrinarayanan, V., Kendall, A., Cipolla, R.: Segnet: a deep convolutional encoder-decoder architecture for scene segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017). https://doi.org/10.1109/TPAMI.2016.2644615
Chaabane, C.B., Mellouli, D., Hamdani, T.M., Alimi, A.M., Abraham, A.: Wavelet convolutional neural networks for handwritten digits recognition. In: Abraham, A., Muhuri, P.K., Muda, A.K., Gandhi, N. (eds.) HIS 2017. AISC, vol. 734, pp. 305–310. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76351-4_31
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 834–848 (2017). https://doi.org/10.1109/TPAMI.2017.2699184
Chen, L.C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. Computer Vision and Pattern Recognition (2017)
Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European Conference on Computer Vision, pp. 801–818 (2018)
Chui, C.K.: An Introduction to Wavelets. Academic Press, New York (1992)
Fujieda, S., Takayama, K., Hachisuka, T.: Wavelet convolutional neural networks for texture classification (2017)
Fujieda, S., Takayama, K., Hachisuka, T.: Wavelet convolutional neural networks (2018)
Mallat, M.S.: A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans. Pattern Anal. Mach. Intell. 11, 674–693 (1989)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Hu, J., Shen, L., Albanie, S., Sun, G., Wu, E.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018). https://doi.org/10.1109/TPAMI.2019.2913372
Huang, G., Liu, Z., Laurens, V.D.M., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017). https://doi.org/10.1109/CVPR.2017.243
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems 25, no. 2 (2012). https://doi.org/10.1145/3065386
Liu, P., Zhang, H., Lian, W., Zuo, W.: Multi-level wavelet convolutional neural networks (2019)
Ma, L., Stückler, J., Wu, T., Cremers, D.: Detailed dense inference with convolutional neural networks via discrete wavelet transform (2018)
Mallat, S.: A wavelet tour of signal processing (2009)
Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1520–1528 (2015)
Oktay, O., et al.: Attention u-net: learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 (2018)
Rieder, P., Götze, J., Nossek, J.A.: Multiwavelet transforms based on several scaling functions (1994)
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Staal, J., Abramoff, M., Niemeijer, M., Viergever, M., van Ginneken, B.: Ridge-based vessel segmentation in color images of the retina. IEEE Trans. Med. Imaging 23(4), 501–509 (2004)
Strang, G., Strela, V.: Short wavelets and matrix dilation equations. IEEE Trans. Signal Process. 43(1), 108–115 (1995)
Williams, T., Li, R.: Advanced image classification using wavelets and convolutional neural networks (2017). https://doi.org/10.1109/ICMLA.2016.0046
Yang, H., et al.: CLCI-Net: cross-level fusion and context inference networks for lesion segmentation of chronic stroke. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11766, pp. 266–274. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32248-9_30
Zhuang, J.: Laddernet: multi-path networks based on u-net for medical image segmentation. arXiv preprint arXiv:1810.07810 (2018)
Acknowledgments
This work was supported by the National Natural Science Foundation of China grants 11871328, 61976134, 11601315 and 11701357.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, Y., Wang, Y., Leng, T., Zhijie, W. (2020). Wavelet U-Net for Medical Image Segmentation. In: Farkaš, I., Masulli, P., Wermter, S. (eds) Artificial Neural Networks and Machine Learning – ICANN 2020. ICANN 2020. Lecture Notes in Computer Science(), vol 12396. Springer, Cham. https://doi.org/10.1007/978-3-030-61609-0_63
Download citation
DOI: https://doi.org/10.1007/978-3-030-61609-0_63
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61608-3
Online ISBN: 978-3-030-61609-0
eBook Packages: Computer ScienceComputer Science (R0)