Attention-guided multi-path cross-CNN for underwater image super-resolution

Zhang, Yan; Yang, Shangxue; Sun, Yemei; Liu, Shudong; Li, Xianguo

doi:10.1007/s11760-021-01969-4

Attention-guided multi-path cross-CNN for underwater image super-resolution

Original Paper
Published: 06 July 2021

Volume 16, pages 155–163, (2022)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Yan Zhang¹,
Shangxue Yang¹,
Yemei Sun¹,
Shudong Liu¹ &
…
Xianguo Li²

899 Accesses
1 Altmetric
Explore all metrics

Abstract

Raw underwater images usually suffer from quality degradation, and their resolutions are lower. To obtain high-resolution underwater images, some super-resolution (SR) algorithms have achieved great visual effect based on the excellent ability of deep convolution neural network. However, most previous works fail to consider the full use of inner information. Besides, simply widening and deepening the network contribute little to performance improvement. In this paper, an attention-guided multi-path cross-convolution neural network (AMPCNet) is proposed for underwater image SR. We present a multi-path cross (MPC)-module, which contains residual blocks and dilated blocks, to enhance the model’s learning capacity and increase abstract feature representation. Specifically, the design of the cross-connections realizes the mutual fusion of local features learned by residual blocks and multi-scale features acquired by dilated blocks. And the combination of dilated convolution and ordinary convolution achieves a trade-off between performance and efficiency. Furthermore, an attention block is presented to adaptively rescale the channel-wise features for more discriminative representations. Finally, the upsample block helps the reconstruction of HR images. Experimental results demonstrate the superiority of our AMPCNet network in terms of both quantitative metrics and visual quality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Towards domain adaptation underwater image enhancement and restoration

Article 15 February 2024

Underwater image enhancement by combining multi-attention with recurrent residual convolutional U-Net

Article 30 January 2024

Optimized UNet framework with a joint loss function for underwater image enhancement

Article Open access 01 March 2025

References

Anwar, S., Li, C.: Diving deeper into underwater image enhancement: a survey. ArXiv190707863 Cs Eess. (2019)
Li, C., Anwar, S., Porikli, F.: Underwater scene prior inspired deep underwater image and video enhancement. Pattern Recognit. 98, 107038 (2020)
Article Google Scholar
Islam, J., Luo, P., Sattar, J.: Simultaneous enhancement and super-resolution of underwater imagery for improved visual perception. 14
Chen, Y., Sun, J., Jiao, W., Zhong, G.: Recovering super-resolution generative adversarial network for underwater images. In: Gedeon, T., Wong, K.W., Lee, M. (eds.) Neural Information Processing, pp. 75–83. Springer International Publishing, Cham (2019)
Chapter Google Scholar
Islam, M.J., Enan, S.S., Luo, P., Sattar, J.: Underwater image super-resolution using deep residual multipliers. ArXiv190909437 Cs Eess (2020)
Dong, C., Loy, C.C., He, K., Tang, X.: Learning a deep convolutional network for image super-resolution. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) Computer Vision – ECCV 2014, pp. 184–199. Springer International Publishing, Cham (2014)
Chapter Google Scholar
Kim, J., Lee, J.K., Lee, K.M.: Accurate image super-resolution using very deep convolutional networks. ArXiv151104587 Cs. (2016)
Dai, T., Cai, J., Zhang, Y., Xia, S.-T., Zhang, L.: Second-order attention network for single image super-resolution. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 11057–11066. IEEE, Long Beach, CA, USA (2019)
Jiang, Q., Zhou, W., Chai, X., Yue, G., Shao, F., Chen, Z.: A full-reference stereoscopic image quality measurement via hierarchical deep feature degradation fusion. IEEE Trans. Instrum. Meas. 69, 9784–9796 (2020). https://doi.org/10.1109/TIM.2020.3005111
Article Google Scholar
Chen, Y., Li, W., Xia, M., Li, Q., Yang, K.: Super-resolution reconstruction for underwater imaging. Opt. Appl. 41, 841–853 (2011)
Google Scholar
Lu, H., Li, Y., Nakashima, S., Kim, H., Serikawa, S.: Underwater image super-resolution by descattering and fusion. IEEE Access. 5, 670–679 (2017)
Article Google Scholar
Chen, Y., Niu, K., Zeng, Z., Pan, Y.: A wavelet based deep learning method for underwater image super resolution reconstruction. IEEE Access. (2020)
Tian, C., Xu, Y., Zuo, W.: Image denoising using deep CNN with batch renormalization. Neural Netw. 121, 461–473 (2020). https://doi.org/10.1016/j.neunet.2019.08.022
Article Google Scholar
Chu, J., Li, X., Zhang, J., Lu, W.: Super-resolution using multi-channel merged convolutional network. Neurocomputing (2019). https://doi.org/10.1016/j.neucom.2019.04.089
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. ArXiv151203385 Cs. (2015)
Liu, J., Zhang, W., Tang, Y., Tang, J., Wu, G.: Residual feature aggregation network for image super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 2359–2368 (2020)
Jiang, Q., Gao, W., Wang, S., Yue, G., Shao, F., Ho, Y.-S., Kwong, S.: Blind image quality measurement by exploiting high-order statistics with deep dictionary encoding network. IEEE Trans. Instrum. Meas. 69, 7398–7410 (2020). https://doi.org/10.1109/TIM.2020.2984928
Article Google Scholar
Cai, J., Meng, Z., Ho, C.M.: Residual channel attention generative adversarial network for image super-resolution and noise reduction. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). pp. 1852–1861. IEEE, Seattle, WA, USA (2020)
Ledig, C., Theis, L., Huszar, F., Caballero, J., Cunningham, A., Acosta, A., Aitken, A., Tejani, A., Totz, J., Wang, Z., Shi, W.: Photo-realistic single image super-resolution using a generative adversarial network. ArXiv160904802 Cs Stat. (2017)
Lim, B., Son, S., Kim, H., Nah, S., Lee, K.M.: Enhanced deep residual networks for single image super-resolution. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). pp. 1132–1140. IEEE, Honolulu, HI, USA (2017)
Tian, C., Xu, Y., Li, Z., Zuo, W., Fei, L., Liu, H.: Attention-guided CNN for image denoising. Neural Netw. 124, 117–129 (2020). https://doi.org/10.1016/j.neunet.2019.12.024
Article Google Scholar
Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions. ArXiv151107122 Cs. (2016)
Shi, W., Jiang, F., Zhao, D.: Single image super-resolution with dilated convolution based multi-scale information learning inception module. ArXiv170707128 Cs. (2017)
Fu, J., Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z., Lu, H.: Dual attention network for scene segmentation. Presented at the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2019)
Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., Liu, W.: Ccnet: criss-cross attention for semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 603–612 (2019)
Chen, S., Tan, X., Wang, B., Hu, X.: Reverse attention for salient object detection. In: Proceedings of the European Conference on Computer Vision (ECCV). pp. 234–250 (2018)
Fan, D.-P., Wang, W., Cheng, M.-M., Shen, J.: Shifting more attention to video salient object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 8554–8564 (2019)
Wang, J., Liu, W., Xing, W., Wang, L., Zhang, S.: Attention shake siamese network with auxiliary relocation branch for visual object tracking. Neurocomputing. (2020)
Shen, J., Tang, X., Dong, X., Shao, L.: Visual object tracking by hierarchical attention siamese network. IEEE Trans. Cybern. (2019)
Zhang, Y., Li, K., Li, K., Wang, L., Zhong, B., Fu, Y.: Image super-resolution using very deep residual channel attention networks. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) Computer Vision –ECCV 2018, pp. 294–310. Springer International Publishing, Cham (2018). https://doi.org/10.1016/j.neucom.2019.08.070
Chapter Google Scholar
Bai, F., Lu, W., Huang, Y., Zha, L., Yang, J.: Densely convolutional attention network for image super-resolution. Neurocomputing 368, 25–33 (2019). https://doi.org/10.1016/j.neucom.2019.11.044
Article Google Scholar
Xue, S., Qiu, W., Liu, F., Jin, X.: Wavelet-based residual attention network for image super-resolution. Neurocomputing 382, 116–126 (2020). https://doi.org/10.1016/j.neucom.2019.06.078
Article Google Scholar
Wang, X., Gu, Y., Gao, X., Hui, Z.: Dual residual attention module network for single image super resolution. Neurocomputing 364, 269–279 (2019)
Article Google Scholar
Mehta, S., Rastegari, M., Caspi, A., Shapiro, L., Hajishirzi, H.: ESPNet: efficient spatial pyramid of dilated convolutions for semantic segmentation. ArXiv180306815 Cs. (2018)
Wang, Z., Ji, S.: Smoothed dilated convolutions for improved dense prediction. ArXiv180808931 Cs. (2019) https://doi.org/10.1145/3219819.3219944
Ziegler, T., Fritsche, M., Kuhn, L., Donhauser, K.: Efficient smoothing of dilated convolutions for image segmentation. ArXiv190307992 Cs. (2019)
Chen, L.-C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. ArXiv170605587 Cs. (2017)
Shi, W., Caballero, J., Huszar, F., Totz, J., Aitken, A.P., Bishop, R., Rueckert, D., Wang, Z.: Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1874–1883. IEEE, Las Vegas, NV, USA (2016)
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. In: 2015 IEEE International Conference on Computer Vision (ICCV). pp. 1026–1034. IEEE, Santiago, Chile (2015)
Zhang, K., Zuo, W., Zhang, L.: Deep plug-and-play super-resolution for arbitrary Blur Kernels. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1671–1681. IEEE, Long Beach, CA, USA (2019)
Panetta, K., Gao, C., Agaian, S.: Human-visual-system-inspired underwater image quality measures. IEEE J. Ocean. Eng. 41, 541–551 (2016). https://doi.org/10.1109/JOE.2015.2469915
Article Google Scholar
Islam, M.J., Xia, Y., Sattar, J.: Fast underwater image enhancement for improved visual perception. arXiv:1903.09766 [cs]. (2020)

Download references

Acknowledgements

This work was supported by the Scientific Research Project of Tianjin Municipal Education Commission [grant number 2019KJ105]. The authors also acknowledge the anonymous reviewers for their helpful comments on the manuscript.

Author information

Authors and Affiliations

School of Computer and Information Engineering, Tianjin Chengjian University, Tianjin, China
Yan Zhang, Shangxue Yang, Yemei Sun & Shudong Liu
Tianjin Key Laboratory of Optoelectronic Detection Technology and System, Tianjin, China
Xianguo Li

Authors

Yan Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Shangxue Yang
View author publications
You can also search for this author inPubMed Google Scholar
Yemei Sun
View author publications
You can also search for this author inPubMed Google Scholar
Shudong Liu
View author publications
You can also search for this author inPubMed Google Scholar
Xianguo Li
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Yemei Sun.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, Y., Yang, S., Sun, Y. et al. Attention-guided multi-path cross-CNN for underwater image super-resolution. SIViP 16, 155–163 (2022). https://doi.org/10.1007/s11760-021-01969-4

Download citation

Received: 27 September 2020
Revised: 31 March 2021
Accepted: 04 June 2021
Published: 06 July 2021
Issue Date: February 2022
DOI: https://doi.org/10.1007/s11760-021-01969-4

Keyword

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Attention-guided multi-path cross-CNN for underwater image super-resolution

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Towards domain adaptation underwater image enhancement and restoration

Underwater image enhancement by combining multi-attention with recurrent residual convolutional U-Net

Optimized UNet framework with a joint loss function for underwater image enhancement

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keyword

Subscribe and save

Buy Now