Skip to main content

Multi-scale Attention Conditional GAN for Underwater Image Enhancement

  • Conference paper
  • First Online:
Advances in Computer Graphics (CGI 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14495))

Included in the following conference series:

  • 226 Accesses

Abstract

Underwater image enhancement (UIE) has achieved impressive achievements in various marine tasks, such as aquaculture and biological monitoring. However, complex underwater scenarios impede current UIE method application development. Some UIE methods utilize convolutional neural network (CNN) based models to improve the quality of degradation images, but these methods fail to capture multi-scale high-level features, leading to sub-optimal results. To address these issues, we propose a multi-scale attention conditional generative adversarial network (GAN), dubbed Mac-GAN, to recover the degraded underwater images by utilizing an encoder-decoder structure. Concretely, a novel multi-scale conditional GAN architecture is utilized to aggregate the multi-scale features and reconstruct the high-quality underwater images with high perceptual information. Different from the reference model, a novel attention module (AMU) is designed to integrate associated features among the channels for the UIE tasks and embedded after the down sampling layer, effectively suppressing non-significant features to improve the extraction effect of multi-scale features. Meanwhile, perceptual loss and total variation loss are utilized to enhance smoothness and suppress artifacts. Extensive experiments demonstrate that our proposed model achieves remarkable results in terms of qualitative and quantitative metrics, such as 0.7dB improvement in PSNR metrics and 0.8dB improvement in UIQM metrics. Moreover, Mac-GAN can generate a pleasing visual result without obvious over-enhancement and over-saturation over the comparison of UIE methods. A detailed set of ablation experiments analyzes core components’ contribution to the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Fabbri, C., Islam, M.J., Sattar, J.: Enhancing underwater imagery using generative adversarial networks. In: 2018 IEEE International Conference on Robotics and Automation (ICRA), pp. 7159–7165(2018)

    Google Scholar 

  2. Li, C., Anwar, S., Porikli, F.: Underwater scene prior inspired deep underwater image and video enhancement. Pattern Recogn. 98, 107038 (2020)

    Article  Google Scholar 

  3. Li, C., Anwar, S., Hou, J., et al.: Underwater image enhancement via medium transmission-guided multi-color space embedding. In: 2021 IEEE Transactions on Image Processing, vol. 30, pp. 4985–5000 (2021)

    Google Scholar 

  4. Cosmin, A., et al.: Enhancing underwater images and videos by fusion. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 81–88 (2012)

    Google Scholar 

  5. Peng, Y.-T., Cosman, P.C., et al.: Underwater image restoration based on image blurriness and light absorption. In: 2017 IEEE Transactions on Image Processing, vol. 26, pp. 1579–1594 (2017)

    Google Scholar 

  6. Jian, S., Wen, W.: Study on underwater image denoising algorithm based on wavelet transform. J. Phys. Conf. Ser. 806, 012006 (2017)

    Article  Google Scholar 

  7. Drews, P., Nascimento, E., Moraes, F., et al.: Transmission estimation in underwater single images. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 825–830 (2013)

    Google Scholar 

  8. Wang, Y., Zhang, J., Cao, Y., et al.: A deep CNN method for underwater image enhancement. In: 2017 IEEE International Conference on Image Processing (ICIP), pp. 1382–1386 (2017)

    Google Scholar 

  9. Hou, M., Liu, R., Fan, X., et al.: Joint residual learning for underwater image enhancement. In: 2018 IEEE International Conference on Image Processing (ICIP), pp. 4043–4047 (2018)

    Google Scholar 

  10. Guo, Y., Li, H., Zhuang, P.: Underwater image enhancement using a multiscale dense generative adversarial network. IEEE J. Oceanic Eng. 45, 862–870 (2019)

    Article  Google Scholar 

  11. Liu, X., Gao, Z., Chen, B.M.: MLFcGAN: multilevel feature fusion-based conditional GAN for underwater image color correction. IEEE Geosci. Remote Sens. Lett. 17, 1488–1492 (2019)

    Article  Google Scholar 

  12. Li, C., Guo, C., Ren, W., et al.: An underwater image enhancement benchmark dataset and beyond. In: 2019 IEEE Transactions on Image Processing, vol. 29, pp. 4376–4389 (2019)

    Google Scholar 

  13. Jamadandi, A., Mudenagudi, U.: Exemplar-based underwater image enhancement augmented by wavelet corrected transforms. In: 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) workshops, pp. 11–17 (2019)

    Google Scholar 

  14. Islam, M.J., Xia, Y., Sattar, J.: Fast underwater image enhancement for improved visual perception. IEEE Robot. Autom. Let. 5, 3227–3234 (2020)

    Article  Google Scholar 

  15. Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 42, pp. 7132–7141 (2018)

    Google Scholar 

  16. Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention, vol. 9351, pp. 234–241 (2015)

    Google Scholar 

  17. Isola, P., Zhu, J.Y., Zhou, T., et al.: Image-to-image translation with conditional adversarial networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1125–1134 (2017)

    Google Scholar 

  18. Liu, Y., Shao, Z., Teng, Y., et al.: NAM: normalization-based attention module. In: 2021 Conference on Neural Information Processing Systems (NeurIPS) Workshops (2021)

    Google Scholar 

  19. Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: 2015 International Conference on Machine Learning (ICML), vol. 37 (2015)

    Google Scholar 

  20. Gulrajani, I., Ahmed, F., Arjovsky, M., et al.: Improved training of wasserstein gans. In: 2017 Advances in Neural Information Processing Systems (NIPS), vol. 30 (2017)

    Google Scholar 

  21. Shao, Y., Li, L., Ren, W., et al.: Domain adaptation for image dehazing, in 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2808–2817 (2020)

    Google Scholar 

  22. Sheikh, H.R., Sabir, M.F., Bovik, A.C.: A statistical evaluation of recent full reference image quality assessment algorithms. IEEE Trans. Image Process. 15, 3440–3451 (2006)

    Article  Google Scholar 

  23. Wang, Z., Bovik, A.C., Sheikh, H.R., et al.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13 (2004)

    Google Scholar 

  24. Panetta, K., Gao, C., Agaian, S.: Human-visual-system-inspired underwater image quality measures. IEEE J. Oceanic Eng. 41(3), 541–551 (2016). https://doi.org/10.1109/JOE.2015.2469915

    Article  Google Scholar 

  25. Yang, M., Sowmya, A.: An underwater color image quality evaluation metric. IEEE J. Oceanic Eng. 246062–246071(2015)

    Google Scholar 

  26. Liu, R., Fan, X., Zhu, M., Hou, M., Luo, Z.: Real-world underwater enhancement: challenges, benchmarks, and solutions under natural light. IEEE Trans. Circuits Syst. Video Technol. 30, 4861–4875 (2020)

    Article  Google Scholar 

  27. Li, C., Anwar, S., Hou, J., et al.: Underwater image enhancement via medium transmission-guided multi-color space embedding. IEEE Trans. Image Process. 30, 4985–5000 (2021)

    Article  Google Scholar 

  28. Lin, X., Sun, S., Huang, W., et al.: EAPT: efficient attention pyramid transformer for image processing. IEEE Trans. Multimedia 25, 50–61 (2023)

    Article  Google Scholar 

  29. Li, L., Tang, J., Ye, Z., et al.: Unsupervised face super-resolution via gradient enhancement and semantic guidance. Vis. Comput. 37, 2855–2867 (2021)

    Article  Google Scholar 

  30. Guo, Z., Shao, M., Li, S.: Image-to-image translation using an offset-based multi-scale codes GAN encoder. Visual Comput. 1–17 (2023)

    Google Scholar 

  31. Zhang, Y., Han, S., Zhang, Z., et al.: CF-GAN: cross-domain feature fusion generative adversarial network for text-to-image synthesis. Vis. Comput. 39(4), 1283–1293 (2023)

    Google Scholar 

Download references

Acknowledgments

This study is supported by Key-Area Research and Development Program of Guangdong Province - Ecological engineering breeding technology and model in seawater ponds (2020B0202010009). We thank editor and the anonymous reviewers who reviewed this paper for their valuable suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhenbo Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, Y., Li, F., Li, Z. (2024). Multi-scale Attention Conditional GAN for Underwater Image Enhancement. In: Sheng, B., Bi, L., Kim, J., Magnenat-Thalmann, N., Thalmann, D. (eds) Advances in Computer Graphics. CGI 2023. Lecture Notes in Computer Science, vol 14495. Springer, Cham. https://doi.org/10.1007/978-3-031-50069-5_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-50069-5_38

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-50068-8

  • Online ISBN: 978-3-031-50069-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics