Abstract
Underwater image enhancement (UIE) has achieved impressive achievements in various marine tasks, such as aquaculture and biological monitoring. However, complex underwater scenarios impede current UIE method application development. Some UIE methods utilize convolutional neural network (CNN) based models to improve the quality of degradation images, but these methods fail to capture multi-scale high-level features, leading to sub-optimal results. To address these issues, we propose a multi-scale attention conditional generative adversarial network (GAN), dubbed Mac-GAN, to recover the degraded underwater images by utilizing an encoder-decoder structure. Concretely, a novel multi-scale conditional GAN architecture is utilized to aggregate the multi-scale features and reconstruct the high-quality underwater images with high perceptual information. Different from the reference model, a novel attention module (AMU) is designed to integrate associated features among the channels for the UIE tasks and embedded after the down sampling layer, effectively suppressing non-significant features to improve the extraction effect of multi-scale features. Meanwhile, perceptual loss and total variation loss are utilized to enhance smoothness and suppress artifacts. Extensive experiments demonstrate that our proposed model achieves remarkable results in terms of qualitative and quantitative metrics, such as 0.7dB improvement in PSNR metrics and 0.8dB improvement in UIQM metrics. Moreover, Mac-GAN can generate a pleasing visual result without obvious over-enhancement and over-saturation over the comparison of UIE methods. A detailed set of ablation experiments analyzes core components’ contribution to the proposed approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fabbri, C., Islam, M.J., Sattar, J.: Enhancing underwater imagery using generative adversarial networks. In: 2018 IEEE International Conference on Robotics and Automation (ICRA), pp. 7159–7165(2018)
Li, C., Anwar, S., Porikli, F.: Underwater scene prior inspired deep underwater image and video enhancement. Pattern Recogn. 98, 107038 (2020)
Li, C., Anwar, S., Hou, J., et al.: Underwater image enhancement via medium transmission-guided multi-color space embedding. In: 2021 IEEE Transactions on Image Processing, vol. 30, pp. 4985–5000 (2021)
Cosmin, A., et al.: Enhancing underwater images and videos by fusion. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 81–88 (2012)
Peng, Y.-T., Cosman, P.C., et al.: Underwater image restoration based on image blurriness and light absorption. In: 2017 IEEE Transactions on Image Processing, vol. 26, pp. 1579–1594 (2017)
Jian, S., Wen, W.: Study on underwater image denoising algorithm based on wavelet transform. J. Phys. Conf. Ser. 806, 012006 (2017)
Drews, P., Nascimento, E., Moraes, F., et al.: Transmission estimation in underwater single images. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 825–830 (2013)
Wang, Y., Zhang, J., Cao, Y., et al.: A deep CNN method for underwater image enhancement. In: 2017 IEEE International Conference on Image Processing (ICIP), pp. 1382–1386 (2017)
Hou, M., Liu, R., Fan, X., et al.: Joint residual learning for underwater image enhancement. In: 2018 IEEE International Conference on Image Processing (ICIP), pp. 4043–4047 (2018)
Guo, Y., Li, H., Zhuang, P.: Underwater image enhancement using a multiscale dense generative adversarial network. IEEE J. Oceanic Eng. 45, 862–870 (2019)
Liu, X., Gao, Z., Chen, B.M.: MLFcGAN: multilevel feature fusion-based conditional GAN for underwater image color correction. IEEE Geosci. Remote Sens. Lett. 17, 1488–1492 (2019)
Li, C., Guo, C., Ren, W., et al.: An underwater image enhancement benchmark dataset and beyond. In: 2019 IEEE Transactions on Image Processing, vol. 29, pp. 4376–4389 (2019)
Jamadandi, A., Mudenagudi, U.: Exemplar-based underwater image enhancement augmented by wavelet corrected transforms. In: 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) workshops, pp. 11–17 (2019)
Islam, M.J., Xia, Y., Sattar, J.: Fast underwater image enhancement for improved visual perception. IEEE Robot. Autom. Let. 5, 3227–3234 (2020)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 42, pp. 7132–7141 (2018)
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention, vol. 9351, pp. 234–241 (2015)
Isola, P., Zhu, J.Y., Zhou, T., et al.: Image-to-image translation with conditional adversarial networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1125–1134 (2017)
Liu, Y., Shao, Z., Teng, Y., et al.: NAM: normalization-based attention module. In: 2021 Conference on Neural Information Processing Systems (NeurIPS) Workshops (2021)
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: 2015 International Conference on Machine Learning (ICML), vol. 37 (2015)
Gulrajani, I., Ahmed, F., Arjovsky, M., et al.: Improved training of wasserstein gans. In: 2017 Advances in Neural Information Processing Systems (NIPS), vol. 30 (2017)
Shao, Y., Li, L., Ren, W., et al.: Domain adaptation for image dehazing, in 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2808–2817 (2020)
Sheikh, H.R., Sabir, M.F., Bovik, A.C.: A statistical evaluation of recent full reference image quality assessment algorithms. IEEE Trans. Image Process. 15, 3440–3451 (2006)
Wang, Z., Bovik, A.C., Sheikh, H.R., et al.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13 (2004)
Panetta, K., Gao, C., Agaian, S.: Human-visual-system-inspired underwater image quality measures. IEEE J. Oceanic Eng. 41(3), 541–551 (2016). https://doi.org/10.1109/JOE.2015.2469915
Yang, M., Sowmya, A.: An underwater color image quality evaluation metric. IEEE J. Oceanic Eng. 246062–246071(2015)
Liu, R., Fan, X., Zhu, M., Hou, M., Luo, Z.: Real-world underwater enhancement: challenges, benchmarks, and solutions under natural light. IEEE Trans. Circuits Syst. Video Technol. 30, 4861–4875 (2020)
Li, C., Anwar, S., Hou, J., et al.: Underwater image enhancement via medium transmission-guided multi-color space embedding. IEEE Trans. Image Process. 30, 4985–5000 (2021)
Lin, X., Sun, S., Huang, W., et al.: EAPT: efficient attention pyramid transformer for image processing. IEEE Trans. Multimedia 25, 50–61 (2023)
Li, L., Tang, J., Ye, Z., et al.: Unsupervised face super-resolution via gradient enhancement and semantic guidance. Vis. Comput. 37, 2855–2867 (2021)
Guo, Z., Shao, M., Li, S.: Image-to-image translation using an offset-based multi-scale codes GAN encoder. Visual Comput. 1–17 (2023)
Zhang, Y., Han, S., Zhang, Z., et al.: CF-GAN: cross-domain feature fusion generative adversarial network for text-to-image synthesis. Vis. Comput. 39(4), 1283–1293 (2023)
Acknowledgments
This study is supported by Key-Area Research and Development Program of Guangdong Province - Ecological engineering breeding technology and model in seawater ponds (2020B0202010009). We thank editor and the anonymous reviewers who reviewed this paper for their valuable suggestions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Li, Y., Li, F., Li, Z. (2024). Multi-scale Attention Conditional GAN for Underwater Image Enhancement. In: Sheng, B., Bi, L., Kim, J., Magnenat-Thalmann, N., Thalmann, D. (eds) Advances in Computer Graphics. CGI 2023. Lecture Notes in Computer Science, vol 14495. Springer, Cham. https://doi.org/10.1007/978-3-031-50069-5_38
Download citation
DOI: https://doi.org/10.1007/978-3-031-50069-5_38
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-50068-8
Online ISBN: 978-3-031-50069-5
eBook Packages: Computer ScienceComputer Science (R0)