Skip to main content
Log in

DPMFformer: an underwater image enhancement network based on deep pooling and multi-scale fusion transformer

  • Research
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

Due to light absorption and scattering, underwater images often suffer from color distortion, low contrast, and blurred details, seriously affects the effectiveness of advanced computer vision tasks. To address these degradation issues, this paper proposes an innovative underwater image enhancement algorithm, Deep Pooling and Multi-Scale Fusion Transformer (DPMFformer). The algorithm is composed of four key modules: the Dual-Balanced Multiscale Fusion Module (DBMF), the Deep Pooling Self-Attention Transformer (DPST), the Wavelet Sampling (WS), and the Global Spatial Feature Self-Attention Transformer (GSFAT). The DBMF module employs trainable color modules to simulate the grey-scale world theory, achieving inter-channel color balance. The DPST module enhances the network’s ability to extract information from feature regions through a deep-pooling layer and spatial attention mechanism. The WS module utilizes Harr wavelet sampling instead of conventional up- and down-sampling, preserving low-frequency information while improving the up-sampling outcome. The GSFAT module combines Swin Transformer (SwinT) and Position Embedding Cascading Transformer (PCET), enhancing the extraction of global information through position embedding and a sliding window self-attention mechanism, thereby improving the attention on the degraded regions of the image. Experimental results show that the proposed DPMFfomer is superior to existing underwater image enhancement methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

No datasets were generated or analysed during the current study.

References

Download references

Funding

This work was supported by Special projects in universities' key fields of Guangdong Province (2023ZDZX3017), 2022 Tertiary Education Scientific research project of Guangzhou Municipal Education Bureau (202234607), the National Natural Science Foundation of China (52101358). The General Universities' Key Scientific Research Platform Project of Guangdong Province(2023KSYS009).

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Dan Xiang, Wenlei Yang, Zebin Zhou, Jinwen Zhang, Jianxin Li, Jing Ling and Jian Ouyang. The first draft of the manuscript was written by Wenlei Yang and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Jian Ouyang or Jing Ling.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Communicated by: Hassan Babaie

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xiang, D., Yang, W., Zhou, Z. et al. DPMFformer: an underwater image enhancement network based on deep pooling and multi-scale fusion transformer. Earth Sci Inform 18, 61 (2025). https://doi.org/10.1007/s12145-024-01573-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s12145-024-01573-3

Keywords