19 February 2024 Attention-guided hybrid transformer-convolutional neural network for underwater image super-resolution
Zihan Zhan, Chaofeng Li, Yuqi Zhang
Author Affiliations +
Abstract

Underwater images suffer from localized distortion and blurred degradation of edge structures due to light absorption and scattering by water. However, existing super-resolution (SR) methods for underwater images cannot effectively solve the above problems and encounter model sizes that are too large. To this end, we propose an attention-guided hybrid transformer-CNN network (AHTCN) to improve the SR reconstruction of underwater images through the interaction of local and multiscale global information, as well as the long-range dependencies modeling capability. Specifically, AHTCN mainly consists of several cascaded transformer-CNN feature extraction blocks (TCFEB) and an image reconstruction module. In TCFEB, the designed attention-based channel separation mechanism can adaptively separate the weighted features while reducing the number of model parameters and then extract the local details and global structural information at different scales through the dual-stream structure. Moreover, we replace the feedforward layer in the transformer with the blueprint separable convolutional feedforward layer and propose an enhanced pyramid pooling transformer layer, which helps to strengthen the feature perception of the model. Experimental results demonstrate that AHTCN outperforms the state-of-the-art algorithms in terms of both subjective visual effects and objective quality assessment, while requiring fewer parameters.

© 2024 SPIE and IS&T
Zihan Zhan, Chaofeng Li, and Yuqi Zhang "Attention-guided hybrid transformer-convolutional neural network for underwater image super-resolution," Journal of Electronic Imaging 33(1), 013044 (19 February 2024). https://doi.org/10.1117/1.JEI.33.1.013044
Received: 23 October 2023; Accepted: 1 February 2024; Published: 19 February 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Transformers

Feature extraction

Performance modeling

Image restoration

Lithium

Visualization

Image quality

Back to Top