Abstract
The objective of image dehazing algorithms is to extract latent clear image details from degraded images obscured by inhomogeneous haze, improving the quality of the degraded images. In the image dehazing task, transformer-based models demonstrate effective spatial aggregation capability through multi-head self-attention. However, those models tend to overlook the effective interaction between channel information and lack the utilization of feature information of other layers in the image reconstruction stage. This paper proposes a novel dehazing model called MFFormer, which comprises a feature interaction block (FIB) and a multi-level feature-boosted module (MFBF). Specifically, the FIB provides the model with global channel feature information and depth spatial feature information through channel interaction operation, allowing the model to efficiently model degraded regions of images. The MFBF integrates shallow feature information and deep feature information at different scales, enhancing the role of shallow features in the image reconstruction stage. Furthermore, we propose a content-guided contrastive regularization that focus on optimizing the model with shallow hidden features to recover more image details. Experimental results on synthetic and real-world datasets demonstrate that the proposed MFFormer achieves superior dehazing results with a smaller number of parameters compared to the state-of-the-art models.The code is released in https://github.com/Dudragon1/MFFormer.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03665-5/MediaObjects/11760_2024_3665_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03665-5/MediaObjects/11760_2024_3665_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03665-5/MediaObjects/11760_2024_3665_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03665-5/MediaObjects/11760_2024_3665_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03665-5/MediaObjects/11760_2024_3665_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03665-5/MediaObjects/11760_2024_3665_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03665-5/MediaObjects/11760_2024_3665_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03665-5/MediaObjects/11760_2024_3665_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03665-5/MediaObjects/11760_2024_3665_Fig9_HTML.png)
Similar content being viewed by others
Data Availability
No datasets were generated or analysed during the current study.
References
Nayar, S.K., Narasimhan, S.G.: Vision in bad weather. In: Proceedings of the seventh IEEE international conference on computer vision. IEEE, 2: 820–827 (1999)
Narasimhan, S.G., Nayar, S.K.: Vision and the atmosphere. Int. J. Comput. Vision 48, 233–254 (2002)
McCartney, E.J.: Optics of the atmosphere: scattering by molecules and particles. New York, (1976)
He, K., Sun, J., Tang, X.: Single image haze removal using dark channel prior. IEEE Trans. Pattern Anal. Mach. Intell. 33(12), 2341–2353 (2010)
Zhu, Q., Mai, J., Shao, L.: A fast single image haze removal algorithm using color attenuation prior. IEEE Trans. Image Process. 24(11), 3522–3533 (2015)
Berman, D., Avidan, S.: Non-local image dehazing. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 1674-1682 (2016)
Tang, K., Yang, J., Wang, J.: Investigating haze-relevant features in a learning framework for image dehazing. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2995–3000 (2014)
Qu, Y., Chen, Y., Huang, J., et al.: Enhanced pix2pix dehazing network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 8160–8168 (2019)
Li, B., Peng, X., Wang, Z., et al.: Aod-net: All-in-one dehazing network. In: Proceedings of the IEEE international conference on computer vision. 4770–4778 (2017)
Zhao, S., Zhang, L., Shen, Y., et al.: RefineDNet: a weakly supervised refinement framework for single image dehazing[J]. IEEE Trans. Image Process. 30, 3391–3404 (2021)
Liu, X., Ma, Y., Shi, Z., et al.: Griddehazenet: Attention-based multi-scale network for image dehazing. In: Proceedings of the IEEE/CVF international conference on computer vision. 7314-7323 (2019)
Qian, W., Zhou, C., Zhang, D.: FAOD-Net: a fast AOD-Net for dehazing single image[J]. Math. Probl. Eng. 2020, 1–11 (2020)
Su, Y.Z., He, C., Cui, Z.G., et al.: Physical model and image translation fused network for single-image dehazing[J]. Pattern Recogn. 142, 109700 (2023)
Wang, N., Cui, Z., Su, Y., et al.: Multiscale supervision-guided context aggregation network for single image dehazing. IEEE Signal Process. Lett. 29, 70–74 (2021)
Dong, H., Pan, J., Xiang, L., et al.: Multi-scale boosted dehazing network with dense feature fusion. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2157-2167 (2020)
Hu, G., Tan, A., He, L., et al.: Pyramid feature boosted network for single image dehazing. Int. J. Mach. Learn. Cybern. 14(6), 2099–2110 (2023)
Ye, T., Jiang, M., Zhang, Y., et al.: Perceiving and modeling density is all you need for image dehazing. arxiv preprint arxiv:2111.09733, (2021)
Song, Y., He, Z., Qian, H., et al.: Vision transformers for single image dehazing. IEEE Trans. Image Process. 32, 1927–1941 (2023)
Qiu, Y., Zhang, K., Wang, C., et al.: MB-TaylorFormer: Multi-branch efficient transformer expanded by Taylor formula for image dehazing. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 12802-12813 (2023)
Liu, J., Yuan, H., Yuan, Z., et al.: Visual transformer with stable prior and patch-level attention for single image dehazing[J]. Neurocomputing 551, 126535 (2023)
Guo, C.L., Yan, Q., Anwar, S., et al.: Image dehazing transformer with transmission-aware 3d position embedding. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 5812-5820 (2022)
Cui, Y., Knoll, A.: Exploring the potential of channel interactions for image restoration[J]. Knowl.-Based Syst. 282, 111156 (2023)
Zamir, S.W., Arora, A., Khan, S.: Learning enriched features for real image restoration and enhancement. In: Computer Vision-ECCV, et al.: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXV 16. Springer International Publishing 2020, 492–511 (2020)
Wu, H., Qu, Y., Lin, S., et al.: Contrastive learning for compact single image dehazing. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 10551-10560 (2021)
Qin, X., Wang, Z., Bai, Y., et al.: FFA-Net: Feature fusion attention network for single image dehazing. In: Proceedings of the AAAI conference on artificial intelligence. 34(07): 11908-11915 (2020)
Dong, J., Pan, J., Physics-based feature dehazing networks. In: Computer Vision-ECCV,: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXX 16. Springer International Publishing 2020, 188–204 (2020)
Wang, N., Cui, Z., Su, Y., et al.: Prior-guided multiscale network for single-image dehazing. IET Image Proc. 15(13), 3368–3379 (2021)
Cui, Z., Wang, N., Su, Y., et al.: ECANet: enhanced context aggregation network for single image dehazing. SIViP 17(2), 471–479 (2023)
Lan, Y., Cui, Z., Su, Y., et al.: Online knowledge distillation network for single image dehazing. Sci. Rep. 12(1), 14927 (2022)
Tu, Z., Talebi, H., Zhang, H., et al.: Maxim: Multi-axis mlp for image processing. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 5769-5780 (2022)
Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. arxiv preprint arxiv:2010.11929, (2020)
Liu, Z., Lin, Y., Cao, Y., et al.: Swin transformer: Hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF international conference on computer vision. 10012-10022 (2021)
Park, N., Kim, S.: How do vision transformers work? arxiv preprint arxiv:2202.06709, (2022)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 7132-7141 (2018)
Chen, Z., He, Z., Lu, Z. M.: DEA-Net: Single image dehazing based on detail-enhanced convolution and content-guided attention. IEEE Transactions on Image Processing, (2024)
Sun, H., Li, B., Dan, Z., et al.: Multi-level feature interaction and efficient non-local information enhanced channel attention for image dehazing[J]. Neural Netw. 163, 10–27 (2023)
Ronneberger, O., Fischer, P., Brox, T., U-net: Convolutional networks for biomedical image segmentation. Medical image computing and computer-assisted intervention-MICCAI,: 18th international conference, Munich, Germany, October 5–9, 2015, proceedings, part III 18. Springer International Publishing 2015, 234–241 (2015)
Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2117-2125 (2017)
Romano, Y., Elad, M.: Boosting of image denoising algorithms. SIAM J. Imag. Sci. 8(2), 1187–1219 (2015)
Li, B., Ren, W., Fu, D., et al.: Benchmarking single-image dehazing and beyond. IEEE Trans. Image Process. 28(1), 492–505 (2018)
Ancuti, C.O., Ancuti, C., Sbert, M., Dense-haze: A benchmark for image dehazing with dense-haze and haze-free images[C], et al.: IEEE international conference on image processing (ICIP). IEEE 2019, 1014–1018 (2019)
Ancuti, C.O., Ancuti, C., Timofte, R.: NH-HAZE: An image dehazing benchmark with non-homogeneous hazy and haze-free images. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops. 444-445 (2020)
Bai, H., Pan, J., Xiang, X., et al.: Self-guided image dehazing using progressive feature fusion. IEEE Trans. Image Process. 31, 1217–1229 (2022)
Lu, L.P., Xiong, Q., Chu, D.F., et al.: MixDehazeNet: Mix structure block for image dehazing network. arxiv preprint arxiv:2305.17654, (2023)
Luo, Z., Gustafsson, F.K., Zhao, Z., et al.: Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models. arxiv preprint arxiv:2404.09732, (2024)
Author information
Authors and Affiliations
Contributions
X.G. contributed to the conception of the study; D.H. performed the experiments and contributed significantly to analysis and manuscript preparation; Z.Z. helped perform the analysis of implementation methods. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gong, X., Du, H. & Zheng, Z. MFFormer: multi-level boosted transformer expanded by feature interaction block. SIViP 19, 96 (2025). https://doi.org/10.1007/s11760-024-03665-5
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11760-024-03665-5