Semantic-Driven Context Aggregation Network for Underwater Image Enhancement

Shi, Dongxiang; Ma, Long; Liu, Risheng; Fan, Xin; Luo, Zhongxuan

doi:10.1007/978-3-030-88010-1_3

Semantic-Driven Context Aggregation Network for Underwater Image Enhancement

Dongxiang Shi¹⁶,
Long Ma¹⁶,
Risheng Liu¹⁷,
Xin Fan¹⁷ &
…
Zhongxuan Luo¹⁶

Conference paper
First Online: 22 October 2021

2301 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 13021))

Abstract

Recently, underwater image enhancement has attracted broad attention due to its potential in ocean exploitation. Unfortunately, limited to the hand-crafted subjective ground truth for matching low-quality underwater images, existing techniques are less robust for some unseen scenarios and may be unfriendly to semantic-related vision tasks. To handle these issues, we aim at introducing the high-level semantic features extracted from a pre-trained classification network into the image enhancement task for improving robustness and semantic-sensitive potency. To be specific, we design an encoder-aggregation-decoder architecture for enhancement, in which a context aggregation residual block is tailored to improve the representational capacity of the original encoder-decoder. Then we introduce a multi-scale feature transformation module that transforms the extracted multi-scale semantic-level features, to improve the robustness and endow the semantic-sensitive property for the encoder-aggregation-decoder network. In addition, during the training phase, the pre-trained classification network is fixed to avoid introducing training costs. Extensive experiments demonstrate the superiority of our method against other state-of-the-art methods. We also apply our method into the salient object detection task to reveal our excellent semantic-sensitive ability.

This work is partially supported by the National Natural Science Foundation of China (Nos. 61922019, 61733002, and 61672125), LiaoNing Revitalization Talents Program (XLYC1807088), and the Fundamental Research Funds for the Central Universities.

D. Shi—Author is a student.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Ancuti, C., Ancuti, C.O., Haber, T., Bekaert, P.: Enhancing underwater images and videos by fusion. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 81–88 (2012)
Google Scholar
Chiang, J.Y., Chen, Y.C.: Underwater image enhancement by wavelength compensation and dehazing. IEEE Trans. Image Process. 21(4), 1756–1769 (2011)
Article MathSciNet Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: Proceedings of the IEEE Conference on Computer Cision and Pattern Recognition, pp. 248–255 (2009)
Google Scholar
Deng, S., et al.: Detail-recovery image deraining via context aggregation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 14560–14569 (2020)
Google Scholar
Drews, P.L., Nascimento, E.R., Botelho, S.S., Campos, M.F.M.: Underwater depth estimation and image restoration based on single images. IEEE Comput. Graphics Appl. 36(2), 24–35 (2016)
Article Google Scholar
Ebner, M.: Color constancy, vol. 7. John Wiley & Sons (2007)
Google Scholar
Fu, X., Fan, Z., Ling, M., Huang, Y., Ding, X.: Two-step approach for single underwater image enhancement. In: 2017 International Symposium on Intelligent Signal Processing and Communication Systems, pp. 789–794 (2017)
Google Scholar
Fu, Z., Fu, X., Huang, Y., Ding, X.: Twice mixing: a rank learning based quality assessment approach for underwater image enhancement. arXiv preprint arXiv:2102.00670 (2021)
Galdran, A., Pardo, D., Picón, A., Alvarez-Gila, A.: Automatic red-channel underwater image restoration. J. Vis. Commun. Image Represent. 26, 132–145 (2015)
Article Google Scholar
He, K., Sun, J., Tang, X.: Single image haze removal using dark channel prior. IEEE Trans. Pattern Anal. Mach. Intell. 33(12), 2341–2353 (2010)
Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Google Scholar
Hummel, R.: Image enhancement by histogram transformation. Comput. Graph. Image Process. 6(2), 184–195 (1977)
Article MathSciNet Google Scholar
Islam, M.J., Wang, R., de Langis, K., Sattar, J.: Svam: saliency-guided visual attention modeling by autonomous underwater robots. arXiv preprint arXiv:2011.06252 (2020)
Islam, M.J., Xia, Y., Sattar, J.: Fast underwater image enhancement for improved visual perception. IEEE Robot. Automation Lett. 5(2), 3227–3234 (2020)
Article Google Scholar
Li, C., Anwar, S., Porikli, F.: Underwater scene prior inspired deep underwater image and video enhancement. Pattern Recogn. 98, 107038 (2020)
Google Scholar
Li, C., Guo, C., Ren, W., Cong, R., Hou, J., Kwong, S., Tao, D.: An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process. 29, 4376–4389 (2019)
Article Google Scholar
Li, C., Guo, J., Guo, C.: Emerging from water: underwater image color correction based on weakly supervised color transfer. IEEE Signal Process. Lett. 25(3), 323–327 (2018)
Article Google Scholar
Liu, R., Fan, X., Hou, M., Jiang, Z., Luo, Z., Zhang, L.: Learning aggregated transmission propagation networks for haze removal and beyond. IEEE Trans. Neural Networks Learn. Syst. 30(10), 2973–2986 (2018)
Article Google Scholar
Liu, R., Fan, X., Zhu, M., Hou, M., Luo, Z.: Real-world underwater enhancement: challenges, benchmarks, and solutions under natural light. IEEE Trans. Circuits Syst. Video Technol. 30(12), 4861–4875 (2020)
Article Google Scholar
Liu, R., Ma, L., Zhang, J., Fan, X., Luo, Z.: Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 10561–10570 (2021)
Google Scholar
Ma, L., Liu, R., Zhang, X., Zhong, W., Fan, X.: Video deraining via temporal aggregation-and-guidance. In: 2021 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2021)
Google Scholar
Mittal, A., Soundararajan, R., Bovik, A.C.: Making a “completely blind” image quality analyzer. IEEE Signal Process. Lett. 20(3), 209–212 (2012)
Google Scholar
Panetta, K., Gao, C., Agaian, S.: Human-visual-system-inspired underwater image quality measures. IEEE J. Oceanic Eng. 41(3), 541–551 (2015)
Article Google Scholar
Van De Weijer, J., Gevers, T., Gijsenij, A.: Edge-based color constancy. IEEE Trans. Image Process. 16(9), 2207–2214 (2007)
Article MathSciNet Google Scholar
Wang, Y., Zhang, J., Cao, Y., Wang, Z.: A deep cnn method for underwater image enhancement. In: IEEE International Conference on Image Processing, pp. 1382–1386 (2017)
Google Scholar
Wei, J., Wang, S., Huang, Q.: F\(^3\)net: Fusion, feedback and focus for salient object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 12321–12328 (2020)
Google Scholar
Zhao, H., Gallo, O., Frosio, I., Kautz, J.: Loss functions for image restoration with neural networks. IEEE Trans. Comput. Imaging 3(1), 47–57 (2016)
Article Google Scholar
Zuiderveld, K.: Contrast limited adaptive histogram equalization. Graphics gems, pp. 474–485 (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Software, Dalian University of Technology, Dalian, 116024, China
Dongxiang Shi, Long Ma & Zhongxuan Luo
DUT-RU International School of Information Science and Engineering, Dalian University of Technology, Dalian, 116024, China
Risheng Liu & Xin Fan

Authors

Dongxiang Shi
View author publications
You can also search for this author in PubMed Google Scholar
Long Ma
View author publications
You can also search for this author in PubMed Google Scholar
Risheng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xin Fan
View author publications
You can also search for this author in PubMed Google Scholar
Zhongxuan Luo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhongxuan Luo .

Editor information

Editors and Affiliations

University of Science and Technology Beijing, Beijing, China
Huimin Ma
Chinese Academy of Sciences, Beijing, China
Liang Wang
Tsinghua University, Beijing, China
Changshui Zhang
Zhejiang University, Hangzhou, China
Fei Wu
Chinese Academy of Sciences, Beijing, China
Tieniu Tan
Hunan University, Changsha, China
Yaonan Wang
Sun Yat-Sen University, Guangzhou, Guangdong, China
Jianhuang Lai
Beijing Jiaotong University, Beijing, China
Yao Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shi, D., Ma, L., Liu, R., Fan, X., Luo, Z. (2021). Semantic-Driven Context Aggregation Network for Underwater Image Enhancement. In: Ma, H., et al. Pattern Recognition and Computer Vision. PRCV 2021. Lecture Notes in Computer Science(), vol 13021. Springer, Cham. https://doi.org/10.1007/978-3-030-88010-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-88010-1_3
Published: 22 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88009-5
Online ISBN: 978-3-030-88010-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics