Abstract
Recently, underwater image enhancement has attracted broad attention due to its potential in ocean exploitation. Unfortunately, limited to the hand-crafted subjective ground truth for matching low-quality underwater images, existing techniques are less robust for some unseen scenarios and may be unfriendly to semantic-related vision tasks. To handle these issues, we aim at introducing the high-level semantic features extracted from a pre-trained classification network into the image enhancement task for improving robustness and semantic-sensitive potency. To be specific, we design an encoder-aggregation-decoder architecture for enhancement, in which a context aggregation residual block is tailored to improve the representational capacity of the original encoder-decoder. Then we introduce a multi-scale feature transformation module that transforms the extracted multi-scale semantic-level features, to improve the robustness and endow the semantic-sensitive property for the encoder-aggregation-decoder network. In addition, during the training phase, the pre-trained classification network is fixed to avoid introducing training costs. Extensive experiments demonstrate the superiority of our method against other state-of-the-art methods. We also apply our method into the salient object detection task to reveal our excellent semantic-sensitive ability.
This work is partially supported by the National Natural Science Foundation of China (Nos. 61922019, 61733002, and 61672125), LiaoNing Revitalization Talents Program (XLYC1807088), and the Fundamental Research Funds for the Central Universities.
D. Shi—Author is a student.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Ancuti, C., Ancuti, C.O., Haber, T., Bekaert, P.: Enhancing underwater images and videos by fusion. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 81–88 (2012)
Chiang, J.Y., Chen, Y.C.: Underwater image enhancement by wavelength compensation and dehazing. IEEE Trans. Image Process. 21(4), 1756–1769 (2011)
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: Proceedings of the IEEE Conference on Computer Cision and Pattern Recognition, pp. 248–255 (2009)
Deng, S., et al.: Detail-recovery image deraining via context aggregation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 14560–14569 (2020)
Drews, P.L., Nascimento, E.R., Botelho, S.S., Campos, M.F.M.: Underwater depth estimation and image restoration based on single images. IEEE Comput. Graphics Appl. 36(2), 24–35 (2016)
Ebner, M.: Color constancy, vol. 7. John Wiley & Sons (2007)
Fu, X., Fan, Z., Ling, M., Huang, Y., Ding, X.: Two-step approach for single underwater image enhancement. In: 2017 International Symposium on Intelligent Signal Processing and Communication Systems, pp. 789–794 (2017)
Fu, Z., Fu, X., Huang, Y., Ding, X.: Twice mixing: a rank learning based quality assessment approach for underwater image enhancement. arXiv preprint arXiv:2102.00670 (2021)
Galdran, A., Pardo, D., Picón, A., Alvarez-Gila, A.: Automatic red-channel underwater image restoration. J. Vis. Commun. Image Represent. 26, 132–145 (2015)
He, K., Sun, J., Tang, X.: Single image haze removal using dark channel prior. IEEE Trans. Pattern Anal. Mach. Intell. 33(12), 2341–2353 (2010)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Hummel, R.: Image enhancement by histogram transformation. Comput. Graph. Image Process. 6(2), 184–195 (1977)
Islam, M.J., Wang, R., de Langis, K., Sattar, J.: Svam: saliency-guided visual attention modeling by autonomous underwater robots. arXiv preprint arXiv:2011.06252 (2020)
Islam, M.J., Xia, Y., Sattar, J.: Fast underwater image enhancement for improved visual perception. IEEE Robot. Automation Lett. 5(2), 3227–3234 (2020)
Li, C., Anwar, S., Porikli, F.: Underwater scene prior inspired deep underwater image and video enhancement. Pattern Recogn. 98, 107038 (2020)
Li, C., Guo, C., Ren, W., Cong, R., Hou, J., Kwong, S., Tao, D.: An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process. 29, 4376–4389 (2019)
Li, C., Guo, J., Guo, C.: Emerging from water: underwater image color correction based on weakly supervised color transfer. IEEE Signal Process. Lett. 25(3), 323–327 (2018)
Liu, R., Fan, X., Hou, M., Jiang, Z., Luo, Z., Zhang, L.: Learning aggregated transmission propagation networks for haze removal and beyond. IEEE Trans. Neural Networks Learn. Syst. 30(10), 2973–2986 (2018)
Liu, R., Fan, X., Zhu, M., Hou, M., Luo, Z.: Real-world underwater enhancement: challenges, benchmarks, and solutions under natural light. IEEE Trans. Circuits Syst. Video Technol. 30(12), 4861–4875 (2020)
Liu, R., Ma, L., Zhang, J., Fan, X., Luo, Z.: Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 10561–10570 (2021)
Ma, L., Liu, R., Zhang, X., Zhong, W., Fan, X.: Video deraining via temporal aggregation-and-guidance. In: 2021 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2021)
Mittal, A., Soundararajan, R., Bovik, A.C.: Making a “completely blind” image quality analyzer. IEEE Signal Process. Lett. 20(3), 209–212 (2012)
Panetta, K., Gao, C., Agaian, S.: Human-visual-system-inspired underwater image quality measures. IEEE J. Oceanic Eng. 41(3), 541–551 (2015)
Van De Weijer, J., Gevers, T., Gijsenij, A.: Edge-based color constancy. IEEE Trans. Image Process. 16(9), 2207–2214 (2007)
Wang, Y., Zhang, J., Cao, Y., Wang, Z.: A deep cnn method for underwater image enhancement. In: IEEE International Conference on Image Processing, pp. 1382–1386 (2017)
Wei, J., Wang, S., Huang, Q.: F\(^3\)net: Fusion, feedback and focus for salient object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 12321–12328 (2020)
Zhao, H., Gallo, O., Frosio, I., Kautz, J.: Loss functions for image restoration with neural networks. IEEE Trans. Comput. Imaging 3(1), 47–57 (2016)
Zuiderveld, K.: Contrast limited adaptive histogram equalization. Graphics gems, pp. 474–485 (1994)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Shi, D., Ma, L., Liu, R., Fan, X., Luo, Z. (2021). Semantic-Driven Context Aggregation Network for Underwater Image Enhancement. In: Ma, H., et al. Pattern Recognition and Computer Vision. PRCV 2021. Lecture Notes in Computer Science(), vol 13021. Springer, Cham. https://doi.org/10.1007/978-3-030-88010-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-88010-1_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88009-5
Online ISBN: 978-3-030-88010-1
eBook Packages: Computer ScienceComputer Science (R0)