skip to main content
10.1145/3503161.3547998acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Phase-based Memory Network for Video Dehazing

Published: 10 October 2022 Publication History

Abstract

Video dehazing using deep-learning based methods has just received increasing attention in recent years. However, most existing methods tackle temporal consistency in the color domain only, which are less sensitive to small and imperceptible motions in a video, due to fog's drift and diffusion. In this work, we investigate in the frequency domain, which enables us to capture small motions effectively, and find that the phase component contains more semantic structures yet less haze information than the amplitude component of the hazy image. Based on these observations, we propose a novel phase-based memory network (PM-Net) to integrate the phase and color memory information for boosting video dehazing. Apart from the color memory from consecutive video frames, our PM-Net constructs a phase memory, which stores phase features of past video frames, and devise a cross-modal memory read (CMR) module, which fully leverages features from the color memory and the phase memory to boost features extracted from the current video frame for dehazing. Experimental results on the benchmark dataset of real hazy videos and a newly collected dataset of synthetic videos, show that the proposed PM-Net clearly outperforms the state-of-the-art image and video dehazing methods. Code is available at https://github.com/liuye123321/PM-Net.

Supplementary Material

MP4 File (MM22-fp1023.mp4)
This is the presentation for "Phase-based Memory Network for Video Dehazing".

References

[1]
Bolun Cai, Xiangmin Xu, Kui Jia, Chunmei Qing, and Dacheng Tao. 2016. DehazeNet: An end-to-end system for single image haze removal. TIP 25, 11 (2016), 5187--5198.
[2]
Chen Chen, Minh N Do, and Jue Wang. 2016. Robust image and video dehazing with visual artifact suppression via gradient residual minimization. In ECCV. Springer, 576--591.
[3]
Zijun Deng, Lei Zhu, Xiaowei Hu, Chi-Wing Fu, Xuemiao Xu, Qing Zhang, Jing Qin, and Pheng-Ann Heng. 2019. Deep multi-model fusion for single-image dehazing. In ICCV. 2453--2462.
[4]
Hang Dong, Jinshan Pan, Lei Xiang, Zhe Hu, Xinyi Zhang, Fei Wang, and Ming- Hsuan Yang. 2020. Multi-scale boosted dehazing network with dense feature fusion. In CVPR. 2157--2167.
[5]
Jiangxin Dong and Jinshan Pan. 2020. Physics-Based Feature Dehazing Networks. In ECCV. 188--204.
[6]
Jun Guo and Hongyang Chao. 2016. Building dual-domain representations for compression artifacts reduction. In ECCV. Springer, 628--644.
[7]
Tiantong Guo, Hojjat Seyed Mousavi, Tiep Huu Vu, and Vishal Monga. 2017. Deep wavelet prediction for image super-resolution. In CVPR. 104--113.
[8]
Kaiming He, Jian Sun, and Xiaoou Tang. 2011. Single image haze removal using dark channel prior. TPAMI 33, 12 (2011), 2341--2353.
[9]
Ming Hong, Yuan Xie, Cuihua Li, and Yanyun Qu. 2020. Distilling image dehazing with heterogeneous task imitation. In CVPR. 3462--3471.
[10]
Huaibo Huang, Ran He, Zhenan Sun, and Tieniu Tan. 2017. Wavelet-srnet: A wavelet-based cnn for multi-scale face super resolution. In ICCV. 1689--1697.
[11]
Michal Irani and Shmuel Peleg. 1993. Motion analysis for image enhancement: Resolution, occlusion, and transparency. Journal of visual communication and image representation 4, 4 (1993), 324--335.
[12]
H. Ji and C Fermüller. 2009. Robust Wavelet-Based Super-Resolution Reconstruction: Theory and Algorithm. TPAMI 31, 4 (2009), 649--660.
[13]
Jiawan, Zhang, Liang, Li, Yi, Zhang, Guoqiang, Yang, Xiaochun, and CaoJizhou. 2011. Video dehazing with spatial and temporal coherence. Visual Computer 27, 6--8 (2011), 749--757.
[14]
Jin-Hwan Kim, Won-Dong Jang, Jae-Young Sim, and Chang-Su Kim. 2013. Optimized contrast enhancement for real-time image and video dehazing. Journal of Visual Communication and Image Representation 24, 3 (2013), 410--425.
[15]
Neeraj Kumar, Ruchika Verma, and Amit Sethi. 2017. Convolutional neural networks for wavelet domain super resolution. Pattern Recognition Letters 90 (2017), 65--71.
[16]
Boyun Li, Yuanbiao Gou, Jerry Zitao Liu, Hongyuan Zhu, Joey Tianyi Zhou, and Xi Peng. 2020. Zero-shot image dehazing. TIP 29 (2020), 8457--8466.
[17]
Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, and Dan Feng. 2017. AODNet: An all-in-one network for dehazing and beyond. In ICCV.
[18]
Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, and Dan Feng. 2018. Endto- end united video dehazing and detection. In AAAI, Vol. 32.
[19]
Yiyi Liao, Jun Xie, and Andreas Geiger. 2021. KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D. arXiv preprint arXiv:2109.13410 (2021).
[20]
Pengju Liu, Hongzhi Zhang, Kai Zhang, Liang Lin, and Wangmeng Zuo. 2018. Multi-level wavelet-CNN for image restoration. In CVPRW. 773--782.
[21]
Wei Liu, Andrew Rabinovich, and Alexander C Berg. 2016. ParseNet: Looking wider to see better. In ICLR.
[22]
Wei Liu, Fei Zhou, Tao Lu, Jiang Duan, and Guoping Qiu. 2020. Image defogging quality assessment: Real-world database and method. IEEE Transactions on Image Processing 30 (2020), 176--190.
[23]
Xiaohong Liu, Yongrui Ma, Zhihao Shi, and Jun Chen. 2019. Griddehazenet: Attention-based multi-scale network for image dehazing. In ICCV. 7313--7322.
[24]
Xiaohong Liu, Yongrui Ma, Zhihao Shi, and Jun Chen. 2019. Griddehazenet: Attention-based multi-scale network for image dehazing. In ICCV. 7314--7323.
[25]
Xing Liu, Masanori Suganuma, Zhun Sun, and Takayuki Okatani. 2019. Dual residual networks leveraging the potential of paired operations for image restoration. In CVPR. 7007--7016.
[26]
Ye Liu, Lei Zhu, Shunda Pei, Huazhu Fu, Jing Qin, Qing Zhang, Liang Wan, and Wei Feng. 2021. From Synthetic to Real: Image Dehazing Collaborating with Unlabeled Real Data. In ACM MM. 50--58.
[27]
Simone Meyer, Oliver Wang, Henning Zimmer, Max Grosse, and Alexander Sorkine-Hornung. 2015. Phase-based frame interpolation for video. In CVPR. 1410--1418.
[28]
Shree K Nayar and Srinivasa G Narasimhan. 1999. Vision in bad weather. In ICCV. 820--827.
[29]
SeoungWug Oh, Joon-Young Lee, Ning Xu, and Seon Joo Kim. 2019. Video Object Segmentation Using Space-Time Memory Networks. In ICCV.
[30]
Xu Qin, Zhilin Wang, Yuanchao Bai, Xiaodong Xie, and Huizhu Jia. 2020. FFANet: Feature Fusion Attention Network for Single Image Dehazing. In AAAI. 11908--11915.
[31]
Wenqi Ren, Si Liu, Hua Zhang, Jinshan Pan, Xiaochun Cao, and Ming-Hsuan Yang. 2016. Single image dehazing via multi-scale convolutional neural networks. In ECCV. 154--169.
[32]
Wenqi Ren, Lin Ma, Jiawei Zhang, Jinshan Pan, Xiaochun Cao, Wei Liu, and Ming-Hsuan Yang. 2018. Gated fusion network for single image dehazing. In CVPR. 3253--3261.
[33]
Wenqi Ren, Jingang Zhang, Xiangyu Xu, Lin Ma, Xiaochun Cao, Gaofeng Meng, and Wei Liu. 2018. Deep video dehazing with semantic segmentation. TIP 28, 4 (2018), 1895--1908.
[34]
Christos Sakaridis, Dengxin Dai, and Luc Van Gool. 2017. Semantic Foggy Scene Understanding with Synthetic Data. IJCV (2017).
[35]
Varun Santhaseelan and Vijayan K Asari. 2015. Utilizing local phase information to remove rain from video. IJCV 112, 1 (2015), 71--89.
[36]
Yuanjie Shao, Lerenhan Li, Wenqi Ren, Changxin Gao, and Nong Sang. 2020. Domain Adaptation for Image Dehazing. In CVPR. 2805--2814.
[37]
Yafei Song, Jia Li, Xiaogang Wang, and Xiaowu Chen. 2018. Single Image Dehazing Using Ranking Convolutional Neural Network. TMM 20, 6 (2018), 1548--1560.
[38]
Neal Wadhwa, Michael Rubinstein, Frédo Durand, and William T Freeman. 2013. Phase-based video motion processing. TOG 32, 4 (2013), 1--10.
[39]
Xintao Wang, Kelvin CK Chan, Ke Yu, Chao Dong, and Chen Change Loy. 2019. Edvr: Video restoration with enhanced deformable convolutional networks. In CVPR. 0--0.
[40]
Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity. TIP 13, 4 (2004), 600--612.
[41]
Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M Alvarez, and Ping Luo. 2021. SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. arXiv preprint arXiv:2105.15203 (2021).
[42]
Dong Yang and Jian Sun. 2018. Proximal Dehaze-Net: A Prior Learning-Based Deep Network for Single Image Dehazing. In ECCV. 729--746.
[43]
Feng Yu, Chunmei Qing, Xiangmin Xu, and Bolun Cai. 2016. Image and video dehazing using view-based cluster segmentation. In 2016 Visual Communications and Image Processing (VCIP). 1--4.
[44]
He Zhang and Vishal M Patel. 2018. Densely connected pyramid dehazing network. In CVPR. 3194--3203.
[45]
Xinyi Zhang, Hang Dong, Jinshan Pan, Chao Zhu, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, and Fei Wang. 2021. Learning to restore hazy video: A new real-world dataset and a new method. In CVPR. 9239--9248.
[46]
Yulun Zhang, Kunpeng Li, Kai Li, LichenWang, Bineng Zhong, and Yun Fu. 2018. Image super-resolution using very deep residual channel attention networks. In ECCV. 294--310.
[47]
Zhisheng Zhong, Tiancheng Shen, Yibo Yang, Zhouchen Lin, and Chao Zhang. 2018. Joint sub-bands learning with clique structures for wavelet domain superresolution. Advances in neural information processing systems 31 (2018).
[48]
Lei Zhu, Zijun Deng, Xiaowei Hu, Haoran Xie, Xuemiao Xu, Jing Qin, and Pheng- Ann Heng. 2021. Learning gated non-local residual for single-image rain streak removal. TCSVT 31, 6 (2021), 2147--2159.
[49]
Lei Zhu, Chi-Wing Fu, Dani Lischinski, and Pheng-Ann Heng. 2017. Joint bi-layer optimization for single-image rain streak removal. In ICCV. 2526--2534.

Cited By

View all
  • (2024)ULD-CycleGAN: An Underwater Light Field and Depth Map-Optimized CycleGAN for Underwater Image EnhancementIEEE Journal of Oceanic Engineering10.1109/JOE.2024.342862449:4(1275-1288)Online publication date: Oct-2024
  • (2024)Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02467(26109-26119)Online publication date: 16-Jun-2024
  • (2024)Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02419(25606-25616)Online publication date: 16-Jun-2024
  • Show More Cited By

Index Terms

  1. Phase-based Memory Network for Video Dehazing

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '22: Proceedings of the 30th ACM International Conference on Multimedia
    October 2022
    7537 pages
    ISBN:9781450392037
    DOI:10.1145/3503161
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 October 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. cross-modal
    2. memory
    3. phase feature
    4. video dehazing

    Qualifiers

    • Research-article

    Funding Sources

    • The Hong Kong Polytechnic University under Project of Strategic Importance
    • The National Natural Science Foundation of China

    Conference

    MM '22
    Sponsor:

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)71
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)ULD-CycleGAN: An Underwater Light Field and Depth Map-Optimized CycleGAN for Underwater Image EnhancementIEEE Journal of Oceanic Engineering10.1109/JOE.2024.342862449:4(1275-1288)Online publication date: Oct-2024
    • (2024)Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02467(26109-26119)Online publication date: 16-Jun-2024
    • (2024)Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02419(25606-25616)Online publication date: 16-Jun-2024
    • (2024)Multi-modal graph context extraction and consensus-aware learning for emotion recognition in conversationKnowledge-Based Systems10.1016/j.knosys.2024.111954298(111954)Online publication date: Aug-2024
    • (2024)Triplane-Smoothed Video Dehazing with CLIP-Enhanced GeneralizationInternational Journal of Computer Vision10.1007/s11263-024-02161-0133:1(475-488)Online publication date: 1-Aug-2024
    • (2024)Masked frequency-color fusion network for video instance-level hazy lane detectionThe Visual Computer10.1007/s00371-024-03671-1Online publication date: 14-Oct-2024
    • (2024)Self-Supervised Video Desmoking for Laparoscopic SurgeryComputer Vision – ECCV 202410.1007/978-3-031-73220-1_18(307-324)Online publication date: 3-Nov-2024
    • (2023)Single-Image Dehazing Based on Improved Bright Channel Prior and Dark Channel PriorElectronics10.3390/electronics1202029912:2(299)Online publication date: 6-Jan-2023
    • (2023)Video Adverse-Weather-Component Suppression Network via Weather Messenger and Adversarial Backpropagation2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.01214(13154-13164)Online publication date: 1-Oct-2023
    • (2023)Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52729.2023.01731(18053-18062)Online publication date: Jun-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media