research-article

Phase-based Memory Network for Video Dehazing

Authors:

Lei ZhuAuthors Info & Claims

MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Pages 5427 - 5435

https://doi.org/10.1145/3503161.3547998

Published: 10 October 2022 Publication History

Abstract

Video dehazing using deep-learning based methods has just received increasing attention in recent years. However, most existing methods tackle temporal consistency in the color domain only, which are less sensitive to small and imperceptible motions in a video, due to fog's drift and diffusion. In this work, we investigate in the frequency domain, which enables us to capture small motions effectively, and find that the phase component contains more semantic structures yet less haze information than the amplitude component of the hazy image. Based on these observations, we propose a novel phase-based memory network (PM-Net) to integrate the phase and color memory information for boosting video dehazing. Apart from the color memory from consecutive video frames, our PM-Net constructs a phase memory, which stores phase features of past video frames, and devise a cross-modal memory read (CMR) module, which fully leverages features from the color memory and the phase memory to boost features extracted from the current video frame for dehazing. Experimental results on the benchmark dataset of real hazy videos and a newly collected dataset of synthetic videos, show that the proposed PM-Net clearly outperforms the state-of-the-art image and video dehazing methods. Code is available at https://github.com/liuye123321/PM-Net.

Supplementary Material

MP4 File (MM22-fp1023.mp4)

This is the presentation for "Phase-based Memory Network for Video Dehazing".

Download
166.73 MB

References

[1]

Bolun Cai, Xiangmin Xu, Kui Jia, Chunmei Qing, and Dacheng Tao. 2016. DehazeNet: An end-to-end system for single image haze removal. TIP 25, 11 (2016), 5187--5198.

Digital Library

[2]

Chen Chen, Minh N Do, and Jue Wang. 2016. Robust image and video dehazing with visual artifact suppression via gradient residual minimization. In ECCV. Springer, 576--591.

[3]

Zijun Deng, Lei Zhu, Xiaowei Hu, Chi-Wing Fu, Xuemiao Xu, Qing Zhang, Jing Qin, and Pheng-Ann Heng. 2019. Deep multi-model fusion for single-image dehazing. In ICCV. 2453--2462.

[4]

Hang Dong, Jinshan Pan, Lei Xiang, Zhe Hu, Xinyi Zhang, Fei Wang, and Ming- Hsuan Yang. 2020. Multi-scale boosted dehazing network with dense feature fusion. In CVPR. 2157--2167.

[5]

Jiangxin Dong and Jinshan Pan. 2020. Physics-Based Feature Dehazing Networks. In ECCV. 188--204.

[6]

Jun Guo and Hongyang Chao. 2016. Building dual-domain representations for compression artifacts reduction. In ECCV. Springer, 628--644.

[7]

Tiantong Guo, Hojjat Seyed Mousavi, Tiep Huu Vu, and Vishal Monga. 2017. Deep wavelet prediction for image super-resolution. In CVPR. 104--113.

[8]

Kaiming He, Jian Sun, and Xiaoou Tang. 2011. Single image haze removal using dark channel prior. TPAMI 33, 12 (2011), 2341--2353.

Digital Library

[9]

Ming Hong, Yuan Xie, Cuihua Li, and Yanyun Qu. 2020. Distilling image dehazing with heterogeneous task imitation. In CVPR. 3462--3471.

[10]

Huaibo Huang, Ran He, Zhenan Sun, and Tieniu Tan. 2017. Wavelet-srnet: A wavelet-based cnn for multi-scale face super resolution. In ICCV. 1689--1697.

[11]

Michal Irani and Shmuel Peleg. 1993. Motion analysis for image enhancement: Resolution, occlusion, and transparency. Journal of visual communication and image representation 4, 4 (1993), 324--335.

[12]

H. Ji and C Fermüller. 2009. Robust Wavelet-Based Super-Resolution Reconstruction: Theory and Algorithm. TPAMI 31, 4 (2009), 649--660.

Digital Library

[13]

Jiawan, Zhang, Liang, Li, Yi, Zhang, Guoqiang, Yang, Xiaochun, and CaoJizhou. 2011. Video dehazing with spatial and temporal coherence. Visual Computer 27, 6--8 (2011), 749--757.

[14]

Jin-Hwan Kim, Won-Dong Jang, Jae-Young Sim, and Chang-Su Kim. 2013. Optimized contrast enhancement for real-time image and video dehazing. Journal of Visual Communication and Image Representation 24, 3 (2013), 410--425.

Digital Library

[15]

Neeraj Kumar, Ruchika Verma, and Amit Sethi. 2017. Convolutional neural networks for wavelet domain super resolution. Pattern Recognition Letters 90 (2017), 65--71.

Digital Library

[16]

Boyun Li, Yuanbiao Gou, Jerry Zitao Liu, Hongyuan Zhu, Joey Tianyi Zhou, and Xi Peng. 2020. Zero-shot image dehazing. TIP 29 (2020), 8457--8466.

[17]

Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, and Dan Feng. 2017. AODNet: An all-in-one network for dehazing and beyond. In ICCV.

[18]

Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, and Dan Feng. 2018. Endto- end united video dehazing and detection. In AAAI, Vol. 32.

[19]

Yiyi Liao, Jun Xie, and Andreas Geiger. 2021. KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D. arXiv preprint arXiv:2109.13410 (2021).

[20]

Pengju Liu, Hongzhi Zhang, Kai Zhang, Liang Lin, and Wangmeng Zuo. 2018. Multi-level wavelet-CNN for image restoration. In CVPRW. 773--782.

[21]

Wei Liu, Andrew Rabinovich, and Alexander C Berg. 2016. ParseNet: Looking wider to see better. In ICLR.

[22]

Wei Liu, Fei Zhou, Tao Lu, Jiang Duan, and Guoping Qiu. 2020. Image defogging quality assessment: Real-world database and method. IEEE Transactions on Image Processing 30 (2020), 176--190.

Digital Library

[23]

Xiaohong Liu, Yongrui Ma, Zhihao Shi, and Jun Chen. 2019. Griddehazenet: Attention-based multi-scale network for image dehazing. In ICCV. 7313--7322.

[24]

Xiaohong Liu, Yongrui Ma, Zhihao Shi, and Jun Chen. 2019. Griddehazenet: Attention-based multi-scale network for image dehazing. In ICCV. 7314--7323.

[25]

Xing Liu, Masanori Suganuma, Zhun Sun, and Takayuki Okatani. 2019. Dual residual networks leveraging the potential of paired operations for image restoration. In CVPR. 7007--7016.

[26]

Ye Liu, Lei Zhu, Shunda Pei, Huazhu Fu, Jing Qin, Qing Zhang, Liang Wan, and Wei Feng. 2021. From Synthetic to Real: Image Dehazing Collaborating with Unlabeled Real Data. In ACM MM. 50--58.

[27]

Simone Meyer, Oliver Wang, Henning Zimmer, Max Grosse, and Alexander Sorkine-Hornung. 2015. Phase-based frame interpolation for video. In CVPR. 1410--1418.

[28]

Shree K Nayar and Srinivasa G Narasimhan. 1999. Vision in bad weather. In ICCV. 820--827.

[29]

SeoungWug Oh, Joon-Young Lee, Ning Xu, and Seon Joo Kim. 2019. Video Object Segmentation Using Space-Time Memory Networks. In ICCV.

[30]

Xu Qin, Zhilin Wang, Yuanchao Bai, Xiaodong Xie, and Huizhu Jia. 2020. FFANet: Feature Fusion Attention Network for Single Image Dehazing. In AAAI. 11908--11915.

[31]

Wenqi Ren, Si Liu, Hua Zhang, Jinshan Pan, Xiaochun Cao, and Ming-Hsuan Yang. 2016. Single image dehazing via multi-scale convolutional neural networks. In ECCV. 154--169.

[32]

Wenqi Ren, Lin Ma, Jiawei Zhang, Jinshan Pan, Xiaochun Cao, Wei Liu, and Ming-Hsuan Yang. 2018. Gated fusion network for single image dehazing. In CVPR. 3253--3261.

[33]

Wenqi Ren, Jingang Zhang, Xiangyu Xu, Lin Ma, Xiaochun Cao, Gaofeng Meng, and Wei Liu. 2018. Deep video dehazing with semantic segmentation. TIP 28, 4 (2018), 1895--1908.

[34]

Christos Sakaridis, Dengxin Dai, and Luc Van Gool. 2017. Semantic Foggy Scene Understanding with Synthetic Data. IJCV (2017).

[35]

Varun Santhaseelan and Vijayan K Asari. 2015. Utilizing local phase information to remove rain from video. IJCV 112, 1 (2015), 71--89.

Digital Library

[36]

Yuanjie Shao, Lerenhan Li, Wenqi Ren, Changxin Gao, and Nong Sang. 2020. Domain Adaptation for Image Dehazing. In CVPR. 2805--2814.

[37]

Yafei Song, Jia Li, Xiaogang Wang, and Xiaowu Chen. 2018. Single Image Dehazing Using Ranking Convolutional Neural Network. TMM 20, 6 (2018), 1548--1560.

[38]

Neal Wadhwa, Michael Rubinstein, Frédo Durand, and William T Freeman. 2013. Phase-based video motion processing. TOG 32, 4 (2013), 1--10.

Digital Library

[39]

Xintao Wang, Kelvin CK Chan, Ke Yu, Chao Dong, and Chen Change Loy. 2019. Edvr: Video restoration with enhanced deformable convolutional networks. In CVPR. 0--0.

[40]

Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity. TIP 13, 4 (2004), 600--612.

Digital Library

[41]

Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M Alvarez, and Ping Luo. 2021. SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. arXiv preprint arXiv:2105.15203 (2021).

[42]

Dong Yang and Jian Sun. 2018. Proximal Dehaze-Net: A Prior Learning-Based Deep Network for Single Image Dehazing. In ECCV. 729--746.

[43]

Feng Yu, Chunmei Qing, Xiangmin Xu, and Bolun Cai. 2016. Image and video dehazing using view-based cluster segmentation. In 2016 Visual Communications and Image Processing (VCIP). 1--4.

[44]

He Zhang and Vishal M Patel. 2018. Densely connected pyramid dehazing network. In CVPR. 3194--3203.

[45]

Xinyi Zhang, Hang Dong, Jinshan Pan, Chao Zhu, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, and Fei Wang. 2021. Learning to restore hazy video: A new real-world dataset and a new method. In CVPR. 9239--9248.

[46]

Yulun Zhang, Kunpeng Li, Kai Li, LichenWang, Bineng Zhong, and Yun Fu. 2018. Image super-resolution using very deep residual channel attention networks. In ECCV. 294--310.

[47]

Zhisheng Zhong, Tiancheng Shen, Yibo Yang, Zhouchen Lin, and Chao Zhang. 2018. Joint sub-bands learning with clique structures for wavelet domain superresolution. Advances in neural information processing systems 31 (2018).

[48]

Lei Zhu, Zijun Deng, Xiaowei Hu, Haoran Xie, Xuemiao Xu, Jing Qin, and Pheng- Ann Heng. 2021. Learning gated non-local residual for single-image rain streak removal. TCSVT 31, 6 (2021), 2147--2159.

[49]

Lei Zhu, Chi-Wing Fu, Dani Lischinski, and Pheng-Ann Heng. 2017. Joint bi-layer optimization for single-image rain streak removal. In ICCV. 2526--2534.

Cited By

Zhang GLi CYan JZheng Y(2024)ULD-CycleGAN: An Underwater Light Field and Depth Map-Optimized CycleGAN for Underwater Image EnhancementIEEE Journal of Oceanic Engineering10.1109/JOE.2024.342862449:4(1275-1288)Online publication date: Oct-2024
https://doi.org/10.1109/JOE.2024.3428624
Fan JWeng JWang KYang YQian JLi JYang J(2024)Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02467(26109-26119)Online publication date: 16-Jun-2024
https://doi.org/10.1109/CVPR52733.2024.02467
Yang YWu HAviles-Rivero AZhang YQin JZhu L(2024)Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02419(25606-25616)Online publication date: 16-Jun-2024
https://doi.org/10.1109/CVPR52733.2024.02419
Show More Cited By

Index Terms

Phase-based Memory Network for Video Dehazing
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks

Recommendations

Optimized contrast enhancement for real-time image and video dehazing

A fast and optimized dehazing algorithm for hazy images and videos is proposed in this work. Based on the observation that a hazy image exhibits low contrast in general, we restore the hazy image by enhancing its contrast. However, the overcompensation ...
An efficient non-volatile main memory using phase change memory
CompSysTech '12: Proceedings of the 13th International Conference on Computer Systems and Technologies

The paper represents a suggestion for a non-volatile computer system design. We propose architecture for implementing the main memory as non-volatile, resulting in a non-volatile computer. Our solution is based on the rapidly developing contemporary ...
Practical nonvolatile multilevel-cell phase change memory
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis

Multilevel-cell (MLC) phase change memory (PCM) may provide both high capacity main memory and faster-than-Flash persistent storage. But slow growth in cell resistance with time, resistance drift, can cause transient errors in MLC-PCM. Drift errors ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '22: Proceedings of the 30th ACM International Conference on Multimedia

October 2022

7537 pages

ISBN:9781450392037

DOI:10.1145/3503161

General Chairs:
João Magalhães
NOVA University of Lisbon, Portugal
,
Alberto del Bimbo
University of Florence, Italy
,
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Nicu Sebe
University of Trento, Italy
,
Program Chairs:
Xavier Alameda-Pineda
Inria, Grenoble, France
,
Qin Jin
Renmin University of China, China
,
Vincent Oria
New Jersey Institute of Technology, USA
,
Laura Toni
University College London, UK

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 October 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

The Hong Kong Polytechnic University under Project of Strategic Importance
The National Natural Science Foundation of China

Conference

MM '22

Sponsor:

SIGMM

MM '22: The 30th ACM International Conference on Multimedia

October 10 - 14, 2022

Lisboa, Portugal

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

11
Total Citations
View Citations
413
Total Downloads

Downloads (Last 12 months)71
Downloads (Last 6 weeks)8

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhang GLi CYan JZheng Y(2024)ULD-CycleGAN: An Underwater Light Field and Depth Map-Optimized CycleGAN for Underwater Image EnhancementIEEE Journal of Oceanic Engineering10.1109/JOE.2024.342862449:4(1275-1288)Online publication date: Oct-2024
https://doi.org/10.1109/JOE.2024.3428624
Fan JWeng JWang KYang YQian JLi JYang J(2024)Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02467(26109-26119)Online publication date: 16-Jun-2024
https://doi.org/10.1109/CVPR52733.2024.02467
Yang YWu HAviles-Rivero AZhang YQin JZhu L(2024)Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02419(25606-25616)Online publication date: 16-Jun-2024
https://doi.org/10.1109/CVPR52733.2024.02419
Dai YLi JLi YLu G(2024)Multi-modal graph context extraction and consensus-aware learning for emotion recognition in conversationKnowledge-Based Systems10.1016/j.knosys.2024.111954298(111954)Online publication date: Aug-2024
https://doi.org/10.1016/j.knosys.2024.111954
Ren JChen HYe TWu HZhu L(2024)Triplane-Smoothed Video Dehazing with CLIP-Enhanced GeneralizationInternational Journal of Computer Vision10.1007/s11263-024-02161-0133:1(475-488)Online publication date: 1-Aug-2024
https://doi.org/10.1007/s11263-024-02161-0
Liu YZhu LWan LWang X(2024)Masked frequency-color fusion network for video instance-level hazy lane detectionThe Visual Computer10.1007/s00371-024-03671-1Online publication date: 14-Oct-2024
https://doi.org/10.1007/s00371-024-03671-1
Wu RZhang ZZhang SGou LChen HZhang LChen HZuo W(2024)Self-Supervised Video Desmoking for Laparoscopic SurgeryComputer Vision – ECCV 202410.1007/978-3-031-73220-1_18(307-324)Online publication date: 3-Nov-2024
https://doi.org/10.1007/978-3-031-73220-1_18
Li CYuan CPan HYang YWang ZZhou HXiong H(2023)Single-Image Dehazing Based on Improved Bright Channel Prior and Dark Channel PriorElectronics10.3390/electronics1202029912:2(299)Online publication date: 6-Jan-2023
https://doi.org/10.3390/electronics12020299
Yang YAviles-Rivero AFu HLiu YWang WZhu L(2023)Video Adverse-Weather-Component Suppression Network via Weather Messenger and Adversarial Backpropagation2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.01214(13154-13164)Online publication date: 1-Oct-2023
https://doi.org/10.1109/ICCV51070.2023.01214
Xu JHu XZhu LDou QDai JQiao YHeng P(2023)Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52729.2023.01731(18053-18062)Online publication date: Jun-2023
https://doi.org/10.1109/CVPR52729.2023.01731
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten