skip to main content
10.1145/3394171.3413664acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Towards Lighter and Faster: Learning Wavelets Progressively for Image Super-Resolution

Published: 12 October 2020 Publication History

Abstract

Due to the significant development of deep learning (DL) techniques, recent advances in the super-resolution (SR) field have achieved a great performance. While seeking for better performance, the later proposed networks prone to be deeper and heavier, which limits the applications of SR algorithms in the resource-constrain devices. Some advances rely on recurrent/recursive learning to reduce the number of network parameters, however, they ignore the caused long inference time, since the more recurrences/recursions are involved, the longer inference time the network needs. To address this trade-off issue between reconstruction performance, the number of network parameters, and inference time, we propose a lightweight and fast network (WSR) to learn wavelet coefficients of the target image progressively for single image super-resolution. More specifically, the network comprises two main branches. One is used for predicting the second level low-frequency wavelet coefficients, and the other one is designed in a recurrent way for predicting the rest wavelet coefficients at the first and second levels. Finally, an inverse wavelet transformation is adopted to reconstruct the SR images from these coefficients. In addition, we propose a deformable convolution kernel (side window) to construct the side-information multi-distillation block (S-IMDB), which is the basic unit of the recurrent blocks (RBs). We train the WSR with loss constraints at wavelet and spatial domains. Comprehensive experiments demonstrate that our WSR achieves a better trade-off than most of the state-of-the-art approaches. Code is available at https://github.com/FVL2020/WSR.

Supplementary Material

MP4 File (3394171.3413664.mp4)
Video file

References

[1]
Eirikur Agustsson and Radu Timofte. 2017. Ntire 2017 Challenge on Single Image Super-Resolution: Dataset and Study. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 126--135.
[2]
Marco Bevilacqua, Aline Roumy, Christine Guillemot, and Marie Line Alberi-Morel. 2012. Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding. (2012).
[3]
Rong Chen, Yuan Xie, Xiaotong Luo, Yanyun Qu, and Cuihua Li. 2019. Joint-attention Discriminator for Accurate Super-resolution via Adversarial Training. In Proceedings of the 27th ACM International Conference on Multimedia. 711--719.
[4]
Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. 2014. Learning a Deep Convolutional Network for Image Super-Resolution. In Proceedings of the European Conference on Computer Vision. 184--199.
[5]
Chao Dong, Chen Change Loy, and Xiaoou Tang. 2016. Accelerating the Super-Resolution Convolutional Neural Network. In Proceedings of the European Conference on Computer Vision. 391--407.
[6]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In Advances in Neural Information Processing Systems. 2672--2680.
[7]
Bahadir K Gunturk, Yucel Altunbasak, and Russell M Mersereau. 2004. Super-Resolution Reconstruction of Compressed Video Using Transform-Domain Statistics. IEEE Transactions on Image Processing, Vol. 13, 1 (2004), 33--43.
[8]
Tiantong Guo, Hojjat Seyed Mousavi, Tiep Huu Vu, and Vishal Monga. 2017. Deep Wavelet Prediction for Image Super-Resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 104--113.
[9]
Muhammad Haris, Gregory Shakhnarovich, and Norimichi Ukita. 2018. Deep Back-Projection Networks for Super-Resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1664--1673.
[10]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Delving Deep into Rectifiers: Surpassing Human-Level Performance on Imagenet Classification. In Proceedings of the IEEE International Conference on Computer Vision. 1026--1034.
[11]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770--778.
[12]
Jia-Bin Huang, Abhishek Singh, and Narendra Ahuja. 2015. Single Image Super-Resolution from Transformed Self-Exemplars. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5197--5206.
[13]
Zheng Hui, Xinbo Gao, Yunchu Yang, and Xiumei Wang. 2019. Lightweight Image Super-Resolution with Information Multi-Distillation Network. In Proceedings of the 27th ACM International Conference on Multimedia. 2024--2032.
[14]
Zheng Hui, Xiumei Wang, and Xinbo Gao. 2018. Fast and Accurate Single Image Super-Resolution via Information Distillation Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 723--731.
[15]
Zhi Jin, Muhammad Zafar Iqbal, Dmytro Bobkov, Wenbin Zou, Xia Li, and Eckehard Steinbach. 2019. A Flexible Deep CNN Framework for Image Restoration. IEEE Transactions on Multimedia (2019).
[16]
Zhi Jin, Muhammad Zafar Iqbal, Wenbin Zou, Xia Li, and Eckehard Steinbach. 2020. Dual-stream Multi-path Recursive Residual Network for JPEG Image Compression Artifacts Reduction. IEEE Transactions on Circuits and Systems for Video Technology (2020).
[17]
Jiwon Kim, Jung Kwon Lee, and Kyoung Mu Lee. 2016a. Accurate Image Super-Resolution Using Very Deep Convolutional Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1646--1654.
[18]
Jiwon Kim, Jung Kwon Lee, and Kyoung Mu Lee. 2016b. Deeply-Recursive Convolutional Network for Image Super-Resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1637--1645.
[19]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[20]
Wei-Sheng Lai, Jia-Bin Huang, Narendra Ahuja, and Ming-Hsuan Yang. 2017. Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 624--632.
[21]
Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, et almbox. 2017. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4681--4690.
[22]
Zhen Li, Jinglei Yang, Zheng Liu, Xiaomin Yang, Gwanggil Jeon, and Wei Wu. 2019. Feedback Network for Image Super-Resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3867--3876.
[23]
Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. 2017. Enhanced Deep Residual Networks for Single Image Super-resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 136--144.
[24]
David Martin, Charless Fowlkes, Doron Tal, and Jitendra Malik. 2001. A Database of Human Segmented Natural Images and Its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics. In Proceedings of the IEEE International Conference on Computer Vision. 416--423.
[25]
Yusuke Matsui, Kota Ito, Yuji Aramaki, Azuma Fujimoto, Toru Ogawa, Toshihiko Yamasaki, and Kiyoharu Aizawa. 2017. Sketch-Based Manga Retrieval Using Manga109 Dataset. Multimedia Tools and Applications, Vol. 76, 20 (2017), 21811--21838.
[26]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et almbox. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems. 8024--8035.
[27]
Wenzhe Shi, Jose Caballero, Ferenc Huszár, Johannes Totz, Andrew P Aitken, Rob Bishop, Daniel Rueckert, and Zehan Wang. 2016. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1874--1883.
[28]
Wenzhe Shi, Jose Caballero, Christian Ledig, Xiahai Zhuang, Wenjia Bai, Kanwal Bhatia, Antonio M Simoes Monteiro de Marvao, Tim Dawes, Declan O'Regan, and Daniel Rueckert. 2013. Cardiac Image Super-Resolution with Global Correspondence Using Multi-Atlas Patchmatch. In International Conference on Medical Image Computing and Computer-Assisted Intervention. 9--16.
[29]
Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv preprint arXiv:1409.1556 (2014).
[30]
Ying Tai, Jian Yang, and Xiaoming Liu. 2017a. Image Super-Resolution via Deep Recursive Residual Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3147--3155.
[31]
Ying Tai, Jian Yang, Xiaoming Liu, and Chunyan Xu. 2017b. Memnet: A Persistent Memory Network for Image Restoration. In Proceedings of the IEEE International Conference on Computer Vision. 4539--4547.
[32]
Tong Tong, Gen Li, Xiejie Liu, and Qinquan Gao. 2017. Image Super-Resolution Using Dense Skip Connections. In Proceedings of the IEEE International Conference on Computer Vision. 4799--4807.
[33]
Xintao Wang, Ke Yu, Chao Dong, and Chen Change Loy. 2018a. Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 606--615.
[34]
Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, and Chen Change Loy. 2018b. Esrgan: Enhanced Super-Resolution Generative Adversarial Networks. In Proceedings of the European Conference on Computer Vision Workshops. 0--0.
[35]
Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. 2004. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Transactions on Image Processing, Vol. 13, 4 (2004), 600--612.
[36]
Bin-Cheng Yang. 2019. Super Resolution Using Dual Path Connections. In Proceedings of the 27th ACM International Conference on Multimedia. 1552--1560.
[37]
Deniz Yildirim and Oug uz Güngör. 2012. A Novel Image Fusion Method Using IKONOS Satellite Images. Journal of Geodesy and Geoinformation, Vol. 1, 1 (2012), 75--83.
[38]
Jiahui Yu, Yuchen Fan, Jianchao Yang, Ning Xu, Zhaowen Wang, Xinchao Wang, and Thomas Huang. 2018. Wide Activation for Efficient and Accurate Image Super-Resolution. arXiv preprint arXiv:1808.08718 (2018).
[39]
Roman Zeyde, Michael Elad, and Matan Protter. 2010. On Single Image Scale-Up Using Sparse-Representations. In International Conference on Curves and Surfaces. 711--730.
[40]
Yulun Zhang, Kunpeng Li, Kai Li, Lichen Wang, Bineng Zhong, and Yun Fu. 2018. Image Super-Resolution Using Very Deep Residual Channel Attention Networks. In Proceedings of the European Conference on Computer Vision. 286--301.

Cited By

View all
  • (2025)Guided image filtering-conventional to deep models: A review and evaluation studyComputer Vision and Image Understanding10.1016/j.cviu.2025.104278252(104278)Online publication date: Feb-2025
  • (2024)An Efficient Latent Style Guided Transformer-CNN Framework for Face Super-ResolutionIEEE Transactions on Multimedia10.1109/TMM.2023.328385626(1589-1599)Online publication date: 2024
  • (2024)A Wavelet-Domain Consistency-Constrained Compressive Sensing Framework Based on Memory-Boosted Guidance FilteringIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2024.339809673(1-16)Online publication date: 2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '20: Proceedings of the 28th ACM International Conference on Multimedia
October 2020
4889 pages
ISBN:9781450379885
DOI:10.1145/3394171
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 October 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. fast inference
  2. image super-resolution
  3. lightweight network
  4. recurrent learning
  5. side window convolution
  6. wavelet prediction

Qualifiers

  • Research-article

Funding Sources

  • the National Natural Science Foundation of China
  • Key-Area Research and Development Program of Guangdong Province

Conference

MM '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)45
  • Downloads (Last 6 weeks)1
Reflects downloads up to 08 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Guided image filtering-conventional to deep models: A review and evaluation studyComputer Vision and Image Understanding10.1016/j.cviu.2025.104278252(104278)Online publication date: Feb-2025
  • (2024)An Efficient Latent Style Guided Transformer-CNN Framework for Face Super-ResolutionIEEE Transactions on Multimedia10.1109/TMM.2023.328385626(1589-1599)Online publication date: 2024
  • (2024)A Wavelet-Domain Consistency-Constrained Compressive Sensing Framework Based on Memory-Boosted Guidance FilteringIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2024.339809673(1-16)Online publication date: 2024
  • (2024)CSPN: A Category-Specific Processing Network for Low-Light Image EnhancementIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.342652734:11(11929-11941)Online publication date: Nov-2024
  • (2024)RTNN: A Neural Network-Based In-Loop Filter in VVC Using Resblock and TransformerIEEE Access10.1109/ACCESS.2024.343152712(104599-104610)Online publication date: 2024
  • (2024)Enhanced wavelet based spatiotemporal fusion networks using cross-paired remote sensing imagesISPRS Journal of Photogrammetry and Remote Sensing10.1016/j.isprsjprs.2024.04.016211(281-297)Online publication date: May-2024
  • (2023)Dynamic Frame Interpolation in Wavelet DomainIEEE Transactions on Image Processing10.1109/TIP.2023.331515132(5296-5309)Online publication date: 2023
  • (2023)Wavelet Dual-Stream Network for Brain MR Image Super-Resolution2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191590(1-9)Online publication date: 18-Jun-2023
  • (2023)WDU-Net: Wavelet-Guided Deep Unfolding Network for Image Compressed Sensing ReconstructionPattern Recognition and Computer Vision10.1007/978-981-99-8537-1_7(79-91)Online publication date: 26-Dec-2023
  • (2023)A Cross-Paired Wavelet Based Spatiotemporal Fusion Network for Remote Sensing ImagesImage and Graphics10.1007/978-3-031-46317-4_13(143-154)Online publication date: 29-Oct-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media