skip to main content
research-article

Double-Layer Search and Adaptive Pooling Fusion for Reference-Based Image Super-Resolution

Published:25 August 2023Publication History
Skip Abstract Section

Abstract

Reference-based image super-resolution (RefSR) aims to reconstruct high-resolution (HR) images from low-resolution (LR) images by introducing HR reference images. The key step of RefSR is to transfer reference features to LR features. However, existing methods still lack an efficient transfer mechanism, resulting in blurry details in the generated image. In this article, we propose a double-layer search module and an adaptive pooling fusion module group for reference-based image super-resolution, called DLASR. Based on the re-search strategy, the double-layer search module can produce an accurate index map and score map. These two maps are used to filter out accurate reference features, which greatly increases the efficiency of feature transfer in the later stage. Through two continuous feature-enhancement steps, the adaptive pooling fusion module group can transfer more valuable reference features to the corresponding LR features. In addition, a structure reconstruction module is proposed to recover the geometric information of the images, which further improves the visual quality of the generated image. We conduct comparative experiments on a variety of datasets, and the results prove that DLASR achieves significant improvements over other state-of-the-art methods, in terms of quantitative accuracy and qualitative visual effect. The code is available at https://github.com/clttyou/DLASR.

REFERENCES

  1. [1] Aly Hussein A. and Dubois Eric. 2005. Image up-sampling using total-variation regularization with a new observation model. IEEE Trans. Image Process. 14, 10 (2005), 16471659.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Boominathan Vivek, Mitra Kaushik, and Veeraraghavan Ashok. 2014. Improving resolution and depth-of-field of light field cameras using a hybrid imaging system. In Proceedings of the 2014 IEEE International Conference on Computational Photography (ICCP). IEEE, 110.Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Dong Chao, Loy Chen Change, He Kaiming, and Tang Xiaoou. 2014. Learning a deep convolutional network for image super-resolution. In Proceedings of the European Conference on Computer Vision. Springer, 184199.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Dong Chao, Loy Chen Change, He Kaiming, and Tang Xiaoou. 2015. Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38, 2 (2015), 295307.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Farajzadeh Nacer and Rezaei Negin S.. 2014. Vehicle logo recognition using image matching and textural features. In Proceedings of the Scientific Cooperations International Workshops on Electrical and Computer Engineering Subfields. 8286.Google ScholarGoogle Scholar
  6. [6] Fattal Raanan. 2007. Image upsampling via imposed edge statistics. In ACM SIGGRAPH 2007 Papers. 95102.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Gatys Leon A., Ecker Alexander S., and Bethge Matthias. 2015. Texture synthesis using convolutional neural networks. arXiv preprint arXiv:1505.07376 (2015).Google ScholarGoogle Scholar
  8. [8] Goodfellow Ian J., Pouget-Abadie Jean, Mirza Mehdi, Xu Bing, Warde-Farley David, Ozair Sherjil, Courville Aaron, and Bengio Yoshua. 2014. Generative adversarial networks. arXiv preprint arXiv:1406.2661 (2014).Google ScholarGoogle Scholar
  9. [9] Guo Yong, Chen Jian, Wang Jingdong, Chen Qi, Cao Jiezhang, Deng Zeshuai, Xu Yanwu, and Tan Mingkui. 2020. Closed-loop matters: Dual regression networks for single image super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 54075416.Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Haralick Robert M., Shanmugam Karthikeyan, and Dinstein Its’ Hak. 1973. Textural features for image classification. IEEE Trans. Syst Man.Cybe6 (1973), 610621.Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] He Kaiming, Zhang Xiangyu, Ren Shaoqing, and Sun Jian. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770778.Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] He Zhenliang, Zuo Wangmeng, Kan Meina, Shan Shiguang, and Chen Xilin. 2019. ATTGAN: Facial attribute editing by only changing what you want. IEEE Trans. Image Process. 28, 11 (2019), 54645478.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Hore Alain and Ziou Djemel. 2010. Image quality metrics: PSNR vs. SSIM. In Proceedings of the 2010 20th International Conference on Pattern Recognition. IEEE, 23662369.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Huang Gao, Liu Zhuang, Maaten Laurens Van Der, and Weinberger Kilian Q.. 2017. Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 47004708.Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Huang Jia Bin, Singh Abhishek, and Ahuja Narendra. 2015. Single image super-resolution from transformed self-exemplars. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 51975206.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Jin Xin, Xu Jianfeng, Tasaka Kazuyuki, and Chen Zhibo. 2021. Multi-task learning-based all-in-one collaboration framework for degraded image super-resolution. ACM Trans. Multimedia Comput. Commun. 17, 1 (2021), 121.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] Kingma Diederik P. and Ba Jimmy. 2014. ADAM: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google ScholarGoogle Scholar
  18. [18] Ledig Christian, Theis Lucas, Huszár Ferenc, Caballero Jose, Cunningham Andrew, Acosta Alejandro, Aitken Andrew, Tejani Alykhan, Totz Johannes, Wang Zehan, et al. 2017. Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 46814690.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Gatys Leon A., Ecker Alexander S., and Bethge Matthias. 2016. Image style transfer using convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 24142423.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Li Mengyan, Zhang Zhaoyu, Xie Guochen, and Yu Jun. 2020. A deep learning approach for face hallucination guided by facial boundary responses. ACM Trans. Multimedia Comput. Commun. 16, 1 (2020), 123.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] Li Xin and Orchard Michael T.. 2001. New edge-directed interpolation. IEEE Trans. Image Process. 10, 10 (2001), 15211527.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Li Xianguo, Sun Yemei, Yang Yanli, and Miao Changyun. 2019. Symmetrical residual connections for single image super-resolution. ACM Trans. Multimedia Comput. Commun. 15, 1 (2019), 110.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Lim Bee, Son Sanghyun, Kim Heewon, Nah Seungjun, and Lee Kyoung Mu. 2017. Enhanced deep residual networks for single image super-resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 136144.Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Lin Min, Chen Qiang, and Yan Shuicheng. 2013. Network in network. arXiv preprint arXiv:1312.4400 (2013).Google ScholarGoogle Scholar
  25. [25] Liu Qun, Basu Saikat, Ganguly Sangram, Mukhopadhyay Supratik, DiBiano Robert, Karki Manohar, and Nemani Ramakrishna. 2020. Deepsat v2: Feature augmented convolutional neural nets for satellite image classification. Remote Sensing Letters 11, 2 (2020), 156165.Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Liu Xiangbin, He Jiesheng, Song Liping, Liu Shuai, and Srivastava Gautam. 2021. Medical image classification based on an adaptive size deep learning model. ACM Trans. Multimedia Comput. Commun. 17, 3s (2021), 118.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] Lowe David G.. 1999. Object recognition from local scale-invariant features. In Proceedings of the 7th IEEE International Conference on Computer Vision, Vol. 2. IEEE, 11501157.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Lugmayr Andreas, Danelljan Martin, Gool Luc Van, and Timofte Radu. 2020. SRFLOW: Learning the super-resolution space with normalizing flow. In Proceedings of the European Conference on Computer Vision. Springer, 715732.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Ma Xingjun, Niu Yuhao, Gu Lin, Wang Yisen, Zhao Yitian, Bailey James, and Lu Feng. 2021. Understanding adversarial attacks on deep learning based medical image analysis systems. Pattern Recognition 110 (2021), 107332.Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Matsui Yusuke, Ito Kota, Aramaki Yuji, Fujimoto Azuma, Ogawa Toru, Yamasaki Toshihiko, and Aizawa Kiyoharu. 2017. Sketch-based manga retrieval using manga109 dataset. Multimedia Tools and Applications 76, 20 (2017), 2181121838.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. [31] Park Sung Cheol, Park Min Kyu, and Kang Moon Gi. 2003. Super-resolution image reconstruction: A technical overview. IEEE Signal Processing Magazine 20, 3 (2003), 2136.Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Sajjadi Mehdi S. M., Scholkopf Bernhard, and Hirsch Michael. 2017. Enhancenet: Single image super-resolution through automated texture synthesis. In Proceedings of the IEEE International Conference on Computer Vision. 44914500.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Shi Wenzhe, Caballero Jose, Huszár Ferenc, Totz Johannes, Aitken Andrew P., Bishop Rob, Rueckert Daniel, and Wang Zehan. 2016. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 18741883.Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Simonyan Karen and Zisserman Andrew. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).Google ScholarGoogle Scholar
  35. [35] Sun Jian, Xu Zongben, and Shum Heung-Yeung. 2008. Image super-resolution using gradient profile prior. In Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 18.Google ScholarGoogle Scholar
  36. [36] Sun Jian, Xu Zongben, and Shum Heung-Yeung. 2010. Gradient profile prior and its applications in image super-resolution and enhancement. IEEE Trans. Image Process. 20, 6 (2010), 15291542.Google ScholarGoogle Scholar
  37. [37] Wang Xintao, Yu Ke, Wu Shixiang, Gu Jinjin, Liu Yihao, Dong Chao, Qiao Yu, and Loy Chen Change. 2018. ESRGAN: Enhanced super-resolution generative adversarial networks. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops.Google ScholarGoogle Scholar
  38. [38] Yan Qing, Xu Yi, Yang Xiaokang, and Nguyen Truong Q.. 2015. Single image superresolution based on gradient profile sharpness. IEEE Trans. Image Process. 24, 10 (2015), 31873202.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. [39] Yang Fuzhi, Yang Huan, Fu Jianlong, Lu Hongtao, and Guo Baining. 2020. Learning texture transformer network for image super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 57915800.Google ScholarGoogle ScholarCross RefCross Ref
  40. [40] Yang Wenhan, Feng Jiashi, Yang Jianchao, Zhao Fang, Liu Jiaying, Guo Zongming, and Yan Shuicheng. 2017. Deep edge guided recurrent residual learning for image super-resolution. IEEE Trans. Image Process. 26, 12 (2017), 58955907.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Yue Huanjing, Sun Xiaoyan, Yang Jingyu, and Wu Feng. 2013. Landmark image super-resolution by retrieving web images. IEEE Trans. Image Process. 22, 12 (2013), 48654878.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. [42] Zeyde Roman, Elad Michael, and Protter Matan. 2010. On single image scale-up using sparse-representations. In Proceedings of the International Conference on Curves and Surfaces. Springer, 711730.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Zhang Dongyang, Shao Jie, and Shen Heng Tao. 2020. Kernel attention network for single image super-resolution. ACM Trans. Multimedia Comput. Commun. 16, 3 (2020), 115.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. [44] Zhang Lei and Wu Xiaolin. 2006. An edge-guided image interpolation algorithm via directional filtering and data fusion. IEEE Trans. Image Process. 15, 8 (2006), 22262238.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. [45] Zhang Wenlong, Liu Yihao, Dong Chao, and Qiao Yu. 2019. Ranksrgan: Generative adversarial networks with ranker for image super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 30963105.Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Zhang Yulun, Li Kunpeng, Li Kai, Wang Lichen, Zhong Bineng, and Fu Yun. 2018. Image super-resolution using very deep residual channel attention networks. In Proceedings of the European Conference on Computer Vision (ECCV). 286301.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Zhang Yulun, Tian Yapeng, Kong Yu, Zhong Bineng, and Fu Yun. 2018. Residual dense network for image super-resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 24722481.Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Zhang Zhifei, Wang Zhaowen, Lin Zhe, and Qi Hairong. 2019. Image super-resolution by neural texture transfer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 79827991.Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Zhao Mandan, Wu Gaochang, Li Yipeng, Hao Xiangyang, Fang Lu, and Liu Yebin. 2018. Cross-scale reference-based light field super-resolution. IEEE Trans. Comput. Imaging 4, 3 (2018), 406418.Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Zheng Haitian, Ji Mengqi, Han Lei, Xu Ziwei, Wang Haoqian, Liu Yebin, and Fang Lu. 2017. Learning cross-scale correspondence and patch-based synthesis for reference-based super-resolution. In Proceedings of the BMVC.Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Zheng Haitian, Ji Mengqi, Wang Haoqian, Liu Yebin, and Fang Lu. 2018. Crossnet: An end-to-end reference-based super resolution network using cross-scale warping. In Proceedings of the European Conference on Computer Vision (ECCV). 88104.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. [52] Zhong Zhihang, Gao Ye, Zheng Yinqiang, and Zheng Bo. 2020. Efficient spatio-temporal recurrent neural network for video deblurring. In Proceedings of the European Conference on Computer Vision. Springer, 191207.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. [53] Zhu Xiangyuan, Guo Kehua, Fang Hui, Chen Liang, Ren Sheng, and Hu Bin. 2021. Cross view capture for stereo image super-resolution. IEEE Trans. Multimedia (2021).Google ScholarGoogle Scholar
  54. [54] Zhu Xiangyuan, Guo Kehua, Ren Sheng, Hu Bin, Hu Min, and Fang Hui. 2021. Lightweight image super-resolution with expectation-maximization attention mechanism. IEEE Trans. Circuits Syst. Video Techn. (2021).Google ScholarGoogle Scholar
  55. [55] Zhu Yu, Zhang Yanning, Bonev Boyan, and Yuille Alan L.. 2015. Modeling deformable gradient compositions for single-image super-resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 54175425.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Double-Layer Search and Adaptive Pooling Fusion for Reference-Based Image Super-Resolution

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 20, Issue 1
      January 2024
      639 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3613542
      • Editor:
      • Abdulmotaleb El Saddik
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 August 2023
      • Online AM: 21 June 2023
      • Accepted: 13 June 2023
      • Revised: 7 May 2023
      • Received: 7 October 2022
      Published in tomm Volume 20, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)327
      • Downloads (Last 6 weeks)23

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text