Improving deep learning-based image super-resolution with residual learning and perceptual loss using SRGAN model

Abbas, Rehman; Gu, Naijie

doi:10.1007/s00500-023-09126-4

Improving deep learning-based image super-resolution with residual learning and perceptual loss using SRGAN model

Application of soft computing
Published: 07 September 2023

Volume 27, pages 16041–16057, (2023)
Cite this article

Soft Computing Aims and scope Submit manuscript

Rehman Abbas¹ &
Naijie Gu¹

390 Accesses
6 Citations
Explore all metrics

Abstract

This study introduces a new and inventive approach designed to address the complex challenges encountered in the domain of image super-resolution (SR) tasks based on deep learning. The super-resolution generative adversarial network (SRGAN) is an innovative architecture that integrates the concept of residual learning into the complex design of deep recursive neural networks. This integration is aimed at significantly improving the quality of generated images. To attain the intended improvement in image quality, a series of varied loss functions are used in progressive manner, incorporating the structural similarity index (SSIM) loss and the mean squared error (MSE) loss, which are grounded on the perceptual loss paradigm. The carefully crafted loss functions are designed to enhance and maintain the fundamental element of structural integrity in the generated images, which is a critical requirement for dependable image super-resolution. The proposed methodology encountered a thorough evaluation through a series of rigors assessments, using developed benchmark datasets that are frequently used in the relevant field. Through an evaluation of the model’s performance with respect to image quality and structural similarity, we have effectively determined its effectiveness and ability to enhance current efforts aimed at advancing the field of image super-resolution. The results obtained exhibit a high-level accuracy, with the rate of 99.04%, and a less loss value of 3.19%. by achieving SSIM score of 0.97, PSNR score of 34.6, and MSE score of 0.012.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Parametric regularization loss in super-resolution reconstruction

Article 26 July 2022

ISRGAN: Improved Super-Resolution Using Generative Adversarial Networks

Analyzing Perception-Distortion Tradeoff Using Enhanced Perceptual Super-Resolution Network

Availability of data and materials

Not applicable.

References

Ali M, Yin B, Kunar A, Sheikh AM et al (2020) Reduction of multiplications in convolutional neural networks. In: 2020 39th Chinese control conference (CCC). IEEE, pp 7406–7411. https://doi.org/10.23919/CCC50068.2020.9188843
Bashir SMA, Wang Y, Khan M, Niu Y (2021) A comprehensive review of deep learning-based single image super-resolution. PeerJ Comput Sci 7:e621
Article Google Scholar
Blau Y, Michaeli T (2018) The perception-distortion tradeoff. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6228–6237
Cao M, Mou C, Yu F, Wang X, Zheng Y, Zhang J, Dong C, Li G, Shan Y, Timofte R, Sun X (2023) NTIRE 2023 challenge on 360° omnidirectional image and video super-resolution: datasets, methods and results. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1731–1745
Chang H, Yeung D-Y, Xiong Y (2004) Super-resolution through neighbor embedding. In: Proceedings of the 2004 IEEE Computer Society conference on computer vision and pattern recognition, 2004. CVPR 2004, p I
Chen Z (2019) Observer-based dissipative output feedback control for network T-S fuzzy systems under time delays with mismatch premise. Nonlinear Dyn 95:2923–2941
Article MATH Google Scholar
Defrise M, Gullberg GT (2006) Image reconstruction. Phys Med Biol 51(13):R139
Article Google Scholar
Dong C, Loy CC, He K, Tang X (2015) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307
Article Google Scholar
Dong C, Loy CC, He K, Tang X (2014) Learning a deep convolutional network for image super-resolution. In: Computer vision—ECCV 2014: 13th European conference, Zurich, Switzerland, September 6–12, 2014, Proceedings, Part IV 13, pp 184–199
Dubey P, Günay EE, Jackman J, Kremer GE, Kremer P (2022) Deep learning-powered visual inspection using SSD mobile Net V1 with FPN. In: International conference on flexible automation and intelligent manufacturing. Springer International Publishing, Cham, pp 743–752
Sood S, Rewa R, Rusu M (2019) Anisotropic super resolution in prostate MRI using super resolution generative adversarial networks. In: 2019 IEEE 16th international symposium on biomedical imaging (ISBI), IEEE
Hazrat B, Yin B, Kumar A, Ali M, Zhang J, Yao J (2023) Jerk-bounded trajectory planning for rotary flexible joint manipulator: an experimental approach. Soft Comput 27(7):4029–4039. https://doi.org/10.1007/s00500-023-07923-5
Article Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Hossain MS, Muhammad G, Guizani N (2020) Explainable AI and mass surveillance system-based healthcare framework to combat COVID-I9 like pandemics. IEEE Netw 34(4):126–132
Article Google Scholar
Huang T, Dong W, Liu J, Wu F, Shi G, Li X (2020) Accelerating convolutional neural network via structured gaussian scale mixture models: a joint grouping and pruning approach. IEEE J Sel Top Signal Process 14(4):817–827
Article Google Scholar
Jia F, Tan L, Wang G, Jia C, Chen Z (2023) A super-resolution network using channel attention retention for pathology images. PeerJ Comput Sci 9:e1196
Article Google Scholar
Joshi O, Joshi, Sawant (2022) 2D MRI super resolution using generative adversarial network. Available at SSRN, 4291804
Kancharla P, Channappayya SS (2018) Improving the visual quality of generative adversarial network (GAN)-generated images using the multi-scale structural similarity index. In: 2018 25th IEEE international conference on image processing (ICIP), pp 3908–3912
Kashiparekh K, Narwariya J, Malhotra P, Vig L, Shroff G (2019) Convtimenet: a pre-trained deep convolutional neural network for time series classification. In: 2019 international joint conference on neural networks (IJCNN), pp 1–8
Kim J, Lee JK, Lee KM (2016) Accurate image super-resolution using very deep convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1646–1654
Kumar A, Shaikh AM, Li Y et al (2021) Pruning filters with L1-norm and capped L1-norm for CNN compression. Appl Intell 51:1152–1160. https://doi.org/10.1007/s10489-020-01894-y
Article Google Scholar
Ledig C et al (2017) Image-realistic single image super-resolution using a generative adversarial network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4681–4690
Li Q, Hou J (2021) Fault detection for asynchronous T-S fuzzy networked Markov jump systems with new event-triggered scheme. IET Control Theory Appl 15(11):1461–1473
Article MathSciNet Google Scholar
Li R, Shen Y (2023) YOLOSR-IST: a deep learning method for small target detection in infrared remote sensing images based on super-resolution and YOLO. Signal Process 208:108962
Article Google Scholar
Li J, Cong Y, Zhou L, Tian Z, Qiu J (2023) Super-resolution-based part collaboration network for vehicle re-identification. World Wide Web 26(2):519–538
Article Google Scholar
Li CY, Mazzon R, Cavallaro A (2020) Underwater image filtering: methods, datasets and evaluation. arXiv preprint arXiv:2012.12258
Liu S, Xiong C, Shi X, Gao Z (2021) Progressive face super-resolution with cascaded recurrent convolutional network. Neurocomputing 449:357–367
Article Google Scholar
Ma J et al (2020) PathSRGAN: multi-supervised super-resolution for cytopathological images using generative adversarial network. IEEE Trans Med Imaging 39(9):2920–2930
Article Google Scholar
Mukadam SB, Patil HY (2023) Skin cancer classification framework using enhanced super resolution generative adversarial network and custom convolutional neural network. Appl Sci 13(2):1210
Article Google Scholar
Shao D et al (2023) Medical image blind super-resolution based on improved degradation process. IET Image Process 17(5):1615–1625
Article Google Scholar
Sun S, Cao Z, Zhu H, Zhao J (2019) A survey of optimization methods from a machine learning perspective. IEEE Trans Cybern 50(8):3668–3681
Article Google Scholar
Sun J et al (2023) Improving the diagnostic performance of computed tomography angiography for intracranial large arterial stenosis by a novel super-resolution algorithm based on multi-scale residual denoising generative adversarial network. Clin Imaging 96:1–8
Article Google Scholar
Terada Y et al (2022) Clinical evaluation of super-resolution for brain MRI images based on generative adversarial networks. Inform Med Unlocked 32:101030
Article Google Scholar
Viaktin D, Garcia-Zapirain B, Mendez Zorrilla A (2022) DeepDream algorithm for data augmentation in a neural network ensemble applied to multiclass image classification. In: Recent challenges in intelligent information and database systems: 14th Asian conference, ACIIDS 2022, Ho Chi Minh City, Vietnam, November 28–30, 2022, Proceedings. Springer Nature Singapore, Singapore, pp 655–667
Wang C, Ren C, He X, Qing L (2021) Deep recursive network for image denoising with global non-linear smoothness constraint prior. Neurocomputing 426:147–161
Article Google Scholar
Luyang W, Qiang Z, Baoqun Y et al (2019) Second-order convolutional network for crowd counting. In: Proceedings of SPIE 11198, fourth international workshop on pattern recognition, 111980T (31 July 2019). https://doi.org/10.1117/12.2540362
Xu H, Sun Z, Cao Y et al (2023) A data-driven approach for intrusion and anomaly detection using automated machine learning for the Internet of Things. Soft Comput. https://doi.org/10.1007/s00500-023-09037-4
Article Google Scholar
Yang G et al (2017) DAGAN: deep de-aliasing generative adversarial networks for fast compressed sensing MRI reconstruction. IEEE Trans Med Imaging 37(6):1310–1321
Article Google Scholar
Yao W, Guo Y, Wu Y, Guo J (2017) Experimental validation of fuzzy PID control of flexible joint system in presence of uncertainties. In: 2017 36th Chinese control conference (CCC). IEEE, pp 4192–4197. https://doi.org/10.23919/ChiCC.2017.8028015
Yin B, Aslam MS et al (2023) A practical study of active disturbance rejection control for rotary flexible joint robot manipulator. Soft Comput 27:4987–5001. https://doi.org/10.1007/s00500-023-08026-x
Article Google Scholar
Yin B, Khan J, Wang L, Zhang J, Kumar A (2019) Real-time lane detection and tracking for advanced driver assistance systems. In: 2019 Chinese control conference (CCC). IEEE, pp 6772–6777. https://doi.org/10.23919/ChiCC.2019.8866334
Zhang K et al (2022) SOUP-GAN: super-resolution MRI using generative adversarial networks. Tomography 8(2):905–919
Article Google Scholar
Zhou Z, Li B, Yang X, Yang Z (2022) A robust super-resolution reconstruction model of turbulent flow data based on deep learning. Comput Fluids 239:105382
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We extend our gratitude to the National Natural Science Foundation of China (U20A20229) for their generous financial support. Additionally, we would like to extend our appreciation to the Chinese Academy of Sciences (CAS) and The World Academy of Sciences (TWAS) for their Valuable support, which was instrumental in the successful execution of our research.

Funding

No funds, grants, or other support was received.

Author information

Authors and Affiliations

School of Computer Science and Technology, University of Science and Technology China (USTC), Hefei, 230026, Anhui, China
Rehman Abbas & Naijie Gu

Authors

Rehman Abbas
View author publications
You can also search for this author in PubMed Google Scholar
Naijie Gu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rehman Abbas.

Ethics declarations

Conflict of interest

The authors have no financial or proprietary interests in any material discussed in this article. The authors declare that they have no conflict of interest.

Ethical approval

Not applicable.

Informed consent

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Abbas, R., Gu, N. Improving deep learning-based image super-resolution with residual learning and perceptual loss using SRGAN model. Soft Comput 27, 16041–16057 (2023). https://doi.org/10.1007/s00500-023-09126-4

Download citation

Accepted: 19 August 2023
Published: 07 September 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s00500-023-09126-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improving deep learning-based image super-resolution with residual learning and perceptual loss using SRGAN model

Abstract

Access this article

Similar content being viewed by others

Parametric regularization loss in super-resolution reconstruction

ISRGAN: Improved Super-Resolution Using Generative Adversarial Networks

Analyzing Perception-Distortion Tradeoff Using Enhanced Perceptual Super-Resolution Network

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Improving deep learning-based image super-resolution with residual learning and perceptual loss using SRGAN model

Abstract

Access this article

Similar content being viewed by others

Parametric regularization loss in super-resolution reconstruction

ISRGAN: Improved Super-Resolution Using Generative Adversarial Networks

Analyzing Perception-Distortion Tradeoff Using Enhanced Perceptual Super-Resolution Network

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation