Abstract
This paper proposes a wavelet domain-based method for multispectral image super-resolution. The stationary wavelet transform is proposed to decompose the multispectral image into directional wavelet components and for each wavelet component, a joint dictionary learning algorithm is proposed. Using sparse and redundant representations, the proposed approach helps capture intrinsic multispectral features using wavelet domain learning utilizing the up-sampling property of (SWT). The proposed method can learn and recover those image features more accurately. In order to validate the proposed method, we conducted comprehensive experiments. Moreover, we present a comparison of our proposed method with state-of-the-art algorithms over PSNR and SSIM evaluation parameters. The results of the experiments indicate that the proposed method outperforms state-of-the-art methods.







Similar content being viewed by others
Data availability
Data will be available on request.
References
Kaji, S., Kida, S.: Overview of image-to-image translation by use of deep neural networks: denoising, super-resolution, modality conversion, and reconstruction in medical imaging. Radiological physics and technology 12(3), 235–248 (2019)
Han, Q., Yin, Q., Zheng, X., Chen, Z.: Remote sensing image building detection method based on mask r-cnn. Complex & Intelligent Systems 8(3), 1847–1855 (2022)
Lillesand, T., Kiefer, R.W., Chipman, J.: Remote sensing and image interpretation (John Wiley & Sons, 2015)
Du, H., Shi, H., Zeng, D., Zhang, X.P., Mei, T.: The elements of end-to-end deep face recognition: A survey of recent advances. ACM Computing Surveys (CSUR) 54(10s), 1–42 (2022)
Wang, Y., Bashir, S.M.A., Khan, M., Ullah, Q., Wang, R., Song, Y., Guo, Z., Niu, Y.: Remote sensing image super-resolution and object detection: Benchmark and state of the art. Expert Systems with Applications p. 116793 (2022)
Patel, V., Mistree, K.: A review on different image interpolation techniques for image enhancement. International Journal of Emerging Technology and Advanced Engineering 3(12), 129–133 (2013)
Keys, R.: Cubic convolution interpolation for digital image processing. IEEE transactions on acoustics, speech, and signal processing 29(6), 1153–1160 (1981)
Gao, F Wang, Y., Yang, Z., Ma, Y., Zhang, Q.: Single image super-resolution based on multi-scale dense attention network. Soft Computing pp. 1–12 (2022)
Dong, C., Loy, C.C., He, K., Tang, X.: in European conference on computer vision (Springer), pp. 184–199(2014)
Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE transactions on pattern analysis and machine intelligence 38(2), 295–307 (2015)
Nazzal, M., Ozkaramanli, H.: Wavelet domain dictionary learning-based single image superresolution. Signal, Image and Video Processing 9(7), 1491–1501 (2015)
Ahmed, J., Baloch, G.L., Ozkaramanli, H.: in 2017 IEEE International Conference on Imaging Systems and Techniques (IST) (IEEE), pp. 1–5 (2017)
Ayas, S., Ekinci, M.: Single image super resolution based on sparse representation using discrete wavelet transform. Multimedia Tools and Applications 77(13), 16685–16698 (2018)
Dai, S., Han, M., Xu, W., Wu, Y., Gong, Y.: in 2007 IEEE Conference on Computer Vision and Pattern Recognition (IEEE), pp. 1–8(2007)
Sun, J., Xu, Z., Shum, H.Y.: in 2008 IEEE Conference on Computer Vision and Pattern Recognition (IEEE), pp. 1–8(2008)
Zhang, X., Wu, X.: Image interpolation by adaptive 2-d autoregressive modeling and soft-decision estimation. IEEE transactions on image processing 17(6), 887–896 (2008)
Yang, J., Lin, Z., Cohen, S.: in Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 1059–1066 (2013)
Hawe, S., Kleinsteuber, M., Diepold, K.: Analysis operator learning and its application to image reconstruction. IEEE Transactions on Image Processing 22(6), 2138–2150 (2013)
Mallat, S., Yu, G.: Super-resolution with sparse mixing estimators. IEEE transactions on image processing 19(11), 2889–2900 (2010)
Wei, X., Dragotti, P.L.: Fresh-fri-based single-image super-resolution algorithm. IEEE Transactions on Image Processing 25(8), 3723–3735 (2016)
Ahmed, J., Baloch, G.L., Ozkaramanli, H.: in 2017 IEEE International Conference on Imaging Systems and Techniques (IST) (IEEE), pp. 1–5 (2017)
Kang, X., Duan, P., Xu, R.: Single image super-resolution based on mapping-vector clustering and nonlinear pixel-reconstruction. Signal Processing: Image Communication 100, 116,501 (2022)
Yang, J., Wright, J., Huang, T.S., Ma, Y.: Image super-resolution via sparse representation. IEEE transactions on image processing 19(11), 2861–2873 (2010)
Candès, E.J., Romberg, J., Tao, T.: Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Transactions on information theory 52(2), 489–509 (2006)
Yang, J., Wang, Z., Lin, Z., Cohen, S., Huang, T.: Coupled dictionary training for image super-resolution. IEEE transactions on image processing 21(8), 3467–3478 (2012)
Zeyde, R., Elad, M., Protter, M.: in International conference on curves and surfaces (Springer), pp. 711–730 (2010)
Wang, S., Zhang, L., Liang, Y., Pan, Q.: in 2012 IEEE Conference on Computer Vision and Pattern Recognition (IEEE), pp. 2216–2223 (2012)
Guo, Y., Chen, J., Wang, J., Chen, Q., Cao, J., Deng, Z., Xu, Y., Tan, M.: in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, (Computer Vision Foundation / IEEE, 2020), pp. 5406–5415 (2020)
Marivani, I., Tsiligianni, E., Cornelis, B., Deligiannis, N.: Multimodal deep unfolding for guided image super-resolution. IEEE Transactions on Image Processing 29, 8443–8456 (2020)
Tao, G., Ji, X., Wang, W., Chen, S., Lin, C., Cao, Y., Lu, T., Luo, D., Tai, Y.: in Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual, ed. by M. Ranzato, A. Beygelzimer, Y.N. Dauphin, P. Liang, J.W. Vaughan , pp. 22,643–22,654 (2021)
Guo, T., Seyed Mousavi, H., Huu Vu, T., Monga, V.: in Proceedings of the IEEE conference on computer vision and pattern recognition workshops , pp. 104–113 (2017)
Kim, J., Lee, J.K., Lee, K.M.: in Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 1646–1654 (2016)
Tomasi, C., Manduchi, R.: in Sixth international conference on computer vision (IEEE Cat. No. 98CH36271) (IEEE), pp. 839–846 (1998)
Kopf, J., Cohen, M.F., Lischinski, D., Uyttendaele, M.: Joint bilateral upsampling. ACM Transactions on Graphics (ToG) 26(3), 96–es (2007)
He, K., Sun, J., Tang, X.: in European conference on computer vision (Springer), pp. 1–14 (2010)
He, K., Sun, J., Tang, X.: Guided image filtering. IEEE transactions on pattern analysis and machine intelligence 35(6), 1397–1409 (2012)
Shen, X., Yan, Q., Xu, L., Ma, L., Jia, J.: Multispectral joint image restoration via optimizing a scale map. IEEE transactions on pattern analysis and machine intelligence 37(12), 2518–2530 (2015)
Ham, B., Cho, M., Ponce, J.: Robust guided image filtering using nonconvex potentials. IEEE transactions on pattern analysis and machine intelligence 40(1), 192–207 (2017)
Ayas, S., Ekinci, M.: Single image super resolution based on sparse representation using discrete wavelet transform. Multimedia Tools and Applications 77(13), 16685–16698 (2018)
Kumar, N., Verma, R., Sethi, A.: Convolutional neural networks for wavelet domain super resolution. Pattern Recognition Letters 90, 65–71 (2017)
Bae, W., Yoo, J., Chul Ye, J.: in Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 145–153 (2017)
Wu, T., Li, W., Jia, S., Dong, Y., Zeng, T.: Deep multi-level wavelet-cnn denoiser prior for restoring blurred image with cauchy noise. IEEE Signal Processing Letters 27, 1635–1639 (2020)
Nazzal, M., Ozkaramanli, H.: Wavelet domain dictionary learning-based single image superresolution. Signal, Image and Video Processing 9(7), 1491–1501 (2015)
Tropp, J.A.: Greed is good: Algorithmic results for sparse approximation. IEEE Transactions on Information theory 50(10), 2231–2242 (2004)
Xu, J., Qi, C., Chang, Z.: in 2014 IEEE International Conference on Image Processing (ICIP) (IEEE), pp. 3910–3914 (2014)
Pati, Y.C., Rezaiifar, R., Krishnaprasad, P.S.: in Proceedings of 27th Asilomar conference on signals, systems and computers (IEEE), pp. 40–44 (1993)
Tibshirani, R.: Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological) 58(1), 267–288 (1996)
Chen, S.S., Donoho, D.L., Saunders, M.A.: Atomic decomposition by basis pursuit. SIAM review 43(1), 129–159 (2001)
Keys, R.: Cubic convolution interpolation for digital image processing. IEEE transactions on acoustics, speech, and signal processing 29(6), 1153–1160 (1981)
Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE transactions on pattern analysis and machine intelligence 38(2), 295–307 (2015)
Zhang, X., Gao, P., Liu, S., Zhao, K., Li, G., Yin, L., Chen, C.W.: Accurate and efficient image super-resolution via global-local adjusting dense network. IEEE Transactions on Multimedia 23, 1924–1937 (2020)
Huang, W., Liao, X., Zhu, L., Wei, M., Wang, Q.: Single-image super-resolution neural network via hybrid multi-scale features. Mathematics 10(4), 653 (2022)
Song, P., Deng, X., Mota, J.F., Deligiannis, N., Dragotti, P.L., Rodrigues, M.R.: Multimodal image super-resolution via joint sparse representations induced by coupled dictionaries. IEEE Transactions on Computational Imaging 6, 57–72 (2019)
Yasuma, F., Mitsunaga, T., Iso, D., Nayar, S.K.: Generalized assorted pixel camera: postcapture control of resolution, dynamic range, and spectrum. IEEE transactions on image processing 19(9), 2241–2253 (2010)
Author information
Authors and Affiliations
Contributions
A. Substantial contributions to the conceptual design of the work, Implementations, the acquisition, analysis, or interpretation of data for the work. B. Review the whole paper and include some techniques and suggestions regarding implementation parts. C. Revising it critically for important intellectual content. D. Accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. E. Helped with further experimental evaluation, technical suggestions, revision process, refining, and finalizing the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ahmed, A., Kun, S., Ahmed, J. et al. Multimodal image enhancement using convolutional sparse coding. Multimedia Systems 29, 2099–2110 (2023). https://doi.org/10.1007/s00530-023-01074-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-023-01074-1