DCTNet: deep shrinkage denoising via DCT filterbanks

Karaoglu, Hasan Huseyin; Eksioglu, Ender Mete

doi:10.1007/s11760-023-02593-0

DCTNet: deep shrinkage denoising via DCT filterbanks

Original Paper
Published: 02 June 2023

Volume 17, pages 3665–3676, (2023)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Hasan Huseyin Karaoglu¹ &
Ender Mete Eksioglu¹

367 Accesses
1 Citation
Explore all metrics

Abstract

Shrinkage algorithms are well-studied, simple yet efficient transform domain denoisers. Two factors greatly affect their performance, namely the types of signal transform and shrinkage function which are used. The purpose of this study is to develop novel deep learning-based variants for transform domain shrinkage approaches. In particular, the Discrete Cosine Transform (DCT) will be considered as the sparsifying transform utilized in conjunction with deep neural networks. There has been comparatively few studies for the amalgamation of the DCT and deep learning compared to other transforms such as Discrete Wavelet Transform (DWT). Main reason for this is the fact that both global and block treatments of the DCT do not provide feature maps (that is subband images) suitable for processing by deep convolutional neural networks (CNNs). On the other hand, researchers have regularly modeled learnable shrinkage functions that are tuned to satisfy properties such as symmetry and monotonicity while restricting denoiser’s performance. In this paper, we propose a novel DCT-based deep denoising algorithm which consists of three blocks: an original DCT block, a deep shrinkage block, and an inverse DCT block. DCT blocks use 2D DCT basis kernels as mapping filters. The resulting transform is called DCT filterbanks (DCT FB) transform. Proposed DCT FB blocks facilitate the effective production of DCT subband images suitable for processing by CNNs. Instead of analytic shrinkage step, the shrinkage operation is parameterized with deep learning layers and is called as shrinkage block. The proposed DCT domain deep shrinkage network, termed as DCTNet, is trained in a supervised manner and provides an effective and improved hybrid of classical patchwise shrinkage algorithms with deep learning. Our experimental results indicate that the proposed method surpasses model-based and deep CNN-based denoisers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A review of convolutional neural networks in computer vision

Article Open access 23 March 2024

A survey of the recent architectures of deep convolutional neural networks

Article 21 April 2020

Deep learning models for digital image processing: a review

Article 07 January 2024

Availability of data and materials

The simulations utilize the BSDS500 image dataset, publicly available online at [https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/resources.html].

References

Raphan, M., Simoncelli, E.P.: Optimal denoising in redundant representations. IEEE Trans. Image Process. 17(8), 1342–1352 (2008)
Article MathSciNet Google Scholar
Hel-Or, Y., Ben-Artzi, G.: The role of redundant bases and shrinkage functions in image denoising. IEEE Trans. Image Process. 30, 3778–3792 (2021)
Article MathSciNet Google Scholar
Liu, P., Zhang, H., Zhang, K., Lin, L., Zuo, W.: Multi-level Wavelet-CNN for Image Restoration. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 773–782 (2018)
Guo, T., Mousavi, H.S., Monga, V.: Adaptive transform domain image super-resolution via orthogonally regularized deep networks. IEEE Trans. Image Process. 28(9), 4685–4700 (2019)
Article MathSciNet MATH Google Scholar
Herbreteau, S., Kervrann, C.: DCT2net: An Interpretable Shallow CNN for Image Denoising. IEEE Transactions on Image Processing (2022)
Donoho, D.L., Johnstone, J.M.: Ideal spatial adaptation by wavelet shrinkage. Biometrika 81(3), 425–455 (1994)
Article MathSciNet MATH Google Scholar
Coifman, R.R., Donoho, D.L.: In: Antoniadis, A., Oppenheim, G. (eds.) Translation-Invariant De-Noising, pp. 125–150. Springer, New York, NY (1995)
Elad, M.: Sparse and Redundant Representations: From Theory to Applications in Signal and Image Processing vol. 2. Springer
Guleryuz, O.G.: Nonlinear approximation based image recovery using adaptive sparse reconstructions and iterated denoising - part I: theory. IEEE Trans. Image Process. 15(3), 539–554 (2006)
Article Google Scholar
Yu, G., Sapiro, G.: DCT image denoising: a simple and effective image denoising algorithm. Image Proc. Line 1, 292–296 (2011)
Article Google Scholar
Pierazzo, N., Morel, J.-M., Facciolo, G.: Multi-scale DCT denoising. Image Proc. Line 7, 288–308 (2017)
Article Google Scholar
Hel-Or, Y., Shaked, D.: A discriminative approach for wavelet denoising. IEEE Trans. Image Process. 17(4), 443–457 (2008)
Article MathSciNet Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet Classification with Deep Convolutional Neural Networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 834–848 (2018)
Article Google Scholar
Chen, Y., Pock, T.: Trainable nonlinear reaction diffusion: a flexible framework for fast and effective image restoration. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1256–1272 (2016)
Article Google Scholar
Zhang, K., Zuo, W., Chen, Y., Meng, D., Zhang, L.: Beyond a Gaussian Denoiser: residual learning of deep CNN for image denoising. IEEE Trans. Image Process. 26(7), 3142–3155 (2017)
Article MathSciNet MATH Google Scholar
Zhang, K., Zuo, W., Zhang, L.: FFDNet: Toward a Fast and Flexible Solution for CNN-Based Image Denoising. IEEE Trans. Image Process. 27(9), 4608–4622 (2018)
Article MathSciNet Google Scholar
Zhang, K., Zuo, W., Gu, S., Zhang, L.: Learning Deep CNN Denoiser Prior for Image Restoration. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3929–3938 (2017)
Tai, Y., Yang, J., Liu, X., Xu, C.: MemNet: A Persistent Memory Network for Image Restoration. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4539–4547 (2017)
Lefkimmiatis, S.: Non-local Color Image Denoising with Convolutional Neural Networks. In: Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 (2017)
Ulicny, M., Krylov, V.A., Dahyot, R.: Harmonic Convolutional Networks based on Discrete Cosine Transform. Pattern Recogn. 129, 108707 (2022)
Article Google Scholar
Xu, K., Qin, M., Sun, F., Wang, Y., Chen, Y.-K., Ren, F.: Learning in the Frequency Domain. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1740–1749 (2020)
Pan, H., Badawi, D., Cetin, A.E.: Fast Walsh-Hadamard Transform and Smooth-Thresholding Based Binary Layers in Deep Neural Networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 4650–4659 (2021)
Pan, H., Badawi, D., Cetin, A.E.: Block Walsh-hadamard transform-based binary layers in deep neural networks. ACM Trans. Embedded Comput. Syst. 21(6), 1–25 (2022)
Article Google Scholar
Pan, H., Zhu, X., Atici, S., Cetin, A.E.: DCT Perceptron Layer: A Transform Domain Approach for Convolution Layer. arXiv preprint arXiv:2211.08577 (2022)
Pan, H., Badawi, D., Chen, C., Watts, A., Koyuncu, E., Cetin, A.E.: Deep Neural Network With Walsh-Hadamard Transform Layer for Ember Detection During a Wildfire. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 257–266 (2022)
Guo, T., Seyed Mousavi, H., Huu Vu, T., Monga, V.: Deep Wavelet Prediction for Image Super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 104–113 (2017)
Huang, H., He, R., Sun, Z., Tan, T.: Wavelet-SRNet: A Wavelet-based CNN for Multi-scale Face Super Resolution. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1689–1697 (2017)
Gunturk, B.K., Li, X.: Image Restoration: Fundamentals and Advances, 1st edn. CRC Press Inc, Boca Raton, FL, USA (2017)
MATH Google Scholar
Young, S.I., Zhe, W., Taubman, D., Girod, B.: Transform quantization for CNN compression. IEEE Trans. Pattern Anal. Mach. Intell. 44(9), 5700–5714 (2021)
Google Scholar
Wang, Y., Xu, C., Xu, C., Tao, D.: Packing convolutional neural networks in the frequency domain. IEEE Trans. Pattern Anal. Mach. Intell. 41(10), 2495–2510 (2018)
Article Google Scholar
Oppenheim, A.V., Buck, J.R., Schafer, R.W.: Discrete-Time Signal Processing. Prentice Hall, Upper Saddle River, NJ (2001)
Google Scholar
Malvar, H.S.: Signal Processing with Lapped Transforms. Artech House Inc, USA (1992)
MATH Google Scholar
Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Prentice Hall, Upper Saddle River, N.J. (2008)
Google Scholar
Strang, G.: The discrete cosine transform. SIAM Review 41(1), 135–147 (1999)
Article MathSciNet MATH Google Scholar
Elad, M.: Why simple shrinkage is still relevant for redundant representations? IEEE Trans. Inf. Theory 52(12), 5559–5569 (2006)
Article MathSciNet MATH Google Scholar
Afonso, M.V., Bioucas-Dias, J.M., Figueiredo, M.A.T.: Fast image recovery using variable splitting and constrained optimization. IEEE Trans. Image Process. 19(9), 2345–2356 (2010)
Article MathSciNet MATH Google Scholar
Selesnick, I.W., Figueiredo, M.A.: Signal Restoration with Overcomplete Wavelet Transforms: Comparison of Analysis and Synthesis Priors. In: Wavelets XIII, vol. 7446, pp. 107–121 (2009). spie
Dabov, K., Foi, A., Katkovnik, V., Egiazarian, K.: Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Trans. Image Process. 16(8), 2080–2095 (2007)
Article MathSciNet Google Scholar
Gu, S., Zhang, L., Zuo, W., Feng, X.: Weighted Nuclear Norm Minimization with Application to Image Denoising. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2862–2869 (2014)
Martin, D., Fowlkes, C., Tal, D., Malik, J.: A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics. In: Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, vol. 2, pp. 416–423 (2001)
Burger, H.C., Schuler, C.J., Harmeling, S.: Image Denoising: Can Plain Neural Networks Compete with BM3D? In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2392–2399 (2012). IEEE
Vedaldi, A., Lenc, K.: MatConvNet: Convolutional Neural Networks for Matlab. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 689–692 (2015)

Download references

Funding

This work was supported by ITU BAP (Istanbul Technical University Research Fund) under project number 42027 (MDK-2019-42027).

Author information

Authors and Affiliations

Electronics and Communication Engineering Department, Istanbul Technical University, Maslak, Istanbul, 34469, Turkey
Hasan Huseyin Karaoglu & Ender Mete Eksioglu

Authors

Hasan Huseyin Karaoglu
View author publications
You can also search for this author in PubMed Google Scholar
Ender Mete Eksioglu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Both authors contributed to the study conception and design. Hasan H. Karaoglu realized material preparation, data collection, analysis, and simulations. Hasan H. Karaoglu wrote the original manuscript draft. Ender M. Eksioglu carried out editing, supervision, and project administration. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Hasan Huseyin Karaoglu.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Karaoglu, H.H., Eksioglu, E.M. DCTNet: deep shrinkage denoising via DCT filterbanks. SIViP 17, 3665–3676 (2023). https://doi.org/10.1007/s11760-023-02593-0

Download citation

Received: 02 February 2023
Revised: 07 April 2023
Accepted: 08 April 2023
Published: 02 June 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s11760-023-02593-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

DCTNet: deep shrinkage denoising via DCT filterbanks

Abstract

Access this article

Similar content being viewed by others

A review of convolutional neural networks in computer vision

A survey of the recent architectures of deep convolutional neural networks

Deep learning models for digital image processing: a review

Availability of data and materials

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

DCTNet: deep shrinkage denoising via DCT filterbanks

Abstract

Access this article

Similar content being viewed by others

A review of convolutional neural networks in computer vision

A survey of the recent architectures of deep convolutional neural networks

Deep learning models for digital image processing: a review

Availability of data and materials

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation