Skip to main content
Log in

DCTNet: deep shrinkage denoising via DCT filterbanks

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Shrinkage algorithms are well-studied, simple yet efficient transform domain denoisers. Two factors greatly affect their performance, namely the types of signal transform and shrinkage function which are used. The purpose of this study is to develop novel deep learning-based variants for transform domain shrinkage approaches. In particular, the Discrete Cosine Transform (DCT) will be considered as the sparsifying transform utilized in conjunction with deep neural networks. There has been comparatively few studies for the amalgamation of the DCT and deep learning compared to other transforms such as Discrete Wavelet Transform (DWT). Main reason for this is the fact that both global and block treatments of the DCT do not provide feature maps (that is subband images) suitable for processing by deep convolutional neural networks (CNNs). On the other hand, researchers have regularly modeled learnable shrinkage functions that are tuned to satisfy properties such as symmetry and monotonicity while restricting denoiser’s performance. In this paper, we propose a novel DCT-based deep denoising algorithm which consists of three blocks: an original DCT block, a deep shrinkage block, and an inverse DCT block. DCT blocks use 2D DCT basis kernels as mapping filters. The resulting transform is called DCT filterbanks (DCT FB) transform. Proposed DCT FB blocks facilitate the effective production of DCT subband images suitable for processing by CNNs. Instead of analytic shrinkage step, the shrinkage operation is parameterized with deep learning layers and is called as shrinkage block. The proposed DCT domain deep shrinkage network, termed as DCTNet, is trained in a supervised manner and provides an effective and improved hybrid of classical patchwise shrinkage algorithms with deep learning. Our experimental results indicate that the proposed method surpasses model-based and deep CNN-based denoisers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Availability of data and materials

The simulations utilize the BSDS500 image dataset, publicly available online at [https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/resources.html].

References

  1. Raphan, M., Simoncelli, E.P.: Optimal denoising in redundant representations. IEEE Trans. Image Process. 17(8), 1342–1352 (2008)

    Article  MathSciNet  Google Scholar 

  2. Hel-Or, Y., Ben-Artzi, G.: The role of redundant bases and shrinkage functions in image denoising. IEEE Trans. Image Process. 30, 3778–3792 (2021)

    Article  MathSciNet  Google Scholar 

  3. Liu, P., Zhang, H., Zhang, K., Lin, L., Zuo, W.: Multi-level Wavelet-CNN for Image Restoration. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 773–782 (2018)

  4. Guo, T., Mousavi, H.S., Monga, V.: Adaptive transform domain image super-resolution via orthogonally regularized deep networks. IEEE Trans. Image Process. 28(9), 4685–4700 (2019)

    Article  MathSciNet  MATH  Google Scholar 

  5. Herbreteau, S., Kervrann, C.: DCT2net: An Interpretable Shallow CNN for Image Denoising. IEEE Transactions on Image Processing (2022)

  6. Donoho, D.L., Johnstone, J.M.: Ideal spatial adaptation by wavelet shrinkage. Biometrika 81(3), 425–455 (1994)

    Article  MathSciNet  MATH  Google Scholar 

  7. Coifman, R.R., Donoho, D.L.: In: Antoniadis, A., Oppenheim, G. (eds.) Translation-Invariant De-Noising, pp. 125–150. Springer, New York, NY (1995)

  8. Elad, M.: Sparse and Redundant Representations: From Theory to Applications in Signal and Image Processing vol. 2. Springer

  9. Guleryuz, O.G.: Nonlinear approximation based image recovery using adaptive sparse reconstructions and iterated denoising - part I: theory. IEEE Trans. Image Process. 15(3), 539–554 (2006)

    Article  Google Scholar 

  10. Yu, G., Sapiro, G.: DCT image denoising: a simple and effective image denoising algorithm. Image Proc. Line 1, 292–296 (2011)

    Article  Google Scholar 

  11. Pierazzo, N., Morel, J.-M., Facciolo, G.: Multi-scale DCT denoising. Image Proc. Line 7, 288–308 (2017)

    Article  Google Scholar 

  12. Hel-Or, Y., Shaked, D.: A discriminative approach for wavelet denoising. IEEE Trans. Image Process. 17(4), 443–457 (2008)

    Article  MathSciNet  Google Scholar 

  13. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet Classification with Deep Convolutional Neural Networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

  14. Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 834–848 (2018)

    Article  Google Scholar 

  15. Chen, Y., Pock, T.: Trainable nonlinear reaction diffusion: a flexible framework for fast and effective image restoration. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1256–1272 (2016)

    Article  Google Scholar 

  16. Zhang, K., Zuo, W., Chen, Y., Meng, D., Zhang, L.: Beyond a Gaussian Denoiser: residual learning of deep CNN for image denoising. IEEE Trans. Image Process. 26(7), 3142–3155 (2017)

    Article  MathSciNet  MATH  Google Scholar 

  17. Zhang, K., Zuo, W., Zhang, L.: FFDNet: Toward a Fast and Flexible Solution for CNN-Based Image Denoising. IEEE Trans. Image Process. 27(9), 4608–4622 (2018)

    Article  MathSciNet  Google Scholar 

  18. Zhang, K., Zuo, W., Gu, S., Zhang, L.: Learning Deep CNN Denoiser Prior for Image Restoration. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3929–3938 (2017)

  19. Tai, Y., Yang, J., Liu, X., Xu, C.: MemNet: A Persistent Memory Network for Image Restoration. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4539–4547 (2017)

  20. Lefkimmiatis, S.: Non-local Color Image Denoising with Convolutional Neural Networks. In: Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 (2017)

  21. Ulicny, M., Krylov, V.A., Dahyot, R.: Harmonic Convolutional Networks based on Discrete Cosine Transform. Pattern Recogn. 129, 108707 (2022)

    Article  Google Scholar 

  22. Xu, K., Qin, M., Sun, F., Wang, Y., Chen, Y.-K., Ren, F.: Learning in the Frequency Domain. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1740–1749 (2020)

  23. Pan, H., Badawi, D., Cetin, A.E.: Fast Walsh-Hadamard Transform and Smooth-Thresholding Based Binary Layers in Deep Neural Networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 4650–4659 (2021)

  24. Pan, H., Badawi, D., Cetin, A.E.: Block Walsh-hadamard transform-based binary layers in deep neural networks. ACM Trans. Embedded Comput. Syst. 21(6), 1–25 (2022)

    Article  Google Scholar 

  25. Pan, H., Zhu, X., Atici, S., Cetin, A.E.: DCT Perceptron Layer: A Transform Domain Approach for Convolution Layer. arXiv preprint arXiv:2211.08577 (2022)

  26. Pan, H., Badawi, D., Chen, C., Watts, A., Koyuncu, E., Cetin, A.E.: Deep Neural Network With Walsh-Hadamard Transform Layer for Ember Detection During a Wildfire. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 257–266 (2022)

  27. Guo, T., Seyed Mousavi, H., Huu Vu, T., Monga, V.: Deep Wavelet Prediction for Image Super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 104–113 (2017)

  28. Huang, H., He, R., Sun, Z., Tan, T.: Wavelet-SRNet: A Wavelet-based CNN for Multi-scale Face Super Resolution. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1689–1697 (2017)

  29. Gunturk, B.K., Li, X.: Image Restoration: Fundamentals and Advances, 1st edn. CRC Press Inc, Boca Raton, FL, USA (2017)

    MATH  Google Scholar 

  30. Young, S.I., Zhe, W., Taubman, D., Girod, B.: Transform quantization for CNN compression. IEEE Trans. Pattern Anal. Mach. Intell. 44(9), 5700–5714 (2021)

    Google Scholar 

  31. Wang, Y., Xu, C., Xu, C., Tao, D.: Packing convolutional neural networks in the frequency domain. IEEE Trans. Pattern Anal. Mach. Intell. 41(10), 2495–2510 (2018)

    Article  Google Scholar 

  32. Oppenheim, A.V., Buck, J.R., Schafer, R.W.: Discrete-Time Signal Processing. Prentice Hall, Upper Saddle River, NJ (2001)

    Google Scholar 

  33. Malvar, H.S.: Signal Processing with Lapped Transforms. Artech House Inc, USA (1992)

    MATH  Google Scholar 

  34. Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Prentice Hall, Upper Saddle River, N.J. (2008)

    Google Scholar 

  35. Strang, G.: The discrete cosine transform. SIAM Review 41(1), 135–147 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  36. Elad, M.: Why simple shrinkage is still relevant for redundant representations? IEEE Trans. Inf. Theory 52(12), 5559–5569 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  37. Afonso, M.V., Bioucas-Dias, J.M., Figueiredo, M.A.T.: Fast image recovery using variable splitting and constrained optimization. IEEE Trans. Image Process. 19(9), 2345–2356 (2010)

    Article  MathSciNet  MATH  Google Scholar 

  38. Selesnick, I.W., Figueiredo, M.A.: Signal Restoration with Overcomplete Wavelet Transforms: Comparison of Analysis and Synthesis Priors. In: Wavelets XIII, vol. 7446, pp. 107–121 (2009). spie

  39. Dabov, K., Foi, A., Katkovnik, V., Egiazarian, K.: Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Trans. Image Process. 16(8), 2080–2095 (2007)

    Article  MathSciNet  Google Scholar 

  40. Gu, S., Zhang, L., Zuo, W., Feng, X.: Weighted Nuclear Norm Minimization with Application to Image Denoising. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2862–2869 (2014)

  41. Martin, D., Fowlkes, C., Tal, D., Malik, J.: A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics. In: Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, vol. 2, pp. 416–423 (2001)

  42. Burger, H.C., Schuler, C.J., Harmeling, S.: Image Denoising: Can Plain Neural Networks Compete with BM3D? In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2392–2399 (2012). IEEE

  43. Vedaldi, A., Lenc, K.: MatConvNet: Convolutional Neural Networks for Matlab. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 689–692 (2015)

Download references

Funding

This work was supported by ITU BAP (Istanbul Technical University Research Fund) under project number 42027 (MDK-2019-42027).

Author information

Authors and Affiliations

Authors

Contributions

Both authors contributed to the study conception and design. Hasan H. Karaoglu realized material preparation, data collection, analysis, and simulations. Hasan H. Karaoglu wrote the original manuscript draft. Ender M. Eksioglu carried out editing, supervision, and project administration. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Hasan Huseyin Karaoglu.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Karaoglu, H.H., Eksioglu, E.M. DCTNet: deep shrinkage denoising via DCT filterbanks. SIViP 17, 3665–3676 (2023). https://doi.org/10.1007/s11760-023-02593-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-023-02593-0

Keywords

Navigation