skip to main content
research-article

Principal Component Approximation Network for Image Compression

Published:11 January 2024Publication History
Skip Abstract Section

Abstract

In this work, we propose a novel principal component approximation network (PCANet) for image compression. The proposed network is based on the assumption that a set of images can be decomposed into several shared feature matrices, and an image can be reconstructed by the weighted sum of these matrices. The proposed PCANet is specifically devised to learn and approximate these feature matrices and weight vectors, which are used to encode images for compression. Unlike previous deep learning-based methods, a distinctive aspect of our approach is its consideration of network size in the bit-rate computation. Despite this inclusion, our proposed method yields promising results. Through extensive experiments conducted on standard datasets, we demonstrate the effectiveness of our approach in comparison to state-of-the-art techniques. To the best of our knowledge, this is the first machine learning approach that includes the size of networks during bitrate computation in image compression.

REFERENCES

  1. [1] Agustsson Eirikur, Mentzer Fabian, Tschannen Michael, Cavigelli Lukas, Timofte Radu, Benini Luca, and Gool Luc V.. 2017. Soft-to-hard vector quantization for end-to-end learning compressible representations. Adv. Neural Inf. Process. Syst. 30 (2017).Google ScholarGoogle Scholar
  2. [2] Agustsson Eirikur, Tschannen Michael, Mentzer Fabian, Timofte Radu, and Gool Luc Van. 2019. Generative Adversarial Networks for Extreme Learned Image Compression. (Aug.2019). DOI:Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Ahanonu Eze, Marcellin Michael, and Bilgin Ali. 2018. Lossless image compression using reversible integer wavelet transforms and convolutional neural networks. In Proceedings of the Data Compression Conference. 395395. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Ahmed Nasir, Natarajan T., and Rao Kamisetty R.. 1974. Discrete cosine transform. IEEE Trans. Comput. 100, 1 (1974), 9093.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Amirpour Hadi, Pinheiro Antonio, Pereira Manuela, Lopes Fernando J. P., and Ghanbari Mohammad. 2022. Efficient light field image compression with enhanced random access. ACM Trans. Multim. Comput., Commun. Applic. 18, 2 (Mar.2022), 44:1–44:18. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. [6] Andrews H. C. and Pratt W. K.. 1968. Fourier transform coding of images. In Proceedings of the Hawaii International Conference: System Sciences. 677679.Google ScholarGoogle Scholar
  7. [7] Baig Mohammad Haris, Koltun Vladlen, and Torresani Lorenzo. 2017. Learning to inpaint for image compression. In Advances in Neural Information Processing Systems, Guyon I., Luxburg U. Von, Bengio S., Wallach H., Fergus R., Vishwanathan S., and Garnett R. (Eds.), Vol. 30. Curran Associates, Inc.Retrieved from https://proceedings.neurips.cc/paper/2017/file/013a006f03dbc5392effeb8f18fda755-Paper.pdfGoogle ScholarGoogle Scholar
  8. [8] Ballé Johannes, Laparra Valero, and Simoncelli Eero P.. 2016. End-to-end optimized image compression. In Proceedings of the 5th International Conference on Learning Representations.Google ScholarGoogle Scholar
  9. [9] Ballé Johannes, Minnen David, Singh Saurabh, Hwang Sung Jin, and Johnston Nick. 2018. Variational image compression with a scale hyperprior. In Proceedings of the 6th International Conference on Learning Representations.Google ScholarGoogle Scholar
  10. [10] Bjontegaard Gisle. 2001. Calculation of average PSNR differences between RD-curves. ITU SG16 Doc. VCEG-M33 (2001).Google ScholarGoogle Scholar
  11. [11] Chen Tong, Liu Haojie, Ma Zhan, Shen Qiu, Cao Xun, and Wang Yao. 2021. End-to-end learnt image compression via non-local attention optimization and improved context modeling. IEEE Trans. Image Process. 30 (2021), 31793191. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Chen Tong and Ma Zhan. 2020. Variable bitrate image compression with quality scaling factors. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’20). 21632167. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Cheng Zhengxue, Sun Heming, Takeuchi Masaru, and Katto Jiro. 2019. Learning image and video compression through spatial-temporal energy compaction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’19).Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Cheng Zhengxue, Sun Heming, Takeuchi Masaru, and Katto Jiro. 2020. Learned image compression with discretized Gaussian mixture likelihoods and attention modules. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Christopoulos Charilaos, Skodras Athanassios, and Ebrahimi Touradj. 2000. The JPEG2000 still image coding system: An overview. IEEE Trans. Consum. Electron. 46, 4 (2000), 11031127.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. [16] al. Y. Wang et2014. CDnet 2014: An expanded change detection benchmark dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’14). 393400. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] Gao Ge, You Pei, Pan Rong, Han Shunyuan, Zhang Yuanyuan, Dai Yuchao, and Lee Hojae. 2021. Neural image compression via attentional multi-scale back projection and frequency decomposition. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV’21). 1467714686.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Gao Wei, Sun Shangkun, Zheng Huiming, Wu Yuyang, Ye Hua, and Zhang Yongchi. 2023. OpenDMC: An open-source library and performance evaluation for deep-learning-based multi-frame compression. In Proceedings of the ACM International Conference on Multimedia (ACM MM’23).Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Gregor Karol, Besse Frederic, Rezende Danilo Jimenez, Danihelka Ivo, and Wierstra Daan. 2016. Towards conceptual compression. In Advances in Neural Information Processing Systems, Lee D., Sugiyama M., Luxburg U., Guyon I., and Garnett R. (Eds.), Vol. 29. Curran Associates, Inc.Google ScholarGoogle Scholar
  20. [20] Gregor Karol, Danihelka Ivo, Graves Alex, Rezende Danilo, and Wierstra Daan. 2015. DRAW: A recurrent neural network for image generation. In Proceedings of the 32nd International Conference on Machine Learning (Proceedings of Machine Learning Research), Bach Francis and Blei David (Eds.), Vol. 37. PMLR, 14621471.Google ScholarGoogle Scholar
  21. [21] He Dailan, Yang Ziming, Peng Weikun, Ma Rui, Qin Hongwei, and Wang Yan. 2022. ELIC: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 57185727.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Hu Yueyu, Yang Wenhan, Ma Zhan, and Liu Jiaying. 2022. Learning end-to-end lossy image compression: A benchmark. IEEE Trans. Pattern Anal. Mach. Intell. 44, 8 (2022), 41944211. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Huffman David A.. 1952. A Method for the Construction of Minimum-Redundancy Codes. Proc. IRE 40, 9 (Sept.1952), 10981101. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Johnston Nick, Vincent Damien, Minnen David, Covell Michele, Singh Saurabh, Chinen Troy, Hwang Sung Jin, Shor Joel, and Toderici George. 2018. Improved lossy image compression with priming and spatially adaptive bit rates for recurrent networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Kim Jun-Hyuk, Heo Byeongho, and Lee Jong-Seok. 2022. Joint global and local hierarchical priors for learned image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 59926001.Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Lee Jooyoung, Cho Seunghyun, and Beack Seung-Kwon. 2019. Context-adaptive entropy model for end-to-end optimized image compression. In Proceedings of the 7th International Conference on Learning Representations.Google ScholarGoogle Scholar
  27. [27] Lee Jae-Han, Jeon Seungmin, Choi Kwang Pyo, Park Youngo, and Kim Chang-Su. 2022. DPICT: Deep progressive image compression using trit-planes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 1611316122.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Lei Jianjun, Liu Xiangrui, Peng Bo, Jin Dengchao, Li Wanqing, and Gu Jingxiao. 2022. Deep stereo image compression via bi-directional coding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 1966919678.Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Li Mu, Zuo Wangmeng, Gu Shuhang, Zhao Debin, and Zhang David. 2018. Learning convolutional networks for content-weighted image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Lin Tsung-Yi, Maire Michael, Belongie Serge, Hays James, Perona Pietro, Ramanan Deva, Dollár Piotr, and Zitnick C. Lawrence. 2014. Microsoft COCO: Common objects in context. In Proceedings of the European Conference on Computer Vision. Springer, 740755.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2015. Deep learning face attributes in the wild. In Proceedings of International Conference on Computer Vision (ICCV’15).Google ScholarGoogle Scholar
  32. [32] Ma Siwei, Zhang Xinfeng, Jia Chuanmin, Zhao Zhenghui, Wang Shiqi, and Wang Shanshe. 2019. Image and video compression with neural networks: A review. IEEE Trans. Circ. Syst. Vid. Technol. 30, 6 (2019), 16831698.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Mentzer Fabian, Agustsson Eirikur, Tschannen Michael, Timofte Radu, and Gool Luc Van. 2018. Conditional probability models for deep image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Minnen David, Ballé Johannes, and Toderici George D.. 2018. Joint autoregressive and hierarchical priors for learned image compression. In Advances in Neural Information Processing Systems, Bengio S., Wallach H., Larochelle H., Grauman K., Cesa-Bianchi N., and Garnett R. (Eds.), Vol. 31. Curran Associates, Inc.Retrieved from https://proceedings.neurips.cc/paper/2018/file/53edebc543333dfbf7c5933af792c9c4-Paper.pdfGoogle ScholarGoogle Scholar
  35. [35] Muckley Matthew, Juravsky Jordan, Severo Daniel, Singh Mannat, Duval Quentin, and Ullrich Karen. 2021. Neural Compression. Retrieved from https://github.com/facebookresearch/NeuralCompre ssionGoogle ScholarGoogle Scholar
  36. [36] Patel Yash, Appalaraju Srikar, and Manmatha R.. 2021. Saliency driven perceptual image compression. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 227236.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Pratt William K., Kane Julius, and Andrews Harry C.. 1969. Hadamard transform image coding. Proc. IEEE 57, 1 (1969), 5868.Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Rhee Hochang, Jang Yeong Il, Kim Seyun, and Cho Nam Ik. 2022. LC-FDNet: Learned lossless image compression with frequency decomposition network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 60336042.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Rippel Oren and Bourdev Lubomir. 2017. Real-time adaptive image compression. In Proceedings of the 34th International Conference on Machine Learning (Proceedings of Machine Learning Research), Precup Doina and Teh Yee Whye (Eds.), Vol. 70. PMLR, 29222930.Google ScholarGoogle Scholar
  40. [40] Shen Jialie, Deng Robert H., Cheng Zhiyong, Nie Liqiang, and Yan Shuicheng. 2015. On robust image spam filtering via comprehensive visual modeling. Pattern Recog. 48, 10 (Oct.2015), 32273238. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Shen Jialie and Robertson Neil. 2021. BBAS: Towards large scale effective ensemble adversarial attacks against deep neural network learning. Inf. Sci. 569 (Aug.2021), 469478. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Song Myungseo, Choi Jinyoung, and Han Bohyung. 2021. Variable-rate deep image compression through spatially-adaptive feature transform. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV’21). 23802389.Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Theis Lucas, Shi Wenzhe, Cunningham Andrew, and Huszár Ferenc. 2017. Lossy image compression with compressive autoencoders. In Proceedings of the 5th International Conference on Learning Representations.Google ScholarGoogle Scholar
  44. [44] Toderici George, Vincent Damien, Johnston Nick, Hwang Sung Jin, Minnen David, Shor Joel, and Covell Michele. 2017. Full resolution image compression with recurrent neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Wallace G. K.. 1992. The JPEG still picture compression standard. IEEE Trans. Consum. Electron. 38, 1 (Feb.1992), xviii–xxxiv. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. [46] Wang Dezhao, Yang Wenhan, Hu Yueyu, and Liu Jiaying. 2022. Neural data-dependent transform for learned image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 1737917388.Google ScholarGoogle ScholarCross RefCross Ref
  47. [47] Wang Zhou, Simoncelli Eero P., and Bovik Alan C.. 2003. Multiscale structural similarity for image quality assessment. In Proceedings of the 37th Asilomar Conference on Signals, Systems & Computers, Vol. 2. IEEE, 13981402.Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Witten Ian H., Neal Radford M., and Cleary John G.. 1987. Arithmetic coding for data compression. Commun. ACM 30, 6 (June1987), 520540. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Wödlinger Matthias, Kotera Jan, Xu Jan, and Sablatnig Robert. 2022. SASIC: Stereo image compression with latent shifts and stereo attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 661670.Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Wu Lirong, Huang Kejie, and Shen Haibin. 2020. A GAN-based tunable image compression system. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV’20). 23232331. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Zhou Lei, Cai Chunlei, Gao Yue, Su Sanbao, and Wu Junmin. 2018. Variational autoencoder for low bit-rate image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 26172620.Google ScholarGoogle Scholar
  52. [52] Zhu Xiaosu, Song Jingkuan, Gao Lianli, Zheng Feng, and Shen Heng Tao. 2022. Unified multivariate gaussian mixture for efficient neural image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 1761217621.Google ScholarGoogle ScholarCross RefCross Ref
  53. [53] Zou Renjie, Song Chunfeng, and Zhang Zhaoxiang. 2022. The devil is in the details: Window-based attention for image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 1749217501.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Principal Component Approximation Network for Image Compression

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 20, Issue 5
      May 2024
      650 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3613634
      • Editor:
      • Abdulmotaleb El Saddik
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 11 January 2024
      • Online AM: 13 December 2023
      • Accepted: 10 December 2023
      • Revised: 9 November 2023
      • Received: 14 August 2023
      Published in tomm Volume 20, Issue 5

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)235
      • Downloads (Last 6 weeks)26

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text