skip to main content
research-article

Principal Component Approximation Network for Image Compression

Published: 11 January 2024 Publication History

Abstract

In this work, we propose a novel principal component approximation network (PCANet) for image compression. The proposed network is based on the assumption that a set of images can be decomposed into several shared feature matrices, and an image can be reconstructed by the weighted sum of these matrices. The proposed PCANet is specifically devised to learn and approximate these feature matrices and weight vectors, which are used to encode images for compression. Unlike previous deep learning-based methods, a distinctive aspect of our approach is its consideration of network size in the bit-rate computation. Despite this inclusion, our proposed method yields promising results. Through extensive experiments conducted on standard datasets, we demonstrate the effectiveness of our approach in comparison to state-of-the-art techniques. To the best of our knowledge, this is the first machine learning approach that includes the size of networks during bitrate computation in image compression.

References

[1]
Eirikur Agustsson, Fabian Mentzer, Michael Tschannen, Lukas Cavigelli, Radu Timofte, Luca Benini, and Luc V. Gool. 2017. Soft-to-hard vector quantization for end-to-end learning compressible representations. Adv. Neural Inf. Process. Syst. 30 (2017).
[2]
Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte, and Luc Van Gool. 2019. Generative Adversarial Networks for Extreme Learned Image Compression. (Aug.2019). DOI:
[3]
Eze Ahanonu, Michael Marcellin, and Ali Bilgin. 2018. Lossless image compression using reversible integer wavelet transforms and convolutional neural networks. In Proceedings of the Data Compression Conference. 395–395. DOI:
[4]
Nasir Ahmed, T. Natarajan, and Kamisetty R. Rao. 1974. Discrete cosine transform. IEEE Trans. Comput. 100, 1 (1974), 90–93.
[5]
Hadi Amirpour, Antonio Pinheiro, Manuela Pereira, Fernando J. P. Lopes, and Mohammad Ghanbari. 2022. Efficient light field image compression with enhanced random access. ACM Trans. Multim. Comput., Commun. Applic. 18, 2 (Mar.2022), 44:1–44:18. DOI:
[6]
H. C. Andrews and W. K. Pratt. 1968. Fourier transform coding of images. In Proceedings of the Hawaii International Conference: System Sciences. 677–679.
[7]
Mohammad Haris Baig, Vladlen Koltun, and Lorenzo Torresani. 2017. Learning to inpaint for image compression. In Advances in Neural Information Processing Systems, I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.), Vol. 30. Curran Associates, Inc.Retrieved from https://proceedings.neurips.cc/paper/2017/file/013a006f03dbc5392effeb8f18fda755-Paper.pdf
[8]
Johannes Ballé, Valero Laparra, and Eero P. Simoncelli. 2016. End-to-end optimized image compression. In Proceedings of the 5th International Conference on Learning Representations.
[9]
Johannes Ballé, David Minnen, Saurabh Singh, Sung Jin Hwang, and Nick Johnston. 2018. Variational image compression with a scale hyperprior. In Proceedings of the 6th International Conference on Learning Representations.
[10]
Gisle Bjontegaard. 2001. Calculation of average PSNR differences between RD-curves. ITU SG16 Doc. VCEG-M33 (2001).
[11]
Tong Chen, Haojie Liu, Zhan Ma, Qiu Shen, Xun Cao, and Yao Wang. 2021. End-to-end learnt image compression via non-local attention optimization and improved context modeling. IEEE Trans. Image Process. 30 (2021), 3179–3191. DOI:
[12]
Tong Chen and Zhan Ma. 2020. Variable bitrate image compression with quality scaling factors. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’20). 2163–2167. DOI:
[13]
Zhengxue Cheng, Heming Sun, Masaru Takeuchi, and Jiro Katto. 2019. Learning image and video compression through spatial-temporal energy compaction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’19).
[14]
Zhengxue Cheng, Heming Sun, Masaru Takeuchi, and Jiro Katto. 2020. Learned image compression with discretized Gaussian mixture likelihoods and attention modules. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).
[15]
Charilaos Christopoulos, Athanassios Skodras, and Touradj Ebrahimi. 2000. The JPEG2000 still image coding system: An overview. IEEE Trans. Consum. Electron. 46, 4 (2000), 1103–1127.
[16]
Y. Wang et al.2014. CDnet 2014: An expanded change detection benchmark dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’14). 393–400. DOI:
[17]
Ge Gao, Pei You, Rong Pan, Shunyuan Han, Yuanyuan Zhang, Yuchao Dai, and Hojae Lee. 2021. Neural image compression via attentional multi-scale back projection and frequency decomposition. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV’21). 14677–14686.
[18]
Wei Gao, Shangkun Sun, Huiming Zheng, Yuyang Wu, Hua Ye, and Yongchi Zhang. 2023. OpenDMC: An open-source library and performance evaluation for deep-learning-based multi-frame compression. In Proceedings of the ACM International Conference on Multimedia (ACM MM’23).
[19]
Karol Gregor, Frederic Besse, Danilo Jimenez Rezende, Ivo Danihelka, and Daan Wierstra. 2016. Towards conceptual compression. In Advances in Neural Information Processing Systems, D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett (Eds.), Vol. 29. Curran Associates, Inc.
[20]
Karol Gregor, Ivo Danihelka, Alex Graves, Danilo Rezende, and Daan Wierstra. 2015. DRAW: A recurrent neural network for image generation. In Proceedings of the 32nd International Conference on Machine Learning (Proceedings of Machine Learning Research), Francis Bach and David Blei (Eds.), Vol. 37. PMLR, 1462–1471.
[21]
Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, and Yan Wang. 2022. ELIC: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 5718–5727.
[22]
Yueyu Hu, Wenhan Yang, Zhan Ma, and Jiaying Liu. 2022. Learning end-to-end lossy image compression: A benchmark. IEEE Trans. Pattern Anal. Mach. Intell. 44, 8 (2022), 4194–4211. DOI:
[23]
David A. Huffman. 1952. A Method for the Construction of Minimum-Redundancy Codes. Proc. IRE 40, 9 (Sept.1952), 1098–1101. DOI:
[24]
Nick Johnston, Damien Vincent, David Minnen, Michele Covell, Saurabh Singh, Troy Chinen, Sung Jin Hwang, Joel Shor, and George Toderici. 2018. Improved lossy image compression with priming and spatially adaptive bit rates for recurrent networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[25]
Jun-Hyuk Kim, Byeongho Heo, and Jong-Seok Lee. 2022. Joint global and local hierarchical priors for learned image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 5992–6001.
[26]
Jooyoung Lee, Seunghyun Cho, and Seung-Kwon Beack. 2019. Context-adaptive entropy model for end-to-end optimized image compression. In Proceedings of the 7th International Conference on Learning Representations.
[27]
Jae-Han Lee, Seungmin Jeon, Kwang Pyo Choi, Youngo Park, and Chang-Su Kim. 2022. DPICT: Deep progressive image compression using trit-planes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 16113–16122.
[28]
Jianjun Lei, Xiangrui Liu, Bo Peng, Dengchao Jin, Wanqing Li, and Jingxiao Gu. 2022. Deep stereo image compression via bi-directional coding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 19669–19678.
[29]
Mu Li, Wangmeng Zuo, Shuhang Gu, Debin Zhao, and David Zhang. 2018. Learning convolutional networks for content-weighted image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[30]
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In Proceedings of the European Conference on Computer Vision. Springer, 740–755.
[31]
Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2015. Deep learning face attributes in the wild. In Proceedings of International Conference on Computer Vision (ICCV’15).
[32]
Siwei Ma, Xinfeng Zhang, Chuanmin Jia, Zhenghui Zhao, Shiqi Wang, and Shanshe Wang. 2019. Image and video compression with neural networks: A review. IEEE Trans. Circ. Syst. Vid. Technol. 30, 6 (2019), 1683–1698.
[33]
Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, and Luc Van Gool. 2018. Conditional probability models for deep image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[34]
David Minnen, Johannes Ballé, and George D. Toderici. 2018. Joint autoregressive and hierarchical priors for learned image compression. In Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), Vol. 31. Curran Associates, Inc.Retrieved from https://proceedings.neurips.cc/paper/2018/file/53edebc543333dfbf7c5933af792c9c4-Paper.pdf
[35]
Matthew Muckley, Jordan Juravsky, Daniel Severo, Mannat Singh, Quentin Duval, and Karen Ullrich. 2021. Neural Compression. Retrieved from https://github.com/facebookresearch/NeuralCompre ssion
[36]
Yash Patel, Srikar Appalaraju, and R. Manmatha. 2021. Saliency driven perceptual image compression. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 227–236.
[37]
William K. Pratt, Julius Kane, and Harry C. Andrews. 1969. Hadamard transform image coding. Proc. IEEE 57, 1 (1969), 58–68.
[38]
Hochang Rhee, Yeong Il Jang, Seyun Kim, and Nam Ik Cho. 2022. LC-FDNet: Learned lossless image compression with frequency decomposition network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 6033–6042.
[39]
Oren Rippel and Lubomir Bourdev. 2017. Real-time adaptive image compression. In Proceedings of the 34th International Conference on Machine Learning (Proceedings of Machine Learning Research), Doina Precup and Yee Whye Teh (Eds.), Vol. 70. PMLR, 2922–2930.
[40]
Jialie Shen, Robert H. Deng, Zhiyong Cheng, Liqiang Nie, and Shuicheng Yan. 2015. On robust image spam filtering via comprehensive visual modeling. Pattern Recog. 48, 10 (Oct.2015), 3227–3238. DOI:
[41]
Jialie Shen and Neil Robertson. 2021. BBAS: Towards large scale effective ensemble adversarial attacks against deep neural network learning. Inf. Sci. 569 (Aug.2021), 469–478. DOI:
[42]
Myungseo Song, Jinyoung Choi, and Bohyung Han. 2021. Variable-rate deep image compression through spatially-adaptive feature transform. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV’21). 2380–2389.
[43]
Lucas Theis, Wenzhe Shi, Andrew Cunningham, and Ferenc Huszár. 2017. Lossy image compression with compressive autoencoders. In Proceedings of the 5th International Conference on Learning Representations.
[44]
George Toderici, Damien Vincent, Nick Johnston, Sung Jin Hwang, David Minnen, Joel Shor, and Michele Covell. 2017. Full resolution image compression with recurrent neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).
[45]
G. K. Wallace. 1992. The JPEG still picture compression standard. IEEE Trans. Consum. Electron. 38, 1 (Feb.1992), xviii–xxxiv. DOI:
[46]
Dezhao Wang, Wenhan Yang, Yueyu Hu, and Jiaying Liu. 2022. Neural data-dependent transform for learned image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 17379–17388.
[47]
Zhou Wang, Eero P. Simoncelli, and Alan C. Bovik. 2003. Multiscale structural similarity for image quality assessment. In Proceedings of the 37th Asilomar Conference on Signals, Systems & Computers, Vol. 2. IEEE, 1398–1402.
[48]
Ian H. Witten, Radford M. Neal, and John G. Cleary. 1987. Arithmetic coding for data compression. Commun. ACM 30, 6 (June1987), 520–540. DOI:
[49]
Matthias Wödlinger, Jan Kotera, Jan Xu, and Robert Sablatnig. 2022. SASIC: Stereo image compression with latent shifts and stereo attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 661–670.
[50]
Lirong Wu, Kejie Huang, and Haibin Shen. 2020. A GAN-based tunable image compression system. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV’20). 2323–2331. DOI:
[51]
Lei Zhou, Chunlei Cai, Yue Gao, Sanbao Su, and Junmin Wu. 2018. Variational autoencoder for low bit-rate image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 2617–2620.
[52]
Xiaosu Zhu, Jingkuan Song, Lianli Gao, Feng Zheng, and Heng Tao Shen. 2022. Unified multivariate gaussian mixture for efficient neural image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 17612–17621.
[53]
Renjie Zou, Chunfeng Song, and Zhaoxiang Zhang. 2022. The devil is in the details: Window-based attention for image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’22). 17492–17501.

Cited By

View all
  • (2024)Multifaceted Approaches for Facial Image Compression: A Review on State of Art Techniques and Applications2024 5th International Conference on Innovative Trends in Information Technology (ICITIIT)10.1109/ICITIIT61487.2024.10580030(1-6)Online publication date: 15-Mar-2024
  • (2024)Quaternion-based 2D-DOST and Stacked Principal Component Analysis Network for Multimodal Face RecognitionApplied Soft Computing10.1016/j.asoc.2024.112154(112154)Online publication date: Aug-2024

Index Terms

  1. Principal Component Approximation Network for Image Compression

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Multimedia Computing, Communications, and Applications
    ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 20, Issue 5
    May 2024
    650 pages
    EISSN:1551-6865
    DOI:10.1145/3613634
    • Editor:
    • Abdulmotaleb El Saddik
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 January 2024
    Online AM: 13 December 2023
    Accepted: 10 December 2023
    Revised: 09 November 2023
    Received: 14 August 2023
    Published in TOMM Volume 20, Issue 5

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Image compression
    2. neural network
    3. decomposition

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)197
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Multifaceted Approaches for Facial Image Compression: A Review on State of Art Techniques and Applications2024 5th International Conference on Innovative Trends in Information Technology (ICITIIT)10.1109/ICITIIT61487.2024.10580030(1-6)Online publication date: 15-Mar-2024
    • (2024)Quaternion-based 2D-DOST and Stacked Principal Component Analysis Network for Multimodal Face RecognitionApplied Soft Computing10.1016/j.asoc.2024.112154(112154)Online publication date: Aug-2024

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media