Abstract
This paper presents a scheme and its Field Programmable Gate Array (FPGA) implementation for a system based on combining the bi-dimensional discrete wavelet transformation (2D-DWT) and vector quantization (VQ) for image compression. The 2D-DWT works in a non-separable fashion using a parallel filter structure with distributed control to compute two resolution levels. The wavelet coefficients of the higher frequency sub-bands are vector quantized using multi-resolution codebook and those of the lower frequency sub-band at level two are scalar quantized and entropy encoded. VQ is carried out by self organizing feature map (SOFM) neural nets working at the recall phase. Codebooks are quickly generated off-line using the same nets functioning at the training phase. The complete system, including the 2D-DWT, the multi-resolution codebook VQ, and the statistical encoder, was implemented on a Xilinx Virtex 4 FPGA and is capable of performing real-time compression for digital video when dealing with grayscale 512 × 512 pixels images. It offers high compression quality (PSNR values around 35 dB) and acceptable compression rate values (0.62 bpp).
Similar content being viewed by others
References
Nasrabadi, N.M., King, R.A.: Image coding using vector quantization: a review. IEEE Trans. Commun. 36(8), 957–971 (1988)
Gersho, A., Gray, R.M.: Vector Quantization and Signal Compression. Kluwer, Norwood (1992)
Lewis, A.S., Knowles, G.: Image compression using the 2-D wavelet transform. IEEE Trans. Image Process. 1(2), 244–250 (1992)
Antonini, M., Barlaud, M., Mathieu, P., Daubechies, I.: Image coding using wavelet transform. IEEE Trans. Image Process. 1(2), 205–220 (1992)
Paek, S.K., Kim, L.S.: A real-time wavelet vector quantization algorithm and its VLSI architecture. IEEE Trans. Circuits Syst. Video Technol. 10(3), 475–489 (2000)
Kim, K.B., Kim, S., Kim, G.H.: Vector quantizer of medical image using wavelet transform and enhanced SOM algorithm. Neural Comput. Appl. 15(3), 245–251 (2006)
Aliaga, R.J., Martinez, J.D., Gadea, R., Sebastia, A., Benlloch, J.M., Sanchez, F., Pavon, N., Lerche, C.H.: Corrected position estimation in PET detector modules with multi-anode PMTs using neural networks. IEEE Trans. Nucl. Sci. 53(3 Part 1), 776–783 (2006)
Jones, W., Reed, J., Everman, J., Young, J., Seese, R.: Next generation PET data acquisition architectures. IEEE Trans. Nucl. Sci. 44(3), 1202–1207 (1997)
Jones, W., Reed, J., Everman, J., Luk, P., Gremillion, T., Castleberry, B., Breeding, E., Musr, J., Overbay, M., Schmand, M., Powers, R., Moyers, J., Baker, K., Bendriem, B.: First time measurement of transaxial resolution for a new high-sensitivity PET prototype panel detectors. IEEE Nucl. Sci. Symp. 2, 694–698 (2002)
Mallat, S.G.: Multifrequency channel decompositions of images and wavelet models. IEEE Trans. Acoust. Speech Signal Process 37(12), 2091–2110 (1989)
Colom, R., Gadea, R., Sebastia, A.: A novel FPGA architecture of a 2-D wavelet transform. J. VLSI Signal Process. 2, 273–284 (2006)
Kohonen, T.: Self-Organizing Maps. Series in Information Sciences (1995), 2nd edn, vol 31. Springer, Heidelberg (1997)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ramirez-Agundis, A., Gadea-Girones, R., Colom-Palero, R. et al. A wavelet-VQ system for real-time video compression. J Real-Time Image Proc 2, 271–280 (2007). https://doi.org/10.1007/s11554-007-0064-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11554-007-0064-7