Skip to main content
Log in

A High Throughput Processor Chip for Transform and Quantization Coding in H.264/AVC

  • Published:
Journal of Signal Processing Systems Aims and scope Submit manuscript

Abstract

This paper presents an ASIC processor chip for real-time implementation of the computing of the complete process of forward transform, quantization, inverse transform, dequantization, and reconstruction of a 16 × 16 macroblock in full compliance with the H.264/AVC video coding standard. This processor is capable of processing 4 × 4 blocks without interruption, with a parallelism in the data-path of 16 data/cycle, in a pipeline architecture with the twofold aim of achieving high operation frequency and high throughput. To implement the four 4 × 4 transforms and two 2 × 2 transforms required in the H.264/AVC coding system, two configurable multitransform direct 2-D architectures are used, one for forward and another for inverse. Moreover, a reduction in hardware is achieved by reformulating of quantization and dequantization equations and appropriately adjusting the datapath bus widths. A prototype of this processor chip was fabricated in the HCMOS9 STMicroelectronics 130 nm standard cell technology. The latency for 16 × 16 macroblocks is 26 clock cycles in normal mode and 42 in Intra 16 × 16 mode with a maximum operating frequency of 280 MHz and a throughput of 4,480 Mpixels/s. As a result, our processor chip is able to support the UHDTV 7680 × 4320@60 Hz (3 G sample/s) format requirement.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10

Similar content being viewed by others

References

  1. Wiegand, T., Sullivan, G. J., Bjontegaard, G., & Luthra, A. (2003). Overview of the H.264/AVC video coding standard. IEEE Transactions on Circuits and Systems For Video Technology, 13(7), 560–576.

    Article  Google Scholar 

  2. Joint Video Team of ITU-T and ISO/IEC (2004). “Draft text of H.264/AVC fidelity range extensions amendment”, Doc. JVT-L047.

  3. Sühring, K. H.264/AVC Software Coordination (http://iphome.hhi.de/suehring/tml/), Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute, Image Processing Research Department, Berlin, Germany.

  4. Malvar, H. S., Hallapuro, A., Karczewicz, M., & Kerofsky, L. (2003). Low-complexity transform and quantization in H.264/AVC. IEEE Trans. on Circuits and Systems for Video Technology, 13(7), 598–603.

    Article  Google Scholar 

  5. Gordon, S., Marpe, D., Wiegand, T. (2004). “Simplified use of 8 × 8 transforms”, JVT Doc. JVT-K028, Munich, March.

  6. Wang, T.C., Huang, Y.W., Fang, H.C., Chen, L.G. (2003). “Parallel 4 × 4 2D transform and inverse transform architecture for MPEG-4 AVC/H.264,” in Proceedings of IEEE International Symposium on Circuits and Systems, pp. 800−803.

  7. Huang, Y. W., Hsieh, B. Y., Chen, T. C., & Chen, L. G. (2005). Analysis, fast algorithm, and VLSI architecture design for H.264/AVC intra frame coder. IEEE Transactions on Circuits and Systems for Video Technology, 15(3), 378–401.

    Article  Google Scholar 

  8. Liu, L., Qiu, L., Rong, M., Jiang, L. (2004). “A 2-D forward/inverse transform processor of H.264 based on highly-parallel architecture,” 4th IEEE International Workshop System-on-Chip for Real-Time Applications, pp. 158−161.

  9. Lee, J.J., Park, S., and Eum, N. (2008). “Design of application specific processor for H.264 inverse transform and quantization,” International SoC Design Conference, pp. II.57–II.60.

  10. Fan, C. P. (2006). Fast 2-dimensional 4 × 4 forward integer transform implementation for H.264/AVC. IEEE Transactions on Circuits and Systems II: Express Briefs, 53(3), 174–177.

    Article  Google Scholar 

  11. Porto, R., Porto, M., Bampi, S., Agostini, L. (2007). “High throughput architecture for forward transforms module of H.264/AVC video coding standard,” IEEE International Conference on Electronics, Circuits and Systems, pp. 150−153.

  12. Sampaio, F., Dornelles, R., Palomino, D., Siedler, G., Correa, M., and Agostini, L. (2010). “A multitransform architecture for the H.264/AVC standard and its design space exploration,” 16th IEEE International Conference on Electronics, Circuits, and Systems (ICECS), pp. 711–714.

  13. Shi, B., Zheng, W., Li, D., Zhang, M. (2007). “Fast algorithm and architecture design for H.264/AVC multiple transforms,” IEEE International Conference on Multimedia and Expo, pp. 2086−2089.

  14. Wei, C., Hui, H., Jinmei, L., Jiarong, T., Hao, M. (2008). “A novel dynamic reconfigurable VLSI architecture for H.264 transforms,” IEEE Asia Pacific Conference on Circuits and Systems, pp. 1810−1813.

  15. Chen, K. H., Guo, J. I., & Wang, J. S. (2006). A high-performance direct 2-D transform coding IP design for MPEG-4 AVC/H.264. IEEE Transactions on Circuits and Systems for Video Technology, 16(4), 472–483.

    Article  Google Scholar 

  16. Li, Y., He, Y., & Mei, S. (2008). A Highly Parallel Joint VLSI Architecture for Transforms in H.264/AVC. Journal of Signal Processing Systems, 50, 19–32.

    Article  Google Scholar 

  17. Chen, K.H., Guo, J.I., Chao, K.C., Wang, J.S., and Chu, Y.S., (2005). “A high-performance low power direct 2-D transform coding IP design for MPEG-4 AVC/H.264 with a switching power suppression technique,” International Symposium on VLSI Technology Systems and Applications, pp. 291−294.

  18. Cheng, Z.Y., Chen, C.H., Liu, B.D., Yang, J.F. (2004). “High throughput 2-D transform architectures for H.264 advanced video coders,” IEEE Asia-Pacific Conference on Circuits and Systems, pp. 1141–1144.

  19. Ngo, N.T., Do, T.T.T., Le, T.M., Kadam, Y.S., Bermak, A. (2008). “ASIP-controlled inverse integer transform for H.264/AVC compression,” 19th IEEE/IFIP International Symposium on Rapid System Prototyping, pp. 158−164.

  20. Raja, G., Khan, S., Mirza, M.J. (2005). “VLSI Architecture & Implementation of H.264 Integer Transform, ”17th International Conference on Microelectronics, pp. 218−223.

  21. Lin, H. Y., Chao, Y. C., Chen, C. H., Liu, B. D., & Yang, J. F. (2005). Combined 2-D transform and Quantization architectures for H.264 video coders. IEEE International Symposium on Circuits and Systems, 2, 1802–1805.

    Google Scholar 

  22. Husemann, R., Majolo, M., Susin, A., Roesler, V., and Valdeni de Lima, J. (2010). “New integrated architecture for H.264 Transform and Quantization hardware implementation,” 53rd IEEE International Midwest Symposium on Circuits and Systems (MWSCAS), pp. 379–382.

  23. Keshaveni, N., Ramachandran, S., and Gurumurthy, K.S. (2009). “Design and Implementation of Integer Transform and Quantization Processor for H.264 Encoder on FPGA,” International Conference on Advances in Computing, Control, & Telecommunication Technologies, pp. 646–649.

  24. Kordasiewicz, R., & Shirani, S. (2007). On hardware implementations of DCT and quantization blocks for H.264/AVC. Journal of VLSI Signal Processing, 47, 93–102.

    Article  Google Scholar 

  25. Chungan, P., Dunshan, Y., Xixin, C., Shimin Sheng, S. (2007). “A new high throughput VLSI architecture for H.264 transform and quantization,” 7th International Conference ASICON ’07, pp. 950−953.

  26. Owaida, M., Koziri, M., Katsavounidis, I., Stamoulis, G. (2009). “A high performance and low power hardware architecture for the transform & quantization stages in H.264,” IEEE International Conference on Multimedia and Expo, pp. 1102−1105.

  27. Lin, Y., Zeng, X., Chen, J., Zhang, Z. (2006). “A low cost design for optimized IDCT and de-quantization of H.264,” 8th International Conference on Solid-State and Integrated Circuit Technology, pp. 1893−1895.

  28. Ho, D.J., Zhang, C.C. (2004). “Hardware Architecture for an Integrated Inverse Transform and Quantization of the H.264 Decoder,” Proceedings of 10th International Symposium on Integrated Circuits, Devices & Systems.

  29. Chao, Y. C., Wei, S. T., Liu, B. D., & Yang, J. F. (2009). Combined CAVLC decoder, inverse quantizer, and transform kernel in compact H.264/AVC decoder. IEEE Transactions on Circuits and Systems for Video Technology, 19(1), 53–62.

    Article  Google Scholar 

  30. Tasdizen, O., Hamzaoglu, I. (2005). “A High Performance and Low Cost Hardware Architecture for H.264 Transform and quantization algorithms,” Antalya, Turkey: 13th European Signal Processing Conference, pp. 4−8.

  31. Logashanmugam, E., & Ramachandran, R. (2008). An efficient hardware architecture for H.264 transform and quantization algorithms. International Journal of Computer Science and Network Security, 8(6), 167–173.

    Google Scholar 

  32. Lee, S., Cho, K. (2006). “Implementation of an AMBA-Compliant IP for H.264 transform and quantization,” Asia Pacific Conference on Circuits and Systems, pp. 1071−1076.

  33. Song, S., Seo, C., Kim, K. (2008). “A unified transform unit for H.264,” International SoC Design Conference, pp. II-130−II-133.

Download references

Acknowledgment

We wish to acknowledge the financial support of the Spanish Ministry of Education and Science through TEC2006-12438/TCM received to finance this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Miguel A. Manzano.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Manzano, M.A., Michell, J.A. & Ruiz, G.A. A High Throughput Processor Chip for Transform and Quantization Coding in H.264/AVC. J Sign Process Syst 70, 59–73 (2013). https://doi.org/10.1007/s11265-012-0660-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11265-012-0660-z

Keywords

Navigation