Abstract
The H.264/AVC Fractional Motion Estimation (FME) with rate-distortion constrained mode decision can improve the rate-distortion efficiency by 2–6 dB in peak signal-to-noise ratio. However, it comes with considerable computation complexity. Acceleration by dedicated hardware is a must for real-time applications. The main difficulty for FME hardware implementation is parallel processing under the constraint of the sequential flow and data dependency. We analyze seven inter-correlative loops extracted from FME procedure and provide decomposing methodologies to obtain efficient projection in hardware implementation. Two techniques, 4×4 block decomposition and efficiently vertical scheduling, are proposed to reuse data among the variable block size and to improve the hardware utilization. Besides, advanced architectures are designed to efficiently integrate the 6-taps 2D finite impulse response, residue generation, and 4×4 Hadamard transform into a fully pipelined architecture. This design is finally implemented and integrated into an H.264/AVC single chip encoder that supports realtime encoding of 720×480 30fps video with four reference frames at 81 MHz operation frequency with 405 K logic gates (41.9% area of the encoder).

















Similar content being viewed by others
References
Joint Video Team (2003). Draft ITU-T recommendation and final draft international standard of joint video specification, ITU-T Recommendation H.264 and ISO/IEC 14496-10 AVC (May).
Wiegand, T., Sullivan, G. J., Bjøntegaard, G., & Luthra, A. (2003). Overview of the H.264/AVC video coding standard. IEEE Transactions on Circuits and Systems for Video Technology (CSVT), 13(7), 560–576 (July).
Ostermann, J., Bormans, J., List, P., Marpe, D., Narroschke, M., Pereira, F., et al. (2004). Video coding with H.264/AVC: Tools, performance, and complexity. IEEE Magazine on Circuits and Systems Magazine, 4, 7–28.
ISO (1999). Information technology - coding of audio-visual objects - Part 2: Visual. ISO/IEC 14496-2.
ISO (1996). Information technology - generic coding of moving pictures and associated audio information: Video. ISO/IEC 13818-2 and ITU-T Rec. H.262.
Choi, W.-I., Jeon, B., & Jeong, J. (2003). Fast motion estimation with modified diamond search for variable motion block sizes. In Proceedings of IEEE international conference on image processing (ICIP’03), (pp. 371–374).
Huang, Y.-W., Wang, T.-C., Hsieh, B.-Y., & Chen, L.-G. (2003). Hardware architecture design for variable block size motion estimation in MPEG-4 AVC/JVT/ITU-T H.264. In Proceedings of IEEE international symposium on circuits and systems (ISCAS’03), (pp. 796–799).
Lee, J.-H., & Lee, N.-S. (2004). Variable block size motion estimation algorithm and its hardware architecture for H.264. In Proceedings of IEEE international symposium on circuits and systems (ISCAS’04) (pp. 740–743).
Yap, S. Y., & McCanny, J. V. (2004). A VLSI architecture for variable block size video motion estimation. IEEE Transactions on Circuits and Systems II (CASII), 51, 384–389.
Chen, T.-C., Fang, H.-C., Lian, C.-J., Tsai, C.-H., Huang, Y.-W., Chen, T.-W., et al. (2006). Algorithm analysis and architecture design for HDTV applications. IEEE Circuits and Devices Magazine, 22, 22–31.
Wiegand, T., Zhang, X., & Girod, B. (1999). Long-term memory motion-compensated prediction. IEEE Transactions on Circuits and Systems for Video Technology (CSVT), 9, 70–84 (February).
Sullivan, G. J., & Wiegand, T. (1998). Rate-distortion optimization for video compression. IEEE Signal Processing Magazine, 15(6), 74–90 (November).
Wiegand, T., Schwarz, H., Joch, A., Kossentini, F., & Sullivan, G. J. (2003). Rate-constrained coder control and comparison of video coding standards. IEEE Transactions on Circuits and Systems for Video Technology (CSVT), 13(7), 688–703 (July).
Chao, W.-M., Chen, T.-C., Chang, Y.-C., Hsu, C.-W., & Chen, L.-G. (2003). Computationally controllable integer, half, and quarter-pel motion estimator for MPEG-4 advanced simple profile. In Proceedings of 2003 international symposium on circuits and systems (ISCAS’03) (pp. II788–II791).
Miyama, M., Miyakoshi, J., Kuroda, Y., Imamura, K., Hashimoto, H., & Yoshimoto, M. (2004). A sub-mW MPEG-4 motion estimation processor core for mobile video application. IEEE Journal of Solid-State Circuits, 39, 1562–1570.
Su, Y., & Sun, M.-T. (2004). Fast multiple reference frame motion estimation for H.264. In Proceedings of IEEE international conference on multimedia and expo (ICME’04).
Joint Video Team Reference Software JM8.5 (2004). http://bs.hhi.de/~suehring/tml/download/ (September).
Chang, H.-C., Chen, L.-G., Hsu, M.-Y., & Chang, Y.-C. (2000). Performance analysis and architecture evaluation of MPEG-4 video codec system. In Proceedings of IEEE international symposium on circuits and systems (ISCAS’00), 2, 449–452 (May).
Huang, Y.-W., Chen, T.-C., Tsai, C.-H., Chen, C.-Y., Chen, T.-W., Chen, C.-S., et al. (2005). A 1.3TOPS H.264/AVC Single-Chip Encoder for HDTV Applications. In Proceedings of IEEE international solid-state circuits conference (ISSCC’05) (pp. 128–130).
Chen, T.-C., Huang, Y.-W., & Chen, L.-G. (2004). Analysis and design of macroblock pipelining for H.264/AVC VLSI architecture. In Proceedings of 2004 international symposium on circuits and systems (ISCAS’04) (pp. II273–II276).
Wang, T.-C., Huang, Y.-W. Fang, H.-C., & Chen, L.-G. (2003). Parallel 4x4 2D transform and inverse transform architecture for MPEG-4 AVC/H.264. In Proceedings of IEEE international symposium on circuits and systems (ISCAS’03) (pp. 800–803).
Chen, T.-C., Huang, Y.-W., & Chen, L.-G. (2004). Fully utilized and reusable architecture for fractional motion estimation of H.264/AVC. In Proceedings of IEEE ICASSP (pp. V–9–V–12) (May).
Yang, C., Goto, S., & Ikenaga, T. (2006). High performance VLSI architecture of fractional motion estimation in H.264 for HDTV. In Proc. IEEE ISCAS (pp. 2605–2608).
Chen, T.-C., Chen, Y.-H., Tsai, C.-Y., & Chen, L.-G. (2006). Low power and power aware fractional motion estimation of H.264/AVC for mobile applications. In Proceedings of IEEE international symposium on circuits and systems (ISCAS’06).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chen, YH., Chen, TC., Chien, SY. et al. VLSI Architecture Design of Fractional Motion Estimation for H.264/AVC. J Sign Process Syst Sign Image Video Technol 53, 335–347 (2008). https://doi.org/10.1007/s11265-008-0213-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11265-008-0213-7