Abstract
In conventional motion compensated temporal filtering based wavelet coding scheme, where the group of picture structure and low-pass frame position are fixed, variations in motion activities of video sequences are not considered. In this paper, we propose an adaptive group of picture structure selection scheme, which the group of picture size and low-pass frame position are selected based on mutual information. Furthermore, the temporal decomposition process is determined adaptively according to the selected group of picture structure. A large amount of experimental work is carried out to compare the compression performance of proposed method with the conventional motion compensated temporal filtering encoding scheme and adaptive group of picture structure in standard scalable video coding model. The proposed low-pass frame selection can improve the compression quality by about 0.3–0.5 dB comparing to the conventional scheme in video sequences with high motion activities. In the scenes with un-even variation of motion activities, e.g. frequent shot cuts, the proposed adaptive group of picture size can achieve a better compression capability than conventional scheme. When comparing to adaptive group of picture in standard scalable video coding model, the proposed group of picture structure scheme can lead to about 0.2~0.8 dB improvements in sequences with high motion activities or shot cut.







Similar content being viewed by others
References
Andreopoulos Y, Munteanu A, Barbarien J, Van der Schaar M, Cornelis J, Schelkens P (2004) In-band motion compensated temporal filtering. Signal Process Image Commun 19:653–673. doi:10.1016/j.image.2004.05.007
Butz T, Thiran JP (2001) Shot boundary detection with mutual information. IEEE Int Conf Image Process 3:421–424
Cerneková Z, Pitas I, Nikou C (2006) Information theory-based shot cut/fade detection and video summarization. IEEE Trans Circuits Syst Video Technol 16:82–91. doi:10.1109/TCSVT.2005.856896
Chen P Software package of MC-EZBC wavelet coder is publicly available at ftp://ftp.cipr.rpi.edu/personal/chen.
Chen P (2003) Fully scalable subband/wavelet coding. Doctoral Thesis, Rensselaer Polytechnic Institute Troy, New York
Chen P, Woods JW (2004) Bidirectional MC-EZBC With Lifting Implementation. IEEE Trans Circuits Syst Video Technol 14:982–993
Chen C-Y, Huang C-T, Chen Y-H, Chien S-Y, Chen L-G (2006) System analysis of VLSI architecture for 5/3 and 1/3 motion-compensated temporal filtering. IEEE Trans Image Process 54:4004–4014
Cheng W, Liu Y, Xu D (2003) Shot boundary detection based on the knowledge of information theory. IEEE Int Conf Neural Netw Signal Process 2:1237–1241. doi:10.1109/ICNNSP.2003.1281094
Choi S-J, Woods JW (1999) Motion-compensated 3-D subband coding of video. IEEE Trans Image Process 8:155–167. doi:10.1109/83.743851
Dubios E, Sabri S (1984) Noise Reduction in Image Sequences Using Motion-Compensated Temporal Filtering. IEEE Trans Commun COM32(7):826–831
Eeckhaut H, Harald D, Benjamin S, Mark C, & Dirk S (2005) A hardware-friendly wavelet entropy codec for scalable video, IEEE Design, Automation and Test in Europe Conference and Exhibition (DATE’05), vol. 3, pp. 14–19
Hsiang S-T, Woods JW (2001) Embedded video coding using invertible motion compensated 3-D subband/wavelet filter bank. Signal Process Image Commun. 16:705–724. doi:10.1016/S0923-5965(01)00002-9
Lee J, Dickinson BW (1994) Temporally adaptive motion interpolation exploiting temporal masking in visual perception. IEEE Trans Image Process. 3:513–526. doi:10.1109/83.334989
Lee J, Shin I, Park H (2006) Adaptive intra-frame assignment and bit-rate estimation for variable GOP length in H.264. IEEE Trans Circuits Syst Video Technol 16:1271–1279. doi:10.1109/TCSVT.2006.881856
Leonardi R, Ohm J-R (2006) Wavelet Video Coding–an Overview. MPEG Workgroup Video Subgroup, ISO/IEC JTC1/SC29/WG11 W7824, Bangkok, Thailand
Li X (2004) Scalable video compression via overcomplete motion compensated wavelet coding. Signal Process Image Commun 19:637–651. doi:10.1016/j.image.2004.05.006
Luo L, Li J, Li S, Zhuang Z, & Zhang Y-Q (2001) Motion compensated lifting wavelet and its application in video coding, IEEE Int. Conf. Multimedia and Expo ICME, pp. 365–368
Ohm J-R (1994) Three-dimensional subband coding with motion compensation. IEEE Trans Image Process 3:559–571. doi:10.1109/83.334985
Ohm J-R (2005) Advances in Scalable Video Coding, Proceedings of the IEEE, vol. 93, Issue 1, pp. 42–56
Park GH, Park MW, Jeong S, Kim K, Hong J (2005) Improve SVC coding efficiency by adaptive GOP structure (SVC CE2). Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6) JVT-O018, Korea
Park MW, Park GH, Jeong S, Suh D-Y, & Kim K (2007) Adaptive GOP Structure for Joint Scalable Video Coding, IEICE Trans. Communication, vol. E 90-B(2).
Park GH, Park MW, Jeong S, Cha J, Kim K, Hong J (2005) Adaptive GOP structure for SVC. ISO/IEC/JTC1/SC29/WG11/MPEG/ M11563, Hong Kong
Pesquet-Popescu B, & Bottreau V (2001) Three-dimensional lifting schemes for motion-compensated video compression, IEEE Int. Conf. Acoustics, Speech, and Signal Processing, pp. 1793–1796
Secker A, Taubman D (2001) Motion-compensated highly-scalable video compression using an adaptive 3D wavelet transform based on lifting. IEEE Int. Conf. Image Process. 2:1029–1032
Song H, Kim J, Jay Kuo C-C (1999) Real-time encoding frame rate control for H.263+ video over the internet. Signal Process Image Commun. 15:127–148. doi:10.1016/S0923-5965(99)00027-2
Tillier C, Pesquet-Popescu B, van der Schaar M (2006) 3-Band motion-compensated temporal structures for scalable video coding. IEEE Trans Image Process. 15:2545–2557. doi:10.1109/TIP.2006.877411
Turaga DS, van der Schaar M, Andreopoulos Y, Munteanu A, Schelkens P (2005) Unconstrained motion compensated temporal filtering (UMCTF) for efficient and flexible interframe wavelet video coding. Signal Process Image Commun. 20:1–19. doi:10.1016/j.image.2004.08.006
Wang L (2000) Rate control for MPEG video coding. Signal Process Image Commun. 15:493–511. doi:10.1016/S0923-5965(99)00009-0
Wang Y, Cui S, Fowler JE (2006) 3-D Video coding with redundant-wavelet multihypothesis. IEEE Trans Circuits Syst Video Technol 16:166–177. doi:10.1109/TCSVT.2005.861940
Wang Y-L, Wang J-X, Lai Y-W, & Su AWY (2005) Dynamic GOP structure determination for real-time MEPG-4 advanced simple profile video encoder. IEEE Int Conf Multimedia and Expo pp. 293–296
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liu, ZG., Peng, YH. & Yang, Y. An adaptive GOP structure selection for haar-like MCTF encoding based on mutual information. Multimed Tools Appl 43, 25–43 (2009). https://doi.org/10.1007/s11042-008-0255-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-008-0255-7