Abstract
In video transcoding, accuracy and efficiency of macroblock mode decision are critical issues at the re-encoder side due to the changes in frame size, frame rate, and bit rate. In this paper, a fast macroblock mode decision scheme based on support vector machines is proposed for H.264/AVC baseline profile video transcoder. Features including motion vectors, residual data, pre-encoded macroblock modes, and quantization parameters are extracted from incoming bitstream in both of training stage and classification stage. Feature extraction methods are investigated for spatial resolution transcoder, temporal resolution transcoder, and bit-rate transcoder. After off-line training and simplification of support vectors, the obtained support vector machine classifier can determine macroblock mode in the re-encoder accurately. Extensive experiments are carried out on different types of transcoders and results show that the proposed method can save about 80% in computational complexity compared to full mode search algorithm implemented in the latest H.264/AVC reference software (JM17.1), while maximum peak signal-to-noise ratio is degraded by 0.2–1.1 dB depending on different sequences and bit rate.















Similar content being viewed by others
References
Ahmad, I., Wei, X., Sun, Y., Zhang, Y.-Q.: Video transcoding: an overview of various techniques and research issues. IEEE Trans. Multimed. 7(5), 793–804 (2005)
Xin, J., Lin, C.-W., Sun, M.-T.: Digital video transcoding. Proc. IEEE 93(1), 84–97 (2005)
Kim, J.-H., Kim, B.-G.: Fast block mode decision algorithm in H.264/AVC video coding. J. Vis. Commun. Image. Represent. 19(3), 175–183 (2008)
Richardson, I.E.G.: H.264 and MPEG-4 video compression: video coding for next-generation multimedia. Wiley, England (2003)
Kuo, T.-Y., Chan, C.-H.: Fast variable block size motion estimation for H.264 using likelihood and correlation of motion Field. IEEE. Transact. Circ. Syst. Video Technol. 16(10), 1185–1195 (2006)
Tsai, A.-C., Paul, A., Wang, J.-C., Wang, J.-F.: Intensity gradient technique for efficient intra-prediction in H.264/AVC. IEEE. Transact. Circ. Syst. Video Technol. 18(5), 694–698 (2008)
Xin, J., Vetro, A., Sun, H., Su, Y.: Efficient MPEG-2 to H.264/AVC transcoding of intra-coded video. EURASIP J. Adv. Signal Process., 12 (2007)
Lefol, D., Bull, D., Canagarajah, N., Redmill, D.: An efficient complexity-scalable video transcoder with mode refinement. Signal. Process. Image. Commun. 22, 421–433 (2007)
Kalva, H., Petljanski, B.: Exploiting the directional features in MPEG-2 for H.264 intra transcoding. IEEE Trans. Consumer Electron. 52(2), 706–711 (2006)
Bialkowski, J., Barkowsky, M., Kaup, A.: Fast video transcoding from H.263 to H.264/MPEG-4 AVC. Multime´d. Tools App. 35, 127–146 (2007)
Lee, Y.-K., Lee, S.-S., Lee, Y.-L.: MPEG-4 to H.264 transcoding with frame rate reduction. Multime´d. Tools App. 35, 147–162 (2007)
Li, C.-H., Wang, C.-N., Chiang, T.: A fast downsizing video transcoder based on H.264/AVC standard. In: 5th Pacific Rim Conference on Multimedia (PCM 2004), Lecture Notes in Computer Science vol. 3331. Springer, Japan, pp. 215–223 (2004)
Lefol, D., Bull, D.: Mode refinement algorithm for H.264 inter frame requantization, IEEE International Conference on Image Processing (ICIP 2006), pp. 845–848, Atlanta, GA, USA, Oct. (2006)
De Cock, J., Notebasert, S., Lambert, P., De Schrijver, D., Van de Walle, R.: Requantization transcoding in pixel and frequency domain for intra 16x16 in H.264/AVC, Advanced Concepts for Intelligent Vision Systems (ACIVS 2006), LNCS vol. 4179, pp. 533–544, Belgium, Sept. (2006)
Notebasert, S., De Cock, J., De Wolf, K., Van de Walle, R.: Requantization transcoding of H.264/AVC bitstreams for intra 4 × 4 prediction modes. In: 2006 Pacific-Rim Conference on Multimedia (PCM 2006), Hangzhou, China. LNCS vol. 4261, pp. 808–817, Nov. (2006)
Liu, Z.-G., Wang, Y.: Intra Mode Selection in Downsizing Video Transcoder Based on H.264. Int. J. Imaging Syst. Technol. 19(4), 340–349 (2009)
Lamy-Bergot, C., Renan, E., Gadat, B., Lavaux, D.: Data supervision for adaptively transcoded video surveillance over wireless links. In: IEEE 9th International Conference on Intelligent Transport Systems Telecommunications (ITST 2009), France, pp. 415–419 LILLE, Oct. 2009
Fernández-Escribano, G., Bialkowski, J., Gámez, J.A., Kalva, H., Cuenca, P., Orozco-Barbosa, L., Kaup, A.: Low-complexity heterogeneous video transcoding using data mining. IEEE Trans. Multime´d. 10(2), 286–299 (2008)
Jing, X., Siu, W.-C., Chau, L.-P., Constantinides, A.G.: Efficient inter mode decision for H.263 to H.264 video transcoding using support vector machines. IEEE International Symposium on Circuits and Systems (ISCAS 2009), pp. 2349–2352, Taipei May 2009
Liu, Y., Zheng, Y.F.: Soft SVM and its application in video-object extraction. IEEE Trans. Signal Process. 55(7), 3272–3282 (2007)
Yuan, J., Wang, H., Xiao, L., Zheng, W., Jianmin, L., Lin, F., Zhang, B.: A formal study of shot boundary detection. IEEE Trans. Circuits Syst. Video Technol. 17(2), 168–186 (2007)
Zhou, J., Gao, D., Zhang, D.: Moving vehicle detection for automatic traffic monitoring. IEEE Trans. Veh. Technol. 56(1), 51–59 (2007)
Shanableh, T., Ghanbari, M.: Heterogeneous video transcoding to lower spatio-temporal resolutions and different encoding formats. IEEE Trans. Multime´d. 2(2), 101–110 (2000)
Björk, N., Christopoulos, C.: Transcoder architectures for video coding. IEEE Trans. Consumer Electron. 44(1), 88–98 (1998)
Youn, J., Sun, M.-T., Lin, C.-W.: Motion vector refinement for high-performance transcoding. IEEE Trans. Multimed. 1(1), 30–40 (1999)
Chen, M.-J., Chu, M.-C., Pan, C.-W.: Efficient motion-estimation algorithm for reduced frame-rate video transcoder. IEEE Lett. Circuits Syst. Video Technol. 12(4), 269–275 (2002)
Lee, W.J., Ho, W.J.: Adaptive frame-skipping for video transcoding, IEEE International Conference on Image Processing (ICIP 2003) 1:165–168 (2003)
Downs, T., Gates, K.E., Masters, A.: Exact simplification of support vector solutions. J. Mach. Learn. Res. 2, 293–297 (2001)
Nguyen, D.D., Ho, T.B.: A bottom-up method for simplifying support vector solutions. IEEE Trans. Neural Netw. 17, 792–796 (2006)
Wu, T.-F., Lin, C.-J., Weng, R.C.: Probability estimates for multi-class classification by pairwise coupling. J. Mach. Learn. Res. 5, 975–1005 (2004)
Tao, Q., Wu, G.-W., Wang, Fei.-Yue., Wang, J.: Posterior probability support vector machines for unbalanced data. IEEE Trans. Neural Netw. 16(6), 1561–1573 (2005)
Gönen, M., Tanugur, A.G., Alpaydın, E.: Multiclass posterior probability support vector machines. IEEE Trans. Neural Netw. 19(1), 130–139 (2008)
Chang, C.-C., Lin, C.-J. LIBSVM: a Library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm/
Acknowledgement
This work is supported by the National Natural Science Foundation of China (No. 61103118, No. 61073162), and the Shandong Natural Science Foundation of China (No. ZR2011FQ021).
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Thomas Haenselmann.
Rights and permissions
About this article
Cite this article
Liu, ZG., Yang, Y. & Ji, XH. Fast macroblock mode decision for H.264/AVC baseline profile video transcoder based on support vector machines. Multimedia Systems 18, 359–372 (2012). https://doi.org/10.1007/s00530-011-0256-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-011-0256-7