Abstract
Video content is growing at an explosive rate nowadays. How to consume them efficiently is an important research point for years. Although the widely investigated video summarization solution can generate the main content of a video, it cannot ensure the coherence and apprehensibility of the original video. In this paper, we present a new framework called video shrinking to remove the video’s redundant information while keeping the integrality of the video content. Firstly, speech detection is performed to extract Candidate Deletion Shots (CDS), which have the property of low speech-ratio. Then, by combining the attention analysis and continuity analysis, CDS are refined to obtain the final temporal shrinking output. Subsequently, we further shrink the video spatially to adapt for the small screens of mobile devices. Experimental results demonstrate the effectiveness and efficiency of the proposed method.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zhu, G.Y., Huang, Q.M., Xu, C.S., Xing, L.Y., Gao, W., Yao, H.X.: Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video. IEEE Transactions on Multimedia 9(6), 1167–1182 (2007)
Qiu, X.K., Jiang, S.Q., Huang, Q.M., Liu, H.Y.: Spatial-temporal video browsing for mobile environment based on visual attention. In: 2009 IEEE International Conference on Multimedia and Expo., New York (2009)
Simakov, D., Caspi, Y., Shechtman, E., Irani, M.: Summarizing visual data using bidirectional similarity. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, Anchorage, pp. 1–8 (2008)
Liu, F., Gleicher, M.: Video retargeting: automating pan and scan. In: 14th annual ACM international conference on Multimedia, Santa Barbara, pp. 241–250 (2006)
Liu, C.X., Liu, H.Y., Jiang, S.Q., Huang, Q.M., Zheng, Y.J., Zhang, W.G.: JDL at Trecvid 2006 Shot Boundary Detection. In: TRECVID 2006 Workshop (2006)
Wold, E., Blum, T., Keislar, D., Wheaten, J.: Content-based classification, search, and retrieval of audio. IEEE Multimedia 3(3), 27–36 (1996)
Lu, L., Zhang, H.J., Jiang, H.: Content analysis for audio classification and segmentation. IEEE Transactions on Speech and Audio Processing 10(7), 504–516 (2002)
Platt, J.C.: Fast training of support vector machines using sequential minimal optimization. In: Advances in kernel methods: support vector learning, pp. 185–208. MIT Press, Cambridge (1998)
Liu, H.Y., Jiang, S.Q., Huang, Q.M., Xu, C.S.: A generic virtual content insertion system based on visual attention analysis. In: 16th annual ACM international conference on Multimedia, Vancouver, pp. 379–388 (2008)
Li, Z., Wei, Q., Wang, Y.J., Yang, S.Q., Zhang, H.J.: Video shot grouping using best-first model merging. In: SPIE conference on Storage and Retrieval for Media Database, San Jose, pp. 262–296 (2001)
Cheng, W.H., Wang, C.W., Wu, J.L.: Video adaptation for small display based on content recomposition. IEEE Transactions on Circuits and Systems for Video Technology 17(1), 43–58 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, Q., Liu, H., Jiang, S., Huang, Q., Gong, Y. (2009). Video Shrinking by Auditory and Visual Cues. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds) Advances in Multimedia Information Processing - PCM 2009. PCM 2009. Lecture Notes in Computer Science, vol 5879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10467-1_69
Download citation
DOI: https://doi.org/10.1007/978-3-642-10467-1_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10466-4
Online ISBN: 978-3-642-10467-1
eBook Packages: Computer ScienceComputer Science (R0)