Video Shrinking by Auditory and Visual Cues

Xu, Qianqian; Liu, Huiying; Jiang, Shuqiang; Huang, Qingming; Gong, Yu

doi:10.1007/978-3-642-10467-1_69

Qianqian Xu²²,
Huiying Liu^22,23,24,
Shuqiang Jiang^23,24,
Qingming Huang^22,23,24 &
…
Yu Gong^22,23,24

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5879))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

1098 Accesses

Abstract

Video content is growing at an explosive rate nowadays. How to consume them efficiently is an important research point for years. Although the widely investigated video summarization solution can generate the main content of a video, it cannot ensure the coherence and apprehensibility of the original video. In this paper, we present a new framework called video shrinking to remove the video’s redundant information while keeping the integrality of the video content. Firstly, speech detection is performed to extract Candidate Deletion Shots (CDS), which have the property of low speech-ratio. Then, by combining the attention analysis and continuity analysis, CDS are refined to obtain the final temporal shrinking output. Subsequently, we further shrink the video spatially to adapt for the small screens of mobile devices. Experimental results demonstrate the effectiveness and efficiency of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Attention-Based Audio-Visual Fusion for Video Summarization

Video Clip Growth: A General Algorithm for Multi-view Video Summarization

Saliency Driven Video Motion Magnification

References

Zhu, G.Y., Huang, Q.M., Xu, C.S., Xing, L.Y., Gao, W., Yao, H.X.: Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video. IEEE Transactions on Multimedia 9(6), 1167–1182 (2007)
Article Google Scholar
Qiu, X.K., Jiang, S.Q., Huang, Q.M., Liu, H.Y.: Spatial-temporal video browsing for mobile environment based on visual attention. In: 2009 IEEE International Conference on Multimedia and Expo., New York (2009)
Google Scholar
Simakov, D., Caspi, Y., Shechtman, E., Irani, M.: Summarizing visual data using bidirectional similarity. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, Anchorage, pp. 1–8 (2008)
Google Scholar
Liu, F., Gleicher, M.: Video retargeting: automating pan and scan. In: 14th annual ACM international conference on Multimedia, Santa Barbara, pp. 241–250 (2006)
Google Scholar
Liu, C.X., Liu, H.Y., Jiang, S.Q., Huang, Q.M., Zheng, Y.J., Zhang, W.G.: JDL at Trecvid 2006 Shot Boundary Detection. In: TRECVID 2006 Workshop (2006)
Google Scholar
Wold, E., Blum, T., Keislar, D., Wheaten, J.: Content-based classification, search, and retrieval of audio. IEEE Multimedia 3(3), 27–36 (1996)
Article Google Scholar
Lu, L., Zhang, H.J., Jiang, H.: Content analysis for audio classification and segmentation. IEEE Transactions on Speech and Audio Processing 10(7), 504–516 (2002)
Article Google Scholar
Platt, J.C.: Fast training of support vector machines using sequential minimal optimization. In: Advances in kernel methods: support vector learning, pp. 185–208. MIT Press, Cambridge (1998)
Google Scholar
Liu, H.Y., Jiang, S.Q., Huang, Q.M., Xu, C.S.: A generic virtual content insertion system based on visual attention analysis. In: 16th annual ACM international conference on Multimedia, Vancouver, pp. 379–388 (2008)
Google Scholar
Li, Z., Wei, Q., Wang, Y.J., Yang, S.Q., Zhang, H.J.: Video shot grouping using best-first model merging. In: SPIE conference on Storage and Retrieval for Media Database, San Jose, pp. 262–296 (2001)
Google Scholar
Cheng, W.H., Wang, C.W., Wu, J.L.: Video adaptation for small display based on content recomposition. IEEE Transactions on Circuits and Systems for Video Technology 17(1), 43–58 (2007)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Graduate University of Chinese Academy of Sciences, Beijing, 100049, China
Qianqian Xu, Huiying Liu, Qingming Huang & Yu Gong
Key Lab of Intell. Info. Process., Chinese Academy of Sciences, Beijing, 100190, China
Huiying Liu, Shuqiang Jiang, Qingming Huang & Yu Gong
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
Huiying Liu, Shuqiang Jiang, Qingming Huang & Yu Gong

Authors

Qianqian Xu
View author publications
You can also search for this author in PubMed Google Scholar
Huiying Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shuqiang Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Qingming Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yu Gong
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Naresuan University, 65000, Phisanulok, Thailand
Paisarn Muneesawang
Microsoft Research Asia, 100109, Beijing, China
Feng Wu
Tokyo Institute of Technology, 226-8503, Yokohama, Japan
Itsuo Kumazawa
Mahanakorn University of Technology, 10530, Bankok, Thailand
Athikom Roeksabutr
Institute of Information Science, Academia Sinica, Taipei, Taiwan
Mark Liao
Chinese University of Hong Kong, Shatin, N.T., Hong Kong,
Xiaoou Tang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, Q., Liu, H., Jiang, S., Huang, Q., Gong, Y. (2009). Video Shrinking by Auditory and Visual Cues. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds) Advances in Multimedia Information Processing - PCM 2009. PCM 2009. Lecture Notes in Computer Science, vol 5879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10467-1_69

Download citation

DOI: https://doi.org/10.1007/978-3-642-10467-1_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10466-4
Online ISBN: 978-3-642-10467-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics