Skip to main content

Video Shrinking by Auditory and Visual Cues

  • Conference paper
Advances in Multimedia Information Processing - PCM 2009 (PCM 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5879))

Included in the following conference series:

  • 1098 Accesses

Abstract

Video content is growing at an explosive rate nowadays. How to consume them efficiently is an important research point for years. Although the widely investigated video summarization solution can generate the main content of a video, it cannot ensure the coherence and apprehensibility of the original video. In this paper, we present a new framework called video shrinking to remove the video’s redundant information while keeping the integrality of the video content. Firstly, speech detection is performed to extract Candidate Deletion Shots (CDS), which have the property of low speech-ratio. Then, by combining the attention analysis and continuity analysis, CDS are refined to obtain the final temporal shrinking output. Subsequently, we further shrink the video spatially to adapt for the small screens of mobile devices. Experimental results demonstrate the effectiveness and efficiency of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Zhu, G.Y., Huang, Q.M., Xu, C.S., Xing, L.Y., Gao, W., Yao, H.X.: Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video. IEEE Transactions on Multimedia 9(6), 1167–1182 (2007)

    Article  Google Scholar 

  2. Qiu, X.K., Jiang, S.Q., Huang, Q.M., Liu, H.Y.: Spatial-temporal video browsing for mobile environment based on visual attention. In: 2009 IEEE International Conference on Multimedia and Expo., New York (2009)

    Google Scholar 

  3. Simakov, D., Caspi, Y., Shechtman, E., Irani, M.: Summarizing visual data using bidirectional similarity. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, Anchorage, pp. 1–8 (2008)

    Google Scholar 

  4. Liu, F., Gleicher, M.: Video retargeting: automating pan and scan. In: 14th annual ACM international conference on Multimedia, Santa Barbara, pp. 241–250 (2006)

    Google Scholar 

  5. Liu, C.X., Liu, H.Y., Jiang, S.Q., Huang, Q.M., Zheng, Y.J., Zhang, W.G.: JDL at Trecvid 2006 Shot Boundary Detection. In: TRECVID 2006 Workshop (2006)

    Google Scholar 

  6. Wold, E., Blum, T., Keislar, D., Wheaten, J.: Content-based classification, search, and retrieval of audio. IEEE Multimedia 3(3), 27–36 (1996)

    Article  Google Scholar 

  7. Lu, L., Zhang, H.J., Jiang, H.: Content analysis for audio classification and segmentation. IEEE Transactions on Speech and Audio Processing 10(7), 504–516 (2002)

    Article  Google Scholar 

  8. Platt, J.C.: Fast training of support vector machines using sequential minimal optimization. In: Advances in kernel methods: support vector learning, pp. 185–208. MIT Press, Cambridge (1998)

    Google Scholar 

  9. Liu, H.Y., Jiang, S.Q., Huang, Q.M., Xu, C.S.: A generic virtual content insertion system based on visual attention analysis. In: 16th annual ACM international conference on Multimedia, Vancouver, pp. 379–388 (2008)

    Google Scholar 

  10. Li, Z., Wei, Q., Wang, Y.J., Yang, S.Q., Zhang, H.J.: Video shot grouping using best-first model merging. In: SPIE conference on Storage and Retrieval for Media Database, San Jose, pp. 262–296 (2001)

    Google Scholar 

  11. Cheng, W.H., Wang, C.W., Wu, J.L.: Video adaptation for small display based on content recomposition. IEEE Transactions on Circuits and Systems for Video Technology 17(1), 43–58 (2007)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xu, Q., Liu, H., Jiang, S., Huang, Q., Gong, Y. (2009). Video Shrinking by Auditory and Visual Cues. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds) Advances in Multimedia Information Processing - PCM 2009. PCM 2009. Lecture Notes in Computer Science, vol 5879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10467-1_69

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-10467-1_69

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-10466-4

  • Online ISBN: 978-3-642-10467-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics