Skip to main content

Advertisement

Log in

A framework for automatic static and dynamic video thumbnail extraction

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Video thumbnails enable users to see quick snapshots of video collections. To display the video thumbnails, the first frame or a frame selected by using simple low level features in each video clip has been set to the default thumbnail for the sake of computational efficiency and implementation simplicity. However, such methods often fail to represent the gist of the clip. To overcome this limitation, we present a new framework for both static and dynamic video thumbnail extraction. First, we formulate energy functions using the features which incorporate mid-level information to obtain superior thumbnailing. Since it is considered that frames whose layouts are similar to others in the clip are relevant in video thumbnail extraction, scene layouts are also considered in computing overall energy. For dynamic thumbnail generation, a time slot is determined by finding the duration showing the minimum energy. Experimental results show that the proposed method achieves comparable performance on a variety of challenging videos, and the subjective evaluation demonstrates the effectiveness of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Almeida J, Leite NJ, Torres RdS (2012) Vison: Video summarization for online applications. Pattern Recogn Lett 33(4):397–409

    Article  Google Scholar 

  2. Al-Hajri A, Fong M, Miller G, Fels S (2014) Fast forward with your vcr: Visualizing single-video viewing statistics for navigation and sharing. In: Proceedings of the 2014 Graphics Interface Conference, pp 123–128

  3. Benini S, Migliorati P, Leonardi R (2007) Hidden markov models for video skim generation. In: Eighth International Workshop on Image Analysis for Multimedia Interactive Services. IEEE, pp 6–6

  4. Choi J, Jung C, Lee J, Kim C (2014) Determining the existence of objects in an image and its application to image thumbnailing. IEEE Signal Process Lett 21 (8):957–961

    Article  Google Scholar 

  5. Christel MG (2006) Evaluation and user studies with respect to video summarization and browsing. In: Electronic Imaging 2006. International Society for Optics and Photonics, pp 60730M–60730M

  6. Cotsaces C, Nikolaidis N, Pitas I (2006) Video shot detection and condensed representation: a review. IEEE Signal Process Mag 23(2):28–37

    Article  Google Scholar 

  7. Craggs B, Scott MK, Alexander J (2014) ThumbReels: query sensitive web video previews based on temporal, crowdsourced, semantic tagging. In: Proceedings the 32nd annual ACM Conference on Human Factors in Computing Systems, pp 1217–1220

  8. Gao Y, Zhang T, Xiao J (2009) Thematic video thumbnail selection. In: Proceedings of IEEE International Conference on Image Processing (ICIP). IEEE, pp 4333–4336

  9. Gong Y, Liu X (2000) Generating optimal video summaries. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), vol 3. IEEE, pp 1559–1562

  10. Jiang J, Zhang X-P (2010) A novel video thumbnail extraction method using spatiotemporal vector quantization , Proc. of the 3rd International Workshop on Automated Information Extraction in Media Production. ACM, pp. 9–14

  11. Jiang J, Zhang X-P (2011) Video thumbnail extraction using video time density function and independent component analysis mixture model. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1417–1420

  12. Lee K-J., Lee W-J., Jeong J-C. (2014) An enhanced error compensation method for thumbnail generation in H. 264/AVC bitstreams. In: Proceedings of the 4th IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC), pp 236–239

  13. Li H, Ngan KN (2007) Unsupervized video segmentation with low depth of field. IEEE Trans Circuits Syst Video Technol 17(12):1742–1751

    Article  Google Scholar 

  14. Liu C, Huang Q, Jiang S (2011) Query sensitive dynamic web video thumbnail generation. In: Proceedings of IEEE International Conference on Image Processing (ICIP). IEEE, pp 2449–2452

  15. Ma Y-F, Hua X-S, Lu L, Zhang H-J (2005) A generic framework of user attention model and its application in video summarization. IEEE Trans Multimed 7 (5):907–919

    Article  Google Scholar 

  16. Ma Y-F, Lu L, Zhang H-J, Li M (2002) A user attention model for video summarization. In: Proceedings of the tenth ACM international conference on Multimedia. ACM, pp 533–542

  17. Money AG, Agius H (2008) Video summarisation: A conceptual framework and survey of the state of the art. J Vis Commun Image Represent 19(2):121–143

    Article  Google Scholar 

  18. Ngo C-W, Ma Y-F, Zhang H-J (2005) Video summarization and scene detection by graph modeling. IEEE Trans Circuits Syst Video Technol 15(2):296–305

    Article  Google Scholar 

  19. Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987

    Article  MATH  Google Scholar 

  20. Sundaram S, Velisavljevic V, Qin Y (2011) Hotflashes: Thumbnailing videos of social gatherings by detecting camera flash illuminated frames. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–4

  21. Truong BT, Venkatesh S (2007) Video abstraction: A systematic review and classification. ACM Trans Multimed Comput Commun Appl (TOMCCAP) 3(1):3

    Article  Google Scholar 

  22. Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154

    Article  Google Scholar 

  23. Wang M, Hong R, Li G, Zha Z-J, Yan S, Chua T-S (2012) Event driven web video summarization by tag localization and key-shot identification. IEEE Trans Multimed 14(4):975–985

    Article  Google Scholar 

  24. Wang T, Mei T, Hua X-S, Liu X-L, Zhou H-Q (2007) Video collage: A novel presentation of video sequence. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1479–1482

  25. Wang Y.-S, Liu F, Hsu P-S, Lee T-Y (2013) Spatially and temporally optimized video stabilization. IEEE Trans Vis Comput Graph:1

  26. Yong S-P., Deng JD, Purvis MK (2013) Wildlife video key-frame extraction based on novelty detection in semantic context. Multimed Tools Appl 62(2):359–376

    Article  Google Scholar 

  27. Zhang W, Liu C, Huang Q, Jiang S, Gao W (2012) A novel framework for web video thumbnail generation. In: Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), IEEE, pp 343–346

  28. Zhang W, Liu C, Wang Z, Li G, Huang Q, Gao W (2013) Web video thumbnail recommendation with content-aware analysis and query-sensitive matching. Multimed Tools Appl:1–25

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Changick Kim.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Choi, J., Kim, C. A framework for automatic static and dynamic video thumbnail extraction. Multimed Tools Appl 75, 15975–15991 (2016). https://doi.org/10.1007/s11042-015-2909-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-015-2909-6

Keywords

Navigation