A content-based approach for detecting highlights in action movies

Yeh, Mei-Chen; Tsai, Yen-Wei; Hsu, Hao-Chen

doi:10.1007/s00530-015-0457-6

A content-based approach for detecting highlights in action movies

Regular Paper
Published: 21 March 2015

Volume 22, pages 287–295, (2016)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Mei-Chen Yeh¹,
Yen-Wei Tsai¹ &
Hao-Chen Hsu¹

512 Accesses
3 Citations
Explore all metrics

Abstract

Although detecting highlights in films is a trivial task for humans, previous studies have not determined whether a computer can be equipped with this capability. In this paper, we present a content-based system that automatically detects highlight scenes and predicts highlight scores in action movies. In particular, high-level image attributes and an early event detection approach are applied. Dissimilar to current learning-based approaches that model the relationship between the whole highlight and corresponding audiovisual features, the proposed system studies the temporal changes of a set of general features from a nonhighlight to a highlight scene. The experimental results indicate that achieving the highlight detection task is technically feasible. It also provides critical insights into understanding the feasibility of solving this challenging problem. For example, both audio and visual features are crucial and the filming style can be captured using high-level image attributes, which further improve the overall detection performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Adams, B., Dorai, C., Venkatesh, S.: Study of shot length and motion as contributing factors to movie tempo. In: ACM International Conference on Multimedia (2000)
Chênes, C., Chanel, G., Soleymani, M., Pun, T.: Highlight detection in movie scenes through inter-users physiological linkage. In: Ramzan, N., (Ed.) Social Media Retrieval. Computer Communications and Networks, pp. 217–237. Springer, London (2013)
Gross, J.J., Levenson, R.W.: Emotion elicitation using films. Cogn. Emot. 9(1), 87–108 (1995)
Article Google Scholar
Hamann, S.: Cognitive and neural mechanisms of emotional memory. Trends Cogn Sci 5(9), 394–400 (2001)
Article Google Scholar
Hanjalic, A.: Adaptive extraction of highlights from a sport video based on excitement modeling. IEEE Trans Multimed 7(6), 1114–1122 (2005)
Article Google Scholar
Irie, G., Satou, T., Kojima, A., Yamasaki, T., Aizawa, K.: Automatic trailer generation. In: ACM International Conference on Multimedia (2010)
Li, Y., Lee, S.-H., Yeh, C.-H., Kuo, C.-C.J.: Techniques for movie content analysis and skimming: tutorial and overview on video abstraction techniques. IEEE Signal Process Mach 23(3), 79–89 (2006)
Google Scholar
Lin, K.-S., Lee, A., Yang, Y.-H., Lee, C.-T., Chen H.H.: Automatic highlights extraction for drama video using music emotion and human face features. In: IEEE International Workshop on Multimedia Signal Processing (2011)
Liu, A., Li, J., Zhang, Y., Tang, S., Song, Y., Yang, Z.: An innovative model of tempo and its application in action scene detection for movie analysis. In: IEEE Workshop on Applications of Computer Vision (2008)
Liu, A., Tang, S., Zhang, Y., Song, Y., Li, J., Yang, Z.: A hierarchical framework for movie content analysis: Let computers watch films like humans. In: IEEE International Conference on Computer Vision and Pattern Recognition Workshops (2008)
Liu, A., Yang, Z.: Watching, thinking, reacting: a human-centered framework for movie content analysis. Int J Digit Content Technol Appl 4(5), 23–37 (2010)
Article Google Scholar
Ma, Y.-F., Lu, L., Zhang, H.-J., Li, M.: A user attention model for video summarization. In: ACM International Conference on Multimedia (2002)
Merriam-Webster: Merriam-Webster’s collegiate dictionary, 2003. Retrieved July 17 2014 from: http://www.merriam-webster.com/dictionary/highlight
Minh, H., Torre, D.: Max-margin early event detection. In: IEEE International Conference on Computer Vision and Pattern Recognition (2012)
MPEG-7 Visual Experimentation Model (XM), Version 10.0, ISO/IEC/JTC1/SC29/WG11, Doc. N4063 (2001)
Rasheed, Z., Shah, M.: Detection and representation of scenes in videos. IEEE Trans Multimed 7(6), 1097–1105 (2005)
Article Google Scholar
Sadlier, D.A., O’Connor, N.E.: Event detection in field sports video using audio-visual features and a support vector machine. IEEE Trans Circuits Syst Video Technol 15(10), 1225–1233 (2005)
Article Google Scholar
Smeaton, A.F., Lehane, B., O’Connor, N.E., Brady, C., Craig, G.: Automatically selecting shots for action movie trailers. ACM International Workshop on Multimedia Information Retrieval (2006)
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large margin methods for structured and interdependent output variables. J Mach Learn Res (JMLR) 6, 1453–1484 (2005)
MathSciNet MATH Google Scholar
Wang, H.L., Cheng, L.-F.: Affective understanding in film. IEEE Trans Circuits Syst Video Technol 16(6), 689–704 (2006)
Article Google Scholar
Wang, J., Xu, C., Chng, E., Tian, Q.: Sports highlight detection from key word sequences using HMM. In: IEEE International Conference on Multimedia & Expo (2004)
Zheng, Y., Zhu, G., Jiang, S., Huang, Q., Gao, W.: Visual-aural attention modeling for talk show video highlight detection. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2008)

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Taiwan Normal University, No. 88, Sec. 4, Tingzhou Rd., Taipei, Taiwan
Mei-Chen Yeh, Yen-Wei Tsai & Hao-Chen Hsu

Authors

Mei-Chen Yeh
View author publications
You can also search for this author in PubMed Google Scholar
Yen-Wei Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Hao-Chen Hsu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mei-Chen Yeh.

Additional information

Communicated by Y. Zhang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yeh, MC., Tsai, YW. & Hsu, HC. A content-based approach for detecting highlights in action movies. Multimedia Systems 22, 287–295 (2016). https://doi.org/10.1007/s00530-015-0457-6

Download citation

Received: 17 July 2014
Accepted: 01 March 2015
Published: 21 March 2015
Issue Date: June 2016
DOI: https://doi.org/10.1007/s00530-015-0457-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A content-based approach for detecting highlights in action movies

Abstract

Access this article

Similar content being viewed by others

Automatic summarization of soccer highlights using audio-visual descriptors

Content-Aware Summarization of Broadcast Sports Videos: An Audio–Visual Feature Extraction Approach

Automatic highlight detection in videos of martial arts tricking

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A content-based approach for detecting highlights in action movies

Abstract

Access this article

Similar content being viewed by others

Automatic summarization of soccer highlights using audio-visual descriptors

Content-Aware Summarization of Broadcast Sports Videos: An Audio–Visual Feature Extraction Approach

Automatic highlight detection in videos of martial arts tricking

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation