Skip to main content
Log in

Extraction of Film Takes for Cinematic Analysis

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In this paper, we focus on the ‘reverse editing’ problem in movie analysis, i.e., the extraction of film takes, original camera shots that a film editor extracts and arranges to produce a finished scene. The ability to disassemble final scenes and shots into takes is essential for nonlinear browsing, content annotation and the extraction of higher order cinematic constructs from film. A two-part framework for take extraction is proposed. The first part focuses on the filtering out action-driven scenes for which take extraction is not useful. The second part focuses on extracting film takes using agglomerative hierarchical clustering methods along with different similarity metrics and group distances and demonstrates our findings with 10 movies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. B. Adams, C. Dorai, and S. Venkatesh. “Automatic extraction of expressive elements from motion pictures: Tempo,” IEEE Transactions on Multimedia, Vol. 4, No. 4, pp. 472–481, 2002.

    Google Scholar 

  2. F.E. Beaver, Dictionary of Film Terms: The Aesthetics companion to Film Analysis, New York: Twayne Publisher, 1994.

    Google Scholar 

  3. J.M. Corridoni and A.D. Bimbo, “Structured representation and automatic indexing of movie information conent,” Pattern Recognition, Vol. 31, No. 12, pp. 2027–2045, 1998.

    Google Scholar 

  4. N. Dimitrova, J. Martino, L. Agnihotri, and H. Elenbaas, “Color superhistograms for video representation,” in ICIP’99, Kobe, 1999, Vol. 3, pp. 314–318.

  5. N.D. Doulamis, A.D. Doulamis, Y.S. Avrithis, and S.D. Kollias, “Video content representation using optimal extraction of frames and scenes,” in ICIP, Vol. 1., Chicago, Illiois, 1998, pp. 875–879.

  6. M.S. Drew and J. Au, “Video keyframe production by efficient clustering of compressed chromaticity signatures,” in ACM Multimedia 2000, Los Angeles, 2000.

  7. B.S. Everitt, Cluster Analysis 3rd edition. Edward Arnold, 1993.

  8. D. Farin, W. Effelsberg, and P. de With, “Robust clustering-based video-summarization with integration of domain-knowledge,” in IEEE International Conference on Multimedia and Expo(ICME), Lausanne, 2002.

  9. A. Girgensohn and J.S. Boreczky, “Time-constrained keyframe selection technique,” Multimedia Tools and Applications, Vol. 11, No. 3, pp. 347–358, 2000.

    Google Scholar 

  10. J. Huang, S. Kumar, M. Mitra, W. Zhu, and R. Zahib, “Spatial Color Indexing and Applications,” International Journal of Computer Vision, Vol. 35, No. 3, pp. 245–268, 1999.

    Google Scholar 

  11. L. Hubert and P. Arabie, “Comparing partitions,” Journal of Classificaiton, Vol. 2, pp. 193–218, 1985.

    Google Scholar 

  12. S. Kobayashi, Colorist: A Practical Handbook for Personal and Professional Use, Kodansa International: Tokyo, 1998.

    Google Scholar 

  13. Y. Li, T. Zhang, and D. Tretter, “An overview of video abstraction techniques,” Technical Report HPL-2001-191, HP Laboratory, 2001.

  14. R. Lienhart, “Reliable transition detection in videos: A survey and practitioner’s guide,” International Journal of Image and Graphics, Vol. 1, No. 3, pp. 469–486, 2001.

    Google Scholar 

  15. T. Moriayama and M. Sakauchi, “Video summarisation based on the psychological content in the track structure,” in ACM multimedia workshops 2000, 2002, pp. 191–194.

  16. J. Nam and A.H. Tewfik, “Dynamic video sumarization and visualization,” in The 7th ACM Conference on Multimida, ACMMM’99, Vol. 2. Orlando, Florida, 1999, pp. 53–56.

  17. C.-W. Ngo, T.-C. Pong, and H.-Z. Zhang, “On Clustering and Retrieval of video shots,” in: ACM Multimedia’01, Ottawa, 2001, pp. 51–60.

  18. Y. Rui, T.S. Huang, and M.S., “Constructing table-of-content for videos,” ACM Multimedia System Journal: Special Issue in Multimedia Systems on Video Libraries, Vol. 7, No. 5, pp. 359–368, 1999.

    Google Scholar 

  19. S. Sharff, “The Elements of Cinema: Towards a CinestheticImpact,” Columbia Uni Press: New York, 1982.

    Google Scholar 

  20. E.A. Tekalp and A. Mehrotra, “Automatic soccer video analysis and summarization,” IEEE Transaction on Image Processing, 2003.

  21. B.T. Truong, C. Dorai, and S. Venkatesh, “Automatic scene extraction in motion pictures,” IEEE Transactions in Circuits and Systems for Video Technology, Vol. 13, No. 1, pp. 5–10, 2003.

    Google Scholar 

  22. B.T. Truong, S. Venkatesh, and C. Dorai, “New enhancements to cut, fade, and dissolve detection process in video segmentation,” in ACM Multimedia 2000, LA, 2000, pp. 219–227.

  23. B.T. Truong, S. Venkatesh, and C. Dorai, “Application of computational media aesthetics methodology to extracting color semantics in film,” in ACM Multiemdia 2002, France Les Pins, 2002, pp. 339–342.

  24. B.T. Truong, S. Venkatesh, and C. Dorai, “Discovering semantics from the visualization of film takes,” in Accepted for IEEE Multimedia Modelling 2004, Brisbane, Australia, 2004.

  25. E. Veneau, R. Ronfard, and P. Bouthemy, “From video shot clustering to sequence segmentation,” in ICPR’00, Vol. 4, Barcelona, 2000, pp. 254–257.

  26. G.W. Milligan and M.C. Cooper, “An examination of procedures for determining the number of clusters in a data set,” Psychometrika, Vol. 50, No. 2, pp. 159–179, 1985.

    Google Scholar 

  27. J. Wang and T.-S. Chua, “A cinematic-based framework for detecting scene boundaries in video,” The Visual Computer. To appear, 2003.

  28. M. Yeung, B.-L. Yeo, and B. Liu, “Segmentation of video by clustering and graph analysis,” Computer Vision and Image Understanding, Vol. 7, No. 1, pp. 94–109, 1998.

    Google Scholar 

  29. H. Zettl, Sight Sound Motion: Applied Media Aesthetics, 3rd edition, Wadsworth Publishing, 1999.

  30. L. Zhao, W. Qi, S. Yang, and H. Zhang, “Video shot grouping using best-first model merging,” in Proc. 13th SPIE Symposium on Electronic Imaging—Storage and Retrieval for Image and Video Databases, San Jose, 2001, pp. 262–267.

  31. D. Zhong, H. Zhang, and S.-F. Chang, “Clustering methods for video browsing and annotation,” in Storage and Retrieval for Still Image and Video Databases IV, 1996, pp. 239–246.

  32. Y. Zhuang, Y. Rui, T. Huang, and S. Mehrotra, “Adaptive key frame extraction using unsupervised clustering,” in ICIP’98, Chicago, 1998, pp. 866–870.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ba Tu Truong.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Truong, B.T., Venkatesh, S. & Dorai, C. Extraction of Film Takes for Cinematic Analysis. Multimed Tools Appl 26, 277–298 (2005). https://doi.org/10.1007/s11042-005-0892-z

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-005-0892-z

Keywords

Navigation