Detecting and Clustering Multiple Takes of One Scene

Bailer, Werner; Lee, Felix; Thallinger, Georg

doi:10.1007/978-3-540-77409-9_8

Werner Bailer¹,
Felix Lee¹ &
Georg Thallinger¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4903))

Included in the following conference series:

International Conference on Multimedia Modeling

1632 Accesses
5 Citations

Abstract

In applications such as video post-production users are confronted with large amounts of redundant unedited raw material, called rushes. Viewing and organizing this material are crucial but time consuming tasks. Typically multiple but slightly different takes of the same scene can be found in the rushes video. We propose a method for detecting and clustering takes of one scene shot from the same or very similar camera positions. It uses a variant of the LCSS algorithm to find matching subsequences in sequences of visual features extracted from the source video. Hierarchical clustering is used to group the takes of one scene. The approach is evaluated in terms of correctly assigned takes using manually annotated ground truth.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bailer, W., Lee, F., Thallinger, G.: Skimming rushes video using retake detection. In: TVS 2007. Proceedings of the TRECVID Workshop on Video Summarization, pp. 60–64. ACM Press, New York (September 2007)
Chapter Google Scholar
Bailer, W., Thallinger, G.: A framework for multimedia content abstraction and its application to rushes exploration. In: Proceedings of ACM International Conference on Image and Video Retrieval, Amsterdam, NL (July 2007)
Google Scholar
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines, Software (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Chang, S.-F., Chen, W., Meng, H., Sundaram, H., Zhong, D.: VideoQ: an automated content based video search system using visual cues. In: MULTIMEDIA 1997: Proceedings of the fifth ACM international conference on Multimedia, pp. 313–324. ACM Press, New York (1997)
Chapter Google Scholar
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, 2nd edn. The MIT Press, Cambridge (2001)
MATH Google Scholar
Covell, M., Baluja, S., Fink, M.: Advertisement detection and replacement using acoustic and visual repetition. In: IEEE Workshop on Multimedia Signal Processing, pp. 461–466 (October 2006)
Google Scholar
Delaney, B., Hoomans, B.: Preservation and Digitisation Plans: Overview and Analysis, PrestoSpace Deliverable 2.1 User Requirements Final Report (2004), http://www.prestospace.org/project/deliverables/D2-1_User_Requirements_Final_Report.pdf
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, Chichester (2000)
Google Scholar
Duygulu, P., Pan, J.-Y., Forsyth, D.A.: Towards auto-documentary: tracking the evolution of news stories. In: MULTIMEDIA 2004: Proceedings of the 12th annual ACM international conference on Multimedia, pp. 820–827. ACM Press, New York (2004)
Chapter Google Scholar
Hampapur, A., Bolle, R.M.: Comparison of distance measures for video copy detection. In: IEEE International Conference on Multimedia and Expo, pp. 737–740 (August 2001)
Google Scholar
Hampapur, A., Hyun, K., Bolle, R.M.: In: Yeung, M.M., Li, C.-S., Lienhart, R.W. (eds.) Storage and Retrieval for Media Databases 2002. Society of Photo-Optical Instrumentation Engineers (SPIE) Conference, vol. 4676, pp. 194–201 (2001)
Google Scholar
Hsu, W., Chang, S.-F.: Topic tracking across broadcast news videos with visual duplicates and semantic concepts. In: International Conference on Image Processing (ICIP) (October 2006)
Google Scholar
MPEG-7. Information Technology—Multimedia Content Description Interface: Part 3: Visual. ISO/IEC 15938-3 (2001)
Google Scholar
MPEG-7. Information Technology—Multimedia Content Description Interface: Part 8: Extraction and Use of MPEG-7 Descriptions. ISO/IEC 15938-8 (2001)
Google Scholar
Over, P., Smeaton, A.F., Kelly, P.: The TRECVID 2007 BBC rushes summarization evaluation pilot. In: TVS 2007. Proceedings of the TRECVID Workshop on Video Summarization, pp. 1–15. ACM Press, New York (September 2007)
Chapter Google Scholar
Alan, F., Smeaton, A.F., Over, P.: TRECVID 2006: Shot boundary detection task overview. In: Proceedings of the TRECVID Workshop (November 2006)
Google Scholar
Vlachos, M., Kollios, G., Gunopoulos, D.: Discovering similar multidimensional trajectories. In: ICDE 2002: Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, pp. 673–684. IEEE Computer Society, Washington DC (2002)
Chapter Google Scholar
Zhang, Z., Huang, K., Tan, T.: Comparison of similarity measures for trajectory clustering in outdoor surveillance scenes. In: ICPR 2006: Proceedings of the 18th International Conference on Pattern Recognition, pp. 1135–1138. IEEE Computer Society, Washington, DC, USA (2006)
Google Scholar
Zhu, X., Elmagarmid, A., Xue, X., Wu, L., Catlin, A.: InsightVideo: toward hierarchical video content organization for efficient browsing, summarization and retrieval. IEEE Transactions on Multimedia 7(4), 648–666 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

JOANNEUM RESEARCH Forschungsgesellschaft mbH, Institute of Information Systems & Information Management, Steyrergasse 17, 8010 Graz, Austria
Werner Bailer, Felix Lee & Georg Thallinger

Authors

Werner Bailer
View author publications
You can also search for this author in PubMed Google Scholar
Felix Lee
View author publications
You can also search for this author in PubMed Google Scholar
Georg Thallinger
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Shin’ichi Satoh Frank Nack Minoru Etoh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bailer, W., Lee, F., Thallinger, G. (2008). Detecting and Clustering Multiple Takes of One Scene. In: Satoh, S., Nack, F., Etoh, M. (eds) Advances in Multimedia Modeling. MMM 2008. Lecture Notes in Computer Science, vol 4903. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77409-9_8

Download citation

DOI: https://doi.org/10.1007/978-3-540-77409-9_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77407-5
Online ISBN: 978-3-540-77409-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics