Skip to main content

Detecting and Clustering Multiple Takes of One Scene

  • Conference paper
Book cover Advances in Multimedia Modeling (MMM 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4903))

Included in the following conference series:

Abstract

In applications such as video post-production users are confronted with large amounts of redundant unedited raw material, called rushes. Viewing and organizing this material are crucial but time consuming tasks. Typically multiple but slightly different takes of the same scene can be found in the rushes video. We propose a method for detecting and clustering takes of one scene shot from the same or very similar camera positions. It uses a variant of the LCSS algorithm to find matching subsequences in sequences of visual features extracted from the source video. Hierarchical clustering is used to group the takes of one scene. The approach is evaluated in terms of correctly assigned takes using manually annotated ground truth.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bailer, W., Lee, F., Thallinger, G.: Skimming rushes video using retake detection. In: TVS 2007. Proceedings of the TRECVID Workshop on Video Summarization, pp. 60–64. ACM Press, New York (September 2007)

    Chapter  Google Scholar 

  2. Bailer, W., Thallinger, G.: A framework for multimedia content abstraction and its application to rushes exploration. In: Proceedings of ACM International Conference on Image and Video Retrieval, Amsterdam, NL (July 2007)

    Google Scholar 

  3. Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines, Software (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm

  4. Chang, S.-F., Chen, W., Meng, H., Sundaram, H., Zhong, D.: VideoQ: an automated content based video search system using visual cues. In: MULTIMEDIA 1997: Proceedings of the fifth ACM international conference on Multimedia, pp. 313–324. ACM Press, New York (1997)

    Chapter  Google Scholar 

  5. Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, 2nd edn. The MIT Press, Cambridge (2001)

    MATH  Google Scholar 

  6. Covell, M., Baluja, S., Fink, M.: Advertisement detection and replacement using acoustic and visual repetition. In: IEEE Workshop on Multimedia Signal Processing, pp. 461–466 (October 2006)

    Google Scholar 

  7. Delaney, B., Hoomans, B.: Preservation and Digitisation Plans: Overview and Analysis, PrestoSpace Deliverable 2.1 User Requirements Final Report (2004), http://www.prestospace.org/project/deliverables/D2-1_User_Requirements_Final_Report.pdf

  8. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, Chichester (2000)

    Google Scholar 

  9. Duygulu, P., Pan, J.-Y., Forsyth, D.A.: Towards auto-documentary: tracking the evolution of news stories. In: MULTIMEDIA 2004: Proceedings of the 12th annual ACM international conference on Multimedia, pp. 820–827. ACM Press, New York (2004)

    Chapter  Google Scholar 

  10. Hampapur, A., Bolle, R.M.: Comparison of distance measures for video copy detection. In: IEEE International Conference on Multimedia and Expo, pp. 737–740 (August 2001)

    Google Scholar 

  11. Hampapur, A., Hyun, K., Bolle, R.M.: In: Yeung, M.M., Li, C.-S., Lienhart, R.W. (eds.) Storage and Retrieval for Media Databases 2002. Society of Photo-Optical Instrumentation Engineers (SPIE) Conference, vol. 4676, pp. 194–201 (2001)

    Google Scholar 

  12. Hsu, W., Chang, S.-F.: Topic tracking across broadcast news videos with visual duplicates and semantic concepts. In: International Conference on Image Processing (ICIP) (October 2006)

    Google Scholar 

  13. MPEG-7. Information Technology—Multimedia Content Description Interface: Part 3: Visual. ISO/IEC 15938-3 (2001)

    Google Scholar 

  14. MPEG-7. Information Technology—Multimedia Content Description Interface: Part 8: Extraction and Use of MPEG-7 Descriptions. ISO/IEC 15938-8 (2001)

    Google Scholar 

  15. Over, P., Smeaton, A.F., Kelly, P.: The TRECVID 2007 BBC rushes summarization evaluation pilot. In: TVS 2007. Proceedings of the TRECVID Workshop on Video Summarization, pp. 1–15. ACM Press, New York (September 2007)

    Chapter  Google Scholar 

  16. Alan, F., Smeaton, A.F., Over, P.: TRECVID 2006: Shot boundary detection task overview. In: Proceedings of the TRECVID Workshop (November 2006)

    Google Scholar 

  17. Vlachos, M., Kollios, G., Gunopoulos, D.: Discovering similar multidimensional trajectories. In: ICDE 2002: Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, pp. 673–684. IEEE Computer Society, Washington DC (2002)

    Chapter  Google Scholar 

  18. Zhang, Z., Huang, K., Tan, T.: Comparison of similarity measures for trajectory clustering in outdoor surveillance scenes. In: ICPR 2006: Proceedings of the 18th International Conference on Pattern Recognition, pp. 1135–1138. IEEE Computer Society, Washington, DC, USA (2006)

    Google Scholar 

  19. Zhu, X., Elmagarmid, A., Xue, X., Wu, L., Catlin, A.: InsightVideo: toward hierarchical video content organization for efficient browsing, summarization and retrieval. IEEE Transactions on Multimedia 7(4), 648–666 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Shin’ichi Satoh Frank Nack Minoru Etoh

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bailer, W., Lee, F., Thallinger, G. (2008). Detecting and Clustering Multiple Takes of One Scene. In: Satoh, S., Nack, F., Etoh, M. (eds) Advances in Multimedia Modeling. MMM 2008. Lecture Notes in Computer Science, vol 4903. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77409-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77409-9_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77407-5

  • Online ISBN: 978-3-540-77409-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics