Skip to main content

Accuracy vs. Speed Trade-Off in Detecting of Shots in Video Content for Abstracting Digital Video Libraries

  • Conference paper
  • First Online:
Protocols and Systems for Interactive Distributed Multimedia (IDMS 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2515))

Abstract

Two basic requirements for a digital video library to be “browsable” are a precisely indexed content and informative abstracts. Nowadays such solutions are not common in video search engines or generic digital video platforms, therefore, the authors suggest developing some computer applications resolving the problems of at least abstracts’ creation. The abstracts cannot be constructed without a deep video content analysis, including some low level processing like a shot detection towards a video sequence segmented to a series of “camera takes”. The presented method, aimed at a shot detection, deploys a concept of a Motion Factor (of frame transitions). The basic definition considers the motion factor as a very sudden peak of difference between two successive frames. In some specific areas, the intrashot motion factor may suppress the shot-boundary motion factor. In order to avoid misrecognition of both motion factors during a shot detection process a concept of a differential motion factor was implemented. The full-resolution algorithm achieves the accuracy of up to 80%, however, it is very time-consuming. The shot detection accuracy was measured including true and false shots detected as well as real shots that were bounded visually. The authors’ research of a representative number of movies (from various categories) has revealed that the shot detection process can be accelerated up to 500 times without any significant deterioration of shot recognition accuracy. The shot detection algorithm was accelerated in a simple manner by two-dimensional reduction of a frame resolution (in pixels).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J. R. Smith, “Digital Video Libraries and the Internet”, IEEE Communications Magazine, January 1999, pp. 92–97.

    Google Scholar 

  2. B.-L. Ye, B. Liu, “Unified approach to temporal segmentation...” Proc. 2nd Int. Conf. Multimedia Computing and Systems May 1995.

    Google Scholar 

  3. “Office Automation”, http://www.irisusa.com/Support, 2000.

  4. P. Kruizinga, “The Face Recognition Home Page”, University of Groningen-http://www.cs.rug.nl/~peterkr, 2000.

  5. “Face Recognition Technology”, Visionicshttp://www.visionics.com/, 2000.

  6. J. R. Smith and S.-F. Chang, “Searching for Images and Videos on the World-Wide Web”, CU/CTR Technical Report 459-96-25, 1996.

    Google Scholar 

  7. J. R. Smith and S.-F. Chang, “An Image and Video Search Engine for the World-Wide Web”, Proc. EI’1997, San Jose, CA, February 1997.

    Google Scholar 

  8. B.-L. Yeo and B. Liu, “On the Extraction of DC Sequence...” Proc. Int. Conf. Image Processing, October 1995.

    Google Scholar 

  9. M. Leszczuk, Z. Papir, “Integration of a Voice Recognition-based Indexing with Multimedia Applications”, Proc. PROMS’ 2000, Krakow, Poland, Oct. 2000, pp. 375–381.

    Google Scholar 

  10. M. Leszczuk, Z. Papir, “Developing of Digital Video Libraries Indexed by a Speech...” AI’2001, Innsbruck, Austria, February 2001, pp. 107–113.

    Google Scholar 

  11. M. Leszczuk, Z. Papir,“Developing Digital Video Libraries Indexed by a Speech Recognition Engine”, ICIMADE’2001, Fargo-ND, USA, June 2001.

    Google Scholar 

  12. J. Mitchell, W. Pennebaker, C. Fogg, D. J. LeGall, “MPEG video compression standard”, International Thomson Publishing, New York, 1996, p. 58.

    Google Scholar 

  13. M. Krunz, S. Tripathi, “Scene-based characterization of VBR MPEG-compressed video traffic”, Proc. of ACM SIGMETRICS’1997, 1997.

    Google Scholar 

  14. O. Rose, “Simple and efficient models for variable bit rate MPEG video traffic”, Performance Evaluation, vol. 30, pp. 69–85, July 1997.

    Google Scholar 

  15. J. Roberts, U. Mocci, J. Virtamo, “Broadband network teletraffic”, Springer-Verlag, Berlin 1996, pp. 20–25.

    Google Scholar 

  16. Y. Sang-Jo, K. Seong-Dae, “Traffic Modelling and QoS Prediction for MPEG-Coded Video Services over ATM Networks Using Scene...“, Journal of High-Speed Networks”, vol. 8, no. 3, 1999, pp. 211–224.

    Google Scholar 

  17. A. Mashat, M. Kara, “Performance Evaluation of a Scene-based Model...”, “System Performance Evaluation...” CRC Press, 2000, pp. 123–142.

    Google Scholar 

  18. D. Heyman, A. Tabatabai, T. Lakshman, “Statistical Analysis of MPEG-2 Coded VBR Video Traffic”, sixth Int. Workshop on Packet Video, September 1994.

    Google Scholar 

  19. D. P. Heyman, T. Lakshman, “Source models for VBR broadcast-video traffic”, IEEE/ACM Transactions on Networking, vol. 4, no. 1, February 1996.

    Google Scholar 

  20. M. F. Scheffer, K. Wajda, J. S. Kunicki, “Fuzzy Logic Adaptive Traffic Enforcement Mechanisms for ATM Networks”, Proc. of ITC, Pretoria, South Africa, 1995.

    Google Scholar 

  21. A. Winter, “Video Capture Software and Scene Detection-Scenalyzer”, http://www.scenalyzer.com/, 2001-08-02

  22. MGI Software Corp, “Video Wave”, http://www.mgisoft.com/products/vw, 2001

  23. IBM, “DB2 Video Extender”, http://www-4.ibm.com/software/data/db2

  24. M. Leszczuk, P. Pacyna, Z. Papir, “Video Content Streaming Service Using IP/RSVP Protocol Stack”, IEEE Workshop on Internet Applications WIAPP’99, 7, 1999

    Google Scholar 

  25. L. Sanghoon, M.S. Pattichis, A.C. Bovik, “Foveae video compression with optimal rate control” IEEE Trans. on Img. Proc. Volume: 10 Issue: 7, July 2001.

    Google Scholar 

  26. Sherman Networks, “KaZaA”, http://www.kazaa.com/, 2002.

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Leszczuk, M., Papir, Z. (2002). Accuracy vs. Speed Trade-Off in Detecting of Shots in Video Content for Abstracting Digital Video Libraries. In: Boavida, F., Monteiro, E., Orvalho, J. (eds) Protocols and Systems for Interactive Distributed Multimedia. IDMS 2002. Lecture Notes in Computer Science, vol 2515. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36166-9_16

Download citation

  • DOI: https://doi.org/10.1007/3-540-36166-9_16

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00169-0

  • Online ISBN: 978-3-540-36166-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics