Accuracy vs. Speed Trade-Off in Detecting of Shots in Video Content for Abstracting Digital Video Libraries

Leszczuk, Mikolaj; Papir, Zdzislaw

doi:10.1007/3-540-36166-9_16

Mikolaj Leszczuk⁶ &
Zdzislaw Papir⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2515))

Included in the following conference series:

International Workshop on Interactive Distributed Multimedia Systems and Telecommunication Services

473 Accesses
7 Citations

Abstract

Two basic requirements for a digital video library to be “browsable” are a precisely indexed content and informative abstracts. Nowadays such solutions are not common in video search engines or generic digital video platforms, therefore, the authors suggest developing some computer applications resolving the problems of at least abstracts’ creation. The abstracts cannot be constructed without a deep video content analysis, including some low level processing like a shot detection towards a video sequence segmented to a series of “camera takes”. The presented method, aimed at a shot detection, deploys a concept of a Motion Factor (of frame transitions). The basic definition considers the motion factor as a very sudden peak of difference between two successive frames. In some specific areas, the intrashot motion factor may suppress the shot-boundary motion factor. In order to avoid misrecognition of both motion factors during a shot detection process a concept of a differential motion factor was implemented. The full-resolution algorithm achieves the accuracy of up to 80%, however, it is very time-consuming. The shot detection accuracy was measured including true and false shots detected as well as real shots that were bounded visually. The authors’ research of a representative number of movies (from various categories) has revealed that the shot detection process can be accelerated up to 500 times without any significant deterioration of shot recognition accuracy. The shot detection algorithm was accelerated in a simple manner by two-dimensional reduction of a frame resolution (in pixels).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

J. R. Smith, “Digital Video Libraries and the Internet”, IEEE Communications Magazine, January 1999, pp. 92–97.
Google Scholar
B.-L. Ye, B. Liu, “Unified approach to temporal segmentation...” Proc. 2nd Int. Conf. Multimedia Computing and Systems May 1995.
Google Scholar
“Office Automation”, http://www.irisusa.com/Support, 2000.
P. Kruizinga, “The Face Recognition Home Page”, University of Groningen-http://www.cs.rug.nl/~peterkr, 2000.
“Face Recognition Technology”, Visionicshttp://www.visionics.com/, 2000.
J. R. Smith and S.-F. Chang, “Searching for Images and Videos on the World-Wide Web”, CU/CTR Technical Report 459-96-25, 1996.
Google Scholar
J. R. Smith and S.-F. Chang, “An Image and Video Search Engine for the World-Wide Web”, Proc. EI’1997, San Jose, CA, February 1997.
Google Scholar
B.-L. Yeo and B. Liu, “On the Extraction of DC Sequence...” Proc. Int. Conf. Image Processing, October 1995.
Google Scholar
M. Leszczuk, Z. Papir, “Integration of a Voice Recognition-based Indexing with Multimedia Applications”, Proc. PROMS’ 2000, Krakow, Poland, Oct. 2000, pp. 375–381.
Google Scholar
M. Leszczuk, Z. Papir, “Developing of Digital Video Libraries Indexed by a Speech...” AI’2001, Innsbruck, Austria, February 2001, pp. 107–113.
Google Scholar
M. Leszczuk, Z. Papir,“Developing Digital Video Libraries Indexed by a Speech Recognition Engine”, ICIMADE’2001, Fargo-ND, USA, June 2001.
Google Scholar
J. Mitchell, W. Pennebaker, C. Fogg, D. J. LeGall, “MPEG video compression standard”, International Thomson Publishing, New York, 1996, p. 58.
Google Scholar
M. Krunz, S. Tripathi, “Scene-based characterization of VBR MPEG-compressed video traffic”, Proc. of ACM SIGMETRICS’1997, 1997.
Google Scholar
O. Rose, “Simple and efficient models for variable bit rate MPEG video traffic”, Performance Evaluation, vol. 30, pp. 69–85, July 1997.
Google Scholar
J. Roberts, U. Mocci, J. Virtamo, “Broadband network teletraffic”, Springer-Verlag, Berlin 1996, pp. 20–25.
Google Scholar
Y. Sang-Jo, K. Seong-Dae, “Traffic Modelling and QoS Prediction for MPEG-Coded Video Services over ATM Networks Using Scene...“, Journal of High-Speed Networks”, vol. 8, no. 3, 1999, pp. 211–224.
Google Scholar
A. Mashat, M. Kara, “Performance Evaluation of a Scene-based Model...”, “System Performance Evaluation...” CRC Press, 2000, pp. 123–142.
Google Scholar
D. Heyman, A. Tabatabai, T. Lakshman, “Statistical Analysis of MPEG-2 Coded VBR Video Traffic”, sixth Int. Workshop on Packet Video, September 1994.
Google Scholar
D. P. Heyman, T. Lakshman, “Source models for VBR broadcast-video traffic”, IEEE/ACM Transactions on Networking, vol. 4, no. 1, February 1996.
Google Scholar
M. F. Scheffer, K. Wajda, J. S. Kunicki, “Fuzzy Logic Adaptive Traffic Enforcement Mechanisms for ATM Networks”, Proc. of ITC, Pretoria, South Africa, 1995.
Google Scholar
A. Winter, “Video Capture Software and Scene Detection-Scenalyzer”, http://www.scenalyzer.com/, 2001-08-02
MGI Software Corp, “Video Wave”, http://www.mgisoft.com/products/vw, 2001
IBM, “DB2 Video Extender”, http://www-4.ibm.com/software/data/db2
M. Leszczuk, P. Pacyna, Z. Papir, “Video Content Streaming Service Using IP/RSVP Protocol Stack”, IEEE Workshop on Internet Applications WIAPP’99, 7, 1999
Google Scholar
L. Sanghoon, M.S. Pattichis, A.C. Bovik, “Foveae video compression with optimal rate control” IEEE Trans. on Img. Proc. Volume: 10 Issue: 7, July 2001.
Google Scholar
Sherman Networks, “KaZaA”, http://www.kazaa.com/, 2002.

Download references

Author information

Authors and Affiliations

Department of Telecommunications, AGH University of Technology, Al. Mickiewicza 30, PL-30-059, Kraków, Poland
Mikolaj Leszczuk & Zdzislaw Papir

Authors

Mikolaj Leszczuk
View author publications
You can also search for this author in PubMed Google Scholar
Zdzislaw Papir
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departamento de Engenharia Informática, Universidade de Coimbra, Pólo II, 3030-290, Coimbra, Portugal
Fernando Boavida & Edmundo Monteiro &
Escola Superior de Educação, Instituto Politécnico de Coimbra, Praça Heróis do Ultramar, 3030, Coimbra, Portugal
João Orvalho

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Leszczuk, M., Papir, Z. (2002). Accuracy vs. Speed Trade-Off in Detecting of Shots in Video Content for Abstracting Digital Video Libraries. In: Boavida, F., Monteiro, E., Orvalho, J. (eds) Protocols and Systems for Interactive Distributed Multimedia. IDMS 2002. Lecture Notes in Computer Science, vol 2515. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36166-9_16

Download citation

DOI: https://doi.org/10.1007/3-540-36166-9_16
Published: 16 December 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00169-0
Online ISBN: 978-3-540-36166-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics