Definition:Automatic annotation of video refers to the extraction of the information about video automatically, which can serve as the first step for different data access modalities such as browsing, searching, comparison, and categorization.
Advances in digital video technology and the ever increasing availability of computing resources have resulted, in the last few years, in an explosion of digital video data. Moreover, the increased availability of Internet bandwidth has defined new means of video distribution, other than physical media. The major web search engines have already started to provide specific services to index, search and retrieve videos on the Internet.
Improving of video accessibility is the true challenge. In fact, access to video data requires that video content is appropriately indexed but manually annotating or tagging video is at best a laborious and economically infeasible process. Therefore, one important subject of research has been concerned with study of...
This is a preview of subscription content, log in via an institution.
References
N. Dimitrova, H.-J. Zhang, B. Shahraray, I. Sezan, T. Huang, and A. Zakhor. “Applications of videocontent analysis and retrieval,” IEEE Multimedia Magazine, Vol. 12, No. 3, July 2002.
T. Lin and H.J. Zhang, “Automatic video scene extraction by shot grouping,” Proceedings of the 15th International Conference on Pattern Recognition. Vol. 4, September 2000, pp. 39–42, 2000.
J.S. Boreczky and L.A. Rowe, “Comparison of video shot boundary detection techniques,” Proceedings of the IS&T/SPIE Conference Storage and Retrieval for Image and Video Databases IV, Vol. SPIE 2670, 1996, pp. 170–179.
A. Dailianas, R.B. Allen, and P. England, “Comparison of automatic video segmentation algorithms,” Proceedings of the Integration Issues in Large Commercial Media Delivery Systems, Vol. SPIE 2615, October 1995, pp. 2–16.
U. Gargi, R. Kasturi, and S. H. Strayer. “Performance characterization of video-shot-change detection methods,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 10, No. 3, February 2000.
S. Pfeiffer, S. Fischer, and W. Effelsberg, “Automatic Audio Content Analysis,” Proceedings of the ACM Multimedia 96, pp. 21–30, 1996.
C.G.M. Snoek and M. Worring. “Multimodal video indexing: a review of the state-of-the-art,” Multimedia Tools and Applications, Vol. 25, No. 1, pp. 5–35, January 2005.
S.S. Intille, J.W. Davis, and A.F. Bobick, “Real Time Closed World Tracking,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 697–703, 1997.
A. Elgammal, D. Harwood, and L.S. Davis, “Non Parametric Model for Background Subtraction,” Proceedings of the 7th IEEE International Conference on Computer Vision, Kerkyra, Greece, September 1999.
T. Sato, T. Kanade, E. Hughes, and M. Smith. “Video OCR for Digital News Archives,” Proceedings of the IEEE Workshop on Content-Based Access of Image and Video Databases (CAIVD’ 98), Bombay, India, January 1998.
R. Lienhart, “Video OCR: A survey and practitioner’s guide,” In A. Rosenfeld, D. Doermann, and D. DeMenthon, Editors, Video Mining, pp. 155–183, Kluwer Academic Publishers, 2003.
L. Agnihotri, K.V. Devara, T. McGee, and N. Dimitrova, “Summarization of video programs based on closed captions,” Proceedings of the SPIE, Vol. 4315, pp. 599–607, Storage and Retrieval for Media Databases, 2001.
P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” Proceedings of the Computer Vision and Pattern Recognition (CVPR’01), 2001.
A. Hauptmann, D. Ng, R. Baron, M-Y Chen, M. Christel, S. Duygulu, C. Huang, W-H. Lin, H. Wactlar, N. Moraveji, N. Papernick, C.G.M. Snoek, G. Tzanetakis, J. Yang, R. Yan, and R. Jin, “Informedia at TRECVID 2003: Analyzing and Searching Broadcast News Video,” Proceedings of TREC 2003, Gaithersburg, MD, November 2003.
J. Assfalg, M. Bertini, C. Colombo, and A. Del Bimbo, “Semantic Annotation of Sports Videos,” IEEE Multimedia, Vol. 9 No. 2, pp. 52–60, April/June 2002.
J. Assfalg, M. Bertini, C. Colombo, A. Del Bimbo, and W. Nunziati, “Semantic annotation of soccer videos: automatic highlights identification,” Computer Vision and Image Understanding, Vol. 92, Issue 2–3, pp. 285–305, November/December 2003.
A. Ekin, A.M. Tekalp, and R. Mehrotra, “Automatic soccer video analysis and summarization,” IEEE Transactions on Image Processing, Vol. 12, No. 7, pp. 796–807, July 2003.
M.H. Yang, D.J. Kriegman, and N. Ahuja, “Detecting faces in images: A survey,” IEEE Transactions on Pattern Analysis and Machine, Vol. 24, No. 1, pp. 34–58, January 2002.
W. Zhao, R. Chellappa, P.J. Phillips, and A. Rosenfeld, “Face recognition: a literature survey,” ACM Computing Surveys, Vol. 35, No. 4, pp. 309–459, December 2003.
Z. Rasheed, Y. Sheikh, and M. Shah, “On the Use of Computable Features for Film Classification,” IEEE Transactions on Circuits and Systems for Video, Vol. 15, No. 1, pp. 52–64, January 2005.
S. Eickeler and S. Muller, “Content-based video indexing of TV broadcast news using Hidden Markov Models,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP’ 99, Vol. 6, pp. 2997–3000, March 1999.
B. Lehane, N. O'Connor, and N. Murphy, “Action Sequence Detection in Motion Pictures,” Proceedings of the European Workshop on the Integration of Knowledge, Semantics and Digital Media Technology, London, U.K., November 2004.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer Science+Business Media, Inc.
About this entry
Cite this entry
Del Bimbo, A., Bertini, M. (2006). Video Automatic Annotation. In: Furht, B. (eds) Encyclopedia of Multimedia. Springer, Boston, MA. https://doi.org/10.1007/0-387-30038-4_241
Download citation
DOI: https://doi.org/10.1007/0-387-30038-4_241
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-24395-5
Online ISBN: 978-0-387-30038-2
eBook Packages: Computer ScienceReference Module Computer Science and Engineering