skip to main content
10.1145/1463542.1463554acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Human action analysis, annotation and modeling in video streams based on implicit user interaction

Published: 31 October 2008 Publication History

Abstract

This paper proposes an integrated framework for analyzing human actions in video streams. Despite most current approaches that are just based on automatic spatiotemporal analysis of sequences, the proposed method introduces the implicit user-in-the-loop concept for dynamically mining semantics and annotating video streams. This work sets a new and ambitious goal: to recognize, model and properly use "average user's" selections, preferences and perception, for dynamically extracting content semantics. The proposed approach is expected to add significant value to hundreds of billions of non-annotated or inadequately annotated video streams existing in the Web, file servers, databases etc. Furthermore expert annotators can gain important knowledge relevant to user preferences, selections, styles of searching and perception.

References

[1]
A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, "Content-Based Image Retrieval at the End of the Early Years," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 12, pp. 1349--1380, Dec. 2000.
[2]
Arnab Bhattacharya, Vebjorn Ljosa, Jia-Yu Pan, Mark R. Verardo, Hyungjeong Yang, Christos Faloutsos, and Ambuj K. Singh, "ViVo: Visual Vocabulary Construction for Mining Biomedical Images," Fifth IEEE International Conference on Data Mining, Houston, Texas, November 2005.
[3]
Li, J. and Wang, J. Z., "Real-time computerized annotation of pictures," In Proc. ACM Multimedia, 2006.
[4]
Joshi, D., Wang, J. Z., and Li, J., "The story picturing engine - a system for automatic text illustration," ACM Trans. Multimedia Computing, Communications and Applications, vol. 2, no. 1, p.p. 68--89, 2006.
[5]
Stevenson, K., Leung, C., "Comparative evaluation of Web image search engines for multimedia applications," IEEE International Conference on Multimedia and Expo, July 2005.
[6]
Search Engine Statistics For 2006-07," SEO Weekly Article, www.accuracast.com.
[7]
comScore's qSearch 2.0 service", comScore's Report Article, www.comscore.com
[8]
Jansen, B. J., Spink, A., and Saracevic, T., "Real life, real users, and real needs: A study and analysis of user queries on the web," Information Processing & Management, 36(2), 207--227, 2000.
[9]
Y. Deng, and B. S. Manjunath, "Unsupervised segmentation of color-texture regions in images and video", IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI '01), vol. 23, no. 8, pp. 800--810, Aug. 2001.
[10]
N. Tsapatsoulis, C. Pattichis, A. Kounoudes, C. Loizou, A. Constantinides, J. G. Taylor "Visual Attention based Region of Interest Coding for Video-telephony Applications", 5th International Symposium on Communication Systems, Networks and Digital Signal Processing (CSNDSP'06), Patras, Greece, July 2006.
[11]
N. Tsapatsoulis, Y. Avrithis and S. Kollias, "Facial Image Indexing in Multimedia Databases," Pattern Analysis and Applications, Vol. 4, Issue 2/3, pp 93--107, 2001.
[12]
A. D. Doulamis, N. D. Doulamis and S. D. Kollias, "On Line Retrainable Neural Networks: Improving the Performance of Neural Network in Image Analysis problems," IEEE Trans. on Neural Networks, Vol. 11, No. 1, pp. 137--155, January 2000.
[13]
D. C. Park, M. A. EL-Sharkawi, and R. J. Marks II, "An Adaptively Trained Neural Network," IEEE Trans. on Neural Networks, vol. 2, pp. 334--345, 1991.
[14]
D. J. Luenberger. Linear and non Linear Programming. Addison-Wesley 1984.

Cited By

View all
  • (2016)An Investigation of Textbook-Style Highlighting for VideoProceedings of the 42nd Graphics Interface Conference10.5555/3076132.3076172(201-208)Online publication date: 1-Jun-2016
  • (2010)Implicit visual concept modeling in image / video annotationProceedings of the first ACM international workshop on Analysis and retrieval of tracked events and motion in imagery streams10.1145/1877868.1877878(33-38)Online publication date: 29-Oct-2010
  • (2009)Context information exchange and sharing in a peer-to-peer communityProceedings of the 27th ACM international conference on Design of communication10.1145/1621995.1622048(265-272)Online publication date: 5-Oct-2009

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
AREA '08: Proceedings of the 1st ACM workshop on Analysis and retrieval of events/actions and workflows in video streams
October 2008
132 pages
ISBN:9781605583181
DOI:10.1145/1463542
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 October 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. action modeling
  2. human action analysis
  3. human object detection
  4. user transparent interaction
  5. video annotation

Qualifiers

  • Research-article

Conference

MM08
Sponsor:
MM08: ACM Multimedia Conference 2008
October 31, 2008
British Columbia, Vancouver, Canada

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2016)An Investigation of Textbook-Style Highlighting for VideoProceedings of the 42nd Graphics Interface Conference10.5555/3076132.3076172(201-208)Online publication date: 1-Jun-2016
  • (2010)Implicit visual concept modeling in image / video annotationProceedings of the first ACM international workshop on Analysis and retrieval of tracked events and motion in imagery streams10.1145/1877868.1877878(33-38)Online publication date: 29-Oct-2010
  • (2009)Context information exchange and sharing in a peer-to-peer communityProceedings of the 27th ACM international conference on Design of communication10.1145/1621995.1622048(265-272)Online publication date: 5-Oct-2009

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media