skip to main content
10.1145/1463542.1463554acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections

Human action analysis, annotation and modeling in video streams based on implicit user interaction

Published: 31 October 2008 Publication History


This paper proposes an integrated framework for analyzing human actions in video streams. Despite most current approaches that are just based on automatic spatiotemporal analysis of sequences, the proposed method introduces the implicit user-in-the-loop concept for dynamically mining semantics and annotating video streams. This work sets a new and ambitious goal: to recognize, model and properly use "average user's" selections, preferences and perception, for dynamically extracting content semantics. The proposed approach is expected to add significant value to hundreds of billions of non-annotated or inadequately annotated video streams existing in the Web, file servers, databases etc. Furthermore expert annotators can gain important knowledge relevant to user preferences, selections, styles of searching and perception.


A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, "Content-Based Image Retrieval at the End of the Early Years," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 12, pp. 1349--1380, Dec. 2000.
Arnab Bhattacharya, Vebjorn Ljosa, Jia-Yu Pan, Mark R. Verardo, Hyungjeong Yang, Christos Faloutsos, and Ambuj K. Singh, "ViVo: Visual Vocabulary Construction for Mining Biomedical Images," Fifth IEEE International Conference on Data Mining, Houston, Texas, November 2005.
Li, J. and Wang, J. Z., "Real-time computerized annotation of pictures," In Proc. ACM Multimedia, 2006.
Joshi, D., Wang, J. Z., and Li, J., "The story picturing engine - a system for automatic text illustration," ACM Trans. Multimedia Computing, Communications and Applications, vol. 2, no. 1, p.p. 68--89, 2006.
Stevenson, K., Leung, C., "Comparative evaluation of Web image search engines for multimedia applications," IEEE International Conference on Multimedia and Expo, July 2005.
Search Engine Statistics For 2006-07," SEO Weekly Article,
comScore's qSearch 2.0 service", comScore's Report Article,
Jansen, B. J., Spink, A., and Saracevic, T., "Real life, real users, and real needs: A study and analysis of user queries on the web," Information Processing & Management, 36(2), 207--227, 2000.
Y. Deng, and B. S. Manjunath, "Unsupervised segmentation of color-texture regions in images and video", IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI '01), vol. 23, no. 8, pp. 800--810, Aug. 2001.
N. Tsapatsoulis, C. Pattichis, A. Kounoudes, C. Loizou, A. Constantinides, J. G. Taylor "Visual Attention based Region of Interest Coding for Video-telephony Applications", 5th International Symposium on Communication Systems, Networks and Digital Signal Processing (CSNDSP'06), Patras, Greece, July 2006.
N. Tsapatsoulis, Y. Avrithis and S. Kollias, "Facial Image Indexing in Multimedia Databases," Pattern Analysis and Applications, Vol. 4, Issue 2/3, pp 93--107, 2001.
A. D. Doulamis, N. D. Doulamis and S. D. Kollias, "On Line Retrainable Neural Networks: Improving the Performance of Neural Network in Image Analysis problems," IEEE Trans. on Neural Networks, Vol. 11, No. 1, pp. 137--155, January 2000.
D. C. Park, M. A. EL-Sharkawi, and R. J. Marks II, "An Adaptively Trained Neural Network," IEEE Trans. on Neural Networks, vol. 2, pp. 334--345, 1991.
D. J. Luenberger. Linear and non Linear Programming. Addison-Wesley 1984.

Cited By

View all
  • (2016)An Investigation of Textbook-Style Highlighting for VideoProceedings of the 42nd Graphics Interface Conference10.5555/3076132.3076172(201-208)Online publication date: 1-Jun-2016
  • (2010)Implicit visual concept modeling in image / video annotationProceedings of the first ACM international workshop on Analysis and retrieval of tracked events and motion in imagery streams10.1145/1877868.1877878(33-38)Online publication date: 29-Oct-2010
  • (2009)Context information exchange and sharing in a peer-to-peer communityProceedings of the 27th ACM international conference on Design of communication10.1145/1621995.1622048(265-272)Online publication date: 5-Oct-2009



Information & Contributors


Published In

cover image ACM Conferences
AREA '08: Proceedings of the 1st ACM workshop on Analysis and retrieval of events/actions and workflows in video streams
October 2008
132 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]



Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 October 2008


Request permissions for this article.

Check for updates

Author Tags

  1. action modeling
  2. human action analysis
  3. human object detection
  4. user transparent interaction
  5. video annotation


  • Research-article


MM08: ACM Multimedia Conference 2008
October 31, 2008
British Columbia, Vancouver, Canada


Other Metrics

Bibliometrics & Citations


Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Feb 2025

Other Metrics


Cited By

View all
  • (2016)An Investigation of Textbook-Style Highlighting for VideoProceedings of the 42nd Graphics Interface Conference10.5555/3076132.3076172(201-208)Online publication date: 1-Jun-2016
  • (2010)Implicit visual concept modeling in image / video annotationProceedings of the first ACM international workshop on Analysis and retrieval of tracked events and motion in imagery streams10.1145/1877868.1877878(33-38)Online publication date: 29-Oct-2010
  • (2009)Context information exchange and sharing in a peer-to-peer communityProceedings of the 27th ACM international conference on Design of communication10.1145/1621995.1622048(265-272)Online publication date: 5-Oct-2009

View Options

Login options

View options


View or Download as a PDF file.



View online with eReader.







Share this Publication link

Share on social media