research-article

PACE: Prediction-based Annotation for Crowded Environments

Authors:
Federico Bartoli

University of Florence, Florence, Italy

University of Florence, Florence, Italy
View Profile

,
Giuseppe Lisanti

University of Florence, Florence, Italy

University of Florence, Florence, Italy
View Profile

,
Lorenzo Seidenari

Univerisity of Florence, Florence, Italy

Univerisity of Florence, Florence, Italy
View Profile

,
Alberto Del Bimbo

University of Florence, Florence, Italy

University of Florence, Florence, Italy
View Profile

ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia RetrievalJune 2017Pages 121–124https://doi.org/10.1145/3078971.3079020

Published:06 June 2017Publication History

ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval

Pages 121–124

ABSTRACT

We present a new tool we have developed to ease the annotation of crowded environments, typical of visual surveillance datasets. Our tool is developed using HTML5 and Javascript and has two back-ends. A PHP based back-end implement the persistence using a relational database and manage the dynamic creation of pages and the authentication procedure. A python based REST server implement all the computer vision facilities to assist annotators. Our tool allows collaborative annotation of person identity, group membership, location, gaze and occluded parts. PACE supports multiple cameras and if calibration is provided the geometry is used to improve computer vision based assistance. We detail the whole interface comprising an administrative view that ease the setup of the system.

References

M.R. Amer, P. Lei, and S. Todorovic. Hirf: Hierarchical random field for collective activity recognition in videos. In Proc of ECCV, 2014.Google ScholarCross Ref
Federico Bartoli, Giuseppe Lisanti, Lorenzo Seidenari, and Alberto Del Bimbo. User interest profiling using tracking-free coarse gaze estimation. 2015.Google Scholar
Federico Bartoli, Giuseppe Lisanti, Svebor Seidenari, Lorenzo Karaman, and Alberto Del Bimbo. Museumvisitors: a dataset for pedestrian and group detection, gaze estimation and behavior understanding. In Proc. of CVPR Int.'l Workshop on Group And Crowd Behavior Analysis And Understanding, 2015.Google ScholarCross Ref
Federico Bartoli, Lorenzo Seidenari, Giuseppe Lisanti, Svebor Karaman, and Alberto Del Bimbo. Watts: a web annotation tool for surveillance scenarios. In ACM Multimedia, 2015. Google ScholarDigital Library
L. Bazzani, V. Murino, and M. Cristani. Decentralized particle filter for joint individual-group tracking. In Proc. of CVPR, 2012. Google ScholarDigital Library
W. Choi and S. Savarese. A unified framework for multi-target tracking and collective activity recognition. In Proc. of ECCV, 2012. Google ScholarDigital Library
Navneet Dalal and Bill Triggs. Histograms of oriented gradients for human detection. In Proc. of CVPR, 2005. Google ScholarDigital Library
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In Proc. of CVPR, 2009.Google ScholarCross Ref
Mark Everingham, Luc Van Gool, Christopher K. Williams, John Winn, and Andrew Zisserman. The pascal visual object classes (voc) challenge. Int. J. Comput. Vision, 88(2):303--338, June 2010. Google ScholarDigital Library
A. B. Godbehere, A. Matsukawa, and K. Goldberg. Visual tracking of human visitors under variable-lighting conditions for a responsive audio art installation. In 2012 American Control Conference (ACC), pages 4305--4312, June 2012.Google ScholarCross Ref
Rudolph Emil Kalman. A new approach to linear filtering and prediction problems. Transactions of the ASME--Journal of Basic Engineering, 82(Series D):35--45, 1960.Google Scholar
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. Imagenet classification with deep convolutional neural networks. In Proc. of NIPS. 2012. Google ScholarDigital Library
V.Y. Mariano, J. Min, J.-H. Park, R. Kasturi, D. Mihalcik, D. Doermann, and T. Drayer. Performance evaluation of object detection algorithms. international conference on pattern recognition. In In Proc. of ICPR, 2002. Google ScholarDigital Library
S. Pellegrini, A. Ess, K. Schindler, and L. Van Gool. You'll never walk alone: Modeling social behavior for multi-target tracking. In Proc. of ICCV, 2009.Google ScholarCross Ref
B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman. Labelme: a database and web-based tool for image annotation. International Journal of Computer Vision, 77:157--173, May 2008. Google ScholarDigital Library
Carl Vondrick, Donald Patterson, and Deva Ramanan. Efficiently scaling up crowdsourced video annotation. International Journal of Computer Vision, pages 1--21. Google ScholarDigital Library
Yi Yang and Deva Ramanan. Articulated pose estimation with flexible mixtures-of-parts. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 1385--1392. IEEE, 2011. Google ScholarDigital Library

Index Terms

PACE: Prediction-based Annotation for Crowded Environments
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
2. Information systems
  1. World Wide Web
    1. Web interfaces

Recommendations

WATTS: a Web Annotation Tool for Surveillance Scenarios
MM '15: Proceedings of the 23rd ACM international conference on Multimedia

In this paper, we present a web based annotation tool we developed allowing creating collaboratively a detailed ground truth for datasets related to visual surveillance and behavior understanding. The system persistence is based on a relational database ...
Read More
An adaptive focus-of-attention model for video surveillance and monitoring

In current video surveillance systems, commercial pan/tilt/zoom (PTZ) cameras typically provide naive (or no) automatic scanning functionality to move a camera across its complete viewable field. However, the lack of scene-specific information ...
Read More
Distributed Interactive Video Arrays for Event Capture and Enhanced Situational Awareness

Computer vision promises to play a significant role in a wide range of homeland security applications. The objective is to apply computer vision techniques and algorithms under various environmental conditions for security, surveillance, and protection ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval
June 2017
524 pages
ISBN:9781450347013
DOI:10.1145/3078971
General Chairs:
Bogdan Ionescu
University Politehnica of Bucharest, Romania
,
Nicu Sebe
University of Trento, Italy
,
Program Chairs:
Jiashi Feng
National University of Singapore, Singapore
,
Martha Larson
Radboud University & Delft University of Technology, The Netherlands
,
Rainer Lienhart
University of Augsburg, Germany
,
Cees Snoek
University of Amsterdam & Qualcomm Research Netherlands, The Netherlands
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 June 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
annotation
computer vision
surveillance
Qualifiers
- research-article
Conference

Acceptance Rates
ICMR '17 Paper Acceptance Rate33of95submissions,35%Overall Acceptance Rate254of830submissions,31%
More
Upcoming Conference
ICMR '24

Sponsor:

sigmm

International Conference on Multimedia Retrieval

June 10 - 14, 2024

Phuket , Thailand
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 117
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

PACE: Prediction-based Annotation for Crowded Environments

ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

WATTS: a Web Annotation Tool for Surveillance Scenarios

An adaptive focus-of-attention model for video surveillance and monitoring

Distributed Interactive Video Arrays for Event Capture and Enhanced Situational Awareness