ABSTRACT
We present a new tool we have developed to ease the annotation of crowded environments, typical of visual surveillance datasets. Our tool is developed using HTML5 and Javascript and has two back-ends. A PHP based back-end implement the persistence using a relational database and manage the dynamic creation of pages and the authentication procedure. A python based REST server implement all the computer vision facilities to assist annotators. Our tool allows collaborative annotation of person identity, group membership, location, gaze and occluded parts. PACE supports multiple cameras and if calibration is provided the geometry is used to improve computer vision based assistance. We detail the whole interface comprising an administrative view that ease the setup of the system.
- M.R. Amer, P. Lei, and S. Todorovic. Hirf: Hierarchical random field for collective activity recognition in videos. In Proc of ECCV, 2014.Google ScholarCross Ref
- Federico Bartoli, Giuseppe Lisanti, Lorenzo Seidenari, and Alberto Del Bimbo. User interest profiling using tracking-free coarse gaze estimation. 2015.Google Scholar
- Federico Bartoli, Giuseppe Lisanti, Svebor Seidenari, Lorenzo Karaman, and Alberto Del Bimbo. Museumvisitors: a dataset for pedestrian and group detection, gaze estimation and behavior understanding. In Proc. of CVPR Int.'l Workshop on Group And Crowd Behavior Analysis And Understanding, 2015.Google ScholarCross Ref
- Federico Bartoli, Lorenzo Seidenari, Giuseppe Lisanti, Svebor Karaman, and Alberto Del Bimbo. Watts: a web annotation tool for surveillance scenarios. In ACM Multimedia, 2015. Google ScholarDigital Library
- L. Bazzani, V. Murino, and M. Cristani. Decentralized particle filter for joint individual-group tracking. In Proc. of CVPR, 2012. Google ScholarDigital Library
- W. Choi and S. Savarese. A unified framework for multi-target tracking and collective activity recognition. In Proc. of ECCV, 2012. Google ScholarDigital Library
- Navneet Dalal and Bill Triggs. Histograms of oriented gradients for human detection. In Proc. of CVPR, 2005. Google ScholarDigital Library
- J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In Proc. of CVPR, 2009.Google ScholarCross Ref
- Mark Everingham, Luc Van Gool, Christopher K. Williams, John Winn, and Andrew Zisserman. The pascal visual object classes (voc) challenge. Int. J. Comput. Vision, 88(2):303--338, June 2010. Google ScholarDigital Library
- A. B. Godbehere, A. Matsukawa, and K. Goldberg. Visual tracking of human visitors under variable-lighting conditions for a responsive audio art installation. In 2012 American Control Conference (ACC), pages 4305--4312, June 2012.Google ScholarCross Ref
- Rudolph Emil Kalman. A new approach to linear filtering and prediction problems. Transactions of the ASME--Journal of Basic Engineering, 82(Series D):35--45, 1960.Google Scholar
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. Imagenet classification with deep convolutional neural networks. In Proc. of NIPS. 2012. Google ScholarDigital Library
- V.Y. Mariano, J. Min, J.-H. Park, R. Kasturi, D. Mihalcik, D. Doermann, and T. Drayer. Performance evaluation of object detection algorithms. international conference on pattern recognition. In In Proc. of ICPR, 2002. Google ScholarDigital Library
- S. Pellegrini, A. Ess, K. Schindler, and L. Van Gool. You'll never walk alone: Modeling social behavior for multi-target tracking. In Proc. of ICCV, 2009.Google ScholarCross Ref
- B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman. Labelme: a database and web-based tool for image annotation. International Journal of Computer Vision, 77:157--173, May 2008. Google ScholarDigital Library
- Carl Vondrick, Donald Patterson, and Deva Ramanan. Efficiently scaling up crowdsourced video annotation. International Journal of Computer Vision, pages 1--21. Google ScholarDigital Library
- Yi Yang and Deva Ramanan. Articulated pose estimation with flexible mixtures-of-parts. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 1385--1392. IEEE, 2011. Google ScholarDigital Library
Index Terms
- PACE: Prediction-based Annotation for Crowded Environments
Recommendations
WATTS: a Web Annotation Tool for Surveillance Scenarios
MM '15: Proceedings of the 23rd ACM international conference on MultimediaIn this paper, we present a web based annotation tool we developed allowing creating collaboratively a detailed ground truth for datasets related to visual surveillance and behavior understanding. The system persistence is based on a relational database ...
An adaptive focus-of-attention model for video surveillance and monitoring
In current video surveillance systems, commercial pan/tilt/zoom (PTZ) cameras typically provide naive (or no) automatic scanning functionality to move a camera across its complete viewable field. However, the lack of scene-specific information ...
Distributed Interactive Video Arrays for Event Capture and Enhanced Situational Awareness
Computer vision promises to play a significant role in a wide range of homeland security applications. The objective is to apply computer vision techniques and algorithms under various environmental conditions for security, surveillance, and protection ...
Comments