Video Editor for Annotating Human Actions and Object Trajectories

Kulbacki, Marek; Wereszczyński, Kamil; Segen, Jakub; Sachajko, Michał; Bąk, Artur

doi:10.1007/978-3-662-49390-8_44

Marek Kulbacki⁸,
Kamil Wereszczyński^8,9,
Jakub Segen⁸,
Michał Sachajko⁸ &
…
Artur Bąk⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9622))

Included in the following conference series:

Asian Conference on Intelligent Information and Database Systems

1541 Accesses

Abstract

A system for managing, annotating and editing video sequences is a necessary tool in research on recognition of human actions and tracking people or objects. In addition annotation process is complex and expensive, so some people try to use crowdsourced marketplace based tools to make this process cost effective. Such a tool, video editor for annotating human actions and object trajectories -VATRAC, is presented. It enables flexible viewing video sequences under selected configuration of annotation layers, adding and editing of annotations for actions and trajectories of the entire objects or selected parts of the objects. Video sequences can be queried according to a variety of criteria and preferences for example searching for subsequences annotated with the action class.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Smith, J.R., Lugeon, B.: A visual annotation tool for multimedia content description. In: Proceedings of the SPIE Photonics East, Internet Multimedia Management Systems (2000)
Google Scholar
Russell, B.C., Torralba, A., Murphy, K.P., Freeman, W.T.: LabelMe: a database and web-based tool for image annotation. Int. J. Comput. Vis. 77, 157–173 (2008)
Article Google Scholar
Korč, F., Schneider, D.: Annotation Tool. Technical report TR-IGG-P-2007-01, University of Bonn, Department of Photogrammetry (2007)
Google Scholar
Grundmann, M., Kwatra, V., Han, M., Essa, I.: Efficient hierarchical graph based video segmentation. In: IEEE CVPR (2010)
Google Scholar
Kulbacki, M., Segen, J., Wereszczyński, K., Gudyś, A.: VMASS: massive dataset of multi-camera video for learning, classification and recognition of human actions. In: Nguyen, N.T., Attachoo, B., Trawiński, B., Somboonviwat, K. (eds.) ACIIDS 2014, Part II. LNCS, vol. 8398, pp. 565–574. Springer, Heidelberg (2014)
Chapter Google Scholar
Zhang, S., Staudt, E., Faltemier, T., Roy-Chowdhury, A.: A camera network tracking (CamNeT) dataset and performance baseline. In: IEEE Winter Conference on Applications of Computer Vision, Waikoloa Beach, Hawaii, January 2015
Google Scholar
Chen, C.-C., Ryoo, M.S., Aggarwal, J.K.: UT-Tower dataset: aerial view activity classification challenge (2010). http://cvrc.ece.utexas.edu/SDHA2010/Aerial_View_Activity.html
Gorelick, L., Blank, M., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. Trans. Pattern Anal. Mach. Intell. 29(12), 2247–2253 (2007)
Article Google Scholar
SchÃijldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local SVM approach. In: Proceedings of the ICPR, pp. 32–36 (2004)
Google Scholar
Soomro, K., Zamir, A.R., Shah, M.: UCF101: a dataset of 101 human action classes from videos in the wild. In: CRCV-TR-12-01, November 2012
Google Scholar
Reddy, K.K., Shah, M.: Recognizing 50 human action categories of web videos. Mach. Vis. Appl. J. (MVAP) 24, 971–981 (2012)
Article Google Scholar
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos “in the Wild”. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
Google Scholar
Rodriguez, M.D., Ahmed, J., Shah, M.: Action MACH: a spatio-temporal maximum average correlation height filter for action recognition. In: Computer Vision and Pattern Recognition (2008)
Google Scholar
Jain, M., Jegou, H., Bouthemy, P.: Better exploiting motion for better action recognition. In: CVPR (2013)
Google Scholar
Laptev, I., Marszałek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: IEEE Conference on Computer Vision & Pattern Recognition (2008)
Google Scholar
Marszałek, M., Laptev, I., Schmid, C.: Actions in context. In: IEEE Conference on Computer Vision & Pattern Recognition (2009)
Google Scholar
Kipp, M.: ANVIL - a generic annotation tool for multimodal dialogue. In: Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech), pp. 1367–1370 (2001)
Google Scholar
Vondrick, C., Patterson, D., Ramanan, D.: Efficiently scaling up crowdsourced video annotation. Int. J. Comput. Vis. (IJCV) 101, 184–204 (2012)
Article Google Scholar
Hailpern, J.: VCode and VData: Illustrating a new Framework for Supporting the Video Annotation Workflow. Google engEDU: Tech Talks, Mountain View, CA, 21 June 2008
Google Scholar
Swets, J.A.: Signal Detection Theory and ROC Analysis in Psychology and Diagnostics : Collected Papers. Lawrence Erlbaum Associates, Mahwah, NJ (1996)
MATH Google Scholar

Download references

Acknowledgments

This work has been supported by the National Centre for Research and Development (project UOD-DEM-1-183/001 “Intelligent video analysis system for behavior and event recognition in surveillance networks”).

Author information

Authors and Affiliations

Polish-Japanese Academy of Information Technology, Koszykowa 86, 02-008, Warszawa, Poland
Marek Kulbacki, Kamil Wereszczyński, Jakub Segen, Michał Sachajko & Artur Bąk
Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland
Kamil Wereszczyński

Authors

Marek Kulbacki
View author publications
You can also search for this author in PubMed Google Scholar
Kamil Wereszczyński
View author publications
You can also search for this author in PubMed Google Scholar
Jakub Segen
View author publications
You can also search for this author in PubMed Google Scholar
Michał Sachajko
View author publications
You can also search for this author in PubMed Google Scholar
Artur Bąk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marek Kulbacki .

Editor information

Editors and Affiliations

Wrocław University of Technology, Wrocław, Poland
Ngoc Thanh Nguyen
Wrocław University of Technology, Wrocław, Poland
Bogdan Trawiński
Iwate Prefectural University, Takizawa, Japan
Hamido Fujita
National University of Kaohsiung, Kaohsiung, Taiwan
Tzung-Pei Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kulbacki, M., Wereszczyński, K., Segen, J., Sachajko, M., Bąk, A. (2016). Video Editor for Annotating Human Actions and Object Trajectories. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds) Intelligent Information and Database Systems. ACIIDS 2016. Lecture Notes in Computer Science(), vol 9622. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-49390-8_44

Download citation

DOI: https://doi.org/10.1007/978-3-662-49390-8_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-49389-2
Online ISBN: 978-3-662-49390-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics