skip to main content
10.1145/2499788.2499795acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimcsConference Proceedingsconference-collections
research-article

Exploring dense trajectory feature and encoding methods for human interaction recognition

Authors Info & Claims
Published:17 August 2013Publication History

ABSTRACT

Recently, human activity recognition has obtained increasing attention due to its wide range of potential applications. Much progress has been made to improve the performance on single actions in videos while few on collective and interactive activities. Human interaction is a more challenging task owing to multi-actors in an execution. In this paper, we utilize multi-scale dense trajectories and explore four advanced feature encoding methods on the human interaction dataset with a bag-of-features framework. Particularly, dense trajectories are described by shape, histogram of gradient orientation, histogram of flow orientation and motion boundary histogram, and all these are computed by integral images. Experimental results on the UT-Interaction dataset show that our approach outperforms state-of-the-art methods by 7-14%. Additionally, we thoroughly analyse a finding that the performance of vector quantization is on par with or even better than other sophisticated feature encoding methods by using dense trajectories in videos.

References

  1. M. Blank, L. Gorelick, E. Shechtman, M. Irani, and R. Basri. Actions as space-time shapes. In ICCV, volume 2, pages 1395--1402, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Y.-L. Boureau, F. Bach, Y. LeCun, and J. Ponce. Learning mid-level features for recognition. In CVPR, pages 2559--2566, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  3. W. Brendel and S. Todorovic. Learning spatiotemporal graphs of human activities. In ICCV, pages 778--785, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. N. Dalal, B. Triggs, and C. Schmid. Human detection using oriented histograms of flow and appearance. ECCV, pages 428--441, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pages 65--72, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  6. A. Klaser, M. Marszalek, C. Schmid, et al. A spatio-temporal descriptor based on 3d-gradients. In BMVC, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  7. H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, and T. Serre. Hmdb: A large video database for human motion recognition. In ICCV, pages 2556--2563, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. I. Laptev. On space-time interest points. IJCV, 64(2):107--123, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. Laptev, M. Marszalek, C. Schmid, and B. Rozenfeld. Learning realistic human actions from movies. In CVPR, pages 1--8, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  10. H. Lee, A. Battle, R. Raina, and A. Ng. Efficient sparse coding algorithms. In NIPS, volume 19, pages 801--808, 2007.Google ScholarGoogle Scholar
  11. L. Liu, L. Wang, and X. Liu. In defense of soft-assignment coding. In ICCV, pages 2486--2493, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Marszalek, I. Laptev, and C. Schmid. Actions in context. In CVPR, pages 2929--2936, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  13. M. Ryoo. Human activity prediction: Early recognition of ongoing activities from streaming videos. In ICCV, pages 1036--1043, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Ryoo, C.-C. Chen, J. Aggarwal, and A. Roy-Chowdhury. An overview of contest on semantic description of human activities (sdha) 2010. Recognizing Patterns in Signals, Speech, Images and Videos, pages 270--285, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Schuldt, I. Laptev, and B. Caputo. Recognizing human actions: A local svm approach. In ICPR, volume 3, pages 32--36, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. P. Scovanner, S. Ali, and M. Shah. A 3-dimensional sift descriptor and its application to action recognition. In MM, pages 357--360. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. Waltisberg, A. Yao, J. Gall, and L. Van Gool. Variations of a hough-voting action recognition system. Recognizing Patterns in Signals, Speech, Images and Videos, pages 306--312, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. H. Wang, A. Klaser, C. Schmid, and C.-L. Liu. Action recognition by dense trajectories. In CVPR, pages 3169--3176, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. H. Wang, A. Kläser, C. Schmid, and C.-L. Liu. Dense trajectories and motion boundary descriptors for action recognition. IJCV, Mar. 2013.Google ScholarGoogle ScholarCross RefCross Ref
  20. J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong. Locality-constrained linear coding for image classification. In CVPR, pages 3360--3367, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  21. G. Willems, T. Tuytelaars, and L. Van Gool. An efficient dense and scale-invariant spatio-temporal interest point detector. ECCV, pages 650--663, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. J. Yang, K. Yu, Y. Gong, and T. Huang. Linear spatial pyramid matching using sparse coding for image classification. In CVPR, pages 1794--1801, 2009.Google ScholarGoogle Scholar
  23. L. Yeffet and L. Wolf. Local trinary patterns for human action recognition. In ICCV, pages 492--497, 2009.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Exploring dense trajectory feature and encoding methods for human interaction recognition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICIMCS '13: Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
      August 2013
      419 pages
      ISBN:9781450322522
      DOI:10.1145/2499788
      • Conference Chair:
      • Tat-Seng Chua,
      • General Chairs:
      • Ke Lu,
      • Tao Mei,
      • Xindong Wu

      Copyright © 2013 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 August 2013

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      ICIMCS '13 Paper Acceptance Rate20of94submissions,21%Overall Acceptance Rate163of456submissions,36%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader