skip to main content
10.1145/3293353.3293366acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvgipConference Proceedingsconference-collections
research-article

A Temporal Activity Graph Kernel for Human Activity Classification

Published:03 May 2020Publication History

ABSTRACT

Human activities are predominantly spatio-temporal, involving spatial changes over time. Qualitative spatial relations between interacting entities are often used to describe spatial change. To derive such qualitative spatial relations, the interacting entities are approximated as a single bounding box or set of bounding boxes. A set of bounding boxes abstracting a single entity has been termed as an extended object; where each box is bounding a component. Extended object abstraction of spatial entities has been shown to be more effective for representation of human activities [10]. The temporal aspect of an activity is characterized through changing spatial relations between components of interacting extended objects over time. In this paper, we propose Temporal Activity Graph (TAG) based representation model to keep track of the sequences of relations between components of the extended objects. A kernel is designed for classification of spatio-temporal interactions in the TAG based model. The TAG kernel uses concepts of label sequence similarity and interestingness to compute similarity of a pair of TAGs. The TAG kernel is a generic solution that can be used with any kernel based method. Here, the kernel is used within a Support Vector Machine classifier. The TAG kernel based classification of activities is found on par with the state-of-the-art approaches for experiments performed on the Mind's Eye, the UT Interaction, and the SBU Kinect Interaction datasets.

References

  1. M. Ahmad and Seong-Whan Lee. 2006. HMM-based Human Action Recognition Using Multiview Image Sequences. In 18th Intl. Conf. on Pattern Recognition (ICPR). IEEE, 263--266. https://doi.org/10.1109/ICPR.2006.630Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Thomas Bittner and Maureen Donnelly. 2007. A formal theory of qualitative size and distance relations between regions. In Proc. of the 21st Annual Workshop on Qualitative Reasoning (QR 2007).Google ScholarGoogle Scholar
  3. Henri Bouma, Gertjan Burghouts, Leo de Penning, Patrick Hanckmann, Johan-Martijn ten Hove, Sanne Korzec, Maarten Kruithof, Sander Landsmeer, Coen van Leeuwen, Sebastiaan van den Broek, Arvid Halma, Richard den Hollander, and Klamer Schutte. 2013. Recognition and localization of relevant human behavior in videos. In Sensors, and Command, Control, Communications, and Intelligence (C3I) Technologies for Homeland Security and Homeland Defense XII. SPIE, 87110B-10. https://doi.org/10.1117/12.2015877Google ScholarGoogle Scholar
  4. A. G. Cohn and S. M. Hazarika. 2001. Qualitative Spatial Representation and Reasoning: An Overview. Fundamenta Informaticae 46, 1-2 (2001), 1--29.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Anthony G. Cohn, Derek R. Magee, Aphrodite Galata, David Hogg, and Shyamanta M. Hazarika. 2003. Towards an Architecture for Cognitive Vision Using Qualitative Spatio-temporal Representations and Abduction. In Spatial Cognition. Springer Berlin Heidelberg, 232--248. https://doi.org/10.1007/3-540-45004-1_14Google ScholarGoogle Scholar
  6. Anthony G. Cohn, Jochen Renz, and Muralikrishna Sridhar. 2012. Thinking Inside the Box: A Comprehensive Spatial Representation for Video Analysis. In Proc. 13th Int. Conf. on Principles of Knowledge Representation and Reasoning (KR2012). AAAI Press, 588--592.Google ScholarGoogle Scholar
  7. Krishna S. R. Dubba, Mehul Bhatt, Frank Dylla, David C. Hogg, and Anthony G. Cohn. 2012. Interleaved Inductive-Abductive Reasoning for Learning Complex Event Models.. In ILP (Lecture Notes in Computer Science), Vol. 7207. Springer, 113--129.Google ScholarGoogle Scholar
  8. Christian Freksa. 1992. Temporal Reasoning Based on Semi-intervals. Artificial Intelligence 54, 1-2 (1992), 199--227. https://doi.org/10.1016/0004-3702(92)90090-KGoogle ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Humayun Kabir, M. Robiul Hoque, Keshav Thapa, and Sung-Hyun Yang. 2016. Two-Layer Hidden Markov Model for Human Activity Recognition in Home Environments. Intl. Journal of Distributed Sensor Networks 12, 1 (2016), 12. https://doi.org/10.1155/2016/4560365Google ScholarGoogle Scholar
  10. Shobhanjana Kalita, Arindam Karmakar, and Shyamanta M. Hazarika. 2018. Efficient extraction of spatial relations for extended objects vis-à-vis human activity recognition in video. Applied Intelligence 48, 1 (2018), 204--219. https://doi.org/10.1007/s10489-017-0970-8Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Keerthy Kusumam. 2012. Relational Learning using body parts for Human Activity Recognition in Videos. Master's thesis. University of Leeds.Google ScholarGoogle Scholar
  12. Pierre Latouche and Fabrice Rossi. 2015. Graphs in machine learning:An introduction. In 23rd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN). 207--218.Google ScholarGoogle Scholar
  13. Vlad I. Morariu, David Harwood, and Larry S. Davis. 2013. Tracking People's Hands and Feet Using Mixed Network AND/OR Search. IEEE Trans. on Pattern Recognition and Machine Intelligence (PAMI) 35, 5 (2013), 1248--1262.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S.U. Park, J.H. Park, M.A. Al-masni, M.A. Al-antari, Md.Z. Uddin, and T.-S. Kim. 2016. A Depth Camera-based Human Activity Recognition via Deep Learning Recurrent Neural Network for Health and Social Care Services. Procedia Computer Science 100 (2016), 78--84.Google ScholarGoogle ScholarCross RefCross Ref
  15. V. Ramakrishna, T. Kanade, and Y. Sheikh. 2013. Tracking Human Pose by Tracking Symmetric Parts. In Proc. of IEEE Conf. on Computer Vision and Pattern Recognition. 3728--3735. https://doi.org/10.1109/CVPR.2013.478Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. David A. Randell, Zhan Cui, and Anthony Cohn. 1992. A Spatial Logic Based on Regions and Connection. In KR'92. Principles of Knowledge Representation and Reasoning: Proc. of the 3rd Int. Conf., Bernhard Nebel, Charles Rich, and William Swartout (Eds.). Morgan Kaufmann, 165--176.Google ScholarGoogle Scholar
  17. M. S. Ryoo and J. K. Aggarwal. 2010. UT-Interaction Dataset, ICPR contest on Semantic Description of Human Activities (SDHA). http://cvrc.ece.utexas.edu/SDHA2010/Human_Interaction.html. (2010).Google ScholarGoogle Scholar
  18. Spiros Skiadopoulos and Manolis Koubarakis. 2005. On the consistency of cardinal directions constraints. Artificial Intelligence 163 (2005), 91--135.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Hajar Sadeghi Sokeh, Stephen Gould, and Jochen J. 2013. Efficient Extraction and Representation of Spatial Information from Video Data.. In Proc. of the 23rd Int. Joint Conf. on Artificial Intelligence (IJCAI'13). AAAI Press/IJCAI, 1076--1082.Google ScholarGoogle Scholar
  20. Muralikrishna Sridhar, Anthony G. Cohn, and David C. Hogg. 2010. Relational Graph Mining for Learning Events from Video. In 5th Starting AI Researchers Symposium (STAIRS). 315--327. https://doi.org/10.3233/978-1-60750-676-8-315Google ScholarGoogle Scholar
  21. D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri. 2015. Learning Spatiotemporal Features with 3D Convolutional Networks. In 2015 IEEE Intl. Conf. on Computer Vision (ICCV). 4489--4497.Google ScholarGoogle Scholar
  22. R. Wagner and M. Fischer. 1974. The String-to-String Correction Problem. J. ACM 21, 1 (1974), 168--173.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Y. Wang and G. Mori. 2011. Hidden Part Models for Human Action Recognition: Probabilistic versus Max Margin. IEEE Trans. on Pattern Analysis and Machine Intelligence 33, 7 (2011), 1310--1323. https://doi.org/10.1109/TPAMI.2010.214Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. W. Xu, Z. Miao, and X. P. Zhang. 2015. Structured feature-graph model for human activity recognition. In IEEE Intl. Conf. on Image Processing (ICIP). IEEE, 1245--1249. https://doi.org/10.1109/ICIP.2015.7350999Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Kiwon Yun, Jean Honorio, Debaleena Chattopadhyay, Tamara L. Berg, and Dimitris Samaras. 2012. Two-person Interaction Detection Using Body-Pose Features and Multiple Instance Learning. In IEEE Comp. Soc. Conf. on Computer Vision and Pattern Recognition Workshops (CVPRW),. IEEE.Google ScholarGoogle Scholar
  26. Yimeng Zhang, Xiaoming Liu, Ming-Ching Chang, Weina Ge, and Tsuhan Chen. 2012. Spatio-Temporal Phrases for Activity Recognition. In Proc. of 12th European Conf. on Computer Vision (ECCV)-Part III. Springer Berlin Heidelberg, 707--721. https://doi.org/10.1007/978-3-642-33712-3_51Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Yibiao Zhao, Steven Holtzen, Tao Gao, and Song-Chun Zhu. 2015. Represent and Infer Human Theory of Mind for Human-Robot Interaction. In AAAI Fall Symposium Series.Google ScholarGoogle Scholar
  28. Wentao Zhu, Cuiling Lan, Junliang Xing, Wenjun Zeng, Yanghao Li, Li Shen, and Xiaohui Xie. 2016. Co-Occurrence Feature Learning for Skeleton Based Action Recognition Using Regularized Deep LSTM Networks. In Proc. of the 30th AAAI Conf. on Artificial Intelligence, 2016. 3697--3704.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Temporal Activity Graph Kernel for Human Activity Classification

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            ICVGIP '18: Proceedings of the 11th Indian Conference on Computer Vision, Graphics and Image Processing
            December 2018
            659 pages
            ISBN:9781450366151
            DOI:10.1145/3293353

            Copyright © 2018 ACM

            Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 3 May 2020

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

            Acceptance Rates

            Overall Acceptance Rate95of286submissions,33%
          • Article Metrics

            • Downloads (Last 12 months)1
            • Downloads (Last 6 weeks)0

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader