Locating and recognizing multiple human actions by searching for maximum score subsequences

Zhang, Hong-Bo; Li, Shao-Zi; Chen, Shu-Yuan; Su, Song-Zhi; Lin, Xian-Ming; Cao, Dong-Lin

doi:10.1007/s11760-013-0501-y

Locating and recognizing multiple human actions by searching for maximum score subsequences

Original Paper
Published: 22 June 2013

Volume 9, pages 705–714, (2015)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Hong-Bo Zhang^2,1,
Shao-Zi Li^2,1,
Shu-Yuan Chen³,
Song-Zhi Su^2,1,
Xian-Ming Lin^1,2 &
…
Dong-Lin Cao^1,2

311 Accesses
2 Citations
Explore all metrics

Abstract

Despite the numerous methods to recognize human actions in a video, few are designed for videos containing more than one action over a certain time period. Moreover, existing multiple action recognition methods adopt windowed sequence search strategy. Windowed sequence searching requires an exhaustive trial of window length yielding intensive computation. This work presents a frame-based strategy, capable of searching for maximum score subsequences that correspond to actions. Therefore, start and ending times of all actions are located, and action categories are identified as well. Moreover, contrast mutual information is proposed as a new score function to increase recognition accuracy. Experimental results indicate that the proposed method locates and recognizes multiple actions in a video accurately, even for the conventional single action classification problem.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Combinational Subsequence Matching for Human Identification from General Actions

Analysis of Temporal Coherence in Videos for Action Recognition

Human Action Recognition Using Temporal Segmentation and Accordion Representation

References

Aggarwal, J.K., Ryoo, M.S.: Human activity analysis: a review. ACM Comput. Surv. 43(3):16:1–16:43 (2011)
Google Scholar
Roppe, R.: A survey on vision-based human action recognition. Image Comput. 28(3), 976–990 (2010)
Google Scholar
Laptev, I.: On space-time interest points. Int. J. Comput. Vis. 64(2–3), 107–123 (2005)
Article Google Scholar
Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: Proceedings 2rd International Workshop Visual Surveillance Performance Evaluation Tracking Surveillance, Beijing, China, Oct. 15–16, pp. 65–72 (2005)
Wang, H., Ullah, M., Klaser, A., Laptev, I., Schmid, C.: Evaluation of local spatio-temporal features for action recognition. In: Presented at British Machine Vision Conference, London, England, Sept. 7–10, (2009)
Kläser, A., Marszałek, M., Schmid, C.: A spatio-temporal descriptor based on 3D-gradients. In: Presented at British Machine Vision Conference. Leeds, UK, Sept. 1–4, (2008)
Wang, Y., Mori, G.: Learning a discriminative hidden part model for human action recognition. In: Proceedings 22nd Annual Conference on Neural Information Processing Systems, Vancouver, Canada, Dec. 8–11, pp. 1721–1728 (2008)
Jhuang, H., Serre, T., Wolf, L., Poggio, T.: A biologically inspired system for action recognition. In: Proceedings International Conference Computer Vision, Rio de Janeiro, Brazil, Oct. 14–21, pp. 1–8 (2007)
Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: Proceedings 15th ACM International Conference Multimedia, Bavaria, Germany, Sept. 24–29, pp. 357–360 (2007)
Laptev, I., Lindeberg, T.: Local descriptors for spatio-temporal recognition. In: Proceedings International Workshop on Spatial Coherence for Visual Motion Analysis, Prague, Czech republic, May 15–15, pp. 91–103 (2004)
Laptev, I., Marszałek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: Proceedings 26th IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, United states, Jun. 23–28, pp. 1–8 (2008)
Willems, G., Tuytelaars, T., Van Gool, L.: An efficient dense and scale-invariant spatio-temporal interest point detector. In: Proceedings 10th European Conference on Computer Vision, Marseille, France, Oct. 12–18, pp. 650–663 (2008)
Yuan, J., Liu, S., Wu, Y.: Discriminative video pattern search for efficient action detection. IEEE Trans. Pattern Anal. Mach. Intell. 33(9), 1728–1743 (2011)
Google Scholar
Niebles, J.C., Wang, H., Li, F.-F.: Unsupervised learning of human action categories using spatial-temporal words. Int. J. Comput. Vis. 79(3), 299–318 (2008)
Article Google Scholar
Bently, J.: Programming pearls: algorithm design techniques. Commun. ACM 27(9), 865–873 (1984)
Article Google Scholar
http://www.nada.kth.se/cvap/actions/
Zhang, T., Liu, J., Liu, S., Xu, C., Lu, H.: Boosted exemplar learning for action recognition and annotation. IEEE Trans. Circuits Syst. Video Technol. 21(7), 853–866 (2011)
Article Google Scholar
Seo, H.J., Milanfar, P.: Action recognition from one example. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 867–882 (2011)
Article Google Scholar
Chakraborty, B., Holte, M., Moeslund, T.B., Gonzalez, J.: Selective spatio-temporal interest points. Comput. Vis. Image Underst. 116(3), 396–410 (2012)
Article Google Scholar

Download references

Acknowledgments

The authors would like to thank the anonymous reviewers for the valuable and insightful comments on the earlier version of this manuscript. This work was partially supported by National Nature Science Foundation of China (No. 61202143), the Nature Science Foundation of Fujian Province (No. 2011J01367), Xiamen University 985 Project and National Science Council of Taiwan (NSC-101-2221-E-155-060).

Author information

Authors and Affiliations

School of Information Science and Technology, Xiamen University, Xiamen, China
Hong-Bo Zhang, Shao-Zi Li, Song-Zhi Su, Xian-Ming Lin & Dong-Lin Cao
Fujian Key Laboratory of the Brain-like Intelligent Systems, Xiamen University, Xiamen, China
Hong-Bo Zhang, Shao-Zi Li, Song-Zhi Su, Xian-Ming Lin & Dong-Lin Cao
Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, Taiwan
Shu-Yuan Chen

Authors

Hong-Bo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shao-Zi Li
View author publications
You can also search for this author in PubMed Google Scholar
Shu-Yuan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Song-Zhi Su
View author publications
You can also search for this author in PubMed Google Scholar
Xian-Ming Lin
View author publications
You can also search for this author in PubMed Google Scholar
Dong-Lin Cao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Shao-Zi Li or Song-Zhi Su.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, HB., Li, SZ., Chen, SY. et al. Locating and recognizing multiple human actions by searching for maximum score subsequences. SIViP 9, 705–714 (2015). https://doi.org/10.1007/s11760-013-0501-y

Download citation

Received: 14 October 2012
Revised: 18 May 2013
Accepted: 18 May 2013
Published: 22 June 2013
Issue Date: March 2015
DOI: https://doi.org/10.1007/s11760-013-0501-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Locating and recognizing multiple human actions by searching for maximum score subsequences

Abstract

Access this article

Similar content being viewed by others

Combinational Subsequence Matching for Human Identification from General Actions

Analysis of Temporal Coherence in Videos for Action Recognition

Human Action Recognition Using Temporal Segmentation and Accordion Representation

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Locating and recognizing multiple human actions by searching for maximum score subsequences

Abstract

Access this article

Similar content being viewed by others

Combinational Subsequence Matching for Human Identification from General Actions

Analysis of Temporal Coherence in Videos for Action Recognition

Human Action Recognition Using Temporal Segmentation and Accordion Representation

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation