Multiple Cue Integrated Action Detection

Jung, Sang-Hack; Guo, Yanlin; Sawhney, Harpreet; Kumar, Rakesh

doi:10.1007/978-3-540-75773-3_12

Sang-Hack Jung¹,
Yanlin Guo¹,
Harpreet Sawhney¹ &
…
Rakesh Kumar¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4796))

Included in the following conference series:

International Workshop on Human-Computer Interaction

1478 Accesses
2 Citations

Abstract

We present an action recognition scheme that integrates multiple modality of cues that include shape, motion and depth to recognize human gesture in the video sequences. In the proposed approach we extend classification framework that is commonly used in 2D object recognition to 3D spatio-temporal space for recognizing actions. Specifically, a boosting-based classifier is used that learns spatio-temporal features specific to target actions where features are obtained from temporal patterns of shape contour, optical flow and depth changes occuring at local body parts. The individual features exhibit different strength and sensitivity depending on many factors that include action, underlying body parts and background. In the current method, the multiple cues of different modalities are combined optimally by fisher linear discriminant to form a strong feature that preserve strength of individual cues. In the experiment, we apply the integrated action classifier on a set of target actions and evaluate its performance by comparing with single cue-based cases and present qualitative analysis of performance gain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Shechtman, E., Irani, M.: Space-time behavior based correlation. In: Proc. IEEE Conf. on Comp. Vision and Patt. Recog., Washington, DC, USA, pp. 405–412. IEEE Computer Society Press, Los Alamitos (2005)
Google Scholar
Viola, P., Jones, M., Snow, D.: Detecting pedestrians using patterns of motion and appearance. International Conference on Computer Vision 02, 734 (2003)
Article Google Scholar
Ke, Y., Sukthankar, R., Hebert, M.: Efficient visual event detection using volumetric features. In: International Conference on Computer Vision, Washington, DC, USA, pp. 166–173. IEEE Computer Society Press, Los Alamitos (2005)
Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local svm approach. In: International Conference on Pattern Recognition, Washington, DC, USA, pp. 32–36. IEEE Computer Society Press, Los Alamitos (2004)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proc. IEEE Conf. on Comp. Vision and Patt. Recog., Washington, DC, USA, pp. 886–893. IEEE Computer Society Press, Los Alamitos (2005)
Google Scholar
Shet, V., Prasad, V., Elgammal, A., Yacoob, Y., Davis, L.: Multi-cue exemplar-based nonparametric model for gesture recognition. In: Indian Conference on Computer Vision, Graphics and Image Processing (2004)
Google Scholar
Sidenbladh, H.: Probabilistic Tracking and Reconstruction of 3D Human Motion in Monocular Video Sequences. PhD Thesis TRITA-NA-0114, Dept. of Numerical Analysis and Computer Science, KTH, Sweden (2001) ISBN 91-7283-169-3
Google Scholar
Giebel, J., Gavrila, D., Schnorr, C.: A bayesian framework for multi-cue 3d object tracking. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 241–252. Springer, Heidelberg (2004)
Google Scholar
Paletta, L., Paar, G.: Bayesian decision fusion for dynamic multi-cue object detection. In: Indian Conference on Computer Vision, Graphics and Image Processing (2002)
Google Scholar
Birchfield, S.: Elliptical head tracking using intensity gradients and color histograms. In: Proc. IEEE Conf. on Comp. Vision and Patt. Recog. (1998)
Google Scholar
Spengler, M., Schiele, B.: Towards robust multi-cue integration for visual tracking. IEEE Trans. Pattern Anal. Machine Intell. 13(9), 891–906 (1991)
Article Google Scholar
Shan, Y., Sawhney, H., Kumar, R.: Unsupervised learning of discriminative edge measures for vehicle matching between non-overlapping cameras. In: Proc. IEEE Conf. on Comp. Vision and Patt. Recog. (2005)
Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proc. IEEE Conf. on Comp. Vision and Patt. Recog. (2001)
Google Scholar
Shan, Y., Han, F., Sawhney, H., Kumar, R.: Learning exemplar-based categorization for the detection of multi-view multi-pose objects. In: Proc. IEEE Conf. on Comp. Vision and Patt. Recog., Washington, DC, USA, pp. 1431–1438. IEEE Computer Society Press, Los Alamitos (2006)
Google Scholar
Jung, S.H., Shan, Y., Sawhney, H., Aggarwal, M.: Action detection using approximated spatio-temporal adaboost. Technical Report, Sarnoff Corporation (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Sarnoff Corporation, 201 Washington Road, Princeton, NJ 08543, USA
Sang-Hack Jung, Yanlin Guo, Harpreet Sawhney & Rakesh Kumar

Authors

Sang-Hack Jung
View author publications
You can also search for this author in PubMed Google Scholar
Yanlin Guo
View author publications
You can also search for this author in PubMed Google Scholar
Harpreet Sawhney
View author publications
You can also search for this author in PubMed Google Scholar
Rakesh Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Michael Lew Nicu Sebe Thomas S. Huang Erwin M. Bakker

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jung, SH., Guo, Y., Sawhney, H., Kumar, R. (2007). Multiple Cue Integrated Action Detection. In: Lew, M., Sebe, N., Huang, T.S., Bakker, E.M. (eds) Human–Computer Interaction. HCI 2007. Lecture Notes in Computer Science, vol 4796. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75773-3_12

Download citation

DOI: https://doi.org/10.1007/978-3-540-75773-3_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75772-6
Online ISBN: 978-3-540-75773-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics