Cross-view action matching using a novel projective invariant on non-coplanar space-time points

Jia, Qi; Fan, Xin; Luo, Zhongxuan; Li, Haojie; Huyan, Kang; Li, Zezhou

doi:10.1007/s11042-015-2704-4

Cross-view action matching using a novel projective invariant on non-coplanar space-time points

Published: 10 June 2015

Volume 75, pages 11661–11682, (2016)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Qi Jia¹,
Xin Fan¹,
Zhongxuan Luo¹,
Haojie Li¹,
Kang Huyan¹ &
…
Zezhou Li²

349 Accesses
Explore all metrics

Abstract

Existing action matching methods from the geometric respect typically assume the collinearity or coplanarity for view invariance. These assumptions curb the application to uncontrolled action patterns. In this paper, a new projective invariant named characteristic number (CN) is used, which can be used to describe 3D non-coplanar points. For motion trajectories of actions, we propose the temporal CN (TCN) for individual joint point of a human body in temporal series. This view-invariant feature can characterize an action well with limited number of joints(a single one in our experiments). In addition to TCN, we are also able to define the spatial characteristic number (SCN) on several (five in our paper) joint points in the spatial domain for one frame. SCN works complementary to temporal features, when limited snapshots of an action are available. We validate both SCN and TCN on the widely used CMU Motion Capture Database (Mocap) database, KTH Multiview Football Dataset II and IXMAS dataset. The promising recognition results indicate the invariance to varying viewpoints compared with the state-of-the-art. The results on CMU and KTH database corrupted by noise show the robustness to noise.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PLAViMoP database: A new continuously assessed and collaborative 3D point-light display dataset

Article 19 April 2022

View-independent action recognition: a hybrid approach

Article 24 April 2015

Multi-view key information representation and multi-modal fusion for single-subject routine action recognition

Article 29 February 2024

References

Ahmad M, Lee SW (2006) HMM-based human action recognition using multiview image sequences. In: Proceedings of ICPR
Ashraf N, Shen Y, Cao X, Foroosh H (2013) View invariant action recognition using weighted fundamental ratios. Computer Vision and Image Understanding
Cuzzolin F (2006) Using bilinear models for view-invariant action and identity recognition. In: Proceedings of CVPR
Efros AA, Berg AC, Mori G, Malik J (2003) Recognizing action at a distance. In: Proceedings of CVPR
Farhadi A, Tabrizi MK (2008) Learning to recognize activities from the wrong view point. In: Computer vision–ECCV 2008. Springer, pp 154–166
Gondal I, Murshed M, ul Haq A (2011) On dynamic scene geometry for view-invariant action matching. In: Proceedings of CVPR
Grabner H, Bischof H (2006) On-line boosting and vision. In: 2006 IEEE computer society conference on computer vision and pattern recognition. IEEE, vol 1, pp 260–267
Huang K, Zhang Y, Tan T (2012) A discriminative model of motion and cross ratio for view-invariant action recognition. IEEE Trans Image Processing 21(4):2187–2197
Article MathSciNet Google Scholar
Iosifidis A, Tefas A, Pitas I (2012) View-invariant action recognition based on artificial neural networks. IEEE Transactions on Neural Networks and Learning Systems 23(3):412–424
Article Google Scholar
Junejo IN, Dexter E, Laptev I, Pérez P (2008) Cross-view action recognition from temporal self-similarities. Springer
Laptev I (2005) On space-time interest points. Int J Comput Vis 64(2-3):107–123
Article Google Scholar
Le QV, Zou WY, Yeung SY, Ng AY (2011) Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In: 2011 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 3361–3368
Li R, Zickler T (2012) Discriminative virtual views for cross-view action recognition. In: Proceedings of CVPR
Luo Z, Zhou X, Gu DX (2014) From a projective invariant to some new properties of algebraic hypersurfaces. Science China Mathematics 57(11):2273–2284
Article MathSciNet MATH Google Scholar
Lv F, Nevatia R (2007) Single view human action recognition using key pose matching and viterbi path searching. In: Proceedings of CVPR
Moeslund TB, Hilton A, Kruger V (2006) A survey of advances in vision-based human motion capture and analysis. Comput Vis Image Underst 104(2):90–126
Article Google Scholar
Parameswaran V, Chellappa R (2003) View invariants for human action recognition. In: Proceedings of CVPR
Rao C, Yilmaz A, Shah M (2002) View-invariant representation and recognition of actions. Int J Comput Vis 50(2):203–226
Article MATH Google Scholar
Reddy KK, Liu J, Shah M (2009) Incremental action recognition using feature-tree. In: 2009 IEEE 12th international conference on computer vision. IEEE, pp 1010–1017
Richard H, Zisserman A (2004) Multiple view geometry in computer vision, 2nd edn. Cambridge University Press
Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: a local SVM approach. In: Proceedings of ICPR
Shen Y, Foroosh H (2008) View-invariant action recognition using fundamental ratios. In: Proceedings of CVPR
Srestasathiern P, Yilmaz A (2011) Planar shape representation and matching under projective transformation. Comput Vis Image Underst 115(11):1525–1535
Article Google Scholar
Vahid K, Burenius M, Azizpour H, Sullivan J (2013) Multi-view body part recognition with random forests. In: Proceedings of BMVC
Weinland D, Boyer E, Ronfard R (2007) Action recognition from arbitrary views using 3d exemplars. In: Proceedings of ICCV
Weinland D, Özuysal M, Fua P (2010) Making action recognition robust to occlusions and viewpoint changes. In: Computer vision ECCV 2010. Springer, pp 635–648
Wu X, Jia Y (2012) View-invariant action recognition using latent kernelized structural SVM. In: Proceedings of ECCV
Yan P, Khan SM, Shah M (2008) Learning 4d action feature models for arbitrary view action recognition. In: IEEE conference on computer vision and pattern recognition, 2008. CVPR 2008. IEEE, pp 1–7
Zhang Y, Huang K, Huang Y, Tan T (2009) View-invariant action recognition using cross ratios across frames. In: Proceedings of ICIP
Zhang Z, Wang C, Xiao B, Zhou W, Liu S, Shi C (2013) Cross-view action recognition via a continuous virtual path. In: Proceedings of CVPR
Zhu G, Xu C, Gao W, Huang Q (2006) Action recognition in broadcast tennis video using optical flow and support vector machine. In: Computer vision in human-computer interaction. Springer, pp 89–98

Download references

Acknowledgments

This work is partially supported by the Natural Science Foundation of China under grant Nos. 61402077, 61033012, 11171052, 61272371, 61003177 and 61328206, the program for New Century Excellent Talents (NCET-11-0048), Civil Aviation Administration of China (No. U1233110), and Fundamental Research Funds for the Central Universities.

Author information

Authors and Affiliations

Software School of Dalian University of Technology, 321 Tuqiang Street, Development Area, Dalian, China
Qi Jia, Xin Fan, Zhongxuan Luo, Haojie Li & Kang Huyan
Department of Information Science and Technology, Donghua University, 2999 North Renmin Road, Songjiang District, Shanghai, China
Zezhou Li

Authors

Qi Jia
View author publications
You can also search for this author in PubMed Google Scholar
Xin Fan
View author publications
You can also search for this author in PubMed Google Scholar
Zhongxuan Luo
View author publications
You can also search for this author in PubMed Google Scholar
Haojie Li
View author publications
You can also search for this author in PubMed Google Scholar
Kang Huyan
View author publications
You can also search for this author in PubMed Google Scholar
Zezhou Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xin Fan.

Appendix: Proof of Theorem 1

Assume A is the matrix of a given projective transformation Φ. $\mathcal {P}^{\prime }=\{P_{i}^{\prime }\}_{i=1}^{r}$ represent the projected points of $\mathcal {P}=\{P_{i}\}_{i=1}^{r}$ with Φ, and P _i′=Φ(P _i)=k _i A P _i(P _r+1 = P ₁, k _i+1 = k ₁). $\mathcal {Q}=\left \{Q_{i}^{(j)}\right \}_{i=1,2,\ldots ,r}^{j=1,2,\ldots ,n}$ are projected to $\mathcal {Q}^{\prime }=\left \{Q_{i}^{(j)\prime }\right \}_{i=1,2,\ldots ,r}^{j=1,2,\ldots ,n}$, then

$$\begin{array}{@{}rcl@{}} Q_{i}^{(j)\prime}&=&\varPhi(Q_{i}^{(j)})= l_{i}^{(j)}AQ_{i}^{(j)} = l_{i}^{(j)}A \cdot (a_{i}^{(j)} P_{i} + b_{i}^{(j)} P_{i+1}) \\ &=&\frac{l_{i}^{(j)}a_{i}^{(j)}}{k_{i}} \cdot k_{i}AP_{i} + \frac{l_{i}^{(j)}b_{i}^{(j)}}{k_{i+1}} \cdot k_{i+1}AP_{i+1} \\ &=&\frac{l_{i}^{(j)}a_{i}^{(j)}}{k_{i}}P_{i}^{\prime} + \frac{l_{i}^{(j)}b_{i}^{(j)}}{k_{i+1}} P_{i+1}^{\prime}. \end{array} $$

Thus, the characteristic number of transformed points is given by

$$\begin{array}{@{}rcl@{}} CN(\mathcal{P}^{\prime},\mathcal{Q}^{\prime})&=& \prod\limits_{i=1}^{r} \prod\limits_{j=1}^{n} \left( \frac{k_{i+1}l_{i}^{(j)}a_{i}^{(j)}}{k_{i}l_{i}^{(j)}b_{i}^{(j)}}\right) \\ &=& \prod\limits_{i=1}^{r} \frac{k_{i+1}}{k_{i}} \left( \prod\limits_{j=1}^{n}\frac{a_{i}^{(j)}}{b_{i}^{(j)}}\right) \\ &=& \prod\limits_{i=1}^{r} \left( \prod\limits_{j=1}^{n}\frac{a_{i}^{(j)}}{b_{i}^{(j)}}\right) = CN(\mathcal{P},\mathcal{Q}), \end{array} $$

which indicates that the characteristic number is invariant under projective transformations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jia, Q., Fan, X., Luo, Z. et al. Cross-view action matching using a novel projective invariant on non-coplanar space-time points. Multimed Tools Appl 75, 11661–11682 (2016). https://doi.org/10.1007/s11042-015-2704-4

Download citation

Received: 03 January 2015
Revised: 13 May 2015
Accepted: 20 May 2015
Published: 10 June 2015
Issue Date: October 2016
DOI: https://doi.org/10.1007/s11042-015-2704-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cross-view action matching using a novel projective invariant on non-coplanar space-time points

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

PLAViMoP database: A new continuously assessed and collaborative 3D point-light display dataset

View-independent action recognition: a hybrid approach

Multi-view key information representation and multi-modal fusion for single-subject routine action recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix: Proof of Theorem 1

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Cross-view action matching using a novel projective invariant on non-coplanar space-time points

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

PLAViMoP database: A new continuously assessed and collaborative 3D point-light display dataset

View-independent action recognition: a hybrid approach

Multi-view key information representation and multi-modal fusion for single-subject routine action recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix: Proof of Theorem 1

Appendix: Proof of Theorem 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation