PTL-LTM model for complex action recognition using local-weighted NMF and deep dual-manifold regularized NMF with sparsity constraint

Tong, Ming; Bai, He; Yue, Xing; Bu, Haili

doi:10.1007/s00521-020-04783-0

PTL-LTM model for complex action recognition using local-weighted NMF and deep dual-manifold regularized NMF with sparsity constraint

Original Article
Published: 20 February 2020

Volume 32, pages 13759–13781, (2020)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Ming Tong¹,
He Bai¹,
Xing Yue¹ &
…
Haili Bu¹

288 Accesses
1 Citation
Explore all metrics

Abstract

Complex action recognition possesses significant academic research value, potential commercial value and broad market application prospect. For improving its performance, a local-weighted nonnegative matrix factorization with rank regularization constraint (LWNMF_RC) is firstly presented, which removes complex background and then obtains motion salient regions. Secondly, a dual-manifold regularized nonnegative matrix factorization with sparsity constraint (DMNMF_SC) is proposed, which not only considers the short-term and middle-term temporal dependencies implied in data manifold, but also mines the geometric structure hidden in feature manifold. In addition, the introduction of sparsity constraint makes features possess better discriminativeness. Thirdly, a deep DMNMF_SC method is constructed, which acquires more hierarchical and discriminative features. Finally, a long-term temporal memory model with probability transfer learning (PTL-LTM) is proposed, which accurately memorizes the long-term temporal dependency among multiple simple action segments and, meanwhile, makes full use of the probability features of rich labeled simple actions and then applies the knowledge learned from simple actions for complex action recognition. Consequently, the performance is effectively improved.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 14

NMF with local constraint and Deep NMF with temporal dependencies constraint for action recognition

Article 21 August 2018

Semantic Image Networks for Human Action Recognition

Article 22 October 2019

Probability Matrix SVM+ Learning for Complex Action Recognition

References

Chen Y, Yi Z (2019) Locality-constrained least squares regression for subspace clustering. Knowl-Based Syst 163:51–56
Article Google Scholar
Lu Y, Lai Z, Xu Y, Li X, Zhang D, Yuan C (2017) Nonnegative discriminant matrix factorization. IEEE Trans Circuits Syst Video Technol 27(7):1392–1405
Article Google Scholar
Lu C, Feng J, Lin Z, Mei T, Yan S (2019) Subspace clustering by block diagonal representation. IEEE Trans Pattern Anal Mach Intell 41(2):487–501
Article Google Scholar
Lu C, Feng J, Chen Y, Liu W, Lin Z, Yan S (2019) Tensor robust principal component analysis with a new tensor nuclear norm. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/tpami.2019.2891760
Article Google Scholar
Alawadi S, Fernández-Delgado M, Mera D, Barro S (2018) Polynomial kernel discriminant analysis for 2D visualization of classification problems. Neural Comput Appl. https://doi.org/10.1007/s00521-017-3290-3
Article Google Scholar
Xu KK, Li HX, Liu Z (2018) ISOMAP-based spatiotemporal modeling for lithium-ion battery thermal process. IEEE Trans Ind Inf 14(2):569–577
Article Google Scholar
Lee DD, Seung HS (1999) Learning the parts of objects by nonnegative matrix factorization. Nature 401(6755):788–791
Article MATH Google Scholar
Yuan X, Han L, Qian S, Xu G, Yan H (2019) Singular value decomposition based recommendation using imputed data. Knowl-Based Syst 163:485–494
Article Google Scholar
Yi Y, Wang J, Zhou W, Zheng C, Kong J, Qiao S (2019) Non-negative matrix factorization with locality constrained adaptive graph. IEEE Trans Circuits Syst Video Technol. https://doi.org/10.1109/tcsvt.2019.2892971
Article Google Scholar
Zhu W, Yan Y, Peng Y (2018) Topological structure regularized nonnegative matrix factorization for image clustering. Neural Comput Appl. https://doi.org/10.1007/s00521-018-3572-4
Article Google Scholar
Yang S, Zhang L, He X, Yi Z (2019) Learning manifold structures with subspace segmentations. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2019.2895497
Article Google Scholar
Zhang H, Wang S, Xu X, Chow TW, Wu QJ (2018) Tree2Vector: learning a vectorial representation for tree-structured data. IEEE Trans Neural Netw Learn Syst 99:1–15
MathSciNet Google Scholar
Gao H, Nie F, Huang H (2017) Local centroids structured non-negative matrix factorization. In: Proceedings of the thirty-first AAAI conference on artificial intelligence, pp 1905–1911
Huang S, Zhao P, Ren Y, Li T, Xu Z (2019) Self-paced and soft-weighted nonnegative matrix factorization for data representation. Knowl-Based Syst 164:29–37
Article Google Scholar
Liu F, Xu X, Qiu S, Tao D (2016) Simple to complex transfer learning for action recognition. IEEE Trans Image Process 25(2):949–960
Article MathSciNet MATH Google Scholar
Zhang J, Hu H (2019) Domain learning joint with semantic adaptation for human action recognition. Pattern Recognit 90:196–209
Article Google Scholar
Li J, Wong Y, Zhao Q, Kankanhalli MS (2017) Attention transfer from web images for video recognition. In: Proceedings of the 25th ACM international conference on multimedia, pp 1–9
Luo Z, Zou Y, Hoffman J, Fei-Fei L (2017) Label efficient learning of transferable representations acrosss domains and tasks. In: Proceedings of advances in neural information processing systems (NIPS), pp 165–177
Duan L, Xu D, Tsang IWH, Luo J (2012) Visual event recognition in videos by learning from web data. IEEE Trans Pattern Anal Mach Intell 34(9):1667–1680
Article Google Scholar
Rahmani H, Mian A, Shah M (2018) Learning a deep model for human action recognition from novel viewpoints. IEEE Trans Pattern Anal Mach Intell 40(3):667–681
Article Google Scholar
Wu F, Hu Y, Gao J, Sun Y, Yin B (2016) Ordered subspace clustering with block-diagonal priors. IEEE Trans Cybern 46(12):3209–3219
Article Google Scholar
Wang J, Tian F, Liu CH, Yu H, Wang X, Tang X (2017) Robust nonnegative matrix factorization with ordered structure constraints. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 478–485
Xiang Y, Zhang G, Gu S, Cai J (2018) Online multi-layer dictionary pair learning for visual classification. Expert Syst Appl 105:174–182
Article Google Scholar
Su B, Zhou J, Ding X, Wang H, Wu Y (2016) Hierarchical dynamic parsing and encoding for action recognition. In: Proceedings of European conference on computer vision (ECCV), pp 202–217
Trigeorgis G, Zafeiriou S, Schuller BW (2017) A deep matrix factorization method for learning attribute representations. IEEE Trans Pattern Anal Mach Intell 39(3):417–429
Article Google Scholar
Kulis B (2012) Metric learning: a survey. Found Trends Mach Learn 5(4):287–364
Article MathSciNet MATH Google Scholar
Wang H, Kläser A, Schmid C, Liu CL (2013) Dense trajectories and motion boundary descriptors for action recognition. Int J Comput Vis 103(1):60–79
Article MathSciNet Google Scholar
Cai D, He X, Han J, Huang TS (2011) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560
Article Google Scholar
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22:888–905
Article Google Scholar
Reddy KK, Shah M (2013) Recognizing 50 human action categories of web videos. Mach Vis Appl 24(5):971–981
Article Google Scholar
Niebles JC, Chen CW, Fei-Fei L (2010) Modeling temporal structure of decomposable motion segments for activity classification. In: Proceedings of European conference on computer vision (ECCV), pp 392–405
Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: A local SVM approach. In: Proceedings of the international conference on pattern recognition (ICPR), pp 32–36
Gorelick L, Blank M, Shechtman E, Irani M, Basri R (2007) Actions as space-time shapes. IEEE Trans Pattern Anal Mach Intell 29(12):2247–2253
Article Google Scholar
Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recognit Lett 31(8):651–666
Article Google Scholar
Allab K, Labiod L, Nadif M (2017) A semi-NMF-PCA unified framework for data clustering. IEEE Trans Knowl Data Eng 29(1):2–16
Article MATH Google Scholar
Arias-Castro E, Lerman G, Zhang T (2017) Spectral clustering based on local PCA. J Mach Learn Res 18(9):1–57
MathSciNet MATH Google Scholar
Liu G, Lin Z, Yu Y (2010) Robust subspace segmentation by low-rank representation. In: Proceedings of the international conference on machine learning (ICML), pp 663–670
Hu W, Choi KS, Wang P, Jiang Y, Wang S (2015) Convex nonnegative matrix factorization with manifold regularization. Neural Netw 63:94–103
Article MATH Google Scholar
Xia G, Sun H, Feng L, Zhang G, Liu Y (2018) Human motion segmentation via robust kernel sparse subspace clustering. IEEE Trans Image Process 27(1):135–150
Article MathSciNet MATH Google Scholar
Everts I, Van Gemert JC, Gevers T (2014) Evaluation of color spatio-temporal interest points for human action recognition. IEEE Trans Image Process 23(4):1569–1580
Article MathSciNet MATH Google Scholar
Ciptadi A, Goodwin MS, Rehg JM (2014) Movement pattern histogram for action recognition and retrieval. In: Proceedings of European conference on computer vision (ECCV), pp 695–710
Narayan S, Ramakrishnan KR (2014) A cause and effect analysis of motion trajectories for modeling actions. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 2633–2640
Liu J, Huang Y, Peng X, Wang L (2015) Multi-view descriptor mining via codeword net for action recognition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 793–797
Chen QQ, Zhang YJ (2016) Cluster trees of improved trajectories for action recognition. Neurocomputing 173:364–372
Article Google Scholar
Wang H, Oneata D, Verbeek J, Schmid C (2016) A robust and efficient video representation for action recognition. Int J Comput Vis 119(3):219–238
Article MathSciNet Google Scholar
Peng X, Wang L, Wang X, Qiao Y (2016) Bag of visual words and fusion methods for action recognition: comprehensive study and good practice. Comput Vis Image Underst 150:109–125
Article Google Scholar
Wang L, Qiao Y, Tang X (2016) MoFAP: a multi-level representation for action recognition. Int J Comput Vis 119(3):254–271
Article MathSciNet Google Scholar
Liu AA, Su YT, Nie WZ, Kankanhalli M (2017) Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans Pattern Anal Mach Intell 39(1):102–114
Article Google Scholar
Wang H, Chang X, Shi L, Yang Y, Shen YD (2018) Uncertainty sampling for action recognition via maximizing expected average precision. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence (IJCAI), pp 964–970
Ni B, Moulin P, Yang X, Yan S (2015) Motion part regularization: Improving action recognition via trajectory selection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3698–3706
Liu C, Wu X, Jia Y (2016) A hierarchical video description for complex activity understanding. Int J Comput Vis 118(2):240–255
Article MathSciNet Google Scholar
Yi Y, Zheng Z, Lin M (2017) Realistic action recognition with salient foreground trajectories. Expert Syst Appl 75:44–55
Article Google Scholar
Xu K, Jiang X, Sun T (2017) Two-stream dictionary learning architecture for action recognition. IEEE Trans Circuits Syst Video Technol 27(3):567–576
Article Google Scholar
Li WX, Vasconcelos N (2017) Complex activity recognition via attribute dynamics. Int J Comput Vis 122(2):334–370
Article MathSciNet Google Scholar
Tian Y, Kong Y, Ruan Q, An G, Fu Y (2018) Hierarchical and spatio-temporal sparse representation for human action recognition. IEEE Trans Image Process 27(4):1748–1762
Article MathSciNet Google Scholar
Tang K, Fei-Fei L, Koller D (2012) Learning latent temporal structure for complex event detection. In: Proceedings of the IEEE international conference on computer vision (CVPR), pp 1250–1257
Li W, Yu Q, Divakaran A, Vasconcelos N (2013) Dynamic pooling for complex event recognition. In: Proceedings of the IEEE international conference on computer vision (CVPR), pp 2728–2735
Zheng J, Jiang Z, Chellappa R (2016) Submodular attribute selection for visual recognition. IEEE Trans Pattern Anal Mach Intell 39(11):2242–2255
Article Google Scholar

Download references

Acknowledgements

This work was supported partially by Science and Technology Overall Innovation Project of Shaanxi Province (Grant 2013KTZB03-03-03), Shaanxi Province Key Project of Research and Development Plan (S2018-YF-ZDGY-0187) and International Cooperation Project of Shaanxi Province (S2018-YF-GHMS-0061).

Author information

Authors and Affiliations

School of Electronic Engineering, Xidian University, Xi’an, 710071, China
Ming Tong, He Bai, Xing Yue & Haili Bu

Authors

Ming Tong
View author publications
You can also search for this author in PubMed Google Scholar
He Bai
View author publications
You can also search for this author in PubMed Google Scholar
Xing Yue
View author publications
You can also search for this author in PubMed Google Scholar
Haili Bu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ming Tong.

Ethics declarations

Conflict of interest

All the authors of the manuscript declared that there are no potential conflicts of interest.

Human and animal rights

All the authors of the manuscript declared that there is no research involving human participants and/or animal.

Informed consent

All the authors of the manuscript declared that there is no material that required informed consent.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1: Proof of Theorem 1

To prove Theorem 1, it is required to show the non-increasing property of the objective function in Eq. (3) under the update rules in Eqs. (15) and (16). Firstly, the objective function is proved to have non-increasing property under the update rule in Eq. (15). Then, it is demonstrated to have non-increasing property under the update rule in Eq. (16). The proof procedure will utilize the following auxiliary function, which is the same as that employed in the expectation maximization (EM) algorithm.

Definition 1

If the conditions G(x, x^(t)) ≥ F(x) and G(x, x) = F(x) are satisfied, then G(x, x^(t)) is an auxiliary function of F(x).

Lemma 1

If G(x, x^(t)) is an auxiliary function of F(x), then F(x) is non-increasing under the following update rule:

$$ x^{{\left( {t + 1} \right)}} = \mathop {\arg \hbox{min} }\limits_{x} G\left( {x,x^{\left( t \right)} } \right). $$

(51)

Proof

$ F\left( {x^{{\left( {t + 1} \right)}} } \right) \le G\left( {x^{{\left( {t + 1} \right)}} ,x^{\left( t \right)} } \right) \le G\left( {x^{\left( t \right)} ,x^{\left( t \right)} } \right) = F\left( {x^{\left( t \right)} } \right). $

Now, it will be shown in the following that the update rule for W in Eq. (3) is exactly the update rule in Eq. (15) with a proper auxiliary function.

Considering any element w_jk in W, $ F_{{w_{jk} }} $ is used to represent the part of Eq. (3), which is only related to w_jk. It is easy to check that:

$$ F^{\prime}_{{w_{jk} }} = \left( {\frac{{\partial O_{LWNMF\_RC} }}{{\partial \varvec{W}}}} \right)_{jk} = \left( { - 2\varvec{X}^{\text{T}} \varvec{XV} + 2\varvec{X}^{\text{T}} \varvec{XWV}^{\text{T}} \varvec{V} + \lambda_{1}\varvec{\psi}} \right)_{jk} $$

(52)

$$ F^{\prime\prime}_{{w_{jk} }} = 2\left( {\varvec{X}^{\text{T}} \varvec{X}} \right)_{jj} \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{kk} + \lambda_{1} \left( {\frac{{\partial\varvec{\psi}}}{{\partial \varvec{W}}}} \right)_{jk} . $$

(53)

Since the update rule is essentially element-wise, it is sufficient to prove that each $ F_{{w_{jk} }} $ is non-increasing under the update rule in Eq. (15).

Lemma 2

Function (54) is an auxiliary function of $ F_{{w_{jk} }} $, which is the part of O_{LWNMF_RC} and only related to w_jk.

$$ G\left( {w,w_{jk}^{\left( t \right)} } \right) = F_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right) + F^{\prime}_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right)\left( {w - w_{jk}^{\left( t \right)} } \right) + \frac{{\left[ {\varvec{X}^{\text{T}} \varvec{XWV}^{\text{T}} \varvec{V} + \frac{1}{2}\lambda_{1}\varvec{\psi}} \right]_{jk} }}{{w_{jk}^{\left( t \right)} }}\left( {w - w_{jk}^{\left( t \right)} } \right)^{2} . $$

(54)

Proof

Since $G(w,w) = F_{w_{jk}}(w)$ is obvious, it only requires to prove that $ G\left( {w,w_{jk}^{\left( t \right)} } \right) \ge F_{{w_{jk} }} \left( w \right) $. To do this, a comparison of Taylor series expansion of $ F_{{w_{jk} }} \left( w \right) $ is made with Eq. (54):

$$ F_{{w_{jk} }} \left( w \right) = F_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right) + F^{\prime}_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right)\left( {w - w_{jk}^{\left( t \right)} } \right) + \frac{1}{2}F^{\prime\prime}_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right)\left( {w - w_{jk}^{\left( t \right)} } \right)^{2} $$

(55)

and it can be found that: $ G\left( {w,w_{jk}^{\left( t \right)} } \right) \ge F_{{w_{jk} }} \left( w \right) $ is equivalent to

$$ \frac{{\left[ {\varvec{X}^{\text{T}} \varvec{XWV}^{\text{T}} \varvec{V} + \frac{1}{2}\lambda_{1}\varvec{\psi}} \right]_{jk} }}{{w_{jk}^{\left( t \right)} }} \ge \frac{1}{2}F^{\prime\prime}_{{w_{jk} }} = \left( {\varvec{X}^{\text{T}} \varvec{X}} \right)_{jj} \left( {\varvec{V}^{{\rm T}} \varvec{V}} \right)_{kk} + \frac{1}{2}\lambda_{1} \left( {\frac{{\partial\varvec{\psi}}}{{\partial \varvec{W}}}} \right)_{jk} . $$

(56)

Meanwhile, the following equations hold:

$$ \begin{aligned} \left( {\varvec{X}^{\text{T}} \varvec{XWV}^{\text{T}} \varvec{V}} \right)_{jk} = \sum\limits_{l} {\left( {\varvec{X}^{\text{T}} \varvec{XW}} \right)_{jl} } \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{lk} \ge \left( {\varvec{X}^{\text{T}} \varvec{XW}} \right)_{jk} \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{kk} \\ \ge \sum\limits_{l} {\left( {\varvec{X}^{\text{T}} \varvec{X}} \right)_{jl} w_{lk} \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{kk} } \ge w_{jk}^{\left( t \right)} \left( {\varvec{X}^{\text{T}} \varvec{X}} \right)_{jj} \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{kk} \\ \end{aligned} $$

(57)

$$ \left(\varvec{\psi}\right)_{jk} \ge w_{jk}^{\left( t \right)} \left( {\frac{{\partial\varvec{\psi}}}{{\partial \varvec{W}}}} \right)_{jk} . $$

(58)

Therefore, Eq. (56) holds, from which $ G\left( {w,w_{jk}^{\left( t \right)} } \right) \ge F_{{w_{jk} }} \left( w \right) $ holds.

Now, the objective function of Theorem 1 can be demonstrated to be non-increasing under the update rule in Eq. (15).

Proof

Substitute $ G\left( {w,w_{jk}^{\left( t \right)} } \right) $ in Eq. (54) into Eq. (51), and the following update rule is obtained:

$$ w_{jk}^{{\left( {t + 1} \right)}} = \mathop {\arg \hbox{min} }\limits_{w} G\left( {w,w_{jk}^{\left( t \right)} } \right) = w_{jk}^{\left( t \right)} \frac{{\left( {2\varvec{X}^{\text{T}} \varvec{XV}} \right)_{jk} }}{{\left( {2\varvec{X}^{\text{T}} \varvec{XWV}^{\text{T}} \varvec{V} + \lambda_{1}\varvec{\psi}} \right)_{jk} }}. $$

(59)

Since Eq. (54) is an auxiliary function, $ F_{w_{jk}}$ is non-increasing under this update rule.

Subsequently, the objective function is validated to be non-increasing under the update rule in Eq. (16).

Considering any element v_jk in V, $ F_{{v_{jk} }} $ is used to denote the part of Eq. (3), which is only related to v_jk. It is easy to check that:

$$ F^{\prime}_{{v_{jk} }} = \left( {\frac{{\partial O_{LWNMF\_RC} }}{{\partial \varvec{V}}}} \right)_{jk} = \left( { - 2\varvec{X}^{\text{T}} \varvec{XW} + 2\varvec{VW}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW} + 2\lambda_{1} \varvec{Z}^{\text{T}} . * \varvec{V}. * \varvec{Z}^{\text{T}} + 2\lambda_{2} \varvec{V} - 2\lambda_{2} \varvec{B}} \right)_{jk} $$

(60)

$$ F^{\prime\prime}_{{v_{jk} }} = \left( {2\varvec{W}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW}} \right)_{kk} + \left( {2\lambda_{1} \varvec{Z}. * \varvec{Z}} \right)_{kj} + \left( {2\lambda_{2} \varvec{I}{\mathbf{ + }}2\lambda_{2} \left( {\varvec{B}^{ - } - \varvec{B}^{ + } } \right)} \right)_{jk} , $$

(61)

where I is an identity matrix.

Lemma 3

Function (62) is an auxiliary function for $ F_{{v_{jk} }} $, which is the part of O_{LWNMF_RC} and only related to v_jk.

$$ G\left( {v,v_{jk}^{\left( t \right)} } \right) = F_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right) + F^{\prime}_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right)\left( {v - v_{jk}^{\left( t \right)} } \right) + \frac{{\left[ {\varvec{VW}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW} + \lambda_{1} \varvec{Z}^{\text{T}} . * \varvec{V}. * \varvec{Z}^{\text{T}} + \lambda_{2} \varvec{V} + \lambda_{2} \varvec{B}^{ - } } \right]_{jk} }}{{v_{jk}^{\left( t \right)} }}\left( {v - v_{jk}^{\left( t \right)} } \right)^{2} . $$

(62)

Proof

Since $ G\left( {v,v} \right) = F_{{v_{jk} }} \left( v \right) $ is obvious, it only requires to show that $ G\left( {v,v_{jk}^{\left( t \right)} } \right) \ge F_{{v_{jk} }} \left( v \right) $. To do this, a comparison of Taylor series expansion of $ F_{{v_{jk} }} \left( v \right) $ is made with Eq. (62):

$$ F_{{v_{jk} }} \left( v \right) = F_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right) + F^{\prime}_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right)\left( {v - v_{jk}^{\left( t \right)} } \right) + \frac{1}{2}F^{\prime\prime}_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right)\left( {v - v_{jk}^{\left( t \right)} } \right)^{2} $$

(63)

and it can be found that: $ G\left( {v,v_{jk}^{\left( t \right)} } \right) \ge F_{{v_{jk} }} \left( v \right) $ is equivalent to

$$ \frac{{\left[ {\varvec{VW}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW} + \lambda_{1} \varvec{Z}^{\text{T}} . * \varvec{V}. * \varvec{Z}^{\text{T}} + \lambda_{2} \varvec{V} + \lambda_{2} \varvec{B}^{ - } } \right]_{jk} }}{{v_{jk}^{\left( t \right)} }} \ge \left( {\varvec{W}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW}} \right)_{kk} + \left( {\lambda_{1} \varvec{Z}. * \varvec{Z}} \right)_{kj} + \left( {\lambda_{2} \varvec{I}{\mathbf{ + }}\lambda_{2} \left( {\varvec{B}^{ - } - \varvec{B}^{ + } } \right)} \right)_{jk} $$

(64)

Meanwhile, the following equations hold:

$$ \left( {\varvec{VW}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW}} \right)_{jk} = \sum\limits_{l} {\left( \varvec{V} \right)_{jl} } \left( {\varvec{W}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW}} \right)_{lk} \ge v_{jk}^{\left( t \right)} \left( {\varvec{W}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW}} \right)_{kk} $$

(65)

$$ \lambda_{1} \left( {\varvec{Z}^{\text{T}} . * \varvec{V}. * \varvec{Z}^{\text{T}} } \right)_{jk} = \lambda_{1} \left( {\varvec{Z}^{\text{T}} . * \varvec{Z}^{\text{T}} } \right)_{jk} v_{jk}^{\left( t \right)} = \left( {\lambda_{1} \varvec{Z}. * \varvec{Z}} \right)_{kj} v_{jk}^{\left( t \right)} $$

(66)

$$ \left( {\lambda_{2} \varvec{V} + \lambda_{2} \varvec{B}^{ - } } \right)_{jk} \ge \left( {\lambda_{2} \varvec{I}{\mathbf{ + }}\lambda_{2} \varvec{B}^{ - } - \lambda_{2} \varvec{B}^{ + } } \right)_{jk} v_{jk}^{\left( t \right)} . $$

(67)

Thus, Eq. (64) holds and $ G\left( {v,v_{jk}^{\left( t \right)} } \right) \ge F_{{v_{jk} }} \left( v \right) $.

Now, it can also be demonstrated that the objective function of Theorem 1 is non-increasing under the update rule in Eq. (16).

Proof

Substitute $ G\left( {v,v_{jk}^{\left( t \right)} } \right) $ in Eq. (62) into Eq. (51), and the following update rule can be obtained:

$$ v_{jk}^{{\left( {t + 1} \right)}} = \mathop {\arg \hbox{min} }\limits_{v} G\left( {v,v_{jk}^{\left( t \right)} } \right) = v_{jk}^{\left( t \right)} \frac{{\left( {\varvec{X}^{\text{T}} \varvec{XW} + \lambda_{2} \varvec{B}^{ + } } \right)_{jk} }}{{\left( {\varvec{VW}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW} + \lambda_{1} \varvec{Z}^{\text{T}} . * \varvec{V}. * \varvec{Z}^{\text{T}} + \lambda_{2} \varvec{V} + \lambda_{2} \varvec{B}^{ - } } \right)_{jk} }}. $$

(68)

Since Eq. (62) is an auxiliary function, and $ F_{{v_{jk} }} $ is non-increasing under this update rule. Therefore, Theorem 1 holds.

Appendix 2: Proof of Theorem 2

To prove Theorem 2, it is required to show the non-increasing property of the objective function in Eq. (19) under the update rules in Eqs. (31) and (40). Firstly, the objective function is proved to have non-increasing property under the update rule in Eq. (31). Then, it is also demonstrated to have non-increasing property under the update rule in Eq. (40). The proof will utilize the following auxiliary function, which is the same as that used in the EM algorithm.

According to Definition 1 and Lemma 1 in Appendix 1, it can also be demonstrated that the objective function of Theorem 2 has non-increasing property under the update rule in Eq. (31).

Considering any element w_jk in W, $ \tilde{J}_{{w_{jk} }} $ is utilized to denote the part of Eq. (19), which is only related to w_jk. It is easy to check that:

$$ \tilde{J}^{\prime}_{{w_{jk} }} = \left( {\frac{{\partial O_{DMNMF\_SC} }}{{\partial \varvec{W}}}} \right)_{jk} = \left( { - 2\varvec{X}^{\text{T}} \varvec{XV} + 2\varvec{X}^{\text{T}} \varvec{XWV}^{\text{T}} \varvec{V} + 2\mu_{2} \varvec{X}^{\text{T}} \varvec{L}^{U} \varvec{XW}} \right)_{jk} $$

(69)

$$ \tilde{J}^{\prime\prime}_{{w_{jk} }} = 2\left( {\varvec{X}^{\text{T}} \varvec{X}} \right)_{jj} \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{kk} + 2\mu_{2} \left( {{\mathbf{X}}^{\text{T}} {\mathbf{L}}^{U} {\mathbf{X}}} \right)_{jj} . $$

(70)

Lemma 4

Function (71) is an auxiliary function for $ \tilde{J}_{{w_{jk} }} $, which is the part of O_{DWNMF_SC}, and only related to w_jk.

$$ G\left( {w,w_{jk}^{\left( t \right)} } \right) = \tilde{J}_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right) + \tilde{J}^{\prime}_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right)\left( {w - w_{jk}^{\left( t \right)} } \right) + \frac{{\left[ {{\mathbf{X}}^{\text{T}} {\mathbf{XWV}}^{\text{T}} {\mathbf{V}} + \mu_{2} {\mathbf{X}}^{\text{T}} {\mathbf{D}}^{U} {\mathbf{XW}}} \right]_{jk} }}{{w_{jk}^{\left( t \right)} }}\left( {w - w_{jk}^{\left( t \right)} } \right)^{2} . $$

(71)

Proof

Since $ G\left( {w,w} \right) = \tilde{J}_{{w_{jk} }} \left( w \right) $ is obvious, it only requires to show that $ G\left( {w,w_{jk}^{\left( t \right)} } \right) \ge \tilde{J}_{{w_{jk} }} \left( w \right) $. To do this, a comparison of Taylor series expansion of $ \tilde{J}_{{w_{jk} }} \left( w \right) $ is made with Eq. (71):

$$ \tilde{J}_{{w_{jk} }} \left( w \right) = \tilde{J}_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right) + \tilde{J}^{\prime}_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right)\left( {w - w_{jk}^{\left( t \right)} } \right) + \frac{1}{2}\tilde{J}^{\prime\prime}_{{w_{jk} }} \left( {w_{jk}^{\left( t \right)} } \right)\left( {w - w_{jk}^{\left( t \right)} } \right)^{2} . $$

(72)

And it can be found that: $ G\left( {w,w_{jk}^{\left( t \right)} } \right) \ge \tilde{J}_{{w_{jk} }} \left( w \right) $ is equivalent to

$$ \frac{{\left[ {\varvec{X}^{\text{T}} \varvec{XWV}^{\text{T}} \varvec{V} + \mu_{2} \varvec{X}^{\text{T}} \varvec{D}^{U} \varvec{XW}} \right]_{jk} }}{{w_{jk}^{\left( t \right)} }} \ge \frac{1}{2}\tilde{J}^{\prime\prime}_{{w_{jk} }} = \left( {\varvec{X}^{\text{T}} \varvec{X}} \right)_{jj} \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{kk} + \mu_{2} \left( {\varvec{X}^{\text{T}} \varvec{L}^{U} \varvec{X}} \right)_{jj} . $$

(73)

Meanwhile, the following inequalities hold:

$$ \begin{aligned} \left( {\varvec{X}^{\text{T}} \varvec{XWV}^{\text{T}} \varvec{V}} \right)_{jk} & = \sum\limits_{l} {\left( {\varvec{X}^{\text{T}} \varvec{XW}} \right)_{jl} } \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{lk} \ge \left( {\varvec{X}^{\text{T}} \varvec{XW}} \right)_{jk} \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{kk} \\ & \ge \sum\limits_{l} {\left( {\varvec{X}^{\text{T}} \varvec{X}} \right)_{jl} w_{lk} } \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{kk} \ge w_{jk}^{\left( t \right)} \left( {\varvec{X}^{\text{T}} \varvec{X}} \right)_{jj} \left( {\varvec{V}^{\text{T}} \varvec{V}} \right)_{kk} \\ \end{aligned} $$

(74)

$$ \begin{aligned} \mu_{2} \left( {\varvec{X}^{{^{\text{T}} }} \varvec{D}^{U} \varvec{XW}} \right)_{jk} & = \mu_{2} \sum\limits_{l} {\left( {\varvec{X}^{\text{T}} \varvec{D}^{U} \varvec{X}} \right)_{jl} \left( \varvec{W} \right)_{lk} } \ge \mu_{2} \left( {\varvec{X}^{\text{T}} \varvec{D}^{U} \varvec{X}} \right)_{jj} w_{jk}^{\left( t \right)} \\ & \ge \mu_{2} \left[ {\varvec{X}^{\text{T}} \left( {\varvec{D}^{U} - \varvec{A}^{U} } \right)\varvec{X}} \right]_{jj} w_{jk}^{\left( t \right)} = \mu_{2} \left( {\varvec{X}^{\text{T}} \varvec{L}^{U} \varvec{X}} \right)_{jj} w_{jk}^{\left( t \right)} . \\ \end{aligned} $$

(75)

Thus, Eq. (73) holds and $ G\left( {w,w_{jk}^{\left( t \right)} } \right) \ge \tilde{J}_{{w_{jk} }} \left( w \right) $.

Now, it can be demonstrated that the objective function of Theorem 2 is non-increasing under the update rule of Eq. (31).

Proof

Replace $ G\left( {w,w_{jk}^{\left( t \right)} } \right) $ in Eq. (51) by Eq. (71), and the following update rule is obtained:

$$ w_{jk}^{{\left( {t + 1} \right)}} = \mathop {\arg \hbox{min} }\limits_{w} G\left( {w,w_{jk}^{\left( t \right)} } \right) = w_{jk}^{\left( t \right)} \frac{{\left( {\varvec{X}^{\text{T}} \varvec{XV} + \mu_{2} \varvec{X}^{\text{T}} \varvec{A}^{U} \varvec{XW}} \right)_{jk} }}{{\left( {\varvec{X}^{\text{T}} \varvec{XWV}^{{\rm T}} \varvec{V} + \mu_{2} \varvec{X}^{\text{T}} \varvec{D}^{U} \varvec{XW}} \right)_{jk} }}. $$

(76)

Since Eq. (71) is an auxiliary function, $ \tilde{J}_{{w_{jk} }} $ is non-increasing under this update rule.

Subsequently, the objective function is validated to be non-increasing under the update rule in Eq. (40).

Considering any element v_jk in V, $ \tilde{J}_{{v_{jk} }} $ is used to denote the part of Eq. (19), which is only related to v_jk. It is easy to check that:

$$ \tilde{J}^{\prime}_{{v_{jk} }} = \left( {\frac{{\partial O_{DMNMF\_SC} }}{{\partial \varvec{V}}}} \right)_{jk} = \left( { - 2{\mathbf{X}}^{\text{T}} {\mathbf{XW}} + 2{\mathbf{VW}}^{\text{T}} {\mathbf{X}}^{\text{T}} {\mathbf{XW}} + 2\mu_{1} {\mathbf{L}}^{V} {\mathbf{V}} + \mu_{3} {\mathbf{MV}}} \right)_{jk} $$

(77)

$$ \tilde{J}^{\prime\prime}_{{v_{jk} }} = \left( {2{\mathbf{W}}^{\text{T}} {\mathbf{X}}^{\text{T}} {\mathbf{XW}}} \right)_{kk} + \left( {2\mu_{1} {\mathbf{L}}^{V} } \right){}_{jj} + \mu_{3} \left( {M_{jj} - v_{jk}^{2} \left( {M_{jj} } \right)^{3} } \right). $$

(78)

Lemma 5

Function (79) is an auxiliary function for $ \tilde{J}_{{v_{jk} }} $, which is the part of O_{DWNMF_SC} and only related to v_jk.

$$ G\left( {v,v_{jk}^{\left( t \right)} } \right) = \tilde{J}_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right) + \tilde{J}^{\prime}_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right)\left( {v - v_{jk}^{\left( t \right)} } \right) + \frac{{\left[ {{\mathbf{VW}}^{\text{T}} {\mathbf{X}}^{\text{T}} {\mathbf{XW}} + \mu_{1} {\mathbf{D}}^{V} {\mathbf{V}} + \frac{{\mu_{3} }}{2}{\mathbf{MV}}} \right]_{jk} }}{{v_{jk}^{\left( t \right)} }}\left( {v - v_{jk}^{\left( t \right)} } \right)^{2} . $$

(79)

Proof

Since $ G\left( {v,v} \right) = \tilde{J}_{{v_{jk} }} \left( v \right) $ is obvious, it only requires to show that $ G\left( {v,v_{jk}^{\left( t \right)} } \right) \ge \tilde{J}_{{v_{jk} }} \left( v \right) $. To do this, a comparison of Taylor series expansion of $ \tilde{J}_{{v_{jk} }} \left( v \right) $ is made with Eq. (79):

$$ \tilde{J}_{{v_{jk} }} \left( v \right) = \tilde{J}_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right) + \tilde{J}^{\prime}_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right)\left( {v - v_{jk}^{\left( t \right)} } \right) + \frac{1}{2}\tilde{J}^{\prime\prime}_{{v_{jk} }} \left( {v_{jk}^{\left( t \right)} } \right)\left( {v - v_{jk}^{\left( t \right)} } \right)^{2} $$

(80)

and it can be found that: $ G\left( {v,v_{jk}^{\left( t \right)} } \right) \ge \tilde{J}_{{v_{jk} }} \left( v \right) $ is equivalent to

$$ \frac{{\left[ {\varvec{VW}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW} + \mu_{1} \varvec{D}^{V} \varvec{V} + \frac{1}{2}\mu_{3} \varvec{MV}} \right]_{jk} }}{{v_{jk}^{\left( t \right)} }} \ge \frac{1}{2}\tilde{J}^{\prime\prime}_{{v_{jk} }} = \left( {\varvec{W}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW}} \right)_{kk} + \mu_{1} \left( {\varvec{L}^{V} } \right){}_{jj} + \frac{{\mu_{3} }}{2}\left( {M_{jj} - v_{jk}^{2} \left( {M_{jj} } \right)^{3} } \right). $$

(81)

Meanwhile, the following inequalities hold:

$$ \left( {\varvec{VW}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW}} \right)_{jk} = \sum\limits_{l} {v_{jl} } \left( {\varvec{W}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW}} \right)_{lk} \ge v_{jk}^{\left( t \right)} \left( {\varvec{W}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW}} \right)_{kk} $$

(82)

$$ \mu_{1} \left( {\varvec{D}^{V} \varvec{V}} \right)_{jk} = \mu_{1} \sum\limits_{l} {\left( {\varvec{L}^{V} + \varvec{A}^{V} } \right)_{jl} } v_{lk} \ge \mu_{1} \left( {\varvec{L}^{V} } \right)_{jj} v_{jk}^{\left( t \right)} $$

(83)

$$ \frac{{\mu_{3} }}{2}\left( {\varvec{MV}} \right)_{jk} = \frac{{\mu_{3} }}{2}M_{jj} v_{jk} \ge \frac{{\mu_{3} }}{2}\left( {M_{jj} - v_{jk}^{2} \left( {M_{jj} } \right)^{3} } \right)v_{jk}^{\left( t \right)} . $$

(84)

Thus, Eq. (81) holds and $ G\left( {v,v_{jk}^{\left( t \right)} } \right) \ge \tilde{J}_{{v_{jk} }} \left( v \right) $.

Now, it can also be demonstrated that the objective function of Theorem 2 is non-increasing under the update rule in Eq. (40).

Proof

Replace $ G\left( {v,v_{jk}^{\left( t \right)} } \right) $ in Eq. (51) by Eq. (79) and the following update rule can be obtained:

$$ v_{jk}^{{\left( {t + 1} \right)}} = \mathop {\arg \hbox{min} }\limits_{v} G\left( {v,v_{jk}^{\left( t \right)} } \right) = v_{jk}^{\left( t \right)} \frac{{\left( {2\varvec{X}^{\text{T}} \varvec{XW} + 2\mu_{1} \varvec{A}^{V} \varvec{V}} \right)_{jk} }}{{\left( {2\varvec{VW}^{\text{T}} \varvec{X}^{\text{T}} \varvec{XW} + 2\mu_{1} \varvec{D}^{V} \varvec{V} + \mu_{3} \varvec{MV}} \right)_{jk} }}. $$

(85)

Since Eq. (79) is an auxiliary function, $ \tilde{J}_{{v_{jk} }} $ is non-increasing under the update rule in Eq. (85). Thus, Theorem 2 holds.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tong, M., Bai, H., Yue, X. et al. PTL-LTM model for complex action recognition using local-weighted NMF and deep dual-manifold regularized NMF with sparsity constraint. Neural Comput & Applic 32, 13759–13781 (2020). https://doi.org/10.1007/s00521-020-04783-0

Download citation

Received: 20 April 2019
Accepted: 07 February 2020
Published: 20 February 2020
Issue Date: September 2020
DOI: https://doi.org/10.1007/s00521-020-04783-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PTL-LTM model for complex action recognition using local-weighted NMF and deep dual-manifold regularized NMF with sparsity constraint

Abstract

Access this article

Similar content being viewed by others

NMF with local constraint and Deep NMF with temporal dependencies constraint for action recognition

Semantic Image Networks for Human Action Recognition

Probability Matrix SVM+ Learning for Complex Action Recognition

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Human and animal rights

Informed consent

Additional information

Publisher's Note

Appendices

Appendix 1: Proof of Theorem 1

Definition 1

Lemma 1

Proof

Lemma 2

Proof

Proof

Lemma 3

Proof

Proof

Appendix 2: Proof of Theorem 2

Lemma 4

Proof

Proof

Lemma 5

Proof

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation