Behavior Recognition Based on Two-Stream Temporal Relation-Time Pyramid Pooling Network (TTR-TPPN)

Huang, Mengxing; Li, Zhenfeng; Zhang, Yu; Li, Yuchun; Li, Xinze; Feng, Siling

doi:10.1007/978-3-030-87571-8_36

Mengxing Huang¹³,
Zhenfeng Li¹³,
Yu Zhang¹⁴,
Yuchun Li¹³,
Xinze Li¹³ &
…
Siling Feng¹³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12999))

Included in the following conference series:

International Conference on Web Information Systems and Applications

2446 Accesses

Abstract

Nowadays, intelligent surveillance has received extensive attention from academia, business, and industry. Deep learning algorithms are widely used in the field of intelligent surveillance. Recently, most deep learning models are limited to a short-term behavior recognition in the entire video. In order to better identify human behavior in the video, we combined a Two-stream network and a Temporal Relation network (TRN) and added a time pyramid pooling operation. In this way, the Two-Stream Temporal Relation-Time Pyramid Pooling Network (TTR-TPPN) can be constructed. The relational pyramid pool network integrated the frame-level features in the video into video-level features. We applied the TTR-TPPN to the Internet public standard data set UCF101 and the self-made DW20 data set. It is found through experiments that this network has a higher recognition rate than other behavior recognition methods on both data sets, and it has better performance in long-term behavior recognition. Therefore, the TTR-TPPN enables it to recognize long-time sequence behavior and improves the accuracy of human behavior recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Kupryanov, K., Gorodnichev, M.G.: Recognition of human behavior. In: 2021 Systems of Signals Generating and Processing in the Field of on Board Communications (2021)
Google Scholar
Guan, S., Zhang, Y., Tian, Z.: Research on human behavior recognition based on deep neural network. In: Proceedings of the 3rd International Conference on Mechatronics Engineering and Information Technology (ICMEIT 2019) (2019)
Google Scholar
Yu, Y.: Deep learning for image recognition. J. Jpn. Soc. Artif. Intell. 28, 962–974 (2018)
Google Scholar
Shao, Z., Cai, J., Wang, Z.: Smart monitoring cameras driven intelligent processing to big surveillance video data. IEEE Trans. Big Data 4(1), 105–116 (2018)
Article Google Scholar
Zhang, X., Luo, L., Zhao, W., Guo, Z., Yue, J.: On combining multiscale deep learning features for the classification of hyperspectral remote sensing imagery. Int. J. Remote Sens. 36(13–14), 3368–3379 (2015)
Google Scholar
Ullah, M.M., Parizi, S.N., Laptev, I.: Improving bag-of-features action recognition with non-local cues. In: Proceedings - British Machine Vision Conference, BMVC 2010, Aberystwyth, UK, 31 August–3 September 2010 (2010)
Google Scholar
Liu, L., Jiao, Y., Meng, F.: Key algorithm for human motion recognition in virtual reality video sequences based on hidden Markov model. IEEE Access 8, 159705–159717 (2020)
Article Google Scholar
Cai, W., Xia, S., Sun, R., Chen, H., Chen, W.: A micro-motion feature extraction method based on CORR-OMP. In: 2021 IEEE 4th International Conference on Electronics Technology (ICET) (2021)
Google Scholar
Koohzadi, M., Charkari, N.M.: Survey on deep learning methods in human action recognition. IET Comput. Vis. 11(8), 623–632 (2017)
Article Google Scholar
Huang, S., Huang, M., Zhang, Yu., Li, M.: Under water object detection based on convolution neural network. In: Ni, W., Wang, X., Song, W., Li, Y. (eds.) WISA 2019. LNCS, vol. 11817, pp. 47–58. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30952-7_6
Chapter Google Scholar
Chen, L., Liu, R., Zhou, D., Yang, X., Zhang, Q.: Fused behavior recognition model based on attention mechanism. Visual Comput. Ind. Biomed. Art 3(1), 1–10 (2020). https://doi.org/10.1186/s42492-020-00045-x
Article Google Scholar
Feichtenhofer, C., Pinz, A., Zisserman, A.: Convolutional two-stream network fusion for video action recognition. In: Computer Vision & Pattern Recognition (2016)
Google Scholar
Lan, Z., Yi, Z., Hauptmann, A.G.: Deep local video feature for action recognition. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2017)
Google Scholar
Shi, Y., Tian, Y., Wang, Y., Huang, T.: Sequential deep trajectory descriptor for action recognition with three-stream CNN. IEEE Trans. Multimedia 19(7), 1510–1520 (2017)
Article Google Scholar

Download references

Acknowledgements

Supported by the National Key Research and Development Program of China (Grant #: 2018YFB1404400), National Natural Science Foundation of China(Grant #: 62062030, Major Science and Technology Project of Haikou (Grant #: 2020-009), Project supported by the Education Department of Hainan Province (Grant #: Hnky2019-22).

Author information

Authors and Affiliations

State Key Laboratory of Marine Resource Utilization in South China Sea College of Information Science and Technology, Hainan University, Haikou, China
Mengxing Huang, Zhenfeng Li, Yuchun Li, Xinze Li & Siling Feng
School of Computer Science and Technology, Hainan University, Haikou, 570288, China
Yu Zhang

Authors

Mengxing Huang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenfeng Li
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yuchun Li
View author publications
You can also search for this author in PubMed Google Scholar
Xinze Li
View author publications
You can also search for this author in PubMed Google Scholar
Siling Feng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mengxing Huang .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Chunxiao Xing
Institute of Computer Science, University of Göttingen, Goettingen, Germany
Xiaoming Fu
Tsinghua University, Beijing, China
Yong Zhang
Chinese Academy of Sciences, Beijing, China
Guigang Zhang
Renmin University of China, Beijing, China
Chaolemen Borjigin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, M., Li, Z., Zhang, Y., Li, Y., Li, X., Feng, S. (2021). Behavior Recognition Based on Two-Stream Temporal Relation-Time Pyramid Pooling Network (TTR-TPPN). In: Xing, C., Fu, X., Zhang, Y., Zhang, G., Borjigin, C. (eds) Web Information Systems and Applications. WISA 2021. Lecture Notes in Computer Science(), vol 12999. Springer, Cham. https://doi.org/10.1007/978-3-030-87571-8_36

Download citation

DOI: https://doi.org/10.1007/978-3-030-87571-8_36
Published: 17 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87570-1
Online ISBN: 978-3-030-87571-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)

Behavior Recognition Based on Two-Stream Temporal Relation-Time Pyramid Pooling Network (TTR-TPPN)