Skip to main content

Advertisement

Log in

Video image target monitoring based on RNN-LSTM

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Traditional image object classification and detection algorithms and strategies cannot meet the problem of video image acquisition and processing. Deep learning deliberately simulates the hierarchical structure of human brain, and establishes the mapping from low-level signals to high-level semantics, so as to achieve hierarchical feature representation of data. Deep learning technology has powerful visual information processing ability, which has become the forefront technology and domestic and international research hotspots to deal with this challenge. In order to solve the problem of target space location in video surveillance system, time-consuming and other problems, in this paper, we propose the algorithm based on RNN-LSTM deep learning. At the same time, according to the principle of OpenGL perspective imaging and photogrammetry consistency, we use 3D scene simulation imaging technology, relying on the corresponding relationship between video images and simulation images we locate the target object. In the 3D virtual scene, we set up the virtual camera to simulate the imaging processing of the actual camera, and the pixel coordinates in the video image of the surveillance target are substituted into the simulation image, next, the spatial coordinates of the target are inverted by the inverse process of the virtual imaging. The experimental results show that the detection of target objects has high accuracy, which has an important reference value for outdoor target localization through video surveillance images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Al-Ayyoub M, Jararweh Y, Daraghmeh M, Althebyan Q (2015) Multi-agent based dynamic resource provisioning and monitoring for cloud computing systems infrastructure. Clust Comput 18(2):919–932

    Article  Google Scholar 

  2. Arvind KS, Manimegalai R (2017) Secure data classification using superior naive classifier in agent based mobile cloud computing. Clust Comput 20(2):1535–1542

    Article  Google Scholar 

  3. Boru D, Kliazovich D, Granelli F, Bouvry P, Zomaya AY (2015) Energy-efficient data replication in cloud computing datacenters. Clust Comput 18(1):385–402

    Article  Google Scholar 

  4. Cheng Z, Xing Y, Cheng S, Guo P (2018) Power prediction method of lithium-ion battery for unmanned vehicles based on combined constraint intelligence algorithm. Clust Comput. https://doi.org/10.1007/s10586-018-1900-1

  5. Gulshan V, Peng L, Coram M, Stumpe MC, Wu D, Narayanaswamy A, Venugopalan S, Widner K, Madams T, Cuadros J, Kim R (2016) Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316(22):2402–2410

    Article  Google Scholar 

  6. Harris J, Hofer M (2016) Planning for deep learning using TPACK-based learning activity types. In Society for Information Technology & teacher education international conference (pp. 2864-2871). Association for the Advancement of Computing in Education (AACE)

  7. Hassan MF, Paramesran R, Tanaka Y et al (2018) A method using uniform yellowing pigmentation to model the color perception of the elderly people. SIViP. https://doi.org/10.1007/s11760-018-1247-3

  8. Higgins I, Matthey L, Glorot X, Pal A, Uria B, Blundell C, Mohamed S, Lerchner A (2016) Early visual concept learning with unsupervised deep learning. arXiv preprint arXiv:1606.05579

  9. Huang W, Wang H, Zhang Y, Zhang S (2017) A novel cluster computing technique based on signal clustering and analytic hierarchy model using hadoop. Clust Comput. https://doi.org/10.1007/s10586-017-1205-9

  10. Islam S, Qasim T, Yasir M, Bhatti N, Mahmood H, Zia M (2018) Single- and two-person action recognition based on silhouette shape and optical point descriptors. SIViP. https://doi.org/10.1007/s11760-017-1228-y

  11. Karami H, Ehteram M, Mousavi SF et al (2018) Optimization of energy management and conversion in the water systems based on evolutionary algorithms. Neural Comput & Applic. https://doi.org/10.1007/s00521-018-3412-6

  12. Kim Y, Kim J, Yu H (2012) Geo search:georeferenced video retrieval system. Proc of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Diego: ACM, 1:540–1543

  13. Li P, Li J, Huang Z, Gao CZ, Chen WB, Chen K (2017) Privacy-preserving outsourced classification in cloud computing. Clust Comput 21:277–286. https://doi.org/10.1007/s10586-017-0849-9

    Article  Google Scholar 

  14. Liu H, Yin J, Luo X, Zhang S (2018) Foreword to the special issue on recent advances on pattern recognition and artificial intelligence. Neural Comput & Applic 29:1–2. https://doi.org/10.1007/s00521-017-3243-x

    Article  Google Scholar 

  15. Ma X, Liu Z (2018) Predicting the oil production using the novel multivariate nonlinear model based on Arps decline model and kernel method. Neural Comput & Applic 29:579–591. https://doi.org/10.1007/s00521-016-2721-x

    Article  Google Scholar 

  16. Ma W, Zheng D, Zhang Z, Duan J, Chen B (2018) Robust proportionate adaptive filter based on maximum correntropy criterion for sparse system identification in impulsive noise environments. SIViP 12:117–124. https://doi.org/10.1007/s11760-017-1137-0

    Article  Google Scholar 

  17. Mamoshina P, Vieira A, Putin E, Zhavoronkov A (2016) Applications of deep learning in biomedicine. Mol Pharm 13(5):1445–1454

    Article  Google Scholar 

  18. Mliki H, Zaafouri R, Hammami M (2018) Human action recognition based on discriminant body regions selection. SIViP. https://doi.org/10.1007/s11760-017-1227-z

  19. Rho MJ, Kim HS, Chung K, Choi IY (2015) Factors influencing the acceptance of telemedicine for diabetes management. Clust Comput 18(1):321–331

    Article  Google Scholar 

  20. Tang LY, Chen CC, Zou T et al (2011) OntoPlant: an integrated virtual plant software package for different scale application. Proceeding of 2011 I.E. International Conference on Spatial Data Mining and Geographical Knowledge Services (ICS-DM2011) in Conjunction with Eighth Beijing International Workshop on Geographical Information Science( BJ - IWGIS 2011)Fuzhou:[s.n.], 308–314

  21. Varatharajan R, Manogaran G, Priyan MK (2017) et al, Wearable sensor devices for early detection of Alzheimer disease using dynamic time warping algorithm. Clust Comput. https://doi.org/10.1007/s10586-017-0977-2

  22. Yang XS, Deb S (2014) Cuckoo search: recent advances and applications. Neural Comput & Applic 24(1):169–174

    Article  Google Scholar 

  23. Yang Z, Yu W, Liang P, Guo H, Xia L, Zhang F, Ma Y, Ma J (2018) Deep transfer learning for military object recognition under small training set condition. Neural Comput & Applic. https://doi.org/10.1007/s00521-018-3468-3

  24. Yücelbaş C, Yücelbaş Ş, Özşen S, Tezel G, Küççüktürk S, Yosunkaya Ş (2018) Automatic detection of sleep spindles with the use of STFT, EMD and DWT methods. Neural Comput & Applic 29:17–33. https://doi.org/10.1007/s00521-016-2445-y

    Article  Google Scholar 

  25. Zhang W, Wei D (2018) Prediction for network traffic of radial basis function neural network model based on improved particle swarm optimization algorithm. Neural Comput & Applic 29:1143–1152. https://doi.org/10.1007/s00521-016-2483-5

    Article  Google Scholar 

  26. Zhang S, Wang H, Huang W, Zhang C (2018) Combining Modified LBP and Weighted SRC for Palmprint Recognition. SIViP. https://doi.org/10.1007/s11760-018-1246-4

  27. Zhang S, Wang H, Huang W, You Z (2018) Plant diseased leaf segmentation and recognition by fusion of superpixel, K-means and PHOG. Optik-International Journal for Light and Electron Optics 157:866–872

    Article  Google Scholar 

  28. Zhu Y, Mottaghi R, Kolve E, Lim JJ, Gupta A, Fei-Fei L, Farhadi A (2016) Target-driven visual navigation in indoor scenes using deep reinforcement learning. arXiv preprint arXiv:1609.05143

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhigang Chen.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, F., Chen, Z. & Wang, J. Video image target monitoring based on RNN-LSTM. Multimed Tools Appl 78, 4527–4544 (2019). https://doi.org/10.1007/s11042-018-6058-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-018-6058-6

Keywords

Navigation