Abstract
The aim is to better recognize human behaviors from surveillance videos. Human behavior recognition technology based on surveillance videos is researched, given the intellectual development of massive surveillance video data with full coverage. This technology builds a human behavior detection and recognition model using the new Single Shot MultiBox Detector (SSD) algorithm, which improves the recognition accuracy. The constructed model’s effectiveness is verified through comparisons with other traditional human behavior recognition algorithms via the TensorFlow framework. Results demonstrate the SSD model-based recognition algorithm’s accuracy is significantly higher than that of Direct Part Marking and Fast Convolutional Neural Network (CNN) algorithms. SSD’s average speed is 0.146 s/frame, and the average accuracy on different datasets is 82.8%. If the target is close or partially occluded, the SSD algorithm can also accurately detect the central target, and the detection efficiency is twice that of the R-CNN algorithm. The algorithm proposed has a simple structure and fast processing speed, which can solve the problems in target detection. The research results can provide a theoretical basis for the research on target detection related to human behavior recognition.
Similar content being viewed by others
References
Toelstede B (2019) Democracy interrupted: the anti-social side of intensified policing. Democr Secur 15:137–149
Vietz GJ, Walsh CJ, Fletcher TD (2016) Urban hydrogeomorphology and the urban stream syndrome: treating the symptoms and causes of geomorphic change. Prog Phys Geogr 40:480–492
Shi X, Yang C, Xie W, Liang C, Shi Z, Chen J (2018) Anti-drone system with multiple surveillance technologies: architecture, implementation, and challenges. IEEE Commun Mag 56:68–74
Kunz M, Seuss D, Hassan T, Garbas JU, Siebers M, Schmid U, Schöberl M, Lautenbacher S (2017) Problems of video-based pain detection in patients with dementia: a road map to an interdisciplinary solution. BMC Geriatr 17:1–8
Prasad DK, Prasath CK, Rajan D, Rachmawati L, Rajabaly E, Quek C (2016) Challenges in video based object detection in maritime scenario using computer vision, pp 435–440. arXiv preprint arXiv:1608.01079
Long T, Liang Z, Liu Q (2019) Advanced technology of high-resolution radar: target detection, tracking, imaging, and recognition. Sci China Inf Sci 62:40301–40309
Razakarivony S, Jurie F (2016) Vehicle detection in aerial imagery: a small target detection benchmark. J Vis Commun Image Represent 34:187–203
Dong L, Wang B, Zhao M, Xu W (2017) Robust infrared maritime target detection based on visual attention and spatiotemporal filtering. IEEE Trans Geosci Remote Sens 55:3037–3050
Tsakanikas V, Dagiuklas T (2018) Video surveillance systems-current status and future trends. Comput Electr Eng 70:736–753
Ren Y, Yang J, Zhang Q, Guo Z (2019) Multi-feature fusion with convolutional neural network for ship classification in optical images. Appl Sci 9:4209–4213
Zhang X, Yu Q, Yu H (2018) Physics inspired methods for crowd video surveillance and analysis: a survey. IEEE Access 6:66816–66830
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC. SSD: single shot multibox detector. In: Proceedings of European Conference on Computer Vision; pp 21–37
Wang C, Wang H, Hu B, Wen J, Xu J, Li X (2016) A new spectral-spatial algorithm method for hyperspectral image target detection. Guang pu xue yu guang pu fen xi = Guang pu 36:1163–1169
Zhao Z, Li X, Liu H, Xu C (2020) Improved target detection algorithm based on Libra R-CNN. IEEE Access 8:114044–114056
Fakiris E, Papatheodorou G, Geraga M, Ferentinos G (2016) An automatic target detection algorithm for swath sonar backscatter imagery, using image texture and independent component analysis. Remote Sens 8:373–382
Tannouche A, Sbai K, Rahmoune M, Agounoune R, Rahmani A, Rahmani A (2016) Real time weed detection using a boosted cascade of simple features. Int J Electr Comput Eng (2088–8708) 6:6–14
AbdelRaouf A, Higgins CA, Pridmore T, Khalil MI (2016) Arabic character recognition using a Haar cascade classifier approach (HCC). Pattern Anal Appl 19:411–426
Wei Y, Tian Q, Guo J, Huang W, Cao J (2019) Multi-vehicle detection algorithm through combining Harr and HOG features. Math Comput Simul 155:130–145
Guimarães S, Kenmochi Y, Cousty J, Patrocinio Z, Najman L (2017) Hierarchizing graph-based image segmentation algorithms relying on region dissimilarity: the case of the Felzenszwalb–Huttenlocher method. Math Morphol Theory Appl 2:55–75
Taylor LH, Wallace RM, Balaram D, Lindenmayer JM, Eckery DC, Mutonono-Watkiss B, Parravani E, Nel LH (2017) The role of dog population management in rabies elimination—a review of current approaches and future opportunities. Front Vet Sci 4:109
Khan RU, Zhang X, Kumar R (2019) Analysis of ResNet and GoogleNet models for malware detection. J Comput Virol Hacking Tech 15:29–37
Zou Z, Shi Z (2017) Random access memories: a new paradigm for target detection in high resolution aerial remote sensing images. IEEE Trans Image Process 27:1100–1111
Zhang M, Pang K, Gao C, Xin M (2020) Multi-scale aerial target detection based on densely connected inception ResNet. IEEE Access 8:84867–84878
Li S, Dou Y, Niu X, Lv Q, Wang Q (2017) A fast and memory saved GPU acceleration algorithm of convolutional neural networks for target detection. Neurocomputing 230:48–59
Nida N, Irtaza A, Javed A, Yousaf MH, Mahmood MT (2019) Melanoma lesion detection and segmentation using deep region based convolutional neural network and fuzzy C-means clustering. Int J Med Inform 124:37–48
Liu B, Luo J, Huang H (2020) Toward automatic quantification of knee osteoarthritis severity using improved faster R-CNN. Int J Comput Assist Radiol Surg 15:457–466
Li L, Yang Z, Jiao L, Liu F, Liu X (2019) High-resolution SAR change detection based on ROI and SPP net. IEEE Access 7:177009–177022
Gong G-C, Tsai A-Y (2019) Reduced daytime net growth rate of Synechococcus spp. in the East China sea in summer estimated using a dilution approach. Estuar Coast Shelf Sci 219:90–96
Dong R, Xu D, Zhao J, Jiao L, An J (2019) Sig-NMS-based faster R-CNN combining transfer learning for small target detection in VHR optical remote sensing imagery. IEEE Trans Geosci Remote Sens 57:8534–8545
Lei X, Sui Z (2019) Intelligent fault detection of high voltage line based on the faster R-CNN. Measurement 138:379–385
Qi L, Li B, Chen L, Wang W, Dong L, Jia X, Huang J, Ge C, Xue G, Wang D (2019) Ship target detection algorithm based on improved faster R-CNN. Electronics 8:959–973
Wan S, Goudos S (2020) Faster R-CNN for multi-class fruit detection using a robotic vision system. Comput Netw 168:107036
Han J, Liao Y, Zhang J, Wang S, Li S (2018) Target fusion detection of LiDAR and camera based on the improved YOLO algorithm. Mathematics 6:213–225
Wu Z, Chen X, Gao Y, Li Y (2018) Rapid target detection in high resolution remote sensing images using Yolo model. ISPAR 42:1915–1920
Wang Z, Du L, Mao J, Liu B, Yang D (2018) SAR target detection based on SSD with data augmentation and transfer learning. IEEE Geosci Remote Sens Lett 16:150–154
Chen H, Zhang L, Ma J, Zhang J (2019) Target heat-map network: an end-to-end deep network for target detection in remote sensing images. Neurocomputing 331:375–387
Falisse A, Van Rossom S, Gijsbers J, Steenbrink F, van Basten BJ, Jonkers I, van den Bogert AJ, De Groote F (2018) OpenSim versus human body model: a comparison study for the lower limbs during gait. J Appl Biomech 34:496–502
Jang S, Vitale JM, Jyung RW, Black JB (2017) Direct manipulation is better than passive viewing for learning anatomy in a three-dimensional virtual reality environment. Comput Educ 106:150–165
Fang Y, Eglen RM (2017) Three-dimensional cell cultures in drug discovery and development. SLAS Discov Adv Life Sci R&D 22:456–472
Chen Z, Jiang J, Zhou C, Fu S, Cai Z (2019) SuperBF: superpixel-based bilateral filtering algorithm and its application in feature extraction of hyperspectral images. IEEE Access 7:147796–147807
Zhao B, Gao L, Liao W, Zhang B (2017) A new kernel method for hyperspectral image feature extraction. Geo-Spat Inf Sci 20:309–318
Liu B, Yu X, Zhang P, Yu A, Fu Q, Wei X (2017) Supervised deep feature extraction for hyperspectral image classification. IEEE Trans Geosci Remote Sens 56:1909–1921
Chen H, Li H, Xu Z, Zhao Y, He T (2019) Real‐time action feature extraction via fast PCA‐Flow. Concurrency Comput Pract Experience e5507:5507–5513
Anjum A, Das M, Murthy J, Gudennavar S, Gopal R, Bubbly S (2018) Template-based classification of SDSS-GALEX point sources. J Astrophys Astron 39:61–69
Shen H, Xu M, Guez A, Li A, Ran F (2019) An accurate sleep stages classification method based on state space model. IEEE Access 7:125268–125279
Chen Y, Luo Y, Huang W, Hu D, Zheng R-Q, Cong S-Z, Meng F-K, Yang H, Lin H-J, Sun Y (2017) Machine-learning-based classification of real-time tissue elastography for hepatic fibrosis in patients with chronic hepatitis B. Comput Biol Med 89:18–23
Li W, Liu H, Wang Y, Li Z, Jia Y, Gui G (2019) Deep learning-based classification methods for remote sensing images in urban built-up areas. IEEE Access 7:36274–36284
Deng L, Zhu H, Zhou Q, Li Y (2018) Adaptive top-hat filter based on quantum genetic algorithm for infrared small target detection. Multimed Tools Appl 77:10539–10551
Körez A, Barışçı N (2020) Object detection with low capacity GPU systems using improved faster R-CNN. Appl Sci 10:83–92
Fang W, Wang L, Ren P (2019) Tinier-YOLO: a real-time object detection method for constrained environments. IEEE Access 8:1935–1944
Wang H, Yu Y, Cai Y, Chen X, Chen L, Liu Q (2019) A comparative study of state-of-the-art deep learning algorithms for vehicle detection. IEEE Intell Transp Syst Mag 11:82–95
Wang D, Tang J, Zhu W, Li H, Xin J, He D (2018) Dairy goat detection based on faster R-CNN from surveillance video. Comput Electron Agric 154:443–449
Shen L, Shi J, Dong Y, Ying S, Peng Y, Chen L, Zhang Q, An H, Zhang Y (2019) An improved deep polynomial network algorithm for transcranial sonography-based diagnosis of Parkinson’s disease. Cogn Comput 12(3):553–562
Shi J, Zheng X, Li Y, Zhang Q, Ying S (2017) Multimodal neuroimaging feature learning with multimodal stacked deep polynomial networks for diagnosis of Alzheimer’s disease. IEEE J Biomed Health Inform 22:173–183
Matthews AGG, Van Der Wilk M, Nickson T, Fujii K, Boukouvalas A, León-Villagrá P, Ghahramani Z, Hensman J (2017) GPflow: a Gaussian process library using TensorFlow. J Mach Learn Res 18:1299–1304
Brandon N, Price PS (2020) Calibrating an agent-based model of longitudinal human activity patterns using the consolidated human activity database. J Eposure Sci Environ Epidemiol 30:194–204
Mazoyer B, Mellet E, Perchey G, Zago L, Crivello F, Jobard G, Delcroix N, Vigneau M, Leroux G, Petit L (2016) BIL&GIN: a neuroimaging, cognitive, behavioral, and genetic database for the study of human brain lateralization. Neuroimage 124:1225–1231
Khan RA, Crenn A, Meyer A, Bouakaz S (2019) A novel database of children’s spontaneous facial expressions (LIRIS-CSE). Image Vis Comput 83:61–69
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Pan, H., Li, Y. & Zhao, D. Recognizing human behaviors from surveillance videos using the SSD algorithm. J Supercomput 77, 6852–6870 (2021). https://doi.org/10.1007/s11227-020-03578-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-020-03578-3