Abstract
Human detection and tracking is a key aspect in surveillance system due to its importance in timely identification of person, recognition of human activity and scene analysis. Convolutional neural networks have been widely used approach in detection and tracking related tasks. In this paper, a robust framework is presented for the human detection and tracking in noisy and occluded environments with the aid of data augmentation techniques. In addition, a softmax layer and integrated loss function is used to improve the detection and classification performance of the proposed model. The primary focus is to perform the human detection task in unconstrained environments. The implemented system outperforms the state-of-the-arts methods which can be validated from the experimental results.
Similar content being viewed by others
References
An F, Liu Z (2019) Facial expression recognition algorithm based on parameter adaptive initialization of CNN and LSTM. Vis. Comput. 35:1–16. https://doi.org/10.1007/s0037 1-019-01635 -4
Brunetti A, Buongiorno D, Francesco G, Bevilacqua V (2018) Neurocomputing computer vision and deep learning techniques for pedestrian detection and tracking : A survey. Neurocomputing 300:17–33
Bubryur Kim, N Yuvaraj, KR Sri Preethaa,R Santhosh, A Sabari (2020). Enhanced pedestrian detection using optimized deep convolution neural network for smart building surveillance, Soft Computing.https://doi.org/10.1007/s00500-020-04999-1
Chahyati D, Fanany MI, Arymurthy AM (2017) Tracking people by Detection Using CNN features. Procedia Comput Sci 124:167–172
Coltuc, D; Bolon, P (1999). Strict ordering on discrete images and applications. In Proceedings of the IEEE International Conference on Image Processing, Kobe, Japan, 24–28 October 1999; pp. 150–153
Coltuc D, Bolon P, Chassery J-M (2006) Exact histogram specification. IEEE Trans Image Process 15:1143–1152
Tran Thi Dinh, Nguyen Dinh Vinh, Jeon Jae Wook (2018). Robust Pedestrian Detection via a Recursive Convolution Neural Network, SNPD 2018, June 27–29 2018, Busan
X Du, M El-Khamy, J Lee, and L Davis (2017). “Fused dnn: A deep neural network fusion approach to fast and robust pedestrian detection,” in WACV
Dundar A, Jin J, Martini B, Culurciello E (2017) Embedded streaming deep neural networks accelerator with applications. IEEE TransNeural Netw & Learning Syst 28(7):1572–1583
Farhadi, A and Redmon, J (2016). YOLO9000: better, Faster, Stronger
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal Mach Intell 32:1627–1645
Flores Calero MJ, Aldás M, Lázaro J, Gardel A, Onofa N, Quinga B (2019) Pedestrian Detection under partial occlusion by using logic inference, HOG and SVM. IEEE Latin America Transactions 17(09):1552–1559. https://doi.org/10.1109/TLA.2019.8931190
R Girshick, J Donahue, T Darrell, and J Malik (2014). “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 580–587
Gonzalez, RC; Woods, RE (2010). Digital image processing, 3rd ed.; Prentice Hall: New Jersey
Guo K, Wu S, Xu YF (2017) Face recognition using both visible light image and near-infrared image and a deep network. CAAI Trans Intell Technol 2(1):39–47
Hajizadeh, MA; Ebrahimnezhad, H (2011). Classification of age groups from facial image using histograms of oriented gradients. In Proceedings of the 7th Iranian Conference on Machine Vision and Image Processing, Iran University of Science and Technology (IUST), Tehran, Iran, 16–17 November 2011; pp. 1–5
Huang, C, Lucey, S, Ramanan, D (2017). Learning policies for adaptive tracking with deep feature cascades. In: Computer Vision Foundation
Jen, TC; Hsieh, B; Wang, SJ (2005). Image contrast enhancement based on intensity-pair distribution.In Proceedings of the IEEE International Conference on Image Processing, Genova, Italy, 11–14 September 2005; pp. 1–4
H Jeon, VD Nguyen and JW Jeon (2019). “Pedestrian Detection Based on Deep Learning,” IECON - 45th annual conference of the IEEE industrial electronics society, Lisbon, Portugal, 2019, pp. 144–151, doi: https://doi.org/10.1109/IECON.2019.8927417
Jian M, Lam K, Dong J, Shen L (2015) Visual-patch-attention-aware saliency Detection. IEEE Transactions on Cybernetics 45(8):1575–1586. https://doi.org/10.1109/TCYB.2014.2356200
Jian M, Qi Q, Dong J, Yin Y, Lam K-M (2018) Integrating QDWD with pattern distinctness and local contrast for underwater saliency detection. Journal of Visual Communication and Image Representation, Volume 53:31–41. https://doi.org/10.1016/j.jvcir.2018.03.008
Jian M, Qi Q, Yu H, Dong J, Cui C, Nie X, Zhang H, Yin Y, Lam K-M (2019) The extended marine underwater environment database and baseline evaluations. Applied Soft Computing, Volume 80:425–437. https://doi.org/10.1016/j.asoc.2019.04.025
Karaaba, M; Surinta, O; Schomaker, L; Wiering, MA (2015). Robust face recognition by computing distances from multiple histograms of oriented gradients. In Proceedings of the IEEE Symposium Series on Computational Intelligence, Cape Town International Convention Center, Cape Town, South Africa, 7–10 December 2015; pp. 203–209
Kim S, Kwak S, Ko BC (2019) Fast Pedestrian Detection in surveillance video based on soft target training of shallow random Forest. IEEE Access 7:12415–12426. https://doi.org/10.1109/ACCESS.2019.2892425
Lan X, Ma AJ, Yuen PC, Chellappa R (2015) Joint sparse representation and robust feature-level fusion for multi-cue visual tracking IEEE trans. Image Process 24(12):5826–5841
Lei, Z, Chu, R, He, R, Liao, S, Li, SZ (2007). Face recognition by discriminant analysis with Gabor tensor representation. In: International Conference on Biometrics, pp. 87–95. Springer, Berlin
Li J, Liang X, Shen S, Xu T, Yan S (2018) Scale-aware fast R-CNN for pedestrian detection. IEEE Trans Multimedia 20(4):985–996
R Lienhart and J Maydt (2002). “An extended set of haar-like features for rapid object detection,” in ICIP
M Lin, Q Chen, and S Yan (2013). Network in network. arXiv preprint arXiv:1312.4400
J Liu, X Gao, N Bao, J Tang and G Wu (2017), “Deep convolutional neural networks for pedestrian detection with skip pooling,” 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, pp. 2056–2063, doi: https://doi.org/10.1109/IJCNN.2017.7966103
Shu-an Liu, Shi Lv, Hailin Zhang, Jun Gong (2019). Pedestrian Detection Algorithm Based on the Improved SSD, The 31th Chinese Control and Decision Conference (2019 CCDC), IEEE
P Luo, X Wang, and X Tang (2013). Pedestrian parsing via deep Decompositional neural network, in proceedings of IEEE international conference on computer vision (ICCV)
Lv JJ, Cheng C, Tian GD, Zhou XD, Zhou X (2016) Landmark perturbation-based data augmentation for unconstrained face recognition. Signal Process Image Commun 47:465–475
Madbouly AMM, Mostafa M-SM, Wafy M (2015) Performance assessment of feature detector-descriptor combination. Int J ComputSciIssues 12(5):87–94
Mateus A, Ribeiro D, Miraldo P, Nascimento JC (2019) Efficient and robust Pedestrian Detection using deep learning for human-aware navigation. Robot Auton Syst 113:23–37
Pawłowski, P, Piniarski, K (2015). D ˛abrowski, A. Pedestrian detection in low resolution night vision images. In Proceedings of the IEEE Signal Processing: Algorithms, Architectures, Arrangements, and Applications, Pozna ´n, Poland, 23–25 September 2015; pp. 185–190
J Redmon, S Divvala, R Girshick, and A Farhadi (2016). “You only look once: unified, real-time object detection,” in CVPR
KN Renu Chebrolu and PN Kumar (2019). “Deep learning based Pedestrian Detection at all light conditions,” 2019 International Conference on Communication and Signal Processing (ICCSP), Chennai, pp. 0838–0842, doi: https://doi.org/10.1109/ICCSP.2019.8698101
D Ribeiro, A Mateus, JC Nascimento, and P Miraldo (2016). “A real-time pedestrian detector using deep learning for human-aware navigation,“arXiv:1607.04441
Rivera AR, Ryu B, Chae O (2012) Content-aware dark image enhancement through channel division. IEEE Trans Image Process 21:3967–3980
Yahia Fahem Said, Mohammad Barr (2019). Pedestrian Detection for Advanced Driver Assistance Systems using Deep Learning Algorithms, IJCSNS International Journal of Computer Science and Network Security, VOL.19 No.9, September 2019
Anandamurugan Selvaraj, Jeeva Selvaraj ,Sivabalakrishnan Maruthaiappan, Gokulnath Chandra Babu, Priyan Malarvizhi Kumar (2020). L1 norm based pedestrian detection using video analytics technique, An international journal of computational intelligence, 22 February 2020. https://doi.org/10.1111/coin.12292
Supreeth HSG, Patil CM (2018) Efficient multiple moving object detection and tracking using combined background subtraction and clustering. Signal Image Video Process 15:1097
Y Tian, P Luo, X Wang, and X Tang (2015). “Deep learning strong parts for pedestrian detection,” in ICCV
Wentong Wang , Lichun Wang , Xufei Ge, Jinghua Li and Baocai Yin (2020), Pedestrian Detection Based on Two-Stream UDN, Appl. Sci. , 10, 1866; doi:https://doi.org/10.3390/app10051866
Wang, X, Wang, K, Lian, S (2019). A survey on face data augmentation. arXiv :1904.11685
Wojek C, Dollar P, Schiele B, Perona P (2012) Pedestrian detection:An evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach.Intell. 34(4):743
Xinxin S, Liangnian J, Qinghua L (2019) Detection of stationary humans using time-division UWB MIMO through-wall radar. The Journal of Engineering 2019(20):6799–6802. https://doi.org/10.1049/joe.2019.0542
Chenchen Xu, Guili Wang, Songsong Yan,Jianghua Yu, Baojun Zhang, Shu Dai, Yu Li, and Lin Xu, Fast Vehicle and Pedestrian Detection Using Improved Mask R-CNN (2020). Hindawi, Mathematical Problems in Engineering,Volume 2020, Article ID 5761414, 15 pages. https://doi.org/10.1155/2020/5761414
S Yang, P Luo, CC Loy, and X Tang (2015). “From facial parts responses to face detection: A deep learning approach,” in ICCV
Z Yang and R Nevatia (2016). “A multi-scale cascade fully convolutional network face detector,” in ICPR
Zhang, L Lin, X Liang, K He (2016). Is faster r-cnn doing well for pedestrian detection? In: European Conf. Computer Vision (ECCV), pp. 443–457.
Zhang K, Zhang Z, Li Z, Qiao Y (2016) Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process Lett 23(10):1499–1503
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Haq, E.U., Jianjun, H., Li, K. et al. Human detection and tracking with deep convolutional neural networks under the constrained of noise and occluded scenes. Multimed Tools Appl 79, 30685–30708 (2020). https://doi.org/10.1007/s11042-020-09579-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09579-x