Abstract
Online object tracking under complex environments is an important but challenging problem in computer vision, especially for illumination changing and occlusion conditions. With the emergence of commercial real-time depth cameras like Kinect, depth image-based object tracking, which is insensitive to illumination changing, gains more and more attentions. In this paper, we propose an online depth image-based object tracking method with sparse representation and object detection. In this framework, we combine tracking and detection to leverage precision and efficiency under heavy occlusion conditions. For tracking, objects are represented by sparse representations learned online with update. For detection, we apply two different strategies based on tracking-learning-detection and wider search window approaches. We evaluate our methods on both the subset of the public dataset Princeton Tracking Benchmark and our own driver face video in a simulated driving environment. The quantitative evaluations of precision and running time on these two datasets demonstrate the effectiveness and efficiency of our proposed object tracking algorithms.
Similar content being viewed by others
References
Avidan S (2004) Support vector tracking. IEEE Trans Pattern Anal Mach Intell 26(8):1064–1072
Avidan S (2007) Ensemble tracking. IEEE Trans Pattern Anal Mach Intell 29(2):261–271
Cai Q, Gallup D, Zhang C, Zhang Z (2010) 3D deformable face tracking with a commodity depth camera. In: Computer Vision-ECCV 2010, Springer, pp 229–242
Cao Y, Lu BL (2013) Neural information processing., Real-time head detection with kinect for driving fatigue detectionSpringer, Heidelberg, pp 600–607
Colombo A, Cusano C, Schettini R (2006) 3D face detection using curvature analysis. Pattern Recognit 39(3):444–455
Comaniciu D, Ramesh V, Meer P (2000) Real-time tracking of non-rigid objects using mean shift. In: IEEE conference on computer vision and pattern recognition, vol 2, pp 142–149
Comaniciu D, Ramesh V, Meer P (2003) Kernel-based object tracking. IEEE Trans Pattern Anal Mach Intell 25(5):564–577
Hale ET, Yin W, Zhang Y (2008) Fixed-point continuation for \(\backslash ell\_1\)-minimization: methodology and convergence. SIAM J Optim 19(3):1107–1130
Hu W, Li X, Zhang X, Shi X, Maybank S, Zhang Z (2011) Incremental tensor subspace learning and its applications to foreground segmentation and tracking. Int J Comput Vis 91(3):303–327
Ji Q, Yang X (2002) Real-time eye, gaze, and face pose tracking for monitoring driver vigilance. Real-Time Imaging 8(5):357–377
Ji Q, Zhu Z, Lan P (2004) Real-time nonintrusive monitoring and prediction of driver fatigue. IEEE Trans Veh Technol 53(4):1052–1068
Jia X, Lu H, Yang MH (2012) Visual tracking via adaptive structural local sparse appearance model. In: IEEE conference on computer vision and pattern recognition, pp 1822–1829
Kalal Z, Mikolajczyk K, Matas J (2012) Tracking-learning-detection. IEEE Trans Pattern Anal Mach Intell 34(7):1409–1422
Martínez AM (2002) Recognizing imprecisely localized, partially occluded, and expression variant faces from a single sample per class. IEEE Trans Pattern Anal Mach Intell 24(6):748–763
Mei X, Ling H (2011) Robust visual tracking and vehicle classification via sparse representation. IEEE Trans Pattern Anal Mach Intell 33(11):2259–2272
Nummiaro K, Koller-Meier E, Van Gool L (2003) An adaptive color-based particle filter. Image Vision Comput 21(1):99–110
Paragios N, Deriche R (2000) Geodesic active contours and level sets for the detection and tracking of moving objects. IEEE Trans Pattern Anal Mach Intell 22(3):266–280
Paschos G (2001) Perceptually uniform color spaces for color texture analysis: an empirical evaluation. IEEE Trans Image Process 10(6):932–937
Pei SC, Lin CN (1995) Image normalization for pattern recognition. Image Vis Comput 13(10):711–723
Pérez P, Hue C, Vermaak J, Gangnet M (2002) Color-based probabilistic tracking. In: European conference on computer vision, Springer, pp 661–675
Ross DA, Lim J, Lin RS, Yang MH (2008) Incremental learning for robust visual tracking. Int J Comput Vis 77(1–3):125–141
Sahayadhas A, Sundaraj K, Murugappan M (2012) Detecting driver drowsiness based on sensors: a review. Sensors 12(12):16,937–16,953
Shen SC, Zheng WL, Lu BL (2014) Online object tracking based on depth image with sparse coding. In: Neural information processing, Springer, pp 234–241
Shi LC, Lu BL (2013) EEG-based vigilance estimation using extreme learning machines. Neurocomputing 102:135–143
Song S, Xiao J (2013) Tracking revisited using rgbd camera: Unified benchmark and baselines. In: IEEE international conference on computer vision, pp 233–240
Spinello L, Arras KO (2011) People detection in RGB-D data. In: IEEE/RSJ international conference on intelligent robots and systems, pp 3838–3843
Wang D, Lu H, Yang MH (2013) Online object tracking with sparse prototypes. IEEE Trans Image Process 22(1):314–325
Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Wu Y, Lim J, Yang MH (2013) Online object tracking: A benchmark. In: IEEE conference on computer vision and pattern recognition, pp 2411–2418
Xia L, Chen CC, Aggarwal JK (2011) Human detection using depth information by kinect. In: IEEE computer society conference on computer vision and pattern recognition workshops (CVPRW), pp 15–22
Yang H, Shao L, Zheng F, Wang L, Song Z (2011a) Recent advances and trends in visual tracking: A review. Neurocomputing 74(18):3823–3831
Yang M, Zhang L (2010) Gabor feature based sparse representation for face recognition with gabor occlusion dictionary. In: Computer Vision-ECCV 2010, Springer, pp 448–461
Yang M, Zhang L, Yang J, Zhang D (2011b) Robust sparse coding for face recognition. In: IEEE conference on computer vision and pattern recognition, pp 625–632
Yang T, Pan Q, Li J, Li SZ (2005) Real-time multiple objects tracking with occlusion handling in dynamic scenes. IEEE conference on computer vision and pattern recognition, vol 1, pp 970–975
Yilmaz A, Li X, Shah M (2004) Contour-based object tracking with occlusion handling in video acquired using mobile cameras. IEEE Trans Pattern Anal Mach Intell 26(11):1531–1536
Yilmaz A, Javed O, Shah M (2006) Object tracking: a survey. ACM Comput Surveys (CSUR) 38(4):13
Zhang S, Yao H, Sun X, Lu X (2013) Sparse coding based visual tracking: review and experimental comparison. Pattern Recogn 46(7):1772–1788
Acknowledgments
This work was supported in part by the grants from the National Natural Science Foundation of China (Grant No. 1272248), the National Basic Research Program of China (Grant No. 2013CB329401), and the Science and Technology Commission of Shanghai Municipality (Grant No. 3511500200).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zheng, WL., Shen, SC. & Lu, BL. Online Depth Image-Based Object Tracking with Sparse Representation and Object Detection. Neural Process Lett 45, 745–758 (2017). https://doi.org/10.1007/s11063-016-9509-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-016-9509-y