Abstract
Moving object detection is crucial for cognitive vision-based robot tasks. However, due to noise, dynamic background, variations in illumination, and high frame rate, it is a challenging task to robustly and efficiently detect moving objects in video using the clue of motion. State-of-the-art batch-based methods view a sequence of images as a whole and then model the background and foreground together with the constraints of foreground sparsity and connectivity (smoothness) in a unified framework. But the efficiency of the batch-based methods is very low. State-of-the-art incremental methods model the background by a subspace whose bases are updated frame by frame. However, such incremental methods do not make full use of the foreground sparsity and connectivity. In this paper, we develop an incremental method for detecting moving objects in video. Compared to existing methods, the proposed method not only incrementally models the subspace for background reconstruction but also takes into account the sparsity and connectivity of the foreground. The optimization of the model is very efficient. Experimental results on nine public videos demonstrate that the proposed method is much efficient than the state-of-the-art batch methods and has higher F1-score than the state-of-the-art incremental methods.



Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Yuen P, Gao Y, Griffiths A, Coates A, Muller J, Smith A, Walton D, Leff C, Hancock B, Shin D. Exomars rover pancam: autonomous & computational intelligence. IEEE Comput Intell Mag. 2013;8(4):52–61.
Dios PF, Chung PWH, Meng Q. Landmark-based methods for temporal alignment of human motions. IEEE Comput Intell Mag. 2014;9(2):29–37.
Gao F, Zhang Y, Wang J, Sun J, Yang E, Hussain A. Visual attention model based vehicle target detection in synthetic aperture radar images: a novel approach. Cogn Comput. 2015;7(4):434–44.
Zeng W, Wang C, Li Y. Model-based human gait recognition via deterministic learning. Cogn Comput. 2014;6(2):218–29.
Shah M, Deng J, Woodford B. Video background modeling: recent approaches, issues and our proposed techniques. Mach Vis Appl. 2014;25(5):1105–19.
Zhao J, Du C, Sun H, Liu X, Sun J. Biologically motivated model for outdoor scene classification. Cogn Comput. 2015;7(1):20–3.
Pang Y, Zhu H, Li X, Li X. Classifying discriminative features for blur detection. IEEE Trans Cybern. 2015. doi:10.1109/TCYB.2015.2472478.
Zhou X, Yang C, Zhao H, Yu W. Moving object detection by detecting contiguous outliers in the low-rank representation. IEEE Trans Pattern Anal Mach Intell. 2013;35(3):597–610.
Guo X, Wang X, Yang L, Cao X, Ma Y. Robust foreground detection using smoothness and arbitrariness constraints. In: Proceedings of European conference on computer vision, 2014.
Zhou X, Yang C, Zhao H, Yu W. Low-rank modeling and its applications in image analysis. CoRR. abs/1401.3409 (2014).
Candes E, Li X, Ma Y, Wright J. Robust principal component analysis? J ACM. 2011;58(3):1–37.
He J, Balzano L, Szlam A. Incremental gradient on the Grassmannian for online foreground and background separation in subsampled video. In: Proceedings of IEEE international conference on computer vision and pattern recognition, 2012.
Haines T, Xiang T. Background subtraction with Dirichlet process mixture models. IEEE Trans Pattern Anal Mach Intell. 2014;36(4):670–83.
Viola P, Jones M. Robust real-time face detection. Int J Comput Vis. 2004;57(2):137–54.
Dalal N, Triggs B. Histograms of oriented gradients for human detection. In: Proceedings of IEEE international conference on computer vision and pattern recognition, 2005.
Pang Y, Zhang K, Yuan Y, Wang K. Distributed object detection with linear SVMs. IEEE Trans Cybern. 2014;44(11):2122–33.
Lowe DG. Distinctive image features from scale-invariant keypoints. Int J Comput Vis. 2004;60(2):91–110.
Dollar P, Appel R, Belongie S, Perona P. Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell. 2014;36(8):1532–45.
Dollar P, Tu Z, Perona P, Belongie S. Integral channel features. In: Proceedings of british machine vision conference, 2009.
Isard M, Blake A. CONDENSATION—conditional density propagation for visual tracking. Int J Comput Vis. 1998;29(1):5–28.
Tu Z, Zheng A, Yang E, Luo B, Hussain A. A biologically inspired vision-based approach for detecting multiple moving objects in complex outdoor scenes. Cogn Comput. 2015;7(5):539–51.
Wang K, Xu L, Fang Y, Li J. One-against-all frame differences based hand detection for human and mobile interaction. Neurocomputing. 2013;120:185–91.
Neri A, Colonnese S, Russo G, Talone P. Automatic moving object and background separation. Signal Process. 1998;66:219–32.
Li L, Huang W, Gu I, Tian Q. Statistical modeling of complex backgrounds for foreground object detection. IEEE Trans Image Process. 2004;13(11):1459–72.
Haritaoglu I, Harwood D, Davis L. W4: real-time surveillance of people and their activities. IEEE Trans Pattern Anal Mach Intell. 2000;22(8):809–30.
Xu J, Ithapu VK, Mukherjee L, Rehg JM, Singh V. GOSUS: Grassmannian online subspace updates with structured-sparsity. In: Proceedings of IEEE international conference on computer vision, 2013.
Yuan Y, Pang Y, Pan J, Li X. Scene segmentation based on IPCA for visual surveillance. Neurocomputing. 2009;72(10–12):2450–4.
Pang Y, Wang S, Yuan Y. Learning regularized LDA by clustering. IEEE Trans Neural Netw Learn Syst. 2014;25(12):2191–201.
Pang Y, Ji Z, Jing P, Li X. Ranking graph embedding for learning to rerank. IEEE Trans Neural Netw Learn Syst. 2013;24(8):1292–303.
Bouwmans T, Zahzah E. Robust PCA via principal component pursuit: a review for a comparative evaluation in video surveillance. Comput Vis Image Underst. 2014;122(4):22–3.
Qiu CN, Vaswani N. Missing link between recursive robust pca and recursive sparse recovery in large but correlated noise. arXiv:1106.3286, 2011.
Torre F, Black M. A framework for robust subspace learning. Int J Comput Vis. 2003;54(1):117–42.
Favaro P, Vidal R, Ravichandran A. A closed form solution to robust subspace estimation and clustering. In: Proceedings of IEEE international conference on computer vision and pattern recognition, 2011.
Vidal R, Ma Y, Sastry. Generalized principal component analysis (GPCA). IEEE Trans Patt Anal Mach Intell. 2005;27(12):1945–59.
Balzano L, Nowak R, Recht B. Online identification and tracking of subspaces from highly incomplete information. In: Proceedings of Allerton conference on communication, 2010.
Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Susstrunk S. SLIC: superpixels compared to state-of-the-art superpixel methods. IEEE Trans Pattern Anal Mach Intell. 2012;34(11):2274–82.
Mittal S, Meer P. Conjugate gradient on Grassmann manifolds for robust subspace estimation. Image Vis Comput. 2012;30(6–7):417–27.
Arias T, Edelman A, Smith S. The geometry of algorithms with orthogonality constraints. SIAM J Matrix Anal Appl. 1998;20(2):303–53.
Boykov Y, Veksler O, Zabih R. Fast approximate energy minimization via graph cuts. IEEE Trans Pattern Anal Mach Intell. 2001;23(11):1222–39.
Acknowledgments
This work was supported in part by the National Basic Research Program of China (973 Program) (Grant No. 2014CB340400), the National Natural Science Foundation of China (Grant Nos. 61172121, 61271412, 61472274, and 61222109, 61503274), the Key Research Program of the Chinese Academy of Sciences (Grant No. KGZD-EW-T03), and Excellent Young Scholar of the Tianjin University of Technology and Education (Grant No. RC14-46).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
Jing Pan, Xiaoli Li, Xuelong Li, and Yanwei Pang declare that they have no conflict of interest.
Informed Consent
All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1975, as revised in 2008 (5). Additional informed consent was obtained from all patients for which identifying information is included in this article.
Human and Animal Rights
This article does not contain any studies with human or animal subjects performed by the any of the authors.
Rights and permissions
About this article
Cite this article
Pan, J., Li, X., Li, X. et al. Incrementally Detecting Moving Objects in Video with Sparsity and Connectivity. Cogn Comput 8, 420–428 (2016). https://doi.org/10.1007/s12559-015-9373-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12559-015-9373-5