Abstract
In this article, we propose a Multi Layer Compound Markov Random Field (MLCMRF) Model to spatially segment different image frames of a given video sequence. The segmented image frames are combined with the change between the frames to detect the moving objects from a video. The proposed MLCMRF uses five Markov models in a single framework, one in spatial direction using color feature, four in temporal direction (using two color features and two edges/line fields). Hence, the proposed MLCMRF is a combination of spatial distribution of color, temporal color coherence and edge maps in the temporal frames. The use of such an edge preserving model helps in enhancing the object boundary in spatial segmentation and hence can detect moving objects with less effect of silhouette. A difference between the frames is used to generate the CDM and is subsequently updated with the previous frame video object plane (VOP) and the spatial segmentation of the consecutive frames, to detect the moving objects from the target image frames. Results of the proposed spatial segmentation approach are compared with those of the existing state-of-the-art techniques and are found to be better.
Similar content being viewed by others
Abbreviations
- (a,b):
-
Pixel position
- t :
-
tth time instant
- C D M t :
-
Change detection mask generated at time t
- y :
-
Observed video sequence
- y t :
-
Observed image frame at time t
- T h :
-
Threshold
- σ 2 :
-
Variance of each color plane
- k :
-
Variance-covariance matrix
- s :
-
Spatial site
- p :
-
A neighboring site of s
- y s t :
-
A site s of frame y t
- x :
-
Segmentation of y
- X t :
-
Markov random field at time t
- x t :
-
Realization of X t , segmented version of y t
- η s t :
-
Neighborhood of a site s in spatial direction of the tth frame
- q :
-
A neighboring site of s
- r :
-
A site in (t − 1) or (t − 2) th frame termed as temporal site
- e :
-
A site of the Laplacian edge image x t , x t − 1 or x t − 2 called as edge site
- V c (x t ):
-
Clique potential function in MRF
- V s c (x s t , x q t ):
-
Clique potential function in the spatial domain
- V t e c (x s t , x q r ):
-
Clique potential function in the temporal domain
- V t e e c (x s t , x e r ):
-
Clique potential function incorporating edge features
- α, β and γ :
-
MRF model bonding parameters
- \( \hat {x}_{t} \) :
-
Estimated label or MAP estimate
- 𝜃 :
-
Parameter vector associated with x t
- U(x t ):
-
Energy of realization x t
- N :
-
Gaussian process
- Z :
-
Partition function
- f :
-
Number of features (for RGB space it is 3)
References
Babacan SD, Pappas TN (2007) Spatiotemporal algorithm for background subtraction. In: Proceeding of IEEE international conference on acoustics, speech, and signal processing, pp 1065–1068
Barnich O, Van Droogenbroeck M (2011) ViBe: a universal background subtraction algorithm for video sequences. IEEE Trans Image Process 20(6):1709–1724
Benedek C, Sziranyi T, Kato Z, Zerubia J (2009) Detection of object motion regions in aerial image pairs with a multilayer Markovian model. IEEE Trans Image Process 18(10):2303–2315
Besag J (1986) On the statistical analysis of dirty pictures. J R Stat Soc Ser B Methodol 48(3):259–302
Bouwmans T (2012) Background subtraction for visual surveillance: a fuzzy approach. In: Pal S, Petrosino A, Maddalena L (eds) Handbook on soft computing for video surveillance, vol 5. Taylor and Francis Group, New York, pp 103–138
Bovic AL (2000) Handbook of image and video processing. Academic Press, New York
Cho JH, Kim SD (2004) Object detection using spatio-temporal thresholding in image sequences. Electron Lett 40(18):1109–1110
Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24(5):603–619
Deng Y, Manjunath BS (2001) Unsupervised segmentation of color-texture regions in images and video. IEEE Trans Pattern Anal Mach Intell 23(8):800–810
Elgammal A, Duraiswami R, Harwood D, Davis LS (2002) Background and foreground modeling using nonparametric kernel density estimation for visual surveillance. Proc IEEE 90(7):1151–1163
Geman S, Geman D (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach Intell 6(6):721–741
Ghosh A, Mondal A, Ghosh S (2014) Moving object detection using markov random field and distributeddifferential evolution. Appl Soft Comput 15:121–135
Gonzalez RC, Woods RE (2001) Digital image processing. Pearson Education, Singapore
Hinds RO, Pappas TN (1995) An adaptive clustering algorithm for segmentation of video sequences. In: Proceedings of international conference on acoustics, speech and signal processing, vol 4, pp 2427–2430
Huang SS, Fu LC, Hsiao PY (2007) Region-level motion-based background modeling and subtraction using MRFs. IEEE Trans Image Process 16(5):1446–1456
Hwang SW, Kim EY, Park SH, Kim HJ (2001) Object extraction and tracking using genetic algorithms. In: Proceedings of international conference on image processing, vol 2, pp 383–386
Kim EY, Park SH (2006) Automatic video segmentation using genetic algorithms. Pattern Recogn Lett 27(11):1252–1265
Kirkpatrick SC, Gelatt CD, Vecchi MP (1983) Optimization by simulated annealing. Science 220(4598):671–680
Lee DS, Yeom S, Son JY, Kim SH (2010) Automatic image segmentation for concealed object detection using the expectation-maximization algorithm. Opt Express 18(10):10659–10667
Li SZ (2001) Markov random field modeling in image analysis. Springer, Japan
Lin W, Sun MT, Poovendran R, Zhang Z (2008) Activity recognition using a combination of category components and local models for video surveillance. IEEE Trans Circuits Syst Video Technol 18(8):1128–1139
Liu X, Qi C (2013) Future-data driven modeling of complex backgrounds using mixture of gaussians. Neurocomputing 119:439–453
Liu X, Zhao G, Yao J, Qi C (2015) Background subtraction based on low-rank and structured sparse decomposition. IEEE Trans Image Process 24(8):2502–2514
Luthon F, Caplier A, Lievin M (1999) Spatiotemporal MRF approach to video segmentation: application to motion detection and lip segmentation. Signal Process 76 (1):61–80
Oneata D, Revaud J, Verbeek J, Schmid C (2014) Spatio-temporal object detection proposals. In: European conference on computer vision, pp 737–752
Otsu N (1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9(1):62–66
Palaniappan K, Ersoy I, Seetharaman G, Davis S, Kumar P, Rao RM, Linderman R (2011) Parallel flux tensor analysis for efficient moving object detection. In: Proceedings of the 14th international conference on information fusion, pp 535–546
Qiao Y-L, Yuan K-L, Song C-Y, Xiang X-Z (2014) Detection of moving objects with fuzzy color coherence vector. Math Probl Eng 2014, Article ID 138065:8
Rapantzikos K, Tsapatsoulis N, Avrithis Y, Kollias S (2009) Spatiotemporal saliency for video classification. Signal Process Image Commun 24(7):557–571
Rougier C, Meunier J, St-Arnaud A, Rousseau J (2011) Robust video surveillance for fall detection based on human shape deformation. IEEE Trans Circuits Syst Video Technol 21(5):611–622
Salember P, Marqués F (1999) Region based representation of image and video segmentation tools for multimedia services. IEEE Trans Circuits Syst Video Technol 9 (8):1147–1169
Sasaki GH, Hajek B (1988) The time complexity of maximum matching by simulated annealing. J ACM 35(2):387–403
Satake J, Miura J (2009) Robust stereo-based person detection and tracking for a person following robot. In: Proceedings of IEEE international conference on robotics and automation, pp 1–6
Sheikh Y, Shah M (2005) Bayesian modeling of dynamic scenes for object detection. IEEE Trans Pattern Anal Mach Intell 27(11):1778–1792
Shi Q, Wang L, Cheng L, Smola A (2008) Discriminative human segmentation and recognition using semi-Markov model. In: Proceedings of IEEE conference on computer vision and pattern recognition, pp 1–8
Stauffer C, Grimson WEL (2000) Learning patterns of activity using real-time tracking. IEEE Trans Pattern Anal Mach Intell 22(8):747–757
Stolkin R, Greig A, Hodgetts M, Gilby J (2008) An EM/E-MRF algorithm for adaptive model based tracking in extremely poor visibility. Image Vis Comput 26 (4):480–495
Su TF, Chen YL, Lai SH (2010) Over-segmentation based background modeling and foreground detection with shadow removal by using hierarchical MRFs. In: Proceedings of ACCV, pp 535–546
Teixeira LF, Cardoso JS, Corte-Real L (2007) Object segmentation using background modelling and cascaded change detection. J Multimed 2(5):55–65
Tekalp AM (1995) Digital video processing. Prentice Hall, New Jersey
Wang R, Bunyak F, Seetharaman G, Palaniappan K (2014) Static and moving object detection using flux tensor with split gaussian models. In: 2014 IEEE conference on computer vision and pattern recognition workshops (CVPRW), pp 420–424
Wu GK, Reed TR (1999) Image sequence processing using spatio temporal segmentation. IEEE Trans Circuits Syst Video Technol 9(5):798–807
Xu D, Yan S, Tao D, Zhang L, Li X, Zhang HJ (2006) Human gait recognition with matrix representation. IEEE Trans Circuits Syst Video Technol 16(7):896–903
Zhang YJ (2006) Advances in image and video segmentation. IRM Press, New York
Zhou X, Yang C, Yu W (2013) Moving object detection by detecting contiguous outliers in the low-rank representation. IEEE Trans Pattern Anal Mach Intell 35(3):597–610
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Subudhi, B.N., Ghosh, S., Nanda, P.K. et al. Moving object detection using spatio-temporal multilayer compound Markov Random Field and histogram thresholding based change detection. Multimed Tools Appl 76, 13511–13543 (2017). https://doi.org/10.1007/s11042-016-3698-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-3698-2