Abstract
The segmentation of moving objects become challenging when the object motion is small, the shape of object changes, and there is global background motion in unconstrained videos. In this paper, we propose a fully automatic, efficient, fast and composite framework to segment the moving object on the basis of saliency, locality, color and motion cues. First, we propose a new saliency measure to predict the potential salient regions. In the second step, we use the RANSAC homography and optical flow to compensate the background motion and get reliable motion information, called motion cues. Furthermore, the saliency information and motion cues are combined to get the initial segmented object (seeded region). A refinement is performed to remove the unwanted noisy details and expand the seeded region to the whole object. Detailed experimentation is carried out on challenging video benchmarks to evaluate the performance of the proposed method. The results show that the proposed method is faster and performs better than state-of-the-art approaches.











Similar content being viewed by others
References
Bai X, Wang J, Simons D, Sapiro G (2009) Video snapcut: robust video object cutout using localized classifiers. ACM Transactions on Graphics, 28(3)
Banica D, Agape A, Ion A, Sminchisescu C (2013) Video object segmentation by salient segment chain composition. IEEE Conference on Computer Vision Workshop
Brox T, Malik J (2010) Large displacement optical flow: descriptor matching in variational motion estimation. IEEE Trans Pattern Anal Mach Intell 33(3):500–513
Brox T, Malik J (2010) Object segmentation by long term analysis of point trajectories. In: European Conference on Computer Vision
Bugeau A, Perez P (2007) Detection and segmentation of moving objects in highly dynamic scenes. IEEE Conference on Computer Vision and Pattern Recognition, 1–8
Chiu W, Fritz M (2013) Multi-class video co-segmentation with a generative multi-video model. IEEE Conference on Computer Vision and Pattern Recognition
Chockalingam P, Pradeep N, Birchfield S (2009) Adaptive fragments-based tracking of non-rigid objects using level sets. In: BMVC, pp 1530–1537
Elgammal A, Duraiswami R, Harwood D, Davis L (2002) Background and foreground modeling using nonparametric kernel density estimation for visual surveillance. Proc IEEE 90(7):1151–1163
Faktor A, Irani M (2014) Video segmentation by non-local consensus voting. Proceedings of the British Machine Vision Conference
Fischler M, Bolles R (1981) Random sample consensus: a paradigm for model fitting with application to image analysis and automated cartography. Commun ACM 24(6):381–395
Fragkiadaki K, Arbelaez P, Felsen P, Malik J (2015) Learning to segment moving objects in videos. IEEE Conference on Computer Vision and Pattern Recognition, 4083–4090
Galasso F, Nagaraja NS, Cardenas TJ, Brox T, Schiele B (2013) A unified video segmentation benchmark: Annotation, metrics and analysis. IEEE Conference on Computer Vision
Gelgon M, Bouthemy P (1997) A region-level graph labeling approach to motion-based segmentation. IEEE Conference on Computer Vision and Pattern Recognition, 514–519
Grundmann M, Kwatra V, Han M, Essa I (2010) Efficient hierarchical graph based video segmentation. IEEE Conference on Computer Vision and Pattern Recognition
Han M, Xu W, Gong Y (2006) Video object segmentation by motion-based sequential feature clustering. In: ACM Multimedia, pp 773–782
Hartley R, Zisserman A (2004) Multiple View Geometry in Computer Vision. Cambridge University Press second edition
Heikkila M, Pietikainen M (2006) A texture-based method for modeling the background and detecting moving objects. IEEE Trans Pattern Anal Mach Intelligence 28(4):657–662
I2R (2010) Efficient hierar chicalgraph-based video segmentation. IEEE Conference on Computer Vision and Pattern Recognition, 2141–2148
Jojic B, Kannan A (2003) Learning appearance and transparency manifolds of occluded objects in layers. IEEE Conference on Computer Vision and Pattern Recognition
Joulin A, Bach F, Ponce J (2012) Multi-class cosegmentation. IEEE Conference on Computer Vision and Pattern Recognition
KaewTraKulPong P, Bowden R (2001) An improved adaptive background mixture model for real-time tracking with shadow detection. Proceedings of the 2nd European Workshop Advanced Video Based Surveillance Systems 1:1–5
Khan S, Shah M (2001) Object based segmentation of video using color, motion and spatial information. IEEE Conference on Computer Vision and Pattern Recognition, 746–751
Khan A, Ullah J, Jaffar M, Chai T (2014) Color imagesegmentation: A novel spatial fuzzy genetic algorithm. SIViP, Springer 8(7):1233–1243
Lee Y, Kim J, Grauman K (2011) Key-segments for video object segmentation. IEEE Conference on Computer Vision, 1995–2002
Li L, Huang W, Gu I, Tian Q (2003) Foreground object detection from videos containing complex background. ACM International Conference on Multimedia, 2–10
Li F, Kim T, Humayun A, Tsai D, Rehg J (2013) Video segmentation by tracking many figure-ground segments. IEEE Conference on Computer Vision
Liu C (2009) Beyond pixels: Exploring new representations and applications for motion analysis. In: Doctoral Thesis. Massachusetts Institute of Technology
Liu F, Gleicher M (2009) Learning color and locality cues for moving object detection and segmentation. IEEE Conference on Computer Vision and Pattern Recognition
Liu C, Yuen J, Russell B, Torralba A (2009) Labelme video: Building a video database with human annotations. IEEE Conference on Computer Vision, 1451–1458
Ma T, Latecki LL (2012) Maximum weight cliques with mutex constraints for video object segmentation. IEEE Conference on Computer Vision and Pattern Recognition, 670–677
Mahamud S (2006) Comparing belief propagation and graph cuts for novelty detection. IEEE Conference on Computer Vision and Pattern Recognition, 1154–1159
Matsuyama T, Wada T, Habe H, Tanahashi K (2006) Background subtraction under varying illumination. Syst Comput Japan 37(4):77–88
Ochs P, Brox T (2011) Object segmentation in video: a hierarchical variational approach for turning point trajectories into dense regions. IEEE Conference on Computer Vision
Ochs T, Brox P (2012) Higher order motion models and spectral clustering. IEEE Conference on Computer Vision and Pattern Recognition
Oneata D, Revaud J, Verbeek J, Schmid C (2014) Spatio-temporal object detection proposals. In: European Conference on Computer Vision
Papazoglou A, Ferrari V (2013) Fast object segmentation in unconstrained video. IEEE International Conference on Computer Vision (ICCV), 1777–1784
Price B, Morse B, Cohen S (2009) Livecut: Learning-based interactive video segmentation by evaluation of multiple propagated cues. IEEE Conference on Computer Vision, 779–786
Rahtu E, Kannala J, Salo M, Heikki J (2010) Segmenting salient objects from images and videos. ECCV
Reddy V, Sanderson C, Lovell B (2013) Improved foreground detection via block-based classifier cascade with probabilistic decision integration. IEEE Trans Circuits Syst Video Technol 23(1): 83–93
Rodriguez DM, Ahmed J, Shah M (2011) Action mach: A spatio-temporal maximum average correlation height filter for action recognition. IEEE Conference on Computer Vision and Pattern Recognition
Rubio E, Baena R (2011) Stochastic approximation for background modelling. Int J Comput Vis Image Underst 115(6):735–749
Rubio E, Baena R, Dominguez E (2011) Foreground detection in video sequences with probabilistic self-organizing maps. Int J Neural Syst 21(3):225–246
Sheikh Y, Shah M (2005) Bayesian object detection in dynamic scenes. IEEE Conference on Computer Vision and Pattern Recognition, 74–79
Sun J, Zhang W, Tang X, Shum HY (2006) Background cut. In: Proceedings of ECCV, pp 628–641
Toyama K, Krumm J, Brumitt B, Meyers B (1999) Wallflower: Principles and practice of background maintenance. Seventh International Conference on Computer Vision, 255–261
Tsai D, Flagg M, Rehg J (2010) Motion coherent tracking with multi-label mrf optimization. In: BMVC
Wang Y, Ji Q (2005) A dynamic conditional random field model for object segmentation in image sequences. In: Proceedings of IEEE CVPR, pp 264–270
Xu C, Corso JJ (2012) Evaluation of supervoxel methods for early video processing. IEEE Conference on Computer Vision and Pattern Recognition
Xu X, Huang T (2008) A loopy belief propagation approach for robust background estimation. IEEE Conference on Computer Vision and Pattern Recognition, 1–7
Xu D, Zhang B, Fu H, Lin S (2014) Object-based multiple foreground video co-segmentation. IEEE Conference on Computer Vision and Pattern Recognition
Yin P, Criminisi A, Winn J, Essa I (2007) Tree-based classifiers for bilayer video segmentation. IEEE Conference on Computer Vision and Pattern Recognition, 1–8
Zhang D, Javed O, Shah M (2013) Video object segmentation through spatially accurate and temporally dense extraction of primary object regions. IEEE Conference on Computer Vision and Pattern Recognition, 1451–1458
Zhang G, Jia J, Xiong W, Wong T, Heng P, Bao H (2007) Moving object extraction with a hand-held camera. IEEE Conference on Computer Vision, 1–8
Acknowledgements
The authors would like to thank the anonymous reviewers for their valuable comments and suggestions that helped to improve the quality of paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ullah, J., Khan, A. & Jaffar, M.A. Motion cues and saliency based unconstrained video segmentation. Multimed Tools Appl 77, 7429–7446 (2018). https://doi.org/10.1007/s11042-017-4655-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-4655-4