Abstract
Although numerous algorithms have been proposed for video object segmentation, it is still a challenging problem to segment video object in the case of occlusion. Video object localization is a critical step for an accurate object segmentation. To obtain an initial localization, we propose a new method, Spectral Context Matching (SCM), for a coarse object location. SCM rebuild the affinity Matrix using context information as similarity constraints of features to detect the corresponding areas. Adding with color and optical flow information, the initially estimated object location is selected. For object segmentation, we utilize a spatial-temporal graphical model on the estimated object region to get an accurate segmentation. In addition, we also impose an online update mechanism to detect and handle occlusion adaptively. Experimental results on DAVIS dataset and comparison with the-state-of-the-art method show that our proposed algorithm can efficiently handle heavy occlusion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barnes, C.: Patchmatch: a fast randomized matching algorithm with application to image and video. Princeton University (2011)
Brox, T., Malik, J.: Large displacement optical flow: descriptor matching in variational motion estimation. IEEE Trans. Pattern Anal. Mach. Intell. 33(3), 500–513 (2011)
Caelles, S., Maninis, K.K., Pont-Tuset, J., Leal-Taixé, L., Cremers, D., Van Gool, L.: One-shot video object segmentation. arXiv preprint arXiv:1611.05198 (2016)
Faktor, A., Irani, M.: Video segmentation by non-local consensus voting. In: BMVC, vol. 2, p. 8 (2014)
Grundmann, M., Kwatra, V., Han, M., Essa, I.: Efficient hierarchical graph-based video segmentation. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2141–2148. IEEE (2010)
Indyk, P., Motwani, R.: Approximate nearest neighbors: towards removing the curse of dimensionality. In: Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, pp. 604–613. ACM (1998)
Korman, S., Avidan, S.: Coherency sensitive hashing. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 1607–1614. IEEE (2011)
Kudo, S., Koga, H., Yokoyama, T., Watanabe, T.: Robust automatic video object segmentation with graphcut assisted by surf features. In: 2012 19th IEEE International Conference on Image Processing (ICIP), pp. 297–300. IEEE (2012)
Lee, Y.J., Kim, J., Grauman, K.: Key-segments for video object segmentation. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 1995–2002. IEEE (2011)
Leordeanu, M., Hebert, M.: A spectral technique for correspondence problems using pairwise constraints. In: Tenth IEEE International Conference on Computer Vision, ICCV 2005, vol. 2, pp. 1482–1489. IEEE (2005)
Perazzi, F., Pont-Tuset, J., McWilliams, B., Van Gool, L., Gross, M., Sorkine-Hornung, A.: A benchmark dataset and evaluation methodology for video object segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 724–732 (2016)
Perazzi, F., Wang, O., Gross, M., Sorkine-Hornung, A.: Fully connected object proposals for video segmentation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3227–3234 (2015)
Taylor, B., Karasev, V., Soatto, S.: Causal video object segmentation from persistence of occlusions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4268–4276 (2015)
Tsai, Y.H., Yang, M.H., Black, M.J.: Video segmentation via object flow. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3899–3908 (2016)
Wen, L., Du, D., Lei, Z., Li, S.Z., Yang, M.H.: JOTS: joint online tracking and segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2226–2234 (2015)
Yang, F., Lu, H., Yang, M.H.: Robust superpixel tracking. IEEE Trans. Image Process. 23(4), 1639–1651 (2014)
Zhou, T., Lu, Y., Di, H., Zhang, J.: Video object segmentation aggregation. In: 2016 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2016)
Acknowledgments
This work was supported by the National Natural Science Foundation of China (No. 01273273)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Shi, X., Lu, Y., Zhou, T., Lei, X. (2018). Spectral Context Matching for Video Object Segmentation Under Occlusion. In: Zeng, B., Huang, Q., El Saddik, A., Li, H., Jiang, S., Fan, X. (eds) Advances in Multimedia Information Processing – PCM 2017. PCM 2017. Lecture Notes in Computer Science(), vol 10736. Springer, Cham. https://doi.org/10.1007/978-3-319-77383-4_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-77383-4_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77382-7
Online ISBN: 978-3-319-77383-4
eBook Packages: Computer ScienceComputer Science (R0)