Skip to main content

Spectral Context Matching for Video Object Segmentation Under Occlusion

  • Conference paper
  • First Online:
Book cover Advances in Multimedia Information Processing – PCM 2017 (PCM 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10736))

Included in the following conference series:

  • 2290 Accesses

Abstract

Although numerous algorithms have been proposed for video object segmentation, it is still a challenging problem to segment video object in the case of occlusion. Video object localization is a critical step for an accurate object segmentation. To obtain an initial localization, we propose a new method, Spectral Context Matching (SCM), for a coarse object location. SCM rebuild the affinity Matrix using context information as similarity constraints of features to detect the corresponding areas. Adding with color and optical flow information, the initially estimated object location is selected. For object segmentation, we utilize a spatial-temporal graphical model on the estimated object region to get an accurate segmentation. In addition, we also impose an online update mechanism to detect and handle occlusion adaptively. Experimental results on DAVIS dataset and comparison with the-state-of-the-art method show that our proposed algorithm can efficiently handle heavy occlusion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Barnes, C.: Patchmatch: a fast randomized matching algorithm with application to image and video. Princeton University (2011)

    Google Scholar 

  2. Brox, T., Malik, J.: Large displacement optical flow: descriptor matching in variational motion estimation. IEEE Trans. Pattern Anal. Mach. Intell. 33(3), 500–513 (2011)

    Article  Google Scholar 

  3. Caelles, S., Maninis, K.K., Pont-Tuset, J., Leal-Taixé, L., Cremers, D., Van Gool, L.: One-shot video object segmentation. arXiv preprint arXiv:1611.05198 (2016)

  4. Faktor, A., Irani, M.: Video segmentation by non-local consensus voting. In: BMVC, vol. 2, p. 8 (2014)

    Google Scholar 

  5. Grundmann, M., Kwatra, V., Han, M., Essa, I.: Efficient hierarchical graph-based video segmentation. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2141–2148. IEEE (2010)

    Google Scholar 

  6. Indyk, P., Motwani, R.: Approximate nearest neighbors: towards removing the curse of dimensionality. In: Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, pp. 604–613. ACM (1998)

    Google Scholar 

  7. Korman, S., Avidan, S.: Coherency sensitive hashing. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 1607–1614. IEEE (2011)

    Google Scholar 

  8. Kudo, S., Koga, H., Yokoyama, T., Watanabe, T.: Robust automatic video object segmentation with graphcut assisted by surf features. In: 2012 19th IEEE International Conference on Image Processing (ICIP), pp. 297–300. IEEE (2012)

    Google Scholar 

  9. Lee, Y.J., Kim, J., Grauman, K.: Key-segments for video object segmentation. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 1995–2002. IEEE (2011)

    Google Scholar 

  10. Leordeanu, M., Hebert, M.: A spectral technique for correspondence problems using pairwise constraints. In: Tenth IEEE International Conference on Computer Vision, ICCV 2005, vol. 2, pp. 1482–1489. IEEE (2005)

    Google Scholar 

  11. Perazzi, F., Pont-Tuset, J., McWilliams, B., Van Gool, L., Gross, M., Sorkine-Hornung, A.: A benchmark dataset and evaluation methodology for video object segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 724–732 (2016)

    Google Scholar 

  12. Perazzi, F., Wang, O., Gross, M., Sorkine-Hornung, A.: Fully connected object proposals for video segmentation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3227–3234 (2015)

    Google Scholar 

  13. Taylor, B., Karasev, V., Soatto, S.: Causal video object segmentation from persistence of occlusions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4268–4276 (2015)

    Google Scholar 

  14. Tsai, Y.H., Yang, M.H., Black, M.J.: Video segmentation via object flow. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3899–3908 (2016)

    Google Scholar 

  15. Wen, L., Du, D., Lei, Z., Li, S.Z., Yang, M.H.: JOTS: joint online tracking and segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2226–2234 (2015)

    Google Scholar 

  16. Yang, F., Lu, H., Yang, M.H.: Robust superpixel tracking. IEEE Trans. Image Process. 23(4), 1639–1651 (2014)

    Article  MathSciNet  Google Scholar 

  17. Zhou, T., Lu, Y., Di, H., Zhang, J.: Video object segmentation aggregation. In: 2016 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2016)

    Google Scholar 

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China (No. 01273273)

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Xiaoxue Shi , Yao Lu , Tianfei Zhou or Xiaoyu Lei .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Shi, X., Lu, Y., Zhou, T., Lei, X. (2018). Spectral Context Matching for Video Object Segmentation Under Occlusion. In: Zeng, B., Huang, Q., El Saddik, A., Li, H., Jiang, S., Fan, X. (eds) Advances in Multimedia Information Processing – PCM 2017. PCM 2017. Lecture Notes in Computer Science(), vol 10736. Springer, Cham. https://doi.org/10.1007/978-3-319-77383-4_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-77383-4_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-77382-7

  • Online ISBN: 978-3-319-77383-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics