Skip to main content

Interactive RGB-D Image Segmentation Using Hierarchical Graph Cut and Geodesic Distance

  • Conference paper
  • First Online:
Advances in Multimedia Information Processing -- PCM 2015 (PCM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9314))

Included in the following conference series:

Abstract

In this paper, we propose a novel interactive image segmentation method for RGB-D images using hierarchical Graph Cut. Considering the characteristics of RGB channels and depth channel in RGB-D image, we utilize Euclidean distance on RGB space and geodesic distance on 3D space to measure how likely a pixel belongs to foreground or background in color and depth respectively, and integrate the color cue and depth cue into a unified Graph Cut framework to obtain the optimal segmentation result. Moreover, to overcome the low efficiency problem of Graph Cut in handling high resolution images, we accelerate the proposed method with hierarchical strategy. The experimental results show that our method outperforms the state-of-the-art methods with high efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Li, S., Ju, R., Ren, T., Wu, G.: Saliency cuts based on adaptive triple threshoding. In: International Conference on Image Processing, pp. 1–4. IEEE (2015)

    Google Scholar 

  2. Nguyen, T.N.A., Cai, J., Zhang, J., Zheng, J.: Robust interactive image segmentation using convex active contours. IEEE Trans. Image Process. 21(8), 3734–3743 (2012)

    Article  MathSciNet  Google Scholar 

  3. Delgado-Gonzalo, R., Chenouard, N., Unser, M.: Spline-based deforming ellipsoids for interactive 3D bioimage segmentation. IEEE Trans. Image Process. 22(10), 3926–3940 (2013)

    Article  MathSciNet  Google Scholar 

  4. Cheng, M.M., Mitra, N.J., Huang, X., Torr, P.H.S., Hu, S.M.: Global contrast based salient region detection. IEEE Trans. Pattern Anal. Mach. Intell. 37(3), 569–582 (2014)

    Article  Google Scholar 

  5. Ren, T., Liu, Y., Wu, G.: Image retargeting based on global energy optimization. In: IEEE International Conference on Multimedia and Expo, pp. 406–409 (2009)

    Google Scholar 

  6. Xu, X., Geng, W., Ju, R., Yang, Y., Ren, T., Wu, G.: OBSIR: object-based stereo image retrieval. In: IEEE International Conference on Multimedia and Expo, pp. 1–6 (2014)

    Google Scholar 

  7. Greig, D., Porteous, B., Seheult, A.H.: Exact maximum a posteriori estimation for binary images. J. Roy. Stat. Soc. Ser. B (Methodol.) 51, 271–279 (1989)

    Google Scholar 

  8. Boykov, Y.Y., Jolly, M.P.: Interactive graph cuts for optimal boundary & region segmentation of objects in ND images. In: IEEE International Conference on Computer Vision, pp. 105–112 (2001)

    Google Scholar 

  9. Yatziv, L., Bartesaghi, A., Sapiro, G.: O(n) implementation of the fast marching algorithm. J. Comput. Phys. 212(2), 393–399 (2006)

    Article  MATH  Google Scholar 

  10. Diebold, J., Demmel, N., Hazırbaş, C., Moeller, M., Cremers, D.: Interactive multi-label segmentation of RGB-D images. In: Aujol, J.-F., Nikolova, M., Papadakis, N. (eds.) SSVM 2015. LNCS, vol. 9087, pp. 294–306. Springer, Heidelberg (2015)

    Google Scholar 

  11. Boykov, Y., Kolmogorov, V.: An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1124–1137 (2004)

    Article  MATH  Google Scholar 

  12. Ju, R., Ge, L., Geng, W., Ren, T., Wu, G.: Depth saliency based on anisotropic center-surround difference. In: IEEE International Conference on Image Processing, pp. 1115–1119 (2014)

    Google Scholar 

  13. Peng, H., Li, B., Xiong, W., Hu, W., Ji, R.: RGBD salient object detection: a benchmark and algorithms. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part III. LNCS, vol. 8691, pp. 92–109. Springer, Heidelberg (2014)

    Google Scholar 

  14. Sang, J., Mei, T., Xu, Y.Q., Zhao, C., Xu, C., Li, S.: Interaction design for mobile visual search. IEEE Trans. Multimedia 15(7), 1665–1676 (2013)

    Article  Google Scholar 

  15. Sang, J.: User-centric social multimedia computing. Springer, Heidelberg (2014)

    Book  Google Scholar 

  16. Rother, C., Kolmogorov, V., Blake, A.: Grabcut: interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23(3), 309–314 (2004)

    Article  Google Scholar 

  17. Grady, L.: Random walks for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 28(11), 1768–1783 (2006)

    Article  Google Scholar 

  18. Gulshan, V., Rother, C., Criminisi, A., Blake, A., Zisserman, A.: Geodesic star convexity for interactive image segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3129–3136 (2010)

    Google Scholar 

  19. Lombaert, H., Sun, Y., Grady, L., Xu, C.: A multilevel banded graph cuts method for fast image segmentation. In: IEEE International Conference on Computer Vision, pp. 259–265 (2005)

    Google Scholar 

  20. Vaudrey, T., Gruber, D., Wedel, A., Klappstein, J.: Space-time multi-resolution banded graph-cut for fast segmentation. In: Rigoll, G. (ed.) DAGM 2008. LNCS, vol. 5096, pp. 203–213. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  21. Kolmogorov, V., Criminisi, A., Blake, A., Cross, G., Rother, C.: Bi-layer segmentation of binocular stereo video. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp. 407–414 (2005)

    Google Scholar 

  22. Harville, M., Gordon, G., Woodfill, J.: Foreground segmentation using adaptive mixture models in color and depth. In: IEEE Workshop on Detection and Recognition of Events in Video, pp. 3–11 (2001)

    Google Scholar 

  23. Ahn, J.H., Kim, K., Byun, H.: Robust object segmentation using graph cut with object and background seed estimation. In: International Conference on Pattern Recognition, pp. 361–364. IEEE (2006)

    Google Scholar 

Download references

Acknowledgments

This work is supported by the National Science Foundation of China (No.61321491, 61202320), Research Project of Excellent State Key Laboratory (No.61223003), and National Special Fund (No.2011ZX05035-004-004HZ).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gangshan Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Ge, L., Ju, R., Ren, T., Wu, G. (2015). Interactive RGB-D Image Segmentation Using Hierarchical Graph Cut and Geodesic Distance. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9314. Springer, Cham. https://doi.org/10.1007/978-3-319-24075-6_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24075-6_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24074-9

  • Online ISBN: 978-3-319-24075-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics