Interactive RGB-D Image Segmentation Using Hierarchical Graph Cut and Geodesic Distance

Ge, Ling; Ju, Ran; Ren, Tongwei; Wu, Gangshan

doi:10.1007/978-3-319-24075-6_12

Ling Ge¹⁸,
Ran Ju¹⁸,
Tongwei Ren¹⁸ &
…
Gangshan Wu¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9314))

Included in the following conference series:

Pacific Rim Conference on Multimedia

2040 Accesses

Abstract

In this paper, we propose a novel interactive image segmentation method for RGB-D images using hierarchical Graph Cut. Considering the characteristics of RGB channels and depth channel in RGB-D image, we utilize Euclidean distance on RGB space and geodesic distance on 3D space to measure how likely a pixel belongs to foreground or background in color and depth respectively, and integrate the color cue and depth cue into a unified Graph Cut framework to obtain the optimal segmentation result. Moreover, to overcome the low efficiency problem of Graph Cut in handling high resolution images, we accelerate the proposed method with hierarchical strategy. The experimental results show that our method outperforms the state-of-the-art methods with high efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Unsupervised Segmentation of RGB-D Images

Interactive Multi-label Segmentation of RGB-D Images

GANet: geometry-aware network for RGB-D semantic segmentation

Article 15 February 2025

References

Li, S., Ju, R., Ren, T., Wu, G.: Saliency cuts based on adaptive triple threshoding. In: International Conference on Image Processing, pp. 1–4. IEEE (2015)
Google Scholar
Nguyen, T.N.A., Cai, J., Zhang, J., Zheng, J.: Robust interactive image segmentation using convex active contours. IEEE Trans. Image Process. 21(8), 3734–3743 (2012)
Article MathSciNet Google Scholar
Delgado-Gonzalo, R., Chenouard, N., Unser, M.: Spline-based deforming ellipsoids for interactive 3D bioimage segmentation. IEEE Trans. Image Process. 22(10), 3926–3940 (2013)
Article MathSciNet Google Scholar
Cheng, M.M., Mitra, N.J., Huang, X., Torr, P.H.S., Hu, S.M.: Global contrast based salient region detection. IEEE Trans. Pattern Anal. Mach. Intell. 37(3), 569–582 (2014)
Article Google Scholar
Ren, T., Liu, Y., Wu, G.: Image retargeting based on global energy optimization. In: IEEE International Conference on Multimedia and Expo, pp. 406–409 (2009)
Google Scholar
Xu, X., Geng, W., Ju, R., Yang, Y., Ren, T., Wu, G.: OBSIR: object-based stereo image retrieval. In: IEEE International Conference on Multimedia and Expo, pp. 1–6 (2014)
Google Scholar
Greig, D., Porteous, B., Seheult, A.H.: Exact maximum a posteriori estimation for binary images. J. Roy. Stat. Soc. Ser. B (Methodol.) 51, 271–279 (1989)
Google Scholar
Boykov, Y.Y., Jolly, M.P.: Interactive graph cuts for optimal boundary & region segmentation of objects in ND images. In: IEEE International Conference on Computer Vision, pp. 105–112 (2001)
Google Scholar
Yatziv, L., Bartesaghi, A., Sapiro, G.: O(n) implementation of the fast marching algorithm. J. Comput. Phys. 212(2), 393–399 (2006)
Article MATH Google Scholar
Diebold, J., Demmel, N., Hazırbaş, C., Moeller, M., Cremers, D.: Interactive multi-label segmentation of RGB-D images. In: Aujol, J.-F., Nikolova, M., Papadakis, N. (eds.) SSVM 2015. LNCS, vol. 9087, pp. 294–306. Springer, Heidelberg (2015)
Google Scholar
Boykov, Y., Kolmogorov, V.: An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1124–1137 (2004)
Article MATH Google Scholar
Ju, R., Ge, L., Geng, W., Ren, T., Wu, G.: Depth saliency based on anisotropic center-surround difference. In: IEEE International Conference on Image Processing, pp. 1115–1119 (2014)
Google Scholar
Peng, H., Li, B., Xiong, W., Hu, W., Ji, R.: RGBD salient object detection: a benchmark and algorithms. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part III. LNCS, vol. 8691, pp. 92–109. Springer, Heidelberg (2014)
Google Scholar
Sang, J., Mei, T., Xu, Y.Q., Zhao, C., Xu, C., Li, S.: Interaction design for mobile visual search. IEEE Trans. Multimedia 15(7), 1665–1676 (2013)
Article Google Scholar
Sang, J.: User-centric social multimedia computing. Springer, Heidelberg (2014)
Book Google Scholar
Rother, C., Kolmogorov, V., Blake, A.: Grabcut: interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23(3), 309–314 (2004)
Article Google Scholar
Grady, L.: Random walks for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 28(11), 1768–1783 (2006)
Article Google Scholar
Gulshan, V., Rother, C., Criminisi, A., Blake, A., Zisserman, A.: Geodesic star convexity for interactive image segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3129–3136 (2010)
Google Scholar
Lombaert, H., Sun, Y., Grady, L., Xu, C.: A multilevel banded graph cuts method for fast image segmentation. In: IEEE International Conference on Computer Vision, pp. 259–265 (2005)
Google Scholar
Vaudrey, T., Gruber, D., Wedel, A., Klappstein, J.: Space-time multi-resolution banded graph-cut for fast segmentation. In: Rigoll, G. (ed.) DAGM 2008. LNCS, vol. 5096, pp. 203–213. Springer, Heidelberg (2008)
Chapter Google Scholar
Kolmogorov, V., Criminisi, A., Blake, A., Cross, G., Rother, C.: Bi-layer segmentation of binocular stereo video. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp. 407–414 (2005)
Google Scholar
Harville, M., Gordon, G., Woodfill, J.: Foreground segmentation using adaptive mixture models in color and depth. In: IEEE Workshop on Detection and Recognition of Events in Video, pp. 3–11 (2001)
Google Scholar
Ahn, J.H., Kim, K., Byun, H.: Robust object segmentation using graph cut with object and background seed estimation. In: International Conference on Pattern Recognition, pp. 361–364. IEEE (2006)
Google Scholar

Download references

Acknowledgments

This work is supported by the National Science Foundation of China (No.61321491, 61202320), Research Project of Excellent State Key Laboratory (No.61223003), and National Special Fund (No.2011ZX05035-004-004HZ).

Author information

Authors and Affiliations

State Key Laboratory for Novel Software Technology, Collaborative Innovation Center of Novel Software Technology and Industrialization, Nanjing University, Nanjing, 210023, China
Ling Ge, Ran Ju, Tongwei Ren & Gangshan Wu

Authors

Ling Ge
View author publications
You can also search for this author in PubMed Google Scholar
Ran Ju
View author publications
You can also search for this author in PubMed Google Scholar
Tongwei Ren
View author publications
You can also search for this author in PubMed Google Scholar
Gangshan Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gangshan Wu .

Editor information

Editors and Affiliations

Gwangju Institute of Science and Technology, Gwangju, Korea (Republic of)
Yo-Sung Ho
Chinese Academy of Sciences, Institute of Automation, Beijing, China
Jitao Sang
ICU, IVY Lab, KAIST, Daejeon, Korea (Republic of)
Yong Man Ro
KAIST, Daejeon, Korea (Republic of)
Junmo Kim
College of Computer Science, Zhejiang University, Hangzhou, China
Fei Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ge, L., Ju, R., Ren, T., Wu, G. (2015). Interactive RGB-D Image Segmentation Using Hierarchical Graph Cut and Geodesic Distance. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9314. Springer, Cham. https://doi.org/10.1007/978-3-319-24075-6_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-24075-6_12
Published: 22 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24074-9
Online ISBN: 978-3-319-24075-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics