Skip to main content

3-D Geometry Enhanced Superpixels for RGB-D Data

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8294))

Abstract

This paper introduces a novel 3-D geometry enhanced superpixels for RGB-D data. First, we reconstruct the 3-D geometry of the scene by projecting the depth map into 3-D coordinates. Then, a distance metric for superpixel clustering is constructed using 3-D geometry and color information. Finally, pixels are iteratively clustered into superpixels using the proposed distance metric. The proposed method is able to distinguish objects in similar colors due to the introduced 3-D geometry. The oversegmentation results on RGB-D pairs in the Middlebury datasets demonstrate that our approach shows better performance than other three state-of-the-art superpixel methods. The proposed superpixels are also evaluated in the application of segmentation, and we achieve the best segmentation results compared with three state-of-the-art segmentation methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lai, K., Bo, L., Ren, X., Fox, D.: Sparse distance learning for object recognition combining RGB and depth information. In: ICRA, pp. 4007–4013 (2011)

    Google Scholar 

  2. Herbst, E., Ren, X., Fox, D.: RGB-D object discovery via multi-scene analysis. In: Intelligent Robots and Systems, pp. 4850–4856 (2011)

    Google Scholar 

  3. Ren, X., Bo, L., Fox, D.: RGB-D scene labeling: Features and algorithms. In: CVPR, pp. 2759–2766 (2012)

    Google Scholar 

  4. Henry, P., Krainin, M., Herbst, E., Ren, X., Fox, D.: RGB-D mapping: Using depth cameras for dense 3D modeling of indoor environments. In: The 12th International Symposium on Experimental Robotics, vol. 20, pp. 22–25 (2010)

    Google Scholar 

  5. Izadi, S., Kim, D., Hilliges, O., Molyneaux, D., Newcombe, R., Kohli, P., Shotton, J., Hodges, S., Freeman, D., Davison, A., et al.: Kinectfusion: real-time 3D reconstruction and interaction using a moving depth camera. In: Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology, pp. 559–568 (2011)

    Google Scholar 

  6. Koppula, H.S., Anand, A., Joachims, T., Saxena, A.: Semantic labeling of 3D point clouds for indoor scenes. In: Proceedings of the Advances in Neural Information Processing Systems (2011)

    Google Scholar 

  7. Silberman, N., Hoiem, D., Kohli, P., Fergus, R.: Indoor segmentation and support inference from RGBD images. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 746–760. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  8. Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8), 888–905 (2000)

    Article  Google Scholar 

  9. Moore, A.P., Prince, S., Warrell, J., Mohammed, U., Jones, G.: Superpixel lattices. In: CVPR, pp. 1–8 (2008)

    Google Scholar 

  10. Vedaldi, A., Soatto, S.: Quick shift and kernel methods for mode seeking. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol. 5305, pp. 705–718. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  11. Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Susstrunk, S.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence (2012)

    Google Scholar 

  12. Middlebury stereo vision page, http://vision.middlebury.edu/stereo/data/

  13. Levinshtein, A., Stere, A., Kutulakos, K.N., Fleet, D.J., Dickinson, S.J., Siddiqi, K.: Turbopixels: Fast superpixels using geometric flows. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(12), 2290–2297 (2009)

    Article  Google Scholar 

  14. Gong, X., Liu, J.: Rock detection via superpixel graph cuts. In: 19th ICIP, pp. 2149–2152 (2012)

    Google Scholar 

  15. Li, Z., Wu, X.-M., Chang, S.-F.: Segmentation using superpixels: A bipartite graph partitioning approach. In: CVPR, pp. 789–796 (2012)

    Google Scholar 

  16. Fulkerson, B., Vedaldi, A., Soatto, S.: Class segmentation and object localization with superpixel neighborhoods. In: ICCV, pp. 670–677 (2009)

    Google Scholar 

  17. Xu, Y., Liu, J., Cheng, J., Yin, F., Tan, N.M., Wong, D.W.K., Cheng, C.Y., Tham, Y.C., Wong, T.Y.: Efficient optic cup localization based on superpixel classification for glaucoma diagnosis in digital fundus images. In: ICPR, pp. 49–52 (2012)

    Google Scholar 

  18. Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient graph-based image segmentation. IJCV 59(2), 167–181 (2004)

    Article  Google Scholar 

  19. Cour, T., Benezit, F., Shi, J.: Spectral segmentation with multiscale graph decomposition. In: CVPR, vol. 2, pp. 1124–1131 (2005)

    Google Scholar 

  20. Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 33(5), 898–916 (2011)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Yang, J., Gan, Z., Gui, X., Li, K., Hou, C. (2013). 3-D Geometry Enhanced Superpixels for RGB-D Data. In: Huet, B., Ngo, CW., Tang, J., Zhou, ZH., Hauptmann, A.G., Yan, S. (eds) Advances in Multimedia Information Processing – PCM 2013. PCM 2013. Lecture Notes in Computer Science, vol 8294. Springer, Cham. https://doi.org/10.1007/978-3-319-03731-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-03731-8_4

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-03730-1

  • Online ISBN: 978-3-319-03731-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics