skip to main content
10.1145/2670291.2670294acmotherconferencesArticle/Chapter ViewAbstractPublication Pagessiggraph-asiaConference Proceedingsconference-collections
research-article

Annotating RGBD images of indoor scenes

Published: 24 November 2014 Publication History

Abstract

Annotating RGBD images with high quality semantic annotations plays a crucial key to the advanced scene understanding and image manipulation. While the popularity of affordable RGBD sensors has eased the process to acquire RGBD images, annotating them, automatically or manually, is still a challenging task. State-of-the-art annotation tools focus only on 2D operations and provide at most image segmentation and object labels even with the presence of depth data. In this work, we present an interactive system to exploit both color and depth cues and facilitate annotating RGBD images with image and scene level segmentation, object labels and 3D geometry and structures. With our system, the users only have to provide few scribbles to identify object instances and specify the label and support relationships of objects, while the system performs those tedious tasks of segmenting image and estimating the 3D cuboids. We test the system on a subset of benchmark RGBD dataset and demonstrate that our system provides a convenient way to generate a baseline dataset with rich semantic annotations.

References

[1]
Felzenszwalb, P. F., and Huttenlocher, D. P. 2004. Efficient graph-based image segmentation. IJCV.
[2]
Fisher, M., Ritchie, D., Savva, M., Funkhouser, T., and Hanrahan, P. 2012. Example-based synthesis of 3D object arrangements. ACM Trans. Graph. (Proc. SIGGRAPH Asia) 31, 6, 135:1--135:11.
[3]
Guo, R., and Hoiem, D. 2013. Support surface prediction in indoor scenes. IEEE ICCV.
[4]
Gupta, A., Efros, A. A., and Hebert, M. 2010. Blocks world revisited: Image understanding using qualitative geometry and mechanics. In ECCV.
[5]
Hedau, V., Hoiem, D., and Forsyth, D. 2010. Thinking inside the box: Using appearance models and context based on room geometry. In ECCV.
[6]
Jia, Z., Gallagher, A., Saxena, A., and Chen, T. 2013. 3D-based reasoning with blocks, support, and stability. IEEE CVPR.
[7]
Jiang, H., and Xiao, J. 2013. A linear approach to matching cuboids in rgbd images. In IEEE CVPR.
[8]
Karsch, K., Hedau, V., Forsyth, D., and Hoiem, D. 2011. Rendering synthetic objects into legacy photographs. ACM Trans. Graph. (Proc. SIGGRAPH Asia) 30, 6, 157:1--157:12.
[9]
Kim, Y. M., Mitra, N. J., Yan, D.-M., and Guibas, L. 2012. Acquiring 3d indoor environments with variability and repetition. ACM Trans. Graph. (Proc. SIGGRAPH Asia) 31, 6, 138:1--138:11.
[10]
Koppula, H. S., Anand, A., Joachims, T., and Saxena, A. 2011. Semantic labeling of 3d point clouds for indoor scenes. In NIPS, 244--252.
[11]
Lin, D., Fidler, S., and Urtasun, R. 2013. Holistic scene understanding for 3D object detection with rgbd cameras. IEEE ICCV.
[12]
Nan, L., Xie, K., and Sharf, A. 2012. A search-classify approach for cluttered indoor scene understanding. ACM Trans. Graph. (Proc. SIGGRAPH Asia) 31, 6, 137:1--137:10.
[13]
Ren, X., Bo, L., and Fox, D. 2012. RGB-(D) scene labeling: Features and algorithms. In IEEE CVPR.
[14]
Russell, B. C., Torralba, A., Murphy, K. P., and Freeman, W. T. 2008. LabelMe: A database and web-based tool for image annotation. IJCV.
[15]
Shao, T., Xu, W., Zhou, K., Wang, J., Li, D., and Guo, B. 2012. An interactive approach to semantic modeling of indoor scenes with an rgbd camera. ACM Trans. Graph. (Proc. SIGGRAPH Asia) 31, 6, 136:1--136:11.
[16]
Silberman, N., Hoiem, D., Kohli, P., and Fergus, R. 2012. Indoor segmentation and support inference from RGBD images. In ECCV.
[17]
Xiao, J., Owens, A., and Torralba, A. 2013. SUN3D: A database of big spaces reconstructed using sfm and object labels. In IEEE ICCV.
[18]
Zhao, Y., and Zhu, S.-C. 2013. Scene parsing by integrating function, geometry and appearance models. IEEE CVPR.
[19]
Zheng, Y., Chen, X., Cheng, M.-M., Zhou, K., Hu, S.-M., and Mitra, N. J. 2012. Interactive images: Cuboid proxies for smart image manipulation. ACM Trans. Graph. (Proc. SIGGRAPH) 31, 4, 99:1--99:11.

Cited By

View all
  • (2019)Object-Level Segmentation of Indoor Point Clouds by the Convexity of Adjacent Object RegionsIEEE Access10.1109/ACCESS.2019.29570347(171934-171949)Online publication date: 2019

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
SA '14: SIGGRAPH Asia 2014 Indoor Scene Understanding Where Graphics Meets Vision
November 2014
35 pages
ISBN:9781450332422
DOI:10.1145/2670291
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 November 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. RGBD
  2. labeling
  3. segmentation
  4. structure

Qualifiers

  • Research-article

Conference

SA'14
SA'14: SIGGRAPH Asia 2014
December 3 - 6, 2014
Shenzhen, China

Acceptance Rates

Overall Acceptance Rate 178 of 869 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Object-Level Segmentation of Indoor Point Clouds by the Convexity of Adjacent Object RegionsIEEE Access10.1109/ACCESS.2019.29570347(171934-171949)Online publication date: 2019

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media