abstract

SemanticPaint: interactive segmentation and learning of 3D worlds

Authors:
Stuart Golodetz

University of Oxford, UK

University of Oxford, UK
View Profile

,
Michael Sapienza

University of Oxford, UK

University of Oxford, UK
View Profile

,
Julien P. C. Valentin

University of Oxford, UK

University of Oxford, UK
View Profile

,
Vibhav Vineet

Stanford University

Stanford University
View Profile

,
Ming-Ming Cheng

Nankai University

Nankai University
View Profile

,
Victor A. Prisacariu

University of Oxford, UK

University of Oxford, UK
View Profile

,
Olaf Kähler

University of Oxford, UK

University of Oxford, UK
View Profile

,
Carl Yuheng Ren

University of Oxford, UK

University of Oxford, UK
View Profile

,
Anurag Arnab

University of Oxford, UK

University of Oxford, UK
View Profile

,
Stephen L. Hicks

University of Oxford, UK

University of Oxford, UK
View Profile

,
David W. Murray

University of Oxford, UK

University of Oxford, UK
View Profile

,
Shahram Izadi

Microsoft Research

Microsoft Research
View Profile

,
Philip H. S. Torr

University of Oxford, UK

University of Oxford, UK
View Profile

SIGGRAPH '15: ACM SIGGRAPH 2015 Emerging TechnologiesJuly 2015Article No.: 22Pages 1https://doi.org/10.1145/2782782.2792488

Published:31 July 2015Publication History

SIGGRAPH '15: ACM SIGGRAPH 2015 Emerging Technologies

Pages 1

ABSTRACT

We present a real-time, interactive system for the geometric reconstruction, object-class segmentation and learning of 3D scenes [Valentin et al. 2015]. Using our system, a user can walk into a room wearing a depth camera and a virtual reality headset, and both densely reconstruct the 3D scene [Newcombe et al. 2011; Nießner et al. 2013; Prisacariu et al. 2014]) and interactively segment the environment into object classes such as 'chair', 'floor' and 'table'. The user interacts physically with the real-world scene, touching objects and using voice commands to assign them appropriate labels. These user-generated labels are leveraged by an online random forest-based machine learning algorithm, which is used to predict labels for previously unseen parts of the scene. The predicted labels, together with those provided directly by the user, are incorporated into a dense 3D conditional random field model, over which we perform mean-field inference to filter out label inconsistencies. The entire pipeline runs in real time, and the user stays 'in the loop' throughout the process, receiving immediate feedback about the progress of the labelling and interacting with the scene as necessary to refine the predicted segmentation.

Supplemental Material

Available for Download

zip

a22-golodetz.zip (20.1 MB)

Supplemental files

References

Newcombe, R. A. et al. 2011. KinectFusion: Real-Time Dense Surface Mapping and Tracking. In ISMAR, IEEE. Google ScholarDigital Library
Niessner, M. et al. 2013. Real-time 3D Reconstruction at Scale using Voxel Hashing. ACM TOG 32, 6, 169. Google ScholarDigital Library
Prisacariu, V. A., Kähler, O. et al. 2014. A Framework for the Volumetric Integration of Depth Images. ArXiv e-prints.Google Scholar
Valentin, J. P. C. et al. 2015. SemanticPaint: Interactive 3D Labeling and Learning at your Fingertips. To appear in ACM TOG.Google Scholar

Index Terms

SemanticPaint: interactive segmentation and learning of 3D worlds
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Image segmentation
        Video segmentation
      2. Image and video acquisition
        3D imaging
  2. Computer graphics
    1. Animation

Recommendations

SemanticPaint: Interactive 3D Labeling and Learning at your Fingertips

We present a new interactive and online approach to 3D scene understanding. Our system, SemanticPaint, allows users to simultaneously scan their environment whilst interactively segmenting the scene simply by reaching out and touching any desired object ...
Read More
SemanticPaint: interactive segmentation and learning of 3D world
SIGGRAPH '15: ACM SIGGRAPH 2015 Talks

We present a real-time, interactive system for the geometric reconstruction, object-class segmentation and learning of 3D scenes [Valentin et al. ]. Using our system, a user can walk into a room wearing a consumer depth camera and a virtual reality ...
Read More
Transductive Multilabel Learning via Label Set Propagation

The problem of multilabel classification has attracted great interest in the last decade, where each instance can be assigned with a set of multiple class labels simultaneously. It has a wide variety of real-world applications, e.g., automatic image ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

SIGGRAPH '15: ACM SIGGRAPH 2015 Emerging Technologies
July 2015
27 pages
ISBN:9781450336352
DOI:10.1145/2782782

Copyright © 2015 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 31 July 2015
Check for updates
Qualifiers
- abstract
Conference

Acceptance Rates
Overall Acceptance Rate1,822of8,601submissions,21%
Upcoming Conference
SIGGRAPH '24

Sponsor:

siggraph

Special Interest Group on Computer Graphics and Interactive Techniques Conference

July 27 - August 1, 2024

Denver , CO , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 7
  Total Citations
  View Citations
- 284
  Total Downloads
- Downloads (Last 12 months)8
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

SemanticPaint: interactive segmentation and learning of 3D worlds

SIGGRAPH '15: ACM SIGGRAPH 2015 Emerging Technologies

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

SemanticPaint: Interactive 3D Labeling and Learning at your Fingertips

SemanticPaint: interactive segmentation and learning of 3D world

Transductive Multilabel Learning via Label Set Propagation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

SemanticPaint: interactive segmentation and learning of 3D worlds

SIGGRAPH '15: ACM SIGGRAPH 2015 Emerging Technologies

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

SemanticPaint: Interactive 3D Labeling and Learning at your Fingertips

SemanticPaint: interactive segmentation and learning of 3D world

Transductive Multilabel Learning via Label Set Propagation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media