research-article

Depth Image Super-Resolution with Semantic and RGB Images Using CNN

Author:
Shuo Qin

Beihang University, Haidian District, Beijing

Beihang University, Haidian District, Beijing
View Profile

ICRAI '18: Proceedings of the 4th International Conference on Robotics and Artificial IntelligenceNovember 2018Pages 1–5https://doi.org/10.1145/3297097.3297098

Published:17 November 2018Publication History

ICRAI '18: Proceedings of the 4th International Conference on Robotics and Artificial Intelligence

Pages 1–5

ABSTRACT

Depth images acquired by consumer depth sensors, such as Kinect and ToF, usually are of low resolution and insufficient quality. This limits the application of these depth sensors. Therefore, depth map enhancement is essential to its application. Most existing depth map super-resolution methods employ an RGB image of the same scene in the depth image as a guidance to up-sample the depth map. However, due to part of edges in RGB image do not occurrence in depth image, such as texture in RGB image, most existing methods will introduce a problem of texture-copy in these areas. To address this problem, we propose an approach that introduce semantic information of RGB image. On the other hand, existing methods rely on various kinds of explicit filter construction or hand-designed objective function. It is thus difficult to understand, improve, and accelerate them in a coherent framework. In this paper we use a learning-based approach to construct a joint filter based on Convolutional Neural Networks. In contrast to existing methods that consider only the RGB guidance image, our method can suppress the texture-copy problem. We validate the effectiveness of the proposed method through extensive comparisons with state-of-the-art methods on NYU v2 dataset. Experiment results show that our method suppress the texture-copy problem.

References

Aodha, O. M., Campbell, N. D. F., Nair, A., and Brostow, G. J. 2012. Patch based synthesis for single depth image super-resolution. In: European Conference on Computer Vision. pp. 71--84. Google ScholarDigital Library
Chan, D., Buisman, H., Theobalt, C., and Thrun, S. 2008. A noise-aware filter for real-time depth up-sampling. In: The Workshop on Multi-Camera and Multi-Modal Sensor Fusion Algorithms and Applications.Google Scholar
Diebel, J. and Thrun, S. 2005. An application of markov random fields to range sensing. Advances in Neural Information Processing Systems. pp. 291--298. Google ScholarDigital Library
Dong, C., Chen, C. L., He, K., and Tang, X. 2016. Image super-resolution using deep convolutional networks. IEEE Transactions on Pattern Analysis and Machine Intelligence. 38(2), 295--307. Google ScholarDigital Library
Ferstl, D., Reinbacher, C., Ranftl, R., Ruether, M., and Bischof, H. 2014. Image guided depth up-samjpling using anisotropic total generalized variation. In: IEEE International Conference on Computer Vision. pp. 993--1000. Google ScholarDigital Library
Ferstl, D., Ruther, M., and Bischof, H. 2015. Variational depth super-resolution using example-based edge representations. In: IEEE International Conference on Computer Vision. pp. 513--521. Google ScholarDigital Library
Gregor, K. and Lecun, Y. 2010. Learning fast approximations of sparse coding. In: International Conference on International Conference on Machine Learning. pp. 399--406. Google ScholarDigital Library
Ham, B., Cho, M., and Ponce, J. 2015. Robust image filtering using joint static and dynamic guidance. In: Computer Vision and Pattern Recognition. pp. 4823--4831.Google Scholar
He, K., Sun, J., and Tang, X. 2010. Guided image filtering. In: European Conference on Computer Vision. pp. 1--14.Google Scholar
Hirschmuller, H. and Scharstein, D. 2007. Evaluation of cost functions for stereo matching. In: Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on. pp. 1--8.Google Scholar
Kopf, J., Cohen, M. F., Lischinski, D., Uyttendaele, M. 2007. Joint bilateral up-sampling. In: Acm Siggraph. p. 96. Google ScholarDigital Library
Li, Y., Huang, J. B., Ahuja, N., and Yang, M. H. 2016. Deep Joint Image Filtering. Springer International Publishing.Google Scholar
Liu, M. Y., Tuzel, O., and Taguchi, Y. 2013. Joint geodesic up-sampling of depth images. In: Computer Vision and Pattern Recognition. pp. 169--176. Google ScholarDigital Library
Mandal, S., Bhavsar, A., and Sao, A. K. 2014. Hierarchical example-based range-image super-resolution with edge-preservation. pp. 3867--3871.Google Scholar
Osendorfer, C., Soyer, H., and Smagt, P.V. D. 2014. Image super-resolution with fast approximate convolutional sparse coding. Lecture Notes in Computer Science 8836, 250--257.Google ScholarCross Ref
Park, J., Kim, H., Tai, Y.W., Brown, M. S., and Kweon, I. 2011. High quality depth map up-sampling for 3d-tof cameras. In: IEEE International Conference on Computer Vision. pp. 1623--1630. Google ScholarDigital Library
Scharstein, D. and Pal, C. 2007. Learning conditional random fields for stereo. In: Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on. pp. 1--8.Google Scholar
Silberman, N., Hoiem, D., Kohli, P., and Fergus, R. 2012. Indoor Segmentation and Support Inference from RGBD Images. Springer Berlin Heidelberg.Google Scholar
Tomasi, C. and Manduchi, R. 1998. Bilateral filtering for gray and color images iccv. Proc ICCV p. 839. Google ScholarDigital Library
Vedaldi, A. and Lenc, K. 2014. Matconvnet: Convolutional neural networks for matlab pp. 689--692. Google ScholarDigital Library
Wang, Y., Ortega, A., Tian, D., and Vetro, A. 2014. A graph-based joint bilateral approach for depth enhancement pp. 885--889.Google Scholar
Wang, Z., Liu, D., Yang, J., Han, W., and Huang, T. 2016. Deep networks for image super-resolution with sparse prior. In: IEEE International Conference on Computer Vision. pp. 370--378. Google ScholarDigital Library
Yang, Q., Yang, R., Davis, J., and Nister, D. 2007. Spatial-depth super resolution for range images. In: IEEE Conference on Computer Vision and Pattern Recognition. pp. 1--8.Google Scholar

Index Terms

Depth Image Super-Resolution with Semantic and RGB Images Using CNN
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations
      2. Computer vision tasks
        Vision for robotics

Recommendations

Single Depth Map Super-resolution with Local Self-similarity
ICVIP '18: Proceedings of the 2018 2nd International Conference on Video and Image Processing

Consumer depth sensors such as time-of-flight camera or Kinect have gained significant popularity in recently. However, the captured depth maps suffer from limited spatial resolution and a variety of noise, making such depth maps difficult to be ...
Read More
Symmetric Uncertainty-Aware Feature Transmission for Depth Super-Resolution
MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Color-guided depth super-resolution (DSR) is an encouraging paradigm that enhances a low-resolution (LR) depth map guided by an extra high-resolution (HR) RGB image from the same scene. Existing methods usually use interpolation to upscale the depth ...
Read More
Depth map Super-Resolution based on joint dictionary learning

Although Time-of-Flight (ToF) camera can provide real-time depth information from a real scene, the resolution of depth map captured by ToF camera is rather limited compared to HD color cameras, and thus it cannot be directly used in 3D reconstruction. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICRAI '18: Proceedings of the 4th International Conference on Robotics and Artificial Intelligence
November 2018
109 pages
ISBN:9781450365840
DOI:10.1145/3297097

Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 November 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Depth Map
Semantic Information
Super-Resolution
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 129
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Depth Image Super-Resolution with Semantic and RGB Images Using CNN

ICRAI '18: Proceedings of the 4th International Conference on Robotics and Artificial Intelligence

ABSTRACT

References

Cited By

Index Terms

Recommendations

Single Depth Map Super-resolution with Local Self-similarity

Symmetric Uncertainty-Aware Feature Transmission for Depth Super-Resolution

Depth map Super-Resolution based on joint dictionary learning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Depth Image Super-Resolution with Semantic and RGB Images Using CNN

ICRAI '18: Proceedings of the 4th International Conference on Robotics and Artificial Intelligence

ABSTRACT

References

Cited By

Index Terms

Recommendations

Single Depth Map Super-resolution with Local Self-similarity

Symmetric Uncertainty-Aware Feature Transmission for Depth Super-Resolution

Depth map Super-Resolution based on joint dictionary learning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media