skip to main content
10.1145/3290420.3290458acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccipConference Proceedingsconference-collections
research-article

Combining depth-estimation-based multi-spectral photometric stereo and SLAM for real-time dense 3D reconstruction

Authors Info & Claims
Published:02 November 2018Publication History

ABSTRACT

Obtaining dense 3D reconstruction with low computational cost is one of the important goals in the field of Simultaneous Localization and Mapping (SLAM). In this paper we propose a dense 3D reconstruction framework from monocular multi-spectral video sequences using jointly semi-dense SLAM and depth-estimation-based Multi-spectral Photometric Stereo approaches. Starting from multi-spectral video, we use SALM to reconstruct a semi-dense 3D shape that will be densified. Then the depth maps estimated via conditional Generative Adversarial Nets (cGAN) are fed as priors into optimization-based multi-spectral photometric stereo for dense surface normal recovery. Finally, we use camera poses for view conversion in fusion procedure where we combine the relative sparse point cloud with the dense surface normal to get a dense point cloud. Experiments show that our method can effectively obtain denser 3D reconstruction.

References

  1. Achtelik, M. and M. Achtelik, et al. (2011). Onboard IMU and monocular vision-based control for MAVs in unknown in- and outdoor environments. IEEE International Conference on Robotics and Automation.Google ScholarGoogle Scholar
  2. Akbarzadeh, A. and J. M. Frahm, et al. (2006). Towards Urban 3D Reconstruction from Video. International Symposium on 3d Data Processing, Visualization, and Transmission. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Davison (2003). Real-time simultaneous localisation and mapping with a single camera. IEEE International Conference on Computer Vision. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Klein, G. and D. Murray (2007). Parallel Tracking and Mapping for Small AR Workspaces. IEEE and ACM International Symposium on Mixed and Augmented Reality. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Newcombe, R. A. and S. J. Lovegrove, et al. (2011). DTAM: Dense tracking and mapping in real-time. Proceedings of the 2011 International Conference on Computer Vision. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Newcombe, R. A. and S. J. Lovegrove, et al. (2011). DTAM: Dense tracking and mapping in real-time. Proceedings of the 2011 International Conference on Computer Vision. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Engel, Jakob, T. Schöps, and D. Cremers. LSD-SLAM: Large-Scale Direct Monocular SLAM. Computer Vision - ECCV 2014. Springer International Publishing, 2014:834--849.Google ScholarGoogle Scholar
  8. Engel, Jakob, V. Koltun, and D. Cremers. "Direct Sparse Odometry." IEEE Transactions on Pattern Analysis & Machine Intelligence PP.99(2017):1--1.Google ScholarGoogle Scholar
  9. Wu, C. and Y. Liu, et al. (2011). "Fusing multiview and photometric stereo for 3D reconstruction under uncalibrated illumination." IEEE Transactions on Visualization & Computer Graphics 17 (8): 1082. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Jiao, H. and Y. Luo, et al. (2017). Underwater multi-spectral photometric stereo reconstruction from a single RGBD image. Signal and Information Processing Association Summit and Conference.Google ScholarGoogle Scholar
  11. Kim, Jae Hak, C. Cadena, and I. Reid. "Direct semi-dense SLAM for rolling shutter cameras." IEEE International Conference on Robotics and Automation IEEE, 2016:1308--1315.Google ScholarGoogle Scholar
  12. Liu, F.; Chung, S.; Ng, A.Y. Learning depth from single monocular images. IEEE Trans. Pattern Anal.Mach. Intell. 2005, 18, 1--8.Google ScholarGoogle Scholar
  13. Drew, M. S., and L. L. Kontsevich. "Closed-form attitude determination under spectrally varying illumination." Computer Vision and Pattern Recognition, 1994. Proceedings CVPR '94. 1994 IEEE Computer Society Conference on IEEE, 1994:985--990.Google ScholarGoogle Scholar
  14. Petrov, A. P., I. S. Vergelskaya, and L. L. Kontsevich. "Petrov, A. P., I. S. Vergelskaya, and L. L. Kontsevich. "Reconstruction of shape from shading in color images." Journal of the Optical Society of America A 12.11(1994):1047--1052.Google ScholarGoogle Scholar
  15. Woodham, Robert J. "Gradient and Curvature from Photometric Stereo Including Local Condence Estimation." J.opt.soc.am.a 11.11(1994):3050--3068.Google ScholarGoogle ScholarCross RefCross Ref
  16. Chen, Chia Yen, R. Klette, and C. F. Chen. Shape from Photometric Stereo and Contours. Computer Analysis of Images and Patterns. Springer Berlin Heidelberg, 2003:377--384.Google ScholarGoogle Scholar
  17. Sabzevari, R. and A. D. Bue, et al. (2011). Structure from Motion and Photometric Stereo for Dense 3D Shape Recovery. International Conference on Image Analysis and Processing. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Phillip Isola, et al. Image-to-Image Translation with Conditional Adversarial Networks. 2016: 5967--5976.Google ScholarGoogle Scholar
  19. Yu, Yang, et al. Unsupervised Representation Learning with Deep Convolutional Neural Network for Remote Sensing Images. International Conference on Image and Graphics Springer 2017; Cham:97--108.Google ScholarGoogle Scholar
  20. Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. International Conference on International Conference on Machine Learning 2015; JMLR.org:448--456. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Nehab, Diego, et al. "Efficiently combining positions and normals for precise 3D geometry." Acm Transactions on Graphics 24.3(2005):536--543. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Combining depth-estimation-based multi-spectral photometric stereo and SLAM for real-time dense 3D reconstruction

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICCIP '18: Proceedings of the 4th International Conference on Communication and Information Processing
      November 2018
      326 pages
      ISBN:9781450365345
      DOI:10.1145/3290420
      • Conference Chairs:
      • Jalel Ben-Othman,
      • Hui Yu,
      • Program Chairs:
      • Herwig Unger,
      • Masayuki Arai

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 2 November 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate61of301submissions,20%
    • Article Metrics

      • Downloads (Last 12 months)20
      • Downloads (Last 6 weeks)3

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader