skip to main content
10.1145/3458305.3463374acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article

Open3DGen: open-source software for reconstructing textured 3D models from RGB-D images

Published:15 July 2021Publication History

ABSTRACT

This paper presents the first entirely open-source and cross-platform software called Open3DGen for reconstructing photorealistic textured 3D models from RGB-D images. The proposed software pipeline consists of nine main stages: 1) RGB-D acquisition; 2) 2D feature extraction; 3) camera pose estimation; 4) point cloud generation; 5) coarse mesh reconstruction; 6) optional loop closure; 7) fine mesh reconstruction; 8) UV unwrapping; and 9) texture projection. This end-to-end scheme combines multiple state-of-the-art techniques and provides an easy-to-use software package for real-time 3D model reconstruction and offline texture mapping. The main innovation lies in various Structure-from-Motion (SfM) techniques that are used with additional depth data to yield high-quality 3D models in real-time and at low cost. The functionality of Open3DGen has been validated on AMD Ryzen 3900X CPU and Nvidia GTX1080 GPU. This proof-of-concept setup attains an average processing speed of 15 fps for 720p (1280x720) RGBD input without the offline backend. Our solution is shown to provide competitive 3D mesh quality and execution performance with the state-of-the-art commercial and academic solutions.

References

  1. Agisoft Metashape. [Online]. Available: https://www.agisoft.com/.Google ScholarGoogle Scholar
  2. AliceVision Meshroom. [Online]. Available: https://alicevision.org#meshroom.Google ScholarGoogle Scholar
  3. S. Ullman, "The interpretation of structure from motion," in Proc. R. Soc. Lond., B203, Jan. 1979.Google ScholarGoogle Scholar
  4. P. Henry, M. Krainin, E. Herbst, X. Ren, and D. Fox, "RGB-D mapping: using Kinect-style depth cameras for dense 3D modeling of indoor environments," in Proc. Int. Symp. on Experimental Robot., Dec. 2010, New Delhi and Agra, India.Google ScholarGoogle Scholar
  5. D. R. dos Santos, M. A. Basso, K. Khoshelham, E. de Oliveira, N. L. Pavan, and G. Vosselman, "Mapping indoor spaces by adaptive coarse-to-fine registration of RGB-D data," IEEE Geosci. Remote Sens. Lett., vol. 13, no. 2, Feb. 2016, pp. 262--266.Google ScholarGoogle ScholarCross RefCross Ref
  6. A. Nüchter, K. Lingemann, J. Hertzberg, and H. Surmann, "6D SLAM---3D mapping outdoor environments," J. Field Robot., vol. 24, no. 8-9, Aug. 2007, pp. 699--722.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Newman, D. Cole, and K. Ho, "Outdoor SLAM using visual appearance and laser ranging," in Proc. IEEE Int. Conf. Robot. Autom., May 2006, Orlando, Florida, USA.Google ScholarGoogle Scholar
  8. D. M. Cole and P. M. Newman, "Using laser range data for 3D SLAM in outdoor environments," in Proc. IEEE Int. Conf. Robot. Autom., May 2006, Orlando, Florida, USA.Google ScholarGoogle Scholar
  9. J. Engel, T. Schöps, and D. Cremers, "LSD-SLAM: large-scale direct monocular SLAM," in Proc. European Conf. Comp. Vision, Sept. 2014, Zürich, Switzerland.Google ScholarGoogle ScholarCross RefCross Ref
  10. R. Mur-Artal, J. M. M. Montiel, and J. D. Tardós, "ORB-SLAM: a versatile and accurate monocular SLAM system," IEEE Trans. on Robotics, vol. 31, no. 5, Oct. 2015, pp. 1147--1163.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Mur-Artal and J. D. Tardós. "ORB-SLAM2: an open-source SLAM system for monocular, stereo and RGB-D cameras," IEEE Trans. on Robotics, vol. 33, no. 5, Oct. 2017, pp. 1255--1262.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Campos, R. Elvira, J. J. Gómez, J. M. M. Montiel, and J. D. Tardós, "ORB-SLAM3: an accurate open-source library for visual, visual-inertial and multi-map SLAM," arXiv preprint arXiv:2007.11898, July 2020.Google ScholarGoogle Scholar
  13. A. Rosinol, M. Abate, Y. Chang, and L. Carlone, "Kimera: an open-source library for real-time metric-semantic localization and mapping," in Proc. IEEE Int. Conf. Robot. Autom., Aug. 2020, Paris, France.Google ScholarGoogle ScholarCross RefCross Ref
  14. A. Dai, M. Niessner, M. Zollöfer, S. Izadi, and C. Theobalt, "BundleFusion: real-time globally consistent 3D reconstruction using on-the-fly surface reintegration," ACM Trans. Graph., vol. 36, no. 3, June 2017, pp. 24:1--24:18.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Nießner, M. Zollhöfer, S. Izadi, and M. Stamminger, "Real-time 3D reconstruction at scale using voxel hashing," ACM Trans. Graph., vol. 32, no. 6, Nov. 2013, pp. 169:1--169:11.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. peel 2 3D scanner. [Online]. available: https://peel-3d.com/products/peel-2?variant=33046758522903.Google ScholarGoogle Scholar
  17. Artec Eva. [Online]. Available: https://www.artec3d.com/portable-3d-scanners/artec-eva-v2.Google ScholarGoogle Scholar
  18. S. Pertuz and D. Puig, "Analysis of focus measure operators for shape-from-focus," Pattern Recognition, vol. 46, no. 5, May 2013, pp. 1415--1432.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. OpenCV: Open Computer Vision Library. [Online]. Available: https://opencv.org/.Google ScholarGoogle Scholar
  20. D. Lowe, "Distinctive image features from scale invariant keypoints," Int. J. Comput. Vis., vol. 60, no. 2, Nov. 2004, pp. 91--110.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. P. F. Alcantarilla, J. Nuevo, and A. Bartoli, "Fast explicit diffusion for accelerated features in nonlinear scale spaces," in Proc. British Mach. Vis. Conf., Sept. 2013, Bristol, United Kingdom.Google ScholarGoogle Scholar
  22. E. Rublee, V. Rabaud, K. Konolige, and G. Bradski, "ORB: an efficient alternative to SIFT or SURF," in Proc. IEEE Int. Conf. Comp. Vis., Mar. 2011, Barcelona, Spain.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. O. Chum, T. Pajdla, and P. Sturm, "The geometric error for homographies," Comput. Vis. Image Understanding, vol. 97, no. 1, Jan. 2005, pp. 86--102.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. V. Lepetit, M. Moreno-Noguer, and P. Fua, "EPnP: an accurate O(n) solution to the PnP problem," Int. J. Comput. Vis., vol. 81, no. 2, Feb. 2009, pp. 155--166.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. D. Nister, "An efficient solution to the five-point relative pose problem," IEEE Trans. Pattern Anal. Mach. Intell., vol. 26, no. 6, June 2004, pp. 756--777.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. H. Aghajan and A. Cavallaro, "Multi-Camera Networks: Principles and Applications," Academic Press, Apr. 2009, Orlando, Florida, USA.Google ScholarGoogle Scholar
  27. M. Kazhdan, M. Bolitho, and H. Hoppe, "Poisson surface reconstruction," in Proc. Eurographics Symp. on Geometry Process., June 2006, Cagliari, Sardinia, Italy.Google ScholarGoogle Scholar
  28. J. Wenzel, M. Tarini, D. Panozzo, and O. Sorkine-Hornung, "Instant field-aligned meshes," ACM Trans. Graph., vol. 34, no. 6, Oct. 2015, pp. 189:1--189:15.Google ScholarGoogle Scholar
  29. F. Dellaert, "Factor graphs and GTSAM: A hands-on introduction," Georgia Institute of Technology, Sept. 2012.Google ScholarGoogle Scholar
  30. Q. Zhou, J. Park, and V. Koltun, "Open3D: a modern library for 3D data processing," arxiv.org/abs/1801.09847, Jan. 2018.Google ScholarGoogle Scholar
  31. F. Bernardini, J. Mittleman, H. Rushmeier, C. Silva, and G. Taubin, "The ball-pivoting algorithm for surface reconstruction," IEEE Trans. Vis. Comput. Graph., vol. 5, no. 4, Nov. 1999, pp. 349--359.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Xatlas. [Online]. Available: https://github.com/jpcy/xatlas.Google ScholarGoogle Scholar
  33. EuRoC MAV Dataset. [Online]. Available: https://projects.asl.ethz.ch/datasets/doku.php?id=kmavvisualinertialdatasets.Google ScholarGoogle Scholar

Index Terms

  1. Open3DGen: open-source software for reconstructing textured 3D models from RGB-D images

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MMSys '21: Proceedings of the 12th ACM Multimedia Systems Conference
        June 2021
        254 pages
        ISBN:9781450384346
        DOI:10.1145/3458305

        Copyright © 2021 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 15 July 2021

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        MMSys '21 Paper Acceptance Rate18of55submissions,33%Overall Acceptance Rate176of530submissions,33%
      • Article Metrics

        • Downloads (Last 12 months)36
        • Downloads (Last 6 weeks)1

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader