research-article

NeuralRoom: Geometry-Constrained Neural Implicit Surfaces for Indoor Scene Reconstruction

Authors:

Chunxia XiaoAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 41, Issue 6

Article No.: 226, Pages 1 - 15

https://doi.org/10.1145/3550454.3555514

Published: 30 November 2022 Publication History

Abstract

We present a novel neural surface reconstruction method called NeuralRoom for reconstructing room-sized indoor scenes directly from a set of 2D images. Recently, implicit neural representations have become a promising way to reconstruct surfaces from multiview images due to their high-quality results and simplicity. However, implicit neural representations usually cannot reconstruct indoor scenes well because they suffer severe shape-radiance ambiguity. We assume that the indoor scene consists of texture-rich and flat texture-less regions. In texture-rich regions, the multiview stereo can obtain accurate results. In the flat area, normal estimation networks usually obtain a good normal estimation. Based on the above observations, we reduce the possible spatial variation range of implicit neural surfaces by reliable geometric priors to alleviate shape-radiance ambiguity. Specifically, we use multiview stereo results to limit the NeuralRoom optimization space and then use reliable geometric priors to guide NeuralRoom training. Then the NeuralRoom would produce a neural scene representation that can render an image consistent with the input training images. In addition, we propose a smoothing method called perturbation-residual restrictions to improve the accuracy and completeness of the flat region, which assumes that the sampling points in a local surface should have the same normal and similar distance to the observation center. Experiments on the ScanNet dataset show that our method can reconstruct the texture-less area of indoor scenes while maintaining the accuracy of detail. We also apply NeuralRoom to more advanced multiview reconstruction algorithms and significantly improve their reconstruction quality.

Supplemental Material

MP4 File

presentation

Download
1240.18 MB

References

[1]

Matan Atzmon, Niv Haim, Lior Yariv, Ofer Israelov, Haggai Maron, and Yaron Lipman. 2019. Controlling neural level sets. Advances in Neural Information Processing Systems 32 (2019).

[2]

Matan Atzmon and Yaron Lipman. 2020. Sal: Sign agnostic learning of shapes from raw data. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2565--2574.

[3]

Gwangbin Bae, Ignas Budvytis, and Roberto Cipolla. 2021. Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 13137--13146.

[4]

Jonathan T Barron, Ben Mildenhall, Matthew Tancik, Peter Hedman, Ricardo Martin-Brualla, and Pratul P Srinivasan. 2021. Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 5855--5864.

[5]

Fausto Bernardini, Joshua Mittleman, Holly Rushmeier, Cláudio Silva, and Gabriel Taubin. 1999. The ball-pivoting algorithm for surface reconstruction. IEEE transactions on visualization and computer graphics 5, 4 (1999), 349--359.

Digital Library

[6]

Alexey Bokhovkin and Angela Dai. 2022. Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans. arXiv preprint arXiv:2203.09375 (2022).

[7]

Chris Buehler, Michael Bosse, Leonard McMillan, Steven Gortler, and Michael Cohen. 2001. Unstructured lumigraph rendering. In Proceedings of the 28th annual conference on Computer graphics and interactive techniques. 425--432.

Digital Library

[8]

Jin-Xiang Chai, Xin Tong, Shing-Chow Chan, and Heung-Yeung Shum. 2000. Plenoptic sampling. In Proceedings of the 27th annual conference on Computer graphics and interactive techniques. 307--318.

Digital Library

[9]

Anpei Chen, Zexiang Xu, Fuqiang Zhao, Xiaoshuai Zhang, Fanbo Xiang, Jingyi Yu, and Hao Su. 2021. Mvsnerf: Fast generalizable radiance field reconstruction from multi-view stereo. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 14124--14133.

[10]

Rui Chen, Songfang Han, Jing Xu, and Hao Su. 2019. Point-based multi-view stereo network. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 1538--1547.

[11]

Shuo Cheng, Zexiang Xu, Shilin Zhu, Zhuwen Li, Li Erran Li, Ravi Ramamoorthi, and Hao Su. 2020. Deep stereo using adaptive thin volume representation with uncertainty awareness. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2524--2534.

[12]

Christopher B Choy, Danfei Xu, JunYoung Gwak, Kevin Chen, and Silvio Savarese. 2016. 3d-r2n2: A unified approach for single and multi-view 3d object reconstruction. In European conference on computer vision. Springer, 628--644.

[13]

Robert T Collins. 1996. A space-sweep approach to true multi-image matching. In Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, 358--363.

[14]

Brian Curless and Marc Levoy. 1996. A volumetric method for building complex models from range images. In Proceedings of the 23rd annual conference on Computer graphics and interactive techniques. 303--312.

Digital Library

[15]

Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, and Matthias Nießner. 2017. ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes. In Proc. Computer Vision and Pattern Recognition (CVPR), IEEE.

[16]

Angela Dai, Christian Diller, and Matthias Nießner. 2020. Sg-nn: Sparse generative neural networks for self-supervised scene completion of rgb-d scans. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 849--858.

[17]

Angela Dai, Yawar Siddiqui, Justus Thies, Julien Valentin, and Matthias Nießner. 2021. Spsg: Self-supervised photometric scene generation from rgb-d scans. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1747--1756.

[18]

Paul E Debevec, Camillo J Taylor, and Jitendra Malik. 1996. Modeling and rendering architecture from photographs: A hybrid geometry-and image-based approach. In Proceedings of the 23rd annual conference on Computer graphics and interactive techniques. 11--20.

Digital Library

[19]

Yikang Ding, Wentao Yuan, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, and Xiao Liu. 2021. TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers. arXiv preprint arXiv:2111.14600 (2021).

[20]

Tien Do, Khiem Vuong, Stergios I Roumeliotis, and Hyun Soo Park. 2020. Surface normal estimation of tilted images via spatial rectifier. In European Conference on Computer Vision. Springer, 265--280.

Digital Library

[21]

Silvano Galliani, Katrin Lasinger, and Konrad Schindler. 2015. Massively parallel multi-view stereopsis by surface normal diffusion. In Proceedings of the IEEE International Conference on Computer Vision. 873--881.

Digital Library

[22]

Kyle Genova, Forrester Cole, Daniel Vlasic, Aaron Sarna, William T Freeman, and Thomas Funkhouser. 2019. Learning shape templates with structured implicit functions. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 7154--7164.

[23]

Xiaodong Gu, Zhiwen Fan, Siyu Zhu, Zuozhuo Dai, Feitong Tan, and Ping Tan. 2020. Cascade cost volume for high-resolution multi-view stereo and stereo matching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2495--2504.

[24]

Haoyu Guo, Sida Peng, Haotong Lin, Qianqian Wang, Guofeng Zhang, Hujun Bao, and Xiaowei Zhou. 2022. Neural 3D Scene Reconstruction with the Manhattan-world Assumption. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5511--5520.

[25]

Yuxin Hou, Juho Kannala, and Arno Solin. 2019. Multi-view stereo by temporal nonparametric fusion. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 2651--2660.

[26]

Jingwei Huang, Yichao Zhou, Thomas Funkhouser, and Leonidas J Guibas. 2019. Framenet: Learning local canonical frames of 3d surfaces from a single rgb image. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8638--8647.

[27]

Mengqi Ji, Juergen Gall, Haitian Zheng, Yebin Liu, and Lu Fang. 2017. Surfacenet: An end-to-end 3d neural network for multiview stereopsis. In Proceedings of the IEEE International Conference on Computer Vision. 2307--2315.

[28]

Mengqi Ji, Jinzhi Zhang, Qionghai Dai, and Lu Fang. 2020. SurfaceNet+: An end-to-end 3D neural network for very sparse multi-view stereopsis. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 11 (2020), 4078--4093.

[29]

Hualie Jiang, Laiyan Ding, Junjie Hu, and Rui Huang. 2021. PLNet: Plane and Line Priors for Unsupervised Indoor Depth Estimation. In 2021 International Conference on 3D Vision (3DV). IEEE, 741--750.

[30]

Abhishek Kar, Christian Häne, and Jitendra Malik. 2017. Learning a multi-view stereo machine. Advances in neural information processing systems 30 (2017).

[31]

Michael Kazhdan, Matthew Bolitho, and Hugues Hoppe. 2006. Poisson surface reconstruction. In Proceedings of the fourth Eurographics symposium on Geometry processing, Vol. 7.

Digital Library

[32]

Petr Kellnhofer, Lars C Jebe, Andrew Jones, Ryan Spicer, Kari Pulli, and Gordon Wetzstein. 2021. Neural lumigraph rendering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4287--4297.

[33]

Andreas Kuhn, Christian Sormann, Mattia Rossi, Oliver Erdler, and Friedrich Fraundorfer. 2020. Deepc-mvs: Deep confidence prediction for multi-view stereo reconstruction. In 2020 International Conference on 3D Vision (3DV). IEEE, 404--413.

[34]

Jie Liao, Yanping Fu, Qingan Yan, Fei Luo, and Chunxia Xiao. 2021. Adaptive depth estimation for pyramid multi-view stereo. Computers & Graphics 97 (2021), 268--278.

Digital Library

[35]

Hongmin Liu, Xincheng Tang, and Shuhan Shen. 2020b. Depth-map completion for large indoor scene reconstruction. Pattern Recognition 99 (2020), 107112.

Digital Library

[36]

Lingjie Liu, Jiatao Gu, Kyaw Zaw Lin, Tat-Seng Chua, and Christian Theobalt. 2020a. Neural sparse voxel fields. Advances in Neural Information Processing Systems 33 (2020), 15651--15663.

[37]

Xiaoxiao Long, Cheng Lin, Lingjie Liu, Wei Li, Christian Theobalt, Ruigang Yang, and Wenping Wang. 2021a. Adaptive surface normal constraint for depth estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 12849--12858.

[38]

Xiaoxiao Long, Lingjie Liu, Wei Li, Christian Theobalt, and Wenping Wang. 2021b. Multi-view depth estimation using epipolar spatio-temporal networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8258--8267.

[39]

William E Lorensen and Harvey E Cline. 1987. Marching cubes: A high resolution 3D surface construction algorithm. ACM siggraph computer graphics 21, 4 (1987), 163--169.

[40]

Paul Merrell, Amir Akbarzadeh, Liang Wang, Philippos Mordohai, Jan-Michael Frahm, Ruigang Yang, David Nistér, and Marc Pollefeys. 2007. Real-time visibility-based fusion of depth maps. In 2007 IEEE 11th International Conference on Computer Vision. IEEE, 1--8.

[41]

Lars Mescheder, Michael Oechsle, Michael Niemeyer, Sebastian Nowozin, and Andreas Geiger. 2019. Occupancy networks: Learning 3d reconstruction in function space. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4460--4470.

[42]

Mateusz Michalkiewicz, Jhony K Pontes, Dominic Jack, Mahsa Baktashmotlagh, and Anders Eriksson. 2019. Implicit surface representations as layers in neural networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4743--4752.

[43]

Ben Mildenhall, Pratul P Srinivasan, Matthew Tancik, Jonathan T Barron, Ravi Ramamoorthi, and Ren Ng. 2020. Nerf: Representing scenes as neural radiance fields for view synthesis. In European conference on computer vision. Springer, 405--421.

Digital Library

[44]

Thomas Müller, Alex Evans, Christoph Schied, and Alexander Keller. 2022. Instant Neural Graphics Primitives with a Multiresolution Hash Encoding. arXiv preprint arXiv:2201.05989 (2022).

[45]

Zak Murez, Tarrence van As, James Bartolozzi, Ayan Sinha, Vijay Badrinarayanan, and Andrew Rabinovich. 2020. Atlas: End-to-end 3d scene reconstruction from posed images. In European Conference on Computer Vision. Springer, 414--431.

Digital Library

[46]

Richard A Newcombe, Shahram Izadi, Otmar Hilliges, David Molyneaux, David Kim, Andrew J Davison, Pushmeet Kohi, Jamie Shotton, Steve Hodges, and Andrew Fitzgibbon. 2011. Kinectfusion: Real-time dense surface mapping and tracking. In 2011 10th IEEE international symposium on mixed and augmented reality. IEEE, 127--136.

Digital Library

[47]

Michael Niemeyer, Jonathan T Barron, Ben Mildenhall, Mehdi SM Sajjadi, Andreas Geiger, and Noha Radwan. 2021. RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs. arXiv preprint arXiv:2112.00724 (2021).

[48]

Michael Niemeyer, Lars Mescheder, Michael Oechsle, and Andreas Geiger. 2020. Differentiable volumetric rendering: Learning implicit 3d representations without 3d supervision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3504--3515.

[49]

Michael Oechsle, Songyou Peng, and Andreas Geiger. 2021. Unisurf: Unifying neural implicit surfaces and radiance fields for multi-view reconstruction. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 5589--5599.

[50]

Jeong Joon Park, Peter Florence, Julian Straub, Richard Newcombe, and Steven Lovegrove. 2019. Deepsdf: Learning continuous signed distance functions for shape representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 165--174.

[51]

A. Paszke, S. Gross, F. Massa, A. Lerer, and S. Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library.

[52]

Songyou Peng, Michael Niemeyer, Lars Mescheder, Marc Pollefeys, and Andreas Geiger. 2020. Convolutional occupancy networks. In European Conference on Computer Vision. Springer, 523--540.

Digital Library

[53]

Julien Philip, Sébastien Morgenthaler, Michaël Gharbi, and George Drettakis. 2021. Free-viewpoint indoor neural relighting from multi-view stereo. ACM Transactions on Graphics (TOG) 40, 5 (2021), 1--18.

Digital Library

[54]

Alexander Rich, Noah Stier, Pradeep Sen, and Tobias Höllerer. 2021. 3DVNet: Multi-View Depth Prediction and Volumetric Refinement. In 2021 International Conference on 3D Vision (3DV). IEEE, 700--709.

[55]

Barbara Roessle, Jonathan T Barron, Ben Mildenhall, Pratul P Srinivasan, and Matthias Nießner. 2021. Dense Depth Priors for Neural Radiance Fields from Sparse Input Views. arXiv preprint arXiv:2112.03288 (2021).

[56]

Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima, Angjoo Kanazawa, and Hao Li. 2019. Pifu: Pixel-aligned implicit function for high-resolution clothed human digitization. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 2304--2314.

[57]

Shunsuke Saito, Tomas Simon, Jason Saragih, and Hanbyul Joo. 2020. Pifuhd: Multilevel pixel-aligned implicit function for high-resolution 3d human digitization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 84--93.

[58]

Johannes L Schonberger and Jan-Michael Frahm. 2016. Structure-from-motion revisited. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4104--4113.

[59]

Steven M Seitz, Brian Curless, James Diebel, Daniel Scharstein, and Richard Szeliski. 2006. A comparison and evaluation of multi-view stereo reconstruction algorithms. In 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06), Vol. 1. IEEE, 519--528.

[60]

Jiaming Sun, Yiming Xie, Linghao Chen, Xiaowei Zhou, and Hujun Bao. 2021a. Neural-Recon: Real-time coherent 3D reconstruction from monocular video. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 15598--15607.

[61]

Shang Sun, Yunan Zheng, Xuelei Shi, Zhenyu Xu, and Yiguang Liu. 2021b. PHI-MVS: Plane Hypothesis Inference Multi-view Stereo for Large-Scale Scene Reconstruction. arXiv preprint arXiv:2104.06165 (2021).

[62]

MatthewTancik, Vincent Casser, Xinchen Yan, Sabeek Pradhan, Ben Mildenhall, Pratul P Srinivasan, Jonathan T Barron, and Henrik Kretzschmar. 2022. Block-NeRF: Scalable Large Scene Neural View Synthesis. arXiv preprint arXiv:2202.05263 (2022).

[63]

Zachary Teed and Jia Deng. 2018. Deepv2d: Video to depth with differentiable structure from motion. arXiv preprint arXiv:1812.04605 (2018).

[64]

Julien Valentin, Angela Dai, Matthias Nießner, Pushmeet Kohli, Philip Torr, Shahram Izadi, and Cem Keskin. 2016. Learning to Navigate the Energy Landscape. arXiv preprint arXiv:1603.05772 (2016).

[65]

Fangjinhua Wang, Silvano Galliani, Christoph Vogel, Pablo Speciale, and Marc Pollefeys. 2021a. Patchmatchnet: Learned multi-view patchmatch stereo. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 14194--14203.

[66]

Jiepeng Wang, Peng Wang, Xiaoxiao Long, Christian Theobalt, Taku Komura, Lingjie Liu, and Wenping Wang. 2022. NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors. arXiv preprint arXiv:2206.13597 (2022).

[67]

Kaixuan Wang and Shaojie Shen. 2018. Mvdepthnet: Real-time multiview depth estimation neural network. In 2018 International conference on 3d vision (3DV). IEEE, 248--257.

[68]

Peng Wang, Lingjie Liu, Yuan Liu, Christian Theobalt, Taku Komura, and Wenping Wang. 2021b. Neus: Learning neural implicit surfaces by volume rendering for multi-view reconstruction. arXiv preprint arXiv:2106.10689 (2021).

[69]

Rui Wang, David Geraghty, Kevin Matzen, Richard Szeliski, and Jan-Michael Frahm. 2020. Vplnet: Deep single view normal estimation with vanishing points and lines. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 689--698.

[70]

Yi Wei, Shaohui Liu, Yongming Rao, Wang Zhao, Jiwen Lu, and Jie Zhou. 2021a. Nerfingmvs: Guided optimization of neural radiance fields for indoor multi-view stereo. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 5610--5619.

[71]

Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, and Guoping Wang. 2021b. Aarmvsnet: Adaptive aggregation recurrent multi-view stereo network. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 6187--6196.

[72]

Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, and Dahua Lin. 2021. CityNeRF: Building NeRF at City Scale. arXiv preprint arXiv:2112.05504 (2021).

[73]

Hongbin Xu, Zhipeng Zhou, Yu Qiao, Wenxiong Kang, and Qiuxia Wu. 2021. Self-supervised multi-view stereo via effective co-segmentation and data-augmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 2. 6.

[74]

Jiamin Xu, Zihan Zhu, Hujun Bao, and Wewei Xu. 2022b. A Hybrid Mesh-neural Representation for 3D Transparent Object Reconstruction. arXiv preprint arXiv:2203.12613 (2022).

[75]

Qingshan Xu and Wenbing Tao. 2019. Multi-scale geometric consistency guided multi-view stereo. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5483--5492.

[76]

Qingshan Xu and Wenbing Tao. 2020a. Planar prior assisted patchmatch multi-view stereo. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 12516--12523.

[77]

Qingshan Xu and Wenbing Tao. 2020b. Planar Prior Assisted PatchMatch Multi-View Stereo. AAAI Conference on Artificial Intelligence (AAAI) (2020).

[78]

Qiangeng Xu, Zexiang Xu, Julien Philip, Sai Bi, Zhixin Shu, Kalyan Sunkavalli, and Ulrich Neumann. 2022a. Point-NeRF: Point-based Neural Radiance Fields. arXiv preprint arXiv:2201.08845 (2022).

[79]

Jianfeng Yan, Zizhuang Wei, Hongwei Yi, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, and Yu-Wing Tai. 2020. Dense hybrid recurrent multi-view stereo net with dynamic consistency checking. In European Conference on Computer Vision. Springer, 674--689.

Digital Library

[80]

Jiayu Yang, Wei Mao, Jose M Alvarez, and Miaomiao Liu. 2020. Cost volume pyramid based depth inference for multi-view stereo. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4877--4886.

[81]

Yao Yao, Zixin Luo, Shiwei Li, Tian Fang, and Long Quan. 2018. Mvsnet: Depth inference for unstructured multi-view stereo. In Proceedings of the European Conference on Computer Vision (ECCV). 767--783.

Digital Library

[82]

Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, and Long Quan. 2019. Recurrent mvsnet for high-resolution multi-view stereo depth inference. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5525--5534.

[83]

Lior Yariv, Jiatao Gu, Yoni Kasten, and Yaron Lipman. 2021. Volume rendering of neural implicit surfaces. Advances in Neural Information Processing Systems 34 (2021), 4805--4815.

[84]

Lior Yariv, Yoni Kasten, Dror Moran, Meirav Galun, Matan Atzmon, Basri Ronen, and Yaron Lipman. 2020. Multiview neural surface reconstruction by disentangling geometry and appearance. Advances in Neural Information Processing Systems 33 (2020), 2492--2502.

[85]

Alex Yu, Sara Fridovich-Keil, Matthew Tancik, Qinhong Chen, Benjamin Recht, and Angjoo Kanazawa. 2021. Plenoxels: Radiance Fields without Neural Networks. arXiv preprint arXiv:2112.05131 (2021).

[86]

Zehao Yu, Songyou Peng, Michael Niemeyer, Torsten Sattler, and Andreas Geiger. 2022. MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction. arXiv preprint arXiv:2206.00665 (2022).

[87]

Jingyang Zhang, Yao Yao, Shiwei Li, Zixin Luo, and Tian Fang. 2020b. Visibility-aware multi-view stereo network. arXiv preprint arXiv:2008.07928 (2020).

[88]

Kai Zhang, Gernot Riegler, Noah Snavely, and Vladlen Koltun. 2020a. Nerf++: Analyzing and improving neural radiance fields. arXiv preprint arXiv:2010.07492 (2020).

[89]

Wenxiao Zhang, Zhen Dong, Jun Liu, Qingan Yan, Chunxia Xiao, et al. 2022. Point Cloud Completion Via Skeleton-Detail Transformer. IEEE Transactions on Visualization and Computer Graphics (2022).

[90]

Qian-Yi Zhou, Jaesik Park, and Vladlen Koltun. 2018. Open3D: A Modern Library for 3D Data Processing. arXiv:1801.09847 (2018).

Cited By

Ming YYang XWang WChen ZFeng JXing YZhang G(2025)Benchmarking neural radiance fields for autonomous robotsEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109685140:COnline publication date: 15-Jan-2025
https://dl.acm.org/doi/10.1016/j.engappai.2024.109685
Xu RLiu LWang NChen SXin SGuo XZhong ZKomura TWang WTu C(2024)CWF: Consolidating Weak Features in High-quality Mesh SimplificationACM Transactions on Graphics10.1145/365815943:4(1-14)Online publication date: 19-Jul-2024
https://dl.acm.org/doi/10.1145/3658159
Yang XWang YLiu YWen YMeng LZhou SLiu XZhu E(2024)Mixed Graph Contrastive Network for Semi-supervised Node ClassificationACM Transactions on Knowledge Discovery from Data10.1145/364154918:7(1-19)Online publication date: 19-Jun-2024
https://dl.acm.org/doi/10.1145/3641549
Show More Cited By

Index Terms

NeuralRoom: Geometry-Constrained Neural Implicit Surfaces for Indoor Scene Reconstruction
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Reconstruction

Recommendations

NopeRoom: Geometric Prior Based Indoor Scene Reconstruction with Unknown Poses
ICCIP '23: Proceedings of the 2023 9th International Conference on Communication and Information Processing

With the rapid development of Virtual Reality (VR) and Augmented Reality (AR) technologies, there is a growing demand for three-dimensional reconstruction of indoor scenes. In recent years, neural implicit surface reconstruction methods have gained ...
Structerf-SLAM: Neural implicit representation SLAM for structural environments
Abstract
In recent years, research on simultaneous localization and mapping (SLAM) using neural implicit representation has shown promising outcomes due to its smooth mapping and low memory consumption, particularly suitable for structured environments ...
Graphical abstract

Display Omitted
Highlights
- The first neural implicit representation SLAM tailored for structured scenes.
- Combining VO with implicit SLAM, superior robustness is maintained under sparse data.
- Plane features are used to achieve stable data association in the ...
Autonomous reconstruction of unknown indoor scenes guided by time-varying tensor fields

Autonomous reconstruction of unknown scenes by a mobile robot inherently poses the question of balancing between exploration efficacy and reconstruction quality. We present a navigation-by-reconstruction approach to address this question, where moving ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 41, Issue 6

December 2022

1428 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/3550454

Issue’s Table of Contents

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 November 2022

Published in TOG Volume 41, Issue 6

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

25
Total Citations
View Citations
237
Total Downloads

Downloads (Last 12 months)44
Downloads (Last 6 weeks)3

Reflects downloads up to 18 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Ming YYang XWang WChen ZFeng JXing YZhang G(2025)Benchmarking neural radiance fields for autonomous robotsEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109685140:COnline publication date: 15-Jan-2025
https://dl.acm.org/doi/10.1016/j.engappai.2024.109685
Xu RLiu LWang NChen SXin SGuo XZhong ZKomura TWang WTu C(2024)CWF: Consolidating Weak Features in High-quality Mesh SimplificationACM Transactions on Graphics10.1145/365815943:4(1-14)Online publication date: 19-Jul-2024
https://dl.acm.org/doi/10.1145/3658159
Yang XWang YLiu YWen YMeng LZhou SLiu XZhu E(2024)Mixed Graph Contrastive Network for Semi-supervised Node ClassificationACM Transactions on Knowledge Discovery from Data10.1145/364154918:7(1-19)Online publication date: 19-Jun-2024
https://dl.acm.org/doi/10.1145/3641549
Liu YLian Z(2024)QT-Font: High-efficiency Font Synthesis via Quadtree-based Diffusion ModelsACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657451(1-11)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657451
Petrov DGoyal PThamizharasan VKim VGadelha MAverkiou MChaudhuri SKalogerakis E(2024)GEM3D: GEnerative Medial Abstractions for 3D Shape SynthesisACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657415(1-11)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657415
Zhang PLi CKang LHuang FWang SXie XKim SChua TNgo CKa-Wei Lee RKumar RLauw H(2024)High-Frequency-aware Hierarchical Contrastive Selective Coding for Representation Learning on Text Attributed GraphsProceedings of the ACM Web Conference 202410.1145/3589334.3645614(4316-4327)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645614
Zhang CTong JLin TNguyen CLi H(2024)PMVC: Promoting Multi-View Consistency for 3D Scene Reconstruction2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00364(3666-3676)Online publication date: 3-Jan-2024
https://doi.org/10.1109/WACV57701.2024.00364
Yang XLu DLiu WYou LLi YWang C(2024)HeRF: A Hierarchical Framework for Efficient and Extendable New View Synthesis2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650631(1-8)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10650631
Sokolova AVorontsova AGabdullin BLimonov A(2024)FAWN: Floor-and-Walls Normal Regularization for Direct Neural TSDF Reconstruction2024 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP51287.2024.10647694(2215-2221)Online publication date: 27-Oct-2024
https://doi.org/10.1109/ICIP51287.2024.10647694
Wu JWyman OTang YPasini DWang W(2024)Multi-view 3D reconstruction based on deep learningNeurocomputing10.1016/j.neucom.2024.127553582:COnline publication date: 9-Jul-2024
https://dl.acm.org/doi/10.1016/j.neucom.2024.127553
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Issue’s Table of Contents