Efficient View Clustering and Selection for City-Scale 3D Reconstruction

Orsingher, Marco; Zani, Paolo; Medici, Paolo; Bertozzi, Massimo

doi:10.1007/978-3-031-06430-2_10

Marco Orsingher^12,13,
Paolo Zani¹³,
Paolo Medici¹³ &
…
Massimo Bertozzi¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13232))

Included in the following conference series:

International Conference on Image Analysis and Processing

2098 Accesses
2 Altmetric

Abstract

Image datasets have been steadily growing in size, harming the feasibility and efficiency of large-scale 3D reconstruction methods. In this paper, a novel approach for scaling Multi-View Stereo (MVS) algorithms up to arbitrarily large collections of images is proposed. Specifically, the problem of reconstructing the 3D model of an entire city is targeted, starting from a set of videos acquired by a moving vehicle equipped with several high-resolution cameras. Initially, the presented method exploits an approximately uniform distribution of poses and geometry and builds a set of overlapping clusters. Then, an Integer Linear Programming (ILP) problem is formulated for each cluster to select an optimal subset of views that guarantees both visibility and matchability. Finally, local point clouds for each cluster are separately computed and merged. Since clustering is independent from pairwise visibility information, the proposed algorithm runs faster than existing literature and allows for a massive parallelization. Extensive testing on urban data are discussed to show the effectiveness and the scalability of this approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Towards an efficient 3D model estimation methodology for aerial and ground images

Article 11 September 2017

Accurate, dense and shading-aware multi-view stereo reconstruction using metaheuritic optimization

Article 23 November 2018

Pixelwise View Selection for Unstructured Multi-View Stereo

References

Agarwal, S., et al.: Building Rome in a day. Commun. ACM 54(10), 105–112 (2011)
Article Google Scholar
Akbarzadeh, A., et al.: Towards urban 3D reconstruction from video. In: Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT 2006), pp. 1–8. IEEE (2006)
Google Scholar
Bron, C., Kerbosch, J.: Algorithm 457: finding all cliques of an undirected graph. Commun. ACM 16(9), 575–577 (1973)
Article Google Scholar
Caesar, H., et al.: nuScenes: a multimodal dataset for autonomous driving. arXiv preprint arXiv:1903.11027 (2019)
Furukawa, Y., Curless, B., Seitz, S.M., Szeliski, R.: Towards internet-scale multi-view stereo. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1434–1441. IEEE (2010)
Google Scholar
Guizilini, V., Ambrus, R., Pillai, S., Raventos, A., Gaidon, A.: 3D packing for self-supervised monocular depth estimation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)
Google Scholar
Knapitsch, A., Park, J., Zhou, Q.Y., Koltun, V.: Tanks and temples: benchmarking large-scale scene reconstruction. ACM Trans. Graph. 36(4), 1–13 (2017)
Article Google Scholar
Ladikos, A., Ilic, S., Navab, N.: Spectral camera clustering. In: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, pp. 2080–2086. IEEE (2009)
Google Scholar
Mauro, M., Riemenschneider, H., Signoroni, A., Leonardi, R., Van Gool, L.: An integer linear programming model for view selection on overlapping camera clusters. In: 2014 2nd International Conference on 3D Vision, vol. 1, pp. 464–471. IEEE (2014)
Google Scholar
Mauro, M., Riemenschneider, H., Signoroni, A., Leonardi, R., Van Gool, L.: A unified framework for content-aware view selection and planning through view importance. Proc. BMVC 2014, 1–11 (2014)
Google Scholar
Mauro, M., Riemenschneider, H., Van Gool, L., Leonardi, R.: Overlapping camera clustering through dominant sets for scalable 3D reconstruction. Proc. BMVC 2013(2013), 1–11 (2013)
Google Scholar
Perron, L., Furnon, V.: Or-tools. https://developers.google.com/optimization/
Schonberger, J.L., Frahm, J.M.: Structure-from-motion revisited. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4104–4113 (2016)
Google Scholar
Seitz, S.M., Curless, B., Diebel, J., Scharstein, D., Szeliski, R.: A comparison and evaluation of multi-view stereo reconstruction algorithms. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), vol. 1, pp. 519–528. IEEE (2006)
Google Scholar
Xu, Q., Tao, W.: Planar prior assisted patchmatch multi-view stereo. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 12516–12523 (2020)
Google Scholar
Zhang, R., Li, S., Fang, T., Zhu, S., Quan, L.: Joint camera clustering and surface segmentation for large-scale multi-view stereo. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2084–2092 (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Università degli Studi di Parma Dipartimento di Ingegneria e Architettura, Parma, Italy
Marco Orsingher & Massimo Bertozzi
Vislab Srl - Ambarella Inc., Parma, Italy
Marco Orsingher, Paolo Zani & Paolo Medici

Authors

Marco Orsingher
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Zani
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Medici
View author publications
You can also search for this author in PubMed Google Scholar
Massimo Bertozzi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Orsingher .

Editor information

Editors and Affiliations

Boston University, Boston, MA, USA
Stan Sclaroff
National Research Council, Lecce, Italy
Cosimo Distante
National Research Council, Lecce, Italy
Marco Leo
University of Catania, Catania, Italy
Giovanni M. Farinella
Technische Universität München, Garching, Germany
Federico Tombari

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Orsingher, M., Zani, P., Medici, P., Bertozzi, M. (2022). Efficient View Clustering and Selection for City-Scale 3D Reconstruction. In: Sclaroff, S., Distante, C., Leo, M., Farinella, G.M., Tombari, F. (eds) Image Analysis and Processing – ICIAP 2022. ICIAP 2022. Lecture Notes in Computer Science, vol 13232. Springer, Cham. https://doi.org/10.1007/978-3-031-06430-2_10

Download citation

DOI: https://doi.org/10.1007/978-3-031-06430-2_10
Published: 17 May 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06429-6
Online ISBN: 978-3-031-06430-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Efficient View Clustering and Selection for City-Scale 3D Reconstruction