Abstract
In this paper, we propose a novel solution for multi-view reconstruction, relative pose and homography estimation using planar regions. The proposed method doesn‘t require point matches, it directly uses a pair of planar image regions and simultaneously reconstructs the normal and distance of the corresponding 3D planar surface patch, the relative pose of the cameras as well as the aligning homography between the image regions. When more than two cameras are available, then a special region-based bundle adjustment is proposed, which provides robust estimates in a multi-view camera system by constructing and solving a non-linear system of equations. The method is quantitatively evaluated on a large synthetic dataset as well as on the KITTI vision benchmark dataset.
This work was partially supported by the NKFI-6 fund through project K120366; “Integrated program for training new generation of scientists in the fields of computer science”, EFOP-3.6.3-VEKOP-16-2017-0002; the Ministry of Human Capacities, Hungary through grant 20391-3/2018/FEKUSTRAT; the Research & Development Operational Programme for the project “Modernization and Improvement of Technical Infrastructure for Research and Development of J. Selye University in the Fields of Nanotechnology and Intelligent Space”, ITMS 26210120042, co-funded by the European Regional Development Fund. The authors would like to thank Levente Hajder for the Matlab implementation of the factorization method from [20].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Musialski, P., Wonka, P., Aliaga, D.G., Wimmer, M., van Gool, L., Purgathofer, W.: A survey of urban reconstruction. In: EUROGRAPHICS 2012 State of the Art Reports, Eurographics Association, pp. 1–28 (2012)
Micusik, B., Kosecka, J.: Piecewise planar city 3D modeling from street view panoramic sequences. In: Proceedings of International Conference on Computer Vision and Pattern Recognition, IEEE (2009)
Levinson, J., et al.: Towards fully autonomous driving: systems and algorithms. In: Proceedings of Intelligent Vehicles Symposium, IEEE (2011)
Lee, H.S., Kim, K.: Simultaneous traffic sign detection and boundary estimation using convolutional neural network. IEEE Trans. Intell. Transp. Syst. 19, 1652–1663 (2018)
Arcos-García, Á., Álvarez-García, J.A., Soria-Morillo, L.M.: Deep neural network for traffic sign recognition systems: an analysis of spatial transformers and stochastic optimisation methods. Neural Networks 99, 158–165 (2018)
Martinović, A., Mathias, M., Weissenberg, J., Van Gool, L.: A three-layered approach to facade parsing. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7578, pp. 416–429. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33786-4_31
Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, ISBN: 0521540518 (2004)
Molnar, J., Huang, R., Kato, Z.: 3D reconstruction of planar surface patches: a direct solution. In: Jawahar, C.V., Shan, S. (eds.) ACCV 2014. LNCS, vol. 9008, pp. 286–300. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16628-5_21
Tanács, A., Majdik, A., Hajder, L., Molnár, J., Sánta, Z., Kato, Z.: Collaborative mobile 3D reconstruction of urban scenes. In: Jawahar, C.V., Shan, S. (eds.) ACCV 2014. LNCS, vol. 9010, pp. 486–501. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16634-6_36
Mikolajczyk, K., et al.: A comparison of affine region detectors. Int. J. Comput. Vision 65, 43–72 (2005)
Furukawa, Y., Ponce, J.: Accurate, dense, and robust multiview stereopsis. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1362–1376 (2010)
Tanács, A., Majdik, A., Molnár, J., Rai, A., Kato, Z.: Establishing correspondences between planar image patches. In: Proceedings of International Conference on Digital Image Computing: Techniques and Applications, Wollongong, Australia, pp. 1–7. IEEE (2014). Best Paper Award
Habbecke, M., Kobbelt, L.: Iterative multi-view plane fitting. In: VMV 2006 (2006)
Habbecke, M., Kobbelt, L.: A surface-growing approach to multi-view stereo reconstruction. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8 (2007)
Sinha, S., Steedly, D., Szeliski, R.: Piecewise planar stereo for image-based rendering. In: IEEE International Conference on Computer Vision, pp. 1881–1888 (2009)
Kowdle, A., Chang, Y.J., Gallagher, A., Chen, T.: Active learning for piecewise planar 3D reconstruction. In: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011, pp. 929–936. IEEE Computer Society, Washington (2011)
Hiep, V.H., Keriven, R., Labatut, P., Pons, J.P.: Towards high-resolution large-scale multi-view stereo. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 1430–1437 (2009)
Fraundorfer, F., Schindler, K., Bischof, H.: Piecewise planar scene reconstruction from sparse correspondences. Image Vision Comput. 24, 395–406 (2006)
Zhou, Z., Jin, H., Ma, Y.: Robust plane-based structure from motion. In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1482–1489. IEEE Computer Society, Washington (2012)
Faugeras, O., Lustman, F.: Motion and structure from motion in a piecewise planar environment. Technical Report RR-0856, INRIA, Sophia Antipolis, France (1988)
Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2004)
Frohlich, R., Tamás, L., Kato, Z.: Homography estimation between omnidirectional cameras without point correspondences. In: Buşoniu, L., Tamás, L. (eds.) Handling Uncertainty and Networked Structure in Robot Control. SSDC, vol. 42, pp. 129–151. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-26327-4_6
Sturm, P.: Algorithms for plane-based pose estimation. In: Proceedings of International Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 706–711(2000)
Mei, C., Benhimane, S., Malis, E., Rives, P.: Efficient homography-based tracking and 3-D reconstruction for single-viewpoint sensors. IEEE Trans. Robot. 24, 1352–1364 (2008)
Caron, G., Marchand, E., Mouaddib, E.M.: Tracking planes in omnidirectional stereovision. In: International Conference on Robotics and Automation, pp. 6306–6311. IEEE (2011)
Makadia, A., Geyer, C., Daniilidis, K.: Correspondence-free structure from motion. Int. J. Comput. Vision 75, 311–327 (2007)
Saurer, O., Fraundorfer, F., Pollefeys, M.: Homography based visual odometry with known vertical direction and weak Manhattan world assumption. In: IEEE/IROS Workshop on Visual Control of Mobile Robots (ViCoMoR) (2012)
Domokos, C., Nemeth, J., Kato, Z.: Nonlinear shape registration without correspondences. IEEE Trans. Pattern Anal. Mach. Intell. 34, 943–958 (2012)
Recky, M., Leberl, F.: Window detection in complex facades. In: Proceedings of European Workshop on Visual Information Processing, pp. 220–225 (2010)
Kneip, L., Li, H., Seo, Y.: UPnP: an optimal O(n) solution to the absolute pose problem with universal applicability. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 127–142. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_9
Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the KITTI vision benchmark suite. In: Proceedings of International Conference on Computer Vision and Pattern Recognition, IEEE (2012)
Geiger, A., Ziegler, J., Stiller, C.: StereoScan: dense 3d reconstruction in real-time. In: Proceedings of Intelligent Vehicles Symposium, IEEE (2011)
Schonberger, J.L., Frahm, J.M.: Structure-from-motion revisited. In: Proceedings of International Conference on Computer Vision and Pattern Recognition, IEEE (2016)
Schönberger, J.L., Zheng, E., Frahm, J.-M., Pollefeys, M.: Pixelwise view selection for unstructured multi-view stereo. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9907, pp. 501–518. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46487-9_31
Knapitsch, A., Park, J., Zhou, Q.Y., Koltun, V.: Tanks and temples. ACM Trans. Graph. 36, 1–13 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Frohlich, R., Kato, Z. (2019). Simultaneous Multi-view Relative Pose Estimation and 3D Reconstruction from Planar Regions. In: Carneiro, G., You, S. (eds) Computer Vision – ACCV 2018 Workshops. ACCV 2018. Lecture Notes in Computer Science(), vol 11367. Springer, Cham. https://doi.org/10.1007/978-3-030-21074-8_37
Download citation
DOI: https://doi.org/10.1007/978-3-030-21074-8_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21073-1
Online ISBN: 978-3-030-21074-8
eBook Packages: Computer ScienceComputer Science (R0)