Skip to main content

Simultaneous Multi-view Relative Pose Estimation and 3D Reconstruction from Planar Regions

  • Conference paper
  • First Online:
Computer Vision – ACCV 2018 Workshops (ACCV 2018)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11367))

Included in the following conference series:

Abstract

In this paper, we propose a novel solution for multi-view reconstruction, relative pose and homography estimation using planar regions. The proposed method doesn‘t require point matches, it directly uses a pair of planar image regions and simultaneously reconstructs the normal and distance of the corresponding 3D planar surface patch, the relative pose of the cameras as well as the aligning homography between the image regions. When more than two cameras are available, then a special region-based bundle adjustment is proposed, which provides robust estimates in a multi-view camera system by constructing and solving a non-linear system of equations. The method is quantitatively evaluated on a large synthetic dataset as well as on the KITTI vision benchmark dataset.

This work was partially supported by the NKFI-6 fund through project K120366; “Integrated program for training new generation of scientists in the fields of computer science”, EFOP-3.6.3-VEKOP-16-2017-0002; the Ministry of Human Capacities, Hungary through grant 20391-3/2018/FEKUSTRAT; the Research & Development Operational Programme for the project “Modernization and Improvement of Technical Infrastructure for Research and Development of J. Selye University in the Fields of Nanotechnology and Intelligent Space”, ITMS 26210120042, co-funded by the European Regional Development Fund. The authors would like to thank Levente Hajder for the Matlab implementation of the factorization method from [20].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Musialski, P., Wonka, P., Aliaga, D.G., Wimmer, M., van Gool, L., Purgathofer, W.: A survey of urban reconstruction. In: EUROGRAPHICS 2012 State of the Art Reports, Eurographics Association, pp. 1–28 (2012)

    Google Scholar 

  2. Micusik, B., Kosecka, J.: Piecewise planar city 3D modeling from street view panoramic sequences. In: Proceedings of International Conference on Computer Vision and Pattern Recognition, IEEE (2009)

    Google Scholar 

  3. Levinson, J., et al.: Towards fully autonomous driving: systems and algorithms. In: Proceedings of Intelligent Vehicles Symposium, IEEE (2011)

    Google Scholar 

  4. Lee, H.S., Kim, K.: Simultaneous traffic sign detection and boundary estimation using convolutional neural network. IEEE Trans. Intell. Transp. Syst. 19, 1652–1663 (2018)

    Article  Google Scholar 

  5. Arcos-García, Á., Álvarez-García, J.A., Soria-Morillo, L.M.: Deep neural network for traffic sign recognition systems: an analysis of spatial transformers and stochastic optimisation methods. Neural Networks 99, 158–165 (2018)

    Article  Google Scholar 

  6. Martinović, A., Mathias, M., Weissenberg, J., Van Gool, L.: A three-layered approach to facade parsing. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7578, pp. 416–429. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33786-4_31

    Chapter  Google Scholar 

  7. Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, ISBN: 0521540518 (2004)

    Google Scholar 

  8. Molnar, J., Huang, R., Kato, Z.: 3D reconstruction of planar surface patches: a direct solution. In: Jawahar, C.V., Shan, S. (eds.) ACCV 2014. LNCS, vol. 9008, pp. 286–300. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16628-5_21

    Chapter  Google Scholar 

  9. Tanács, A., Majdik, A., Hajder, L., Molnár, J., Sánta, Z., Kato, Z.: Collaborative mobile 3D reconstruction of urban scenes. In: Jawahar, C.V., Shan, S. (eds.) ACCV 2014. LNCS, vol. 9010, pp. 486–501. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16634-6_36

    Chapter  Google Scholar 

  10. Mikolajczyk, K., et al.: A comparison of affine region detectors. Int. J. Comput. Vision 65, 43–72 (2005)

    Article  Google Scholar 

  11. Furukawa, Y., Ponce, J.: Accurate, dense, and robust multiview stereopsis. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1362–1376 (2010)

    Article  Google Scholar 

  12. Tanács, A., Majdik, A., Molnár, J., Rai, A., Kato, Z.: Establishing correspondences between planar image patches. In: Proceedings of International Conference on Digital Image Computing: Techniques and Applications, Wollongong, Australia, pp. 1–7. IEEE (2014). Best Paper Award

    Google Scholar 

  13. Habbecke, M., Kobbelt, L.: Iterative multi-view plane fitting. In: VMV 2006 (2006)

    Google Scholar 

  14. Habbecke, M., Kobbelt, L.: A surface-growing approach to multi-view stereo reconstruction. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8 (2007)

    Google Scholar 

  15. Sinha, S., Steedly, D., Szeliski, R.: Piecewise planar stereo for image-based rendering. In: IEEE International Conference on Computer Vision, pp. 1881–1888 (2009)

    Google Scholar 

  16. Kowdle, A., Chang, Y.J., Gallagher, A., Chen, T.: Active learning for piecewise planar 3D reconstruction. In: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011, pp. 929–936. IEEE Computer Society, Washington (2011)

    Google Scholar 

  17. Hiep, V.H., Keriven, R., Labatut, P., Pons, J.P.: Towards high-resolution large-scale multi-view stereo. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 1430–1437 (2009)

    Google Scholar 

  18. Fraundorfer, F., Schindler, K., Bischof, H.: Piecewise planar scene reconstruction from sparse correspondences. Image Vision Comput. 24, 395–406 (2006)

    Article  Google Scholar 

  19. Zhou, Z., Jin, H., Ma, Y.: Robust plane-based structure from motion. In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1482–1489. IEEE Computer Society, Washington (2012)

    Google Scholar 

  20. Faugeras, O., Lustman, F.: Motion and structure from motion in a piecewise planar environment. Technical Report RR-0856, INRIA, Sophia Antipolis, France (1988)

    Article  Google Scholar 

  21. Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2004)

    Book  Google Scholar 

  22. Frohlich, R., Tamás, L., Kato, Z.: Homography estimation between omnidirectional cameras without point correspondences. In: Buşoniu, L., Tamás, L. (eds.) Handling Uncertainty and Networked Structure in Robot Control. SSDC, vol. 42, pp. 129–151. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-26327-4_6

    Chapter  Google Scholar 

  23. Sturm, P.: Algorithms for plane-based pose estimation. In: Proceedings of International Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 706–711(2000)

    Google Scholar 

  24. Mei, C., Benhimane, S., Malis, E., Rives, P.: Efficient homography-based tracking and 3-D reconstruction for single-viewpoint sensors. IEEE Trans. Robot. 24, 1352–1364 (2008)

    Article  Google Scholar 

  25. Caron, G., Marchand, E., Mouaddib, E.M.: Tracking planes in omnidirectional stereovision. In: International Conference on Robotics and Automation, pp. 6306–6311. IEEE (2011)

    Google Scholar 

  26. Makadia, A., Geyer, C., Daniilidis, K.: Correspondence-free structure from motion. Int. J. Comput. Vision 75, 311–327 (2007)

    Article  Google Scholar 

  27. Saurer, O., Fraundorfer, F., Pollefeys, M.: Homography based visual odometry with known vertical direction and weak Manhattan world assumption. In: IEEE/IROS Workshop on Visual Control of Mobile Robots (ViCoMoR) (2012)

    Google Scholar 

  28. Domokos, C., Nemeth, J., Kato, Z.: Nonlinear shape registration without correspondences. IEEE Trans. Pattern Anal. Mach. Intell. 34, 943–958 (2012)

    Article  Google Scholar 

  29. Recky, M., Leberl, F.: Window detection in complex facades. In: Proceedings of European Workshop on Visual Information Processing, pp. 220–225 (2010)

    Google Scholar 

  30. Kneip, L., Li, H., Seo, Y.: UPnP: an optimal O(n) solution to the absolute pose problem with universal applicability. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 127–142. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_9

    Chapter  Google Scholar 

  31. Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the KITTI vision benchmark suite. In: Proceedings of International Conference on Computer Vision and Pattern Recognition, IEEE (2012)

    Google Scholar 

  32. Geiger, A., Ziegler, J., Stiller, C.: StereoScan: dense 3d reconstruction in real-time. In: Proceedings of Intelligent Vehicles Symposium, IEEE (2011)

    Google Scholar 

  33. Schonberger, J.L., Frahm, J.M.: Structure-from-motion revisited. In: Proceedings of International Conference on Computer Vision and Pattern Recognition, IEEE (2016)

    Google Scholar 

  34. Schönberger, J.L., Zheng, E., Frahm, J.-M., Pollefeys, M.: Pixelwise view selection for unstructured multi-view stereo. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9907, pp. 501–518. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46487-9_31

    Chapter  Google Scholar 

  35. Knapitsch, A., Park, J., Zhou, Q.Y., Koltun, V.: Tanks and temples. ACM Trans. Graph. 36, 1–13 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zoltan Kato .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Frohlich, R., Kato, Z. (2019). Simultaneous Multi-view Relative Pose Estimation and 3D Reconstruction from Planar Regions. In: Carneiro, G., You, S. (eds) Computer Vision – ACCV 2018 Workshops. ACCV 2018. Lecture Notes in Computer Science(), vol 11367. Springer, Cham. https://doi.org/10.1007/978-3-030-21074-8_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-21074-8_37

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-21073-1

  • Online ISBN: 978-3-030-21074-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics