Image-Based Camera Localization for Large and Outdoor Environments

Teng, Chin-Hung; Chen, Yu-Liang; Zhang, Xuejie

doi:10.1007/978-3-319-54427-4_11

Image-Based Camera Localization for Large and Outdoor Environments

Chin-Hung Teng¹⁶,
Yu-Liang Chen¹⁷ &
Xuejie Zhang¹⁷

Conference paper
First Online: 16 March 2017

2071 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10117))

Abstract

Locating camera position and orientation is an important step for many augmented reality (AR) applications. In this paper, we develop a system for estimating camera pose for large and outdoor environments. A large set of images for outdoor environments are collected and 3D structure of the scenes are recovered using a structure from motion technique. To improve image indexing accuracy and efficiency, a convolutional neural network (CNN) is employed to extract image features and a set of locality sensitive hashing (LSH) functions are used to classify CNN features. With these techniques, camera localization is achieved by first indexing the nearest images by CNN and LSH and then a set of 2D-3D correspondences are established from the indexed images and the recovered 3D structure. A perspective-n-point (PnP) algorithm is then applied on the 2D-3D correspondences to estimate camera pose. A series of experiments are conducted and the results confirm the effectiveness of proposed system. The nearest neighbors to query image can be accurately and efficiently extracted and the camera pose can be accurately estimated.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Durrant-Whyte, H., Bailey, T.: Simultaneous localization and mapping (SLAM): Part I the essential algorithms. IEEE Robot. Autom. Mag. 13, 99–110 (2006)
Article Google Scholar
Bailey, T., Durrant-Whyte, H.: Simultaneous localization and mapping (SLAM): Part II state of the art. IEEE Robot. Autom. Mag. 13, 108–117 (2006)
Article Google Scholar
Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: 6th IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR 2007), pp. 225–234 (2007)
Google Scholar
Davison, A.J., Reid, I.D., Molton, N.D., Stasse, O.: MonoSLAM: real-time single camera SLAM. IEEE Trans. Pattern Anal. Mach. Intell. 29, 1052–1067 (2007)
Article Google Scholar
Ventura, J., Arth, C., Reitmayr, G., Schmalstieg, D.: Global localization from monocular SLAM on a mobile phone. IEEE Trans. Vis. Comput. Graph. 20, 531–539 (2014)
Article Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, vol. 25, pp. 1106–1114 (2012)
Google Scholar
Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 512–519 (2014)
Google Scholar
Xie, L., Hong, R., Zhang, B., Tian, Q.: Image classification and retrieval are ONE. In: International Conference on Multimedia Retrieval (2015)
Google Scholar
Kato, H., Billinghurst, M.: Marker tracking and HMD calibration for a video-based augmented reality conferencing system. In: International Workshop on Augmented Reality (IWAR 1999) (1999)
Google Scholar
Lepetit, V., Fua, P.: Monocular model-based 3D tracking of rigid objects: a survey. Found. Trends Comput. Graph. Vis. 1, 1–89 (2005)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
Article Google Scholar
Bay, H., Ess, A., Tuytelaars, T., van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110, 346–359 (2008)
Article Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: International Conference on Computer Vision, pp. 2564–2571 (2011)
Google Scholar
Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2004). ISBN 0521540518
Book MATH Google Scholar
Scaramuzza, D., Fraundorfer, F.: Visual odometry: Part I the first 30 years and fundamentals. IEEE Robot. Autom. Mag. 18, 80–92 (2011)
Article Google Scholar
Scaramuzza, D., Fraundorfer, F.: Visual odometry: Part II matching, robustness, optimization, and applications. IEEE Robot. Autom. Mag. 19, 78–90 (2012)
Article Google Scholar
Guan, T., Duan, L., Yu, J., Chen, Y., Zhang, X.: Real-time camera pose estimation for wide-area augmented reality applications. IEEE Comput. Graph. Appl. 31, 56–68 (2011)
Article Google Scholar
Snavely, N., Seitz, S.M., Szeliski, R.: Photo tourism: exploring photo collections in 3D. In: ACM Transactions on Graphic (SIGGRAPH 2006), vol. 25, pp. 835–846 (2006)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. Technical report (2014). arXiv:1409.1556
Charikar, M.: Similarity estimation techniques from rounding algorithm. In: ACM Symposium on Theory of Computing, pp. 380–388 (2002)
Google Scholar

Download references

Acknowledgement

This work was supported in part by the Ministry of Science and Technology, Taiwan, under Grant Nos. MOST 104-2221-E-155-032 and MOST 104-3115-E-155-002.

Author information

Authors and Affiliations

Department of Information Communication and Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Chung-Li, Taiwan
Chin-Hung Teng
School of Information Science and Engineering, Yunnan University, Kunming, China
Yu-Liang Chen & Xuejie Zhang

Authors

Chin-Hung Teng
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Liang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xuejie Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chin-Hung Teng .

Editor information

Editors and Affiliations

Institute of Information Science, Academia Sinica, Taipei, Taiwan
Chu-Song Chen
Tsinghua University, Beijing, China
Jiwen Lu
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, Singapore
Kai-Kuang Ma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Teng, CH., Chen, YL., Zhang, X. (2017). Image-Based Camera Localization for Large and Outdoor Environments. In: Chen, CS., Lu, J., Ma, KK. (eds) Computer Vision – ACCV 2016 Workshops. ACCV 2016. Lecture Notes in Computer Science(), vol 10117. Springer, Cham. https://doi.org/10.1007/978-3-319-54427-4_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-54427-4_11
Published: 16 March 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54426-7
Online ISBN: 978-3-319-54427-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics