Abstract
Image location recognition is a well known process of retrieving the precise location from the contents of the photographs. New photographs are compared to a large geocoded database and the result is both the position and orientation.
This is an inexpensive location system, since it only needs a camera, present in all modern smart phones. It has some great advantages over GPS. It give us the orientation and works under occluded environments, making this system highly attractive to a wide variety of applications.
But at a large scale, this process is easily hindered by heavy weighted database representations, expensive computational operations and visually similar environments. As a consequence, low geocoding rates, inaccurate localization and slow queries are obtained.
In the past years, a variety of solutions have been proposed to address these challenges but we are yet to adopt one of them as the image location recognition solution.
In this paper we review and compare recent state of art advances on image geocoding algorithms focusing on the scalability of such solutions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Robertson, D., Cipolla, R.: An image-based system for urban navigation. In: Proceedings of British Machine Vision Conference (BMVC 2004), vol. 1, pp. 819–828 (2004)
Zhang, W., Kosecka, J.: Image based localization in urban environments. In: Third International Symposium on 3D Data Processing, Visualization, and Transmission, pp. 33–40 (2006)
Schindler, G., Brown, M., Szeliski, R.: City-scale location recognition. In: IEEE conference on computer vision and pattern recognition (CVPR 2007), pp. 1–7 (2007)
Li, Y., Crandall, D.J., Huttenlocher, D.P.: Landmark classification in large-scale image collections. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1957–1964 (2009)
Knopp, J., Sivic, J., Pajdla, T.: Avoiding confusing features in place recognition. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 748–761. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15549-9_54
Bhattacharya, P., Gavrilova, M.: A survey of landmark recognition using the bag-of-words framework. In: Plemenos, D., Miaoulis, G. (eds.) Intelligent Computer Graphics, pp. 243–263. Springer, Berlin Heidelberg (2012)
Nistér, D., Stew, H.: Scalable recognition with a vocabulary tree. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2161–2168 (2006)
Irschara, A., Zach, C., Frahm, J.-M., Bischof, H.: From structure-from-motion point clouds to fast location recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2599–2606 (2009)
Sattler, T., Leibe, B., Kobbelt, L.: Fast image-based localization using direct 2D-to-3D matching. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 667–674 (2011)
Li, Y., Snavely, N., Huttenlocher, D., Fua, P.: Worldwide pose estimation using 3D point clouds. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part I. LNCS, vol. 7572, pp. 15–29. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33718-5_2
Li, Y., Snavely, N., Huttenlocher, D.P.: Location recognition using prioritized feature matching. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 791–804. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15552-9_57
Sattler, T., Leibe, B., Kobbelt, L.: Improving image-based localization by active correspondence search. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part I. LNCS, vol. 7572, pp. 752–765. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33718-5_54
Choudhary, S., Narayanan, P.J.: Visibility probability structure from SfM datasets and applications. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 130–143. Springer, Heidelberg (2012)
Comaniciu, D., Meer, P., Member, S.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)
Havlena, M., Hartmann, W., Schindler, K.: Optimal reduction of large image databases for location recognition. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 676–683 (2013)
Park, H.S., Wang, Y., Nurvitadhi, E., Hoe, J.C., Sheikh, Y., Chen, M.: 3D point cloud reduction using mixed-integer quadratic programming. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 229–236 (2013)
Cao, S., Snavely, N.: Minimal scene descriptions from structure from motion models. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 461–468 (2014)
Cheng, W., Lin, W., Sun, M.-T.: 3D point cloud simplification for image-based localization. In: IEEE International Conference on Multimedia & Expo Workshops (ICMEW), pp. 1–6 (2015)
Donoser, M., Schmalstieg, D.: Discriminative feature-to-point matching in image-based localization. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 516–523 (2014) doi:10.1109/CVPR.2014.73
Sattler, T., Havlena, M., Radenovic, F., Schindler, K., Pollefeys, M.: Hyperpoints and fine vocabularies for large-scale location recognition. In: The IEEE International Conference on Computer Vision (ICCV), December 2015
Mikulik, A., Perdoch, M., Chum, O., Matas, J.: Learning vocabularies over a fine quantization. Int. J. Comput. Vis. 103(1), 163–175 (2012). doi:10.1007/s11263-012-0600-1
Stewénius, H., Gunderson, S.H., Pilet, J.: Size matters: exhaustive geometric verification for image retrieval accepted for ECCV 2012. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part II. LNCS, vol. 7573, pp. 674–687. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33709-3_48
Havlena, M., Schindler, K.: VocMatch: efficient multiview correspondence for structure from motion. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part III. LNCS, vol. 8691, pp. 46–60. Springer, Heidelberg (2014)
Cao, S., Snavely, N.: Graph-based discriminative learning for location recognition. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 700–707 (2015)
Torii, A., Sivic, J., Pajdla, T., Okutomi, M.: Visual place recognition with repetitive structures. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 883–890 (2013). doi:10.1109/CVPR.2013.119
Arandjelović, R., Zisserman, A.: DisLocation: scalable descriptor distinctiveness for location recognition. In: Cremers, D., Reid, I., Saito, H., Yang, M.-H. (eds.) ACCV 2014. LNCS, vol. 9006, pp. 188–204. Springer, Heidelberg (2015)
Svarm, L., Enqvist, O.: Accurate localization and pose estimation for large 3D models. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 532–539 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Amorim, N., Rocha, J.G. (2016). State of Art Survey On: Large Scale Image Location Recognition. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2016. ICCSA 2016. Lecture Notes in Computer Science(), vol 9790. Springer, Cham. https://doi.org/10.1007/978-3-319-42092-9_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-42092-9_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42091-2
Online ISBN: 978-3-319-42092-9
eBook Packages: Computer ScienceComputer Science (R0)