Abstract
LBS-based applications have become trending on mobile phones. It is useful and necessary to locate users location precisely from a digital image. However, gap exists between the query and the data set in scale, viewpoint, and lighting, or the noise existed in the foreground or background, etc. It is challenging for a location recognition or retrieval system to carry out real-time service. To address this problem, we design a place recognition system and a new building data set with ground truth labels. The algorithm not only significantly improves the efficiency, but also gives satisfied accuracy. The main contributions of our work can be concluded by three points: (1) By adding a fast geometric image matching as a filter procedure before applying Random Sample Consensus (RANSAC), we substantially improve the efficiency of spatial verification and recognition accuracy. (2) We apply a camera orientation algorithm to predict a probable retrieval failure. Salient region detection is applied to remove the confusing features in combination with term frequency–inverse document frequency (tf–idf) recovery. This significantly reduces the influence of noise. (3) We establish a new building data set of Tsinghua University to verify retrieval results. Experiments are conducted on several data sets and all achieve state-of-the-art results.
Similar content being viewed by others
References
Arya, S., Mount, D., Netanyahu, N., Silverman, R., Wu, A.: An optimal algorithm for approximate nearest neighbor searching fixed dimensions. J. ACM 45(6), 891–923 (1998)
Bowen, F., Du, E., Hu, J.: A novel graph-based invariant region descriptor for image matching. In: IEEE International Conference on Electro/Information Technology (EIT 2012), pp. 1–6. IEEE Press, New York (2012)
Calonder, M., Lepetit, V., Strecha, C., Fua, P., Brief: Binary robust independent elementary features. In: Computer Vision (ECCV 2010), pp. 778–792 (2010)
Cheng, M., Zhang, G., Mitra, N., Huang, X., Hu, S.: Global contrast based salient region detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2011), pp. 409–416. IEEE Press, New York (2011)
Chum, O., Matas, J., Obdrzalek, S.: Enhancing RANSAC by generalized model optimization. In: Proc. of the ACCV, vol. 2, pp. 812–817 (2004)
Chum, O., Mikulik, A., Perdoch, M., Matas, J.: Total recall. ii. Query expansion revisited. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2011), pp. 889–896. IEEE, New York (2011)
Chum, O., Philbin, J., Sivic, J., Isard, M., Zisserman, A.: Total recall: automatic query expansion with a generative feature model for object retrieval. In: IEEE International Conference on Computer Vision (ICCV 2007), pp. 1–8. IEEE Press, New York (2007).
Duda, R., Hart, P.: Use of the Hough transformation to detect lines and curves in pictures. Commun. ACM 15(1), 11–15 (1972)
Fischler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Gong, M., Sun, L., Yang, S., Yang, Y.: Robust place recognition by avoiding confusing features and fast geometric re-ranking. Computational Visual Media, 210–217 (2012)
Hartley, R.: In defense of the eight-point algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 19(6), 580–593 (1997)
Huang, T., Tian, Y., Li, J., Yu, H.: Salient region detection and segmentation for general object recognition and image understanding. Sci. China Ser. F 54(12), 2461–2470 (2011)
Jacobs, N., Roman, N., Pless, R.: Toward fully automatic geo-location and geo-orientation of static outdoor cameras. In: IEEE Workshop on Applications of Computer Vision (WACV 2008), pp. 1–6. IEEE Press, New York (2008)
Jégou, H., Douze, M., Schmid, C.: Hamming embedding and weak geometric consistency for large scale image search. In: Computer Vision (ECCV 2008), pp. 304–317 (2008)
Jégou, H., Douze, M., Schmid, C.: On the burstiness of visual elements. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2009), pp. 1169–1176. IEEE Press, New York (2009)
Jégou, H., Douze, M., Schmid, C.: Improving bag-of-features for large scale image search. Int. J. Comput. Vis. 87(3), 316–336 (2010)
Knopp, J., Sivic, J., Pajdla, T.: Avoiding confusing features in place recognition. In: Computer Vision (ECCV 2010), pp. 748–761 (2010)
Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Mikolajczyk, K., Schmid, C.: Scale & affine invariant interest point detectors. Int. J. Comput. Vis. 60(1), 63–86 (2004)
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2161–2168. IEEE, New York (2006)
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2007), pp. 1–8. IEEE Press, New York (2007)
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2008), pp. 1–8. IEEE Press, New York (2008)
Quack, T., Leibe, B., Van Gool, L.: World-scale mining of objects and events from community photo collections. In: Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval, pp. 47–56. ACM Press, New York (2008)
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings of the International Conference on Computer Vision, vol. 2, pp. 1470–1477 (2003)
Snavely, N., Seitz, S., Szeliski, R.: Modeling the world from Internet photo collections. Int. J. Comput. Vis. 80(2), 189–210 (2008)
Tsai, S., Chen, D., Takacs, G., Chandrasekhar, V., Vedantham, R., Grzeszczuk, R., Girod, B.: Fast geometric re-ranking for image-based retrieval. In: 17th International Conference on Image Processing (ICIP 2010), pp. 1029–1032. IEEE Press, New York (2010)
Wang, Z., Sun, L., Yang, S.: Efficient relative camera orientation detection for mobile applications. In: Proceedings of the 1st International Workshop on Mobile Location-Based Service, pp. 53–62. ACM Press, New York (2011)
Zisserman, A., Arandjelovic, R.: Three things everyone should know to improve object retrieval. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2911–2918. IEEE Press, New York (2012)
Zobel, J., Moffat, A.: Inverted files for text search engines. ACM Comput. Surv. 38(2), 6 (2006)
Acknowledgements
This work was supported by 973 Program under Grant No. 2011CB302206, NSFC under Grant Nos. 61272231/60833009, and National Significant Science and Technology Projects of China under Grant No. 2011ZX01042-001-002.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gong, M., Sun, L., Yang, S. et al. Find where you are: a new try in place recognition. Vis Comput 29, 1211–1220 (2013). https://doi.org/10.1007/s00371-013-0784-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-013-0784-6