Skip to main content

Geographical Retagging

  • Conference paper
Advances in Multimedia Modeling

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7733))

Abstract

While the geographical tag has brought a novel insight into the multimedia content analysis and understanding, how to improve the tagging accuracy has been rarely exploited. In this paper, we present a novel geographical retagging algorithm to improve the inaccurate geographical tags from an automatic photo content based association and refinement perspective. We do not resort to the time-consuming camera pose estimation and scene geometry recovery schemes like structure-from-motion. Instead, our algorithm is deployed based on a very simple neighbor statistical significance test, i.e., geographically nearby images, if near duplicate, should follow a more smooth affine transform comparing with those farther aways. Such an assumption is robust to noisy photo contents caused by multiple factors, such as indoor/outdoor changes, occlusions, or viewing angle changes. It is also very fast comparing to alternative approaches like structure-from-motion or simultaneous localization and matching. We have shown the accuracy, efficiency, and robustness of the proposed retagging algorithm for refining the geographical tags of Flickr images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kennedy, L., Naaman, M., Ahern, S.: How flickr helps us make sense of the world: context and content in community contributed media collections. ACM Multimedia (2007) 1,3

    Google Scholar 

  2. Cao, L.-L., Yu, J., Luo, J., Huang, T.S.: Enhancing Semantic and Geographic Annotation of Web Images via Logistic canonical correlation regression. ACM Multimedia (2009) 3

    Google Scholar 

  3. Ji, R., Xie, X., Yao, H., Ma, W.-Y.: Mining city landmarks from blogs by graph modeling. ACM Multimedia (2009) 3, 4

    Google Scholar 

  4. Zheng, Y.-T., Zhao, M., Song, Y., Adam, H.: Tour the world: building a web-scale landmark recognition engine. In: CVPR (2009) 1, 2, 4

    Google Scholar 

  5. Irschara, A., Zach, C., Frahm, J., Bischof, H.: From structure-from-motion point clouds to fast location recognition. In: CVPR (2009) 2, 3, 4

    Google Scholar 

  6. Xiao, J., Chen, J., Yeung, D.-Y., Quan, L.: Structuring Visual Words in 3D for Arbitrary-View Object Localization. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 725–737. Springer, Heidelberg (2008) 4

    Chapter  Google Scholar 

  7. Jia, M., Fan, X., Xie, X., Li, M., Ma, W.-Y.: Photo-to-search: Using camera phones to inquire of the surrounding world. In: MDM (2006) 4

    Google Scholar 

  8. Ji, R., Yao, H., Wang, J., Xu, P., Liu, X.: Clustering-based subspace SVM ensemble for relevance feedback learning. In: ICME (2008) 4

    Google Scholar 

  9. Ji, R., Xie, X., Yao, H., Wu, Y., Ma, W.-Y.: Vocabulary tree incremental indexing for scalable location recognition. In: ICME (2008) 4

    Google Scholar 

  10. Wang, M., Ni, B., Hua, X.-S., Chua, T.-S.: Assistive tagging: A survey of multimedia tagging with human-computer joint exploration. ACM Computing Surveys (2012) 4

    Google Scholar 

  11. Wang, M., Hong, R., Li, G., Zha, Z.-J., Yan, S., Chua, T.-S.: Event driven Web Video Summarization by Tag Localization and Key-Shot Identification. TIP (2012) 4

    Article  Google Scholar 

  12. Wang, M., Hong, R., Yuan, X.-T., Yan, S., Chua, T.-S.: Movie2Comics: Towards a Lively Video Content Presentation. TMM (2012) 4

    Article  Google Scholar 

  13. Ji, R., Duan, L.-Y., Chen, J., Yao, H., Yuan, J., Rui, Y., Gao, W.: Location Discriminative Vocabulary Coding for Mobile Landmark Search. IJCV (2012) 4

    Google Scholar 

  14. Ji, R., Yao, H., Liu, W., Sun, X., Tian, Q.: Task Dependent Visual Codebook Compression. TIP (2012) 4

    Google Scholar 

  15. Ji, R., Duan, L.-Y., Yao, H., Xie, L., Rui, Y., Gao, W.: Learning to Distribute Vocabulary Indexing for Scalable Visual Search. TMM (2012) 4

    Google Scholar 

  16. Ji, R., Gao, Y., Zhong, B., Yao, H., Tian, Q.: Mining City Landmarks by Modeling Reconstruction Sparsity. TOMCCAP (2011) 4

    Google Scholar 

  17. Gao, Y., Tang, J., Hong, R., Dai, Q., Chua, T., Jain, R.: W2Go: A Travel Guidance System by Automatic Landmark Ranking. ACM Multimedia, 123–132 (2010) 4

    Google Scholar 

  18. Gao, Y., Wang, M., Tao, D., Ji, R., Dai, Q.: 3D Object Retrieval and Recognition with Hypergraph Analysis. TIP (2012) 4

    Google Scholar 

  19. Gao, Y., Wang, M., Zha, Z., Tian, Q., Dai, Q., Zhang, N.: Less is More: Efficient 3D Object Retrieval with Query View Selection. TMM (2011) 4

    Google Scholar 

  20. Schindler, G., Brown, M.: City-scale location recognition. In: CVPR (2007) 2, 4

    Google Scholar 

  21. Ji, R., Xie, X., Yao, H., Ma, W.-Y.: Mining city landmarks from blogs by graph modeling. ACM Multimedia, 105–114 (2009) 4

    Google Scholar 

  22. Cristani, M., Perina, A., Castellani, U., Murino, V.: Geolocated image analysis using latent representations. In: CVPR (2008) 2, 4

    Google Scholar 

  23. Crandall, D., Backstrom, L., Huttenlocher, D., Kleinberg, J.: Mapping the world’s photos. In: WWW (2009) 4

    Google Scholar 

  24. Hays, J., Efros, A.: IMG2GPS: estimating geographic information from a single image. In: CVPR (2008) 4

    Google Scholar 

  25. Kalogerakis, E., Vesselova, O., Hays, J., Efros, A., Hertzmann, A.: Image sequence geolocation with human travel priors. In: CVPR (2009) 4

    Google Scholar 

  26. Ji, R., Yao, H., Xie, X., Tian, Q.: Vocabulary Hierarchy Optimization and Transfer for Scalable Image Search. IEEE MM (2011) 4

    Google Scholar 

  27. Ji, R., Xie, X., Yao, H., Ma, W.-Y.: Mining City Landmarks by Graph Modeling. ACM Multimedia (2009) 4

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cao, L., Gao, Y., Liu, Q., Ji, R. (2013). Geographical Retagging. In: Li, S., et al. Advances in Multimedia Modeling. Lecture Notes in Computer Science, vol 7733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35728-2_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35728-2_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35727-5

  • Online ISBN: 978-3-642-35728-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics