Geographical Retagging

Cao, Liujuan; Gao, Yue; Liu, Qiong; Ji, Rongrong

doi:10.1007/978-3-642-35728-2_5

Liujuan Cao⁷,
Yue Gao⁸,
Qiong Liu⁹ &
…
Rongrong Ji¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7733))

1942 Accesses
2 Citations

Abstract

While the geographical tag has brought a novel insight into the multimedia content analysis and understanding, how to improve the tagging accuracy has been rarely exploited. In this paper, we present a novel geographical retagging algorithm to improve the inaccurate geographical tags from an automatic photo content based association and refinement perspective. We do not resort to the time-consuming camera pose estimation and scene geometry recovery schemes like structure-from-motion. Instead, our algorithm is deployed based on a very simple neighbor statistical significance test, i.e., geographically nearby images, if near duplicate, should follow a more smooth affine transform comparing with those farther aways. Such an assumption is robust to noisy photo contents caused by multiple factors, such as indoor/outdoor changes, occlusions, or viewing angle changes. It is also very fast comparing to alternative approaches like structure-from-motion or simultaneous localization and matching. We have shown the accuracy, efficiency, and robustness of the proposed retagging algorithm for refining the geographical tags of Flickr images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kennedy, L., Naaman, M., Ahern, S.: How flickr helps us make sense of the world: context and content in community contributed media collections. ACM Multimedia (2007) 1,3
Google Scholar
Cao, L.-L., Yu, J., Luo, J., Huang, T.S.: Enhancing Semantic and Geographic Annotation of Web Images via Logistic canonical correlation regression. ACM Multimedia (2009) 3
Google Scholar
Ji, R., Xie, X., Yao, H., Ma, W.-Y.: Mining city landmarks from blogs by graph modeling. ACM Multimedia (2009) 3, 4
Google Scholar
Zheng, Y.-T., Zhao, M., Song, Y., Adam, H.: Tour the world: building a web-scale landmark recognition engine. In: CVPR (2009) 1, 2, 4
Google Scholar
Irschara, A., Zach, C., Frahm, J., Bischof, H.: From structure-from-motion point clouds to fast location recognition. In: CVPR (2009) 2, 3, 4
Google Scholar
Xiao, J., Chen, J., Yeung, D.-Y., Quan, L.: Structuring Visual Words in 3D for Arbitrary-View Object Localization. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 725–737. Springer, Heidelberg (2008) 4
Chapter Google Scholar
Jia, M., Fan, X., Xie, X., Li, M., Ma, W.-Y.: Photo-to-search: Using camera phones to inquire of the surrounding world. In: MDM (2006) 4
Google Scholar
Ji, R., Yao, H., Wang, J., Xu, P., Liu, X.: Clustering-based subspace SVM ensemble for relevance feedback learning. In: ICME (2008) 4
Google Scholar
Ji, R., Xie, X., Yao, H., Wu, Y., Ma, W.-Y.: Vocabulary tree incremental indexing for scalable location recognition. In: ICME (2008) 4
Google Scholar
Wang, M., Ni, B., Hua, X.-S., Chua, T.-S.: Assistive tagging: A survey of multimedia tagging with human-computer joint exploration. ACM Computing Surveys (2012) 4
Google Scholar
Wang, M., Hong, R., Li, G., Zha, Z.-J., Yan, S., Chua, T.-S.: Event driven Web Video Summarization by Tag Localization and Key-Shot Identification. TIP (2012) 4
Article Google Scholar
Wang, M., Hong, R., Yuan, X.-T., Yan, S., Chua, T.-S.: Movie2Comics: Towards a Lively Video Content Presentation. TMM (2012) 4
Article Google Scholar
Ji, R., Duan, L.-Y., Chen, J., Yao, H., Yuan, J., Rui, Y., Gao, W.: Location Discriminative Vocabulary Coding for Mobile Landmark Search. IJCV (2012) 4
Google Scholar
Ji, R., Yao, H., Liu, W., Sun, X., Tian, Q.: Task Dependent Visual Codebook Compression. TIP (2012) 4
Google Scholar
Ji, R., Duan, L.-Y., Yao, H., Xie, L., Rui, Y., Gao, W.: Learning to Distribute Vocabulary Indexing for Scalable Visual Search. TMM (2012) 4
Google Scholar
Ji, R., Gao, Y., Zhong, B., Yao, H., Tian, Q.: Mining City Landmarks by Modeling Reconstruction Sparsity. TOMCCAP (2011) 4
Google Scholar
Gao, Y., Tang, J., Hong, R., Dai, Q., Chua, T., Jain, R.: W2Go: A Travel Guidance System by Automatic Landmark Ranking. ACM Multimedia, 123–132 (2010) 4
Google Scholar
Gao, Y., Wang, M., Tao, D., Ji, R., Dai, Q.: 3D Object Retrieval and Recognition with Hypergraph Analysis. TIP (2012) 4
Google Scholar
Gao, Y., Wang, M., Zha, Z., Tian, Q., Dai, Q., Zhang, N.: Less is More: Efficient 3D Object Retrieval with Query View Selection. TMM (2011) 4
Google Scholar
Schindler, G., Brown, M.: City-scale location recognition. In: CVPR (2007) 2, 4
Google Scholar
Ji, R., Xie, X., Yao, H., Ma, W.-Y.: Mining city landmarks from blogs by graph modeling. ACM Multimedia, 105–114 (2009) 4
Google Scholar
Cristani, M., Perina, A., Castellani, U., Murino, V.: Geolocated image analysis using latent representations. In: CVPR (2008) 2, 4
Google Scholar
Crandall, D., Backstrom, L., Huttenlocher, D., Kleinberg, J.: Mapping the world’s photos. In: WWW (2009) 4
Google Scholar
Hays, J., Efros, A.: IMG2GPS: estimating geographic information from a single image. In: CVPR (2008) 4
Google Scholar
Kalogerakis, E., Vesselova, O., Hays, J., Efros, A., Hertzmann, A.: Image sequence geolocation with human travel priors. In: CVPR (2009) 4
Google Scholar
Ji, R., Yao, H., Xie, X., Tian, Q.: Vocabulary Hierarchy Optimization and Transfer for Scalable Image Search. IEEE MM (2011) 4
Google Scholar
Ji, R., Xie, X., Yao, H., Ma, W.-Y.: Mining City Landmarks by Graph Modeling. ACM Multimedia (2009) 4
Google Scholar

Download references

Author information

Authors and Affiliations

Harbin Engineering University, Harbin, 150001, P.R. China
Liujuan Cao
National University of Singapore, 117417, Singapore
Yue Gao
Huazhong University of Science and Technology, Wuhan, 430074, China
Qiong Liu
Columbia University, NY State, 10027, United States
Rongrong Ji

Authors

Liujuan Cao
View author publications
You can also search for this author in PubMed Google Scholar
Yue Gao
View author publications
You can also search for this author in PubMed Google Scholar
Qiong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Rongrong Ji
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Asia, 5 Danling Street, 100080, Beijing, China
Shipeng Li & Tao Mei &
School of Electrical Engineering and Computer Science, University of Ottawa, 800 King Edward, K1N 6N5, Ottawa, ON, Canada
Abdulmotaleb El Saddik
School of Computer and Information, Hefei University of Technology, Road Tunxi 193#, 230009, Hefei, Anhui, China
Meng Wang & Richang Hong &
Department of Information Engineering and Computer Science, University of Trento, ommarive 14, 38100, Trento, Italy
Nicu Sebe
Department of Electrical and Computer Engineering, National University of Singapore, 4 Engineering Drive 3, 117583, Singapore, Singapore
Shuicheng Yan
School of Computing, CLARITY: Centre for Sensor Web Technologies, Dublin City University, Glasnevin, 9, Dublin, Ireland
Cathal Gurrin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cao, L., Gao, Y., Liu, Q., Ji, R. (2013). Geographical Retagging. In: Li, S., et al. Advances in Multimedia Modeling. Lecture Notes in Computer Science, vol 7733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35728-2_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-35728-2_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35727-5
Online ISBN: 978-3-642-35728-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics