Geotagged Image Recognition by Combining Three Different Kinds of Geolocation Features

Yaegashi, Keita; Yanai, Keiji

doi:10.1007/978-3-642-19309-5_28

Keita Yaegashi¹⁹ &
Keiji Yanai¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6493))

Included in the following conference series:

Asian Conference on Computer Vision

3803 Accesses
1 Citations

Abstract

Scenes and objects represented in photos have causal relationship to the places where they are taken. In this paper, we propose using geo-information such as aerial photos and location-related texts as features for geotagged image recognition and fusing them with Multiple Kernel Learning (MKL). By the experiments, we have verified the possibility for reflecting location contexts in image recognition by evaluating not only recognition rates, but feature fusion weights estimated by MKL. As a result, the mean average precision (MAP) for 28 categories increased up to 80.87% by the proposed method, compared with 77.71% by the baseline. Especially, for the categories related to location-dependent concepts, MAP was improved by 6.57 points.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Luo, J., Yu, J., Joshi, D., Hao, W.: Event recognition: Viewing the world with a third eye. In: Proc. of ACM International Conference Multimedia (2008)
Google Scholar
Joshi, D., Luo, J.: Inferring generic activities and events from image content and bags of geo-tags. In: Proc. of ACM International Conference on Image and Video Retrieval (2008)
Google Scholar
Yaegashi, K., Yanai, K.: Can geotags help image recognition? In: Proc. of Pacific-Rim Symposium on Image and Video Technology (2009)
Google Scholar
Yaegashi, K., Yanai, K.: Geotagged photo recognition using corresponding aerial photos with multiple kernel learning. In: Proc. of IAPR International Conference on Pattern Recognition (2010)
Google Scholar
Qi, G.J., Hua, X.S., Rui, Y., Tang, J., Mei, T., Zhang, H.: Correlative multi-label video annotation. In: Proc. of ACM International Conference Multimedia, pp. 17–26 (2007)
Google Scholar
Hays, J., Efros, A.A.: IM2GPS: Estimating geographic information from a single image. In: Proc. of IEEE Computer Vision and Pattern Recognition (2008)
Google Scholar
Kalogerakis, E., Vesselova, O., Hays, J., Efros, A., Hertzmann, A.: Image sequence geolocation with human travel priors. In: Proc. of IEEE International Conference on Computer Vision (2010)
Google Scholar
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: Proc. of ECCV Workshop on Statistical Learning in Computer Vision, pp. 59–74 (2004)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
Article Google Scholar
Varma, M., Ray, D.: Learning the discriminative power-invariance trade-off. In: Proc. of IEEE International Conference on Computer Vision, pp. 1150–1157 (2007)
Google Scholar
Gehler, P., Nowozin, S.: On feature combination for multiclass object classification. In: Proc. of IEEE International Conference on Computer Vision (2009)
Google Scholar
Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision 73, 213–238 (2007)
Article Google Scholar
Sonnenburg, S., Rätsch, G., Schäfer, C., Schölkopf, B.: Large Scale Multiple Kernel Learning. The Journal of Machine Learning Research 7, 1531–1565 (2006)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, The University of Electro-Communications, 51–5–1 Chofugaoka, Chofu-shi, Tokyo, 182–8585, Japan
Keita Yaegashi & Keiji Yanai

Authors

Keita Yaegashi
View author publications
You can also search for this author in PubMed Google Scholar
Keiji Yanai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Technion, Israel Institute of Technology, 32000, Haifa, Israel
Ron Kimmel
The University of Auckland, 37 Kohimarama Road, Mission Bay, 1071, Auckland, New Zealand
Reinhard Klette
National Institute of Informatics, 1018430, Chiyoda, Tokyo, Japan
Akihiro Sugimoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yaegashi, K., Yanai, K. (2011). Geotagged Image Recognition by Combining Three Different Kinds of Geolocation Features. In: Kimmel, R., Klette, R., Sugimoto, A. (eds) Computer Vision – ACCV 2010. ACCV 2010. Lecture Notes in Computer Science, vol 6493. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19309-5_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-19309-5_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19308-8
Online ISBN: 978-3-642-19309-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics