Collaborative Multimodal Location Estimation of Consumer Media

Ekambaram, Venkatesan; Ramchandran, Kannan; Choi, Jaeyoung; Friedland, Gerald

doi:10.1007/978-3-319-09861-6_7

Collaborative Multimodal Location Estimation of Consumer Media

Venkatesan Ekambaram³,
Kannan Ramchandran³,
Jaeyoung Choi⁴ &
…
Gerald Friedland⁴

Chapter
First Online: 01 January 2014

909 Accesses

Abstract

With the emergence of Web 2.0 and with GPS devices becoming ubiquitous and pervasive in our daily life, location-based services are rapidly gaining traction in the online world. The main driving force behind these services is the enabling of a very personalized experience. Social-media websites such as Flickr, YouTube, Twitter, etc., allow queries for results originating at a certain location. Likewise, the belief is that retro-fitting archives with location information will be attractive to many businesses, and will enable newer applications. The task of estimating the geo-coordinates of a media-recording goes by different names such as “geo-tagging”, “location estimation” or “placing”. Geo-tagging multimedia content has various applications. For example, geo-location services can be provided for media captured in environments without GPS, such as photos taken indoors on mobile phones. Vacation videos and photos can be better organized and presented to the user if they have geo-location information. With the explosive growth of available multimedia content on the Internet (200 million photos are uploaded to Facebook daily), there is a dire need for efficient organization and retrieval of multimedia content, which can be enabled by geo-tagging. Geo-location information further helps develop a better semantic understanding of multimedia content. These are some of the main motivations of the MediaEval Placing task [1, 2].

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Mediaeval web site, http://www.multimediaeval.org
M. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, Gareth J.F. Jones, Automatic Tagging and Geo-Tagging in Video Collections and Communities, in ACM International Conference on Multimedia Retrieval (ICMR 2011), April 2011, p. to appear
Google Scholar
G. Friedland, O. Vinyals, T. Darrell, Multimodal Location Estimation, in Proceedings of ACM Multimedia, 2010, pp. 1245–1251
Google Scholar
G. Schindler, M. Brown, R. Szeliski, City-scale location recognition, in IEEE Conference on Computer Vision and Pattern Recognition, 2007, pp. 1–7
Google Scholar
W. Zhang, J. Kosecka, Image based localization in urban environments, in 3D Data Processing, Visualization, and Transmission, 3rd Intl. Symposium on, 2006, pp. 33–40
Google Scholar
J. Hays, A.A. Efros, IM2GPS: estimating geographic information from a single image, in IEEE Conference on Computer Vision and Pattern Recognition, 2008, pp. 1–8
Google Scholar
J. Luo, D. Joshi, J. Yu, A. Gallagher, Geotagging in multimedia and computer vision-a survey. Multimed. Tools Appl. 51, 187–211 (2011)
Article Google Scholar
T. Rattenbury, M. Naaman, Methods for extracting place semantics from Flickr tags. ACM Trans. Web (TWEB) 3(1), 1–30 (2009)
Article Google Scholar
P. Serdyukov, V. Murdock, R. van Zwol, Placing Flickr photos on a map, in ACM SIGIR, 2009, pp. 484–491
Google Scholar
O. Van Laere, S. Schockaert, B. Dhoedt, Ghent university at the 2011 placing task, in Proceedings of MediaEval, 2011
Google Scholar
L. Cao, J. Yu, J. Luo, T. Huang, Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression, in Proceedings of the 17th ACM International Conference on Multimedia, New York, NY, USA, 2009, MM ’09, pp. 125–134, ACM
Google Scholar
A. Gallagher, D. Joshi, J. Yu, J. Luo, Geo-location inference from image content and user tags, in Proceedings of IEEE CVPR. 2009, IEEE
Google Scholar
David J. Crandall, Lars Backstrom, Daniel Huttenlocher, Jon Kleinberg, Mapping the world’s photos, in Proceedings of WWW ’09, New York, NY, USA, 2009, pp. 761–770, ACM
Google Scholar
P. Kelm, S. Schmiedeke, T. Sikora, A hierarchical, multi-modal approach for placing videos on the map using millions of flickr photographs, in Proceedings of SBNMA ’11, New York, NY, USA, 2011, pp. 15–20, ACM
Google Scholar
G. Friedland, J. Choi, H. Lei, A. Janin, Multimodal Location Estimation on Flickr Videos, in Proceedings of the 2011 ACM Workshop on Social Media, Scottsdale, Arizona, USA, 2011, pp. 23–28, ACM
Google Scholar
J. Choi, H. Lei, G. Friedland, The 2011 ICSI Video Location Estimation System, in Proceedings of MediaEval, 2011
Google Scholar
M.J. Wainwright, M.I. Jordan, Graphical models, exponential families, and variational inference. Found. Trends Mach. Learn. 1, 1–305 (2008)
Article MATH Google Scholar
Le Song, Arthur Gretton, Danny Bickson, Yucheng Low, Carlos Guestrin, Kernel belief propagation, in International Conference on Artificial Intelligence and Statistics, 2011, pp. 707–715
Google Scholar
N. Vlassis, A. Likas, A greedy em algorithm for gaussian mixture learning. Neural Process. Lett 15(1), 77–87 (2002)
Article MATH Google Scholar
A. Rae, V. Murdock, P. Serdyukov, Working Notes for the Placing Task at MediaEval 2011, in MediaEval 2011 Workshop (Pisa, Italy, September 2011)
Google Scholar

Download references

Author information

Authors and Affiliations

University of California, Berkeley, CA, USA
Venkatesan Ekambaram & Kannan Ramchandran
International Computer Science Institute, Berkeley, CA, USA
Jaeyoung Choi & Gerald Friedland

Authors

Venkatesan Ekambaram
View author publications
You can also search for this author in PubMed Google Scholar
Kannan Ramchandran
View author publications
You can also search for this author in PubMed Google Scholar
Jaeyoung Choi
View author publications
You can also search for this author in PubMed Google Scholar
Gerald Friedland
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Venkatesan Ekambaram .

Editor information

Editors and Affiliations

International Computer Science Institute, Berkeley, California, USA
Jaeyoung Choi
International Computer Science Institute, Berkeley, California, USA
Gerald Friedland

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ekambaram, V., Ramchandran, K., Choi, J., Friedland, G. (2015). Collaborative Multimodal Location Estimation of Consumer Media. In: Choi, J., Friedland, G. (eds) Multimodal Location Estimation of Videos and Images. Springer, Cham. https://doi.org/10.1007/978-3-319-09861-6_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-09861-6_7
Published: 05 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09860-9
Online ISBN: 978-3-319-09861-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics