skip to main content
10.1145/2509230.2509238acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

A novel fusion method for integrating multiple modalities and knowledge for multimodal location estimation

Published: 21 October 2013 Publication History

Abstract

This article describes a novel fusion approach using multiple modalities and knowledge sources that improves the accuracy of multimodal location estimation algorithms. The problem of "multimodal location estimation" or "placing" involves associating geo-locations with consumer-produced nmultimedia data like videos or photos that have not been tagged using GPS. Our algorithm effectively integrates data from the visual and textual modalities with external geographical knowledge bases by building a hierarchical model that combines data-driven and semantic methods to group visual and textual features together within geographical regions. We evaluate our algorithm on the MediaEval 2010 Placing Task dataset and show that our system significantly outperforms other state-of-the-art approaches, successfully locating about 40% of the videos to within a radius of 100 m.

References

[1]
http://translate.google.com.
[2]
http://www.geonames.org.
[3]
http://www.wikipedia.org.
[4]
http://code.google.com/apis/maps/index.html.
[5]
J. Baldridge. The OpenNLP Project. http://www.opennlp.com, 2005.
[6]
J. Choi, G. Friedland, V. Ekambaram, and K. Ramchandran. Multimodal location estimation of consumer media: Dealing with sparse training data. In Multimedia and Expo (ICME), 2012 IEEE International Conference on, pages 43--48, July.
[7]
J. Choi, A. Janin, and G. Friedland. The 2010 ICSI Video Location Estimation System. In Working Notes of the MediaEval 2010 Workshop, 2010.
[8]
D. Ferrés and H. Rodríguez. Talp at mediaeval 2010 placing task: Geographical focus detection of flickr mtextual annotations. Proceedings of MediaEval, 2010.
[9]
G. Friedland, O. Vinyals, and T. Darrell. Multimodal Location Estimation. In Proceedings of ACM Multimedia, pages 1245--1251, 2010.
[10]
J. Hays and A. A. Efros. IM2GPS: estimating geographic information from a single image. In IEEE Conference on Computer Vision and Pattern mRecognition, 2008. CVPR 2008, pages 1--8. IEEE, June 2008.
[11]
L. A. U.-L. M. G.-V. José M. Perea-Ortega, Miguel A. García-Cumbreras. In Working Notes Proceedings of the MediaEval 2010 Workshop, Pisa, Italy, 2010.
[12]
P. Kelm, S. Schmiedeke, and T. Sikora. Multi-modal, multi-resource methods for placing flickr videos on the map. In ACM International Conference on Multimedia Retrieval (ICMR), page 8, Apr. 2011.
[13]
M. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, and G. J. F. Jones. Automatic tagging and geotagging in video collections and communities. In Proceedings of the 1st ACM International Conference on Multimedia Retrieval, ICMR '11, pages 51:1--51:8, New York, NY, USA, 2011. ACM.
[14]
T. Rattenbury and M. Naaman. Methods for extracting place semantics from flickr tags. ACM Trans. Web, 3(1):1:1--1:30, Jan. 2009.
[15]
O. Van Laere, S. Schockaert, and B. Dhoedt. Ghent University at the 2010 Placing Task. In Proceedings of MediaEval, October 2010.

Cited By

View all
  • (2022)Urban Image Geo-Localization Using Open Data on Public SpacesProceedings of the 19th International Conference on Content-based Multimedia Indexing10.1145/3549555.3549589(50-56)Online publication date: 14-Sep-2022
  • (2018)Improved image GPS location estimation by mining salient featuresImage Communication10.1016/j.image.2015.07.00738:C(141-150)Online publication date: 27-Dec-2018
  • (2016)Location Estimation Using Crowdsourced Spatial RelationsACM Transactions on Spatial Algorithms and Systems10.1145/28947452:2(1-23)Online publication date: 21-Jun-2016
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GeoMM '13: Proceedings of the 2nd ACM international workshop on Geotagging and its applications in multimedia
October 2013
42 pages
ISBN:9781450323918
DOI:10.1145/2509230
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. centroid-based fusion
  2. geo-tagging
  3. hierarchical segmentation
  4. multimodal location estimation

Qualifiers

  • Research-article

Conference

MM '13
Sponsor:
MM '13: ACM Multimedia Conference
October 21, 2013
Barcelona, Spain

Acceptance Rates

GeoMM '13 Paper Acceptance Rate 5 of 9 submissions, 56%;
Overall Acceptance Rate 10 of 14 submissions, 71%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Urban Image Geo-Localization Using Open Data on Public SpacesProceedings of the 19th International Conference on Content-based Multimedia Indexing10.1145/3549555.3549589(50-56)Online publication date: 14-Sep-2022
  • (2018)Improved image GPS location estimation by mining salient featuresImage Communication10.1016/j.image.2015.07.00738:C(141-150)Online publication date: 27-Dec-2018
  • (2016)Location Estimation Using Crowdsourced Spatial RelationsACM Transactions on Spatial Algorithms and Systems10.1145/28947452:2(1-23)Online publication date: 21-Jun-2016
  • (2015)Boosting Prediction of Geo-location for Web Images Through Integrating Multiple Knowledge SourcesProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749351(559-562)Online publication date: 22-Jun-2015
  • (2014)The Placing TaskProceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia10.1145/2661118.2661125(27-31)Online publication date: 7-Nov-2014
  • (2014)Social Media-based Profiling of Business LocationsProceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia10.1145/2661118.2661119(1-6)Online publication date: 7-Nov-2014
  • (2014)Georeferencing Flickr Resources Based on Multimodal FeaturesMultimodal Location Estimation of Videos and Images10.1007/978-3-319-09861-6_8(127-152)Online publication date: 5-Oct-2014
  • (2014)Application of Large-Scale Classification Techniques for Simple Location Estimation ExperimentsMultimodal Location Estimation of Videos and Images10.1007/978-3-319-09861-6_6(101-113)Online publication date: 5-Oct-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media