skip to main content
10.1145/2424321.2424393acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Multimedia multimodal geocoding

Authors Info & Claims
Published:06 November 2012Publication History

ABSTRACT

This work is developed in the context of the placing task of the MediaEval 2011 initiative. The objective is to geocode (or geotag) a set of videos, i.e., automatically assign geographical coordinates to them. This paper presents an architecture for multimodal geocoding that exploits both visual and textual descriptions associated with videos. This work also describes our efforts regarding the implementation of this architecture to demonstrate its applicability. Conducted experiments show how our multimodal approach enhances the results compared to relying on a single modality.

References

  1. J. Almeida, N. J. Leite, and R. da S. Torres. Comparison of video sequences with histograms of motion patterns. In ICIP, pages 3673--3676, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  2. R. Candeias and B. Martins. Associating relevant photos to georeferenced textual documents through rank aggregation. In Int. Semantic Web Conf. - Terra Cognita Workshop, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Choi, H. Lei, and G. Friedland. The 2011 ICSI video location estimation system. In Working Notes Proc. MediaEval Workshop, volume 807, 2011.Google ScholarGoogle Scholar
  4. W. B. Croft. Combining approaches to information retrieval. In Adv. in Inf. Retrieval, volume 7, pages 1--36. Springer US, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  5. F. A. Faria, A. Veloso, H. M. de Almeida, E. Valle, R. da S. Torres, M. A. Gonçalves, and W. M. Jr. Learning to rank for content-based image retrieval. In ACM MIR, pages 285--294, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Friendly. Corrgrams: Exploratory displays for correlation matrices. The American Statistician, 56(4):316--324, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  7. J. Hays and A. A. Efros. im2gps: estimating geographic information from a single image. In CVPR, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  8. C. B. Jones and R. S. Purves. Geographical information retrieval. Int. J. Geo. Info. Science, 22(3):219--228, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Y. Kalantidis, G. Tolias, Y. Avrithis, M. Phinikettos, E. Spyrou, P. Mylonas, and S. Kollias. Viral: Visual image retrieval and localization. Mult. Tools and App., 51:555--592, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Kelm, S. Schmiedeke, and T. Sikora. A hierarchical, multi-modal approach for placing videos on the map using millions of flickr photographs. In Workshop on Social and behavioural networked media access, pages 15--20, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, and G. J. F. Jones. Automatic tagging and geotagging in video collections and communities. In ICMR, pages 51:1--51:8, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. L. T. Li, J. Almeida, and R. da S. Torres. RECOD working notes for placing task MediaEval 2011. In Working Notes Proc. MediaEval Workshop, volume 807, 2011.Google ScholarGoogle Scholar
  13. J. Luo, D. Joshi, J. Yu, and A. Gallagher. Geotagging in multimedia and computer vision--a survey. Mult. Tools and App., 51:187--211, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. C. D. Manning, P. Raghavan, and H. Schtze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. C. G. Pedronette and R. da S. Torres. Exploiting clustering approaches for image re-ranking. J. Vis. Lang. and Comp., 22(6):453--466, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. C. G. Pedronette, R. da S. Torres, and R. T. Calumby. Using contextual spaces for image re-ranking and rank aggregation. Mult. Tools and App., pages 1--28, 2012.Google ScholarGoogle Scholar
  17. O. A. B. Penatti, L. T. Li, J. Almeida, and R. da S. Torres. A Visual Approach for Video Geocoding using Bag-of-Scenes. In ICMR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Rae, V. Murdock, P. Serdyukov, and P. Kelm. Working notes for the placing task at MediaEval 2011. In Working Notes Proc. MediaEval Workshop, volume 807, 2011.Google ScholarGoogle Scholar
  19. O. Van Laere, S. Schockaert, and B. Dhoedt. Finding locations of flickr resources using language models and similarity search. In International Conference on Multimedia Retrieval, pages 48:1--48:8, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Multimedia multimodal geocoding

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SIGSPATIAL '12: Proceedings of the 20th International Conference on Advances in Geographic Information Systems
            November 2012
            642 pages
            ISBN:9781450316910
            DOI:10.1145/2424321

            Copyright © 2012 Authors

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 6 November 2012

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate220of1,116submissions,20%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader