skip to main content
10.1145/2396761.2398504acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

A probabilistic approach to mining geospatial knowledge from social annotations

Published:29 October 2012Publication History

ABSTRACT

User-generated content, such as photos and videos, is often annotated by users with free-text labels, called tags. Increasingly, such content is also georeferenced, i.e., it is associated with geographic coordinates. The implicit relationships between tags and their locations can tell us much about how people conceptualize places and relations between them. However, extracting such knowledge from social annotations presents many challenges, since annotations are often ambiguous, noisy, uncertain and spatially inhomogeneous. We introduce a probabilistic framework for modeling georeferenced annotations and a method for learning model parameters from data. The framework is flexible and general, and can be used in a variety of applications that mine geospatial knowledge from user-generated content. Specifically, we study three problems: extracting place semantics, predicting locations of photos and learning part-of relations between places. We show our method performs well compared to state-of-the-art approaches developed for the first two problems, and offers a novel solution to the problem of learning relations between places.

References

  1. E. Amitay, N. Har'El, R. Sivan, and A. Soffer. Web-a-where: geotagging web content. In SIGIR. ACM, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. C. M. Bishop and S. S. En Ligne. Pattern recognition and machine learning, volume 4. springer New York, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. J. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the world's photos. In WWW, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society. Series B (Methodological), pages 1--38, 1977.Google ScholarGoogle ScholarCross RefCross Ref
  5. C. Gouvêa, S. Loh, L. F. F. Garcia, E. B. Fonseca, and Wendt. Discovering Location Indicators of Toponyms from News to Improve Gazetteer-Based Geo-Referencing. In Simpósio Brasileiro de Geoinformática-GEOINFO, 2008.Google ScholarGoogle Scholar
  6. J. R. Hershey and P. A. Olsen. Approximating the Kullback Leibler divergence between Gaussian mixture models. In ICASSP, volume 4. Ieee, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  7. J. C. Lagarias, J. A. Reeds, M. H. Wright, and P. E. Wright. Convergence properties of the Nelder-Mead simplex method in low dimensions. Siam journal of optimization, 9:112--147, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. R. Liddle. Information criteria for astrophysical model selection. Monthly Notices of the Royal Astronomical Society: Letters, 377(1):L74--L78, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  9. S. Openshaw. The modifiable areal unit problem. Geo Books Norwich, UK, 1983.Google ScholarGoogle Scholar
  10. A. Plangprasopchok, K. Lerman, and L. Getoor. A probabilistic approach for learning folksonomies from structured data. In WSDM, Nov. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Rattenbury and M. Naaman. Methods for extracting place semantics from Flickr tags. ACM Transactions on the Web (TWEB), 3(1):1, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Sanderson and B. Croft. Deriving concept hierarchies from text. In SIGIR, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. P. Schmitz. Inducing ontology from flickr tags. In WWW Workshop on Collaborative Web Tagging, May 2006.Google ScholarGoogle Scholar
  14. P. Serdyukov, V. Murdock, and R. Van Zwol. Placing flickr photos on a map. In SIGIR, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. W. Sinnott. Virtues of the Haversine. Sky and telescope, 68:158, 1984.Google ScholarGoogle Scholar
  16. Y. Yang and G. I. Webb. Discretization for naive-Bayes learning: managing discretization bias and variance. Machine learning, 74(1):39--74, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. A probabilistic approach to mining geospatial knowledge from social annotations

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
        October 2012
        2840 pages
        ISBN:9781450311564
        DOI:10.1145/2396761

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 29 October 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • short-paper

        Acceptance Rates

        Overall Acceptance Rate1,861of8,427submissions,22%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader