short-paper

A probabilistic approach to mining geospatial knowledge from social annotations

Authors:
Suradej Intagorn

University of Southern California, Marina del Rey, CA, USA

University of Southern California, Marina del Rey, CA, USA
View Profile

,
Kristina Lerman

University of Southern California, Marina del Rey, CA, USA

University of Southern California, Marina del Rey, CA, USA
View Profile

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge managementOctober 2012Pages 1717–1721https://doi.org/10.1145/2396761.2398504

Published:29 October 2012Publication History

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

Pages 1717–1721

ABSTRACT

User-generated content, such as photos and videos, is often annotated by users with free-text labels, called tags. Increasingly, such content is also georeferenced, i.e., it is associated with geographic coordinates. The implicit relationships between tags and their locations can tell us much about how people conceptualize places and relations between them. However, extracting such knowledge from social annotations presents many challenges, since annotations are often ambiguous, noisy, uncertain and spatially inhomogeneous. We introduce a probabilistic framework for modeling georeferenced annotations and a method for learning model parameters from data. The framework is flexible and general, and can be used in a variety of applications that mine geospatial knowledge from user-generated content. Specifically, we study three problems: extracting place semantics, predicting locations of photos and learning part-of relations between places. We show our method performs well compared to state-of-the-art approaches developed for the first two problems, and offers a novel solution to the problem of learning relations between places.

References

E. Amitay, N. Har'El, R. Sivan, and A. Soffer. Web-a-where: geotagging web content. In SIGIR. ACM, 2004. Google ScholarDigital Library
C. M. Bishop and S. S. En Ligne. Pattern recognition and machine learning, volume 4. springer New York, 2006. Google ScholarDigital Library
D. J. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the world's photos. In WWW, 2009. Google ScholarDigital Library
A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society. Series B (Methodological), pages 1--38, 1977.Google ScholarCross Ref
C. Gouvêa, S. Loh, L. F. F. Garcia, E. B. Fonseca, and Wendt. Discovering Location Indicators of Toponyms from News to Improve Gazetteer-Based Geo-Referencing. In Simpósio Brasileiro de Geoinformática-GEOINFO, 2008.Google Scholar
J. R. Hershey and P. A. Olsen. Approximating the Kullback Leibler divergence between Gaussian mixture models. In ICASSP, volume 4. Ieee, 2007.Google ScholarCross Ref
J. C. Lagarias, J. A. Reeds, M. H. Wright, and P. E. Wright. Convergence properties of the Nelder-Mead simplex method in low dimensions. Siam journal of optimization, 9:112--147, 1998. Google ScholarDigital Library
A. R. Liddle. Information criteria for astrophysical model selection. Monthly Notices of the Royal Astronomical Society: Letters, 377(1):L74--L78, 2007.Google ScholarCross Ref
S. Openshaw. The modifiable areal unit problem. Geo Books Norwich, UK, 1983.Google Scholar
A. Plangprasopchok, K. Lerman, and L. Getoor. A probabilistic approach for learning folksonomies from structured data. In WSDM, Nov. 2011. Google ScholarDigital Library
T. Rattenbury and M. Naaman. Methods for extracting place semantics from Flickr tags. ACM Transactions on the Web (TWEB), 3(1):1, 2009. Google ScholarDigital Library
M. Sanderson and B. Croft. Deriving concept hierarchies from text. In SIGIR, 1999. Google ScholarDigital Library
P. Schmitz. Inducing ontology from flickr tags. In WWW Workshop on Collaborative Web Tagging, May 2006.Google Scholar
P. Serdyukov, V. Murdock, and R. Van Zwol. Placing flickr photos on a map. In SIGIR, 2009. Google ScholarDigital Library
R. W. Sinnott. Virtues of the Haversine. Sky and telescope, 68:158, 1984.Google Scholar
Y. Yang and G. I. Webb. Discretization for naive-Bayes learning: managing discretization bias and variance. Machine learning, 74(1):39--74, 2009. Google ScholarDigital Library

A probabilistic approach to mining geospatial knowledge from social annotations
1. Information systems
  1. Information systems applications
2. Mathematics of computing
  1. Probability and statistics
    1. Probabilistic reasoning algorithms

Recommendations

A probabilistic approach to mining geospatial knowledge from social annotations

Knowledge produced online often comes in the form of free-text labels, known as tags, with which users annotate the content they create, such as photos and videos. Increasingly, such content is also georeferenced, i.e., it is associated with geographic ...
Read More
A Flexible Text Mining System for Entity and Relation Extraction in PubMed
DTMBIO '15: Proceedings of the ACM Ninth International Workshop on Data and Text Mining in Biomedical Informatics

Due to an enormous number of scientific publications that cannot be handled manually, there is a rising interest in text-mining techniques for automated information extraction, especially in the biomedical field. Such techniques provide effective means ...
Read More
Application of association rules mining to Named Entity Recognition and co-reference resolution for the Indonesian language

In this paper, we propose a new method, association rules mining for Named Entity Recognition (NER) and co-reference resolution. The method uses several morphological and lexical features such as Pronoun Class (PC) and Name Class (NC), String Similarity ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
October 2012
2840 pages
ISBN:9781450311564
DOI:10.1145/2396761
General Chair:
Xuewen Chen
Wayne State University, USA
,
Program Chairs:
Guy Lebanon
Georgia Institute of Technology
,
Haixun Wang
Microsoft Research Asia
,
Mohammed J. Zaki
Rensselaer Polytechnic Institute
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 29 October 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
data mining
geo-spatial
information extraction
social network
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 130
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A probabilistic approach to mining geospatial knowledge from social annotations

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

ABSTRACT

References

Cited By

Recommendations

A probabilistic approach to mining geospatial knowledge from social annotations

A Flexible Text Mining System for Entity and Relation Extraction in PubMed

Application of association rules mining to Named Entity Recognition and co-reference resolution for the Indonesian language

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

A probabilistic approach to mining geospatial knowledge from social annotations

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

ABSTRACT

References

Cited By

Recommendations

A probabilistic approach to mining geospatial knowledge from social annotations

A Flexible Text Mining System for Entity and Relation Extraction in PubMed

Application of association rules mining to Named Entity Recognition and co-reference resolution for the Indonesian language

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media