skip to main content
10.1145/2525314.2525353acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

A probablistic model for spatio-temporal signal extraction from social media

Published:05 November 2013Publication History

ABSTRACT

It is nowadays possible to access a huge and increasing stream of social media records. Recently, such data has been used to infer about spatio-temporal phenomena by treating the records as proxy observations of the real world. However, since such observations are heavily uncertain and their spatio-temporal distribution is highly heterogeneous, extracting meaningful signals from such data is a challenging task. In this paper, we present a probabilistic model to extract spatio-temporal distributions of phenomena (called spatio-temporal signals) from social media. Our approach models spatio-temporal and semantic knowledge about real-world phenomena embedded in records on the basis of conditional probability distributions in a Bayesian network. Through this, we realize a generic and comprehensive model where knowledge and uncertainties about spatio-temporal phenomena can be described in a modular and extensible fashion. We show that existing models for the extraction of spatio-temporal phenomena distributions from social media are particular instances of our model. We quantitatively evaluate instances of our model by comparing the spatio-temporal distributions of extracted phenomena from a large Twitter data set to their real-world distributions. The results clearly show that our model allows to extract better spatio-temporal signals in terms of quality and robustness.

References

  1. S. Ahern, M. Naaman, R. Nair, and J. H.-I. Yang. World Explorer: Visualizing Aggregate Data from Unstructures Text in Geo-Referenced Collections. In JCDL 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Bollen, H. Mao, and X. Zeng. Twitter mood predicts the stock market. In Journal of Computational Science 2(1), 2011.Google ScholarGoogle Scholar
  3. A. Gallagher. The Wisdom of Social Multimedia: Using Flickr For Prediction and Forecast. In MM 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Ginsberg, M. H. Mohebbi, R. S. Patel, L. Brammer, M. S. Smolinski, and L. Brilliant. Detecting influenza epidemics using search engine query data. In Nature, 457(7232), 2009.Google ScholarGoogle ScholarCross RefCross Ref
  5. R. C. Gonzalez and R. E. Woods. Digital Image Processing. Pearson, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. L. Hong, A. Ahmed, S. Gurumurthy, A. Smola, and K. Tsioutsiouliklis. Discovering Geographical Topics In The Twitter Stream. In WWW 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. Kennedy, M. Naaman, S. Ahern, R. Nair, and T. Rattenbury. How Flickr Helps us Make Sense of the World: Context and Content in Community-Contributed Media Collections. In MM 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Kollar and N. Friedman. Probabilistic graphical models: principles and techniques. The MIT Press, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Q. Mei, C. Liu, H. Su, and C. Zhai. A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In WWW 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. T. Quack. World-scale Mining of Objects and Events from Community Photo Collections. In CIVR 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Rattenbury, N. Good, and M. Naaman. Towards Automatic Extraction of Event and Place Semantics from Flickr Tags. In SIGIR 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Sakaki. Earthquake Shakes Twitter Users: Real-time Event Detection by Social Sensors. In WWW 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Sankaranarayanan, B. E. Teitler, H. Samet, M. D. Lieberman, and J. Sperling. TwitterStand: News in Tweets. In GIS 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. C. Sengstock and M. Gertz. Exploration and Comparison of Geographic Information Sources using Distance Statistics. In GIS 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Sengstock and M. Gertz. Latent Geographic Feature Extraction from Social Media. In GIS 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. P. Serdyukov, V. Murdock, and R. van Zwol. Placing flickr photos on a map. In SIGIR 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Strötgen and M. Gertz. Multilingual and cross-domain temporal tagging. Language Resources and Evaluation, 47(2), 2012.Google ScholarGoogle Scholar
  18. J.-m. Xu, A. Bhargava, R. Nowak, and X. Zhu. Socioscope: Spatio-Temporal Signal Recovery from Social Media. In ECML-PKDD 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Z. Yin, L. Cao, J. Han, C. Zhai, and T. Huang. Geographical Topic Discovery and Comparison. In WWW 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. H. Zhang, M. Korayem, D. J. Crandall, and G. Lebuhn. Mining Photo-sharing Websites to Study Ecological Phenomena. In WWW 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A probablistic model for spatio-temporal signal extraction from social media

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGSPATIAL'13: Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
          November 2013
          598 pages
          ISBN:9781450325219
          DOI:10.1145/2525314

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 5 November 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate220of1,116submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader