ABSTRACT
It is nowadays possible to access a huge and increasing stream of social media records. Recently, such data has been used to infer about spatio-temporal phenomena by treating the records as proxy observations of the real world. However, since such observations are heavily uncertain and their spatio-temporal distribution is highly heterogeneous, extracting meaningful signals from such data is a challenging task. In this paper, we present a probabilistic model to extract spatio-temporal distributions of phenomena (called spatio-temporal signals) from social media. Our approach models spatio-temporal and semantic knowledge about real-world phenomena embedded in records on the basis of conditional probability distributions in a Bayesian network. Through this, we realize a generic and comprehensive model where knowledge and uncertainties about spatio-temporal phenomena can be described in a modular and extensible fashion. We show that existing models for the extraction of spatio-temporal phenomena distributions from social media are particular instances of our model. We quantitatively evaluate instances of our model by comparing the spatio-temporal distributions of extracted phenomena from a large Twitter data set to their real-world distributions. The results clearly show that our model allows to extract better spatio-temporal signals in terms of quality and robustness.
- S. Ahern, M. Naaman, R. Nair, and J. H.-I. Yang. World Explorer: Visualizing Aggregate Data from Unstructures Text in Geo-Referenced Collections. In JCDL 2007. Google ScholarDigital Library
- J. Bollen, H. Mao, and X. Zeng. Twitter mood predicts the stock market. In Journal of Computational Science 2(1), 2011.Google Scholar
- A. Gallagher. The Wisdom of Social Multimedia: Using Flickr For Prediction and Forecast. In MM 2010. Google ScholarDigital Library
- J. Ginsberg, M. H. Mohebbi, R. S. Patel, L. Brammer, M. S. Smolinski, and L. Brilliant. Detecting influenza epidemics using search engine query data. In Nature, 457(7232), 2009.Google ScholarCross Ref
- R. C. Gonzalez and R. E. Woods. Digital Image Processing. Pearson, 2007. Google ScholarDigital Library
- L. Hong, A. Ahmed, S. Gurumurthy, A. Smola, and K. Tsioutsiouliklis. Discovering Geographical Topics In The Twitter Stream. In WWW 2012. Google ScholarDigital Library
- L. Kennedy, M. Naaman, S. Ahern, R. Nair, and T. Rattenbury. How Flickr Helps us Make Sense of the World: Context and Content in Community-Contributed Media Collections. In MM 2007. Google ScholarDigital Library
- D. Kollar and N. Friedman. Probabilistic graphical models: principles and techniques. The MIT Press, 2009. Google ScholarDigital Library
- Q. Mei, C. Liu, H. Su, and C. Zhai. A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In WWW 2006. Google ScholarDigital Library
- T. Quack. World-scale Mining of Objects and Events from Community Photo Collections. In CIVR 2008. Google ScholarDigital Library
- T. Rattenbury, N. Good, and M. Naaman. Towards Automatic Extraction of Event and Place Semantics from Flickr Tags. In SIGIR 2007. Google ScholarDigital Library
- T. Sakaki. Earthquake Shakes Twitter Users: Real-time Event Detection by Social Sensors. In WWW 2010. Google ScholarDigital Library
- J. Sankaranarayanan, B. E. Teitler, H. Samet, M. D. Lieberman, and J. Sperling. TwitterStand: News in Tweets. In GIS 2009. Google ScholarDigital Library
- C. Sengstock and M. Gertz. Exploration and Comparison of Geographic Information Sources using Distance Statistics. In GIS 2011. Google ScholarDigital Library
- C. Sengstock and M. Gertz. Latent Geographic Feature Extraction from Social Media. In GIS 2012. Google ScholarDigital Library
- P. Serdyukov, V. Murdock, and R. van Zwol. Placing flickr photos on a map. In SIGIR 2009. Google ScholarDigital Library
- J. Strötgen and M. Gertz. Multilingual and cross-domain temporal tagging. Language Resources and Evaluation, 47(2), 2012.Google Scholar
- J.-m. Xu, A. Bhargava, R. Nowak, and X. Zhu. Socioscope: Spatio-Temporal Signal Recovery from Social Media. In ECML-PKDD 2012. Google ScholarDigital Library
- Z. Yin, L. Cao, J. Han, C. Zhai, and T. Huang. Geographical Topic Discovery and Comparison. In WWW 2011. Google ScholarDigital Library
- H. Zhang, M. Korayem, D. J. Crandall, and G. Lebuhn. Mining Photo-sharing Websites to Study Ecological Phenomena. In WWW 2012. Google ScholarDigital Library
Index Terms
- A probablistic model for spatio-temporal signal extraction from social media
Recommendations
Uses and gratifications of social networking sites for bridging and bonding social capital
Applying uses and gratifications theory (UGT) and social capital theory, our study examined users of four social networking sites (SNSs) (Facebook, Twitter, Instagram, and Snapchat), and their influence on online bridging and bonding social capital. ...
Social capital, social media, and TV ratings
Motivated by the increasing role of social media in relating to economic outcomes, this paper examines the relationship between social networking sites SNS and television ratings drawing from the social capital theoretical framework of bonding and ...
College students social media use and communication network heterogeneity
This study examined whether and how the usage of social media can influence college students' level of network heterogeneity and how network heterogeneity is associated with levels of bridging/bonding social capital and subjective well-being. In ...
Comments