Abstract
Detection of events using voluntarily generated content in microblogs has been the objective of numerous recent studies. One essential challenge tackled in these studies is estimating the locations of events. In this paper, we review the state-of-the-art location estimation techniques used in the localization of events detected in microblogs, particularly in Twitter, which is one of the most popular microblogging platforms worldwide. We analyze these techniques with respect to the targeted event type, granularity of estimated locations, location-related features selected as sources of spatial evidence, and the method used to make aggregate decisions based on the extracted evidence. We discuss the strengths and advantages of alternative solutions to various problems related to location estimation, as well as their preconditions and limitations. We examine the most widely used evaluation methods to analyze the accuracy of estimations and present the results reported in the literature. We also discuss our findings and highlight important research challenges that may need further attention.
Similar content being viewed by others
Notes
References
Abdelhaq H, Sengstock C, Gertz M (2013) EvenTweet: online localized event detection from twitter. Proc VLDB Endow 6(12):1326–1329
Abdelhaq H, Gertz M, Armiti A (2016) Efficient online extraction of keywords for localized events in twitter. GeoInformatica. doi:10.1007/s10707-016-0258-x
Achrekar H, Gandhe A, Lazarus R, Yu SH, Liu B (2013) Online social networks flu trend tracker: a novel sensory approach to predict flu trends. In: Gabriel J, Schier J et al (eds) Biomedical engineering systems and technologies, communications in computer and information science, vol 357. Springer, Berlin Heidelberg, pp 353–368
Aggarwal CC (2013) A survey of stream clustering algorithms. In: Data clustering: algorithms and applications. CRC Press, Florida, USA, pp 231–258
Aggarwal CC, Zhai C (2012) A survey of text clustering algorithms. In: Aggarwal CC, Zhai C (eds) Mining text data. Springer, New York, pp 77–128
Ajao O, Hong J, Liu W (2015) A survey of location inference techniques on Twitter. J Inf Sci 41(6):855–864
Allan J (ed) (2002) Topic Detection and Tracking: Event-based Information Organization. Kluwer Academic Publishers
Amitay E, Har’El N, Sivan R, Soffer A (2004) Web-a-where: Geotagging web content. In: Proceedings of the 27th international ACM SIGIR conference on research and development in information retrieval, ACM, pp 273–280
Anantharam P, Barnaghi P, Thirunarayan K, Sheth A (2015) Extracting city traffic events from social streams. ACM Trans Intell Syst Technol 6(4):43:1–43:27
Ao J, Zhang P, Cao Y (2014) Estimating the locations of emergency events from Twitter streams. Proc ITQM 2014:731–739
Arulampalam MS, Maskell S, Gordon N, Clapp T (2002) A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans Signal Process 50(2):174–188
Atefeh F, Khreich W (2015) A survey of techniques for event detection in Twitter. Comput Intell 31(1):132–164
Boettcher A, Lee D (2012) EventRadar: a real-time local event detection scheme using Twitter stream. In: Proceedings of the 2012 IEEE international conference on green computing and communications, IEEE Computer Society, GREENCOM ’12, pp 358–367
Calvo T, Kolesárová A, Komorníková M, Mesiar R (2002) Aggregation operators: properties, classes and construction methods. Aggregation operators: new trends and applications. Physica-Verlag GmbH, Heidelberg, pp 3–104
Cheng T, Wicks T (2014) Event detection using Twitter: a spatio-temporal approach. PLoS ONE 9(6):1–10
Cheng Z, Caverlee J, Lee K (2010) You are where you tweet: a content-based approach to geo-locating Twitter users. In: Proceedings of the 19th ACM international conference on information and knowledge management, ACM, pp 759–768
Cordeiro M, Gama J (2016) Online social networks event detection: a survey. In: Michaelis S, Piatkowski N, Stolpe M (eds) Solving large scale learning tasks. Challenges and Algorithms. Springer, Cham, pp 1–41
Crooks A, Croitoru A, Stefanidis A, Radzikowski J (2013) # Earthquake: Twitter as a distributed sensor system. Trans GIS 17(1):124–147
De Longueville B, Smith RS, Luraschi G (2009) OMG, from here, I can see the flames!: a use case of mining location based social networks to acquire spatio-temporal data on forest fires. In: Proceedings of international workshop on location based social networks, ACM, LBSN ’09, pp 73–80
Dempster AP (1967) Upper and lower probabilities induced by a multivalued mapping. Ann Math Stat 38(2):325–339
Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd international conference on knowledge discovery and data mining, AAAI Press, pp 226–231
Feng W, Zhang C, Zhang W, Han J, Wang J, Aggarwal C, Huang J (2015) STREAMCUBE: Hierarchical spatio-temporal hashtag clustering for event exploration over the Twitter stream. In: International conference on data engineering (ICDE), pp 1561–1572
Fox D, Hightower J, Liao L, Schulz D, Borriello G (2003) Bayesian filtering for location estimation. Pervasive Computing, IEEE 2(3):24–33
Garg M, Kumar M (2016) Review on event detection techniques in social multimedia. Online Inf Rev 40(3):347–361
Gelernter J, Mushegian N (2011) Geo-parsing messages from microtext. Trans GIS 15(6):753–773
Giridhar P, Abdelzaher T, George J, Kaplan L (2015a) On quality of event localization from social network feeds. In: Pervasive computing and communication workshops (PerCom Workshops), pp 75–80
Giridhar P, Wang S, Abdelzaher T, George J, Kaplan L, Ganti R (2015b) Joint localization of events and sources in social networks. In: Distributed computing in sensor systems (DCOSS), pp 179–188
Goodchild MF (2007) Citizens as sensors: the world of volunteered geography. GeoJournal 69(4):211–221
Hamed AA, Ayer AA, Clark EM, Irons EA, Taylor GT, Zia A (2015) Measuring climate change on Twitter using Google’s algorithm: perception and events. Int J Web Inf Syst 11(4):527–544
Han J, Kamber M, Tung AKH (2001) Spatial clustering methods in data mining: A survey. In: Miller HJ, Han J (eds) Geographic data mining and knowledge discovery, research monographs in GIS. Taylor & Francis Inc, Bristol
Hecht B, Hong L, Suh B, Chi EH (2011) Tweets from Justin Bieber’s heart: The dynamics of the location field in user profiles. In: Proceedings of CHI conference on human factors in computing systems, ACM, pp 237–246
Heravi BR, Morrison D, Khare P, Marchand-Maillet S (2014) Where is the news breaking? Towards a location-based event detection framework for journalists. In: Proceedings of the 20th international conference on multimedia modeling, Volume 8326, Springer, pp 192–204
Hill LL (2006) Georeferencing: the geographic associations of information. The MIT Press, Cambridge
Hua T, Chen F, Zhao L, Lu CT, Ramakrishnan N (2013) STED: Semi-supervised targeted-interest event detection in Twitter. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, ACM, KDD ’13, pp 1466–1469
Huang Y, Liu Z, Nguyen P (2015) Location-based event search in social texts. In: International conference on computing, networking and communications (ICNC), pp 668–672
Imran M, Castillo C, Diaz F, Vieweg S (2015) Processing social media messages in mass emergency: a survey. ACM Comput Surv 47(4):67:1–67:38
Jin P, Lin S, Zhang Q (2014) Spatiotemporal Information for the Web. Encyclopedia of social network analysis and mining. Springer, Berlin, pp 1997–2010
Kalman RE (1960) A new approach to linear filtering and prediction problems. Trans ASME J Basic Eng 82(Series D):35–45
Kleinberg J (2002) Bursty and hierarchical structure in streams. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’02, pp 91–101
Kulldorff M (1999) Spatial scan statistics: models, calculations, and applications. Scan statistics and applications. Birkhäuser Boston, Boston
Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of the 19th international conference on world wide web, ACM, WWW ’10, pp 591–600
Lee R, Sumiya K (2010) Measuring geographical regularities of crowd behaviors for Twitter-based geo-social event detection. In: Proceedings of the 2nd ACM SIGSPATIAL international workshop on location based social networks, ACM, LBSN ’10, pp 1–10
Leetaru K, Wang S, Cao G, Padmanabhan A, Shook E (2013) Mapping the global Twitter heartbeat: The geography of Twitter. First Monday 18(5). doi:10.5210/fm.v18i5.4366
Li C, Weng J, He Q, Yao Y, Datta A, Sun A, Lee BS (2012a) TwiNER: Named entity recognition in targeted Twitter stream. In: Proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval, ACM, pp 721–730
Li R, Lei KH, Khadiwala R, Chang KC (2012b) TEDAS: A Twitter-based event detection and analysis system. In: IEEE international conference on data engineering (ICDE), pp 1273–1276
Lingad J, Karimi S, Yin J (2013) Location extraction from disaster-related microblogs. In: Proceedings of WWW companion, pp 1017–1020
Lu Y, Hu X, Wang F, Kumar S, Liu H, Maciejewski R (2015) Visualizing social media sentiment in disaster scenarios. In: Proceedings of the 24th international conference on world wide web, ACM, New York, NY, USA, WWW ’15 Companion, pp 1211–1215
MacEachren A, Jaiswal A, Robinson A, Pezanowski S, Savelyev A, Mitra P, Zhang X, Blanford J (2011) SensePlace2: GeoTwitter analytics support for situational awareness. In: IEEE conference on visual analytics science and technology (VAST), pp 181–190
Marcus A, Bernstein MS, Badar O, Karger DR, Madden S, Miller RC (2011) TwitInfo: Aggregating and visualizing microblogs for event exploration. In: Proceedings of the SIGCHI conference on human factors in computing systems, ACM, CHI’11, pp 227–236
Middleton S, Middleton L, Modafferi S (2014) Real-time crisis mapping of natural disasters using social media. Intell Syst IEEE 29(2):9–17
Musaev A, Wang D, Pu C (2015) LITMUS: a multi-service composition system for landslide detection. IEEE Trans Serv Comput 8(5):715–726
Nagar R, Yuan Q, Freifeld CC, Santillana M, Nojima A, Chunara R, Brownstein SJ (2014) A case study of the New York City 2012–2013 influenza season with daily geocoded Twitter data from temporal and spatiotemporal perspectives. J Med Internet Res 16(10):e236
Neill DB (2012) Fast subset scan for spatial pattern detection. J R Stat Soc: Ser B (Stat Methodol) 74(2):337–360
Nguyen HL, Woon YK, Ng WK (2014) A survey on data stream clustering and classification. Knowl Inf Syst 45(3):535–569
Ozdikis O, Senkul P, Oguztuzun H (2012) Semantic expansion of hashtags for enhanced event detection in Twitter. In: Proceedings of VLDB 2012 workshop on online social systems (WOSS)
Ozdikis O, Oguztuzun H, Karagoz P (2013) Evidential location estimation for events detected in Twitter. In: Proceedings of the 7th workshop on geographic information retrieval, ACM, GIR ’13, pp 9–16
Ozdikis O, Oğuztüzün H, Karagoz P (2016) Evidential estimation of event locations in microblogs using the Dempster–Shafer theory. Inf Process Manag 52(6):1227–1246
Padmanabhan A, Wang S, Cao G, Hwang M, Zhang Z, Gao Y, Soltani K, Liu Y (2014) FluMapper: a cyberGIS application for interactive analysis of massive location-based social media. Concurr Comput: Pract Exp 26(13):2253–2265
Panteras G, Wise S, Lu X, Croitoru A, Crooks A, Stefanidis A (2015) Triangulating social multimedia content for event localization using Flickr and Twitter. Trans GIS 19(5):694–715
Paradesi SM (2011) Geotagging tweets using their content. In: Proceedings of FLAIRS, AAAI Press
Power R, Robinson B, Colton J, Cameron M (2014) Emergency situation awareness: Twitter case studies. In: Hanachi C, Bnaben F, Charoy F (eds) Information systems for crisis response and management in mediterranean countries, lecture notes in business information processing, vol 196, Springer, pp 218–231
Rill S, Reinel D, Scheidt J, Zicari RV (2014) Politwi: early detection of emerging political topics on Twitter and the impact on concept-level sentiment analysis. Knowl-Based Syst 69:24–33
Ritter A, Clark S, Mausam, Etzioni O (2011) Named entity recognition in tweets: an experimental study. In: Proceedings of the conference on empirical methods in natural language processing, association for computational linguistics, pp 1524–1534
Roick O, Heuser S (2013) Location based social networks—definition, current state of the art and research agenda. Trans GIS 17(5):763–784
Sakai T, Tamura K (2014) Identifying bursty areas of emergency topics in geotagged tweets using density-based spatiotemporal clustering algorithm. In: 7th International workshop on computational intelligence and applications (IWCIA), pp 95–100
Sakaki T, Okazaki M, Matsuo Y (2010) Earthquake shakes Twitter users: Real-time event detection by social sensors. In: Proceedings of the 19th international conference on world wide web, ACM, WWW ’10, pp 851–860
Sakaki T, Okazaki M, Matsuo Y (2013) Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Trans Knowl Data Eng 25(4):919–931
Sankaranarayanan J, Samet H, Teitler BE, Lieberman MD, Sperling J (2009) TwitterStand: News in tweets. In: Proceedings of the 17th ACM SIGSPATIAL international conference on advances in geographic information systems, ACM, pp 42–51
Dos Santos ADP, Wives LK, Alvares LO (2012) Location-based events detection on micro-blogs. CoRR abs/1210.4008
Shafer G (1976) A mathematical theory of evidence. Princeton University Press, Princeton
Silva JA, Faria ER, Barros RC, Hruschka ER, Carvalho ACPLFd, Gama J (2013) Data stream clustering: a survey. ACM Comput Surv 46(1):13:1–13:31
Silverman BW (1986) Density estimation for statistics and data analysis. Chapman & Hall, London
Stefanidis A, Crooks A, Radzikowski J (2013) Harvesting ambient geospatial information from social media feeds. GeoJournal 78(2):319–338
Steiger E, de Albuquerque JP, Zipf A (2015) An advanced systematic literature review on spatiotemporal analyses of Twitter data. Trans GIS 19(6):809–834
Tamura K, Ichimura T (2013) Density-based spatiotemporal clustering algorithm for extracting bursty areas from georeferenced documents. In: IEEE international conference on systems, man, and cybernetics (SMC), pp 2079–2084
Tamura K, Kitakami H (2013) Detecting location-based enumerating bursts in georeferenced micro-posts. In: International conference on advanced applied informatics, pp 389–394
Teitler BE, Lieberman MD, Panozzo D, Sankaranarayanan J, Samet H, Sperling J (2008) NewsStand: A new view on news. In: Proceedings of the 16th ACM SIGSPATIAL international conference on advances in geographic information systems, ACM, pp 1–10
Unankard S, Li X, Sharaf M (2015) Emerging event detection in social networks with location sensitivity. World Wide Web 18(5):1393–1417
Vieweg S, Hughes AL, Starbird K, Palen L (2010) Microblogging during two natural hazards events: What Twitter may contribute to situational awareness. In: Proceedings of the SIGCHI conference on human factors in computing systems, ACM, CHI ’10, pp 1079–1088
Wang D, Amin MTA, Abdelzaher T, Roth D, Voss CR, Kaplan LM, Tratz S, Laoudi J, Briesch D (2014) Provenance-assisted classification in social networks. IEEE J Sel Top Signal Process 8(4):624–637
Wanner F, Stoffel A, Jäckle D, Kwon BC, Weiler A, Keim DA (2014) State-of-the-art report of visual analysis for event detection in text data streams. In: Borgo R, Maciejewski R, Viola I (eds). EuroVis - STARs, The Eurographics Association
Watanabe K, Ochi M, Okabe M, Onai R (2011) Jasmine: A real-time local-event detection system based on geolocation information propagated to microblogs. In: Proceedings of the 20th ACM international conference on information and knowledge management, ACM, CIKM ’11, pp 2541–2544
Welch G, Bishop G (1995) An introduction to the Kalman filter. Technical report, Chapel Hill, NC, USA
Weng J, Lee B (2011) Event detection in Twitter. In: Proceedings of the 5th international conference on weblogs and social media
Wu X, Kumar V, Ross Quinlan J, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Yu PS, Zhou ZH, Steinbach M, Hand DJ, Steinberg D (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14(1):1–37
Yin J, Lampert A, Cameron M, Robinson B, Power R (2012) Using social media to enhance emergency situation awareness. Intell Syst IEEE 27(6):52–59
Yuan Q, Cong G, Ma Z, Sun A, Thalmann NM (2013) Who, Where, When and What: Discover spatio-temporal topics for Twitter users. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, ACM, KDD ’13, pp 605–613
Zhang T, Ramakrishnan R, Livny M (1996) BIRCH: An efficient data clustering method for very large databases. In: Proceedings of the 1996 ACM SIGMOD international conference on management of data, ACM, New York, NY, USA, SIGMOD ’96, pp 103–114
Acknowledgements
This work was financially supported by TUBITAK with the Grant Number 112E275 and ICT COST Action IC1203.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ozdikis, O., Oğuztüzün, H. & Karagoz, P. A survey on location estimation techniques for events detected in Twitter. Knowl Inf Syst 52, 291–339 (2017). https://doi.org/10.1007/s10115-016-1007-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-016-1007-z