Skip to main content

Geolocation Detection Approaches for User Discussion Analysis in Twitter

  • Conference paper
  • First Online:
HCI International 2022 - Late Breaking Papers. Interaction in New Media, Learning and Games (HCII 2022)

Abstract

In this research, the authors consider methods for identifying geodata of users of social networks within user discussions. The knowledge of user geolocation data makes it possible to analyze the spread of discussion among users of different countries. Authors do not try to determine the exact geolocation, but rather the country where the users are located. The problem of getting country-level user location data lies in the fact that a high percentage of users do not state their location correctly, either mentioning it in humorous ways or even not stating it at all. There are various methods of obtaining data about the location of users. Among them, there are text-based methods, methods based on the analysis of the context, and methods based on the topology of the user graph. In this paper, we make a special emphasis on a method that allows to reveal geodata of users who specified their geodata incorrectly or did not specify it at all. In order to test our method, we use Twitter datasets.

We propose several approaches to resolve the issues stated above. The paper highlights three approaches: the naïve approach, the naïve approach using natural language processing (NLP), and the graph approach, which is glossary-based and determines the number of outgoing connections. We have introduced two measures in order to evaluate the proposed approaches. Recall-GEO and Precision-GEO that are described throughout the paper. The accuracy of UserGraph method is finally evaluated using the metrics above.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ren, K., Zhang, S., Lin, H.: Where are you settling down: geo-locating twitter users based on tweets and social networks. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 150–161. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35341-3_13

    Chapter  Google Scholar 

  2. Han, B., Cook, P., Baldwin, T.: Geolocation prediction in social media data by finding location indicative words. In: Proceedings of the Conference on Computing Linguistics: Technical, Papers, pp. 1045–1062 (2012)

    Google Scholar 

  3. Dai, X., Karimi, S., Hachey, B., Paris, C.: An effective transition-based model for discontinuous NER. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5860–5870 (2020)

    Google Scholar 

  4. Mahmud, J., Nichols, J., Drews, C.: Where is this tweet from? inferring home locations of twitter users. In: Proceedings of the International Conference on Weblogs Social Media, pp. 511–514 (2012)

    Google Scholar 

  5. Han, B., Cook, P., Baldwin, T.: Text-based twitter user geolocation prediction. J. Artif. Intell. Res. 49(1), 451–500 (2014)

    Article  Google Scholar 

  6. Cheng, Z., Caverlee, J., Lee, K.: You are where you tweet: a content-based approach to geo-locating twitter users. In: Proceedings of the ACM Conference on Information Knowledge Management, pp. 759–768 (2010)

    Google Scholar 

  7. Backstrom, L., Kleinberg, J., Kumar, R., Novak, J.: Spatial variation in search engine queries. In: Proceedings of the Conference on World Wide Web, pp. 357–366 (2008)

    Google Scholar 

  8. Ryoo, K., Moon, S.: Inferring twitter user locations with 10 km accuracy. In: Proceedings of the World Wide Web Conference on Companion Volume, pp. 643–648 (2014)

    Google Scholar 

  9. Li, R., Wang, S., Chang, K.C.-C.: Multiple location profiling for users and relationships from social network and content. Proc. VLDB Endowment 5(11), 1603–1614 (2012). in Proc. Conf. Human Factors Comput. Syst., pp. 237–246, 2011

    Google Scholar 

  10. Rahimi, A., Vu, D., Cohn, T., Baldwin, T.: Exploiting text and network context for geolocation of social media users. In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1362–1367 (2015)

    Google Scholar 

  11. Hecht, B., Hong, L., Suh, B., Chi, E.H.: Tweets from justin bieber’s heart: the dynamics of the location field in user profiles. In: CHI ‘11: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 237–246 (2011)

    Google Scholar 

  12. Mahmud, J., Nichols, J., Drews, C.: Home location identification of twitter users. ACM Trans. Intell. Syst. Technol. 5 (3), 47:1–47:21 (2014)

    Google Scholar 

  13. Wing, B.P., Baldridge, J.: Simple supervised document geolocation with geodesic grids. In: Proceedings of the Annual Meeting Association Computing Linguistics: Human Language Technology, pp. 955–964 (2011)

    Google Scholar 

  14. Wing, B., Baldridge, J.: Hierarchical discriminative classification for text-based geolocation. In: Proceedings of the Conference on Empirical Methods Natural Language Process, pp. 336–348 (2014)

    Google Scholar 

  15. Roller, S., Speriosu, M., Rallapalli, S., Wing, B., Baldridge, J.: Supervised text-based geolocation using language models on an adaptive grid. In: Proceedings of the Joint Conference on Empirical Methods Natural Language Processing Computing Natural Language Learning, pp. 1500–1510 (2012)

    Google Scholar 

  16. Miura, Y., Taniguchi, M., Taniguchi, T., Ohkuma, T.: A simple scalable neural networks based model for geolocation prediction in twitter. In: Proceedings of the Workshop Noisy User-Generated Text, pp. 235–239 (2016)

    Google Scholar 

  17. Miura, Y., Taniguchi, M., Taniguchi, T., Ohkuma, T.: Unifying text, metadata, and user network representations with a neural network for geolocation prediction. In: Proceedings of the Annual Meeting Association Computing Linguistics, pp. 1260–1272 (2017)

    Google Scholar 

  18. Rahimi, A., Cohn, T., Baldwin, T.: A neural model for user geolocation and lexical dialectology. In: Proceedings of the Annual Meeting Association Computing Linguistics, Volume 2: Short Papers, pp. 209–216 (2017)

    Google Scholar 

  19. Davis Jr., C.A., Pappa, G.L., de Oliveira, D.R.R., de L Arcanjo, F.: Inferring the location of twitter messages based on user relationships. Trans. GIS, 15(6), 735–751 (2011)

    Google Scholar 

  20. Kong, L., Liu, Z., Huang, Y.: SPOT: locating social media users based on social network context. Proc. VLDB Endowment 7(13), 1681–1684 (2014)

    Article  Google Scholar 

  21. McGee, J., Caverlee, J.A., Cheng, Z.: A geographic study of tie strength in social media. In: Proceedings of the ACM Conference on Information Knowledge Management, pp. 2333–2336, 2011

    Google Scholar 

  22. McGee, J., Caverlee, J., Cheng, Z.: Location prediction in social media based on tie strength. In: Proceedings of the ACM Conference on Information Knowledge Management, pp. 459–468 (2013)

    Google Scholar 

  23. Compton, R., Jurgens, D., Allen, D.: Geotagging one hundred million twitter accounts with total variation minimization. In: Proceedings of the IEEE International Conference on Big Data, pp. 393–401 (2014)

    Google Scholar 

  24. Jurgens, D.: That’s what friends are for: Inferring location in online social media platforms based on social relationships. In: Proceedings of the International Conference on Weblogs Social Media, pp. 273–282 (2013)

    Google Scholar 

  25. Efstathiades, H., Antoniades, D., Pallis, G., Dikaiakos, M.D.: Identification of key locations based on online social network activity. In: Proceedings of the IEEE/ACM Conference on Advanced Social Network Analysis Mining, pp. 218–225 (2015)

    Google Scholar 

  26. Blekanov, I., Bodrunova, S.S., Akhmetov, A.: Detection of hidden communities in twitter discussions of varying volumes. Future Internet. 13(11), 295–311 (2021)

    Article  Google Scholar 

  27. Bodrunova, S.S., Litvinenko, A.A., Blekanov, I.S.: Please follow us: media roles in twitter discussions in the United States, Germany, France, and Russia. Journal. Pract. Routledge 12(2), 177–203 (2018). https://doi.org/10.1080/17512786.2017.1394208

  28. Bodrunova, S.S., Blekanov, I.S.: Power laws in ad hoc conflictual discussions on twitter. In: Alexandrov, D.A., Boukhanovsky, A.V., Chugunov, A.V., Kabanov, Y., Koltsova, O. (eds.) DTGS 2018. CCIS, vol. 859, pp. 67–82. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-02846-6_6

    Chapter  Google Scholar 

  29. Bodrunova, S.S., Blekanov, I., Smoliarova, A., Litvinenko, A.: Beyond left and right: real-world political polarization in twitter discussions on inter-ethnic conflicts. Media Commun. 7(3), 119–132 (2019). https://doi.org/10.17645/mac.v7i3.1934

  30. Poulston, A., Stevenson, M., Bontcheva, K.: Hyperlocal home location identification of twitter profiles. In: Proceedings of the 28th ACM Conference on Hypertext and Social Media (HT 2017), pp. 45–54. ACM. https://doi.org/10.1145/3078714.3078719, 2017

Download references

Acknowledgements

This research has been supported in full by Russian Science Foundation, project 21–18-00454 (2021–2023).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ivan Blekanov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Blekanov, I., Maksimov, A., Nepiyushchikh, D., Bodrunova, S.S. (2022). Geolocation Detection Approaches for User Discussion Analysis in Twitter. In: Meiselwitz, G., et al. HCI International 2022 - Late Breaking Papers. Interaction in New Media, Learning and Games. HCII 2022. Lecture Notes in Computer Science, vol 13517. Springer, Cham. https://doi.org/10.1007/978-3-031-22131-6_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-22131-6_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-22130-9

  • Online ISBN: 978-3-031-22131-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics