Abstract
Ubiquitous Internet connectivity enables users to update their Online Social Network profile from any location and at any point in time. These, often geo-tagged, data can be used to provide valuable information to closely located users, both in real time and in aggregated form. However, despite the fact that users publish geo-tagged information, only a small number implicitly reports their base location in their Online Social Network profile. In this paper, we present a simple yet effective methodology for identifying a user’s Key locations, namely her Home and Work places. We evaluate our methodology with Twitter datasets collected from the country of Netherlands, city of London and Los Angeles county. Furthermore, we combine Twitter and LinkedIn information to construct a Work location dataset and evaluate our methodology. Results show that our proposed methodology not only outperforms state-of-the-art methods by at least 30 % in terms of accuracy, but also cuts the detection radius at least at half the distance from other methods. To illustrate the applicability of our methodology and motivate further research in location-based social network analysis, we provide an initial evaluation of three such approaches, namely (1) Twitter user mobility patterns, (2) Ego network formulation, and (3) Key location tweet sentiment analysis.
Similar content being viewed by others
Notes
https://dev.twitter.com/rest/public (Last accessed: June 2016).
Geo-tagged Microblog Corpus: http://www.ark.cs.cmu.edu/GeoText/ (Last accessed: June 2016).
Similar behavior has also been observed by Falcone et al. (2014).
Cho et al. (2011) used a 25 Km square boundary.
https://www.census.gov/prod/1/gen/95statab/app3.pdf (Last accessed: June 2016).
Organization for Economic Co-operation and Development, http://www.oecd.org/social/soc/47346594.pdf (Last accessed: June 2016).
London DataStore, http://data.london.gov.uk/ (Last accessed: June 2016).
Hedonometer, http://hedonometer.org/index.html (Last accessed: June 2015).
http://text-processing.com/demo/sentiment/ (Last accessed: June 2016).
References
Adali S, Golbeck J (2014) Predicting personality with social behavior: a comparative study. Soc Netw Anal Min 4(1):1–20
Aldrich HE, Kim PH (2007) Small worlds, infinite possibilities? How social networks affect entrepreneurial team formation and search. Strateg Entrep J 1(1–2):147–165
Backstrom L, Huttenlocher D, Kleinberg J, Lan X (2006) Group formation in large social networks: membership, growth, and evolution. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’06. ACM, New York, pp 44–54
Bird S (2006) Nltk: the natural language toolkit. In: Proceedings of the COLING/ACL on interactive presentation sessions, COLING-ACL ’06, Association for Computational Linguistics, Stroudsburg, pp 69–72
Bo H, Cook P, Baldwin T (2012) Geolocation prediction in social media data by finding location indicative words. In: Proceedings of COLING 2012: technical papers, pp 1045–1062
Borgatti SP, Mehra A, Brass DJ, Labianca G (2009) Network analysis in the social sciences. Science 323(5916):892–895
Brown C, Noulas A, Mascolo C, Blondel V (2013) A place-focused model for social networks in cities. In: 2013 international conference on social computing (SocialCom), pp 75–80
Catanzaro M, Caldarelli G, Pietronero L (2004) Social network growth with assortative mixing. Phys A Stat Mech Appl 338(1–2):119–124. Proceedings of the conference a nonlinear world: the real world, 2nd international conference on frontier science
Cho E, Myers SA, Leskovec J (2011) Friendship and mobility: user movement in location-based social networks. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’11. ACM, New York
Cici B, Markopoulou A, Frias-Martinez E, Laoutaris N (2014) Assessing the potential of ride-sharing using mobile and social data: a tale of four cities. In: Proceedings of the 2014 ACM international joint conference on pervasive and ubiquitous computing, UbiComp ’14. ACM, New York, pp 201–211
Efstathiades H, Antoniades D, Pallis G, Dikaiakos MD (2015) Identification of key locations based on online social network activity. In: Proceedings of the 2015 IEEE/ACM international conference on advances in social networks analysis and mining 2015, ASONAM ’15. ACM, New York, pp 218–225
Eisenstein J, O’Connor B, Smith NA, Xing EP (2010) A latent variable model for geographic lexical variation. In Proceedings of the 2010 conference on empirical methods in natural language processing, EMNLP ’10. Stroudsburg
Ellison NB, Steinfield C, Lampe C (2007) The benefits of facebook friends: social capital and college students’ use of online social network sites. J Comput Med Commun 12(4):1143–1168
Falcone D, Mascolo C, Comito C, Talia D, Crowcroft J (2014) What is this place? Inferring place categories through user patterns identification in geo-tagged tweets. In: Proceedings of international conference on mobile computing, applications and services, MobiCASE
Ferrara E, Varol O, Davis C, Menczer F, Flammini A (2014) The rise of social bots. CoRR. arxiv:1407.5225
Ganti RK, Tsai Y-E, Abdelzaher TF (2008) Senseworld: towards cyber-physical social networks. In: Proceedings of the 7th international conference on information processing in sensor networks, IPSN ’08, Washington, DC. IEEE Computer Society, , pp 563–564
Georgiev P, Noulas A, Mascolo C (2014) The call of the crowd: event participation in location-based social services. In: International AAAI conference on weblogs and social media (ICWSM)
Granovetter MS (1973) The strength of weak ties. Am J Sociol 78(6):1360–1380
Granovetter M, Soong R (1983) Threshold models of diffusion and collective behavior. J Math Sociol 9(3):165–179
Gross R, Acquisti A (2005) Information revelation and privacy in online social networks. In: Proceedings of the 2005 ACM workshop on privacy in the electronic society. ACM, pp 71–80
Guimerà R, Danon L, Díaz-Guilera A, Giralt F, Arenas A (2003) Self-similar community structure in a network of human interactions. Phys Rev E 68:065103
Hawelka B, Sitko I, Beinat E, Sobolevsky S, Kazakopoulos P, Ratti C (2014) Geo-located twitter as proxy for global mobility patterns. Cartogr Geogr Inform Sci 41(3):260–271
Hecht B, Hong L, Suh B, Chi EH (2011) Tweets from justin bieber’s heart: the dynamics of the location field in user profiles. In: Proceedings of the SIGCHI conference on human factors in computing systems, CHI ’11, New York. ACM, pp 237–246
Herder E, Siehndel P, Kawase R (2014) Predicting user locations and trajectories. User modeling, adaptation, and personalization. Springer, New York, pp 86–97
Hopcroft J, Lou T, Tang J (2011) Who will follow you back?: Reciprocal relationship prediction. In: Proceedings of the 20th ACM international conference on information and knowledge management, CIKM ’11, New York. ACM, pp 1137–1146
Jaiswal A, Peng W, Sun T (Aug 2013) Predicting time-sensitive user locations from social media. In: 2013 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM), pp 870–877
Jurdak R, Zhao K, Liu J, AbouJaoude M, Cameron M, Newth D (2014) Understanding Human mobility from Twitter. ArXiv e-prints
Jurgens D (2013) That’s what friends are for: inferring location in online social media platforms based on social relationships. In: International AAAI conference on weblogs and social media (ICWSM)
Katragadda S, Jin M, Raghavan V (2014) An unsupervised approach to identify location based on the content of user’s tweet history. Active media technology lecture notes in computer science, vol 8610. Springer, New York, pp 311–323
Kotzias D, Lappas T, Gunopulos D (2016) Home is where your friends are: Utilizing the social graph to locate twitter users in a city. Inform Syst 57:77–87
Kulshrestha J, Kooti F, Nikravesh A, Gummadi PK (2012) Geographic dissection of the twitter network. In: International AAAI conference on weblogs and social media (ICWSM)
Kumar R, Novak J, Tomkins A (2006) Structure and evolution of online social networks. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’06, New York. ACM, pp 611–617
Kwak H, Lee C, Park H, Moon S (2010) What is twitter, a social network or a news media? In: Proceedings of the 19th international conference on World Wide Web, WWW ’10, New York. ACM, pp 591–600
Leskovec J, Horvitz E (2008) Planetary-scale views on a large instant-messaging network. In: Proceedings of the 17th international conference on World Wide Web, WWW ’08, New York. ACM, pp 915–924
Leskovec J, Kleinberg J, Faloutsos C (2005) Graphs over time: densification laws, shrinking diameters and possible explanations. In: Proceedings of the Eleventh ACM SIGKDD international conference on knowledge discovery in data mining, KDD ’05, New York. ACM, pp 177–187
Levine SS, Kurzban R (2006) Explaining clustering in social networks: towards an evolutionary theory of cascading benefits. Manag Decis Econ 27(2–3):173–187
Liben-Nowell D, Kleinberg J (2003) The link prediction problem for social networks. In: Proceedings of the Twelfth international conference on information and knowledge management, CIKM ’03, New York. ACM, pp 556–559
Li G, Hu J, Feng J, Tan K-L (March 2014) Effective location identification from microblogs. In: IEEE 30th international conference on data engineering (ICDE), 2014. pp 880–891
Liu H, Zhou Y, Zhang Y (2015) Estimating users’ home and work locations leveraging large-scale crowd-sourced smartphone data. IEEE Commun Mag 53(3):71–79
Li R, Wang S, Deng H, Wang R, Chang KC-C (2012) Towards social user profiling: Unified and discriminative influence model for inferring home locations. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, New York. ACM, pp 1023–1031
Mahmud J, Nichols J, Drews C (2014) Home location identification of twitter users. ACM Trans Intell Syst Technol 5(3):47
Mcauley J, Leskovec J (2014) Discovering social circles in ego networks. ACM Trans Knowl Discov Data 8(1):4:1–4:28
Milgram S (1967) The small world problem. Psychol Today 2(1):60–67
Mocanu D, Baronchelli A, Perra N, Gonçalves B, Zhang Q, Vespignani A (2013) The twitter of babel: mapping world languages through microblogging platforms. PLoS One 8(4):e61981
Morstatter F, Pfeffer J, Liu H, Carley K (2013) Is the sample good enough? Comparing data from twitter’s streaming api with twitter’s firehose. In: International AAAI conference on weblogs and social media (ICWSM)
Myers SA, Sharma A, Gupta P, Lin J (2014) Information network or social network? The structure of the twitter follow graph. In: Proceedings of the 23rd international conference on world wide web, WWW ’14 Companion, New York. ACM, pp 493–498
Narr S, Hulfenhaus M, Albayrak S (2012) Language-independent twitter sentiment analysis. In: Knowledge discovery and machine learning (KDML), LWA, pp 12–14
Noulas A, Scellato S, Mascolo C, Pontil M (2011) An empirical study of geographic user activity patterns in foursquare. In: Proceedings of the 5th international AAAI conference on weblogs and social media. pp 570–573
Perc M (2014) The matthew effect in empirical data. J R Soc Interface 11(98):20140378
Ryoo K, Moon S (2014) Inferring twitter user locations with 10 km accuracy. In: Proceedings of the companion publication of the 23rd international conference on world wide web companion, WWW Companion ’14, pp 643–648
Sadilek A, Kautz H, Bigham JP (2012) Finding your friends and following them to where you are. In: Proceedings of the Fifth ACM international conference on web search and data mining, WSDM ’12, New York, NY, USA. ACM, pp 723–732
Yang C, Harkreader R, Gu G (2013) Empirical evaluation and new design for fighting evolving twitter spammers. IEEE Trans Inform Forensics Secur 8(8):1280–1293
Yuan Q, Cong G, Ma Z, Sun A, Thalmann NM (2013) Who, where, when and what: discover spatio-temporal topics for twitter users. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’13, New York
Yu Y, Wang X (2015) World cup 2014 in the twitter world: a big data analysis of sentiments in u.s. sports fansâĂŹ tweets. Comput Hum Behav 48:392–400
Zhang D, Huang J, Li Y, Zhang F, Xu C, He T (2014) Exploring human mobility with multi-source data at extremely large metropolitan scales. In: Proceedings of the 20th annual international conference on mobile computing and networking, MobiCom ’14, New York. ACM, pp 201–212
Acknowledgments
This work was partially supported by the iSocial EU Marie Curie ITN project (FP7-PEOPLE-2012-ITN).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Efstathiades, H., Antoniades, D., Pallis, G. et al. Users key locations in online social networks: identification and applications. Soc. Netw. Anal. Min. 6, 66 (2016). https://doi.org/10.1007/s13278-016-0376-3
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-016-0376-3