skip to main content
review-article

User Identity Linkage across Online Social Networks: A Review

Published:22 March 2017Publication History
Skip Abstract Section

Abstract

The increasing popularity and diversity of social media sites has encouraged more and more people to participate on multiple online social networks to enjoy their services. Each user may create a user identity, which can includes profile, content, or network information, to represent his or her unique public figure in every social network. Thus, a fundamental question arises -- can we link user identities across online social networks? User identity linkage across online social networks is an emerging task in social media and has attracted increasing attention in recent years. Advancements in user identity linkage could potentially impact various domains such as recommendation and link prediction. Due to the unique characteristics of social network data, this problem faces tremendous challenges. To tackle these challenges, recent approaches generally consist of (1) extracting features and (2) constructing predictive models from a variety of perspectives. In this paper, we review key achievements of user identity linkage across online social networks including stateof- the-art algorithms, evaluation metrics, and representative datasets. We also discuss related research areas, open problems, and future research directions for user identity linkage across online social networks.

References

  1. Mohammad Al Hasan, Vineet Chaoji, Saeed Salem, and Mohammed Zaki. Link prediction using supervised learning. In SDM06: workshop on link analysis, counter-terrorism and security, 2006.Google ScholarGoogle Scholar
  2. Mohammad Al Hasan and Mohammed J Zaki. A survey of link prediction in social networks. In Social network data analytics. 2011.Google ScholarGoogle Scholar
  3. Lars Backstrom, Cynthia Dwork, and Jon Kleinberg. Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography. In WWW, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Albert-László Barabási and Réka Albert. Emergence of scaling in random networks. science, 1999.Google ScholarGoogle Scholar
  5. Sergey Bartunov, Anton Korshunov, Seung-Taek Park, Wonho Ryu, and Hyungdong Lee. Joint link-attribute user identity resolution in online social networks. In ACM (SNA-KDD), 2012.Google ScholarGoogle Scholar
  6. Mohsen Bayati, Margot Gerritsen, David F Gleich, Amin Saberi, and Ying Wang. Algorithms for large, sparse network alignment problems. In ICDM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Omar Benjelloun, Hector Garcia-Molina, David Menestrina, Qi Su, Steven Euijong Whang, and Jennifer Widom. Swoosh: a generic approach to entity resolution. VLDB, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Nacéra Bennacer, Coriane Nana Jipmo, Antonio Penta, and Gianluca Quercini. Matching user profiles across social networks. In International Conference on Advanced Information Systems Engineering. Springer, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  9. Mikhail Bilenko and Raymond J Mooney. Adaptive duplicate detection using learnable string similarity measures. In KDD, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. David Guy Brizan and Abdullah Uz Tansel. A. survey of entity resolution and record linkage methodologies. Communications of the IIMA, 2015.Google ScholarGoogle Scholar
  11. Francesco Buccafurri, Gianluca Lax, Antonino Nocera, and Domenico Ursino. Discovering links among social networks. In ECML/PKDD, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  12. Iván Cantador, Ignacio Fernández-Tobás, Shlomo Berkovsky, and Paolo Cremonesi. Cross-domain recommender systems. In Recommender Systems Handbook. 2015.Google ScholarGoogle Scholar
  13. Francesca Carmagnola and Federica Cena. User identification for cross-system personalisation. Information Sciences, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Deepayan Chakrabarti, Yiping Zhan, and Christos Faloutsos. R-mat: A recursive model for graph mining. In SDM, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  15. Peter Christen. Data matching: concepts and techniques for record linkage, entity resolution, and duplicate detection. Springer Science & Business Media, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. William Cohen, Pradeep Ravikumar, and Stephen Fienberg. A comparison of string metrics for matching names and records. 2003.Google ScholarGoogle Scholar
  17. Donatello Conte, Pasquale Foggia, Carlo Sansone, and Mario Vento. Thirty years of graph matching in pattern recognition. International journal of pattern recognition and artificial intelligence, 2004.Google ScholarGoogle Scholar
  18. Zhengyu Deng, Jitao Sang, and Changsheng Xu. Personalized video recommendation based on crossplatform user modeling. In ICME, 2013.Google ScholarGoogle Scholar
  19. Mohamed G. Elfeky, Vassilios S. Verykios, and Ahmed K Elmagarmid. Tailor: A record linkage toolbox. In ICDE, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  20. P. ERDdS and A. R&WI. On random graphs i. Publ. Math. Debrecen, 6:290--297, 1959.Google ScholarGoogle Scholar
  21. Ivan P. Fellegi and Alan B. Sunter. A theory for record linkage. Journal of the American Statistical Association, 1969.Google ScholarGoogle Scholar
  22. Lise Getoor and Ashwin Machanavajjhala. Entity resolution: theory, practice & open challenges. VLDB, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Oana Goga, Howard Lei, Sree Hari Krishnan Parthasarathi, Gerald Friedland, Robin Sommer, and Renata Teixeira. Exploiting innocuous activity for correlating users across sites. In WWW, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Oana Goga, Patrick Loiseau, Robin Sommer, Renata Teixeira, and Krishna P Gummadi. On the reliability of profile matching across large online social networks. In KDD, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Oana Goga, Daniele Perito, Howard Lei, Renata Teixeira, and Robin Sommer. Large-scale correlation of accounts across social networks. 2013.Google ScholarGoogle Scholar
  26. David J. Hand and Robert J. Till. A simple generalisation of the area under the roc curve for multiple class classification problems. Machine learning, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Tereza Iofciu, Peter Fankhauser, Fabian Abel, and Kerstin Bischoff. Identifying users across social tagging systems. In ICWSM, 2011.Google ScholarGoogle Scholar
  28. Paridhi Jain and Ponnurangam Kumaraguru. Finding nemo: searching and resolving identities of users across online social networks. arXiv preprint arXiv:1212.6147, 2012.Google ScholarGoogle Scholar
  29. Gunnar W Klau. A new graph-based method for pairwise global network alignment. BMC bioinformatics, 2009.Google ScholarGoogle Scholar
  30. Xiangnan Kong, Jiawei Zhang, and Philip S Yu. Inferring anchor links across multiple heterogeneous social networks. In CIKM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Hanna Köpcke and Erhard Rahm. Frameworks for entity matching: A comparison. Data & Knowledge Engineering, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Nitish Korula and Silvio Lattanzi. An efficient reconciliation algorithm for social networks. VLDB, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Shamanth Kumar, Reza Zafarani, and Huan Liu. Understanding user migration patterns in social media. In AAAI, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Sebastian Labitzke, Irina Taranu, and Hannes Hartenstein. What your friends tell others about you: Low cost linkability of social network profiles. 2011.Google ScholarGoogle Scholar
  35. Silvio Lattanzi and D Sivakumar. Affiliation networks. In STOC, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Chung-Yi Li and Shou-De Lin. Matching users and items across domains to improve the recommendation quality. In KDD, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. David Liben-Nowell and Jon Kleinberg. The linkprediction problem for social networks. Journal of the American society for information science and technology, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Jing Liu, Fan Zhang, Xinying Song, Young-In Song, Chin-Yew Lin, and Hsiao-Wuen Hon. What's in a name?: an unsupervised approach to link users across communities. In WSDM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Li Liu, Cheung K. William, Xin Li, and Lejian Liao. Aligning users across social networks using network embedding. In IJCAI, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Siyuan Liu, ShuhuiWang, Feida Zhu, Jinbo Zhang, and Ramayya Krishnan. Hydra: Large-scale social identity linkage via heterogeneous behavior modeling. In SIGMOD, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Chun-Ta Lu, Sihong Xie, Weixiang Shao, Lifang He, and Philip S Yu. Item recommendation for emerging online businesses. 2016.Google ScholarGoogle Scholar
  42. Anshu Malhotra, Luam Totti, Wagner Meira Jr, Ponnurangam Kumaraguru, and Virgilio Almeida. Studying user footprints in different online social networks. In ASONAM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Tong Man, Huawei Shen, Shenghua Liu, Xiaolong Jin, and Xueqi Cheng. Predict anchor links across social networks via an embedding approach. In IJCAI, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Lydia Manikonda, Venkata Vamsikrishna Meduri, and Subbarao Kambhampati. Tweeting the mind and instagramming the heart: Exploring differentiated content sharing on social media. arXiv preprint arXiv:1603.02718, 2016.Google ScholarGoogle Scholar
  45. Sergey Melnik, Hector Garcia-Molina, and Erhard Rahm. Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In ICDE, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Marti Motoyama and George Varghese. I seek you: searching and matching individuals in social networks. In Proceedings of the eleventh international workshop on Web information and data management, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Xin Mu, Feida Zhu, Zhi-Hua Zhou, Ee-Peng Lim, Jing Xiao, and Jianzong Wang. User identity linkage by latent user space modeling. In KDD, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Arvind Narayanan and Vitaly Shmatikov. Deanonymizing social networks. In ISSP, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Arvind Narayanan and Vitaly Shmatikov. Myths and fallacies of personally identifiable information. Communications of the ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Yuanping Nie, Yan Jia, Shudong Li, Xiang Zhu, Aiping Li, and Bin Zhou. Identifying users across social networks based on dynamic core interests. Neurocomputing, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Olga Peled, Michael Fire, Lior Rokach, and Yuval Elovici. Entity matching in online social networks. In SocialCom. IEEE, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Daniele Perito, Claude Castelluccia, Mohamed Ali Kaafar, and Pere Manils. How unique and traceable are usernames? In International Symposium on Privacy Enhancing Technologies Symposium. Springer, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Christopher Riederer, Yunsung Kim, Augustin Chaintreau, Nitish Korula, and Silvio Lattanzi. Linking users across domains with location data: Theory and validation. In WWW, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Yilin Shen and Hongxia Jin. Controllable information sharing for user accounts linkage across multiple online social networks. In CIKM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Rohit Singh, Jinbo Xu, and Bonnie Berger. Global alignment of multiple protein interaction networks with application to functional orthology detection. Proceedings of the National Academy of Sciences, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  56. Shulong Tan, Ziyu Guan, Deng Cai, Xuzhen Qin, Jiajun Bu, and Chun Chen. Mapping users across networks by manifold alignment on hypergraph. In AAAI, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. Line: Large-scale information network embedding. In WWW, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Jiliang Tang, Yi Chang, and Huan Liu. Mining social media with social theories: a survey. ACM SIGKDD Explorations Newsletter, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Andreas Thor and Erhard Rahm. Moma-a mappingbased object matching system. In CIDR, 2007.Google ScholarGoogle Scholar
  60. Jan Vosecky, Dan Hong, and Vincent Y Shen. User identification across multiple social networks. In 2009 First International Conference on Networked Digital Technologies. IEEE, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  61. Duncan J Watts and Steven H Strogatz. Collective dynamics of small-worldnetworks. nature, 1998.Google ScholarGoogle Scholar
  62. Ming Yan, Jitao Sang, Tao Mei, and Changsheng Xu. Friend transfer: cold-start friend recommendation with cross-platform transfer learning of social knowledge. In ICME, 2013.Google ScholarGoogle Scholar
  63. Xiaodan Song, Ching-Yung Lin, Belle L. Tseng, and Ming-Ting Sun. Modeling and predicting personal information dissemination behavior. In Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, KDD '05, pages 479--488, New York, NY, USA, 2005. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. Reza Zafarani and Huan Liu. Connecting corresponding identities across communities. ICWSM, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  65. Reza Zafarani and Huan Liu. Connecting users across social media sites: a behavioral-modeling approach. In KDD, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. Pei Sun and Sanjay Chawla. On local spatial outliers. In Data Mining, 2004. ICDM'04. Fourth IEEE International Conference on, pages 209--216. IEEE, 2004Reza Zafarani and Huan Liu. Finding friends on a new site using minimum information. In SDM, 2014.. Google ScholarGoogle ScholarDigital LibraryDigital Library
  67. Reza Zafarani and Huan Liu. Users joining multiple sites: Distributions and patterns. In ICWSM. Citeseer, 2014.Google ScholarGoogle Scholar
  68. Reza Zafarani and Huan Liu. Users joining multiple sites: Friendship and popularity variations across sites. Information Fusion, 28:83--89, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  69. Reza Zafarani, Lei Tang, and Huan Liu. User identification across social media. TKDD, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  70. Haochen Zhang, Min-Yen Kan, Yiqun Liu, and Shaoping Ma. Online social network profile linkage. In Asia Information Retrieval Symposium, pages 197--208. Springer, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  71. Haochen Zhang, Minyen Kan, Yiqun Liu, and Shaoping Ma. Online social network profile linkage based on cost-sensitive feature acquisition. In Chinese National Conference on Social Media Processing, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  72. Jiawei Zhang and Philip S Yu. Pct: partial coalignment of social networks. In WWW, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  73. Jiawei Zhang, Philip S Yu, and Zhi-Hua Zhou. Metapath based multi-network collective link prediction. In KDD, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  74. Jiawei Zhang and Philip Yu S. Multiple anonymized social networks alignment. In ICDM, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  75. Si Zhang and Hanghang Tong. Final: Fast attributed network alignment. In KDD. ACM, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  76. Yutao Zhang, Jie Tang, Zhilin Yang, Jian Pei, and Philip S Yu. Cosnet: connecting heterogeneous social networks with local and global consistency. In KDD, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  77. Yuxiang Zhang, Lulu Wang, Xiaoli Li, and Chunjing Xiao. Social identity link across incomplete social information sources using anchor link expansion. In PAKDD, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  78. Xiaoping Zhou, Xun Liang, Haiyan Zhang, and Yuefeng Ma. Cross-platform identification of anonymous identical users in multiple social media networks. TKDE, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  79. Xiaojin Zhu and Andrew B Goldberg. Introduction to semi-supervised learning. Synthesis lectures on artificial intelligence and machine learning, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  80. Reza Zafarani, Mohammad Ali Abbasi, and Huan Liu. Social media mining: an introduction. Cambridge University Press, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. User Identity Linkage across Online Social Networks: A Review
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM SIGKDD Explorations Newsletter
      ACM SIGKDD Explorations Newsletter  Volume 18, Issue 2
      December 2016
      29 pages
      ISSN:1931-0145
      EISSN:1931-0153
      DOI:10.1145/3068777
      Issue’s Table of Contents

      Copyright © 2017 Authors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 March 2017

      Check for updates

      Qualifiers

      • review-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader