Abstract
The increasing popularity and diversity of social media sites has encouraged more and more people to participate on multiple online social networks to enjoy their services. Each user may create a user identity, which can includes profile, content, or network information, to represent his or her unique public figure in every social network. Thus, a fundamental question arises -- can we link user identities across online social networks? User identity linkage across online social networks is an emerging task in social media and has attracted increasing attention in recent years. Advancements in user identity linkage could potentially impact various domains such as recommendation and link prediction. Due to the unique characteristics of social network data, this problem faces tremendous challenges. To tackle these challenges, recent approaches generally consist of (1) extracting features and (2) constructing predictive models from a variety of perspectives. In this paper, we review key achievements of user identity linkage across online social networks including stateof- the-art algorithms, evaluation metrics, and representative datasets. We also discuss related research areas, open problems, and future research directions for user identity linkage across online social networks.
- Mohammad Al Hasan, Vineet Chaoji, Saeed Salem, and Mohammed Zaki. Link prediction using supervised learning. In SDM06: workshop on link analysis, counter-terrorism and security, 2006.Google Scholar
- Mohammad Al Hasan and Mohammed J Zaki. A survey of link prediction in social networks. In Social network data analytics. 2011.Google Scholar
- Lars Backstrom, Cynthia Dwork, and Jon Kleinberg. Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography. In WWW, 2007. Google ScholarDigital Library
- Albert-László Barabási and Réka Albert. Emergence of scaling in random networks. science, 1999.Google Scholar
- Sergey Bartunov, Anton Korshunov, Seung-Taek Park, Wonho Ryu, and Hyungdong Lee. Joint link-attribute user identity resolution in online social networks. In ACM (SNA-KDD), 2012.Google Scholar
- Mohsen Bayati, Margot Gerritsen, David F Gleich, Amin Saberi, and Ying Wang. Algorithms for large, sparse network alignment problems. In ICDM, 2009. Google ScholarDigital Library
- Omar Benjelloun, Hector Garcia-Molina, David Menestrina, Qi Su, Steven Euijong Whang, and Jennifer Widom. Swoosh: a generic approach to entity resolution. VLDB, 2009. Google ScholarDigital Library
- Nacéra Bennacer, Coriane Nana Jipmo, Antonio Penta, and Gianluca Quercini. Matching user profiles across social networks. In International Conference on Advanced Information Systems Engineering. Springer, 2014.Google ScholarCross Ref
- Mikhail Bilenko and Raymond J Mooney. Adaptive duplicate detection using learnable string similarity measures. In KDD, 2003. Google ScholarDigital Library
- David Guy Brizan and Abdullah Uz Tansel. A. survey of entity resolution and record linkage methodologies. Communications of the IIMA, 2015.Google Scholar
- Francesco Buccafurri, Gianluca Lax, Antonino Nocera, and Domenico Ursino. Discovering links among social networks. In ECML/PKDD, 2012.Google ScholarCross Ref
- Iván Cantador, Ignacio Fernández-Tobás, Shlomo Berkovsky, and Paolo Cremonesi. Cross-domain recommender systems. In Recommender Systems Handbook. 2015.Google Scholar
- Francesca Carmagnola and Federica Cena. User identification for cross-system personalisation. Information Sciences, 2009. Google ScholarDigital Library
- Deepayan Chakrabarti, Yiping Zhan, and Christos Faloutsos. R-mat: A recursive model for graph mining. In SDM, 2004.Google ScholarCross Ref
- Peter Christen. Data matching: concepts and techniques for record linkage, entity resolution, and duplicate detection. Springer Science & Business Media, 2012. Google ScholarDigital Library
- William Cohen, Pradeep Ravikumar, and Stephen Fienberg. A comparison of string metrics for matching names and records. 2003.Google Scholar
- Donatello Conte, Pasquale Foggia, Carlo Sansone, and Mario Vento. Thirty years of graph matching in pattern recognition. International journal of pattern recognition and artificial intelligence, 2004.Google Scholar
- Zhengyu Deng, Jitao Sang, and Changsheng Xu. Personalized video recommendation based on crossplatform user modeling. In ICME, 2013.Google Scholar
- Mohamed G. Elfeky, Vassilios S. Verykios, and Ahmed K Elmagarmid. Tailor: A record linkage toolbox. In ICDE, 2002.Google ScholarCross Ref
- P. ERDdS and A. R&WI. On random graphs i. Publ. Math. Debrecen, 6:290--297, 1959.Google Scholar
- Ivan P. Fellegi and Alan B. Sunter. A theory for record linkage. Journal of the American Statistical Association, 1969.Google Scholar
- Lise Getoor and Ashwin Machanavajjhala. Entity resolution: theory, practice & open challenges. VLDB, 2012. Google ScholarDigital Library
- Oana Goga, Howard Lei, Sree Hari Krishnan Parthasarathi, Gerald Friedland, Robin Sommer, and Renata Teixeira. Exploiting innocuous activity for correlating users across sites. In WWW, 2013. Google ScholarDigital Library
- Oana Goga, Patrick Loiseau, Robin Sommer, Renata Teixeira, and Krishna P Gummadi. On the reliability of profile matching across large online social networks. In KDD, 2015. Google ScholarDigital Library
- Oana Goga, Daniele Perito, Howard Lei, Renata Teixeira, and Robin Sommer. Large-scale correlation of accounts across social networks. 2013.Google Scholar
- David J. Hand and Robert J. Till. A simple generalisation of the area under the roc curve for multiple class classification problems. Machine learning, 2001. Google ScholarDigital Library
- Tereza Iofciu, Peter Fankhauser, Fabian Abel, and Kerstin Bischoff. Identifying users across social tagging systems. In ICWSM, 2011.Google Scholar
- Paridhi Jain and Ponnurangam Kumaraguru. Finding nemo: searching and resolving identities of users across online social networks. arXiv preprint arXiv:1212.6147, 2012.Google Scholar
- Gunnar W Klau. A new graph-based method for pairwise global network alignment. BMC bioinformatics, 2009.Google Scholar
- Xiangnan Kong, Jiawei Zhang, and Philip S Yu. Inferring anchor links across multiple heterogeneous social networks. In CIKM, 2013. Google ScholarDigital Library
- Hanna Köpcke and Erhard Rahm. Frameworks for entity matching: A comparison. Data & Knowledge Engineering, 2010. Google ScholarDigital Library
- Nitish Korula and Silvio Lattanzi. An efficient reconciliation algorithm for social networks. VLDB, 2014. Google ScholarDigital Library
- Shamanth Kumar, Reza Zafarani, and Huan Liu. Understanding user migration patterns in social media. In AAAI, 2011. Google ScholarDigital Library
- Sebastian Labitzke, Irina Taranu, and Hannes Hartenstein. What your friends tell others about you: Low cost linkability of social network profiles. 2011.Google Scholar
- Silvio Lattanzi and D Sivakumar. Affiliation networks. In STOC, 2009. Google ScholarDigital Library
- Chung-Yi Li and Shou-De Lin. Matching users and items across domains to improve the recommendation quality. In KDD, 2014. Google ScholarDigital Library
- David Liben-Nowell and Jon Kleinberg. The linkprediction problem for social networks. Journal of the American society for information science and technology, 2007. Google ScholarDigital Library
- Jing Liu, Fan Zhang, Xinying Song, Young-In Song, Chin-Yew Lin, and Hsiao-Wuen Hon. What's in a name?: an unsupervised approach to link users across communities. In WSDM, 2013. Google ScholarDigital Library
- Li Liu, Cheung K. William, Xin Li, and Lejian Liao. Aligning users across social networks using network embedding. In IJCAI, 2016. Google ScholarDigital Library
- Siyuan Liu, ShuhuiWang, Feida Zhu, Jinbo Zhang, and Ramayya Krishnan. Hydra: Large-scale social identity linkage via heterogeneous behavior modeling. In SIGMOD, 2014. Google ScholarDigital Library
- Chun-Ta Lu, Sihong Xie, Weixiang Shao, Lifang He, and Philip S Yu. Item recommendation for emerging online businesses. 2016.Google Scholar
- Anshu Malhotra, Luam Totti, Wagner Meira Jr, Ponnurangam Kumaraguru, and Virgilio Almeida. Studying user footprints in different online social networks. In ASONAM, 2012. Google ScholarDigital Library
- Tong Man, Huawei Shen, Shenghua Liu, Xiaolong Jin, and Xueqi Cheng. Predict anchor links across social networks via an embedding approach. In IJCAI, 2016. Google ScholarDigital Library
- Lydia Manikonda, Venkata Vamsikrishna Meduri, and Subbarao Kambhampati. Tweeting the mind and instagramming the heart: Exploring differentiated content sharing on social media. arXiv preprint arXiv:1603.02718, 2016.Google Scholar
- Sergey Melnik, Hector Garcia-Molina, and Erhard Rahm. Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In ICDE, 2002. Google ScholarDigital Library
- Marti Motoyama and George Varghese. I seek you: searching and matching individuals in social networks. In Proceedings of the eleventh international workshop on Web information and data management, 2009. Google ScholarDigital Library
- Xin Mu, Feida Zhu, Zhi-Hua Zhou, Ee-Peng Lim, Jing Xiao, and Jianzong Wang. User identity linkage by latent user space modeling. In KDD, 2016. Google ScholarDigital Library
- Arvind Narayanan and Vitaly Shmatikov. Deanonymizing social networks. In ISSP, 2009. Google ScholarDigital Library
- Arvind Narayanan and Vitaly Shmatikov. Myths and fallacies of personally identifiable information. Communications of the ACM, 2010. Google ScholarDigital Library
- Yuanping Nie, Yan Jia, Shudong Li, Xiang Zhu, Aiping Li, and Bin Zhou. Identifying users across social networks based on dynamic core interests. Neurocomputing, 2016. Google ScholarDigital Library
- Olga Peled, Michael Fire, Lior Rokach, and Yuval Elovici. Entity matching in online social networks. In SocialCom. IEEE, 2013. Google ScholarDigital Library
- Daniele Perito, Claude Castelluccia, Mohamed Ali Kaafar, and Pere Manils. How unique and traceable are usernames? In International Symposium on Privacy Enhancing Technologies Symposium. Springer, 2011. Google ScholarDigital Library
- Christopher Riederer, Yunsung Kim, Augustin Chaintreau, Nitish Korula, and Silvio Lattanzi. Linking users across domains with location data: Theory and validation. In WWW, 2016. Google ScholarDigital Library
- Yilin Shen and Hongxia Jin. Controllable information sharing for user accounts linkage across multiple online social networks. In CIKM, 2014. Google ScholarDigital Library
- Rohit Singh, Jinbo Xu, and Bonnie Berger. Global alignment of multiple protein interaction networks with application to functional orthology detection. Proceedings of the National Academy of Sciences, 2008.Google ScholarCross Ref
- Shulong Tan, Ziyu Guan, Deng Cai, Xuzhen Qin, Jiajun Bu, and Chun Chen. Mapping users across networks by manifold alignment on hypergraph. In AAAI, 2014. Google ScholarDigital Library
- Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. Line: Large-scale information network embedding. In WWW, 2015. Google ScholarDigital Library
- Jiliang Tang, Yi Chang, and Huan Liu. Mining social media with social theories: a survey. ACM SIGKDD Explorations Newsletter, 2014. Google ScholarDigital Library
- Andreas Thor and Erhard Rahm. Moma-a mappingbased object matching system. In CIDR, 2007.Google Scholar
- Jan Vosecky, Dan Hong, and Vincent Y Shen. User identification across multiple social networks. In 2009 First International Conference on Networked Digital Technologies. IEEE, 2009.Google ScholarCross Ref
- Duncan J Watts and Steven H Strogatz. Collective dynamics of small-worldnetworks. nature, 1998.Google Scholar
- Ming Yan, Jitao Sang, Tao Mei, and Changsheng Xu. Friend transfer: cold-start friend recommendation with cross-platform transfer learning of social knowledge. In ICME, 2013.Google Scholar
- Xiaodan Song, Ching-Yung Lin, Belle L. Tseng, and Ming-Ting Sun. Modeling and predicting personal information dissemination behavior. In Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, KDD '05, pages 479--488, New York, NY, USA, 2005. ACM. Google ScholarDigital Library
- Reza Zafarani and Huan Liu. Connecting corresponding identities across communities. ICWSM, 2009.Google ScholarCross Ref
- Reza Zafarani and Huan Liu. Connecting users across social media sites: a behavioral-modeling approach. In KDD, 2013. Google ScholarDigital Library
- Pei Sun and Sanjay Chawla. On local spatial outliers. In Data Mining, 2004. ICDM'04. Fourth IEEE International Conference on, pages 209--216. IEEE, 2004Reza Zafarani and Huan Liu. Finding friends on a new site using minimum information. In SDM, 2014.. Google ScholarDigital Library
- Reza Zafarani and Huan Liu. Users joining multiple sites: Distributions and patterns. In ICWSM. Citeseer, 2014.Google Scholar
- Reza Zafarani and Huan Liu. Users joining multiple sites: Friendship and popularity variations across sites. Information Fusion, 28:83--89, 2016. Google ScholarDigital Library
- Reza Zafarani, Lei Tang, and Huan Liu. User identification across social media. TKDD, 2015. Google ScholarDigital Library
- Haochen Zhang, Min-Yen Kan, Yiqun Liu, and Shaoping Ma. Online social network profile linkage. In Asia Information Retrieval Symposium, pages 197--208. Springer, 2014.Google ScholarCross Ref
- Haochen Zhang, Minyen Kan, Yiqun Liu, and Shaoping Ma. Online social network profile linkage based on cost-sensitive feature acquisition. In Chinese National Conference on Social Media Processing, 2014.Google ScholarCross Ref
- Jiawei Zhang and Philip S Yu. Pct: partial coalignment of social networks. In WWW, 2016. Google ScholarDigital Library
- Jiawei Zhang, Philip S Yu, and Zhi-Hua Zhou. Metapath based multi-network collective link prediction. In KDD, 2014. Google ScholarDigital Library
- Jiawei Zhang and Philip Yu S. Multiple anonymized social networks alignment. In ICDM, 2015. Google ScholarDigital Library
- Si Zhang and Hanghang Tong. Final: Fast attributed network alignment. In KDD. ACM, 2016. Google ScholarDigital Library
- Yutao Zhang, Jie Tang, Zhilin Yang, Jian Pei, and Philip S Yu. Cosnet: connecting heterogeneous social networks with local and global consistency. In KDD, 2015. Google ScholarDigital Library
- Yuxiang Zhang, Lulu Wang, Xiaoli Li, and Chunjing Xiao. Social identity link across incomplete social information sources using anchor link expansion. In PAKDD, 2016.Google ScholarCross Ref
- Xiaoping Zhou, Xun Liang, Haiyan Zhang, and Yuefeng Ma. Cross-platform identification of anonymous identical users in multiple social media networks. TKDE, 2016. Google ScholarDigital Library
- Xiaojin Zhu and Andrew B Goldberg. Introduction to semi-supervised learning. Synthesis lectures on artificial intelligence and machine learning, 2009. Google ScholarDigital Library
- Reza Zafarani, Mohammad Ali Abbasi, and Huan Liu. Social media mining: an introduction. Cambridge University Press, 2014. Google ScholarDigital Library
Index Terms
- User Identity Linkage across Online Social Networks: A Review
Recommendations
Hyperbolic User Identity Linkage across Social Networks
GLOBECOM 2020 - 2020 IEEE Global Communications ConferenceWith the growing prosperity and diversity of social networks, more and more users participate in multiple social networks to enjoy their diverse services. Users can create different user identities in different social networks, but most networks are ...
Identifying User Identity across Social Network Sites based on Overlapping Relationship and Social Interaction
ChineseCSCW '17: Proceedings of the 12th Chinese Conference on Computer Supported Cooperative Work and Social ComputingMost people have identities on multiple social network sites (SNSs) simultaneously to meet their diverse needs for social media use. User identity identification across SNSs has been a significant research focus in recent years as it is important for ...
Managing identity across social networks
CERIAS '11: Proceedings of the 12th Annual Information Security SymposiumGoal: Gain an in-depth understanding of online identity management among heavy social media users -- people who use Facebook, Twitter and LinkedIn weekly.
Theoretically grounded in social psychology and symbolic interactionism, the project inquires how ...
Comments