Similarity-Based User Identification Across Social Networks

Zamani, Katerina; Paliouras, Georgios; Vogiatzis, Dimitrios

doi:10.1007/978-3-319-24261-3_14

Katerina Zamani^16,17,
Georgios Paliouras¹⁷ &
Dimitrios Vogiatzis^17,18

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9370))

Included in the following conference series:

International Workshop on Similarity-Based Pattern Recognition

2195 Accesses

Abstract

In this paper we study the identifiability of users across social networks, with a trainable combination of different similarity metrics. This application is becoming particularly interesting as the number and variety of social networks increase and the presence of individuals in multiple networks is becoming commonplace. Motivated by the need to verify information that appears in social networks, as addressed by the research project REVEAL, the presence of individuals in different networks provides an interesting opportunity: we can use information from one network to verify information that appears in another. In order to achieve this, we need to identify users across networks. We approach this problem by a combination of similarity measures that take into account the users’ affiliation, location, professional interests and past experience, as stated in the different networks. We experimented with a variety of combination approaches, ranging from simple averaging to trained hybrid models. Our experiments show that, under certain conditions, identification is possible with sufficiently high accuracy to support the goal of verification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

LIAISON: reconciLIAtion of Individuals Profiles Across SOcial Networks

Tuser3: A Profile Matching Based Algorithm Across Three Heterogeneous Social Networks

Identifying multiple social network accounts belonging to the same users

Article 11 March 2021

Notes

1.
http://revealproject.eu/.

References

Goga, O., Perito, D., Lei, H., Teixeira, R., Sommer, R.: Large-scale correlation of accounts across social networks. Technical report (2013)
Google Scholar
Iofciu, T., Fankhauser, P., Abel, F., Bischoff, K.: Identifying users across social tagging systems. In: Adamic, L.A., Baeza-Yates, R.A., Counts, S. (eds.) ICWS. The AAAI Press (2011)
Google Scholar
Goga, O., Lei, H., Parthasarathi, S., Friedland, G., Sommer, R., Teixeira, R.: On exploiting innocuous user activity for correlating accounts across social network sites. Technical report, ICSI Technical Reports University of Berkeley (2012)
Google Scholar
Hall, M., Frank, E.: Combining Naive Bayes and decision tables. In: FLAIRS Conference, vol. 2118, pp. 318–319 (2008)
Google Scholar
Egele, M., et al.: COMPA: detecting compromised accounts in social networks. In: NDSS (2013)
Google Scholar
Elmagarmid, A.K., Ipeirotis, P.G., Verykios, V.S.: Duplicate record detection: a survey. IEEE Trans. Knowl. Data Eng. 19, 1–16 (2007)
Article Google Scholar
Chen, Y., Zhao, J., Hu, X., Zhang, X., Li, Z., Chua, T.S.: From interest to function: location estimation in social media. In: AAAI (2013)
Google Scholar
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
MATH Google Scholar
Moreau, E., Yvon, F., Capp, O.: Robust similarity measures for named entities matching. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1, pp. 593–600. Association for Computational Linguistics (2008)
Google Scholar
Cohen, W., Ravikumar, P., Fienberg, S.: A comparison of string metrics for matching names and records. In: KDD Workshop on Data Cleaning and Object Consolidation, vol. 3, pp. 73–78 (2003)
Google Scholar
Malhotra, A., Totti, L., Meira, Jr., W., Kumaraguru, P., Almeida, V.: Studying user footprints in different online social networks. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining, ASONAM, pp. 1065–1070. IEEE Computer Society (2012)
Google Scholar
Vosecky, J., Hong, D., Shen, V.Y.: User identification across multiple social networks. In: First International Conference on Networked Digital Technologies, NDT 2009, pp. 360–365. IEEE (2009)
Google Scholar
Machine Learning Group at the University of Waikato. http://www.cs.waikato.ac.nz/ml/index.html
GeoNames Ontology. http://www.geonames.org/
Simmetrics Library. https://github.com/Simmetrics/simmetrics
SecondString Library. https://github.com/TeamCohen/secondstring
Reveal Project: Social Media Verification. http://revealproject.eu/

Download references

Acknowledgments

This work was partially supported by the research project REVEAL (REVEALing hidden concepts in Social Media), which is funded by the European Commission, under the FP7 programme (contract number 610928).

Author information

Authors and Affiliations

Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece
Katerina Zamani
Institute of Informatics and Telecommunications, National Centre for Scientific Research, “Demokritos”, Aghia Paraskevi, Greece
Katerina Zamani, Georgios Paliouras & Dimitrios Vogiatzis
The American College of Greece, Athens, Greece
Dimitrios Vogiatzis

Authors

Katerina Zamani
View author publications
You can also search for this author in PubMed Google Scholar
Georgios Paliouras
View author publications
You can also search for this author in PubMed Google Scholar
Dimitrios Vogiatzis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Katerina Zamani .

Editor information

Editors and Affiliations

University of Copenhagen, Copenhagen, Denmark
Aasa Feragen
DAIS, Università Ca' Foscari Venezia, Venezia Mestre, Italy
Marcello Pelillo
Delft University of Technology, Delft, Zuid-Holland, The Netherlands
Marco Loog

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zamani, K., Paliouras, G., Vogiatzis, D. (2015). Similarity-Based User Identification Across Social Networks. In: Feragen, A., Pelillo, M., Loog, M. (eds) Similarity-Based Pattern Recognition. SIMBAD 2015. Lecture Notes in Computer Science(), vol 9370. Springer, Cham. https://doi.org/10.1007/978-3-319-24261-3_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-24261-3_14
Published: 25 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24260-6
Online ISBN: 978-3-319-24261-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics