Skip to main content

A Solution to Tweet-Based User Identification Across Online Social Networks

  • Conference paper
  • First Online:
Advanced Data Mining and Applications (ADMA 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10604))

Included in the following conference series:

Abstract

User identification can help us build better users’ profiles and benefit many applications. It has attracted many scholars’ attention. The existing works with good performance are mainly based on the rich online data. However, due to the privacy settings, it is costless or even difficult to obtain the rich data. Besides some profile attributes do not require exclusivity and are easily faked by users for different purposes. This makes the existing schemes are quite fragile. Users often publicly publish their activities on different social networks. This provides a way to overcome the above problem. We aim to address the user identification only based on users’ tweets. We first formulate the user identification based on tweets and propose a tweet-based user identification model. Then a supervised machine learning based solution is presented. It consists of three key steps: first, we propose several algorithms to measure the spatial similarity, temporal similarity and content similarity of two tweets; second, we extract the spatial, temporal and content features to exploit information redundancies; Afterwards, we employ the machine learning method for user identification. The experiment shows that the proposed solution can provide excellent performance with F1 values reaching 89.79%, 86.78% and 86.24% on three ground truth datasets, respectively. This work shows the possibility of user identification with easily accessible and not easily impersonated online data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://scikit-learn.org/stable/.

References

  1. Global Social Media Ranking (2017). https://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users/

  2. Zhou, X.P., Liang, X., Zhang, H.Y., et al.: Cross-platform identification of anonymous identical users in multiple social media networks. IEEE Trans. Knowl. Data Eng. 28(2), 411–424 (2016)

    Article  Google Scholar 

  3. Liu, J., Zhang, F., Song, X.Y., et al.: What’s in a name?: an unsupervised approach to link users across communities. In: Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, pp. 495–504 (2013)

    Google Scholar 

  4. Zafarani, R., Liu, H.: Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 41–49 (2013)

    Google Scholar 

  5. Iofciu, T., Fankhauser, P., Abel, F., et al.: Identifying users across social tagging systems. In: Proceedings of 5th International AAAI Conference on Weblogs and Social Media, pp. 522–525 (2011)

    Google Scholar 

  6. Motoyama, M., Varghese, G.: I seek you: searching and matching individuals in social networks. In: Proceedings of 7th International Workshop on Web Information and Data Management, pp. 67–75 (2009)

    Google Scholar 

  7. Abel, F., Herder, E., Houben, G.-J., et al.: Cross-system user modeling and personalization on the social web. User Model. User Adapt. Interact. 23(2), 169–209 (2013)

    Article  Google Scholar 

  8. Raad, E., Dipanda, A., Chbeir, R.: User profile matching in social networks. In: Proceedings of 16th International Conference on Network-Based Information Systems, pp. 297–304 (2010)

    Google Scholar 

  9. Vosecky, J., Hong, D., Shen, V.Y.: User identification across multiple social networks. In: Proceedings of 1st International Conferences on Networked Digital Technologies, pp. 360–365 (2009)

    Google Scholar 

  10. Jain, P., Kumaraguru, P., Joshi, A.: @ i seek ‘fb. me’: identifying users across multiple online social networks. In: Proceedings of the 22nd International Conference on World Wide Web Companion, pp. 1259–1268 (2013)

    Google Scholar 

  11. Vosecky, J., Hong, D., Shen, V.Y.: User identification across social networks using the web profile and friend network. Int. J. Web Appl. 2(1), 23–34 (2010)

    Google Scholar 

  12. Buccafurri, F., Lax, G., Nocera, A., Ursino, D.: Discovering links among social networks. In: Flach, P.A., Bie, T., Cristianini, N. (eds.) ECML PKDD 2012. LNCS (LNAI), vol. 7524, pp. 467–482. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33486-3_30

    Chapter  Google Scholar 

  13. Tan, S., Guan, Z.Y., Cai, D., et al.: Mapping users across networks by manifold alignment on hypergraph. In: Proceedings of 28th AAAI Conference on Artificial Intelligence, pp. 159–165 (2014)

    Google Scholar 

  14. You, G.-W., Hwang, S.-W., Nie, Z.Q., et al.: SocialSearch: enhancing entity search with social network matching. In: Proceedings of the 14th International Conference on Extending Database Technology, pp. 515–519 (2011)

    Google Scholar 

  15. Goga, O.: Matching user accounts across online social networks: methods and applications. Ph.D. Dissertation, Universite Pierre etmarie curie – Pairs 6, Franch (2014)

    Google Scholar 

  16. Vesdapunt, N., Hector, G.-M.: Identifying users in social networks with limited information. In: Proceedings of the IEEE 31st International Conference on Data Engineering, pp. 627–638 (2015)

    Google Scholar 

  17. Huang, S.R., Zhang, J., Lu, S.Y., et al.: Social friend recommendation based on network correlation and feature co-clustering. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, pp. 315–322 (2015)

    Google Scholar 

  18. Zafarani, R., Tang, L., Liu, H.: User identification across social media. ACM Trans. Knowl. Discov. Data 10(2), 1–30 (2015)

    Article  Google Scholar 

  19. Shmatikov, V., Narayanan, A.: De-anonymizing social networks. In: Proceedings of IEEE Symposium on Security and Privacy, pp. 173–187 (2009)

    Google Scholar 

  20. Bartunov, S., Korshunov, A., Park, S.-T., et al.: Joint link-attribute user identity resolution in online social networks. In: Proceedings of 6th SNA-KDD Workshop (2012)

    Google Scholar 

  21. Korula, N., Lattanzi, S.: An efficient reconciliation algorithm for social networks. Proc. VLDB Endow. 7(5), 377–388 (2013)

    Article  Google Scholar 

  22. Kong, X.N., Zhang, J.W., Yu, P.-S.: Inferring anchor links across multiple heterogeneous social networks. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 179–188 (2013)

    Google Scholar 

  23. Goga, O., Lei, H., Hari, S., et al.: Exploiting innocuous activity for correlating users across sites. In Proceedings of the 22nd International Conference on World Wide Web, pp. 447–458 (2013)

    Google Scholar 

  24. Sajadmanesh, S., Rabiee, H.R., Khodadadi, A.: Predicting anchor links between heterogeneous social networks. In: Proceedings of 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 158–163 (2016)

    Google Scholar 

  25. Zhang, J.W., Kong, X.N., Yu, P.-S.: Predicting social links for new users across aligned heterogeneous social networks. In: Proceedings of IEEE 13th International Conference on Data Mining, pp. 1289–1294 (2013)

    Google Scholar 

  26. Jain, P., Kumaraguru, P.: Finding nemo: searching and resolving identities of users across online social networks. arXiv preprint 2012. arxiv:1212.6147

  27. Perito, D., Castelluccia, C., Kaafar, M., et al.: How unique and traceable are usernames? In: Proceedings of 11th International Conference on Privacy Enhancing Technologies, pp. 1–17 (2011)

    Google Scholar 

  28. Jiang, X., Wei, S.K., Zhao, R.Z., et al.: Camera fingerprint: a new perspective for identifying user’s identity. arXiv preprint arxiv: 1610.07728 (2016)

  29. Liu, D., Wu, Q.Y., Han, W.H.: User identification across multiple websites based on usern-ame features. Chin. J. Comput. 38(10), 2028–2040 (2015)

    Google Scholar 

  30. Bennacer, N., Nana Jipmo, C., Penta, A., Quercini, G.: Matching user profiles across social networks. In: Jarke, M., Mylopoulos, J., Quix, C., Rolland, C., Manolopoulos, Y., Mouratidis, H., Horkoff, J. (eds.) CAiSE 2014. LNCS, vol. 8484, pp. 424–438. Springer, Cham (2014). doi:10.1007/978-3-319-07881-6_29

    Google Scholar 

  31. Malhotra, A., Totti, L., Meira, W., et al.: Studying user footprints in different online social networks. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining, pp. 1065–1070 (2012)

    Google Scholar 

  32. Almishari, M., Tsudik, G.: Exploring linkability of user reviews. In: Foresti, S., Yung, M., Martinelli, F. (eds.) ESORICS 2012. LNCS, vol. 7459, pp. 307–324. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33167-1_18

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yongjun Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Li, Y., Zhang, Z., Peng, Y. (2017). A Solution to Tweet-Based User Identification Across Online Social Networks. In: Cong, G., Peng, WC., Zhang, W., Li, C., Sun, A. (eds) Advanced Data Mining and Applications. ADMA 2017. Lecture Notes in Computer Science(), vol 10604. Springer, Cham. https://doi.org/10.1007/978-3-319-69179-4_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-69179-4_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-69178-7

  • Online ISBN: 978-3-319-69179-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics