Abstract
User identification can help us build better users’ profiles and benefit many applications. It has attracted many scholars’ attention. The existing works with good performance are mainly based on the rich online data. However, due to the privacy settings, it is costless or even difficult to obtain the rich data. Besides some profile attributes do not require exclusivity and are easily faked by users for different purposes. This makes the existing schemes are quite fragile. Users often publicly publish their activities on different social networks. This provides a way to overcome the above problem. We aim to address the user identification only based on users’ tweets. We first formulate the user identification based on tweets and propose a tweet-based user identification model. Then a supervised machine learning based solution is presented. It consists of three key steps: first, we propose several algorithms to measure the spatial similarity, temporal similarity and content similarity of two tweets; second, we extract the spatial, temporal and content features to exploit information redundancies; Afterwards, we employ the machine learning method for user identification. The experiment shows that the proposed solution can provide excellent performance with F1 values reaching 89.79%, 86.78% and 86.24% on three ground truth datasets, respectively. This work shows the possibility of user identification with easily accessible and not easily impersonated online data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Global Social Media Ranking (2017). https://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users/
Zhou, X.P., Liang, X., Zhang, H.Y., et al.: Cross-platform identification of anonymous identical users in multiple social media networks. IEEE Trans. Knowl. Data Eng. 28(2), 411–424 (2016)
Liu, J., Zhang, F., Song, X.Y., et al.: What’s in a name?: an unsupervised approach to link users across communities. In: Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, pp. 495–504 (2013)
Zafarani, R., Liu, H.: Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 41–49 (2013)
Iofciu, T., Fankhauser, P., Abel, F., et al.: Identifying users across social tagging systems. In: Proceedings of 5th International AAAI Conference on Weblogs and Social Media, pp. 522–525 (2011)
Motoyama, M., Varghese, G.: I seek you: searching and matching individuals in social networks. In: Proceedings of 7th International Workshop on Web Information and Data Management, pp. 67–75 (2009)
Abel, F., Herder, E., Houben, G.-J., et al.: Cross-system user modeling and personalization on the social web. User Model. User Adapt. Interact. 23(2), 169–209 (2013)
Raad, E., Dipanda, A., Chbeir, R.: User profile matching in social networks. In: Proceedings of 16th International Conference on Network-Based Information Systems, pp. 297–304 (2010)
Vosecky, J., Hong, D., Shen, V.Y.: User identification across multiple social networks. In: Proceedings of 1st International Conferences on Networked Digital Technologies, pp. 360–365 (2009)
Jain, P., Kumaraguru, P., Joshi, A.: @ i seek ‘fb. me’: identifying users across multiple online social networks. In: Proceedings of the 22nd International Conference on World Wide Web Companion, pp. 1259–1268 (2013)
Vosecky, J., Hong, D., Shen, V.Y.: User identification across social networks using the web profile and friend network. Int. J. Web Appl. 2(1), 23–34 (2010)
Buccafurri, F., Lax, G., Nocera, A., Ursino, D.: Discovering links among social networks. In: Flach, P.A., Bie, T., Cristianini, N. (eds.) ECML PKDD 2012. LNCS (LNAI), vol. 7524, pp. 467–482. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33486-3_30
Tan, S., Guan, Z.Y., Cai, D., et al.: Mapping users across networks by manifold alignment on hypergraph. In: Proceedings of 28th AAAI Conference on Artificial Intelligence, pp. 159–165 (2014)
You, G.-W., Hwang, S.-W., Nie, Z.Q., et al.: SocialSearch: enhancing entity search with social network matching. In: Proceedings of the 14th International Conference on Extending Database Technology, pp. 515–519 (2011)
Goga, O.: Matching user accounts across online social networks: methods and applications. Ph.D. Dissertation, Universite Pierre etmarie curie – Pairs 6, Franch (2014)
Vesdapunt, N., Hector, G.-M.: Identifying users in social networks with limited information. In: Proceedings of the IEEE 31st International Conference on Data Engineering, pp. 627–638 (2015)
Huang, S.R., Zhang, J., Lu, S.Y., et al.: Social friend recommendation based on network correlation and feature co-clustering. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, pp. 315–322 (2015)
Zafarani, R., Tang, L., Liu, H.: User identification across social media. ACM Trans. Knowl. Discov. Data 10(2), 1–30 (2015)
Shmatikov, V., Narayanan, A.: De-anonymizing social networks. In: Proceedings of IEEE Symposium on Security and Privacy, pp. 173–187 (2009)
Bartunov, S., Korshunov, A., Park, S.-T., et al.: Joint link-attribute user identity resolution in online social networks. In: Proceedings of 6th SNA-KDD Workshop (2012)
Korula, N., Lattanzi, S.: An efficient reconciliation algorithm for social networks. Proc. VLDB Endow. 7(5), 377–388 (2013)
Kong, X.N., Zhang, J.W., Yu, P.-S.: Inferring anchor links across multiple heterogeneous social networks. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 179–188 (2013)
Goga, O., Lei, H., Hari, S., et al.: Exploiting innocuous activity for correlating users across sites. In Proceedings of the 22nd International Conference on World Wide Web, pp. 447–458 (2013)
Sajadmanesh, S., Rabiee, H.R., Khodadadi, A.: Predicting anchor links between heterogeneous social networks. In: Proceedings of 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 158–163 (2016)
Zhang, J.W., Kong, X.N., Yu, P.-S.: Predicting social links for new users across aligned heterogeneous social networks. In: Proceedings of IEEE 13th International Conference on Data Mining, pp. 1289–1294 (2013)
Jain, P., Kumaraguru, P.: Finding nemo: searching and resolving identities of users across online social networks. arXiv preprint 2012. arxiv:1212.6147
Perito, D., Castelluccia, C., Kaafar, M., et al.: How unique and traceable are usernames? In: Proceedings of 11th International Conference on Privacy Enhancing Technologies, pp. 1–17 (2011)
Jiang, X., Wei, S.K., Zhao, R.Z., et al.: Camera fingerprint: a new perspective for identifying user’s identity. arXiv preprint arxiv: 1610.07728 (2016)
Liu, D., Wu, Q.Y., Han, W.H.: User identification across multiple websites based on usern-ame features. Chin. J. Comput. 38(10), 2028–2040 (2015)
Bennacer, N., Nana Jipmo, C., Penta, A., Quercini, G.: Matching user profiles across social networks. In: Jarke, M., Mylopoulos, J., Quix, C., Rolland, C., Manolopoulos, Y., Mouratidis, H., Horkoff, J. (eds.) CAiSE 2014. LNCS, vol. 8484, pp. 424–438. Springer, Cham (2014). doi:10.1007/978-3-319-07881-6_29
Malhotra, A., Totti, L., Meira, W., et al.: Studying user footprints in different online social networks. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining, pp. 1065–1070 (2012)
Almishari, M., Tsudik, G.: Exploring linkability of user reviews. In: Foresti, S., Yung, M., Martinelli, F. (eds.) ESORICS 2012. LNCS, vol. 7459, pp. 307–324. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33167-1_18
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Li, Y., Zhang, Z., Peng, Y. (2017). A Solution to Tweet-Based User Identification Across Online Social Networks. In: Cong, G., Peng, WC., Zhang, W., Li, C., Sun, A. (eds) Advanced Data Mining and Applications. ADMA 2017. Lecture Notes in Computer Science(), vol 10604. Springer, Cham. https://doi.org/10.1007/978-3-319-69179-4_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-69179-4_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69178-7
Online ISBN: 978-3-319-69179-4
eBook Packages: Computer ScienceComputer Science (R0)