Abstract
Fake accounts that are run by fake identities are causing many problems within the Sudanese online community on Facebook. While research is focused on automatic and semi-automatic accounts, human fake accounts are often neglected. A general characterization of these accounts needs to be considered based on the cultural context they exist within. This research interviewed 250 Sudanese persons on Facebook who fell victim of eight of these accounts. Data was manually harvested for both confirmed fake accounts and confirmed real accounts. The dataset included 231 instances which was imbalanced and skewed toward the real accounts. Over-sampling with SMOTE was applied to treat the over-fitting problem of the machine learning models. Supervised classification algorithms achieved an accuracy of up to 89.7% and an AUC of 0.96 in detecting fake accounts. Furthermore, human-based methods, such as profile image verification and history, were identified by the interviewees as ways to assert the legitimacy of an account.






Similar content being viewed by others
Notes
https://www.youtube.com/watch?v=MDEiRoxQcoM, accessed August 28th, 2019.
https://www.sudanakhbar.com/356560, accessed August 28th, 2019.
https://www.alnilin.com/12977511.htm, accessed August 28th, 2019.
https://goo.gl/uLA4pT, accessed August 28th, 2019.
References
El Azab A, Idrees AM, Mahmoud MA, Hefny H (2016) Fake account detection in Twitter based on minimum weighted feature set. Int J Comput Electr Autom Control Inf Eng 10(1):13–18
Sample C, McAlaney J, Bakdash J, Thackray H (2018) “A cultural exploration of the social media manipulators,” in European conference on information warfare and security, ECCWS, 2018-June, 432–440
Gurajala S, White JS, Hudson B, Voter BR, Matthews JN (2016) Profile characteristics of fake Twitter accounts. Big Data Soc 3(2):205395171667423
Romanov A, Semenov A, Mazhelis O, Veijalainen J (2017) “Detection of fake profiles in social media: literature review,” in WEBIST 2017 - proceedings of the 13th international conference on web information systems and technologies, 363–369
Tuna T et al (2016) User characterization for online social networks. Social Network Analysis and Mining 6:1
Romanov A, Semenov A, Veijalainen J (2017) “Revealing fake profiles in social networks by longitudinal data analysis,” in WEBIST 2017 - proceedings of the 13th international conference on web information systems and technologies, 51–58
Mohammadrezaei M, Shiri ME, Rahmani AM (2018) “Identifying fake accounts on social networks based on graph analysis and classification algorithms,” Secur Commun Networks, vol. 2018
Patel K, Agrahari S, Srivastava S (2020) Survey on fake profile detection on social sites by using machine learning algorithm. In: ICRITO 2020 - IEEE 8th international conference on reliability, infocom technologies and optimization (trends and future directions), pp 1236–1240
Roy PK, Chahar S (2021) “Fake profile detection on social networking websites: a comprehensive review,” 4581, c, 1–15
Breuer A, Eilat R, Weinsberg U (2020) “Friend or faux: graph-based early detection of fake accounts on social networks,” arXiv
Wang AH (2010) “Don’t follow me - spam detection in Twitter,” in SECRYPT 2010 - proceedings of the international conference on security and cryptography, 142–151
Fire M, Kagan D, Elyashar A, Elovici Y (2014) Friend or foe? Fake profile identification in online social networks. Soc Netw Anal Min 4(1):1–23
Wani MA, Agarwal N, Jabin S, Hussain SZ (2019) “Analyzing real and fake users in Facebook network based on emotions,” in 2019 11th international conference on communication systems and networks, COMSNETS 2019, 110–117
Ramalingam D, Chinnaiah V (2018) Fake profile detection techniques in large-scale online social networks: a comprehensive review. Comput Electr Eng 65(3):165–177
Myo M, Swe M, Nyein P, Myo N (2018) “Fake accounts classification on Twitter,” 03, 06, 141–146
Gupta A, Kaushal R (2017) “Towards detecting fake user accounts in Facebook,” in ISEA Asia security and privacy conference 2017, ISEASP 2017
Xuan Y, Chen Y, Li H, Hui P, Shi L (2016) “LBSNShield: malicious account detection in location-based social networks,” in Proceedings of the ACM conference on computer supported cooperative work, CSCW, 26-Februar, 437–440
Van Der Walt E, Eloff JHP (2017) “Identity deception detection on social media platforms,” in ICISSP 2017 - proceedings of the 3rd international conference on information systems security and privacy, 2017-Janua, 573–578
Sen S (2015) “An overview of data mining and marketing,” 254–259
Chawla NCN (2005) “Data mining for imbalanced datasets: an overview,” Data min knowl discov handbook, 853–867
Wani SY, Kirmani MM, Ansarulla SI (2016) Prediction of fake profiles on Facebook using supervised machine learning techniques-a theoretical model. Int J Comput Sci Inf Technol 7(4):1735–1738
Johansson U, Sönströd C, Norinder U, Boström H, Löfström T (2010) Using feature selection with bagging and rule extraction in drug discovery. Smart Innov Syst Technol 4:413–422
He H, Ma Y (2013) Imbalanced learning: foundations, algorithms, and applications
Branco P, Torgo L, Ribeiro R (2015) “A survey of predictive modelling under imbalanced distributions"
Kong J, Rios T, Kowalczyk W, Menzel S, Bäck T (2020) “On the performance of oversampling techniques for class imbalance problems,” in Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), 12085 LNAI, pp. 84–96
van der Walt E, Eloff JHP (2018) “Are attributes on social media platforms usable for assisting in the automatic detection of identity deception?,” in Proceedings of the twelfth international symposium on human aspects of information security & assurance (HAISA 2018), 57–66
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author declares no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Elhussein, M. Is it Sarrah Rahamah? A supervised classification model to detect fake identities on Facebook within the Sudanese community. Pers Ubiquit Comput 27, 107–118 (2023). https://doi.org/10.1007/s00779-022-01664-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00779-022-01664-2