Skip to main content

Advertisement

Log in

Is it Sarrah Rahamah? A supervised classification model to detect fake identities on Facebook within the Sudanese community

  • Original Article
  • Published:
Personal and Ubiquitous Computing Aims and scope Submit manuscript

Abstract

Fake accounts that are run by fake identities are causing many problems within the Sudanese online community on Facebook. While research is focused on automatic and semi-automatic accounts, human fake accounts are often neglected. A general characterization of these accounts needs to be considered based on the cultural context they exist within. This research interviewed 250 Sudanese persons on Facebook who fell victim of eight of these accounts. Data was manually harvested for both confirmed fake accounts and confirmed real accounts. The dataset included 231 instances which was imbalanced and skewed toward the real accounts. Over-sampling with SMOTE was applied to treat the over-fitting problem of the machine learning models. Supervised classification algorithms achieved an accuracy of up to 89.7% and an AUC of 0.96 in detecting fake accounts. Furthermore, human-based methods, such as profile image verification and history, were identified by the interviewees as ways to assert the legitimacy of an account.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. https://www.youtube.com/watch?v=MDEiRoxQcoM, accessed August 28th, 2019.

  2. https://www.sudanakhbar.com/356560, accessed August 28th, 2019.

  3. https://www.alnilin.com/12977511.htm, accessed August 28th, 2019.

  4. https://goo.gl/uLA4pT, accessed August 28th, 2019.

References

  1. El Azab A, Idrees AM, Mahmoud MA, Hefny H (2016) Fake account detection in Twitter based on minimum weighted feature set. Int J Comput Electr Autom Control Inf Eng 10(1):13–18

    Google Scholar 

  2. Sample C, McAlaney J, Bakdash J, Thackray H (2018) “A cultural exploration of the social media manipulators,” in European conference on information warfare and security, ECCWS, 2018-June, 432–440

  3. Gurajala S, White JS, Hudson B, Voter BR, Matthews JN (2016) Profile characteristics of fake Twitter accounts. Big Data Soc 3(2):205395171667423

    Article  Google Scholar 

  4. Romanov A, Semenov A, Mazhelis O, Veijalainen J (2017) “Detection of fake profiles in social media: literature review,” in WEBIST 2017 - proceedings of the 13th international conference on web information systems and technologies, 363–369

  5. Tuna T et al (2016) User characterization for online social networks. Social Network Analysis and Mining 6:1

  6. Romanov A, Semenov A, Veijalainen J (2017) “Revealing fake profiles in social networks by longitudinal data analysis,” in WEBIST 2017 - proceedings of the 13th international conference on web information systems and technologies, 51–58

  7. Mohammadrezaei M, Shiri ME, Rahmani AM (2018) “Identifying fake accounts on social networks based on graph analysis and classification algorithms,” Secur Commun Networks, vol. 2018

  8. Patel K, Agrahari S, Srivastava S (2020) Survey on fake profile detection on social sites by using machine learning algorithm. In: ICRITO 2020 - IEEE 8th international conference on reliability, infocom technologies and optimization (trends and future directions), pp 1236–1240

  9. Roy PK, Chahar S (2021) “Fake profile detection on social networking websites: a comprehensive review,” 4581, c, 1–15

  10. Breuer A, Eilat R, Weinsberg U (2020) “Friend or faux: graph-based early detection of fake accounts on social networks,” arXiv

  11. Wang AH (2010) “Don’t follow me - spam detection in Twitter,” in SECRYPT 2010 - proceedings of the international conference on security and cryptography, 142–151

  12. Fire M, Kagan D, Elyashar A, Elovici Y (2014) Friend or foe? Fake profile identification in online social networks. Soc Netw Anal Min 4(1):1–23

    Article  Google Scholar 

  13. Wani MA, Agarwal N, Jabin S, Hussain SZ (2019) “Analyzing real and fake users in Facebook network based on emotions,” in 2019 11th international conference on communication systems and networks, COMSNETS 2019, 110–117

  14. Ramalingam D, Chinnaiah V (2018) Fake profile detection techniques in large-scale online social networks: a comprehensive review. Comput Electr Eng 65(3):165–177

    Article  Google Scholar 

  15. Myo M, Swe M, Nyein P, Myo N (2018) “Fake accounts classification on Twitter,” 03, 06, 141–146

  16. Gupta A, Kaushal R (2017) “Towards detecting fake user accounts in Facebook,” in ISEA Asia security and privacy conference 2017, ISEASP 2017

  17. Xuan Y, Chen Y, Li H, Hui P, Shi L (2016) “LBSNShield: malicious account detection in location-based social networks,” in Proceedings of the ACM conference on computer supported cooperative work, CSCW, 26-Februar, 437–440

  18. Van Der Walt E, Eloff JHP (2017) “Identity deception detection on social media platforms,” in ICISSP 2017 - proceedings of the 3rd international conference on information systems security and privacy, 2017-Janua, 573–578

  19. Sen S (2015) “An overview of data mining and marketing,” 254–259

  20. Chawla NCN (2005) “Data mining for imbalanced datasets: an overview,” Data min knowl discov handbook, 853–867

  21. Wani SY, Kirmani MM, Ansarulla SI (2016) Prediction of fake profiles on Facebook using supervised machine learning techniques-a theoretical model. Int J Comput Sci Inf Technol 7(4):1735–1738

    Google Scholar 

  22. Johansson U, Sönströd C, Norinder U, Boström H, Löfström T (2010) Using feature selection with bagging and rule extraction in drug discovery. Smart Innov Syst Technol 4:413–422

    Article  Google Scholar 

  23. He H, Ma Y (2013) Imbalanced learning: foundations, algorithms, and applications

  24. Branco P, Torgo L, Ribeiro R (2015) “A survey of predictive modelling under imbalanced distributions"

  25. Kong J, Rios T, Kowalczyk W, Menzel S, Bäck T (2020) “On the performance of oversampling techniques for class imbalance problems,” in Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), 12085 LNAI, pp. 84–96

  26. van der Walt E, Eloff JHP (2018) “Are attributes on social media platforms usable for assisting in the automatic detection of identity deception?,” in Proceedings of the twelfth international symposium on human aspects of information security & assurance (HAISA 2018), 57–66

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mariam Elhussein.

Ethics declarations

Conflict of interest

The author declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Elhussein, M. Is it Sarrah Rahamah? A supervised classification model to detect fake identities on Facebook within the Sudanese community. Pers Ubiquit Comput 27, 107–118 (2023). https://doi.org/10.1007/s00779-022-01664-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00779-022-01664-2

Keywords

Navigation