Skip to main content

Click Ad Fraud Detection Using XGBoost Gradient Boosting Algorithm

  • Conference paper
  • First Online:
Computing Science, Communication and Security (COMS2 2021)

Abstract

The growth of the online advertising industry has created new business opportunities on the Internet. Companies and advertisers are turning to digital ad platforms like never before to compete for the attention of their audience. In this environment, actions such as clicking an ad result in financial transactions among advertisers, advertising networks and publishers. Since these new opportunities have financial impact, fraudsters have been trying to gain illegal advantages and profit through them. Mitigating the negative effects of illegal traffic is extremely important to the success of any marketing endeavor. Today, false clicks that waste budgets and don’t generate any meaningful value or revenue are costing advertisers billions of dollars. These are the biggest challenge PPC (Pay Per Click) marketers face, although there are efforts made by the advertisers to block fake traffic, they still try to find leading security strategy to identify click fraud. This paper analyzes the click fraud mechanism, focusing on its detection and methods of solution used in recent cases, we try and explain various fundamentals related to online advertising. The objective of this research is to propose solution for click ad fraud present in online advertising using the XGBoost Gradient Boosting algorithm and this model provides the accuracy of 96% with a set of hyperparameters along with features that can be implemented on datasets related to click frauds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Lin, Y. https://www.oberlo.in/blog/internet-statistics. Accessed 25 Nov 2020

  2. Keelery, S. https://www.statista.com/statistics/1040905/india-opinion-timespent-online-than-tv/. Accessed 25 Nov 2020

  3. Handley, L. https://www.cnbc.com/2017/12/04/global-advertising-spend-2020-online-and-offline-ad-spend-to-be-equal.html. Accessed 25 Nov 2020

  4. Braccialini, C. https://blog.hubspot.com/marketing/online-advertising. Accessed 25 Nov 2020

  5. Globsyn Business School Online, Digital Marketing Course (2020)

    Google Scholar 

  6. Kshetri, N., Voas, J.: Online advertising fraud. Computer 52(1), 58–61 (2019). https://doi.org/10.1109/MC.2018.2887322

    Article  Google Scholar 

  7. SEJ search engine journal. https://www.searchenginejournal.com/clickcease-howmuchppc-fraud-costing-your-business/357328/. Accessed 25 Nov 2020

  8. IPQuality score. https://www.ipqualityscore.com/articles/view/17/what-is-payper-click-ppc-fraud. Accessed 25 Nov 2020

  9. Serrano, T. https://thriveagency.com/news/what-is-click-fraud-and-how-do-youprevent-it/. Accessed 25 Nov 2020

  10. Lynch, O. https://www.cheq.ai/what-is-click-fraud. Accessed 25 Nov 2020

  11. Digital element. https://www.digitalelement.com/identify-proxiesfight-clickfraud-and-wasted-impressions/. Accessed 25 Nov 2020

  12. Whats new publishing. https://whatsnewinpublishing.com/publishers-shadowtraffic-problem-why-your-traffic-numbers-are-off-by-20/. Accessed 25 Nov 2020

  13. Thejas, G.S.: Deep learning-based model to fight against ad click fraud, pp. 176–181 (2019). https://doi.org/10.1145/3299815.3314453

  14. Nagaraja, S., Shah, R.: Clicktok: click fraud detection using traffic analysis, pp. 105–116 (2019). https://doi.org/10.1145/3317549.3323407

  15. Mouawi, R., Elhajj, I.H., Chehab, A., et al.: Crowdsourcing for click fraud detection. EURASIP J. Info. Secur. 2019, 11 (2019). https://doi.org/10.1186/s13635-019-0095-1

  16. Zhang, X., Liu, X., Guo, H.: A click fraud detection scheme based on cost sensitive BPNN and ABC in mobile advertising, pp. 1360–1365 (2018). https://doi.org/10.1109/CompComm.2018.8780941

  17. Almahmoud, S., Hammo, B., Al-Shboul, B.: Exploring non-human traffic in online digital advertisements: analysis and prediction (2019). https://doi.org/10.1007/978-3-030-28374-257

  18. Gabryel, M.: Data analysis algorithm for click fraud recognition (2018). https://doi.org/10.1007/978-3-319-99972-236

  19. Minastireanu, E., Mesnita, G.: Light GBM machine learning algorithm to online click fraud detection. J. Inf. Assur. Cybersecur. (2019). https://doi.org/10.5171/2019.263928

  20. Fallah, I.M., Zarifzadeh, S.: Practical detection of click spams using efficient classification-based algorithms. Int. J. Inf. Commun. Technol. Res. 10, 63–71 (2018)

    Google Scholar 

  21. Kaggle.com, TalkingData AdTracking fraud detection challenge (2018). https://www.kaggle.com/c/talkingdata-adtracking-fraud-detection. Accessed 25 Nov 2020

  22. Great learning academy, data visualization in python course (2019)

    Google Scholar 

  23. Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system, CoRR, abs/1603.02754 (2016)

    Google Scholar 

  24. Kaggle.com, Jose, Advertising dataset by Jose Portilla and Pierian Data (2018). https://www.kaggle.com/fayomi/advertising. Accessed 25 Nov 2020

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gohil, N.P., Meniya, A.D. (2021). Click Ad Fraud Detection Using XGBoost Gradient Boosting Algorithm. In: Chaubey, N., Parikh, S., Amin, K. (eds) Computing Science, Communication and Security. COMS2 2021. Communications in Computer and Information Science, vol 1416. Springer, Cham. https://doi.org/10.1007/978-3-030-76776-1_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-76776-1_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-76775-4

  • Online ISBN: 978-3-030-76776-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics