Skip to main content

A Click Fraud Detection Scheme Based on Cost-Sensitive CNN and Feature Matrix

  • Conference paper
  • First Online:
Big Data and Security (ICBDS 2019)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1210))

Included in the following conference series:

Abstract

Around 30% revenue has been wasted due to different types of advertising fraud. In view of the continuous emergence of fraud and the improvement of fraudulent means, a click fraud detection of cost-sensitive convolutional neural network (CSCNN) based on feature matrix is proposed. In order to capture the pattern of click fraud, priori probability is introduced and a new feature set is derived from the knowledge of entropy. Then, using cost-based sampling method and threshold-moving cost sensitive learning algorithm to solve unbalanced data training problems. Features are transformed into feature matrix by different time windows to fit the input of a convolutional neural network. Finally, considering the convolution kernel size, the number of convolution kernels and the pooling size of different network structures in this work, effective CSCNN structure was chosen. Experiments on real-word data show that this method can complete click fraud detection effectively, and can complete multi classification without increasing the complexity of the model. A new effective method was provided on click fraud detection in mobile advertisement.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Haider, C.M.R., Iqbal, A., Rahman, A.H., et al.: An ensemble learning based approach for impression fraud detection in mobile advertising. J. Netw. Comput. Appl. 112, 126–141 (2018)

    Article  Google Scholar 

  2. Yang, Y.-H., Huang, H.-H., Shen, Q.-N., et al.: Research on intrusion detection based on incremental GHSOM. Chin. J. Comput. 5, 1216–1224 (2014). (in Chinese)

    Google Scholar 

  3. He, H., Gimpel, K., Lin, J.: Multi-perspective sentence similarity modeling with convolutional neural networks. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP2015), Lisbon, Portugal, pp. 1576–1586 (2015)

    Google Scholar 

  4. Perera, K.S., Neupane, B., Faisal, M.A., Aung, Z., Woon, W.L.: A novel ensemble learning-based approach for click fraud detection in mobile advertising. In: Prasath, R., Kathirvalavakumar, T. (eds.) MIKE 2013. LNCS (LNAI), vol. 8284, pp. 370–382. Springer, Cham (2013). https://doi.org/10.1007/978-3-319-03844-5_38

    Chapter  Google Scholar 

  5. Kitts, B., et al.: Click fraud detection: adversarial pattern recognition over 5 years at microsoft. In: Abou-Nasr, M., Lessmann, S., Stahlbock, R., Weiss, G.M. (eds.) Real World Data Mining Applications. AIS, vol. 17, pp. 181–201. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-07812-0_10

    Chapter  Google Scholar 

  6. Crussell, J., Stevens, R., Chen, H.: Madfraud: investigating ad fraud in android applications. In: Proceedings of the 12th Annual International Conference on Mobile Systems, Applications, and Services, MobiSys 2014, Bretton Woods, NH, USA, pp. 123–134 (2014)

    Google Scholar 

  7. Oentaryo, R., Lim, E.P., Finegold, M., et al.: Detecting click fraud in online advertising: a data mining approach. J. Mach. Learn. Res. 15(1), 99–140 (2014)

    MathSciNet  Google Scholar 

  8. King, M.A., Abrahams, A.S., Ragsdale, C.T.: Ensemble learning methods for pay-per-click campaign management. Expert Syst. Appl. 42(10), 4818–4829 (2015)

    Article  Google Scholar 

  9. Zhang, X., Liu, X.-J., Li, B., et al.: Application of SVM ensemble method to click fraud detection. J. Chin. Comput. Syst. 39(5), 951–956 (2018). (in Chinese)

    Google Scholar 

  10. Kim, Y.: Convolutional Neural Networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), Doha, Qatar, pp. 1474–1480 (2014)

    Google Scholar 

  11. Wang, H.-Y., Dong, M.-W.: Latent group recommendation based on dynamic probabilistic matrix factorization model integrated with CNN. J. Comput. Res. Dev. 54(8), 1853–1863 (2017). (in Chinese)

    Google Scholar 

  12. Jiao, L., Yang, S.-Y., Liu, F., et al.: Seventy years beyond neural networks: retrospect and prospect. Chin. J. Comput. 39(8), 1697–1716 (2016). (in Chinese)

    Google Scholar 

  13. Li, H., Zhang, L., Zhou, X., et al.: Cost-sensitive sequential three-way decision modeling using a deep neural network. Int. J. Approx. Reason. 85(C), 68–78 (2017)

    Article  MathSciNet  MATH  Google Scholar 

  14. Arar, F.M., Ayan, K.: Software defect prediction using cost-sensitive neural network. Appl. Soft Comput. 33, 263–277 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xinyu Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liu, X., Zhang, X., Miao, Q. (2020). A Click Fraud Detection Scheme Based on Cost-Sensitive CNN and Feature Matrix. In: Tian, Y., Ma, T., Khan, M. (eds) Big Data and Security. ICBDS 2019. Communications in Computer and Information Science, vol 1210. Springer, Singapore. https://doi.org/10.1007/978-981-15-7530-3_6

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-7530-3_6

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-7529-7

  • Online ISBN: 978-981-15-7530-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics