Detection of Advertising Users Based on K-SMOTE and Ensemble Learning

Qiu, Zihan; Zhou, Zekai; Long, Yongxu; Ji, Chang; Li, Jianguo; Tang, Yong

doi:10.1007/978-3-031-23741-6_12

Zihan Qiu¹²,
Zekai Zhou¹²,
Yongxu Long¹²,
Chang Ji¹²,
Jianguo Li¹² &
…
Yong Tang¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13795))

Included in the following conference series:

International Conference on Human Centered Computing

Abstract

Aiming at the problem of the unbalanced advertising user data of social networks leading to unsatisfactory prediction results, we propose a prediction model for advertising users based on the combination among K-Means, synthetic minority oversampling Technique (SMOTE), and Ensemble Learning. On the basis of the real user data provided by Scholat, we analyzed the data and extracted many key features from it to draw a portrait of advertising users. Our algorithm first clusters the minority class, and then processes the continuous and discrete features of each sample separately through the improved SMOTE to synthesize new minority samples, and finally constructs an integrated classifier using the ensemble learning. This method effectively avoids the problems of blurred positive and negative class boundaries caused by SMOTE and the inability of SMOTE to process discrete features. Meanwhile, ensemble learning enables the classifier to get more reasonable results and reduce overall errors. The experimental results show that our method improves the quality of the generated minority class samples and significantly improves the prediction performance of advertising users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Oversampling Method Based Covariance Matrix Estimation in High-Dimensional Imbalanced Classification

Imbalanced Classification: Challenges and Approaches to Handle

ISMOTE: A More Accurate Alternative for SMOTE

Article Open access 04 October 2024

Notes

References

Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Smote: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 16(1), 321–357 (2002)
Article MATH Google Scholar
Xixian, P., Qinghua, Z., Xuan, L.: Research on behavior characteristics and classification of micro-blog. Inf. Sci. 033(001), 69–75 (2015)
Google Scholar
Meng, X., Xu, L., Wang, S.: Spam analysis and detection of social network based on sina weibo. Sci. Technol. 000(015), 125–127 (2014)
Google Scholar
Stringhini, G., Kruegel, C., Vigna, G.: Detecting spammers on social networks. In: Twenty-Sixth Annual Computer Security Applications Conference, ACSAC 2010, Austin, Texas, USA, 6–10 December 2010 (2010)
Google Scholar
Benevenuto, F., Rodrigues, T., Almeida, V., Almeida, J.M., Gonçalves, M.: Detecting spammers and content promoters in online video social networks. In: IEEE (2009)
Google Scholar
Hui, H., Wang, W.Y., Mao, B.H.: Borderline-smote: A new over-sampling method in imbalanced data sets learning. In: Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I (2005)
Google Scholar
Sánchez, A.I., Morales, E.F., Gonzalez, J.A.: Synthetic oversampling of instances using clustering. Int. J. Artif. Intell. Tools 22(02), 1350008 (2013). https://doi.org/10.1142/S0218213013500085
Article Google Scholar
Barua, S., Islam, M.M., Yao, X., Murase, K.: MWMOTE--majority weighted minority oversampling technique for imbalanced data set learning. IEEE Trans. Knowl. Data Eng. 26(2), 405–425 (2014). https://doi.org/10.1109/TKDE.2012.232
Article Google Scholar
Douzas, G., Bacao, F., Last, F.: Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE. Inf. Sci. 465, 1–20 (2018). https://doi.org/10.1016/j.ins.2018.06.056
Article Google Scholar
Ruan, Q., Qingfeng, W., Wang, Y., Liu, X., Miao, F.: Effective learning model of user classification based on ensemble learning algorithms. Computing 101(6), 531–545 (2018). https://doi.org/10.1007/s00607-018-0688-4
Article MathSciNet Google Scholar

Download references

Acknowledgements

We thank the anonymous reviewers for their insightful comments. This work was supported by National Natural Science Foundation of China under grant number U1811263, by National Natural Science Foundation of China under grant number 6177221.

Author information

Authors and Affiliations

South China Normal University, Guangzhou, 510630, Guangdong, China
Zihan Qiu, Zekai Zhou, Yongxu Long, Chang Ji, Jianguo Li & Yong Tang

Authors

Zihan Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Zekai Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Yongxu Long
View author publications
You can also search for this author in PubMed Google Scholar
Chang Ji
View author publications
You can also search for this author in PubMed Google Scholar
Jianguo Li
View author publications
You can also search for this author in PubMed Google Scholar
Yong Tang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianguo Li .

Editor information

Editors and Affiliations

Wuhan University of Technology, Wuhan, China
Qiaohong Zu
South China Normal University, Guangzhou, China
Yong Tang
University of Kragujevac, Kragujevac, Serbia
Vladimir Mladenovic
Huawei Technologies Co., Ltd., Oak Way, UK
Aisha Naseer
University of Birmingham, Birmingham, UK
Jizheng Wan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qiu, Z., Zhou, Z., Long, Y., Ji, C., Li, J., Tang, Y. (2022). Detection of Advertising Users Based on K-SMOTE and Ensemble Learning. In: Zu, Q., Tang, Y., Mladenovic, V., Naseer, A., Wan, J. (eds) Human Centered Computing. HCC 2021. Lecture Notes in Computer Science, vol 13795. Springer, Cham. https://doi.org/10.1007/978-3-031-23741-6_12

Download citation

DOI: https://doi.org/10.1007/978-3-031-23741-6_12
Published: 01 January 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-23740-9
Online ISBN: 978-3-031-23741-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Detection of Advertising Users Based on K-SMOTE and Ensemble Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Oversampling Method Based Covariance Matrix Estimation in High-Dimensional Imbalanced Classification

Imbalanced Classification: Challenges and Approaches to Handle

ISMOTE: A More Accurate Alternative for SMOTE

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Detection of Advertising Users Based on K-SMOTE and Ensemble Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Oversampling Method Based Covariance Matrix Estimation in High-Dimensional Imbalanced Classification

Imbalanced Classification: Challenges and Approaches to Handle

ISMOTE: A More Accurate Alternative for SMOTE

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation