Abstract
In online advertisement industry, it is important to predict potentially profitable users who will click target ads (i.e., Behavioral targeting). The task selects the potential users that are likely to click the ads by analyzing user’s clicking/web browsing information and displaying the most relevant ads to them. This paper proposes four multiple criteria mathematical programming models for advertisement clicking problems. First two are multi-criteria linear regression (MCLR) and kernel-based multiple criteria regression (KMCR) algorithms for click-through rate (CTR) prediction. The second two are multi-criteria linear programming (MCLP) and kernel-based multiple criteria programming (KMCP) algorithms, which are used to predict ads clicking events, such as identifying clicked ads in a set of ads. Using the experimental datasets from KDD Cup 2012, the paper first conducts a comparison of the proposed MCLR and KMCR with the methods of support vector regression (SVR) and logistic regression (LR), which shows that both MCLR and KMCR are good alternatives. Then the paper further studies the performance between the proposed MCLP and KMCP algorithms with known algorithms, including support vector machines (SVM), LR, radial basis function network (RBFN), k-nearest neighbor algorithm (KNN) and Naïve Bayes (NB) in both prediction and selection processes. The studies show that the MCLP and KMCP models have better performance stability and can be used to effectively handle behavioral targeting application for online advertisement problems.
Similar content being viewed by others
References
Chang, C.-C., Lin, C.-J.: LIBSVM: A library for support vector machines. ACM Trans. Int. Syst. Technol. 2 27:1-27:27, software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm (2011)
Chen, Y., Pavlov, D., John, F.: Large scale behavioral targeting. Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 209-218 (2009)
Chen, Y., Pavlov, D., Canny, J.F.: Behavioral targeting: the art of scaling up simple algorithms. ACM Trans. Knowl. Disc. Data 4(4), Article 17 (2010)
Cheng, H., Cantú-Paz, E.: Personalized click prediction in sponsored search. In Proceedings of the 3rd ACM international conference on Web search and data mining, pp. 351–360 (2010)
Grant,M., Boyd, S.: CVX: Matlab software for disciplined convex programming. version 1.21 (2011)
Guo, Q., Agichtein, E.: Ready to buy or just browsing?: detecting web searcher goals from interaction data. In Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval, pp. 130–137 (2010)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. SIGKDD Explor. Newsl. 11(1), 10–18 (2009)
Kou, G., et al.: Multiple criteria mathematical programming for multi-class classification and application in network intrusion detection. Inf. Sci. 179(4), 371–381 (2009)
Kwak, W., Shi, Y., Cheh, J.J.: Firm bankruptcy prediction using multiple criteria linear programming data mining approach. Adv. Invest. Anal. Portf. Manag. 2, 27–49 (2006)
Kwak, W., et al.: Bankruptcy prediction for Japanese firms: using multiple criteria linear programming data mining approach. Int. J. Bus. Int. Data Min. 1(4), 401–416 (2006)
Li, J., Zhang, P.: Efficient behavioral targeting Using SVM Ensemble Indexing. Proceedings of IEEE International Conference on Data Mining (ICDM), pp. 409-418 (2012)
Niu, L., Zhao, X., Shi, Y.: A simple regularized multiple criteria linear programs for binary classification. Proc. Comput. Sci. 18, 1690–1699 (2013)
Peng, Y., et al.: A multi-criteria convex quadratic programming model for credit data analysis. Decis. Support. Syst. 44(4), 1016–1030 (2008)
Richardson, M., Dominowska, E., Ragno, R.: Predicting clicks: estimating the click-through rate for new ads. In Proceedings of the 16th international conference on World Wide Web, 521–530 (2007)
Richardson, M., Dominowska, E., Ragno, R.: Predicting clicks: estimating the click-through rate for new ads. International World Wide Web Conference (WWW 2007), pp. 521-530 (2007)
Schlosser, A.E., Shavitt, S., Kanfer, A.: Survey of Internet users’ attitudes toward Internet advertising. J. Interact. Mark. 13(3), 34–54 (1999)
Shi, Y.: Multiple criteria optimization based data mining methods and applications: a systematic survey. Knowl. Inf. Syst. 24(3), 369–391 (2010)
Shi, Y., Peng, Y., Xu, W.: Data mining via multiple criteria linear programming: applications in credit card portfolio management. Int. J. Inf. Technol. Decis. Mak. 1(1), 131–151 (2002)
Shi, Y., Tian, Y., Kou, G., Peng, Y., Li, J.: Optimization Based Data Mining: Theory and Applications, doi:10.1007/978-0-85729-504-0 (2011)
Shi, Y., et al.: A multiple-criteria quadratic programming approach to network intrusion detection. Data Min. Knowl. Manag. Lect. Notes Comput. Sci. 3327, 145–153 (2005)
Shi, Y. et al.: Data mining in credit card portfolio management: a multiple criteria decision making approach. Lecture notes in economics and mathematical systems, pp. 427–436 (2001)
Vapnik, V.N.: The nature of statistical learning theory (Second Edition). Springer, New York (2000)
Yu, P.L.: A class of solutions for group decision problems. Manag. Sci. 19(8), 936–946 (1973)
Zhang, D.L., Shi, Y., Tian, Y.J., Zhu, M.H.: A class of classification and regression methods by multi objective programming. Front. Comput. Sci. China 2(3), 192–204 (2009)
Zhang, Z., Shi, Y., Gao, G.: A rough set-based multiple criteria linear programming approach for the medical diagnosis and prognosis. Expert Syst. Appl. 36(5), 8932–8937 (2009)
Zhao, X., Deng, W., Shi, Y.: Feature selection with attributes clustering by maximal information coefficient. In Proceedings of 1st International Conference on Information Technology and Quantitative Management, volume 17, pp. 70–79 (2013)
Zhao, X., Shi, Y., Lingfeng, N.: Kernel-based simple regularized multiple criteria linear program for binary classification and regression. Int. Data Anal. 19(3) (2015). (To be appear)
Zheng, J., et al.: Classification of HIV-I mediated neuronal dendritic and synaptic damage using multiple criteria linear programming. Neuroinformatics 2(3), 303–326 (2004)
Acknowledgments
The authors thank the anonymous reviewers for helping to improve this paper. This work was partially supported by the National Nature Science Foundation of China (Grant No.70921061, 71331005), the CAS/SAFEA International Partnership Program for Creative Research Teams, International (Region) Joint Research Project (No.71110107026).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lee, J., Shi, Y., Wang, F. et al. Advertisement clicking prediction by using multiple criteria mathematical programming. World Wide Web 19, 707–724 (2016). https://doi.org/10.1007/s11280-015-0353-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-015-0353-1