Abstract
Classification is an important data mining technique for classifying the items according to a series of associated features. Even so, most of them are not stable in performance, and they may get high classification accuracy rate in some datasets but poor in other issues. To solve this problem, in this paper, an integrated algorithm is proposed to keep balance between the classification accuracy rate and stability. The proposed algorithm integrates the K-Nearest Neighbor (KNN), Naive Bayes (NB), Regression Tree (RT), Random Forest (RF), Bagging, and Discriminant Analysis Classifier (DAC) using forecasting probability strategies. Specifically, the majority voting strategy and weighted voting strategy are presented using the forecasting probability obtained from the classification algorithms. To demonstrate the effectiveness of the proposed algorithm, numerous experiments are conducted by applying the classification algorithms to real mobile APP statistics. Results indicate that it can get a comprehensive and stable classification accuracy rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Liu, J.W.: Using big data database to construct new Gfuzzy text mining and decision algorithm for targeting and classifying customers. Comput. Ind. Eng. 128, 1088–1095 (2018)
Liu, Q., Liu, C.: A novel locally linear KNN method with applications to visual recognition. IEEE Trans. Neural Netw. Learn. Syst. 28(9), 2010–2021 (2016)
Sahu, S.K., Kumar, P., Singh, A.P.: Modified K-NN algorithm for classification problems with improved accuracy. Int. J. Inf. Technol. 10(1), 65–70 (2017)
Wolfson, J., Bandyopadhyayy, S., et al.: A naive Bayes machine learning approach to risk prediction using censored, time-to-event data. Stat. Med. 34(21), 2941–2957 (2015)
Hu, C., Steingrimsson, J.A.: Personalized risk prediction in clinical oncology research: applications and practical issues using survival trees and random forests. J. Biopharm. Stat. 1–17 (2017)
Arar, Ö.F., Ayan, K.: A feature dependent naive Bayes approach and its application to the software defect prediction problem. Appl. Soft Comput. 59, 197–209 (2017)
Cano, G., Garcia-Rodriguez, J., Garcia-Garcia, A., Perez-Sanchez, H., Benediktsson, J.A., Thapa, A., et al.: Automatic selection of molecular descriptors using random forest: application to drug discovery. Expert Syst. Appl. 72, 151–159 (2017)
Yoonseok, S.: Application of boosting regression trees to preliminary cost estimation in building construction projects. Comput. Intell. Neurosci. 1–9 (2015)
Yang, R.M., Zhang, G.L., et al.: Comparison of boosted regression tree and random forest models for mapping topsoil organic carbon concentration in an alpine ecosystem. Ecol. Indic. 60, 870–878 (2016)
Prabhakar Karthikeyan, S., Jacob Raglend, I., Sathish Kumar, K., Kumar Sahoo, S., Priya Esther, B.: Application of SVM as classifier in estimating market power under deregulated electricity market. In: Kamalakannan, C., Suresh, L.P., Dash, S.S., Panigrahi, B.K. (eds.) Power Electronics and Renewable Energy Systems. LNEE, vol. 326, pp. 1309–1317. Springer, New Delhi (2015). https://doi.org/10.1007/978-81-322-2119-7_127
Zareapoor, M., Shamsolmoali, P.: Application of credit card fraud detection: based on bagging ensemble classifier. Procedia Comput. Sci. 48, 679–685 (2015)
Gyamfi, K.S., Brusey, J., Hunt, A., Gaura, E.: Linear classifier design under heteroscedasticity in linear discriminant analysis. Expert Syst. Appl. 79, 44–52 (2017)
Nugrahaeni, R.A., Mutijarsa, K.: Comparative analysis of machine learning KNN, SVM, and random forests algorithm for facial expression classification. In: Technology of Information & Communication, pp. 163–168 (2017)
Wang, W., Li, Y., Wang, X., Liu, J., Zhang, X.: Detecting android malicious apps and categorizing benign apps with ensemble of classifiers. Futur. Gener. Comput. Syst. 78, 987–994 (2017)
Zhou, Q.: A comparative study of various supervised learning approaches to selective omission in a road network. Cartogr. J. 54, 1–11 (2016)
Gopinath, B., Gupt, D.B.R.: Majority voting based classification of thyroid carcinoma. Procedia Comput. Sci. 2(2), 265–271 (2010)
Zhu, X., Song, Q., Jia, Z.: A weighted voting-based associative classification algorithm. Comput. J. 53(6), 786–801 (2010)
Acknowledgement
This work is partially supported by The Natural Science Foundation of Guangdong Province (2018A030310575), and Research Foundation of Shenzhen University (85303/00000155).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Cao, J., Wang, H., Pang, M. (2019). An Integrated Classification Algorithm Using Forecasting Probability Strategies for Mobile App Statistics. In: Huang, DS., Huang, ZK., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2019. Lecture Notes in Computer Science(), vol 11645. Springer, Cham. https://doi.org/10.1007/978-3-030-26766-7_57
Download citation
DOI: https://doi.org/10.1007/978-3-030-26766-7_57
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26765-0
Online ISBN: 978-3-030-26766-7
eBook Packages: Computer ScienceComputer Science (R0)