Skip to main content

An Integrated Classification Algorithm Using Forecasting Probability Strategies for Mobile App Statistics

  • Conference paper
  • First Online:
Intelligent Computing Methodologies (ICIC 2019)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11645))

Included in the following conference series:

  • 1734 Accesses

Abstract

Classification is an important data mining technique for classifying the items according to a series of associated features. Even so, most of them are not stable in performance, and they may get high classification accuracy rate in some datasets but poor in other issues. To solve this problem, in this paper, an integrated algorithm is proposed to keep balance between the classification accuracy rate and stability. The proposed algorithm integrates the K-Nearest Neighbor (KNN), Naive Bayes (NB), Regression Tree (RT), Random Forest (RF), Bagging, and Discriminant Analysis Classifier (DAC) using forecasting probability strategies. Specifically, the majority voting strategy and weighted voting strategy are presented using the forecasting probability obtained from the classification algorithms. To demonstrate the effectiveness of the proposed algorithm, numerous experiments are conducted by applying the classification algorithms to real mobile APP statistics. Results indicate that it can get a comprehensive and stable classification accuracy rate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Liu, J.W.: Using big data database to construct new Gfuzzy text mining and decision algorithm for targeting and classifying customers. Comput. Ind. Eng. 128, 1088–1095 (2018)

    Article  Google Scholar 

  2. Liu, Q., Liu, C.: A novel locally linear KNN method with applications to visual recognition. IEEE Trans. Neural Netw. Learn. Syst. 28(9), 2010–2021 (2016)

    MathSciNet  Google Scholar 

  3. Sahu, S.K., Kumar, P., Singh, A.P.: Modified K-NN algorithm for classification problems with improved accuracy. Int. J. Inf. Technol. 10(1), 65–70 (2017)

    Google Scholar 

  4. Wolfson, J., Bandyopadhyayy, S., et al.: A naive Bayes machine learning approach to risk prediction using censored, time-to-event data. Stat. Med. 34(21), 2941–2957 (2015)

    Article  MathSciNet  Google Scholar 

  5. Hu, C., Steingrimsson, J.A.: Personalized risk prediction in clinical oncology research: applications and practical issues using survival trees and random forests. J. Biopharm. Stat. 1–17 (2017)

    Google Scholar 

  6. Arar, Ö.F., Ayan, K.: A feature dependent naive Bayes approach and its application to the software defect prediction problem. Appl. Soft Comput. 59, 197–209 (2017)

    Article  Google Scholar 

  7. Cano, G., Garcia-Rodriguez, J., Garcia-Garcia, A., Perez-Sanchez, H., Benediktsson, J.A., Thapa, A., et al.: Automatic selection of molecular descriptors using random forest: application to drug discovery. Expert Syst. Appl. 72, 151–159 (2017)

    Article  Google Scholar 

  8. Yoonseok, S.: Application of boosting regression trees to preliminary cost estimation in building construction projects. Comput. Intell. Neurosci. 1–9 (2015)

    Google Scholar 

  9. Yang, R.M., Zhang, G.L., et al.: Comparison of boosted regression tree and random forest models for mapping topsoil organic carbon concentration in an alpine ecosystem. Ecol. Indic. 60, 870–878 (2016)

    Article  Google Scholar 

  10. Prabhakar Karthikeyan, S., Jacob Raglend, I., Sathish Kumar, K., Kumar Sahoo, S., Priya Esther, B.: Application of SVM as classifier in estimating market power under deregulated electricity market. In: Kamalakannan, C., Suresh, L.P., Dash, S.S., Panigrahi, B.K. (eds.) Power Electronics and Renewable Energy Systems. LNEE, vol. 326, pp. 1309–1317. Springer, New Delhi (2015). https://doi.org/10.1007/978-81-322-2119-7_127

    Chapter  Google Scholar 

  11. Zareapoor, M., Shamsolmoali, P.: Application of credit card fraud detection: based on bagging ensemble classifier. Procedia Comput. Sci. 48, 679–685 (2015)

    Article  Google Scholar 

  12. Gyamfi, K.S., Brusey, J., Hunt, A., Gaura, E.: Linear classifier design under heteroscedasticity in linear discriminant analysis. Expert Syst. Appl. 79, 44–52 (2017)

    Article  Google Scholar 

  13. Nugrahaeni, R.A., Mutijarsa, K.: Comparative analysis of machine learning KNN, SVM, and random forests algorithm for facial expression classification. In: Technology of Information & Communication, pp. 163–168 (2017)

    Google Scholar 

  14. Wang, W., Li, Y., Wang, X., Liu, J., Zhang, X.: Detecting android malicious apps and categorizing benign apps with ensemble of classifiers. Futur. Gener. Comput. Syst. 78, 987–994 (2017)

    Article  Google Scholar 

  15. Zhou, Q.: A comparative study of various supervised learning approaches to selective omission in a road network. Cartogr. J. 54, 1–11 (2016)

    Google Scholar 

  16. Gopinath, B., Gupt, D.B.R.: Majority voting based classification of thyroid carcinoma. Procedia Comput. Sci. 2(2), 265–271 (2010)

    Article  Google Scholar 

  17. Zhu, X., Song, Q., Jia, Z.: A weighted voting-based associative classification algorithm. Comput. J. 53(6), 786–801 (2010)

    Article  Google Scholar 

Download references

Acknowledgement

This work is partially supported by The Natural Science Foundation of Guangdong Province (2018A030310575), and Research Foundation of Shenzhen University (85303/00000155).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hong Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Cao, J., Wang, H., Pang, M. (2019). An Integrated Classification Algorithm Using Forecasting Probability Strategies for Mobile App Statistics. In: Huang, DS., Huang, ZK., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2019. Lecture Notes in Computer Science(), vol 11645. Springer, Cham. https://doi.org/10.1007/978-3-030-26766-7_57

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-26766-7_57

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-26765-0

  • Online ISBN: 978-3-030-26766-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics