A Novel Compression Algorithm for Hardware-Oriented Gradient Boosting Decision Tree Classification Model

Wang, Xiao; Li, Yafei; Li, Yinqi; Mei, Kuizhi

doi:10.1007/978-3-030-26766-7_35

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11645))

Included in the following conference series:

International Conference on Intelligent Computing

1983 Accesses

Abstract

Gradient boosting decision tree is a widely used machine learning algorithm. As the big data era comes, this algorithm has been applied to the multimedia fields. However, this algorithm suffers a lot when it comes to the mobility of multimedia application. In this paper, we propose a compression algorithm, GBDT Compression, to enhance the storage adaptability of a gradient boosting decision tree classification model. GBDT Compression introduces a new rule to pruning, addressing and encoding for a well trained gradient boosting decision tree model. By conduct GBDT Compression on small data sets, we show that this algorithm considerably reduces the space cost of original model up to dozens of times. Furthermore, as data increases and the original model grows, the compression rate also enhances. Which is meaningful for mobile multimedia application with limited memory.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Fast Gradient Boosting Decision Trees with Bit-Level Data Structures

Boosting over Non-deterministic ZDDs

Bitpaths: Compressing Datasets Without Decreasing Predictive Performance

References

Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)
Article MathSciNet Google Scholar
Tian, D., He, G., Wu, J., Chen, H., Jiang, Y.: An accurate eye pupil localization approach based on adaptive gradient boosting decision tree. In: Visual Communications and Image Processing, pp. 1–4 (2016)
Google Scholar
Moon, J., Kim, S.: An approach on a combination of higher-order statistics and higher-order differential energy operator for detecting pathological voice with machine learning. In: 2018 International Conference on Information and Communication Technology Convergence (ICTC), pp. 46–51, October 2018
Google Scholar
Prasad, A.G., Sanjana, S., Bhat, S.M., Harish, B.S.: Sentiment analysis for sarcasm detection on streaming short text data. In: 2017 2nd International Conference on Knowledge Engineering and Applications (ICKEA), pp. 1–5, October 2017
Google Scholar
Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)
Google Scholar
Keck, T.: FastBDT: a speed-optimized and cache-friendly implementation of stochastic gradient-boosted decision trees for multivariate classification (2016)
Google Scholar
Zhang, H., Si, S., Hsieh, C.J.: GPU-acceleration for large-scale tree boosting (2017)
Google Scholar
Wang, Y., Feng, D., Li, D., Chen, X., Zhao, Y., Niu, X.: A mobile recommendation system based on logistic regression and gradient boosting decision trees. In: International Joint Conference on Neural Networks, pp. 1896–1902 (2016)
Google Scholar
Han, S., Mao, H., Dally, W.J.: Deep compression: compressing deep neural networks with pruning, trained quantization and Huffman coding. Fiber 56(4), 3–7 (2015)
Google Scholar
Liu, Z., Li, J., Shen, Z., Huang, G., Yan, S., Zhang, C.: Learning efficient convolutional networks through network slimming, pp. 2755–2763 (2017)
Google Scholar
Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., Keutzer, K.: SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5 Mb model size (2016)
Google Scholar
Ullrich, K., Meeds, E., Welling, M.: Soft weight-sharing for neural network compression (2016)
Google Scholar
Gysel, P.: Ristretto: hardware-oriented approximation of convolutional neural networks (2016)
Google Scholar
Oneto, L., Parra, X., Anguita, D., Ghio, A., Reyes-Ortiz, J.L.: A public domain dataset for human activity recognition using smartphones. In: 21st European Symposium on Artificial Neural Networks (2013)
Google Scholar
Fisher, R.A.: The use of multiple measurements in taxonomic problems. Ann. Hum. Genet. 7(2), 179–188 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Xi’an Jiaotong University, No. 28, West Xianning Road, Xi’an, Shaanxi, China
Xiao Wang, Yafei Li, Yinqi Li & Kuizhi Mei

Authors

Xiao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yafei Li
View author publications
You can also search for this author in PubMed Google Scholar
Yinqi Li
View author publications
You can also search for this author in PubMed Google Scholar
Kuizhi Mei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kuizhi Mei .

Editor information

Editors and Affiliations

Tongji University, Shanghai, China
De-Shuang Huang
Nanchang Institute of Technology, Nanchang, China
Zhi-Kai Huang
Liverpool John Moores University, Liverpool, UK
Abir Hussain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, X., Li, Y., Li, Y., Mei, K. (2019). A Novel Compression Algorithm for Hardware-Oriented Gradient Boosting Decision Tree Classification Model. In: Huang, DS., Huang, ZK., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2019. Lecture Notes in Computer Science(), vol 11645. Springer, Cham. https://doi.org/10.1007/978-3-030-26766-7_35

Download citation

DOI: https://doi.org/10.1007/978-3-030-26766-7_35
Published: 24 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26765-0
Online ISBN: 978-3-030-26766-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics