Skip to main content

Combining Multiple Factors of LightGBM and XGBoost Algorithms to Predict the Morbidity of Double-High Disease

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1059))

Abstract

In recent years, cardiovascular and cerebrovascular diseases have seriously jeopardized people’s health. Double-high (hyperlipidemia and hypertension) is one of the main causes of cardiovascular and cerebrovascular diseases in clinical practice. To diagnose cardiovascular and cerebrovascular diseases early, a reliable prediction system should be developed to assist doctors. In this paper, the different elements and evaluate the weight of these elements on double-high diseases are analyzed by machine learning method. The LightGBM algorithm and XGBoost algorithm were employed to construct the prediction models, respectively. Significantly, the proposed model was trained by real physical examination data and five meaningful and useful biochemical indicators were selected to encoding the raw physical examination data to numerical vector. The selected features are systolic blood pressure, diastolic blood pressure, serum triglyceride, serum high-density lipoprotein and serum low-density lipoprotein. The mean square error (MSE) after calculating the logarithm of the predicted value and the true value was introduced to assess the prediction model. Results show that this model can effectively predict cardiovascular and cerebrovascular diseases in advance.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

REFERENCES

  1. Sarnak, M.J.: Cardiovascular complications in chronic kidney disease. Am. J. Kidney Dis. 41, 11–17 (2003)

    Article  Google Scholar 

  2. Charytan, D.M.: Introduction: cardiovascular disease in chronic kidney disease. In: Seminars in Nephrology, vol. 38, no. 6, p. 541. Elsevier (2018)

    Google Scholar 

  3. Tang, W.H.W., Kitai, T., Hazen, S.L.: Gut microbiota in cardiovascular health and disease. Circ. Res. 120(7), 1183–1196 (2017)

    Article  Google Scholar 

  4. Maqbool, M., Cooper, M.E., Jandeleit-Dahm, K.A.M.: Cardiovascular disease and diabetic kidney disease. In: Seminars in Nephrology. vol. 38, no. 3, pp. 217–232. WB Saunders, Philadelphia (2018)

    Article  Google Scholar 

  5. Gupta, P., Gan, A.T.L., Man, R.E.K., et al.: Risk of incident cardiovascular disease and cardiovascular risk factors in first and second-generation Indians: the Singapore Indian eye study. Sci. Rep. 8(1), 14805 (2018)

    Article  Google Scholar 

  6. Njie, G.J., Finnie, R.K.C, Acharya, SD, et al.: Peer reviewed: reducing medication costs to prevent cardiovascular disease: a community guide systematic review. Prev. Chronic Dis. 12 (2015)

    Google Scholar 

  7. Onuegbu, A.J., Olisekodiaka, J.M., Udo, J.U., et al.: Evaluation of high-sensitivity C-reactive protein and serum lipid profile in southeastern Nigerian women with pre-eclampsia. Med. Princ. Pract. 24(3), 276–279 (2015)

    Article  Google Scholar 

  8. Ke, G., Meng, Q., Finley, T., et al.: LightGBM: a highly efficient gradient boosting decision tree. In: Advances in Neural Information Processing Systems, pp. 3146–3154 (2017)

    Google Scholar 

  9. Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)

    Google Scholar 

  10. https://github.com/Microsoft/LightGBM

  11. https://github.com/Microsoft/LightGBM/blob/master/docs/Parameters-Tuning.rst

  12. Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)

    Article  MathSciNet  Google Scholar 

  13. https://github.com/fxsjy/jieba

  14. Hsu, C.W., Lin, C.J.: A comparsion of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 13(2), 415–425 (2002)

    Article  Google Scholar 

Download references

Acknowledgments

This research was funded by the National Natural Science Foundation of China (Nos. 61772227,61702214), the Development Project of Jilin Province of China (Nos 20170101006JC, 20170203002GX, 20190201293JC). This work was also supported by Jilin Provincial Key Laboratory of Big Date Intelligent Computing (No. 20180622002JC).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lin Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Song, Y. et al. (2019). Combining Multiple Factors of LightGBM and XGBoost Algorithms to Predict the Morbidity of Double-High Disease. In: Mao, R., Wang, H., Xie, X., Lu, Z. (eds) Data Science. ICPCSEE 2019. Communications in Computer and Information Science, vol 1059. Springer, Singapore. https://doi.org/10.1007/978-981-15-0121-0_50

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-0121-0_50

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-0120-3

  • Online ISBN: 978-981-15-0121-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics