skip to main content
10.1145/3523286.3524582acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbicConference Proceedingsconference-collections
research-article

A Hypertension Risk Prediction Model Based on Improve Random Forest

Published: 31 May 2022 Publication History

Abstract

In order to reduce the serious consequences of chronic diseases, this paper proposes a hypertension risk prediction model based on improved random forest, which provides an effective technical means for early warning of hypertension. The original data set with unbalanced samples is processed by the synthetic minority oversampling technique (SMOTE) to form a balanced data set. Then improve the random forest algorithm based on similarity optimization and deep optimization, and finally establish a prediction model. It is compared with the four machine learning algorithms of linear regression (LR), artificial neural network (ANN), support vector machine (SVM) and CatBoost. ROC curve and AUC are used as the evaluation indicators of the model.
The experimental results show that the prediction accuracy of the model based on the improved random forest algorithm is higher, with an AUC value of 0.8697, which is better than the other four algorithms. The improved random forest algorithm has certain feasibility in hypertension risk prediction. This method has a better effect in predicting the risk of hypertension, which is better than other traditional methods, can provide more accurate judgments, and provide better results for early warning and prevention of hypertension.

References

[1]
Guidelines for Rational Use of Hypertension (Second Edition) [J]. Frontier Journal of Chinese Medicine (Electronic Edition),2017,9(07):28-126.
[2]
Revision Committee of "Guidelines for Prevention and Treatment of Hypertension in China". Revised Edition of China Guidelines for Prevention and Treatment of Hypertension in 2018 [J]. Prevention and treatment of cardiovascular and cerebrovascular diseases, 2019, 19(1): 1-44.
[3]
S. Padmanabhan. “Prospects for genetic risk prediction in hypertension”, Hypertension, 2013,61(5), pp.961-963.
[4]
Aburto, N. J. “Effect of increased potassium intake on cardiovascular risk factors and disease: systematic review and meta-analyses.” BMJ,2013,346(apr033), pp. f1378-f1378.
[5]
Liu Shaoyang. Research on blood pressure prediction model of hypertensive patients based on deep learning[D]. Qingdao University of Science and Technology, 2020.
[6]
Zhuoyuan ZHENG, Ye LI and Yunpeng CAI. “The Logistic Regression Analysis on Risk Factors of Hypertension among Peasants in East China & Its Results Validating”. International Journal of Computer Science Issues, 2013.
[7]
Lafreniere D, Zulkernine F, Barber D, “Using machine learning to predict hypertension from a clinical dataset”. Computational Intelligence. 2017.
[8]
H.Zhao, Z. Ma and Y. Sun, "A Hypertension Risk Prediction Model Based on BP Neural Network," 2019 International Conference on Networking and Network Applications (NaNA), 2019, pp. 464-469.
[9]
Wu JH, Wei W, Zhang L, Risk assessment of hypertension in steel workers based on LVQ and Fisher-SVM deep excavation [J]. IEEE Access, 2019, 7: 23109-23119.
[10]
LIU Juan, ZHAO Huanhuan, MA Zuchang, Blood pressure prediction method based on CatBoost algorithm and fuzzy classification system in young people [J]. Beijing Biomedical Engineering, 2020, 39(6): 601-608.
[11]
Ansah JP, Inn RLH, Ahmad S. An evaluation of the impact of aggressive hypertension, diabetes and smoking cessation management on CVD outcomes at the population level: a dynamic simulation analysis [J]. BMC Public Health, 2019, 19 (1): 1105.
[12]
Ming Chen, Xudong Zhao. Fatty Liver Disease Prediction Based on Multi-Layer Random Forest Model. In: CSAI '18, Proceedings. Shenzhen, China, pp.364-368, December 2018.
[13]
H. Lan and Y. Pan, "A Crowdsourcing Quality Prediction Model Based on Random Forests," 2019 IEEE/ACIS 18th International Conference on Computer and Information Science (ICIS), 2019, pp. 315-319.
[14]
Zhao W, Xu M, Jia X, A Classification Method for Imbalanced Data Based on SMOTE and Fuzzy Rough Nearest Neighbor Algorithm[M]//Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. Springer International Publishing, 2015: 340-351.
[15]
Abdi L, Hashemi S. To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques[J]. IEEE Transactions on Knowledge and Data Engineering, 2016, 28(1): 238-251.

Cited By

View all
  • (2024)Chronic Diseases Prediction Using Machine Learning With Data Preprocessing Handling: A Critical ReviewIEEE Access10.1109/ACCESS.2024.340674812(80698-80730)Online publication date: 2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
BIC '22: Proceedings of the 2022 2nd International Conference on Bioinformatics and Intelligent Computing
January 2022
551 pages
ISBN:9781450395755
DOI:10.1145/3523286
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 May 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. decision tree optimization
  2. hypertension
  3. improve random forest
  4. risk prediction
  5. synthetic minority oversampling technique (SMOTE)

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

BIC 2022

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Chronic Diseases Prediction Using Machine Learning With Data Preprocessing Handling: A Critical ReviewIEEE Access10.1109/ACCESS.2024.340674812(80698-80730)Online publication date: 2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media