Authors:
Gulshan Sihag
1
;
Pankaj Yadav
2
;
Veronique Delcroix
1
;
Vivek Vijay
2
;
Xavier Siebert
3
;
Sandeep Kumar Yadav
2
and
François Puisieux
4
Affiliations:
1
Univ. Polytechnique Hauts-de-France, CNRS, UMR 8201 - LAMIH, F-59313 Valenciennes, France
;
2
Department of Mathematics, Indian Institute of Technology, Jodhpur, India
;
3
Univ. de Mons, Faculté Polytechnique, Département de Mathématique et Recherche Opérationnelle, Belgium
;
4
Départment de Gérontologie, Hôpital Universitaire de Lille, 59037 Lille cedex, France
Keyword(s):
Classification, Imbalanced Data, SMOTE, Fall Prevention, Risk Factors for Falls.
Abstract:
Prevention of falls requires providing a small number of recommendations based on the risk factors present for a person. This article deals with the evaluation of 12 modifiable risk factors for fall, based on a selection of 45 variables from a real data set. The results of four classifiers (Logistic Regression, Random Forest, Artificial Neural Networks, and Bayesian Networks) are compared when using the initial imbalanced data set, and after using the balancing method SMOTE. We have compared the results using four different measures to evaluate their performance (balanced accuracy, area under the Receiver Operating Characteristic (ROC) curve F1-score, and F2-score). The results show that there is a significant improvement for all the classifiers when classifying each target risk factor using the data after balancing with SMOTE.