Keywords

1 Introduction

The general trend towards an ageing population in our western society is still unbroken, and life expectancy is even rising [1]. It poses enormous future challenges for the medical profession in general and particularly for the daily practice of family doctors (general practitioners, GP’s).

Aging is often associated with the burden of chronic diseases [2]. However, some older patients with chronic diseases are more vulnerable than others in the development of negative health outcomes, including hospitalization, falls, disability, low physical and mental functioning, dependency on others and most of all to pre-term mortality [3]. The clinical signs of this vulnerable state have been identified by the gerontologists and include the following features: decreased muscle mass and strength, general weakness, slow gait speed, impaired balance and generally low activity [4]. It is termed frailty and explained by the reduced reserve capacity of various physiological systems, due to the aging process and accumulation of chronic medical conditions [5]. There are numerous diagnostic criteria for assessing frailty divided into the two principal groups: the tests focusing on physical functions of frailt and a more comprehensive approach, based on the Cumulative Deficit Model of Frailty (the Frailty Index), which accounts for a broad range of medical, cognitive, psychological, and functional deficits [6].

Frailty is a new concept, and precise theory is still missing. There are several assessment tools (or predictive models), but it may not be the best solution. One of the most widely used tests for assessing physical frailty is the Fried Phenotypic Model of Frailty [4]. It is based on the assessment of the small number of measurable components, including slow walking speed, low grip strength (measured by the hand grip dynamometer), self-reported exhaustion, unintended weight loss, and low physical activity. The credibility of this instrument relies on the fact that it has been derived from data of the large epidemiologic study. In line with the proposed evolutive and dynamic nature of frailty, this instrument also includes a prefrailty state [4, 5]. Gradation is based on disorder counting, so that 1–2 disorders indicate prefrailty and 3–5 indicate frailty [4].

In general, determinants of frailty may be divided into three groups: (1) chronic diseases and conditions associated with disability, such as falls and walking difficulties; (2) markers of physiological systems disturbation; and (3) behavioural and societal factors, such as nutrition, low socioeconomic status and low education [7]. Some chronic diseases were found in epidemiological studies as being more often associated with frailty, than some others, including diabetes, cardiovascular disease (CVD), malignant disease, chronic obstructive pulmonary disease (COPD), and chronic renal failure [2, 8, 9].

In the fundamental study on frailty, Fried et al. have shown that the risks for the development of prefrailty and frailty increase in parallel with the number of chronic diseases and the number of pathophysiology disorders, described with measures of anaemia, inflammation, micronutrient deficits, metabolic regulatory factors, body composition and neuromuscular function [5]. This study has provided a rationale for non-linearity in the development of frailty and heterogeneity of older people concerning prefrailty and frailty.

In this study, we have attempted to go a step forward and to show how prefrail and frail older people are distributed within the “naturally formed groups”. These clusters are defined by the set of parameters which indicate significant pathophysiology disorders associated with frailty. We used only easily available data from General Practice (GP) electronic health records (eHRs) and patient self-reports. To identify the most important chronic diseases and geriatric disabling conditions that are associated with particular clusters, we have assessed their differences among the clusters.

Clustering methods are still rarely used in medical research. However, they have proved to be appropriate for grouping patients with chronic medical conditions and comorbidities in situations with overlapping between patients usually. The solving of such problems is like copying with the system’s complexity. These methods allow insights into the “natural” grouping of patients, that is, in a case when theoretical assumptions of the way of their grouping (classification) are low. The main point which we have taken into account when deciding to use the clustering or the classification methods, was “the quantity” of theory, or how much the postulated hypothesis is convincing. The clustering methods allow a higher degree of uncertainty than the classification methods or take a larger, still unknown context. Also, they may be used when we wish to reconcile the old hypothesis to provide a new paradigm. In contrast to this, the classification methods need a strongly grounded theory and are, consequently, more predictive, than the clustering methods.

We assume that by using this methodology approach, it would be possible to identify clinical patterns that in some local population are mostly associated with frailty. The final aim is to implement the efficacious strategy for recognizing older people, GP patients, who are at increased risk for negative health outcomes. We believe that this study will add value to the requirements that screening of older people on frailty become the standard procedure in PC [10]. In general, using predictive machine learning enables the discovery of potential risk factors which provides the family doctor with information on a probable patient outcome and to react promptly and to avert likely adverse events in advance [11].

1.1 Related Work

To our knowledge, studies with similar approaches to our have not been published to date. The only study that we could find was based on using single measures of physical performance and cognitive function impairment tests, to identify clusters [12]. A similar paper, by ourselves, has been prepared on the smaller sample and with a larger set of parameters, is now under review in the journal devoted to analytical, clinical research, but is not in conflict with this conference paper.

Bertini et al. proposed two predictive frailty models for subjects older than 65 years old by exploiting information from 12 socio-clinical databases available in the Municipality of Bologna [13]. The authors take into account many diagnoses and functional conditions that may be impaired (decreased). They also noted that the frailty has not yet emerged as a well-defined clinical or social concept. Clegg et al. tried to develop and validate an electronic frailty index (eFI) using routinely available primary care electronic health record data [14]. For this purpose, they used anonymized data from a total of 931 541 patients aged 65–95. The eFI enables identification of older people who are fit, and those with mild, moderate, and severe frailty.

Both presented works represent a relatively tricky and lengthy method for daily usage. By clustering methods, we can narrow this high heterogeneity of patients (based on different combinations of many diseases and disorders) and limit it to only several subgroups. These subgroups are clusters defined according to a less number of the main features. Also, the results of some large-scale studies, including older people with multiple chronic medical conditions, may not be applied directly to the local situation, because of the large variability between populations. Thus, small datasets are appropriate for use, when there is a need that the results of research inform local healthcare providers.

2 Data Analysis

A retrospective analytical study was conducted during 2018, in the General Practice (GP) setting, in the town of Osijek, eastern Croatia, during a six-month follow-up. Only GP attendees, old 60 years and more, and not those on home care, were included in the study. Patients were assessed at their regular encounters and were recruited if they gave their written informed consent. Exclusion criteria included: acute medical conditions, worsening of chronic conditions, dementia and active chemo- or biological therapies.

Our dataset contained 261 records characterized by 10 numerical variables (89 males and 172 females). We present these variables as average and standard deviation (SD) and as the median and interquartile range (Table 1).

Table 1. Descriptive statistics of the numerical variables.

The quality of the data matters more than the selected model or the chosen algorithm; therefore, data-pre-processing is the most important step for all machine learning tasks.

It is rare in the practice that the number of clusters is known at the beginning of the experiments. One possibility of how to identify the most suitable number of clusters (k value in the case of the K-Means algorithm) is Elbow method [15, 16]. This method provides a graphical visualization and uses the percentage of variance explained as a function of the number of clusters.

The idea is to run this method on the dataset for a range of values of k (for example k from 1 to 10), and for each value of k calculate the total within-clusters sum of squared errors. On the graph, the first clusters will add much information (explain a lot of variances), but at some point, the marginal gain will drop, giving an angle in the graph [15]. This point represents the expected number of clusters. The Elbow method also has some limitation, such as the elbow is no always unambiguously identified. In this case, we can use the Average silhouette method calculating how well each object lies within its cluster. The optimal number of clusters k is the one that maximizes the average silhouette over a range of possible values for k [17]. We calculated the maximal values for the 2 or 3 possible clusters (Fig. 1). Based on the data characteristics, we finally choose the 3 as k-value.

Fig. 1.
figure 1

The results of Average silhouette method.

For clustering, we selected the K-Means algorithm as a very popular technique to partitioning data sets with numerical attributes [18]. It is an unsupervised learning algorithm constructing a partition of a data of n objects into a set of k clusters. The k-value has to be specified at the beginning. Each cluster is represented by its centre. Next, we were looking for differences among them in other features (expressed by the categorical variables – gender, diagnoses, etc.)

We applied the K-Means algorithm to pre-processed data:

  • The dataset contained only numerical variables like age, bmi, waist Circumference (wc), mid-arm circumference (mac), fasting glucose (glu_f), cholesterol (chol), low-density lipoprotein (ldl), glomerular filtration rate (gfr), hemoglobin (hb) and hematocrit (htc).

  • All variables were normalized based on the z-score standardization (transforms all the variables to a mean value of 0 and a standard deviation of 1).

3 Results

We constructed 3 clusters visualized in Fig. 2, distinguished by the signs. The Principal Component Analysis generates the plot, and an eclipse is drawn around each cluster (but not represented a boundary). The data points are plotting according to the first two principal components coordinate.

Fig. 2.
figure 2

The plot of 3 constructed clusters.

The centres of the clusters are characterized by the relevant value of each input variable, see Table 2.

Table 2. The clusters centres.

We evaluated the differences between clusters. We started with the proportion of prefrail and frail vs. non-frail patients in our dataset. Cluster 2 contains the highest number of non-frail patients (51). On the other hand, the numbers of prefrail or frail patients in the first and second clusters are relatively similar. The highest number of prefrail patients characterizes the third cluster. The gender radio of cluster1 is relatively balanced (1:1); the next two have a different one 2.5:1.

The Kruskal-Wallis test by rank is a non-parametric alternative to the one-way ANOVA test when we have more than two groups [19, 20]. As the p-value is less than the significance level 0.05, we can conclude that there are significant differences between the clusters (Table 3). For this purpose, we used the following list of variables:

Table 3. Differences between clusters - Kruskal-Wallis test by rank.
  • The number of diagnoses of chronic diseases (som com).

  • Selected diagnoses of chronic diseases: diabetes mellitus (dm), chronic obstructive pulmonary disease (copd) or asthma, cardio-vascular disease (cvd), including coronary heart disease (chd) or cardio-vascular disease or peripheral arterial disease (pad), malignant disease (malig), osteoporosis (osteop), low back pain (low back), osteoarthritis (oa).

  • Geriatric syndromes other than frailty: urogenital disease or urinary incontinence (urogenit incont), visus impairment (visus), hearing loss (hear), falls (with or without bone fracture) (fall nf, f), walk difficulties (walk).

For selected variables, we investigate the difference among the clusters in graphical form (Fig. 3). The percentages were calculated on the whole 261 records.

Fig. 3.
figure 3figure 3

Particular chronic medical conditions among clusters.

The proportion of chronic diseases and various impairments also differs among the clusters (Figs. 4 and 5).

Fig. 4.
figure 4

The number of diagnoses of chronic diseases among clusters.

Fig. 5.
figure 5

The number of various impairments like urogenital or incontinence, falls, hear, walk and chronic pain among clusters.

We used the Kruskal-Wallis test also for an evaluation of the difference between the three target categories of the patients: prefrail, frail, and non-frail (Table 4).

Table 4. The difference between prefrail, frail, and non-frail patients.

4 Discussion

We used a small set of laboratory and anthropometric measures, to describe the participants’ health status, and the clustering procedure, to identify major clinical phenotypes within the group of older PC patients from the local community (Tables 1 and 2). Surprisingly, the values of the collected parameters have shown a large degree of diversity (Table 1). On the other hand, this characteristic is in line with non-linear age-related dysregulation of multiple body systems, which stays is in the background of the clinical expression of multimorbidity [3, 5].

The three clusters have been identified, each containing 66, 92, and 103 patients (Table 2). They overlap to some degree with each other, which is usual in people with multiple comorbidities and accounts for their exceptional heterogeneity (Fig. 2) [2, 3].

These characteristics, in combination with heterogeneity and overlapping, are emphasized by the distribution patterns of prefrail and frail patients in the clusters. Namely, in all three clusters, there are also non-frail, prefrail, and frail patients, but presented with different proportions, their patterns depending on characteristics of patients in the clusters.

Analysis of the clusters has shown that patients in the clusters No. 1 and No. 2 are of a lower age than those in the cluster No. 3 (67.5, 71.1 and 74.2 years, respectively), and characterized with high values of bmi and wc measures (30.95 vs 34.64 and 100.15 vs 112.96), which according to the knowledge indicate obesity, in the case of the cluster No. 2, even extreme obesity (Table 2) [21].

These two clusters have similar proportions of prefrail and frail vs. non-frail patients. This ratio is somewhat highe in cluster No. 1. Despite these similarities, patients in these two clusters also show some important differences, such as significantly higher proportions of patients diagnosed with diabetes and CVD, in the cluster No. 1, which according to the evidence are two conditions strongly associated with both, obesity and frailty (Table 3) (Fig. 3) [22, 23]. This fact can explain a higher proportion of patients in the cluster No. 1 who have reported subjective difficulties in a walk, indicating a higher level of physical disability of patients in this cluster, compared to the cluster No. 2 (Table 3) (Fig. 3). Some recently published papers highlight the role of measuring gait speed in the screening of patients with CVD who are in particular vulnerability for negative outcomes of hospitalization and surgery [24].

The differences between the clusters No. 1 and 2 may also be associated with differences in participation of males vs. females. The participation of the males in the cluster No. 1 is relatively higher, than in cluster No. 2, and compared to the whole sample. Regarding these results, it is known that women, in general, gain obesity, and subsequently diabetes and CVD, later than men in the life course, yet in the age after menopause [25]. This fact may explain our results that women, who make a prevalent part of patients in cluster No. 2, are characterized by extreme obesity (Table 2). But they are still not dominated with diabetes, as it is the case with obese patients in cluster No. 1 (Fig. 3). Extreme obesity has been recognized as a possible cause of prefrailty and frailty, although this fact is in contradiction to what is, in general, considered under the term of frailty, including weakness, muscle loss and unintended weight loss [4, 26].

The cluster No. 3 contains the oldest patients (the prevalent age 74.2 years), who, distinctly from patients in the other two clusters, have reduced renal function (as indicated with the parameter gfr = 69.02 vs. 96.27 and 105.75) (Table 2). According to the evidence, frailty is strongly associated with progressive renal impairment, which in this study is highlighted with the highest proportion of prefrail and frail vs non-frail patients, found in cluster No. 3 (a ratio of about 2.6 vs 1.4 and 1.3), compared to the other two clusters [27].

Other characteristics of patients in this cluster, which are markedly different from those in other two clusters, include the lack of obesity (indicated with values of the parameters bmi = 27.03 and wc = 92.13) and reduced muscle mass (indicated with the parameter mac = 28.87 vs. 32.46 and 34.14) [28].

A clinical phenotype that relies on the description with these parameters can be characterized as muscle wasting and frailty, reflecting the both, highly developed levels of frailty and significantly reduced renal function [4, 29]. Other characteristics of patients in this cluster, including lower (than in other two clusters) values of the parameter hb, indicating anemia, and the parameter htc, indicating decreased blood viscosity, as well as moderate (not elevated) values of the parameters fglu and chol, indicating the metabolic status, fasting glucose and total serum cholesterol, are in line with this proposed clinical phenotype [29, 30].

When comorbidities associated with particular clusters were analyzed, a tendency for disorders accumulation has been recognized in cluster No. 3, compared to the other two clusters. It is indicated with the highest proportions, in this cluster, of patients having 3 or more diagnoses of chronic diseases (frequencies 45:46:63) (Fig. 4). This tendency is even more emphasized when integrated geriatric conditions are considered, including walking difficulties, falls, hearing loss, urinary incontinence and chronic pain (frequencies 47:52:84) (Fig. 5). Of particular disorders, the most prevalent disorders in cluster No. 3 were osteoporosis, falls, and hearing loss with significant differences and osteoarthritis and urinary incontinence as non-significant (Table 3) (Fig. 3). The findings indicating functional disorders accumulation in people of older age, with high levels of frailty expression, can find their support in the growing body of evidence [31]. What is emphasized in this study is the key role of renal function impairment in the development of higher levels of frailty expression. This message is in line with the concept of unsuccessful aging, the course of aging that has been turned towards the development of renal function decline, multimorbidity, functional deficits, and frailty [29].

The non-linearity in frailty development, on distribution patterns of chronic medical conditions in the clusters, is visible in Tables 3 and 4. Significant differences among clusters were found in diagnoses of diabetes and CVD (Table 3). When prefrail and frail vs non-frail patients were considered, significant differences were found in diagnoses of osteoporosis and osteo-arthritis. Both conditions are known to be directly associated with frailty and difficulties in a walk (Table 4). The patient group consisted of only prefrail and frail patients, in a great part resembles characteristics of the cluster No. 3, where osteoporosis, osteo-arthritis and other physical dysfunctions have shown a tendency to accumulate (Figs. 3 and 5).

5 Conclusions and Future Outlook

For medical research, we usually select a standard, already proved a machine learning method, to ensure the reproducibility of the results. What is innovative in this paper is a research approach, that is, a use of a combination of ML and statistical and graphical methods, for solving a complex medical problem. This research approach requires a significant input of a medical researcher in designing research. The clusters can be considered as a first level analytical method, the results of which can inform more consistent future models.

There is a problem in medical research of small dataset conditions, such as the need for research within one health institution or when the size of the dataset is constrained by the complexity and a high cost of large-scale experiments. To meet these constraints, many theoreticians in ML methods, try to adapt these methods to be accurate and appropriate for use in small datasets [32].

By using a set of simple parameters from the general practitioners both electronic health records and patient self-reports, together clustering method, they could discover new insights into the main clinical phenotypes of the group of older patients from the local community and their associated rates of prefrail and frail patients. It is an important problem-solving task, characterized by non-linearity and high complexity, which requires a data-driven analytical approach. The setting of the general practitioner creates an ideal place for conducting such research. It provides access to a huge amount and variety of medical data in combination with his broad implicit knowledge of the GP. Moreover, this study showed again that data-quality matters most.

There is an urgent need for closer collaboration between medical experts, particularly general practitioners, and machine learning experts. If the GP’s want to use the full capability of machine learning, there is a need soon to include it to the daily routine workflows of the GP’s. These calls for simple to use Human-AI Interaction, fostering to understand the data within the context of a medical problem and to support decision making under the constraint of increasing workload and time pressure. Consequently, such methods must be trustworthy, and this requires explainability on demand. To reach such a level of explainable medicine it needs much future research in explainability [33] and causability [34].