Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2023 Feb 3;13:1981. doi: 10.1038/s41598-023-27568-6

Machine learning to improve frequent emergency department use prediction: a retrospective cohort study

Yohann M Chiu 1,2,3,, Josiane Courteau 3, Isabelle Dufour 4, Alain Vanasse 2,3, Catherine Hudon 2,3
PMCID: PMC9898278  PMID: 36737625

Abstract

Frequent emergency department use is associated with many adverse events, such as increased risk for hospitalization and mortality. Frequent users have complex needs and associated factors are commonly evaluated using logistic regression. However, other machine learning models, especially those exploiting the potential of large databases, have been less explored. This study aims at comparing the performance of logistic regression to four machine learning models for predicting frequent emergency department use in an adult population with chronic diseases, in the province of Quebec (Canada). This is a retrospective population-based study using medical and administrative databases from the Régie de l’assurance maladie du Québec. Two definitions were used for frequent emergency department use (outcome to predict): having at least three and five visits during a year period. Independent variables included sociodemographic characteristics, healthcare service use, and chronic diseases. We compared the performance of logistic regression with gradient boosting machine, naïve Bayes, neural networks, and random forests (binary and continuous outcome) using Area under the ROC curve, sensibility, specificity, positive predictive value, and negative predictive value. Out of 451,775 ED users, 43,151 (9.5%) and 13,676 (3.0%) were frequent users with at least three and five visits per year, respectively. Random forests with a binary outcome had the lowest performances (ROC curve: 53.8 [95% confidence interval 53.5–54.0] and 51.4 [95% confidence interval 51.1–51.8] for frequent users 3 and 5, respectively) while the other models had superior and overall similar performance. The most important variable in prediction was the number of emergency department visits in the previous year. No model outperformed the others. Innovations in algorithms may slightly refine current predictions, but access to other variables may be more helpful in the case of frequent emergency department use prediction.

Subject terms: Health services, Public health, Epidemiology

Introduction

Although definitions may vary, individuals who visit emergency department (ED) at least three times per year are considered as “frequent users”13. Frequent ED users often display heterogeneous profiles—a combination of mental health disorders, physical comorbidities, and low socioeconomic status1,4,5—leading to complex needs that are not adequately dealt with in an ED6. A significant proportion of frequent ED users have numerous chronic diseases, such as coronary artery disease or chronic obstructive pulmonary disease4,7. Those conditions could be managed in primary care, preventing acute deteriorations that lead to ED use8. Since frequent ED use for complex needs may occur because those needs have not been adequately addressed in a primary care context, this type of ED use is considered suboptimal. As an indicator of unmet needs, it is associated with negative outcomes for patients (e.g., higher hospital admissions or mortality rates9,10). Furthermore, ED costs are generally higher than those in a primary care setting, resulting in a socioeconomical burden for the health system7,11,12. In the province of Quebec (Canada), frequent ED users with chronic diseases represent 9.2% of all the ED users but account for 28.8% of all ED visits13. Furthermore, a recent Canadian census shows that chronic condition prevalence will increase as the population age; the burden on the healthcare system is then likely to increase14.

Targeted interventions such as case management have been shown to help reduce ED visits and ED costs, while improving patient satisfaction and clinical outcomes2,15,16. In this context, being able to accurately predict frequent ED use is relevant to target users who may really benefit from it. Much work has been done with statistical models in this direction. In particular, logistic regression (LR) is a standard and widely used statistical model17. However, with the constant improvement of quantity and quality of measurements (electronic health records), statistical models, and computer capacity, modern machine learning (ML) models are becoming more and more popular. Previous studies have predicted frequent ED use for a specific issue18,19 or in a local hospital20 successfully using ML models other than LR. Yet, no study has been conducted comparing predictive power of ML models in a general population and considering chronic diseases.

This study aims at comparing the performance of a logistic regression to four ML models for frequent emergency department use in an adult population with chronic diseases, in the province of Quebec (Canada).

Methods

All methods in this study were carried out in accordance with the TRIPOD guidelines for model development and validation (see the Supplementary material Table S1)21.

Study design and data sources

This is a population-based retrospective cohort study. We used medico-administrative databases from the health insurance board of the province of Quebec (Régie de l’assurance maladie du Québec), which manages health insurance plan for Quebec citizens. The following files were used:

  1. The patient demographic register, which contains information about the sex, date of birth, date of death (if applicable), and place of residence of the patient;

  2. The physician reimbursement claim register, which contains information about medical services provided by a fee-for-service physician in Quebec: date of service, place of service (emergency, medical clinic, etc.), physician specialty, diagnosis (International Classification of Diseases, ninth revision or ICD-9), and the medical act procedure performed by the physician;

  3. The hospital register, which contains information about the reasons for hospitalization (main diagnosis and up to 25 secondary diagnoses coded in ICD-10), dates of admission and release from hospital, and all medical procedures performed during the hospitalization.

Selection of participants

The study population included all adults (18 years and older) living in the province of Quebec, with at least one ED visit during the inclusion period, i.e., between the 1st of January 2012 and the 31st of December 2013, diagnosed with at least one chronic condition, and without dementia. Patients with dementia may have special needs compared to cognitively intact patients and were thus not included. In this study, the diseases considered were those from the Canadian Institute for Health Information (see the Supplementary material Table S2): asthma, chronic obstructive pulmonary disease (COPD), congestive heart failure (CHF), coronary artery disease (CAD), diabetes, epilepsy, and high blood pressure (HBP)22. Those specific conditions, also known as ambulatory care sensitive conditions, are a set of chronic diseases for which timely intervention in primary care could reduce the risk of hospitalization or the occurrence of acute episodes for those diseases2325. The index date was randomly assigned as one ED visit among all ED visits occurring during the inclusion period26. The index date is then used as a “starting point” for measuring patient characteristics, such as ED use, age, or diagnoses.

There were two exclusion criteria (Fig. 1). First, patients living in remote areas were excluded (6.8%). Remote areas were defined as municipalities with fewer than 10,000 inhabitants with weak or no metropolitan influence zone (the percentage of resident employed labour force who commute to work in urban areas is less than 5%). This exclusion ensured that remote residents who tend to use ED as an alternative to walk-in clinics (as there are fewer primary care alternatives27,28) were not included. However, patients living in municipalities with fewer than 10,000 inhabitants with high or moderate metropolitan influence were included. Secondly, patients who died during the year after their index date (8.3%) were excluded as they can require specialized healthcare, such as patients at the end of life29,30. Besides, that exclusion helped reducing immortal time bias31.

Figure 1.

Figure 1

Flowchart of the study cohorts.

Outcome and independent variables

Frequent ED use was investigated using two different definitions: (1) having at least three visits (“frequent users 3”) and (2) having at least five visits (“frequent users 5”) during the year following the index date (as mentioned in the previous subsection, the index date is an assigned ED visit between 2012 and 2013). Those definitions were chosen amongst the most common ones in order to compare performance in two populations that were different, yet still considered frequent users.

Independent variables (or predictors) considered at the index date were sex, age, residential area (metropolitan: ≥ 100,000; small town: 10,000–100,000; rural: < 10,000 with high or moderate metropolitan influence), material and social deprivation indices32, public prescription drug insurance plan status (PPDIP, see below for the different statuses), having been hospitalized in the two years before the index date, the number of previous ED visits during the year before the index date (PV), and the combined comorbidity index of Charlson (CCI33). The following diagnoses were considered: chronic disease (one diagnosis for each condition, i.e. asthma, COPD, CHF, CAD, diabetes, epilepsy, and HBP), chronic non-cancer pain (CNCP)34, injury, common mental disorders (CMD)35, serious mental disorders (SMD)35, alcohol abuse, and drug abuse. Each condition was identified using the reported diagnoses in the hospital register (one diagnosis) or in the physician reimbursement claim register (at least two diagnoses), during a two-year period before the index date.

Regarding PPDIP status, the Quebec province has four different statuses: “regular recipient of PPDIP”, “admissible to PPDIP and age ≥ 65 years with guaranteed income supplement” (GIS), “not admissible to PPDIP” (individuals with a private insurance plan), or “admissible to PPDIP and being a recipient of last-resort financial assistance” (LRFA)36.

There were less than 5% missing observations, mainly for material and social deprivation indices, and those observations were kept.

Statistical analysis

Frequent ED use prediction is a case of supervised learning, meaning that there are explicit labelled classes (i.e., frequent user or not). Along with logistic regression (LR), four ML predictive models amongst the most efficient for predicting a binary outcome38 were evaluated:

  1. Gradient boosting machines (GBM) build an ensemble of successive decision trees; each tree is a weak learner that improves on the previous one using the residuals39. Tuning parameters were the learning rate and the trees depth.

  2. Naïve Bayes (NB) model is based on Bayes’ theorem and uses a priori probabilities40. The tuning parameter was the Laplace smoothing for probabilities.

  3. Neural networks (NN) feed data through interconnected hidden layers of “neurons”, which apply mathematical operations to the inputs (the independent variables)41. Tuning parameters were the number of neurons and the weight decay.

  4. Random forests (RF) apply sequential splits to the data such that the separation is maximized in regards to a homogeneity criterion (i.e., the Gini index), resulting in a tree-like structure40. RF were evaluated with a binary (RF1) and a continuous outcome (RF2). Tuning parameters were the number of trees and the homogeneity criterion used.

The cohort was randomly divided in a training set (80% of the cohort) for building models and a testing set (remaining 20%) for evaluating performance18,42. This procedure is common in order to minimize overfitting, a sensitive issue when dealing with ML algorithms43. Area under the ROC curve (AUC), sensibility (SEN), specificity (SPE), positive predictive value (PPV), and negative predictive value (NPV) were computed to compare performances. AUC 95% confidence intervals were also computed using DeLong’s method44. The same reasoning was adopted as in Grinspan et al.18, the predictability of a model was judged on its AUC, based on 5 categories: poor (0.50–0.59), fair (0.60–0.69), good (0.70–0.79), very good (0.80–0.89) and excellent (0.90–1.0)18. The best cut-off thresholds were selected using Youden’s statistic 45 in order to compute sensitivity, specificity, positive predictive value, and negative predictive value. All tuning parameters were optimized by searching for the maximum AUC, but only the results with the selected parameters are presented here for clarity and brevity purposes.

Results from ML models (except LR) are not as directly interpretable as those from regression models, which straightforwardly assess the effect of predictive variables on the outcome with quantities such as odd ratios. However, ML framework allows for the evaluation of variable importance in a prediction model (also called feature importance). It was computed as the mean decrease in the Gini index in the case of GBM and RF, as the combinations of the absolute values of the weights for NN, and as the absolute value of the t-statistic for LR43. While it is not possible to compare variable importance directly from one model to another due to the models being different in nature, variable importance is still useful as an interpretable and relative quantity about the contribution of each predictor. In our models, all the variables are categorical and GBM, LR, and NN compute variable importance relative to a baseline category while RF computes an overall variable importance. Of note, there is no available variable importance measure when using the NB algorithm.

Sensitivity analyses were conducted on a population of frequent users with at least four visits and with a 50/50 training and testing sets.

Statistical significance level was set at α = 0.05 and differences in descriptive statistics were evaluated using chi-square tests. All analyses were performed with statistical software programs SAS (version 9.4) and R (version 4.2 with packages e1071, nnet, ranger, and xgboost).

Ethics approval and consent to participate

The research ethics board of the Centre intégré universitaire de santé et de services sociaux de l’Estrie – Centre hospitalier universitaire de Sherbrooke (number MP-31–2017-1571 – Urgences-CPSA) approved this study. The need for informed consent was waived by the aforementioned research ethics board due to the retrospective nature of the study.

Results

Characteristics of participants

Out of 451,775 ED users, 43,151 (9.5%) and 13,676 (3.0%) were frequent users 3 and frequent users 5, respectively (Table 1). For both definitions, differences between frequent users and non-frequent users were statistically significant except for the residential area variable.

Table 1.

Descriptive statistics for the different populations.

Variable Total (%) Non-frequent users 3 (%) Frequent users 3 (%) Non-frequent users 5 (%) Frequent users 5 (%)
Total 451,775 (100) 408,624 (100) 43,151 (100) 438,099 (100) 13,676 (100)
Female 234,320 (51.9) 210,904 (51.6) 23,416 (54.3) 226,968 (51.8) 7352 (53.8)
Age
18–34 23,723 (5.3) 21,179 (5.2) 2,544 (5.9) 22,775 (5.2) 948 (6.9)
35–54 83,393 (18.5) 76,291 (18.7) 7,102 (16.5) 80,977 (18.5) 2416 (17.7)
55–64 99,136 (21.9) 91,411 (22.4) 7,725 (17.9) 96,618 (22.1) 2518 (18.4)
65–74 116,323 (25.7) 105,977 (25.9) 10,346 (24.0) 113,198 (25.8) 3125 (22.9)
75–84 93,091 (20.6) 82,660 (20.2) 10,431 (24.2) 89,887 (20.5) 3204 (23.4)
 ≥ 85 36,109 (8.0) 31,106 (7.6) 5,003 (11.6) 34,644 (7.9) 1465 (10.7)
PPDIP admissibility
Regular 170,044 (37.6) 155,489 (38.1) 14,555 (33.7) 165,854 (37.9) 4190 (30.6)
GIS 118,313 (26.2) 103,725 (25.4) 14,588 (33.8) 113,618 (25.9) 4695 (34.3)
Not admissible 129,608 (28.7) 121,314 (29.7) 8294 (19.2) 127,254 (29.0) 2354 (17.2)
LRFA 33,810 (7.5) 28,096 (6.9) 5714 (13.2) 31,373 (7.2) 2437 (17.8)
Residential area
Metropolitan 302,097 (66.9) 274,825 (67.3) 27,272 (63.2) 293,601 (67.0) 8496 (62.1)
Small town 67,685 (15.0) 60,378 (14.8) 7307 (16.9) 65,331 (14.9) 2354 (17.2)
Rural 81,993 (18.1) 73,421 (18.0) 8572 (19.9) 79,167 (18.1) 2826 (20.7)
Previous ED visits
 ≤ 1 354,465 (78.5) 334,708 (81.9) 19,757 (45.8) 350,197 (79.9) 4268 (31.2)
2–3 71,406 (15.8) 58,738 (14.4) 12,668 (29.4) 67,507 (15.4) 3899 (28.5)
4 11,282 (2.5) 7834 (1.9) 3448 (8.0) 9888 (2.3) 1394 (10.2)
5 5959 (1.3) 3653 (0.9) 2306 (5.3) 4926 (1.1) 1033 (7.6)
 ≥ 6 8663 (1.9) 3691 (0.9) 4972 (11.5) 5581 (1.3) 3082 (22.5)
Previous hospitalization 191,862 (42.5) 165,229 (40.4) 26,633 (61.7) 182,364 (41.6) 9498 (69.5)
Material deprivation
Missing 15,928 (3.5) 13,976 (3.4) 1952 (4.5) 15,242 (3.5) 686 (5.0)
1 70,303 (15.6) 64,773 (15.9) 5530 (12.8) 68,731 (15.7) 1572 (11.5)
2 82,729 (18.3) 75,577 (18.5) 7152 (16.6) 80,578 (18.4) 2151 (15.7)
3 87,736 (19.4) 79,719 (19.5) 8017 (18.6) 85,238 (19.5) 2498 (18.8)
4 96,514 (21.4) 86,900 (21.3) 9614 (22.3) 93,399 (21.3) 3115 (22.8)
5 98,565 (21.8) 87,679 (21.5) 10,886 (25.2) 94,911 (21.7) 3654 (26.7)
Social deprivation
Missing 15,928 (3.5) 13,976 (3.4) 1952 (4.5) 15,242 (3.5) 686 (5.0)
1 73,218 (16.2) 67,198 (16.4) 6020 (14.0) 71,435 (16.3) 1783 (13.0)
2 77,968 (17.3) 71,444 (17.5) 6524 (15.1) 75,997 (17.3) 1971 (14.4)
3 87,542 (19.4) 79,417 (19.4) 8125 (18.8) 85,127 (19.4) 2415 (17.7)
4 93,164 (20.6) 84,148 (20.6) 9016 (20.9) 90,239 (20.6) 2925 (21.4)
5 103,955 (23.0) 92,441 (22.6) 11,514 (26.7) 100,059 (22.8) 3896 (28.5)
Comorbidity index
0 277,798 (61.5) 259,504 (63.5) 18,294 (42.4) 272,919 (62.3) 4879 (35.7)
1–2 98,228 (21.7) 86,976 (21.3) 11,252 (26.1) 94,558 (21.6) 3670 (26.8)
3–4 34,395 (7.6) 28,527 (7.0) 5868 (13.6) 32,248 (7.4) 2147 (15.7)
 ≥ 5 41,354 (9.2) 33,617 (8.2) 7737 (17.9) 38,374 (8.8) 2980 (21.8)
Asthma 47,514 (10.5) 41,438 (10.1) 6076 (14.1) 45,210 (10.3) 2304 (16.8)
COPD 62,975 (13.9) 51,842 (12.7) 11,133 (25.8) 58,694 (13.4) 4281 (31.3)
CHF 27,945 (6.2) 22,281 (5.5) 5664 (13.1) 25,793 (5.9) 2152 (15.7)
CAD 113,141 (25.0) 98,234 (24.0) 14,907 (34.5) 108,062 (24.7) 5079 (37.1)
Diabetes 151,951 (33.6) 135,269 (33.1) 16,682 (38.7) 146,325 (33.4) 5626 (41.1)
Epilepsy 11,538 (2.6) 9857 (2.4) 1681 (3.9) 10,864 (2.5) 674 (4.9)
HBP 245,449 (54.3) 220,607 (54.0) 24,842 (57.6) 237,388 (54.2) 8061 (58.9)
Alcohol abuse 10,678 (2.4) 8173 (2.0) 2505 (5.8) 9444 (2.2) 1234 (9.0)
CNCP 75,263 (16.7) 65,660 (16.1) 9603 (22.3) 71,859 (16.4) 3404 (24.9)
CMD 102,540 (22.7) 87,533 (21.4) 15,007 (34.8) 96,711 (22.1) 5829 (42.6)
Drug abuse 6908 (1.5) 4854 (1.2) 2054 (4.8) 5827 (1.3) 1081 (7.9)
Injury 160,577 (35.5) 140,479 (34.4) 20,098 (46.6) 153,400 (35) 7177 (52.5)
SMD 15,778 (3.5) 12,465 (3.1) 3313 (7.7) 14,281 (3.3) 1497 (10.9)

Percentages in brackets are relative to the column.

PPDIP public prescription drug insurance plan, GIS ≥ 65 years with guaranteed income supplement, LRFA recipients of last resort financial assistance, COPD chronic obstructive pulmonary disease, CHF congestive heart failure, CAD coronary artery disease, HBP high blood pressure, CNCP chronic non-cancer pain, CMD common mental disorders, SMD serious mental disorders.

Main results

Multiple combinations of explicative variables were evaluated. The following variables were selected for their clinical interpretation and explicative power: age, public prescription drug insurance plan status, Charlson comorbidity index, number of previous ED visits, chronic obstructive pulmonary disease, injury, serious mental disorders, common mental disorders, chronic non-cancer pain, alcohol, and drugs. No missing values were observed in the variables selected for prediction.

Model performances are shown in Tables 2, 3, for frequent users 3 and 5 respectively. In both cases, RF1 (binary outcome) had poor performances regarding AUC and SEN, followed by NB (poor or fair). On the other hand, RF1 had the highest SPE and PPV. GBM, LR, NN, and RF2 had similar good performances (or very good in the case of GBM, LR, and RF2 for frequent users 5). Performances improved as the threshold for frequent use was increased from three to five visits, except for RF1. Overall, SPE (NPV) was higher than SEN (PPV).

Table 2.

Model performances for frequent users 3.

Model AUC SEN SPE PPV NPV
LR 74.8 (74.3–75.4) 60.0 78.0 22.3 94.9
GBM 74.9 (74.3–75.5) 64.0 74.0 20.7 95.1
NB 59.6 (59.1–60.0) 23.2 95.9 37.6 92.2
NN 74.4 (73.8–75.0) 58.5 78.8 22.6 94.7
RF 1 53.8 (53.5–54.0) 8.1 99.4 60.2 91.1
RF 2 74.7 (74.1–75.3) 61.8 76.2 21.5 95.0

GBM gradient boosting machine, LR logistic regression, NB naïve bayes, NN neural network, RF random forests, AUC area under the curve, SEN sensitivity, SPE specificity, PPV positive predicted value, NPV negative predicted value.

Table 3.

Model performances for frequent users 5.

Model AUC SEN SPE PPV NPV
LR 81.3 (80.4–82.2) 67.7 81.3 10.1 98.8
GBM 81.4 (80.4–82.3) 69.9 78.9 9.3 98.8
NB 61.2 (60.4–62.0) 24.8 97.6 24.5 97.7
NN 78.4 (77.3–79.4) 63.2 82.1 9.9 98.6
RF 1 51.4 (51.1–51.8) 2.9 99.9 51.3 97.1
RF 2 81.1 (80.2–82.0) 72.5 75.6 8.4 98.9

GBM gradient boosting machine, LR logistic regression, NB naïve bayes, NN neural network, RF random forests (1: binary outcome, 2: continuous outcome). AUC area under the curve, SEN sensitivity, SPE specificity, PPV positive predicted value, NPV negative predicted value.

Variable importance results are shown in Tables 4, 5. Those measures are relative, meaning that it is only possible to compare importance between variables in the same model (e.g., variable importance between LR and GBM are not comparable). However, the ranking of independent variables in each model can still be compared for all models, along with the relative magnitude. All models reported the number of previous ED visits as the most important variable for prediction. The magnitude by which it was superior to the other variables varied considerably. CCI and PPDIP were also important, but to a lesser extent (for instance, their importance was respectively 6 and 12 times less than PV for RF2 in the case of frequent users 5). Among chronic diseases, COPD was the most important.

Table 4.

Variable importance for the predictive models (frequent users 5).

Variable LR GBM NN RF1 RF2
Age
18–34 Ref Ref Ref 284.6 134.2
35–54 5.2 6.8E−4 4.3
55–64 6.2 9.0E−4 4.0
65–74 6.7 1.4E−3 3.6
75–84 4.3 6.5E−4 4.7
 ≥ 85 2.4 3.4E−3 4.2
PPDIP
Regular Ref Ref Ref 220.7 105.6
GIS 9.8 1.6E−2 4.9
Not admissible 5.4 1.1E−2 4.3
LRFA 11.3 1.8E−2 5.2
CCI
0 Ref Ref Ref 281.4 134.7
1–2 12.9 1.1E−2 3.5
3–4 13.9 1.4E−2 3.2
 ≥ 5 16.4 2.3E−2 4.7
PV
 ≤ 1 Ref Ref Ref 1845.8 916.7
2–3 49.7 0.1 2.9
4 54.0 0.1 2.8
5 53.3 0.1 5.3
 ≥ 6 94.8 0.4 4.9
COPD 16.9 0.1 4.3 142.7 70.2
Injury 2.9 2.4E−3 4.2 102.2 47.9
SMD 4.5 2.4E−3 5.5 93.9 45.1
CMD 8.9 3.3E−2 3.9 93.7 45.7
CNCP 8.6 5.5E−3 3.7 102.1 47.9
Alcohol 5.2 3.5E−3 5.2 90.6 43.2
Drugs 6.8 1.3E−2 4.5 123.9 61.4

The higher the value, the higher the importance (relative to a model).

LR logistic regression, GBM gradient boosting machine, NN neural network, RF random forests (1: binary outcome, 2: continuous outcome).

Table 5.

Variable importance for the predictive models (frequent users 3).

Variable LR GBM NN RF1 RF2
Age
18–34 Ref Ref Ref 562.0 271.1
35–54 7.0 5.1E−4 3.5
55–64 9.0 1.1E−3 2.7
65–74 7.5 4.4E−4 3.6
75–84 2.5 4.0E−3 2.8
 ≥ 85 3.2 8.5E−3 3.7
PPDIP
Regular Ref Ref Ref 465.6 226.3
GIS 13.2 7.5E−3 4.2
Not admissible 8.5 1.1E−2 3.9
LRFA 17.0 1.4E−2 4.7
CCI
0 Ref Ref Ref 882.1 439.6
1–2 19.8 1.7E−2 4.1
3–4 21.3 1.3E−2 5.0
 ≥ 5 24.7 1.6E−2 4.3
PV
 ≤ 1 Ref Ref Ref 5255.2 2621.2
2–3 74.3 0.2 3.7
4 66.2 0.1 5.3
5 61.7 0.1 6.0
 ≥ 6 96.4 0.3 5.3
COPD 24.4 2.6E−2 5.1 475.9 234.7
Injury 5.1 2.2E−3 3.7 205.3 98.3
SMD 5.4 2.2E−3 4.8 182.0 88.5
CMD 12.0 1.8E−2 2.9 266.9 130.8
CNCP 11.6 5.7E−3 3.3 180.1 86.7
Alcohol 4.3 2.0E−3 5.3 164.6 79.7
Drugs 8.1 2.9E−3 4.8 228.4 111.2

The higher the value, the higher the importance (relative to a model).

LR logistic regression, GBM gradient boosting machine, NN neural network, RF random forests (1: binary outcome, 2: continuous outcome).

No significant changes were observed in the interpretation of results during sensitivity analyses.

Limitations

Both quantity and quality of data are imperative in a ML context. In this study, we had access to an exhaustive medico-administrative database which included hospital and physician data, but it did not include patient reported outcomes (e.g., perceived health, included in the Canadian Community Health Survey46). Those latter could improve the predictive power of models in future work. For instance, studies using national health surveys and telephone interviews found that fair or poor health status and dissatisfaction with treatment outcome were significantly associated with frequent ED use47,48.

Our study focused on frequent ED users with chronic diseases; though results should only be generalized to this population, chronic diseases are common in the frequent ED user population. Better understanding of a population of ED users with chronic diseases is relevant for other healthcare aspects as chronic diseases are linked not only to frequent ED use, but also to hospitalisations, functioning, and deaths24,49,50.

Discussion

This paper aimed at comparing four ML prediction models (gradient boosting machine, naïve Bayes, neural networks, and random forests) with logistic regression, for frequent ED use in a population with chronic diseases. Those ML models have been successfully used to predict related issues, such as ED revisits, in hospital mortality, or hospital admissions at ED triage42,51,52. Accurate ML models may help for early identification of frequent ED use, thus improving targeted interventions such as case management2,15,16. To this end, case-finding tools are appropriate, such as CONECT-6 which was derived from LR models53.

Model performance

In our study, no model clearly outperformed the others. Other studies on frequent ED use that applied ML reached a similar conclusion18,20,54, though they either focused on a specific chronic disease such as asthma or epilepsy or used hospital only data. In fact, a recent systematic review aiming at comparing performances of LR with ML models (among which figured the ones used in this study) for clinical prediction of a binary outcome showed that there is currently no clear performance benefit of ML models55. However, this review included only studies that used clinical data. Other studies that focus on ED related issues (e.g. risk of emergency hospital admission, risk for sepsis, heart failure readmission) found improved predictions with ML56,57, although this is not a general rule58. Quantity of variables (58 to 121 variables56) or very discriminative variables57 explained those improved predictions, amongst others. In our models, increasing the threshold for frequent ED use (thus reducing the number of frequent ED users) gave slightly better performances for all models. A higher threshold increased the homogeneity of the characteristics of frequent users, thus facilitating prediction of their ED use, a result that has already been observed54. Other studies also compared LR to ML models using administrative claims data59,60 and found similar performances, though they did not focus on ED-related outcomes.

In medical studies, the signal-to-noise ratio is often low, i.e., the amount of information contained in the database that is useful for the prediction61, which may explain in part the modest improvements (if any) of ML models. The type of available variables may also affect performance. For instance, in a study about uncontrolled diabetes prediction, LR was outperformed by NN or GBM62. The authors used data from administrative claims and from US census, in which they had access to social determinants, such as food insecurity or recreational park access. It is possible to tune more precisely ML models to overcome those limitations. In our study, this fine-tuning would have amounted to evaluate model parameters over broader spaces. As an example, NN is known for its ability to model complex and nonlinear relationships by combining multiple hidden layers41, which is limited for a traditional LR. Broader ranges could also be evaluated; GBM has shown good performances and helped refine clinical tools when allowed to learn slowly57. However, this fine-tuning comes with a high computation cost and an added complexity. This latter drawback may result in overfitting issues and limited generalization.

Our models had higher sensitivity than positive predictive value, apart from RF1. This means that most models accounted for a fair portion of frequent ED users, but the number of false positives was significant. This contrasts with another study on frequent ED use among children with asthma20. The authors also applied ML models (LR and RF amongst others) and found higher predictive positive values4870 than sensitivity1627. However, their threshold choice was guided by a maximization of the AUC rather than by a statistical criterion. Besides, specificity and negative predicted values were high in our study. This is a known issue when dealing with imbalanced health datasets (frequent users 3 and 5 represented 9.5% and 3.0% of the cohort respectively)63,64. Algorithms learns mostly from the majority class, which introduces a bias towards non-frequent users. There are possible adjustments such as under-sampling from the majority class or over-sampling from the minority class, but those may not be recommended procedures as they distort prevalence55. Learning from highly imbalanced datasets is an active research area65 and may affect prediction for frequent ED use in the future, especially if combined with multiple different models66.

Variable importance

The models developed in our study are also interesting from a clinical point of view (i.e., risk stratification by variable for frequent ED use). In our study, CCI, PPDIP, and the previous number of ED visits were important, though the latter was the most important variable by a large margin. This result is supported by other studies on frequent ED use conducted with LR26,67,68, but also with ML models18,20. In fact, this variable is usually so important that Brennan et al.68 stated that “targeting patients with the most extreme number of ED visits may be the best and most practical option for targeted interventions”, thus allowing for optimal resource allocation. Hudon et al. (2020) also found that a LR including this variable and having a previous hospitalization performed almost as well as models with more variables such as comorbidities, sociodemographic status, and public prescription drug insurance status26. Even when predicting other ED related-outcomes, the previous number of visits is relevant: Rahimian et al.56 predicted emergency hospital admission (after an ED visit) using RF and GBM and found that it was the most important variable. They also found that other variables are excellent predictors of emergency hospital admission, such as laboratory test results (e.g., cholesterol ratio, haemoglobin, platelets).

Conclusions

Frequent ED use is a major issue in primary and emergency care, and ML models are becoming increasingly popular in medicine and healthcare in general. They are rapidly evolving, offering new opportunities, and while there has been substantial theoretical progress with ML models, the small improvements do not show a clear superiority over simpler models. Those latter still display reasonable performances55,69. In our study, LR was as successful in predicting frequent ED use as other ML models, while the number of ED visits was the most important variable. Access to other variables may be more helpful for refining prediction in the case of frequent ED use, such as patient-reported outcomes or clinical notes. Those types of data have been successfully used with machine learning models in a context of primary care, although not for ED use prediction70,71. Future work also includes considering complex non-linear interactions, where ML models outperform traditional ones72.

Supplementary Information

Abbreviations

AUC

Area under the curve

CAD

Coronary artery disease

CCI

Charlson comorbidity index

CHF

Congestive heart failure

CMD

Common mental disorders

CNCP

Chronic noncancer pain

COPD

Chronic obstructive pulmonary disorder

ED

Emergency department

HBP

High blood pressure

GBM

Gradient boosting machine

LR

Logistic regression

ML

Machine learning

NB

Naïve bayes

NN

Neural network

NPV

Negative predicted value

PPDIP

Public prescription drug insurance plan

RF

Random forests

SEN

Sensitivity

SMD

Serious mental disorders

SPE

Specificity

PPV

Positive predicted value

Author contributions

C.H. and A.V. acquired the funding. Y.M.C., J.C., and C.H. were involved in data analysis. Y.M.C. and C.H. wrote the original draft. Y.M.C., A.V., J.C., I.D., and C.H. contributed to the study conceptualization, interpreted, and validated the results, and reviewed and edited the submitted manuscript. All data used in this study were fully anonymized and no individual detail was used, therefore, consent for publication is not applicable.

Funding

This study was supported by the Fonds de recherche du Québec – Santé, the Fonds de recherche du Québec – Nature et technologie, the Québec SPOR SUPPORT Unit, and the Centre de recherche du Centre hospitalier de l’université de Sherbrooke.

Data availability

The datasets analysed during the current study are not publicly available due to provincial laws about privacy. Although we acknowledge the importance of data availability for reproducible research, our research team is bound by legal reasons to not divulge any part of the data. The Commission de l’accès à l’information du Québec is the provincial organisation that reviews research projects and allows researchers to access health databases. It is also responsible for ensuring their privacy as those databases contain sensitive patient information and it does not legally allow for making any part of them public. Therefore, we are not able to make any part of our data publicly available.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-023-27568-6.

References

  • 1.Krieg C, Hudon C, Chouinard MC, Dufour I. Individual predictors of frequent emergency department use: A scoping review. BMC Health Serv. Res. 2016;16(1):1–10. doi: 10.1186/s12913-016-1852-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Kumar GS, Klein R. Effectiveness of case management strategies in reducing emergency department visits in frequent user patient populations: A systematic review. J. Emerg. Med. 2013;44(3):717–729. doi: 10.1016/j.jemermed.2012.08.035. [DOI] [PubMed] [Google Scholar]
  • 3.Soril LJ, Leggett LE, Lorenzetti DL, Noseworthy TW, Clement FM. Characteristics of frequent users of the emergency department in the general adult population: A systematic review of international healthcare systems. Health Policy. 2016;120(5):452–461. doi: 10.1016/j.healthpol.2016.02.006. [DOI] [PubMed] [Google Scholar]
  • 4.Giannouchos TV, Kum HC, Foster MJ, Ohsfeldt RL. Characteristics and predictors of adult frequent emergency department users in the United States: A systematic literature review. J. Eval. Clin. Pract. 2019;25(3):420–433. doi: 10.1111/jep.13137. [DOI] [PubMed] [Google Scholar]
  • 5.Dufour I, Chiu Y, Courteau J, Chouinard MC, Dubuc N, Hudon C. Frequent emergency department use by older adults with ambulatory care sensitive conditions: A population-based cohort study. Geriatr. Gerontol. Int. 2020;20(4):317–323. doi: 10.1111/ggi.13875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Cunningham A, Mautner D, Ku B, Scott K, LaNoue M. Frequent emergency department visitors are frequent primary care visitors and report unmet primary care needs. J. Eval. Clin. Pract. 2017;23(3):567–573. doi: 10.1111/jep.12672. [DOI] [PubMed] [Google Scholar]
  • 7.Billings J, Raven MC. Dispelling an urban legend: frequent emergency department users have substantial burden of disease. Health Aff. (Millwood) 2013;32(12):2099–2108. doi: 10.1377/hlthaff.2012.1276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Atzema CL, Maclagan LC. The transition of care between emergency department and primary care: A scoping study. Acad. Emerg. Med. 2017;24(2):201–215. doi: 10.1111/acem.13125. [DOI] [PubMed] [Google Scholar]
  • 9.Sun BC, Burstin HR, Brennan TA. Predictors and outcomes of frequent emergency department users. Acad. Emerg. Med. 2003;10(4):320–328. doi: 10.1111/j.1553-2712.2003.tb01344.x. [DOI] [PubMed] [Google Scholar]
  • 10.Ellis G, Marshall T, Ritchie C. Comprehensive geriatric assessment in the emergency department. Clin. Interv. Aging. 2014;9:2033–2044. doi: 10.2147/CIA.S29662. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Mitchell MS, Leon CLK, Byrne TH, Lin WC, Bharel M. Cost of health care utilization among homeless frequent emergency department users. Psychol. Serv. 2017;14(2):193–202. doi: 10.1037/ser0000113. [DOI] [PubMed] [Google Scholar]
  • 12.LaCalle E, Rabin E. Frequent users of emergency departments: the myths, the data, and the policy implications. Ann. Emerg. Med. 2010;56(1):42–48. doi: 10.1016/j.annemergmed.2010.01.032. [DOI] [PubMed] [Google Scholar]
  • 13.Institut canadien d’information sur la santé. SNISA — Nombre de Visites au Service d’Urgence et Durée du Séjour par Province et Territoire, 2018–2019. ICIS (2019).
  • 14.Statistics Canada. Population Projections for Canada (2018 to 2068), Provinces and Territories (2018 to 2043) (2019).
  • 15.Hudon C, Chouinard MC, Pluye P, El Sherif R, Bush PL, Rihoux B, et al. Characteristics of case management in primary care associated with positive outcomes for frequent users of health care: A systematic review. Ann. Fam. Med. 2019;17(5):448–458. doi: 10.1370/afm.2419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Sutherland D, Hayter M. Structured review: Evaluating the effectiveness of nurse case managers in improving health outcomes in three major chronic diseases. J. Clin. Nurs. 2009;18(21):2978–2992. doi: 10.1111/j.1365-2702.2009.02900.x. [DOI] [PubMed] [Google Scholar]
  • 17.Chiu Y, Racine-Hemmings F, Dufour I, Vanasse A, Chouinard M-C, Bisson M, et al. Statistical tools used for analyses of frequent users of emergency department: A scoping review. BMJ Open. 2019;9(5):e027750. doi: 10.1136/bmjopen-2018-027750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Grinspan ZM, Shapiro JS, Abramson EL, Hooker G, Kaushal R, Kern LM. Predicting frequent ED use by people with epilepsy with health information exchange data. Neurology. 2015;85(12):1031–1038. doi: 10.1212/WNL.0000000000001944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Patel SJ, Chamberlain DB, Chamberlain JM. A machine learning approach to predicting need for hospitalization for pediatric asthma exacerbation at the time of emergency department triage. Acad. Emerg. Med. 2018;25(12):1463–1470. doi: 10.1111/acem.13655. [DOI] [PubMed] [Google Scholar]
  • 20.Das LT, Abramson EL, Stone AE, Kondrich JE, Kern LM, Grinspan ZM. Predicting frequent emergency department visits among children with asthma using EHR data. Pediatr. Pulmonol. 2017;52(7):880–890. doi: 10.1002/ppul.23735. [DOI] [PubMed] [Google Scholar]
  • 21.Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement. BMC Med. 2015;13(1):1. doi: 10.1186/s12916-014-0241-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Canadian Institute for Health Information. Ambulatory Care Sensitive Conditions 2019 [Available from: http://indicatorlibrary.cihi.ca/display/HSPIL/Ambulatory+Care+Sensitive+Conditions.
  • 23.Gibson OR, Segal L, McDermott RA. A systematic review of evidence on the association between hospitalisation for chronic disease related ambulatory care sensitive conditions and primary health care resourcing. BMC Health Serv. Res. 2013;13(1):336. doi: 10.1186/1472-6963-13-336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Sanmartin C, Khan S, l’équipe de Recherche de l’Initiative sur les Données Longitudinales Administratives et sur la Santé. Hospitalisations Pour des Conditions Propices aux Soins Ambulatoires (CPSA) : Les Facteurs qui Importent (2011).
  • 25.Hsieh VC, Hsieh ML, Chiang JH, Chien A, Hsieh MS. Emergency department visits and disease burden attributable to ambulatory care sensitive conditions in elderly adults. Sci. Rep. 2019;9(1):3811. doi: 10.1038/s41598-019-40206-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Hudon C, Courteau J, Chiu YM, Chouinard MC, Dubois MF, Dubuc N, et al. Risk of frequent ED utilization among an ambulatory care sensitive condition population: a population-based cohort study. Med. Care. 2020;58(3):248–256. doi: 10.1097/MLR.0000000000001270. [DOI] [PubMed] [Google Scholar]
  • 27.Rechel B, Dzakula A, Duran A, Fattore G, Edwards N, Grignon M, et al. Hospitals in rural or remote areas: An exploratory review of policies in 8 high-income countries. Health Policy. 2016;120(7):758–769. doi: 10.1016/j.healthpol.2016.05.011. [DOI] [PubMed] [Google Scholar]
  • 28.Haggerty JL, Roberge D, Pineault R, Larouche D, Touati N. Features of primary healthcare clinics associated with patients’ utilization of emergency rooms: Urban–rural differences. Healthc Policy. 2007;3(2):72. [PMC free article] [PubMed] [Google Scholar]
  • 29.Rosenwax LK, McNamara BA, Murray K, McCabe RJ, Aoun SM, Currow DC. Hospital and emergency department use in the last year of life: A baseline for future modifications to end-of-life care. Med. J. Aust. 2011;194(11):570–573. doi: 10.5694/j.1326-5377.2011.tb03106.x. [DOI] [PubMed] [Google Scholar]
  • 30.Barbera L, Taylor C, Dudgeon D. Why do patients with cancer visit the emergency department near the end of life? Can. Med. Assoc. J. 2010;182(6):563–568. doi: 10.1503/cmaj.091187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Lévesque LE, Hanley JA, Kezouh A, Suissa S. Problem of immortal time bias in cohort studies: Example using statins for preventing progression of diabetes. BMJ. 2010;340:b5087. doi: 10.1136/bmj.b5087. [DOI] [PubMed] [Google Scholar]
  • 32.Pampalon R, Hamel D, Gamache P. The Quebec Index of Material and Social Deprivation: Methodological Follow-up, 1991 Through 2006: Institut National de Santé Publique du Québec (2011).
  • 33.Simard M, Sirois C, Candas B. Validation of the combined comorbidity index of charlson and elixhauser to predict 30-day mortality across ICD-9 and ICD-10. Med. Care. 2018;56(5):441–447. doi: 10.1097/MLR.0000000000000905. [DOI] [PubMed] [Google Scholar]
  • 34.Lacasse A, Ware MA, Dorais M, Lanctôt H, Choinière M. Is the Quebec provincial administrative database a valid source for research on chronic non-cancer pain? Pharmacoepidemiol. Drug Saf. 2015;24(9):980–990. doi: 10.1002/pds.3820. [DOI] [PubMed] [Google Scholar]
  • 35.Gaulin M, Simard M, Candas B, Lesage A, Sirois C. Combined impacts of multimorbidity and mental disorders on frequent emergency department visits: A retrospective cohort study in Quebec, Canada. CMAJ. 2019;191(26):E724–E732. doi: 10.1503/cmaj.181712. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Éducaloi. The Public Drug Insurance Plan Québec2020 [Available from: https://educaloi.qc.ca/en/capsules/the-public-drug-insurance-plan/.
  • 37.Huang JA, Weng RH, Lai CS, Hu JS. Exploring medical utilization patterns of emergency department users. J. Formos. Med. Assoc. 2008;107(2):119–128. doi: 10.1016/S0929-6646(08)60125-4. [DOI] [PubMed] [Google Scholar]
  • 38.Alanazi HO, Abdullah AH, Qureshi KN. A critical review for developing accurate and dynamic predictive models using machine learning methods in medicine and health care. J. Med. Syst. 2017;41(4):69. doi: 10.1007/s10916-017-0715-6. [DOI] [PubMed] [Google Scholar]
  • 39.Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 1189–1232 (2001).
  • 40.James G, Witten D, Hastie T, Tibshirani R. Introduction to Statistical Learning with Applications in R. Springer; 2013. [Google Scholar]
  • 41.Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer; 2009. [Google Scholar]
  • 42.Hong WS, Haimovich AD, Taylor RA. Predicting hospital admission at emergency department triage using machine learning. PLoS ONE. 2018;13(7):e0201016. doi: 10.1371/journal.pone.0201016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Kuhn M, Johnson K. Applied Predictive Modeling. Springer; 2013. [Google Scholar]
  • 44.Sun X, Xu W. Fast implementation of DeLong’s algorithm for comparing the areas under correlated receiver operating characteristic curves. IEEE Signal Process. Lett. 2014;21(11):1389–1393. [Google Scholar]
  • 45.Le CT. A solution for the most basic optimization problem associated with an ROC curve. Stat. Methods Med. Res. 2006;15(6):571–584. doi: 10.1177/0962280206070637. [DOI] [PubMed] [Google Scholar]
  • 46.Statistics Canada. Canadian Community Health Survey - Annual Component (CCHS) 2020 [Available from: https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=assembleDESurv&DECId=113674&RepClass=591&Id=1263799&DFId=180541.
  • 47.Vinton DT, Capp R, Rooks SP, Abbott JT, Ginde AA. Frequent users of US emergency departments: Characteristics and opportunities for intervention. Emerg. Med. J. 2014;31(7):526–532. doi: 10.1136/emermed-2013-202407. [DOI] [PubMed] [Google Scholar]
  • 48.Huang JA, Tsai WC, Chen YC, Hu WH, Yang DY. Factors associated with frequent use of emergency services in a medical center. J. Formos. Med. Assoc. 2003;102(4):222–228. [PubMed] [Google Scholar]
  • 49.Rizzuto D, Melis RJF, Angleman S, Qiu C, Marengoni A. Effect of chronic diseases and multimorbidity on survival and functioning in elderly adults. J. Am. Geriatr. Soc. 2017;65(5):1056–1060. doi: 10.1111/jgs.14868. [DOI] [PubMed] [Google Scholar]
  • 50.Statistics Canada. Table 13-10-0800-01 Deaths and mortality rate (age standardization using 2011 population), by selected grouped causes (2020) [Available from: 10.25318/1310080001-eng.
  • 51.Hao S, Jin B, Shin AY, Zhao Y, Zhu C, Li Z, et al. Risk prediction of emergency department revisit 30 days post discharge: A prospective study. PLoS ONE. 2014;9(11):e112944. doi: 10.1371/journal.pone.0112944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Taylor RA, Pare JR, Venkatesh AK, Mowafi H, Melnick ER, Fleischman W, et al. Prediction of in-hospital mortality in emergency department patients with sepsis: A local big data-driven, Machine Learning Approach. Acad. Emerg. Med. 2016;23(3):269–278. doi: 10.1111/acem.12876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Hudon C, Bisson M, Dubois M-F, Chiu Y, Chouinard M-C, Dubuc N, et al. CONECT-6: A case-finding tool to identify patients with complex health needs. BMC Health Serv. Res. 2021;21(1):1–9. doi: 10.1186/s12913-021-06154-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Pereira M, Singh V, Hon CP, Greg McKelvey T, Sushmita S, De Cock M, editors. Predicting Future Frequent Users of Emergency Departments in California State. Association for Computing Machinery, Inc; 2016. [Google Scholar]
  • 55.Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J. Clin. Epidemiol. 2019;110:12–22. doi: 10.1016/j.jclinepi.2019.02.004. [DOI] [PubMed] [Google Scholar]
  • 56.Rahimian F, Salimi-Khorshidi G, Payberah AH, Tran J, Ayala Solares R, Raimondi F, et al. Predicting the risk of emergency admission with machine learning: Development and validation using linked electronic health records. PLoS Med. 2018;15(11):e1002695. doi: 10.1371/journal.pmed.1002695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Delahanty RJ, Alvarez J, Flynn LM, Sherwin RL, Jones SS. Development and evaluation of a machine learning model for the early identification of patients at risk for sepsis. Ann. Emerg. Med. 2019;73(4):334–344. doi: 10.1016/j.annemergmed.2018.11.036. [DOI] [PubMed] [Google Scholar]
  • 58.Frizzell JD, Liang L, Schulte PJ, Yancy CW, Heidenreich PA, Hernandez AF, et al. Prediction of 30-day all-cause readmissions in patients hospitalized for heart failure: Comparison of machine learning and other statistical approaches. JAMA Cardiol. 2017;2(2):204–209. doi: 10.1001/jamacardio.2016.3956. [DOI] [PubMed] [Google Scholar]
  • 59.Desai RJ, Wang SV, Vaduganathan M, Evers T, Schneeweiss S. Comparison of machine learning methods with traditional models for use of administrative claims with electronic medical records to predict heart failure outcomes. JAMA Netw. Open. 2020;3(1):e1918962-e. doi: 10.1001/jamanetworkopen.2019.18962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.MacKay EJ, Stubna MD, Chivers C, Draugelis ME, Hanson WJ, Desai ND, et al. Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations. PLoS ONE. 2021;16(6):e0252585. doi: 10.1371/journal.pone.0252585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Ennis M, Hinton G, Naylor D, Revow M, Tibshirani R. A comparison of statistical learning methods on the Gusto database. Stat. Med. 1998;17(21):2501–2508. doi: 10.1002/(sici)1097-0258(19981115)17:21<2501::aid-sim938>3.0.co;2-m. [DOI] [PubMed] [Google Scholar]
  • 62.Basu S, Narayanaswamy R. a prediction model for uncontrolled type 2 diabetes mellitus incorporating area-level social determinants of health. Med. Care. 2019;57(8):592–600. doi: 10.1097/MLR.0000000000001147. [DOI] [PubMed] [Google Scholar]
  • 63.Liu, Y. -Q., Wang, C., Zhang, L. (eds) Decision tree based predictive models for breast cancer survivability on imbalanced data. In 2009 3rd International Conference on Bioinformatics and Biomedical Engineering (IEEE, 2009).
  • 64.Dubey R, Zhou J, Wang Y, Thompson PM, Ye J. Alzheimer’s disease neuroimaging I. Analysis of sampling techniques for imbalanced data: An n = 648 ADNI study. Neuroimage. 2014;87:220–241. doi: 10.1016/j.neuroimage.2013.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Klement, W., Wilk, S., Michalowski, W., Matwin, S. (eds) Classifying severely imbalanced data. In Canadian Conference on Artificial Intelligence (Springer, 2011).
  • 66.Huang, F., Wang, S., Chan, C. –C. (eds) Predicting disease by using data mining based on healthcare information system. In 2012 IEEE International Conference on Granular Computing (IEEE, 2012).
  • 67.Okuyemi KS, Frey B. Describing and predicting frequent users of an emergency department. J. Assoc. Acad. Minor. Phys. 2001;12(1–2):119–123. [PubMed] [Google Scholar]
  • 68.Brennan J, Chan T, Hsia R, Vilke G, Killeen J, Castillo E. Predicting frequent use of emergency department resources. Ann. Emerg. Med. 2014;4(64):S118–S119. [Google Scholar]
  • 69.Hand DJ. Classifier technology and the illusion of progress. Stat. Sci. 2006;21:1–14. [Google Scholar]
  • 70.Verma, D., Bach, K., Mork. P. J. (eds) Application of machine learning methods on patient reported outcome measurements for predicting outcomes: A literature review. In Informatics (MDPI, 2021).
  • 71.Hylan TR, Von Korff M, Saunders K, Masters E, Palmer RE, Carrell D, et al. Automated prediction of risk for problem opioid use in a primary care setting. J. Pain. 2015;16(4):380–387. doi: 10.1016/j.jpain.2015.01.011. [DOI] [PubMed] [Google Scholar]
  • 72.Orfanoudaki A, Chesley E, Cadisch C, Stein B, Nouh A, Alberts MJ, et al. Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score. PLoS ONE. 2020;15(5):e0232414. doi: 10.1371/journal.pone.0232414. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The datasets analysed during the current study are not publicly available due to provincial laws about privacy. Although we acknowledge the importance of data availability for reproducible research, our research team is bound by legal reasons to not divulge any part of the data. The Commission de l’accès à l’information du Québec is the provincial organisation that reviews research projects and allows researchers to access health databases. It is also responsible for ensuring their privacy as those databases contain sensitive patient information and it does not legally allow for making any part of them public. Therefore, we are not able to make any part of our data publicly available.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES