Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Oct 22.
Published in final edited form as: J Crit Care. 2021 Jan 20;62:283–288. doi: 10.1016/j.jcrc.2021.01.003

Including urinary output to define AKI enhances the performance of machine learning models to predict AKI at admission

Emma Schwager a,1, Stephanie Lanius a,1, Erina Ghosh a, Larry Eshelman a, Kalyan S Pasupathy b,c, Erin F Barreto c,d, Kianoush Kashani e,f,*
PMCID: PMC8534813  NIHMSID: NIHMS1746417  PMID: 33508763

Abstract

Purpose:

Acute kidney injury (AKI) is a prevalent and detrimental condition in intensive care unit patients. Most AKI predictive models only predict creatinine-triggered AKI (AKICr) and might underperform when predicting urine-output-triggered AKI (AKIUO). We aimed to describe how admission AKICr prediction models perform in all AKI patients.

Materials and methods:

Three types of models were trained: 1) pAKIany, predicting AKI based on creatinine or urine output, 2) pAKIUO, predicting AKI based only on urine output, and 3) pAKICr, predicting AKI based only on creatinine. We compared model performance and predictive features.

Results:

The pAKIany models had the best overall performance (AUROC 0.673–0.716) and the most consistent performance across three patient cohorts grouped by type of AKI trigger (min AUROC of 0.636). The pAKICr models had fair performance in predicting AKICr (AUROCs 0.702–0.748) but poor performance predicting AKIUO (AUROCs 0.581–0.695). The predictive features for the pAKICr models and pAKIUO models were distinct, while top features for the pAKIany models were consistently a combination of those for the pAKICr and pAKIUO models.

Conclusion:

Ignoring urine output in the outcome during model training resulted in models that are unlikely to predict AKIUO adequately and may miss a substantial proportion of patients in practice.

Keywords: Acute kidney injury, Clinical decision support system

1. Introduction

Acute kidney injury (AKI) commonly affects critically ill patients and adversely impacts patient outcomes such as mortality, ICU and hospital length of stay, hospital cost, and post-discharge quality of life [1,2]. Based on the Kidney Disease Improving Global Outcomes (KDIGO) guidelines, AKI is classified into three stages based on decreases in urine output (oliguria/anuria) or increases in serum creatinine levels [3]. AKI at higher stages is associated with worse outcomes. Since AKI is often initially asymptomatic, its diagnosis is commonly delayed or missed entirely [4]. Tremendous research efforts focusing on predicting AKI using data from hospital or ICU stay have been implemented to avoid missed opportunities for appropriate preventive care [59]. The majority of previous models do not include urine-output-triggered AKI (AKIUO) as an outcome, focusing on the more-readily available measured creatinine to define AKI. This is a startling omission since AKIUO is more prevalent and still associated with higher mortality, particularly at higher stages [10,11].

In this study, we aimed to evaluate the impact of using either or both criteria of the AKI definition in the development and performance of prediction models [12]. Using multiple machine learning techniques, we built models to predict three groups of AKI patients: 1) pAKIany to predict AKIany (triggered by either creatinine or urine output), 2) pAKICr to predict AKICr (creatinine-triggered AKI), and 3) pAKIUO to predict AKIUO (urine output-triggered AKI). We compared the performance and important features of these models. Additionally, we conducted sensitivity analyses to assess how well the pAKICr models performed in identifying patients whose AKI was detected by a decline in urine output, an increase in serum creatinine, or both.

2. Materials and methods

2.1. Study population

We screened all adult intensive care unit (ICU) admissions to the Mayo Clinic Hospital (Rochester, MN) from January 1, 2005, to December 31, 2017. All adult, non-pregnant subjects who provided research authorization were included. We combined admissions from a single patient <10 h apart into a single encounter. Encounters that 1) received renal replacement therapy before or during ICU stay, 2) had a baseline creatinine >5 mg/dL, 3) had insufficient creatinine measurements or urine chartings (no measurements of creatinine or fewer than two non-zero urine output measurements), or 4) had community-acquired AKI were excluded.

This study was reviewed and approved by the Mayo Clinic institutional review board (IRB# 07–001380) and the Philips Research Internal Committee for Biomedical Experiments. The need for informed consent was waived due to the minimal risk of this retrospective study.

2.2. Data extraction and definitions

Data extracted from the ICU database included both predictor features and outcome variables. Predictor variables consisted of patient demographics and comorbidities, history of nephrotoxic medication exposure within the six months before the index hospital admission, admission and diagnosis information, and baseline creatinine when available (Appendix Table 2). The outcome variables included hourly urine outputs, serum creatinine, and weight measurements over the length of ICU stay. Appendix A provides definitions of various machine learning terms [13].

We standardized continuous features such as age and height using Z-score standardization. Missing values were imputed using mean-imputation, though few values were missing (Appendix Table 3). Nephrotoxic drug history was converted to counts of drugs prescribed within 15 pre-defined categories; drugs not mentioned in the patient history were assumed not to have been prescribed (see Appendix B.2). We considered primary diagnoses (defined by ICD-9/10 codes) when they were prevalent in >0.01% of all patients and were either admission diagnoses or reflected comorbid conditions. Further details are provided in Appendix B.

AKI stages throughout ICU stay were calculated using an existing electronic implementation of the KDIGO criteria [14]. Baseline creatinine was defined as the average of all serum creatinine measurements during the 180 to 7 days before the index hospital admission or estimated using the MDRD equation back-calculation [3,15,16], assuming an eGFR of 60 mL/min per 1.73 m2 [14]. Each encounter was assigned three distinct maximum stages, which were assessed throughout the ICU stay (Fig. 1). There were three distinct labels based on the maximum stages: 1) creatinine-triggered AKI stage > 0 (AKICr), 2) urine-output-triggered AKI stage > 0 (AKIUO), and 3) overall AKI stage by either criterion > 0 (AKIany). See Appendix C for additional details.

Fig. 1.

Fig. 1.

Overview of model training. The original 131,873 ICU admissions are combined into encounters and filtered (see Section Data Extraction and Definitions). Predictive features are extracted for each encounter; then, the AKI staging by urine output and by creatinine is calculated over the entire ICU stay for each encounter. The AKI staging is simplified into three labels based on whether the stage was ever above 0: a urine-output-triggered AKI label (AKIUO), a creatinine-triggered AKI label (AKICr), and an AKIany label (triggered by either urine output or creatinine). Lighter colors indicate stage 0 (No AKI), and darker colors stages greater than 0. These labels are then combined with the predictive features to build three separate models (pAKICr, pAKIUO, and pAKIany) using each of four architectures (neural network, gradient boosting, logistic regression, and random forest).

2.3. Model training and evaluation

The data were split randomly into a training cohort (80% of the samples) and a testing cohort (20%) (non-stratified). The train-test split was constant across all models so that no test data were ever used during model training. All models were fit in Python version 3.7.3. Four architectures were employed: gradient boosting, logistic regression, random forest (all implemented in scikit-learn version 0.20.3 [17]), and a PyTorch neural network (torch version 1.2.0 [18]). Each architecture was used to build three models, one to predict each of the three labels, for a total of twelve models. The full details for fitting these architectures are outlined in Appendix D.

Overall model performance was quantified using the area under the receiver operating characteristic (AUROC) and area under the precision-recall curve (AUPRC) and evaluated for detecting AKIany or detecting three disjoint subsets of AKIany patients (AKIUO only, AKICr only, both AKICr and AKIUO; see Appendix E). To quantify false positive/negative rates, thresholds were also selected for each model (Appendix F).

Feature importance was evaluated using SHAP (SHapley Additive exPlanations) values [19], calculated using the python package shap, version 0.30.0.

3. Results

3.1. Cohort characteristics

Among 131,873 screened ICU admissions, 98,472 encounters met all eligibility criteria for the final analyses (Fig. 1). The incidence of AKI at any time during ICU stay was 40%. Patients with AKI were older and had higher baseline creatinine levels, a higher mortality rate, and longer ICU stays (Table 1). Of patients with AKI, most (61%) had a maximum stage one (Appendix Table 4). As expected, mortality rates (both in-hospital and ICU) and ICU length of stay increased with increasing AKI stage. Hospital mortality was 6.4% in stage 1 and 14.5% in stage 3. Median ICU length of stay was 2.0 days in stage 1 and 4.0 days in stage 3. Patients with AKI stage 3 tended to be slightly younger, were less likely to have baseline creatinine measured, and had lower Charlson Comorbidity Index scores (Appendix Table 4). However, patients with stage 3 had generally more severe illness, evidenced by being more likely to have a previous ICU stay (Appendix Table 4); and more likely to be admitted with kidney dysfunction, sepsis, and respiratory failure (Appendix Table 5).

Table 1.

Baseline characteristics of patients with and without AKI. A summary of selected demographic and outcome variables in encounters with and without AKI (by either criterion) in our cohort.

Feature No AKI Any AKI p-value
ICU stays# 59,165 (60.1%) 39,307 (39.9%)
Age (years)* 61.96 (17.12) 65.43 (16.3) <0.001
Length of ICU stay (hours)+ 25.83 (23.5) 60.55 (85.62) <0.001
Charlson Comorbidity Index* 4.02 (2.43) 4.67 (2.47) <0.001
Readmit# 3686 (6.2%) 3098 (7.9%) <0.001
Gender (female)# 24,735 (41.8%) 16,372 (41.7%) 0.63
Race (black)# 806 (1.4%) 567 (1.4%) 0.292
Baseline creatinine (mg/dL)* 1.04 (0.43) 1.13 (0.52) <0.001
Death in Hospital# 1585 (2.7%) 3588 (9.1%) <0.001
Death in ICU# 583 (1.0%) 1961 (5.0%) <0.001
Baseline creatinine, when available 33,741 (57.0%) 23,717 (60.3%) <0.001
#

indicates features described by “count (%),” and tested using Fisher’s exact test;

+

indicates features described by “median (IQR)” and tested using a Wilcoxon rank-sum test;

*

indicates features described by “mean (SD)” and tested using a two-sample t-test with unequal variances.

P-values test the difference between AKI and non-AKI encounters. Across all metrics except the proportion of black encounters and the proportion of female encounters, AKI patients are significantly different from non-AKI patients.

In comparison with AKICr, AKIUO was substantially more common. Among all AKI patients, 46% never triggered the creatinine criterion, and only 28% had AKI triggered by both AKI criteria (Appendix Table 6). Patients who triggered both criteria tended to have a substantially longer length of stay and higher mortality rate than patients with AKIUO only or AKICr only (Appendix Table 6). The staging by urine output had a substantial effect on mortality, independent of staging by creatinine. Among patients with AKICr stage 3, those with AKIUO stage 0 had a mortality rate of 3%, while those with AKIUO stage 3 had a mortality rate of 28% (p-value <0.001) (Appendix Table 7).

3.2. Model performance

We evaluated the performance of all models on the AKIany label using the samples from the test set. Of all the architectures evaluated, the neural network architecture had the best performance measured by both AUROC and AUPRC (Fig. 2). For each architecture, pAKIany models had the best performance in predicting AKIany both by AUROC and AUPRC. The pAKICr models had slightly lower performance than pAKIany models (a drop in AUC ranging from 0.02–0.03).

Fig. 2.

Fig. 2.

Model performance predicting any AKI. The AUROC (top) and AUPRC (bottom) values were calculated for each of the twelve models when they were used to predict any AKI in the held-out test set.

The pAKICr models had much higher false-positive rates and slightly lower false-negative rates than pAKIUO or pAKIany models (Appendix Fig. 1). Positive and negative predictive values were relatively consistent independent of the AKI label, though both tended to be slightly higher for pAKIany models (Appendix Fig. 1). The proportion of missed AKI patients who only had AKIUO was 63.7%–73.9%, 56.3%–60.5%, and 47.3%–51.8% for the pAKICr, pAKIany, and pAKIUO models, respectively (Appendix Fig. 2).

The performance of the models on subsets of patients grouped by AKI defining criteria also revealed performance differences. In identifying patients with only AKIUO, AUROCs of pAKICr models were 0.571 to 0.6 (Fig. 3). This is while the pAKICr models performed with higher AUROCs in identifying patients with only AKICr (increased AUROCs 0.113–0.148). The pAKIany models AUROCs in detecting patients with only AKIUO was just 0.03–0.05 lower than identifying patients with only AKICr. For patients who reached the AKI definition by both criteria, all models uniformly had the highest AUROCs.

Fig. 3.

Fig. 3.

Model performance predicting disjoint subsets of AKI patients. The AUROC (top) and prevalence-corrected AUPRC (bottom) values were calculated for each of the twelve models when distinguishing each of three cohorts and controls. See Section Model Training and Evaluation and Appendix E for details.

3.3. Feature importance

We evaluated feature importance on the best-performing overall model architecture, the neural network (Fig. 3), though similar results were obtained for the other architectures (Appendix Figs. 35) The most important features were quite different for the pAKICr (Fig. 4A) and pAKIUO (Fig. 4B) models. When predicting AKIUO, demographics such as weight, body mass index (BMI), age, and gender were all highly influential. Higher weights and BMI values, as well as greater age or female gender, were predictive of developing AKIUO during ICU stay. By contrast, for the pAKICr model, comorbidities, including chronic kidney disease (CKD), coagulation disorders, congestive heart failure (CHF), and hypertension, were among the most important features. The presence of these comorbidities, as well as higher values for the Charlson Comorbidity Index, were predictive of developing AKICr during ICU stay.

Fig. 4.

Fig. 4.

Feature importance for the neural network models evaluated by SHAP values. The SHAP values for all encounters in the held-out test set (see Section Model Training and Evaluation) for the top 10 features for the pAKICr model (A), pAKIUO model (B), and pAKIany model (C). The mean absolute SHAP value determines the feature ranking across all encounters for that feature. The x-axis is the SHAP value, measuring the impact of a particular feature on the prediction for a particular encounter: negative values indicate a decreased risk of AKI, while positive values indicate an increased risk. The colors indicate the feature value, shifting from bright blue (the minimum value for a feature) to bright red (the maximum value for a feature). Binary features, which have two values, thus have only bright blue and bright red points in this plot.

The most important features of the pAKIany model were a combination of the critical features for the pAKIUO and pAKICr models (Fig. 4C). The top feature for predicting AKIany was weight, which was also the most predictive feature for AKIUO. Features among the top ten for both AKIUO and AKICr were also quite predictive of AKIany, including the presence of coagulation disorders, cardiac dysrhythmias or congestive heart failure, and not being admitted in the neurosurgery unit.

4. Discussion and conclusions

We found that regardless of model architecture used, including the urine output in the model training resulted in improved performance, particularly in detecting patients with AKIUO only. While pAKIany models did exhibit higher performance, the gains were modest. Further, the negative predictive values of pAKIany were similar regardless of the criteria used for the AKI definition.

While serum creatinine levels and urine output are both indicators of kidney function, they provide slightly different information about kidney physiology. Serum creatinine level is a function of glomerular clearance of creatinine with a supposedly constant rate of production [20]. This is while urine output is a function of glomerular filtration and tubular processing. Therefore, it is not surprising to note differences between factors that can predict an increase in serum creatinine versus a decrease in urine output when machine learning tools identify the variables without supervision. In agreement with previous studies [10], we found that AKIUO was more prevalent than AKICr. AKIUO often existed in the absence of AKICr, and both were associated with high mortality rates. Both AKIUO and AKICr were associated with higher mortality rates and longer ICU lengths of stay (Appendix Table 6). The high prevalence of AKI patients who experience only a decline in urine output raises the possibility that training pAKICr models might lead to missing a substantial portion of AKI patients. Dissecting the AKI patients missed by each model revealed essential differences. The pAKIany models performed considerably more consistently in predicting AKIUO and AKICr. Interestingly, all models typically missed more AKIUO, suggesting prediction and detection of AKIUO is more difficult at ICU admission than AKICr.

We demonstrated that the predictive features of AKICr and AKIUO are different, highlighting the variabilities between these two criteria when identifying patients with AKI. The presence of comorbidities, such as CKD, coagulation disorders, hypertension, and CHF, was important for predicting AKICr. The presence of CKD, in particular, might be important. Generally, CKD patients have a higher availability of measured baseline serum creatinine. Estimating baseline creatinine by back-calculating from the MDRD equation would likely result in lower baseline creatinine among CKD patients. Conversely, demographics such as weight, BMI, and age were the most important for predicting AKIUO. Weight might be important because urine output is scaled by weight when determining the AKI stage for the label. AKI is defined by an increase in serum creatinine level and a weight-adjusted decline in urine output [12]. It appears that the size of organs in different individuals is based on their height. For instance, lung size should be assessed based on the ideal body weight, which is calculated using height [21]. While serum creatinine production in larger individuals is higher, as their kidney sizes are bigger as well, the blood serum creatinine levels among individuals are comparable, yet to evaluate the estimated GFR adjustments for race, age, and gender need to be considered [20]. In the urine output case, larger individuals generate more osmoles each day (average 5–10 mosmol/kg/day with American diet); therefore, given comparable urinary concentrations, they need to generate more urine to excrete all generated osmoles. Thus, the urine output criterion of AKI is adjusted based on weight (i.e., ml/kg/h) [10,2225]. The predictive features of AKIany were a combination of the top features of pAKICr and pAKIUO. Higher weights and BMI values were highly predictive for AKIUO and AKIany, while coagulation disorders and CHF were highly predictive for AKICr and AKIany. Although CKD was a significant predictor of AKICr, it was not as important in predicting AKIany or AKIUO. Interestingly, admission in the neurosurgery unit was associated with a decreased AKI risk regardless of AKI definition, likely reflecting that such patients are mostly younger and healthier, with a consequently lower risk of AKI.

Similar to retrospective studies, our research is not immune to limitations, including limited ability to remove community-acquired AKI patients and the single-center nature of the dataset used to train the models. While we excluded all patients with known community-acquired AKI, we had no access to urine output measurements before ICU admission. Further, measured creatinine levels before ICU admission were only available in a subset of patients. Thus, we could only exclude a subset of patients who had community-acquired AKICr. This may result in selection bias in our findings. Having mentioned this, a large proportion (90%) of AKIUO patients had normal urine output and creatinine measurements before the development of AKI. Obviously, there may be some differences among those with previously measured serum creatinine with those who never had their serum creatinine assessed. The differences could be in their comorbidity profile (e.g., individuals with other comorbidities with potentially lower kidney function reserve are more likely to have measured serum creatinine) or healthcare access. While assessing these differences is not in the scope of this manuscript, it should be considered for prospective studies. Although our models were trained using a large data set over multiple years, the data did come from a single-center, which may impact the generalizability of our results. As the dataset used for this analysis is one of the few datasets with hourly urine output information, the validation of our results in other datasets was not feasible. By contrast, the strengths of our study include a large dataset of ICU patients, availability of urine output data, using several architectures to develop higher-performing models and internal validation of our models.

In conclusion, we demonstrated that models trained to predict ICU-acquired AKICr at ICU admission are ill-suited to predict AKIUO. This work highlights the importance of accounting for AKIUO during model training.

Supplementary Material

Supplementary appendix

Sources of funding

This project was supported in part by the National Institute of Allergy and Infectious Diseases, United States; National Institutes of Health, United States, under Award Number K23AI143882 (PI; EFB)

Footnotes

Declaration of Competing Interest

Emma Schwager, Stephanie Lanius, Erina Ghosh, and Larry Eshelman are employees of Philips Research North America. All other authors have disclosed that they do not have any conflicts of interest.

Appendix A. Supplementary data

Supplementary data to this article can be found online at https://doi.org/10.1016/j.jcrc.2021.01.003.

References

  • [1].Chertow GM, et al. Acute kidney injury, mortality, length of stay, and costs in hospitalized patients. J Am Soc Nephrol 2005;16(11):3365–70. [DOI] [PubMed] [Google Scholar]
  • [2].Silver SA, et al. Cost of acute kidney injury in hospitalized patients. J Hosp Med 2017; 12(2):70–6. [DOI] [PubMed] [Google Scholar]
  • [3].Kidney Disease Improving Global Outcomes (KDIGO) Clinical Practice Guideline for Acute Kidney Injury. Kidney Int Sppl 2012;2(1):1–138. [Google Scholar]
  • [4].Siew ED, Davenport A. The growth of acute kidney injury: a rising tide or just closer attention to detail? Kidney Int 2015;87(1):46–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Flechet M, et al. AKIpredictor, an online prognostic calculator for acute kidney injury in adult critically ill patients: development, validation and comparison to serum neutrophil gelatinase-associated lipocalin. Intensive Care Med 2017;43(6):764–73. [DOI] [PubMed] [Google Scholar]
  • [6].Malhotra R, et al. A risk prediction score for acute kidney injury in the intensive care unit. Nephrol Dial Transplant 2017;32(5):814–22. [DOI] [PubMed] [Google Scholar]
  • [7].Mohamadlou H, et al. Prediction of acute kidney injury with a machine learning algorithm using electronic health record data. Can J Kidney Health Dis 2018;5 2054358118776326. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Koyner JL, et al. The development of a machine learning inpatient acute kidney injury prediction model. Crit Care Med 2018;46(7):1070–7. [DOI] [PubMed] [Google Scholar]
  • [9].Chiofolo C, et al. Automated continuous acute kidney injury prediction and surveillance: a random forest model. Mayo Clin Proc 2019;94(5):783–92. [DOI] [PubMed] [Google Scholar]
  • [10].Kellum JA, et al. Classifying AKI by urine output versus serum creatinine level. J Am Soc Nephrol 2015;26(9):2231–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Vincent J-L, et al. The clinical relevance of oliguria in the critically ill patient: analysis of a large observational database. Crit Care 2020;24(1):171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Section 2: AKI Definition, Kidney Int Suppl 2012;2(1):19–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Liu Y, et al. How to read articles that use machine learning: users’ guides to the medical literature. Jama 2019;322(18):1806–16. [DOI] [PubMed] [Google Scholar]
  • [14].Ahmed A, et al. Development and validation of electronic surveillance tool for acute kidney injury: a retrospective analysis. J Crit Care 2015;30(5):988–93. [DOI] [PubMed] [Google Scholar]
  • [15].Hoste EA, et al. RIFLE criteria for acute kidney injury are associated with hospital mortality in critically ill patients: a cohort analysis. Crit Care 2006;10(3):R73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Levey AS, et al. Using standardized serum creatinine values in the modification of diet in renal disease study equation for estimating glomerular filtration rate. Ann Intern Med 2006;145(4):247–54. [DOI] [PubMed] [Google Scholar]
  • [17].Pedregosa F, et al. Scikit-learn: machine learning in Python. J Mach Learn Res 2011; 12:2825–30. [Google Scholar]
  • [18].Paszke A, et al. Automatic differentiation in PyTorch, in NIPS 2017 Workshop Autodiff Decision Program 2017. 31st Conference on Neural Information Processing Systems (NIPS 2017). USA: Long Beach, CA; 2017. [Google Scholar]
  • [19].Lundberg S, Erion G, Lee S. Consistent Individualized Feature Attribution for Tree Ensembles. Cornell University; 2019. [Google Scholar]
  • [20].Kashani K, Rosner MH, Ostermann M. Creatinine: from physiology to clinical application. Eur J Intern Med 2020;72:9–14. [DOI] [PubMed] [Google Scholar]
  • [21].Mattingley JS, et al. Sizing the lung of mechanically ventilated patients. Crit Care 2011;15(1):R60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Thongprayoon C, et al. Actual versus ideal body weight for acute kidney injury diagnosis and classification in critically ill patients. BMC Nephrol 2014;15(1):176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Macedo E, et al. Defining urine output criterion for acute kidney injury in critically ill patients. Nephrol Dial Transplant 2011;26(2):509–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Solomon AW, et al. Urine output on an intensive care unit: case-control study. Bmj 2010;341:c6761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Leedahl DD, et al. Derivation of urine output thresholds that identify a very high risk of AKI in patients with septic shock. Clin J Am Soc Nephrol 2014;9(7):1168–74. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary appendix

RESOURCES