Abstract
Accurate predictions of COVID-19 epidemic dynamics may enable timely organizational interventions in high-risk regions. We exploited the interconnection of the Fresenius Medical Care (FMC) European dialysis clinic network to develop a sentinel surveillance system for outbreak prediction. We developed an artificial intelligence-based model considering the information related to all clinics belonging to the European Nephrocare Network. The prediction tool provides risk scores of the occurrence of a COVID-19 outbreak in each dialysis center within a 2-week forecasting horizon. The model input variables include information related to the epidemic status and trends in clinical practice patterns of the target clinic, regional epidemic metrics, and the distance-weighted risk estimates of adjacent dialysis units. On the validation dates, there were 30 (5.09%), 39 (6.52%), and 218 (36.03%) clinics with two or more patients with COVID-19 infection during the 2-week prediction window. The performance of the model was suitable in all testing windows: AUC = 0.77, 0.80, and 0.81, respectively. The occurrence of new cases in a clinic propagates distance-weighted risk estimates to proximal dialysis units. Our machine learning sentinel surveillance system may allow for a prompt risk assessment and timely response to COVID-19 surges throughout networked European clinics.
Keywords: SARS-CoV-2, COVID-19, sentinel surveillance system, outbreak prediction, machine learning, artificial intelligence
1. Introduction
Due to its unique characteristics, the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) pandemic has posed unprecedented challenges to clinics providing life-saving services to patients suffering from chronic illnesses, including chronic kidney disease (CKD). In fact, non-specific clinical manifestations of Coronavirus disease 2019 (COVID-19) [1] as well as the viral transmission from asymptomatic or pre-symptomatic individuals [2,3,4] make the early recognition of newly infected cases extremely difficult. Moreover, the occurrence of superspreading events (SSEV), during which few individuals are able to infect many people [5], hampers infection control measures [6,7].
Social distancing, preventive quarantine, and the isolation of infected subjects still represents the most effective means to reduce the risk of SARS-CoV-2 human-to-human transmission [8,9]. However, patients with end-stage kidney disease (ESKD) need to undergo in-center dialysis three times per week for 4 h per session, which makes physical distancing more difficult to achieve due to repeated, prolonged interactions with other patients and healthcare staff [10,11,12,13]. Unfortunately, ESKD individuals also show a higher risk of complications following SARS-CoV-2 infection due to weakened immune response [14,15,16,17] and to the occurrence of many of the risk factors commonly associated with development of severe COVID-19 [18,19], including older age and comorbidities [20,21]. Moreover, because of compromised host immunity, a vaccine may not exhibit the same efficacy on hemodialysis patients as it does in immunocompetent individuals [13].
Therefore, the reduction of the contagion risk within dialysis clinics while preserving clinical operations is a key challenge for healthcare systems during this pandemic. To help anticipate local epidemic dynamics and adjust non-pharmacological interventions to the changing background of infection risk, we sought to develop an advanced sentinel surveillance system supported by a machine learning (ML) prediction model, where the occurrence of COVID-19 cases in a clinic propagates distance-weighted risk estimates to adjacent dialysis units. The present study describes the derivation and validation of the prediction model, as well as the strategies adopted to monitor its performance throughout the pandemic period.
2. Materials and Methods
2.1. Design and Setting
All dialysis clinics belonging to the Fresenius Medical Care (FMC) European Nephrocare Network confer clinical data to a centralized data-repository, namely the European Clinical Database (EuCliD®, Fresenius Medical Care, Deutschland GmbH, Vaiano Cremasco, Italy) [22,23]. Since April 2020, all SARS-CoV-2 infections (suspected and confirmed cases as well as initial symptoms), diagnostic procedures, and clinical endpoints are reported in the treatment incident report (TIR) module in EuCLiD®. We used aggregated data abstracted from the TIR, open source data describing epidemic dynamics in European countries, as well as aggregated data on biochemical assays prescriptions and results to estimate outbreak risk in dialysis clinics belonging to the FMC European Nephrocare Network.
2.2. Outcome Variable
The model forecasts the risk of a COVID-19 outbreak in each dialysis clinic in a 2-week horizon. Clinic outbreak is defined as the occurrence of two or more COVID-19-confirmed cases in a given clinic. Therefore, for each clinic registered in the Nephrocare network, the model estimates the probability of COVID-19 outbreak (2 or more PCR confirmed cases within a 2-week horizon) as a function of a vector of input variables. Study design is represented in Figure 1.
For illustrative purposes, we established 3 risk categories: (1) low (L), when outbreak risk is less than or equal to 1.5%; (2) medium (M), risk greater than 1.5% and less than or equal to 12.5%; (3) high (H), if risk is greater than 12.5%. For this purpose, the action threshold defining the low risk class has been chosen to select a subpopulation of clinics where the risk of outbreak is very small so that non-pharmacological interventions to prevent the spread of COVID-19 can be temporarily and partially mitigated. In this context, a costly error would be to assign to the Low Risk class a clinic which will experience an outbreak in the following two weeks. Such threshold would be useful when a sufficiently large share of clinics (i.e., P(Class = L)) could be found, so that P(Class = L|Outbreak = No) is high and P(Outbreak = Yes|Class = L) is, conversely, very small. On the other hand of the spectrum, we selected a more specific action threshold, which defines a High-Risk Class of clinics. In this risk group, additional non-pharmacological intervention should be initiated including, for example, the formal testing of temperature and thorough physical examination administered to each patient before entering the clinic or even periodical screening test (i.e., once-weekly). Since the intervention would require intensive resources, may be constraint by procurement difficulties, and would unduly overburden patients with unnecessary testing, the High Risk threshold should ideally define a group where P(Outbreak = Yes|Class = H) is high and both P(Class = H|Outbreak = No) and P(Outbreak = Yes|Class ≠ H) are low. It is important to remark that the choice and number of the action thresholds depends on the intended use of the risk score, the set of interventions available to the organization, the price cost of each intervention, and ultimately by the value function ranking the desirability/undesirability of different health outcomes. Therefore, the thresholds presented in this paper should not be considered generalizable per se: different institutions may choose different thresholds (or no thresholds at all) depending on the availability, cost, and expected outcomes of COVID-19-related interventions (i.e., email alerts to medical directors, shipments of medical equipment such as face masks or diagnostics kits, delivery of health education modules, PCR screening, etc.,). Therefore, the problem is not diagnostic in nature, yet reduces to optimal ranking (and longitudinal stability of such ranking of risk) in order to efficiently allocate limited resources and minimize risk for the patients throughout a continuously changing epidemic landscape.
2.3. Input Variables
The model is computed using aggregated data provided by all the dialysis centers (min: 545; max: 611) located in one of the 23 countries of the FMC European Nephrocare Network. The final model incorporates 74 variables belonging to one of the following categories (Appendix A):
Open Source Data [24];
Epidemic status in the clinical country/region (prefix: RG): 15 parameters;
- Aggregated Data abstracted from EuCLiD®:
- Epidemic status in the target clinic (prefix: CL): 5 variables;
- Distance-weighted information of the adjacent clinics (prefix: CLS); 5 variables. Adjacent clinics were defined as the 3 centers with shorter distance in terms of both latitude and longitude to the target clinic. Measures of the adjacent clinics, including cases and trends, were computed as the average value weighted for the inverse of the distance to the target clinic;
- Other parameters related to the target clinic (prefix: CL): 49 parameters.
As detailed in Appendix A, each variable can be calculated/collected over different timeframes of the ascertainment period, i.e., the last 7 days (d), previous 7 d, last 14 d, previous 14 d, and previous 28 d.
2.4. Statistical Analysis
2.4.1. Model Derivation
We used XG Boost, a scalable ML system for tree boosting [25]. We used the available open source package [26] for Python, Version 3.7.4 (Python Software Foundation, Delaware, DE, United States) [27].
The first release of the model was trained using data related to 1st April 2020 (training dataset index date), while the second and the third versions were derived using data related to 15th July 2020 and 1st November 2020, respectively. We considered all the clinics delivering services to at least one patient on the index date as well as over the week before index date.
2.4.2. Model Accuracy and Feature Importance
Prediction accuracy of each release was tested every first and fifteenth day (validation dataset index dates). Therefore, development and validation datasets can include the same set of clinics/patients every two weeks.
To evaluate model performance, we measured the area under the curve (AUC) of the receiver operating characteristics (ROC) curve in the testing datasets [28] using Python, Version 3.7.4 (Python Software Foundation, Delaware, DE, United States) [27]. The AUC provides an aggregate measure of performance as the ROC curve plots the true positive rate (TPR) against the false positive rate (FPR) at all classification thresholds. Model discrimination ability over time was monitored by visual inspection of AUC trends. For illustrative purposes, we also reported the classification performance in terms of P(Outbreak|Class) (i.e., probability of outbreak (Yes/No) given the assigned risk class (L/M/H)) and P(Class|Outbreak) (i.e., probability of the assigned risk class given the outbreak) for the two action-thresholds chosen (0.015 and 0.125). In order to calculate P(Outbreak|Class) and P(Class|Outbreak) we artificially treated our problem as a binary decision for each threshold. We computed average probability values across the whole study period.
Feature importance was computed using the SHapley Additive exPlanations (SHAP) method [29]. This analysis enables intuitive model explainability via an accurate and efficient estimation of the contribution to risk of each input variable.
2.4.3. Descriptive Statistics
For both the training and validation datasets, we analyzed the number of active clinics, frequency and incidence of a COVID-19 outbreak, the distribution of clinics in each prediction level of risk (low, medium, high), as well as the relative risk compared to clinics in low-risk groups with Python, Version 3.7.4.
3. Results
3.1. Dialysis Clinic Characteristics
Model version 1, 2, and 3 were trained using a dataset related to 1st April 2020, 15th July 2020, and 1st November 2020, respectively. On these dates, active clinics were 589, 597, 603, while 34 (5.77%), 44 (7.37%), and 233 (38.64%) clinics had two or more patients with COVID-19 infection in the fortnight after the index date.
The surveillance system stratifies clinics by their risk of new local outbreak within two weeks. To facilitate the interpretation of the results, we established three risk categories: (1) Low, when outbreak risk is less than or equal to 1.5%; (2) Medium, risk greater than 1.5% and less than or equal to 12.5%; (3) High, if risk is greater than 12.5%. Risk thresholds depend both on the incidence of pandemic and on the ability of any given clinic to implement containment measures. Figure 2 reports the share of active dialysis clinics in different risk classes at each testing date.
The actual outbreak incidence in the dialysis clinics during the validation period is reported in Figure 3.
3.2. Model Performance
All versions of the model showed a good performance over the validation period. Figure 3 shows trends in AUC values of the three model versions over a 1-year observation period. Variability in prediction accuracy decreased as retraining was applied: version 1’s average AUC was 0.73 (95% CI 0.55–0.91), AUC of version 2 was 0.75 (95% CI 0.65–0.86), while version 3 had a more stable performance with an average AUC of 0.79 (0.74–0.85). The ROC-AUC diagram for the three model versions have been reported in Figure 4.
In order to demonstrate the potential use of the model, we geographically mapped the risk on a few exemplary dates, i.e., the 2 August 2020, 4 October 2020, 1 November 2020, and 3 January 2020 (Figure 5). The graphical representation visually highlights clinic clusters according to the risk of a COVID-19 outbreak occurrence within 2 weeks (Figure 5, left panels, colored circles denote the low, medium, and high-risk categories). There was substantial correlation between the predicted risk (Figure 5, left panels) and the actual outcome (Figure 5, right panels) on all of the validation dates.
Table 1 and Table 2 report the classification performance in terms of P(Outbreak|Class) (i.e., probability of outbreak (Yes/No) given the assigned risk class (L/M/H)) and P(Class|Outbreak) (i.e., probability of the assigned risk class given the outbreak) for the two action-thresholds chosen (0.015 and 0.125). In order to calculate P(Outbreak|Class) and P(Class|Outbreak), we artificially treated our problem as a binary decision for each threshold. We computed average probability values across the whole study period.
Table 1.
Low Risk Group. P(Class = L) = 0.648 | |||
---|---|---|---|
P(Class = L|Outbreak = Yes) | P(Class ≠ L|Outbreak = Yes) | P(Class = L|Outbreak = No) | P(Class ≠ L|Outbreak = No) |
0.23 | 0.77 | 0.73 | 0.27 |
P(Outbreak = Yes|Class = L) | P(Outbreak = No|Class = L) | P(Outbreak = Yes|Class ≠ L) | P(Outbreak = No|Class ≠ L) |
0.06 | 0.94 | 0.37 | 0.63 |
Table 2.
High Risk Group P(Class = H) = 0.197 | |||
---|---|---|---|
P(Class = H|Outbreak = Yes) | P(Class ≠ H|Outbreak = Yes) | P(Class = H|Outbreak = No) | P(Class ≠ H|Outbreak = No) |
0.51 | 0.49 | 0.14 | 0.86 |
P(Outbreak = Yes|Class = H) | P(Outbreak = No|Class = H) | P(Outbreak = Yes|Class ≠ H) | P(Outbreak = No|Class ≠ H) |
0.40 | 0.60 | 0.09 | 0.91 |
Overall, the risk score was strongly associated with the likelihood of COVID-19 outbreak, as demonstrated by the relative risk of outcome occurrence in the three risk classes over the study period (Table 3).
Table 3.
Risk Class | RR |
---|---|
L | −ref |
M | 3.45 |
H | 5.95 |
3.3. Model Feature Importance
Feature analysis investigated the impact of each variable on model output (Figure 6). Although there are some differences among the model versions, overall, the most important variables are related to the epidemic dynamics in the clinic in the period immediately preceding the index date for risk evaluation. Regional data on the number of COVID-19 cases and deaths were likewise ranked high. The number of COVID-19 cases in adjacent clinics resulted in the top predictor list of all three model versions. Of note, variables routinely measured in clinical practice, including changes in CRP and blood white cell count over the observation period, were also strongly associated with outbreak risk.
4. Discussion
The present study describes the development and validation of a novel sentinel surveillance system allowing for the prompt risk assessment of a COVID-19 outbreak in a large European network of dialysis clinics over a 2-week forecasting horizon. The model had a stable accuracy over time and was able to consistently discriminate outbreak risk in dialysis units across all European countries at every stage of the current pandemic, i.e., during epidemic growth and decay phases. The design of our ML prediction model enables administrators and developers to quickly retrain this tool in case the visual inspection of AUC values over time suggests a trend toward a decrease in its discrimination ability.
Nosocomial transmission has greatly contributed to an increase in the global burden of COVID-19 pandemic by extremely affecting the capacity of the health system, not only to provide medical support to patients, but also to protect healthcare professionals [30,31]. Dialysis centers are particularly vulnerable to outbreak development [11,12,32] in that mitigation strategies are not entirely feasible due to the necessity of in-person encounters to provide a life-saving treatment such as hemodialysis [11]. Considering the peculiar frailty of ESKD patients, all scientific nephrology societies have provided guidance on COVID-19 transmission prevention in dialysis facilities [33,34,35]. In this regard, surveillance and early contagion detection are essential to reduce the risk of local outbreaks developing into epidemics.
Clinics of the FMC European Nephrocare Network have implemented multiple non-pharmacological interventions to limit viral spreading among the CKD community, including stringent hygiene procedures, social distancing, and the identification and isolation of suspected cases. In addition, dialysis facilities have established recording pathways to report any infection event in the EuCliD® TIR System. Such data are used to monitor the effectiveness of non-pharmacological intervention and to detect high-risk patients needing special attention [36,37,38].
One important feature of our modeling strategy entailed the combined use of open source and clinical data collected in standard clinical practice. In fact, we exploited the interconnection of the European Nephrocare clinics to augment background epidemic data with a surveillance system based on incident reports and practice pattern variation at each dialysis unit. Information about local epidemic status in a given clinic was then propagated through distance-weighting metrics to the surrounding facilities. An ML method was used to integrate all information into a summary score metric. Remarkably, variables related to the epidemic dynamics in the clinic and to the regional epidemic status, as well as to the risk proxies propagated from adjacent clinics, were all important predictors of outbreak occurrence. Such an approach is particularly relevant because it enabled us to capture local disease spread beyond the registry data compiled for the general population, which does not capture the heterogeneity of viral transmission in a setting where frequent and multiple human interactions necessarily occur. Indeed, as the basic reproduction index (R0) is a function of both the transmissibility of a disease and the contact patterns that underlie transmission [39], the regional/provincial R0 cannot be translated in dialysis facilities in that ESKD patients’ biological and socio-behavioral factors significantly differ from those of the general population [40]. The occurrence of SSEVs further complicates the picture, making generalizations of regional epidemic trends that are not entirely appropriate for the reliable prediction of viral spreading in healthcare settings [41,42].
The interconnection of the FMC network allows for the collection and subsequent central integration of a bulk of information provided by facilities distributed throughout European countries. This particular setting offers the advantage to perform the real-time monitoring of sentinel sensors that are likely to provide timely and accurate indications of epidemic activity [43], while considering the heterogeneity underlying transmission dynamics. Sentinel surveillance in outpatient settings was previously shown to provide a robust approach to oversee SARS-CoV-2 spreading [44]. In general, the monitoring of community transmission in nodes distributed across different regions was reported to ensure efficient disease detection in networked populations [45]. It is important to highlight that the analytic strategy adopted in this study is general and can be applied to any epidemic communicable disease, as all naturally occurring, clustering units where social promiscuity, density, and duration of interactions are substantially different compared to the general population. Henceforth, this method may be applied to social contexts with a high risk of outbreak generation, including schools, hospitals, and workplaces from which the provided infection data are promptly captured and conferred to a central database, even in aggregated form. Monitoring of the pandemic situation within the network allows for the timely implementation of infection control procedures in the adjacent networked unit and efficiently anticipates resource needs.
Finally, variable importance analysis has indicated that trends in clinical practice patterns are among the top predictors. This observation indicates that the tracking of physicians’ prescription behavior can provide valuable information to assess epidemic dynamics also during explosive growth, when surveillance and laboratory resources are limited and COVID-19 cases may be recorded with some delay due to the emergency situation [46].
5. Conclusions
Our sentinel surveillance system allows for a prompt risk assessment and timely response to the challenges posed by the COVID-19 epidemic throughout FMC European dialysis clinics. This tool can have significant implications for public health practice in that it represents a robust strategy to assess the level of community transmission of COVID-19 and to guide the selection and implementation of mitigation measures. The same framework can be applied in other networked settings, such as healthcare facilities or schools to improve early detection and forecasting of SARS-CoV-2 transmission. Finally, the implementation of our surveillance system can guide preparedness efforts for future pandemics.
Appendix A
Table A1.
Category | Variable | Reference Time |
---|---|---|
Epidemic Status in the Country/Region (prefix: RG) | ||
cumulative cases | previous 7 days and last 7 days | |
number of hospitalized | previous 7 days and last 7 days | |
number of ICU patients | previous 7 days and last 7 days | |
cumulative fatalities | previous 7 days and last 7 days | |
cumulative recovered | previous 7 days and last 7 days | |
trend of cumulative cases | last 7 days/previous 7 days | |
trend of hospitalized patients | last 7 days/previous 7 days | |
trend of ICU patients | last 7 days/previous 7 days | |
trend of cumulative recovered in the last week | last 7 days/previous 7 days | |
trend of cumulative fatalities | last 7 days/previous 7 days | |
epidemic status in the clinic (prefix: CL) | ||
number of suspected COVID-19 cases | previous 14 days, previous 7 days, and last 7 days | |
change in suspected cases | last 7 days–previous 7 days | |
change in suspected cases | last 14 days–previous 14 days | |
distance-weighted information of the adjacent clinics (prefix: CL) | ||
number of COVID-19 suspected cases in the closest clinics | previous 14d, previous 7 days, and last 7 days | |
change in COVID-19 suspected cases in the closest clinics | last 7 days–previous 7 days | |
change in COVID-19 suspected cases in the closest clinics | last 14 days–previous 14 days | |
other parameters related to the clinic (prefix: CL) | ||
change in the number of treated patients | last 28 days–last 14 days | |
change in the number of treatments | last 28 days–last 14 days | |
change in the weekly dialysis frequency per clinic | last 28 days–last 14 days | |
change in the weekly dialysis frequency per patient | last 28 days–last 14 days | |
change in the number of treatments with pre/post-BT | last 28 days–last 14 days | |
change in the number of treatments with pre/post-BT > 37 °C | last 28 days–last 14 days | |
change in the percentage of treatments with pre/post-BT > 37 °C | last 28 days–last 14 days | |
change in the mean value of pre/post-dialysis BT | last 28 days–last 14 days | |
change in the number of treatments with pre-dyalisis diastolic BP | last 28 days–last 14 days | |
change in the mean value of pre-dialysis diastolic BP | last 28 days–last 14 days | |
change in the number of treatments with dialysis time | last 28 days–last 14 days | |
change in the mean value of dialysis time | last 28 days–last 14 days | |
change in the number of treatments with IDWG | last 28 days–last 14 days | |
change in the mean value of IDWG | last 28 days–last 14 days | |
change in the number of treatments with O2 sat | last 28 days–last 14 days | |
change in the mean value of O2 sat | last 28 days–last 14 days | |
change in the number of patients with lab tests | last 28 days–last 14 days | |
change in the number of lab tests | last 28 days–last 14 days | |
change in the number of lab tests with Albumin | last 28 days–last 14 days | |
change in the mean value of Albumin | last 28 days–last 14 days | |
change in the number of lab tests with lymphocytes | last 28 days–last 14 days | |
change in the mean value of lymphocytes | last 28 days–last 14 days | |
change in the number of lab tests with monocytes | last 28 days–last 14 days | |
change in the mean value of monocytes | last 28 days–last 14 days | |
change in the number of lab tests with neutrophils | last 28 days–last 14 days | |
change in the mean value of neutrophils | last 28 days–last 14 days | |
change in the number of lab tests with platelets | last 28 days–last 14 days | |
change in the mean value of platelets | last 28 days–last 14 days | |
change in the number of lab tests with PDW | last 28 days–last 14 days | |
change in the mean value of PDW | last 28 days–last 14 days | |
change in the number of lab tests with leukocytes | last 28 days–last 14 days | |
change in the mean value of leukocytes | last 28 days–last 14 days | |
change in the number of lab tests with D-dimer | last 28 days–last 14 days | |
change in the mean value of D-dimer | last 28 days–last 14 days | |
change in the number of lab tests with CRP | last 28 days–last 14 days | |
change in the mean value of CRP | last 28 days–last 14 days | |
change in the number of lab tests with IL-6 | last 28 days–last 14 days | |
change in the mean value of IL-6 | last 28 days–last 14 days | |
change in the number of lab tests with ANP | last 28 days–last 14 days | |
change in the mean value of ANP | last 28 days–last 14 days | |
change in the number of lab tests with BNP | last 28 days–last 14 days | |
change in the mean value of BNP | last 28 days–last 14 days | |
change in the number of lab tests with Ferritin | last 28 days–last 14 days | |
change in the mean value of Ferritin | last 28 days–last 14 days | |
Number of patients with at least one hospitalization | last 14 days | |
Number of hospitalizations | last 14 days |
Author Contributions
Conceptualization: F.B. and L.N.; data curation: M.G. and F.M.-M.; formal analysis: P.C., M.G. and F.M.-M.; supervision: S.S.; validation: F.B., L.N. and S.S.; writing—original draft: P.C. and C.L.; writing—review and editing: F.B., P.C., C.L. and L.N. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by Fresenius Medical Care Deutschland GmbH.
Institutional Review Board Statement
The study was conducted according to the guidelines of the Declaration of Helsinki. IRB approval was not necessary because all input data for modeling were aggregated statistics concerning COVID-19 infection distribution and practice patterns across dialysis centers (i.e., cumulative number of infections in countries, number of prescribed laboratory tests in dialysis centers, number of new cases in dialysis centers). No patients’ personal information has been used for the present study.
Informed Consent Statement
No patients’ personal information has been used for the present study since all input data for modeling were aggregated statistics concerning COVID-19 infection distribution and practice patterns across dialysis centers (i.e., cumulative number of infections in countries, number of prescribed laboratory tests in dialysis centers, number of new cases in dialysis centers).
Data Availability Statement
Open source datasets adopted for the study have been referenced throughout the manuscript. Restrictions apply to the availability of these data. Data was obtained from Fresenius Medical Care and may be available for specific, well-motivated requests, from the corresponding author with the permission of Fresenius Medical Care.
Conflicts of Interest
C.L. received consultancy fees from Fresenius Medical Care Deutschland GmbH. All remaining authors are full time employees of Fresenius Medical Care Deutschland GmbH.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Li J., Huang D.Q., Zou B., Yang H., Hui W.Z., Rui F., Yee N.T.S., Liu C., Nerurkar S.N., Kai J.C.Y., et al. Epidemiology of COVID-19: A systematic review and meta-analysis of clinical characteristics, risk factors, and outcomes. J. Med. Virol. 2021;93:1449–1458. doi: 10.1002/jmv.26424. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Muller C.P. Do asymptomatic carriers of SARS-COV-2 transmit the virus? Lancet Reg. Health. 2021;4:100082. doi: 10.1016/j.lanepe.2021.100082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.He X., Lau E.H.Y., Wu P., Deng X., Wang J., Hao X., Lau Y.C., Wong J.Y., Guan Y., Tan X., et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat. Med. 2020;26:672–675. doi: 10.1038/s41591-020-0869-5. [DOI] [PubMed] [Google Scholar]
- 4.Byambasuren O., Cardona M., Bell K., Clark J., McLaws M.-L., Glasziou P. Estimating the extent of asymptomatic COVID-19 and its potential for community transmission: Systematic review and meta-analysis. JAMMI Off. J. Assoc. Med. Microbiol. Infect. Dis. Can. 2020;5:223–234. doi: 10.3138/jammi-2020-0030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Galvani A.P., May R.M. Epidemiology: Dimensions of superspreading. Nature. 2005;438:293–295. doi: 10.1038/438293a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lemieux J., Siddle K.J., Shaw B.M., Loreth C., Schaffner S., Gladden-Young A., Adams G., Fink T., Tomkins-Tinch C.H., Krasilnikova L.A., et al. Phylogenetic analysis of SARS-CoV-2 in the Boston area highlights the role of recurrent importation and superspreading events. medRxiv. 2020:PMC7457619. doi: 10.1101/2020.08.23.20178236. [DOI] [Google Scholar]
- 7.Lewis D. Superspreading drives the COVID pandemic-and could help to tame it. Nature. 2021;590:544–546. doi: 10.1038/d41586-021-00460-x. [DOI] [PubMed] [Google Scholar]
- 8.Chu D.K., Akl E.A., Duda S., Solo K., Yaacoub S., Schünemann H.J., El-harakeh A., Bognanni A., Lotfi T., Loeb M., et al. Physical distancing, face masks, and eye protection to prevent person-to-person transmission of SARS-CoV-2 and COVID-19: A systematic review and meta-analysis. Lancet. 2020;395:1973–1987. doi: 10.1016/S0140-6736(20)31142-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Regmi K., Lwin C.M. Impact of non-pharmaceutical interventions for reducing transmission of COVID-19: A systematic review and meta-analysis protocol. BMJ Open. 2020;10:e041383. doi: 10.1136/bmjopen-2020-041383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Apata I.W., Cobb J., Navarrete J., Burkart J., Plantinga L., Lea J.P. COVID-19 infection control measures and outcomes in urban dialysis centers in predominantly African American communities. BMC Nephrol. 2021;22:81. doi: 10.1186/s12882-021-02281-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Weiner D.E., Watnick S.G. Hemodialysis and COVID-19: An Achilles’ Heel in the Pandemic Health Care Response in the United States. Kidney Med. 2020;2:227–230. doi: 10.1016/j.xkme.2020.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Meijers B., Messa P., Ronco C. Safeguarding the Maintenance Hemodialysis Patient Population during the Coronavirus Disease 19 Pandemic. Blood Purif. 2020;49:259–264. doi: 10.1159/000507537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Li S.Y., Tang Y.S., Chan Y.J., Tarng D.C. Impact of the COVID-19 pandemic on the management of patients with end-stage renal disease. J. Chin. Med. Assoc. 2020;83:628–633. doi: 10.1097/JCMA.0000000000000356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lamarche C., Iliuta I.A., Kitzler T. Infectious Disease Risk in Dialysis Patients: A Transdisciplinary Approach. Can. J. Kidney Health Dis. 2019;6:2054358119839080. doi: 10.1177/2054358119839080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Clarke C.L., Prendecki M., Dhutia A., Gan J., Edwards C., Prout V., Lightstone L., Parker E., Marchesin F., Griffith M., et al. Longevity of SARS-CoV-2 immune responses in hemodialysis patients and protection against reinfection. Kidney Int. 2021;99:1470–1477. doi: 10.1016/j.kint.2021.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lisowska K.A., Pindel M., Pietruczuk K., Kuźmiuk-Glembin I., Storoniak H., Dębska-Ślizień A., Witkowski J.M. The influence of a single hemodialysis procedure on human T lymphocytes. Sci. Rep. 2019;9:5041. doi: 10.1038/s41598-019-41619-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Betjes M.G.H. Immune cell dysfunction and inflammation in end-stage renal disease. Nat. Rev. Nephrol. 2013;9:255–265. doi: 10.1038/nrneph.2013.44. [DOI] [PubMed] [Google Scholar]
- 18.Williamson E.J., Walker A.J., Bhaskaran K., Bacon S., Bates C., Morton C.E., Curtis H.J., Mehrkar A., Evans D., Inglesby P., et al. Factors associated with COVID-19-related death using OpenSAFELY. Nature. 2020;584:430–436. doi: 10.1038/s41586-020-2521-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Thakur B., Dubey P., Benitez J., Torres J.P., Reddy S., Shokar N., Aung K., Mukherjee D., Dwivedi A.K. A systematic review and meta-analysis of geographic differences in comorbidities and associated severity and mortality among individuals with COVID-19. Sci. Rep. 2021;11:8562. doi: 10.1038/s41598-021-88130-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Wu Z., McGoogan J.M. Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention. JAMA. 2020;323:1239–1242. doi: 10.1001/jama.2020.2648. [DOI] [PubMed] [Google Scholar]
- 21.Jager K.J., Kramer A., Chesnaye N.C., Couchoud C., Sánchez-Álvarez J.E., Garneata L., Collart F., Hemmelder M.H., Ambühl P., Kerschbaum J., et al. Results from the ERA-EDTA Registry indicate a high mortality due to COVID-19 in dialysis patients and kidney transplant recipients across Europe. Kidney Int. 2020;98:1540–1548. doi: 10.1016/j.kint.2020.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Steil H., Amato C., Carioni C., Kirchgessner J., Marcelli D., Mitteregger A., Moscardo V., Orlandini G., Gatti E. EuCliD®—A Medical Registry. Methods Inf. Med. 2004;43:83–88. [PubMed] [Google Scholar]
- 23.Merello Godino J.I., Rentero R., Orlandini G., Marcelli D., Ronco C. Results from EuCliD® (European Clinical Dialysis Database): Impact of shifting treatment modality. Int. J. Artif. Organs. 2002;25:1049–1060. doi: 10.1177/039139880202501103. [DOI] [PubMed] [Google Scholar]
- 24.European Commission Joint Research Centre—ISPRA-Space, Security and Migration Directorate (JRC) [(accessed on 12 July 2020)]. Available online: https://covid-statistics.jrc.ec.europa.eu/
- 25.Chen T., Guestrin C. XGBoost: A Scalable Tree Boosting System; Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; San Francisco, CA, USA. 13–17 August 2016. [Google Scholar]
- 26. [(accessed on 21 December 2019)]. Available online: https://github.com/dmlc/xgboost.
- 27.Van Rossum G. Python Tutorial. CreateSpace Independent Publishing Platform; 1995. (online publisher) [Google Scholar]
- 28.Ling C.X., Huang J., Zhang H. Advances in Artificial Intelligence. Canadian AI 2003. Volume 2671. Springer; Berlin/Heidelberg, Germany: 2003. AUC: A better measure than accuracy in comparing learning algorithms; pp. 329–341. Lecture Notes in Computer Science (Lecture Notes in Artificial Intelligence) [Google Scholar]
- 29.Lundberg S.M., Lee S.-I. Unified Approach to Interpreting Model Predictions; Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017); Long Beach, CA, USA. 4–9 December 2017; 2017. [Google Scholar]
- 30.Abbas M., Robalo Nunes T., Martischang R., Zingg W., Iten A., Pittet D., Harbarth S. Nosocomial transmission and outbreaks of coronavirus disease 2019: The need to protect both patients and healthcare workers. Antimicrob. Resist. Infect. Control. 2021;10:7. doi: 10.1186/s13756-020-00875-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Leeds C. COVID 19: Health care workers, risks, protection and transmission. Lancet Reg. Health. 2021;1:100022. doi: 10.1016/j.lanepe.2020.100022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.La Milia V., Bacchini G., Bigi M.C., Casartelli D., Cavalli A., Corti M., Crepaldi M., Limardo M., Longhi S., Manzoni C., et al. COVID-19 Outbreak in a Large Hemodialysis Center in Lombardy, Italy. Kidney Int. Rep. 2020;5:1095–1099. doi: 10.1016/j.ekir.2020.05.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kliger A.S., Silberzweig J. Mitigating risk of COVID-19 in dialysis facilities. Clin. J. Am. Soc. Nephrol. 2020;15:707–709. doi: 10.2215/CJN.03340320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Basile C., Combe C., Pizzarelli F., Covic A., Davenport A., Kanbay M., Kirmizis D., Schneditz D., Van Der Sande F., Mitra S. Recommendations for the prevention, mitigation and containment of the emerging SARS-CoV-2 (COVID-19) pandemic in haemodialysis centres. Nephrol. Dial. Transplant. 2020;35:737–741. doi: 10.1093/ndt/gfaa069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kliger A.S., Cozzolino M., Jha V., Harbert G., Ikizler T.A. Managing the COVID-19 pandemic: International comparisons in dialysis patients. Kidney Int. 2020;98:12–16. doi: 10.1016/j.kint.2020.04.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Marcelli D. EuCliD (European Clinical Database): A database comparing different realities. J. Nephrol. 2001;14:S94–S100. [PubMed] [Google Scholar]
- 37.Richards N., Ayala J.A., Cesare S., Chazot C., Di Benedetto A., Gassia J.P., Merello J.I., Rentero R., Scatizzi L., Marcelli D. Assessment of quality guidelines implementation using a continuous quality improvement programme. Blood Purif. 2007;25:221–228. doi: 10.1159/000101026. [DOI] [PubMed] [Google Scholar]
- 38.Marcelli D., Moscardó V., Steil H., Day M., Kirchgessner J., Mitteregger A., Orlandini G., Gatti E. Data management and quality assurance for dialysis network. Contrib. Nephrol. 2002;137:293–299. doi: 10.1159/000060251. [DOI] [PubMed] [Google Scholar]
- 39.Delamater P.L., Street E.J., Leslie T.F., Yang Y.T., Jacobsen K.H. Complexity of the basic reproduction number (R0) Emerg. Infect. Dis. 2019;25:1–4. doi: 10.3201/eid2501.171901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Corbett R.W., Blakey S., Nitsch D., Loucaidou M., McLean A., Duncan N., Ashby D.R., Appelbe M., Brown E., Cairns T., et al. Epidemiology of COVID-19 in an urban dialysis center. J. Am. Soc. Nephrol. 2020;31:1815–1823. doi: 10.1681/ASN.2020040534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Meyers L.A., Pourbohloul B., Newman M.E.J., Skowronski D.M., Brunham R.C. Network theory and SARS: Predicting outbreak diversity. J. Theor. Biol. 2005;232:71–81. doi: 10.1016/j.jtbi.2004.07.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Frieden T.R., Lee C.T. Identifying and interrupting superspreading events-implications for control of severe acute respiratory syndrome coronavirus 2. Emerg. Infect. Dis. 2020;26:1061–1066. doi: 10.3201/eid2606.200495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Herrera J.L., Srinivasan R., Brownstein J.S., Galvani A.P., Meyers L.A. Disease Surveillance on Complex Social Networks. PLoS Comput. Biol. 2016;12:e1004928. doi: 10.1371/journal.pcbi.1004928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Zwald M.L., Lin W., Sondermeyer Cooksey G.L., Weiss C., Suarez A., Fischer M., Bonin B.J., Jain S., Langley G.E., Park B.J., et al. Rapid Sentinel Surveillance for COVID-19—Santa Clara County, California, March 2020. Morb. Mortal. Wkly. Rep. 2020;69:419–421. doi: 10.15585/mmwr.mm6914e3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Colman E., Holme P., Sayama H., Gershenson C. Efficient sentinel surveillance strategies for preventing epidemics on networks. PLoS Comput. Biol. 2019;15:e1007517. doi: 10.1371/journal.pcbi.1007517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Monaghan C., Larkin J., Chaudhuri S., Han H., Jiao Y., Bermudez K., Weinhandl E., Dahne-Steuber I., Belmonte K., Neri L., et al. Artificial Intelligence for COVID-19 Risk Classification in Kidney Disease: Can Technology Unmask an Unseen Disease? medRxiv. 2020 doi: 10.1101/2020.06.15.20131680. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Open source datasets adopted for the study have been referenced throughout the manuscript. Restrictions apply to the availability of these data. Data was obtained from Fresenius Medical Care and may be available for specific, well-motivated requests, from the corresponding author with the permission of Fresenius Medical Care.