Probabilistic analysis of COVID-19 patients’ individual length of stay in Swiss intensive care units

Alexander Henzi; Gian-Reto Kleger; Matthias P Hilty; Pedro D Wendel Garcia; Johanna F Ziegel; on behalf of RISC-19-ICU Investigators for Switzerland

doi:10.1371/journal.pone.0247265

. 2021 Feb 19;16(2):e0247265. doi: 10.1371/journal.pone.0247265

Probabilistic analysis of COVID-19 patients’ individual length of stay in Swiss intensive care units

Alexander Henzi ¹, Gian-Reto Kleger ², Matthias P Hilty ^3,⁴, Pedro D Wendel Garcia ^3,⁴, Johanna F Ziegel ^1,^*; on behalf of RISC-19-ICU Investigators for Switzerland^¶

Editor: Martina Crivellari⁵

PMCID: PMC7894868 PMID: 33606773

Abstract

Rationale

The COVID-19 pandemic induces considerable strain on intensive care unit resources.

Objectives

We aim to provide early predictions of individual patients’ intensive care unit length of stay, which might improve resource allocation and patient care during the on-going pandemic.

Methods

We developed a new semiparametric distributional index model depending on covariates which are available within 24h after intensive care unit admission. The model was trained on a large cohort of acute respiratory distress syndrome patients out of the Minimal Dataset of the Swiss Society of Intensive Care Medicine. Then, we predict individual length of stay of patients in the RISC-19-ICU registry.

Measurements

The RISC-19-ICU Investigators for Switzerland collected data of 557 critically ill patients with COVID-19.

Main results

The model gives probabilistically and marginally calibrated predictions which are more informative than the empirical length of stay distribution of the training data. However, marginal calibration was worse after approximately 20 days in the whole cohort and in different subgroups. Long staying COVID-19 patients have shorter length of stay than regular acute respiratory distress syndrome patients. We found differences in LoS with respect to age categories and gender but not in regions of Switzerland with different stress of intensive care unit resources.

Conclusion

A new probabilistic model permits calibrated and informative probabilistic prediction of LoS of individual patients with COVID-19. Long staying patients could be discovered early. The model may be the basis to simulate stochastic models for bed occupation in intensive care units under different casemix scenarios.

1 Introduction

During the COVID-19 pandemic, governments worldwide imposed severe restrictions on public life in order to limit the spread of the SARS-CoV-2 virus. A critical point in the decision making process was the limitation of beds in intensive care units (ICU) in order to adequately treat all severe cases of COVID-19. Many countries increased the number of ICU beds substantially at the onset of the crisis. A critical issue with severe COVID-19 disease is the frequent need for prolonged ICU treatment. For informed decision making it is important to quantitatively assess how long the patients are expected to be in an ICU.

At the example of Switzerland, we propose a prediction method for the individual length of stay (LoS) of patients in ICUs, and apply it to COVID-19 patients. The predictions are given for each patient based on covariates available within 24 hours after ICU admission. The method generates probabilistic predictions, that is, for each patient that enters the ICU, we provide a predictive cumulative distribution function (CDF) that comprehensively quantifies the uncertainty of the LoS at the time point of prediction. In particular, the predictive CDF allows to give prediction intervals with nay desired coverage probability. More precisely, the predictive CDF is an estimate of the conditional distribution of the LoS of the patient given covariates, which include age, gender, Simplified Acute Physiology Score (SAPS II) [1], and Nine Equivalents of nursing Manpower use Score (NEMS) (first shift) [2]. Fig 1 shows some predictive CDFs for randomly selected COVID-19 patients black, and true LoS as vertical lines. For each possible value t of the LoS, the value of the predictive CDF, F(t), gives the probability that the patient stays at most t days in the ICU. Conversely, 1 − F(t) gives the probability that the patient stays longer than t days in the ICU. For example, patient 1 had an LoS of 20 days. The predicted probability that the patient stays at most 20 days was 0.91, and the probability for a stay of at least 10 days was 0.26 (or 0.74 for at most 10 days). Patient 4 stays longest in the ICU. This is in agreement with the predictive CDFs, since for all possible t, the probability of staying longer than t is highest for patient 4. The waves in the curves are explained by the fact that patients have a higher possibility to leave the ICU at certain times of the day, and a lower at others.

Fig 1 — Four patients were drawn at random. The four wavy lines represent their predictive CDFs for the LoS based on covariates that are available at most 24 hours after ICU admission, that is, for each value t of the LoS on the horizontal axis, the curve gives the probability that the respective patient stays at most t days in the ICU. The vertical dashed lines represent the actually observed values of the LoS of the patients, which are unknown at the time of prediction. The larger the increase of the CDF on a given interval on the horizontal axis, the higher the probability of observing an LoS in this interval. For example, the predicted probability for the LoS of patient 1 being between 0 and 5 days is 0.47, whereas this probability is 0.40 for patient 2, 0.35 for patient 3, and only 0.19 for patient 4. The CDF of patient 4 lies substantially below the CDFs of the other patients which is in agreement with patient 4 having the longest realized LoS.

Probabilistic predictions allow to assess the uncertainty of the LoS comprehensively. Therefore they are preferable to forecasts for the mean or median LoS only. Their usefulness is illustrated by the following examples. The probabilistic LoS predictions allow to derive probabilistic forecasts for the number of patients who are still at the ICU at a certain day in future. This may be useful for planning purposes. For a single patient admitted today with predictive LoS distribution F, the probability that the patient is still at the ICU after t days equals 1 − F(t). From the probabilities for single patients, one may compute (with statistical software) the probability that any given number of patients is still at the ICU after t days. This allows to answer questions like ‘How likely is it that there are at least two free beds in five days?’ or ‘What is the smallest number of patients we expect to stay until next week with a high probability (say, 90%)?’. The LoS forecasts, and so also the answers to such questions, take into account the individual characteristics of the patients currently at the ICU. The probabilistic LoS predictions also allow to give alerts for patients that are likely to stay unusually long in the LOS. For example, fix a threshold of x days, say x = 25, and give an alert if the probability that the patient stays longer than x days exceeds, say, 90%. That is, if 1 − F(x) > 90%, where F is the predictive LoS distribution of a specific patient.

For planning of normal ward and intermediate care unit to ICU patient flows, such information is key to allow optimized resource allocation. On a larger scale, one could plan regional patient allocations to multiple hospitals based on such algorithms. The current health care crisis has emphasized the importance of patient flow logistics, and informative predictions of LoS are essential for this purpose.

It is documented in the literature that the prediction of the LoS at the patient level is difficult, and none of the available prediction models is providing satisfactory forecasts [3] with a possible exception being the complex models presented in [4, 5] for the purpose of benchmarking. Furthermore, the focus has almost exclusively been on only point predictions for the mean LoS, which is not ideal given that the LoS distribution is heavily skewed.

Recently, methodological progress has been made by Ziegel’s group [6]: Based on data in the format of the Minimal Dataset of the Swiss Society of Intensive Care Medicine (MDSi), it is possible to give skillful and calibrated probabilistic predictions for the LoS of patients in ICUs 24h after their admission. In particular, the predictions for the probability of exceedance of the LOS over a certain threshold is shown to be reliable. The proposed method is semi-parametric, which makes it highly adaptive to the shape of the conditional LoS distributions. However, it requires large training data sets. The currently available data on COVID-19 patients in Swiss ICUs is (fortunately) not sufficient. Therefore, we suggest to borrow strength from the MDSi in order to predict the conditional LoS of COVID-19 patients.

The LoS of a patient in an ICU does not only depend on their physical condition but also on the characteristics and policies of the ICU. Even within a small country such as Switzerland such differences can be observed [6]. We restrict the analysis in this paper to Switzerland but the methodology can be adapted to other countries given sufficient data is available. We use the prediction method for the LoS to analyze the characteristics of the LoS of COVID-19 patients with respect to age differences, and gender differences. Since some parts of Switzerland were hit harder by the pandemic than others, we also use the predictions to analyse regional differences in the LoS.

2 Patients and methods

2.1 RISC-19-ICU and MDSi

Risk Stratification in COVID-19 patients in the Intensive Care Unit (RISC-19-ICU) registry, is a collaborative effort with the participation of a majority of the Swiss ICUs to provide a basis for decision support during the ongoing public health crisis, endorsed by the Swiss Society of Intensive Care Medicine (https://www.risc-19-icu.net/) [7, 8]. ICU data were reported on a daily basis, including near real-time data on LoS. The registry was deemed exempt from the need for additional ethics approval and patient informed consent by the ethics committee of the University of Zurich (KEK 2020-00322, ClinicalTrials.gov Identifier: NCT04357275). Fully anonymized datasets, in regard to Swiss law, were collected using a secure REDCap infrastructure provided by the Swiss Society of Intensive Care Medicine.

557 critically ill patients with COVID-19 that have been admitted to an ICU in Switzerland have entered the registry as of the snapshot date, June15, 2020, 481 of which have already been dismissed from the ICU or have died, that is for 86.36% of the patients the LoS is available. There are 18 patients for which one or more of the covariates are not available. Overall, covariates and LoS observations are available for 473 patients, and we call these the COVID-19 dataset. Censoring is a non-trivial problem in the COVID-19 dataset and we address this issue in detail in Section A of S1 Appendix.

The Minimal Dataset of the Swiss Society of Intensive Care Medicine (MDSi) has been introduced in 2005 and contains fully anonymized key data of the entire number of ICU patients in certified Swiss ICU’s (https://www.sgi-ssmi.ch/de/datensatz.html). In addition to demographic data, the MDSi includes SAPS II as initial illness severity score and NEMS per nursing shift as a workload score.

Because almost any patient with severe COVID-19 disease presents chiefly like acute respiratory distress syndrome (ARDS), the training data consists of all patients in the MDSi with the diagnosis of ARDS which were admitted to Swiss ICUs in the years 2012 to 2018. Of the 2411 admissions, 856 were excluded because they satisfy one or more of the following criteria: missing or implausible values for SAPS II or NEMS (135), age younger than 16 (5), admitted with burns as initial diagnosis (3) or undergoing transplant operations 24 hours before or after ICU admission (8), readmissions (132), and patients admitted from ICUs or transferred to other ICUs (580). The exclusion of patients transferred from or to ICUs is because their LoS is incomplete and therefore not suitable for prediction. For the LoS predictions, admissions are standardized to a common admission time at midnight, in order to recover patterns in the ICU discharge times [6]. As a consequence, 99 patients had to be excluded because they did not stay in the ICU at least until midnight of the admission date. After exclusions, the training dataset consists of 1555 observations.

Concerning the covariates that are available for prediction, the possibilities are limited to covariates that are available in the COVID-19 dataset and the training data in the same format. Clear choices are the gender and age of patients. Furthermore, we have included SAPS II and the NEMS of the first ICU shift as covariates since they are informative for the LoS [9–11].

2.2 Statistical methods

Distributional Index Models (DIMs) have been introduced in [6]. They are semi-parametric models for distributional regression building on isotonic distributional regression (IDR) introduced in [12, 13]. A distributional regression model allows to estimate the full conditional distribution of the LoS given covariates. For the DIM used in this paper, we use a parametric model for a real-valued index function α, the DIM index, that depends on gender g, age a, SAPS II s, and NEMS m, that is

\begin{matrix} α (g, a, s, m) = β_{0} + β_{1} 1 {g = male} + {cr}_{1} (a) + {cr}_{2} (s) + {cr}_{3} (m), \end{matrix}

where β₀ is the intercept, β₁ the coefficient for gender, and cr₁, cr₂, cr₃ are penalized cubic regression splines for the continuous variables age, SAPS II and NEMS; see the documentation of the mgcv package for details about the penalization. The model is fitted on the transformed LoS log(LoS+ 1). The log-transformation decreases the skewness of the data, while the addition of the constant 1 makes the resulting distribution more symmetric [6].

Furthermore, we assume that for the probability of the LoS Y of a randomly selected patient with covariates (G, A, S, M) = (g, a, s, m) it holds that

\begin{matrix} P (Y \leq y | (G, A, S, M) = (g, a, s, m)) = F_{α (g, a, s, m)} (y), for all y \in R \end{matrix}

(1)

with a family ${(F_{v})}_{v \in R}$ of stochastically ordered CDFs, that is F_v(y)≤F_w(y) for all $y \in R$ if v ≥ w.

We randomly split the training data in two and estimate α by $\hat{α}$ on the first half. Given $\hat{α}$ , we use the second half of the training data to estimate F_v using IDR. In order to make the estimation procedure less dependent on the splitting of the training data, we use repeat this procedure 100 times and average the resulting estimated distributions to obtain our final estimate ${\hat{F}}_{\hat{α}}$ .

There are dependencies between the covariates age, SAPS II and NEMS but we argue that it is still useful to include all of them in the model. The variable age is contained in SAPS II as a discretized effect with 6 levels. Age enters the model as a cubic regression spline with sufficiently high dimension, manually removing the age variable from SAPS II would essentially correspond to a basis transformation of the model and not affect the prediction results. The information provided by the NEMS is not redundant to SAPS II. NEMS is a crucial variable for COVID-19 patients since it contains information on the ventilation status, therapy with cardiovascular drugs and renal replacement treatment, which are not in the SAPS II. More precisely than the SAPS II, the NEMS reflects the actual therapeutic intensity a patients needs, and it is therefore likely to be one of the earliest markers for LoS.

Probabilistic predictions should be calibrated and sharp [14]. We assess probabilistic calibration by Probability Integral Transform (PIT) histograms, and use Pearson’s chi-square test with 10 bins to test for uniformity. Marginal calibration is checked by comparing average predicted CDFs to empirical CDFs (ECDFs). Sharpness is assessed using the Continuous Ranked Probability Score (CRPS) and predictive power is compared with a Diebold-Mariano test based on the CRPS, see Section B of S1 Appendix.

The implementation is done in R 4.0 [15] using the packages mgcv [16] for the estimation of the index function, and isodistrreg for isotonic distributional regression [12]. Sample data and code are provided in the supplement S1 Code of this article.

3 Results

3.1 General

Summary statistics for the COVID-19 dataset and the training data are given in Table 1. The figures are correct for the June 15, 2020, snapshot. The proportion of men in the COVID-19 dataset is higher than in the training data set. The age structure of both datasets is similar with COVID-19 patients being slightly younger on average. COVID-19 patients generally have a higher NEMS in the first shift. The median and mean SAPS II is similar in both datasets.

Table 1. Summary statistics of COVID-19 dataset and training data.

Variable	Data	Q25	Median	Mean	Q75	P-value
Age	training	55.0	67.0	63.8	75.0	4.04 ⋅ 10⁻³
Age	COVID-19	55.0	63.0	63.0	72.0	4.04 ⋅ 10⁻³
LoS	training	4.5	9.1	12.4	15.8	5.79 ⋅ 10⁻⁵
LoS	COVID-19	5.0	12.0	13.9	19.0	5.79 ⋅ 10⁻⁵
NEMS	training	18.0	27.0	28.6	34.0	<1.0 ⋅ 10⁻¹⁶
NEMS	COVID-19	32.0	32.0	33.2	39.0	<1.0 ⋅ 10⁻¹⁶
SAPS II	training	35.0	46.0	48.5	59.0	1.39 ⋅ 10⁻¹
SAPS II	COVID-19	29.0	50.0	44.9	58.0	1.39 ⋅ 10⁻¹
Gender	training	Male: 61.9%		Female: 38.1%		1.66 ⋅ 10⁻⁸
Gender	COVID-19	Male: 75.9%		Female: 24.1%		1.66 ⋅ 10⁻⁸

Open in a new tab

P-values are for two-sided Wilcoxon’s rank sum test for continuous variables and Fisher’s exact test for gender.

Fig 2 provides a quantitative comparison of the LoS in the COVID-19 dataset and the training data. Panel (a) shows that the probability $P (Y \geq y)$ of the LoS exceeding a fixed threshold y is larger for COVID-19 patients than in the training data up to about y = 30 days, and afterwards the relationship is reversed.

This observation does not exclude the possibility that given the covariates (G, A, S, M) for an individual patient, the conditional distribution of the LOS can be predicted well using the training data. Panel(b) of Fig 2 shows that the individual predictions are reasonable and are marginally calibrated up to about 25 days. The tail of the average forecast distribution is heavier than the tail of the empirical distribution of the COVID-19 dataset, meaning that very long LoS are less likely in the COVID-19 dataset.

The DIM predictions for the LoS of the COVID-19 patients have an average CRPS of 5.29 compared to 5.69, which is the average CRPS when predicting the LoS of the COVID-19 patients with the ECDF of the training data, that is, for all patients, independently of the covariates, the LoS is predicted by using the distribution of all the LoS values in the training data. This difference is highly significant with a p-value of less than 5 ⋅ 10⁻⁴. This shows that the DIM predictions are significantly more informative than the ECDF forecast. The DIM predictions show better calibration than the ECDF predictions, see S3 Fig in S1 Appendix. Uniformity of the PIT is rejected for the ECDF forecasts (p-value <10⁻⁴). For the DIM forecasts, uniformity of the PIT is not rejected (p-value: 0.384).

3.2 Age differences

Fig 3(a) gives the empirical CDFs of COVID-19 patients grouped by age. Young patients, less than 40 years, and very old patients, greater than 80 years have much shorter LoS than patients between 40 and 80. Patients between 40 and 65 tend to have a shorter LoS than patients between 65 and 80 except in cases of long LoS beyond 30 days. In Fig 3(c) the empirical CDFs are compared to the predictions based on the training data. The predictions for patients younger than 40 seem reasonable but their quality is hard to judge given the small sample size of this group in the COVID-19 dataset. For patients older than 80, the predicted LoS is longer than observed, but again, a definite statement should not be made due to small sample size. For patients between 40 and 65, marginal calibration is good until about 18 days. For higher thresholds, a longer LoS is predicted than observed. For patients between 65 and 80 years, the predictions give too much weight to LoS shorter than 25 days, and substantially overestimate the LoS beyond 25 days. Fig 3(b) shows that the training data leads to predictions of shorter LoS for patients younger than 40 and older than 80. In contrast to the COVID-19 data, the predicted LoS for patients between 65 and 80 is shorter than for patients between 40 and 65.

3.3 Gender differences

Fig 4(a) shows the empirical CDF of COVID-19 patients grouped by gender. Female patients show a slightly shorter LoS. The deviations of the predicted LoS from the observed LoS for male and female patients is displayed in Fig 4(c). Qualitatively the differences are similar with a slightly worse agreement of predictions and observations for female patients. The average predictive distributions for male and female patients are displayed in Fig 4(b). The predictions show a clear difference depending on gender with the same order as the COVID-19 data in that the LoS for women tends to be shorter than the one for men. However, the difference in average predicted LoS CDF is larger than the difference in ECDF based on the COVID-19 data.

In order to gain some insight on the reasons for this effect, we checked if there is a significant difference in the LoS distribution of men and women in the training data. This is not the case. Furthermore, a comparison of the distribution of the DIM index computed for the men and women in the COVID-19 dataset shows that, indeed, the index values for women tend to be smaller than those for men, which explains the differences between the CDFs in Fig 4(b). In summary, it appears that a female patient with COVID-19 is likely to stay longer in the ICU than a similar female patient in the training data, whereas this effect is less pronounced for men.

3.4 Regional differences

We have split the COVID-19 dataset according to the location of the ICU within Switzerland. Region NE consisting of Northern and Eastern Switzerland and Region WT consisting of Western Switzerland and Ticino. Region WT was hit earlier and more severely by the COVID-19 crisis than Region NE. While ICU capacity limits were never reached in Region NE, ICU occupation was possibly critical in Region WT.

The LOS distribution of COVID-19 patients is similar in both regions. The null hypothesis of equal LoS distribution in both regions cannot be rejected (two-sample Kolmogorov-Smirnov test p-value: 0.510, Wilcoxon rank sum test p-value: 0.607), see also S4 Fig in S1 Appendix. Comparing the regional LoS distributions to the DIM forecasts for the regions, we obtain that both regions show the same pattern: Good marginal calibration until about 25 days and then shorter LoS of the COVID-19 patients in comparison to the DIM predictions, see S5 Fig in S1 Appendix. The differences in the predictions for both regions are small, see S6 Fig in S1 Appendix.

4 Discussion

We have applied a new semi-parametric model, a DIM, for probabilistic predictions for the LoS of COVID-19 patients in Swiss ICUs. The model is trained with data from the MDSi, namely with data of patients with ARDS. Validation of the model using the COVID-19 dataset shows that the predictions are probabilistically calibrated, marginally calibrated (except for the tail of the distribution), and significantly more informative then an ECDF forecast based on the training data.

COVID-19 patients younger than 40 and older than 80 years tend to have a shorter stay in the ICU than the patient groups between 40–65 and 65–80 years. Predictions for patients older than 80 were longer than observed which could be an indicator of early treatment withdrawal in very old patients with severe COVID-19 disease. In the age groups 65-80 years, forecasts were shorter in the early phase than observations. This could be explained by prolonged recovery times compared with ARDS in elderly patients. The forecasts in both age groups (40–65 and 65–80 years) were longer after 25 to 30 days. In those patient groups, withdrawal of treatment is often executed after 20-30 days because of medical futility. The analysis of the LoS with respect to age suggests that there are differences between ARDS (training data) and COVID-19 in the sense that in terms of LoS COVID-19 patients might rather behave like slightly older ARDS patients keeping the other covariates fixed.

The difference between the LoS distribution of female and male COVID-19 patients is smaller than the difference between the predicted LoS distributions based on the training data, that is, non-COVID-19 patients with ARDS. For male patients the predictions agree better with the empirical distribution of observed LoS of the COVID-19 patients than for female patients. In terms of LoS, male COVID-19 patients behave more similarly to patients in the training data than female COVID-19 patients, making “longer than expected” LoS more likely for female than for male patients.

Despite the fact that the Western Switzerland and Ticino (Region WT) were hit earlier, and potentially less prepared for the COVID-19 crisis than Northern and Eastern Switzerland (Region NE), we do not see an impact on the LoS of COVID-19 patients.

There are somepossible shortcomings of our study. First, the training dataset is not on COVID-19 patients. Despite severe COVID-19 pneumonia behaving similar to ARDS, there are some important differences [17]. Furthermore, multiple organ involvement is frequent in severe COVID-19 disease [18, 19]. There have been discussions how and if classical ARDS and ARDS secondary to COVID-19 (C-ARDS) are different. Initially, substantial differences were postulated [20–22] but more recently consensus is growing that C-ARDS is most probably similar to classical ARDS in treatment intensity and therapeutic approach [23]. In view of this, the historical training data is as well chosen as historical data can be. Furthermore, the NEMS evaluates how severe or nursing intensive a patient is, independently of the diagnosis. Therefore, using is as a covariate in prediction is likely to mitigate confounders between training data and COVID-19 dataset. Second, a limitation is imposed by the use of MDSi as training dataset because the analysis is then constrained to the relatively few variables contained in MDSi. Clearly, there are further relevant predictors for COVID-19 patients. However, most of them concern mortality and not LoS, for example, coagulation status. These values are available in the RISC-19-ICU registry but not in the MDSi training data. Furthermore, we believe that a successful model for probabilistic predictions of LoS should rely on values that are routinely recorded and available early after hospitalization such as SAPS II and NEMS. Since they are compound variables, they are informative for the LoS. If training data sets with more covariates are available, the DIM model we propose in Section 2.2 could be adapted to variables specific to COVID-19 patients. This may lead to an increase in predictive skill. Third, there is possibly a bias towards a longer predicted LoS because of the data sampling process. We have assessed whether the patients with missing LoS value in the RISC-19-ICU registry have a substantially different distribution of covariate values than the patients with valid LoS value. This is not the case which is an indication that many of them, rather than having a censored LoS, have indeed not been updated. We have also repeated all of our analyses on the COVID-19 dataset restricted to patients with admission date before April 5, 2020. Here, the update and the censoring problem should be less. Qualitatively, we obtained the same results as the ones reported here. Nevertheless, it should be kept in mind that some of the very long LoS are likely to be censored in either case. Fourth, LoS is often not only dependent on epidemiological and physiologic variables but additionally on ICU resources, therapeutic restriction policies [24] and withdrawal strategies (https://www.samw.ch/de/Ethik/Themen-A-bis-Z/Intensivmedizin.html). Our forecasts predict a longer LoS compared with the observed LOS overall and in almost any patient subgroups after 25 days. This may be due to an earlier withdrawal of the intensive therapy compared to ARDS, especially in shortage of ICU resources. However we did not find any significant difference in LoS distribution between two regions of Switzerland with diverse ICU strain.

5 Conclusion

A new semiparametric model permits calibrated and informative probabilistic prediction of LoS of individual patients with severe COVID-19 in ICUs, given covariate information. These predictions would allow to simulate stochastic models for bed occupation in ICUs under different scenarios for the case mix. These scenarios could be different projections for the rate at which COVID-19 patients and other patients arrive in the ICUs.

Supporting information

S1 Appendix. Additional information about probabilistic forecasting, censoring of LoS in the COVID-19 dataset, and supplementary figures.

(PDF)

Click here for additional data file.^{(488.6KB, pdf)}

S1 Data. Minimal dataset to replicate the results of this study.

(ZIP)

Click here for additional data file.^{(432.5KB, zip)}

S1 Code. Sample data and code to illustrate the computation and usage of probabilistic length of stay forecasts.

(ZIP)

Click here for additional data file.^{(15.5KB, zip)}

Acknowledgments

The authors are grateful to the Swiss Society of Intensive Care Medicine for providing access to the MDSi.

RISC-19-ICU Investigators of Switzerland: RISC-19-ICU registry, University of Zurich (Hilty Matthias P, MD; Wendel Garcia Pedro D, MSc; Schüpbach Reto A, MD; Thierry Fumeaux, MD; Jonathan Montomoli, MD, PhD; Philippe Guerci, MD); Klinik für Operative Intensivmedizin, Kantonsspital Aarau, Aarau (Rolf Ensner, MD); Intensivstation, Kantonsspital Schaffhausen, Schaffhausen (Nadine Gehring, MD); Institut fuer Anesthaesie und Intensivmedizin, Zuger Kantonsspital AG, Baar (Peter Schott, MD; Severin Urech, MD); Department Intensivmedizin, Universitaetsspital Basel, Basel (Martin Siegemund, MD; Nuria Zellweger); Intensivmedizin, St. Claraspital, Basel (Adriana Lambert, MD; Lukas Merki, MD); Department of Intensive Care Medicine, University Hospital Bern, Inselspital, Bern (Marie-Madlen Jeitziner, RN, PhD; Beatrice Jenni-Moser, RN, MSc); Department Intensive Care Medicine, Spitalzentrum Biel, Biel (Marcus Laube, MD); Interdisziplinäre Intensivstation, Spital Bülach, Bülach (Bernd Yuen, MD; Thomas Hillermann, MD); Intensivstation, Regionalspital Emmental AG, Burgdorf (Petra Salomon, MD; Iris Drvaric, MD); Intensivmedizin, Kantonsspital Graubünden, Chur (Frank Hillgaertner, MD; Marianne Sieber); Institut fuer Anaesthesie und Intensivmedizin, Spital Thurgau, Frauenfeld (Alexander Dullenkopf, MD; Lina Petersen, MD); Soins Intensifs, Hopital cantonal de Fribourg, Fribourg (Hatem Ksouri, MD, PhD; Govind Oliver Sridharan, MD); Division of Intensive Care, University Hospitals of Geneva, Geneva (Sara Cereghetti, MD; Filippo Boroli, MD; Jerome Pugin, MD, PhD); Division of Neonatal and Pediatric Intensive Care, University Hospitals of Geneva, Geneva (Serge Grazioli, MD; Peter C. Rimensberger, MD); Intensivstation, Spital Grabs, Grabs (Christian Bürkle, MD); Institut für Anaesthesiologie Intensivmedizin & Rettungsmedizin, See-Spital Horgen & Kilchberg, Horgen (Julien Marrel, MD; Mirko Brenni, MD); Soins Intensifs, Hirslanden Clinique Cecil, Lausanne (Isabelle Fleisch, MD; Jerome Lavanchy, MD); Anaesthesie und Intensivmedizin, Kantonsspital Baselland, Liestal (Anja Baltussen Weber, MD; Peter Gerecke, MD; Andreas Christ, MD); Dipartimento Area Critica, Clinica Luganese Moncucco, Lugano (Romano Mauri, MD; Samuele Ceruti, MD); Interdisziplinaere Intensivstation, Spital Maennedorf AG, Maennedorf (Katharina Marquardt, MD; Karim Shaikh, MD); Institut fuer Anaesthesie und Intensivmedizin, Spital Thurgau, Münsterlingen (Thomas Neff, MD; Tobias Hübner, MD); Intensivmedizin, Schweizer Paraplegikerzentrum Nottwil, Nottwil (Hermann Redecker, MD); Soins intensifs, Groupement Hospitalier de l’Ouest Lémanique, Hôpital de Nyon, Nyon (Thierry Fumeaux, MD; Mallory Moret-Bochatay, MD); Intensivmedizin & Intermediate Care, Kantonsspital Olten, Olten (Michael Studhalter, MD); Intensivmedizin, Spital Oberengadin, Samedan (Michael Stephan, MD; Jan Brem, MD); Anaesthesie Intensivmedizin Schmerzmedizin, Spital Schwyz, Schwyz (Daniela Selz, MD; Didier Naon, MD); Medizinische Intensivstation, Kantonsspital St. Gallen, St. Gallen (Gian-Reto Kleger, MD); Departement of Anesthesiology and Intensive Care Medicine, Kantonsspital St. Gallen, St. Gallen (Miodrag Filipovic, MD; Urs Pietsch, MD); Paediatric Intensive Care Unit, Children’s Hospital of Eastern Switzerland, St. Gallen (Bjarte Rogdo, MD; Andre Birkenmaier, MD); Departement for intensive care medicine, Kantonsspital Nidwalden, Stans (Anette Ristic, MD; Michael Sepulcri, MD); Intensivstation, Spital Simmental-Thun-Saanenland AG, Thun (Antje Heise, MD); Klinik für Anaesthesie und Intensivmedizin, Spitalzentrum Oberwallis, Visp (Friederike Meyer zu Bentrup, MD, MBA); Service d’Anesthesiologie, EHNV, Yverdon- les-Bains (Marilene Franchitti Laurent, MD; Jean-Christophe Laurent, MD); Institute of Intensive Care Medicine, University Hospital Zurich, Zurich (Philipp Bühler, MD; Silvio Brugger, MD, PhD; Jan Bartussek, PhD; Martina Maibach, PhD; Annelies Zinkernagel, MD, PhD, Dorothea Heuberger, PhD; Srikanth Mairpady Shambat, PhD); Interdisziplinaere Intensivstation, Stadtspital Triemli, Zurich (Patricia Fodor, MD; Pascal Locher, MD; Giovanni Camen, MD); Abteilung für Anaesthesiologie und Intensivmedizin, Hirslanden Klinik Im Park, Zürich (Tomislav Gaspert, MD; Marija Jovic, MD); Institut für Anaesthesiologie und Intensivmedizin, Klinik Hirslanden, Zurich (Christoph Haberthuer, MD; Roger F. Lussman, MD).

Data Availability

Any intensive care unit or other centre treating critically ill COVID-19 patients is invited to join the RISC-19-ICU registry at https://www.risc-19-icu.net. While the registry protocol prevents the deposition of the full registry dataset in a third-party repository, analyses on the full dataset may be requested by any collaborating centre after approval of the study protocol by the registry board. Reproducibility of the results in the present study is ensured by providing underlying code and an adapted dataset to exemplarily test the later. The registry protocol and data dictionary is publicly accessible at https://www.risc-19-icu.net.

Funding Statement

The author(s) received no specific funding for this work.

References

1. Le Gall JR, Lemeshow S, Saulnier F. A New Simplified Acute Physiology Score (SAPS II) Based on a European/North American Multicenter Study. JAMA. 1993;270:2957–2963. [DOI] [PubMed] [Google Scholar]
2. Miranda DR, Nap R, de Rijk A, Schaufeli W, Iapichino G, members of the TISS Working Group. Nursing activities score. Crit Care Med. 2003;31:374–382. 10.1097/01.CCM.0000045567.78801.CC [DOI] [PubMed] [Google Scholar]
3. Verburg IWM, Atashi A, Eslami S, Holman R, Abu-Hanna A, de Jonge E, et al. Which models can I use to predict adult ICU length of stay? A systematic review. Crit Care Med. 2017;45:e222–e231. 10.1097/CCM.0000000000002054 [DOI] [PubMed] [Google Scholar]
4. Zimmerman JE, Kramer AA, McNair DS, Malila FM, Shaffer VL. Intensive care unit length of stay: Benchmarking based on Acute Physiology and Chronic Health Evaluation (APACHE) IV. Crit Care Med. 2006;34:2517–2529. 10.1097/01.CCM.0000240233.01711.D9 [DOI] [PubMed] [Google Scholar]
5. Vasilevskis EE, Kuzniewicz MW, Cason BA, Lane RK, Dean ML, Clay T, et al. Mortality probability model III and acute simplified physiology score II: Assessing their value in predicting length of stay and comparison to APACHE IV. Chest. 2009;136:89–101. 10.1378/chest.08-2591 [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Henzi A, Kleger GR, Ziegel JF. Distributional (Single) Index Models. Preprint. 2020;arXiv:2006.09219. [Google Scholar]
7. Wendel Garcia PD, Fumeaux T, Guerci P, Heuberger DM, Montomoli J, Roche-Campo F, et al. Prognostic factors associated with mortality risk and disease progression in 639 critically ill patients with COVID-19 in Europe: Initial report of the international RISC-19-ICU prospective observational cohort. EClinicalMedicine. 2020; p. 100449 10.1016/j.eclinm.2020.100449 [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Hilty MP, Wendel Garcia PD. hobbes8080/risc-19-icu: registry data transformation v1.0. Zenodo Data Repository. 2020. 10.5281/zenodo.3757064 [DOI] [Google Scholar]
9. Rothen HU, Stricker K, Einfalt J, Bauer P, Metnitz PG, Moreno RP, et al. Variability in outcome and resource use in intensive care units. Intensive Care Med. 2007;33(8):1329–1336. 10.1007/s00134-007-0690-3 [DOI] [PubMed] [Google Scholar]
10. Granholm A, Christiansen CF, Christensen S, Perner A, Mueller MH. Performance of SAPS II according to ICU length of stay: A Danish nationwide cohort study. Acta Anaesthesiol Scand. 2019;63(9):1200–1209. 10.1111/aas.13415 [DOI] [PubMed] [Google Scholar]
11.Kleger GR. Die Aufenthaltsdauer kritisch kranker Patienten auf einer Intensivstation: Probabilistische Prädiktionsmodelle. Master’s Thesis, University of Bern; 2018.
12.Henzi A, Ziegel JF, Gneiting T. Isotonic distributional regression. Preprint. 2019;arXiv:1909.03725.
13. Mösching A, Dümbgen L. Monotone least squares and isotonic quantiles. Electron J Stat. 2020;14:24–49. 10.1214/19-EJS1659 [DOI] [Google Scholar]
14. Gneiting T, Balabdaoui F, Raftery AE. Probabilistic forecasts, calibration and sharpness. J R Stat Soc Series B Stat Methodol. 2007;69:243–268. 10.1111/j.1467-9868.2007.00587.x [DOI] [Google Scholar]
15.R Core Team. R: A Language and Environment for Statistical Computing; 2020. Available from: https://www.R-project.org/.
16. Wood SN. Generalized Additive Models: An Introduction with R. 2nd ed Chapman and Hall/CRC; 2017. [Google Scholar]
17. Marini JJ, Gattinoni L. Management of COVID-19 Respiratory Distress. JAMA. 2020;323:2329–2330. 10.1001/jama.2020.6825 [DOI] [PubMed] [Google Scholar]
18. Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395(10223):497–506. 10.1016/S0140-6736(20)30183-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Yang X, Yu Y, Xu J, Shu H, Xia J, Liu H, et al. Clinical course and outcomes of critically ill patients with SARS-CoV-2 pneumonia in Wuhan, China: a single-centered, retrospective, observational study. Lancet Respir Med. 2020;8(5):475–481. 10.1016/S2213-2600(20)30079-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Grasselli G, Tonetti T, Filippini C, Slutsky AS, Pesenti A, Ranieri VM. Pathophysiology of COVID-19-associated acute respiratory distress syndrome—Authors’ reply. Lancet Respir Med. 2021;9(1):e5–e6. 10.1016/S2213-2600(20)30525-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Marini JJ, Gattinoni L. Time Course of Evolving Ventilator-Induced Lung Injury: The “Shrinking Baby Lung”. Crit Care Med. 2020;48(8):1203–1209. 10.1097/CCM.0000000000004416 [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Chiumello D, Busana M, Coppola S, Romitti F, Formenti P, Bonifazi M, et al. Physiological and quantitative CT-scan characterization of COVID-19 and typical ARDS: a matched cohort study. Intensive Care Med. 2020;46:2187–2196. 10.1007/s00134-020-06281-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Trahtemberg U, Slutsky AS, Villar J. What have we learned ventilating COVID-19 patients? Intensive Care Med. 2020;46:2458–2460. [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Vincent JL, Creteur J. Ethical aspects of the COVID-19 crisis: How to deal with an overwhelming shortage of acute beds. Eur Heart J Acute Cardiovasc Care. 2020;9(3):248–252. 10.1177/2048872620922788 [DOI] [PMC free article] [PubMed] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0247265.r001

Decision Letter 0

Martina Crivellari

11 Dec 2020

PONE-D-20-30127

Probabilistic analysis of COVID-19 patients’ individual length of stay in Swiss intensive care units

PLOS ONE

Dear Dr. Ziegel,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

The paper is based on an interesting topic, the statistical analysis has been performed appropriately, but there many concerns authors should correct and some concepts clarified to make the manuscript suitable for publication. This paper could be very difficult to be interpreted by a physician without mathematical experience. Authors should simplify the interpretations of results.

There are no conflicts beteween reviews.
Please answer to all the questions moved by reviewers.

Please submit your revised manuscript by Jan 11th. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Martina Crivellari

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. In your ethics statement in the manuscript and in the online submission form, please provide additional information about the patient records/samples used in your retrospective study.

Specifically, please ensure that you have discussed whether all data/samples were fully anonymized before you accessed them.

If patients provided informed written consent to have data/samples from their medical records used in research, please include this information.

3. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

Reviewer #3: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: I Don't Know

Reviewer #2: I Don't Know

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This paper attempts to provide a way to estimate LoS for individual patients, based on characteristics available within the first 24 hours of admission. I think that this is a very valuable contribution, as it can help with health care planning and provide more accurate estimates for when hospital capacity may be exceeded.

Although I think this is a nice paper in principal, I don't think enough information has been provided for me to adequately review the methods and results. As it stands I don't think the methods are clear, and the interpretation of the results is difficult to follow. I have tried to highlight below areas that I think could be made clearer:

Line 13, you talk about probabilistic predictions, but even with your definition it is not clear what this means. How is the uncertainty of the LoS quantified? This ties in with Figure 1, which I also don't think is clear. There is very little information provided in the figure legend or text about figure 1, and it is difficult to interpret. As I understand, each vertical line indicates how long someone actually stays in hospital, so patient 4 stayed in for 40 days? And then patient 3 was released from ICU on day ±1, but had a very low possibility of doing so (±0.05)? I think more of an explanation is the figure legend and text is required to adequately explain this figure, maybe even just proving a small example as I have done above would help. I think it's a nice figure which if adequately explained in the legend and text would help to clarify the aim/ methods of the paper. At the moment I don't think your overall aim is clear.

It is also not made clear in the introduction how having the CDF would be used. Who are you expecting to use your results, and how? It says in the abstract that individuals with long LoS could be discovered early, indicating that maybe this is for hospital planning purposes? Also, going back to figure 1, patient 4 has the longest LoS, but it is not clear precisely what indicates that patient 4 is going to have the longest LoS from the Forecast CDF. So difficult to see how the CDF is going to indicate long los.

Methods:

I understand that you are not able to release your data, however, there is no reason that the code could not be released, along with a brief description of the datasets you have available (could even consider creating some dummy datasets). This would allow others to understand your methods more clearly, and be able to repeat your analysis.

From your methods, it's not adequately explained what the difference is between ECDF and CDF. In addition, in the results you discuss ECDF, and show plots comparing them to CDF, but I think providing a brief summary at the beginning of the results would aid interpretation.

Results:

You talk about how this is done on an individual level, and yet in Figures 2-4 your provide figures for one overall CDF. So is this a CDF for the whole dataset? How have you combined them?

Line 138 you talk about panel c of figure 2, but there is no panel c.

Overall, I think this is an interesting idea and concept, but in it's current format I don't think their methods are reproducible and their results are not easy to follow. I think they need to be clear who is there target audience, is it for mathematical modellers or people with a more clinical background? Clinicians or anyone with a non-technical background would struggle to know what to do with this information. However, I do think it is highly relevant, so I hope that the authors are able to revise their manuscript to make things clearer. I think being able to predict who is going to spend a long time on ICU is of great value. Best of luck with the submission.

Reviewer #2: In this paper the authors developed a new semi parametric distributional index model that should provide a probabilistic prediction of ICU length of stay 24 hours after admission for COVID-19 patients.

The model is based on 4 covariates: age, gender, SAPS II and NEMS.

According to the authors these parameters were the only possible choice.

I wonder if and how covariates dependance affects the model. Particularly:

- age is included in SAPS II

- both SAPS II and NEMS are expression of the severity of patients status, is this model performing better than a simpler model including just SAPS II?

On the opposite, I wonder if including more specific variables, such as the coagulation status would provide better predictions.

For the quality and soundness of the statistical analysis, I do not have the skills to judge, and an expert’s opinion is needed.

Minor comments:

* Minor language revision is needed, as the manuscript contains several typos

Reviewer #3: The authors proposed a new semi-parametric model to predict individual ICU LOS even though prediction of LOS at the patient level is difficult and none of the available models were reliable. The model was fully demonstrated in two of method papers showing that it provides more information than classical models.

In Figure 1, predicted CDFs of LOS and the corresponding true LOS were depicted. In reality, patient 3 had the shortest LOS and patients 1 and 2 had longer LOS. However, the forecasted CDF for those three patients were very close to each other. It looks like the proposed model doesn’t have the ability to provide satisfactory forecasts at the patient level.

The authors mentioned LOS of a patient in ICU also depends on the characteristics and policies of the ICU. The COVID-19 pandemic placed a significant burden on healthcare systems. It induced unprecedented strain on ICU resources. It brought systematic error of prediction by using patients who were diagnosed of ARDS in 6 concurrent years as the training cohort. Because both measured and unmeasured confounders wound be unbalanced between training and COVID-19 cohorts.

It may be helpful to present comparisons of other measured confounders and patients’ dispositions between cohorts.

Patients who were admitted to ICU had two dispositions, discharged or dead. It looks like the authors included both discharged and dead patients in analysis. Was patients’ disposition considered differently in model? If not, can the authors clarify?

In line 153, the authors reported patients who were greater than 80 years of age have much shorter LOS. Is it because in-ICU mortality rate was higher for this age-subgroup?

The authors used cubic spline for continuous variables age, SAPS II, and NEMS in regression. Can the authors justify their decision to use cubic spline and how the assumptions of such a model were considered? Did the authors conducted tests for curvature and tests for significance of each curve?

Figure 1: The color and pattern of lines confused me. Why did you use green lines for patients 1, 2, and 3 and red lines for patient 4?

The paragraph of censoring of LOS in Appendix should be reported as a limitation in the discussion section.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Feb 19;16(2):e0247265. doi: 10.1371/journal.pone.0247265.r002

Author response to Decision Letter 0

12 Jan 2021

Detailed responses to all comments of the editor and the reviewers are provided in the pdf-file "response_Covid19_2020.12.17.pdf".

Attachment

Submitted filename: response_Covid19_2021.01.06.pdf

Click here for additional data file.^{(134.2KB, pdf)}

PLoS One. doi: 10.1371/journal.pone.0247265.r003

Decision Letter 1

Martina Crivellari

29 Jan 2021

PONE-D-20-30127R1

Probabilistic analysis of COVID-19 patients’ individual length of stay in Swiss intensive care units

PLOS ONE

Dear Dr. Ziegel,

ACADEMIC EDITOR: Most of comments have been addressed. I've asked a check to the statistician, see conclusions below. Please modify the manuscript as suggested to make it definitely suitable for publication.

Please submit your revised manuscript by Feb 15th. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

We look forward to receiving your revised manuscript.

Kind regards,

Martina Crivellari

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #3: Yes

**********

6. Review Comments to the Author

Reviewer #3: The authors have addressed most of my comments and concerns.

The reviewer still worry about prediction accuracy. In figure 1, patient 3 had the shortest LoS but the forecast CDF curve was in the middle. The order of realized LoS from the shortest to the longest were (patient 3 < patient 2 < patient 1 < patient 4). But the predicted probabilities were in different order.

In the legend of figure 1, you said if a patient left on day t, the predictive CDF would jump from 0 to 1 at t. However, the CDFs in figure 1 didn’t do so. Patient left on day 1, but the corresponding CDF didn’t jump to 1.

The reviewer suggested to draw a 2-D scatter plot to indicate prediction accuracy. The x-axis is the observed values of the LoS and the y-axis is the probability that the respective patient would discharged on the realized day. Each dot represents each patient. For example, patient 1 was discharged/dead on day 20 and the probability that he/she would discharged on day 20 was about 80%. So the dot of patient 1 should be located at (x = 20, y = 80%).

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #3: No

PLoS One. 2021 Feb 19;16(2):e0247265. doi: 10.1371/journal.pone.0247265.r004

Author response to Decision Letter 1

1 Feb 2021

Please see the attached file "Response to Reviewers".

Attachment

Submitted filename: response_Covid19_2021.02.01.pdf

Click here for additional data file.^{(99KB, pdf)}

PLoS One. doi: 10.1371/journal.pone.0247265.r005

Decision Letter 2

Martina Crivellari

4 Feb 2021

Probabilistic analysis of COVID-19 patients’ individual length of stay in Swiss intensive care units

PONE-D-20-30127R2

Dear Dr. Ziegel,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Martina Crivellari

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

PLoS One. doi: 10.1371/journal.pone.0247265.r006

Acceptance letter

Martina Crivellari

10 Feb 2021

PONE-D-20-30127R2

Probabilistic analysis of COVID-19 patients' individual length of stay in Swiss intensive care units

Dear Dr. Ziegel:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Martina Crivellari

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Appendix. Additional information about probabilistic forecasting, censoring of LoS in the COVID-19 dataset, and supplementary figures.

(PDF)

Click here for additional data file.^{(488.6KB, pdf)}

S1 Data. Minimal dataset to replicate the results of this study.

(ZIP)

Click here for additional data file.^{(432.5KB, zip)}

S1 Code. Sample data and code to illustrate the computation and usage of probabilistic length of stay forecasts.

(ZIP)

Click here for additional data file.^{(15.5KB, zip)}

Attachment

Submitted filename: response_Covid19_2021.01.06.pdf

Click here for additional data file.^{(134.2KB, pdf)}

Attachment

Submitted filename: response_Covid19_2021.02.01.pdf

Click here for additional data file.^{(99KB, pdf)}

Data Availability Statement

[pone.0247265.ref001] 1. Le Gall JR, Lemeshow S, Saulnier F. A New Simplified Acute Physiology Score (SAPS II) Based on a European/North American Multicenter Study. JAMA. 1993;270:2957–2963. [DOI] [PubMed] [Google Scholar]

[pone.0247265.ref002] 2. Miranda DR, Nap R, de Rijk A, Schaufeli W, Iapichino G, members of the TISS Working Group. Nursing activities score. Crit Care Med. 2003;31:374–382. 10.1097/01.CCM.0000045567.78801.CC [DOI] [PubMed] [Google Scholar]

[pone.0247265.ref003] 3. Verburg IWM, Atashi A, Eslami S, Holman R, Abu-Hanna A, de Jonge E, et al. Which models can I use to predict adult ICU length of stay? A systematic review. Crit Care Med. 2017;45:e222–e231. 10.1097/CCM.0000000000002054 [DOI] [PubMed] [Google Scholar]

[pone.0247265.ref004] 4. Zimmerman JE, Kramer AA, McNair DS, Malila FM, Shaffer VL. Intensive care unit length of stay: Benchmarking based on Acute Physiology and Chronic Health Evaluation (APACHE) IV. Crit Care Med. 2006;34:2517–2529. 10.1097/01.CCM.0000240233.01711.D9 [DOI] [PubMed] [Google Scholar]

[pone.0247265.ref005] 5. Vasilevskis EE, Kuzniewicz MW, Cason BA, Lane RK, Dean ML, Clay T, et al. Mortality probability model III and acute simplified physiology score II: Assessing their value in predicting length of stay and comparison to APACHE IV. Chest. 2009;136:89–101. 10.1378/chest.08-2591 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0247265.ref006] 6. Henzi A, Kleger GR, Ziegel JF. Distributional (Single) Index Models. Preprint. 2020;arXiv:2006.09219. [Google Scholar]

[pone.0247265.ref007] 7. Wendel Garcia PD, Fumeaux T, Guerci P, Heuberger DM, Montomoli J, Roche-Campo F, et al. Prognostic factors associated with mortality risk and disease progression in 639 critically ill patients with COVID-19 in Europe: Initial report of the international RISC-19-ICU prospective observational cohort. EClinicalMedicine. 2020; p. 100449 10.1016/j.eclinm.2020.100449 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0247265.ref008] 8. Hilty MP, Wendel Garcia PD. hobbes8080/risc-19-icu: registry data transformation v1.0. Zenodo Data Repository. 2020. 10.5281/zenodo.3757064 [DOI] [Google Scholar]

[pone.0247265.ref009] 9. Rothen HU, Stricker K, Einfalt J, Bauer P, Metnitz PG, Moreno RP, et al. Variability in outcome and resource use in intensive care units. Intensive Care Med. 2007;33(8):1329–1336. 10.1007/s00134-007-0690-3 [DOI] [PubMed] [Google Scholar]

[pone.0247265.ref010] 10. Granholm A, Christiansen CF, Christensen S, Perner A, Mueller MH. Performance of SAPS II according to ICU length of stay: A Danish nationwide cohort study. Acta Anaesthesiol Scand. 2019;63(9):1200–1209. 10.1111/aas.13415 [DOI] [PubMed] [Google Scholar]

[pone.0247265.ref011] 11.Kleger GR. Die Aufenthaltsdauer kritisch kranker Patienten auf einer Intensivstation: Probabilistische Prädiktionsmodelle. Master’s Thesis, University of Bern; 2018.

[pone.0247265.ref012] 12.Henzi A, Ziegel JF, Gneiting T. Isotonic distributional regression. Preprint. 2019;arXiv:1909.03725.

[pone.0247265.ref013] 13. Mösching A, Dümbgen L. Monotone least squares and isotonic quantiles. Electron J Stat. 2020;14:24–49. 10.1214/19-EJS1659 [DOI] [Google Scholar]

[pone.0247265.ref014] 14. Gneiting T, Balabdaoui F, Raftery AE. Probabilistic forecasts, calibration and sharpness. J R Stat Soc Series B Stat Methodol. 2007;69:243–268. 10.1111/j.1467-9868.2007.00587.x [DOI] [Google Scholar]

[pone.0247265.ref015] 15.R Core Team. R: A Language and Environment for Statistical Computing; 2020. Available from: https://www.R-project.org/.

[pone.0247265.ref016] 16. Wood SN. Generalized Additive Models: An Introduction with R. 2nd ed Chapman and Hall/CRC; 2017. [Google Scholar]

[pone.0247265.ref017] 17. Marini JJ, Gattinoni L. Management of COVID-19 Respiratory Distress. JAMA. 2020;323:2329–2330. 10.1001/jama.2020.6825 [DOI] [PubMed] [Google Scholar]

[pone.0247265.ref018] 18. Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395(10223):497–506. 10.1016/S0140-6736(20)30183-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0247265.ref019] 19. Yang X, Yu Y, Xu J, Shu H, Xia J, Liu H, et al. Clinical course and outcomes of critically ill patients with SARS-CoV-2 pneumonia in Wuhan, China: a single-centered, retrospective, observational study. Lancet Respir Med. 2020;8(5):475–481. 10.1016/S2213-2600(20)30079-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0247265.ref020] 20. Grasselli G, Tonetti T, Filippini C, Slutsky AS, Pesenti A, Ranieri VM. Pathophysiology of COVID-19-associated acute respiratory distress syndrome—Authors’ reply. Lancet Respir Med. 2021;9(1):e5–e6. 10.1016/S2213-2600(20)30525-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0247265.ref021] 21. Marini JJ, Gattinoni L. Time Course of Evolving Ventilator-Induced Lung Injury: The “Shrinking Baby Lung”. Crit Care Med. 2020;48(8):1203–1209. 10.1097/CCM.0000000000004416 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0247265.ref022] 22. Chiumello D, Busana M, Coppola S, Romitti F, Formenti P, Bonifazi M, et al. Physiological and quantitative CT-scan characterization of COVID-19 and typical ARDS: a matched cohort study. Intensive Care Med. 2020;46:2187–2196. 10.1007/s00134-020-06281-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0247265.ref023] 23. Trahtemberg U, Slutsky AS, Villar J. What have we learned ventilating COVID-19 patients? Intensive Care Med. 2020;46:2458–2460. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0247265.ref024] 24. Vincent JL, Creteur J. Ethical aspects of the COVID-19 crisis: How to deal with an overwhelming shortage of acute beds. Eur Heart J Acute Cardiovasc Care. 2020;9(3):248–252. 10.1177/2048872620922788 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Probabilistic analysis of COVID-19 patients’ individual length of stay in Swiss intensive care units

Alexander Henzi

Gian-Reto Kleger

Matthias P Hilty

Pedro D Wendel Garcia

Johanna F Ziegel

Roles

Abstract

Rationale

Objectives

Methods

Measurements

Main results

Conclusion

1 Introduction

Fig 1. Predictive CDFs for the LoS of some COVID-19 patients with corresponding realizations as a vertical line.

2 Patients and methods

2.1 RISC-19-ICU and MDSi

2.2 Statistical methods

3 Results

3.1 General

Table 1. Summary statistics of COVID-19 dataset and training data.

Fig 2.

3.2 Age differences

Fig 3.

3.3 Gender differences

Fig 4.

3.4 Regional differences

4 Discussion

5 Conclusion

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Martina Crivellari

Roles

Author response to Decision Letter 0

Decision Letter 1

Martina Crivellari

Roles

Author response to Decision Letter 1

Decision Letter 2

Martina Crivellari

Roles

Acceptance letter

Martina Crivellari

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases