Skip to main content
PLOS One logoLink to PLOS One
. 2021 Oct 6;16(10):e0245103. doi: 10.1371/journal.pone.0245103

Disease progression of 213 patients hospitalized with Covid-19 in the Czech Republic in March–October 2020: An exploratory analysis

Martin Modrák 1,*, Paul-Christian Bürkner 2, Tomáš Sieger 3, Tomáš Slisz 4,5, Martina Vašáková 4,5, Grigorij Mesežnikov 6, Luis Fernando Casas-Mendez 6, Jaromír Vajter 6, Jan Táborský 7, Viktor Kubricht 8, Daniel Suk 9, Jan Horejsek 9, Martin Jedlička 10, Adriana Mifková 10, Adam Jaroš 11, Miroslav Kubiska 12, Jana Váchalová 12, Robin Šín 12, Markéta Veverková 13, Zbyšek Pospíšil 14, Julie Vohryzková 15, Rebeka Pokrievková 16, Kristián Hrušák 15, Kristína Christozova 15, Vianey Leos-Barajas 17, Karel Fišer 18, Tomáš Hyánek 11
Editor: Aleksandar R Zivkovic19
PMCID: PMC8494367  PMID: 34613965

Abstract

We collected a multi-centric retrospective dataset of patients (N = 213) who were admitted to ten hospitals in Czech Republic and tested positive for SARS-CoV-2 during the early phases of the pandemic in March—October 2020. The dataset contains baseline patient characteristics, breathing support required, pharmacological treatment received and multiple markers on daily resolution. Patients in the dataset were treated with hydroxychloroquine (N = 108), azithromycin (N = 72), favipiravir (N = 9), convalescent plasma (N = 7), dexamethasone (N = 4) and remdesivir (N = 3), often in combination. To explore association between treatments and patient outcomes we performed multiverse analysis, observing how the conclusions change between defensible choices of statistical model, predictors included in the model and other analytical degrees of freedom. Weak evidence to constrain the potential efficacy of azithromycin and favipiravir can be extracted from the data. Additionally, we performed external validation of several proposed prognostic models for Covid-19 severity showing that they mostly perform unsatisfactorily on our dataset.

Introduction

The Covid-19 pandemic caused by severe acute respiratory syndrome coronavirus (SARS-CoV-2) has, as of June 2021, led to over 172 million cases and over 3.7 million deaths. The present study was designed and conducted during March—October 2020, when Czech Republic experienced a relatively mild first wave of the pandemic due to early and strict lockdowns. Low numbers of cases continued throughout the summer but during September and October, after most of the data collection for this study concluded, the number of cases was raising again. On October 1st 2020, Czech Republic had accumulated 74283 total confirmed Covid-19 cases and 704 confirmed Covid-19 related deaths [1].

At the time the study was conducted, the proposed treatments included antivirals approved for other indications (chloroquine, hydroxychloroquine, lopinavir/ritonavir, remdesivir, favipiravir, umifenovir), azithromycin, corticosteroids, immunoglobulins, tocilizumab and convalescent plasma [2, 3]. Notably, the anti-malarial and anti-rheumatic drug hydroxychloroquine and the macrolide antibiotic azithromycin showed promise in early data and were broadly available and thus were frequently used in the early stages of the pandemic. Remdesivir, previously designed and approved for Ebola, SARS and MERS, also reported good initial results. However, during spring and summer 2020 remdesivir was available in Czech Republic only in limited amounts via compassionate use programme. The RECOVERY trial reported positive results of coroticosteriod dexamethasone for severe cases in June 2020 [4], but at this point the number of Covid-19 patients hospitalized in Czech Republic was low and dexamethasone thus did not see wider use until later in the pandemic.

Our understanding of the efficacy of Covid-19 treatments has improved substantially since the present study was conducted. As of April 2021, the pharmacological treatments that were deemed to be beneficial for at least one outcome in a systematic review of randomized trials were the corticosteroid dexamethasone (mortality, mechanical ventilation), colchicine (mortality, length of hospital stay), the antiviral remdesivir (mechanical ventilation), Janus kinase inhibitors (mechanical ventilation, duration of ventilation), IL-6 inhibitors (mechanical ventilation, length of hospital stay), the antiviral favipiravir (length of hospital stay, resolution of symptoms) and the anti-androgen proxalutimide (admission to hospital). Hydroxychloroquine, interferon beta, lopinavir-ritonavir, azithromycin, vitamin C, vitamin D, anticoagulants and ACE inhibitors were considered to not be better than standard of care and lopinvair-ritonavir showed evidence of harm, although most of the conclusions were considered to be of low certainty [5]. Interestingly, in observational studies, hydroxychloroquine was often found to be associated with better outcomes [68]. No benefit was also observed in a meta-analysis of randomized trials of convalescent plasma treatment [9].

High IL-6, D-dimer values were observed to be associated with worse outcome and increased disease severity [10]. Large study of electronic health records [11] showed an increase in C-reactive protein in early disease and increase of D-dimer and white blood cell count in later stages of the disease.

An ongoing challenge in evaluating Covid-19 treatments is that the analysis and interpretation of the data is often inappropriate or misleading, most notably interpreting lack of evidence due to small sample size as evidence of no effect [12, 13].

Additionally, many methods for predicting disease severity of Covid-19 were published, but the methods are at high risk of bias and lack external validation [14].

The present study aims to describe the outcomes and disease course of hospitalized patients with mild to severe clinical presentation in a multicentric Czech cohort during the early stages of the pandemic, explore the association between the outcomes and pharmacological interventions and to provide external validation to previously published prognostic models for Covid-19 severity.

Methods

Patients and study design

A convenience sample of patients from 10 sites was collected. The study sites span the whole spectrum of sizes from large university hospitals in major cities with multiple dedicated Covid-19 wards (Thomayer University Hospital in Prague, Motol University Hospital in Prague, Kralovské Vinohrady University Hospital in Prague, General University Hospital in Prague, University Hospital in Pilsen) through major regional/specialized hospitals (Na Homolce Hospital in Prague, Military Hospital Olomouc) as well as smaller hospitals caring for just several Covid-19 patients at a time (AGEL Hospital Nový Jičín, Hořovice Hospital, Třebíč Hospital). The sites were chosen based on availability and willingness of the personnel to participate in data collection. None of the study sites was exclusively dedicated to treating Covid-19 patients. For each site, the dataset contains all patients hospitalized in the participating wards over the data collection period. The data collection started at the onset of the Covid-19 pandemic in March 2020 (except for one site where some older records were inaccessible), but the end date for collection differs between sites due to time constraints of the participating physicians. Three sites included total of 23 patients that could be considered part of “second wave” (admitted after September 1st). Last patient included in the dataset was admitted on October 12th. See S1 Fig for per-site data collection periods. Patients over the age of 18 were included if they had PCR-confirmed infection of SARS-CoV-2 and were not participating in a clinical trial of any Covid-19 pharmacotherapy.

Not all patients developed pneumonia or other symptoms of Covid-19. All patients received the standard of care which could include supplemental oxygen and ventilation and antibiotics for bacterial superinfections, as determined by the attending physician. Some patients were not indicated for all treatment modalities (especially mechanical ventilation) based on decision of the attending physician and underlying patient condition. We note that the participating sites were not homogeneous in either patient population or treatment protocols. The choice of pharmacological treatment was based on the decision of the attending clinician and its availability.

The study was approved by the Ethical committees of General University Hospital, Hospital Nový Jíčín, Motol University Hospital, Thomayer Hospital, University Hospital Vinohrady, Military Hospital Olomouc, Na Homolce Hospital, University Hospital in Pilsen, Hořovice Hospital, Jihlava Hospital, all data were collected in fully anonymized form. Data was collected between June and October 2020 for patients that were treated between March and October 2020.

Data collection

We collected data on comorbidities and information about disease progression on daily resolution including breathing support required, oxygen flow rate, experimental anti-Covid-19 and antimicrobial drugs taken and several laboratory markers (PCR positivity for SARS-CoV-2, C-reactive protein, D-dimer, Interleukin 6, Ferritin, lymphocyte count). Full protocol for data collection is attached in S1 File and the data collection table in S2 File. Due to very low number of patients using extra-corporeal membrane oxygenation (N = 1) or non-invasive positive pressure ventilation (N = 6) in our sample, we merged those categories with mechanical ventilation.

Statistical analysis

The character of the convenience sample does not allow for a proper assessment of the association between treatments and patient outcomes, because the treatments had not been assigned to patients at random but were only observed retrospectively. This can be partially remedied by adjusting for patient characteristics in the analysis, but such adjustments will always be imperfect and the analysis needs to be treated as exploratory and interpreted cautiously.

Since many details of analysis may influence the conclusions made, we performed multiverse analysis [15] and report results for all the hypothesis tested across multiple different models using both frequentist and Bayesian paradigms. For each model class we worked with several possible sets of adjustments. All analyses were performed in the R language [16], visualization and data cleaning was run with the tidyverse package [17].

First class of models are frequentist survival and multistate models under the proportional hazards assumption as implemented in the coxph function from the survival package [18]. We primarily use a model with competing risks for death and discharge from hospital (see Fig 1a).

Fig 1. States used in the competing risk model (a) and in the two hidden Markov model variants (b,c).

Fig 1

AA = Ambient air, Oxygen = Nasal oxygen, Ventilated = any form of ventilation (non-invasive positive-pressure ventilation, mechanical ventilation and extra-corporeal membrane oxygenation). In all models the ‘Death’ and ‘Discharged’ states are terminal. In the second hidden Markov model (c), the ‘Improving’ and ‘Worsening’ variants of each non-terminal state are not observable—only the breathing support is observed and improving/worsening is inferred from progression of the disease.

Second class of models are Bayesian hidden Markov models (HMM) of disease progression implemented via a custom extension to the brms package [19]. The parametrization of the HMM is inspired by Williams et al. [20]: the actual process of disease is assumed to be continuous and allow only for transitions between neighboring states (as shown in Fig 1b and 1c). The total probability of transition between any two states over the period of a day is then computed as the total probability of transition across all possible paths. This class of models does not satisfy the proportional hazards assumption, instead, it is assumed the process has the Markov property—i.e., that the (potentially unobserved) state and the covariates at a given day contain all the information to determine probabilities of the states on the next day. We use two versions of such models, one working solely with the observed breathing support and one assuming a hidden improving/worsening distinction. All of the hidden Markov models take into account whether best supportive care was initiated and a patient was thus not indicated to progress to more intensive treatment modalities.

Finally, we used a set of Bayesian regression models implemented with the brms package [19]. Those included overall survival, state at day 7 or 28 as either binary or categorical outcome and a Bayesian version of the Cox proportional-hazards model.

Except for age, sex and comorbidities, all covariates are treated as time-varying, e.g., the effect of taking a drug is only included for the days after the drug was taken. More details on the exact model formulations can be found in the supplementary statistical analysis in S3 File.

Evaluating prognostic models

We searched the living systematic review of Covid-19 prognostic models [14] for those that could be applied to our dataset (i.e., where we have gathered all the input features). We primarily focused on the Area Under Receiver Operating Characteristic Curve (AUC), and its bootstrapped 95% confidence intervals which we computed using the pROC package [21]. When there were multiple reasonable ways to evaluate the outcome or a predictor in our dataset, we computed and reported all of those options. We used two simple scores with age or the decade of age as the sole predictor to have a baseline to compare the scores against.

Complete code for all analyses is available at https://github.com/cas-bioinf/covid19retrospective/.

Results

Baseline characteristics

In total, we were able to gather data for 213 patients, see Table 1 for the overall characteristics of the patient sample and several subgroups we used in the analysis, including treatments taken. Counts of all treatment combinations are shown in S2 and S3 Figs shows outcomes by study site, demonstrating quite large hospital-specific differences. The dataset includes 19 patients already reported in a study of inflammatory signatures of Covid-19 [22].

Table 1. Patient characteristics for the overall sample and treatment subgroups.

Note that the favipiravir subgroup is not exclusive with either the HCQ or No HCQ group.

All HCQ No HCQ Favipiravir
N 213 108 105 9
Distinct sites 10 10 10 1
Male 105 (49%) 53 (49%) 52 (50%) 4 (44%)
Age (mean, IQR) 69 (58–80) 67 (56–80) 70 (64–82) 59 (51–68)
Admitted for Covid 172 (81%) 96 (89%) 76 (72%) 8 (89%)
Took hydroxychloroquine 108 (51%) 108 (100%) 0 (0%) 8 (89%)
Took azithromycin 72 (34%) 63 (58%) 9 (9%) 8 (89%)
Took dexamethasone 4 (2%) 0 (0%) 4 (4%) 0 (0%)
Took favipiravir 9 (4%) 8 (7%) 1 (1%) 9 (100%)
Took remdesivir 3 (1%) 0 (0%) 3 (3%) 0 (0%)
Convalescent plasma 7 (3%) 6 (6%) 1 (1%) 3 (33%)
Ischemic Heart Disease 43 (20%) 15 (14%) 28 (27%) 1 (11%)
Takes antihypertensives 141 (66%) 66 (61%) 75 (71%) 4 (44%)
Heart Failure 34 (16%) 15 (14%) 19 (18%) 0 (0%)
COPD 21 (10%) 9 (8%) 12 (11%) 1 (11%)
Asthma 18 (8%) 9 (8%) 9 (9%) 0 (0%)
Other lung disease 14 (7%) 7 (6%) 7 (7%) 0 (0%)
Diabetes 51 (24%) 18 (17%) 33 (31%) 3 (33%)
Renal Disease 43 (20%) 19 (18%) 24 (23%) 2 (22%)
Liver Disease 14 (7%) 8 (7%) 6 (6%) 1 (11%)
Smoking 28 (13%) 15 (14%) 13 (12%) 1 (11%)
BMI (mean, IQR) 28 (24–31) 28 (24–30) 28 (24–31) 32 (27–35)
Best supportive care 58 (27%) 22 (20%) 36 (34%) 0 (0%)
Death 42 (20%) 15 (14%) 27 (26%) 0 (0%)
Discharged 122 (57%) 76 (70%) 46 (44%) 8 (89%)

IQR = interquartile range, COPD = Chronic obstructive pulmonary disease, BMI = body-mass index, Best supportive care = patient was not indicated to undergo more intensive treatment modality.

Disease progression

In Fig 2 we show the overall disease progression for all patients and in Fig 3 we show the time-course of a subset of the markers we have measured. The data show some interesting patterns: patients with low Interleukin-6 or D-dimer values are overrepresented among patients with better outcomes, most patients had high CRP upon admission and for many the CRP levels stayed elevated over the whole hospitalization. However, the limited nature of the data does not allow for any statistically robust conclusions. We also see that the marker levels were not substantially stratified by study site. Those patterns should however be interpreted with care due to systematic missingness of the data—in particular, patients that fared worse were probably more likely to have the markers measured. However, we believe this kind of patient-level view is useful to appreciate the extent of both between-patient and within-patient variability.

Fig 2. Disease progression for all patients included in the study as determined by breathing support required.

Fig 2

Each vertical strip is a single patient, the ordering on the horizontal axis is by disease severity. Ventilated = any form of ventilation (non-invasive positive-pressure ventilation, mechanical ventilation and extra-corporeal membrane oxygenation).

Fig 3. Values of selected markers over the course of the disease.

Fig 3

Each line represents a patient, stratified by the worst breathing support required over the hospitalization. Color indicates study sites. The vertical scale is logarithmic. Ventilated = any form of ventilation (non-invasive positive-pressure ventilation, mechanical ventilation and extra-corporeal membrane oxygenation), CRP = C-reactive protein [mg/l], D-dimer [ng/ml DDU], Ly = lymphocyte count [109/l], IL-6 = Interleukin 6 [ng/l].

The between-patient variability is notable even across outcomes—when ordering the patients by the highest CRP levels experienced throughout the hospital stay, the top 20% of patients that breathed ambient air for the whole hospitalization experienced higher levels than the bottom 20% of patients that required ventilation or died. This overlap is even larger when comparing only against the patients that died and D-dimer, Interleukin-6 and lymphocyte count also show a notably larger overlap than CRP (S4 Fig).

Association between patients’ characteristics and treatments

As noted above, the nature of the convenience sample did not enforce random assignment of treatments to patients. In fact, patients with worse baseline characteristics, which lead to worse outcomes, were less likely to receive hydroxychloroquine (see Fig 4). This clearly creates a bias towards a positive effect of hydroxychloroquine on the outcome (and potentially for other treatments as well—most were used in combination with hydroxychloroquine), which, however, could be false.

Fig 4. The choice of treatment with hydroxychloroquine seemed to be associated with the status of patients upon hospitalization.

Fig 4

Comorbidities were associated with both worse outcome (black) and lower chance of treatment with hydroxychloroquine (red). Dots and lines represent the estimates and the 95% confidence intervals of the log odds ratio of the respective outcome. HCQ: hydroxychloroquine, IHD: ischemic heart disease, HD: hypertension drugs, HF: heart failure history, COPD: chronic obstructive pulmonary disease, LungD: other lung disease, Dia: diabetes, RenalD: renal disease, LiverD: liver disease, HighCr: creatinin above 115 for males or above 97 for females, HighInr: Prothrombin time (Quick test) as International Normalized Ratio above 1.2, LowAlb: albumin in serum/plasma below 36 g/l.

Taken quantitatively, the comorbidities known upon hospitalization were informative with respect to the future hydroxychloroquine treatment: the score representing the cumulative presence of ischemic heart disease, hypertension drugs, former heart failure, COPD, other lung diseases, renal disease, or high creatinine was associated with a lower chance of taking hydroxychloroquine over the course of the hospitalization (the chance was only 79.9%, 95% confidence interval (65.3, 97)%, Chi-square test in the logistic regression model, χ2 = 5.18, df = 1, P = 0.023).

Association between treatments and outcomes

Here, we focus on hydroxychloroquine and azithromycin as those are the only treatments with larger sample size. We also investigate favipiravir as it is less well reported in the literature. Hydroxychloroquine was dosed almost exclusively in a 5-day regime starting with a loading dose of 800mg on the first day and followed by 400mg. Majority of patients complemented hydroxychloroquine with azithromycin while azithromycin was rarely used alone (see Table 1). Azithromycin was most frequently dosed 250 or 500mg/day, but doses ranging from 100mg/day to 1500 mg/day were observed. Favipiravir was used only at one site with a loading dose of 3600mg on the first day, followed by at most 9 days with a 1600mg dose. All but one of the patients receiving favipiravir also received hydroxychloroquine. Treatment was initiated mostly within two days of admission (see S5 Fig).

The results of the multiverse analysis for association between hydroxychloroquine, azithromycin and favipiravir usage and death is shown in Fig 5—here, we only show models that were not found to have immediate problems representing the data well or computational issues (see S3 File for details). Results for all models we tested are reported in S6S8 Figs, with additional details in S3 File. The results do not change noticeably when only patients from the first wave are included (S6S8 Figs).

Fig 5. Estimates of model coefficients for association between treatments and main outcomes.

Fig 5

Each row represents a model—Categorical 7/28 = Bayesian categorical regression for state at day 7/28, Bayes Cox = Bayesian version of the Cox proportional hazards model with a binary outcome, Cox (single) = frequentist Cox model with a binary outcome, Cox (competing) = frequentist Cox model using competing risks (as in Fig 1a), HMM A = Bayesian hidden-Markov model as in Fig 1b with predictors for rate groups, HMM B = Bayesian hidden-markov model as in Fig 1b with predictors for individual rates, HMM C = Bayesian hidden-Markov model as in Fig 1c. For frequentist models, we show maximum likelihood estimate and 95% confidence interval, for Bayesian models we show posterior mean and 95% credible interval. The estimands are either log odds-ratio (Categorical, HMM) or log hazard ratio (Cox variants). In all cases coefficient <0 means better patient outcome in the treatment group. Vertical lines indicate zero (blue) and substantial increase or decrease with odds or hazard ratio of 3:2 or 2:3 (green). Additionally the factors the model adjusted for are listed—Site = the study site, admitted = Admitted for Covid-19, Supportive = best supportive care initiated, Comorb. = total number of comorbidities, AZ = took azithromycin, HCQ = took hydroxychloroquine, FPV = took favipiravir, C. plasma = received convalescent plasma.

Most models report that using hydroxychloroquine is associated with lower risk of death. We must however bear in mind the potential bias noted in the previous section. Also, we see that for the HMM models, as we add adjustments the credible intervals do not widen but instead shift towards zero. This is a weak indication that further adjustments could drive the effect towards zero. We did not attempt to model additional adjustments as the models became computationally unstable. The case of hydroxychloroquine serves as a “control group” for our other results—since randomized trials give us high confidence that hydroxychloroquine does not substantially reduce mortality, we can be quite certain the associations we observe for hydroxychloroquine are just a measure of bias in the data. Additionally, our models either cannot determine the sign of association between azithromycin and risk of death or even show an increase in risk of death. This serves as a weak evidence that a substantial improvement in mortality from azithromycin is unlikely.

Most models exclude very strong association between increased risk of death and using favipiravir, but our data are necessarily quite limited, which is reflected in the very wide uncertainties around estimates. We also cannot put strict bounds on the association between favipiravir and length of hospitalization.

We also examined the association between treatments and length of hospital stay for all the patients that survived. Almost all models cannot discern the sign of the association for all treatments examined (S6S8 Figs). Similarly, we studied the association between D-dimer and Interleukin 6 and outcomes, with unconclusive results as well (S9 Fig).

Published prognostic models are not better than using age as the sole predictor of outcome

Following Wynants et al. [14] we found five prediction models we were able to recompute: Li et al. report the ACP index [23] combining CRP and age to form 3 grades, Chen & Liu [24] report a continuous score using age, CRP, D-dimer and lymphocyte count, Shi et al. [25] use age, sex and hypertension to form 4 grades, Caramelo et al. use age, sex, hypertension, diabetes, cardiac disease, chronic respiratory disease and cancer to form a continuous score [26], Bello-Chavolla et al. [27] use age, diabetes, obesity, pneumonia, chronic kidney disease, COPD and immunosuppression to build a score ranging from -6 to 22. For the latter two scores we had to impute some of the predictors as they had no immediate equivalent in our dataset. The outcomes present in the studies were: 12-day mortality, 30-day mortality and mortality without any further details, here we report results for both 12-day and 30-day mortality. Full details on the scores and how we used our dataset to compute them is given in the S3 File.

All prognostic models we tested performed similarly to or notably worse than using age as the only predictor and also worse than originally reported (Fig 6). Additionally, some publications did not provide enough detail to unambiguously reconstruct how the score and/or outcome was assessed. We thus concur with Wynants et al. [14] that reported prediction scores are at high risk of bias and need additional careful evaluation.

Fig 6. Performance of tested prediction scores as measured by AUC.

Fig 6

AUC = 1 means perfect prediction while AUC ≤ 0.5 means that the score is worse than random guess and a better prediction would be obtained by reversing the score (marked by thick blue line). The line ranges represent the bootstrapped 95% confidence intervals. Red dots show results computed in present study—model variants (horizontal axis) vary in the outcome measured (12-day or 30-day mortality, severe disease) and potentially on how ambiguities in score computation were resolved, although this rarely makes a big difference—see S3 File for details. Cyan triangles show AUC as reported by the original authors or recomputed based on their published data. When the confidence interval or the AUC of the original study is not shown, it means that the value was not reported by the authors and not enough information to recompute it was given.

Discussion

Our data show the extent of between-patient variability in progression of the disease in terms of both length of hospital stay, duration of various types of breathing support and basic markers. A direct comparison with other studies is hard to perform as almost always only summaries of measurements are reported.

For multiple candidate Covid-19 treatments, observational data repeatedly contradicted results of randomized controlled trials (contrast e.g. Catteau et al. [6] to the RECOVERY trial [28] for hydroxychloroquine and Liu et al. [29] to Agarwal et al. [30] for convalescent plasma). Our results for hydroxychloroquine also fit into this pattern. This should make us wary about over-interpreting the results of this study for azithromycin and favipiravir, although some higher-quality evidence that suggests clinical benefit of favipiravir has been reported [5].

The current (April 2021) Covid-19 treatment guidelines in Czech Republic recommend monoclonal antibodies and in some cases convalescent plasma or favipiravir as early treatment for high-risk patients with mild or no symptoms. For more severe cases dexamethason and anticoagulants are recommended while remdesivir is recommended only for patients that have severe disease but do not require mechanical ventilation [31]. This is similar to recommendations from the National Institute of Health in the USA who additionally recommend tocilizumab in some cases while not recommending convalescent plasma and favipiravir [32]. We do not believe our results should directly inform clinical decision, though we see some potential for inclusion of our results in future meta-analyses.

Regarding methodology, there are multiple approaches that are—at least in principle—capable of deriving causal conclusions from observational data, most notably the DAG framework [33, 34] and target trial framework [35, 36]. In all approaches—and also in some randomized designs—it is common that substantial uncertainty about the best statistical model for the task at hand remains and cannot be eliminated. Nevertheless, most published papers present results only from a single statistical model. We believe that this uncertainty about model choice should not be ignored, rather we should embrace the uncertainty and employ a multiverse analysis or other forms of robustness checks to explore how our conclusions would differ had different assumptions been adopted. In this work we tried to show how such an analysis could be performed and reported in practice. We note that modelling choices that are often made semi-arbitrarily, e.g., logistic regression for survival at 28 days vs. a Cox proportional hazards model for time to event, did in our case lead to substantially different results.

The hidden-Markov models (HMMs) used in this study are of some interest because they fitted the data well and allowed for inclusion of a larger number of predictors than the Cox proportional hazards model without making the posterior uncertainty unreasonably large. We believe this is because such models make better use of the available detailed data. Additionally, HMMs can be used even when the outcomes are observed only indirectly or noisily—as in the original study that motivated our models which concerns the progression of Alzheimer’s disease [20]. Noisily observed outcomes can pose problems for the proportional hazards model and require some special care [37, 38]. We should however note that the Markov property assumed in HMMs is likely to be a reasonable approximation in fewer settings than the proportional hazards assumption of the Cox model.

Common problems with prognostic models in medicine are small sample sizes used to develop the models, weak or problematic statistical methods and lack of external validation on independent datasets [39]. Those problems are prevalent also in prediction models for Covid-19 [14]. We used our dataset to validate several models and observed very poor performance for four out of the five models tested. In fact, the simple assumption that older patients tend to have worse outcomes provides better or similar results to all of the models we were able to implement. This is despite all of the scores including age as a predictor. There seem to be two causes—three of the models dichotomize the age into just two groups which is known to lose information [40, 41]. Of the other two models Chen & Liu [24] use stepwise variable selection which is known to be a problematic approch [42]. The resulting model puts largest relative weight on laboratory markers and deemphasizes age. Caramelo et al. [26] take the decade of age as a very strong predictor and perform the best on our data. Still their results are not distinguishable from just using age. Our findings are almost the same as in a similar but larger validation study using 22 models and 411 patients from the United Kingdom where no tested model provided better prediction for mortality than age alone [43].

Conclusions

We provide very weak observational evidence against a substantial beneficiary effect of using azithromycin (both with or without hydroxychloroquine) and against substantial negative effect of using favipiravir in hospitalized Covid-19 patients. We also observed better outcomes associated with taking hydroxychloroquine, which is likely linked to substantial confounding by indication. Where our results contradict randomized trials, the most likely explanation is systematic bias in our dataset.

A lesson from our analysis is that the assessment of treatment efficacy from observational data is sensitive to modelling assumptions while it is usually almost impossible to determine which of the models is more likely to reflect reality (if any). We believe that using multiverse analysis is an appropriate way to explore data in such contexts as it lets us be transparent about this sensitivity. We further believe that using hidden Markov models is a promising complement to the standard Cox proportional hazards analysis when detailed information on disease progression is available, particularly because it lets us impose additional structure on the model and thus make inferences with more disease states than would be possible to handle in the Cox framework, making better use of the available data.

Additionally, our experience indicates that a substantial fraction of published prognostic models will perform much worse on new patients than on the datasets they were built for and that external validation is crucial. We suggest that comparing the prognostic models against simple baselines (e.g., decade of age as the single predictor) should be a first step in validation. Furthermore, some of the published scores lack enough information to let others implement the score in the same way.

Supporting information

S1 Fig. Data collection timeline.

Data collection periods at individual sites, showing the range of admission dates of patients included in the study. Note that we cannot provide additional information to link the sites here with data shown elsewhere as that would increase the risk of deanonymization of the patients.

(TIFF)

S2 Fig. Treatment combinations.

Upset plot of treatment combinations—each vertical bar displays the number of patients that received the combination indicated by filled dots in the matrix. Horizontal bars show the total number of patients receiveing the given treatment.

(TIFF)

S3 Fig. Outcomes per site.

Number of patients and outcomes at the individual sites. The numbers above bars are the exact counts. Hospitalized = still hospitalized at the end of data collection at the site or transferred to other site and lost to followup. Sites are anonymized to preserve patient privacy.

(TIFF)

S4 Fig. Markers and outcomes.

Density plots of worst marker values per patient, stratified by worst condition experienced by the patient. For each patient that had a given marker measured, the worst value was taken. Additionally the patients are classified by the worst condition (regardless of the timing relative to the worst marker levels). For each set of patients and marker an empirical density plot of the worst marker values is shown.

(TIFF)

S5 Fig. Treatment onset.

Histogram of timing of first treatment relative to admission into one of the study sites. Two patients initiated treatment before admission, which is shown as the negative numbers.

(TIFF)

S6 Fig. Association of HCQ with outcomes.

Estimates of model coefficients for association between hydroxychloroquine and main outcomes. The “Suspicious” section shows models that were found to not fit the data well or have computational issues—see supplementary statistical analysis for details. Each row represents a model—Categorical All/7/28 = Bayesian categorical regression for state at last observed day/day 7/day 28, Binary All/7/28 = Bayesian logistic regression for state at last observed day/day 7/day 28, Bayes Cox = Bayesian version of the Cox proportional hazards model with a binary outcome, Cox (single) = frequentist Cox model with a binary outcome, Cox (competing) = frequentist Cox model using competing risks (as in Fig 1a), HMM A = Bayesian hidden-Markov model as in Fig 1b with predictors for rate groups, HMM B = Bayesian hidden-markov model as in Fig 1b with predictors for individual rates, HMM C = Bayesian hidden-Markov model as in Fig 1c. For frequentist models, we show maximum likelihood estimate and 95% confidence interval, for Bayesian models we show posterior mean and 95% credible interval. The estimands are either log odds-ratio (Categorical, HMM) or log hazard ratio (Cox variants) or log ratio of mean duration of hospitalization (HMM duration). In all cases coefficient <0 means better patient outcome in the treatment group. Vertical lines indicate zero (blue) and substantial increase or decrease with odds or hazard ratio of 3:2 or 2:3 (green). Additionally the factors the model adjusted for are listed—Site = the study site, admitted = Admitted for Covid-19, Supportive = best supportive care initiated, Comorb. = total number of comorbidities, AZ = took azithromycin, HCQ = took hydroxychloroquine, FPV = took favipiravir, C. plasma = received convalescent plasma, first wave = only patients admitted before September 1st were included.

(TIFF)

S7 Fig. Association of azithromycin with outcomes.

Estimates of model coefficients for association between azithromycin and main outcomes. The “Suspicious” section shows models that were found to not fit the data well or have computational issues—see supplementary statistical analysis for details. Each row represents a model—Categorical All/7/28 = Bayesian categorical regression for state at last observed day/day 7/day 28, Binary All/7/28 = Bayesian logistic regression for state at last observed day/day 7/day 28, Bayes Cox = Bayesian version of the Cox proportional hazards model with a binary outcome, Cox (single) = frequentist Cox model with a binary outcome, Cox (competing) = frequentist Cox model using competing risks (as in Fig 1a), HMM A = Bayesian hidden-Markov model as in Fig 1b with predictors for rate groups, HMM B = Bayesian hidden-markov model as in Fig 1b with predictors for individual rates, HMM C = Bayesian hidden-Markov model as in Fig 1c. For frequentist models, we show maximum likelihood estimate and 95% confidence interval, for Bayesian models we show posterior mean and 95% credible interval. The estimands are either log odds-ratio (Categorical, HMM) or log hazard ratio (Cox variants) or log ratio of mean duration of hospitalization (HMM duration). In all cases coefficient <0 means better patient outcome in the treatment group. Vertical lines indicate zero (blue) and substantial increase or decrease with odds or hazard ratio of 3:2 or 2:3 (green). Additionally the factors the model adjusted for are listed—Site = the study site, admitted = Admitted for Covid-19, Supportive = best supportive care initiated, Comorb. = total number of comorbidities, AZ = took azithromycin, HCQ = took hydroxychloroquine, FPV = took favipiravir, C. plasma = received convalescent plasma, first wave = only patients admitted before September 1st were included.

(TIFF)

S8 Fig. Association of favipiravir with outcomes.

Estimates of model coefficients for association between favipiravir and main outcomes. The “Suspicious” section shows models that were found to not fit the data well or have computational issues—see supplementary statistical analysis for details. Each row represents a model—Categorical All/7/28 = Bayesian categorical regression for state at last observed day/day 7/day 28, Binary All/7/28 = Bayesian logistic regression for state at last observed day/day 7/day 28, Bayes Cox = Bayesian version of the Cox proportional hazards model with a binary outcome, Cox (single) = frequentist Cox model with a binary outcome, Cox (competing) = frequentist Cox model using competing risks (as in Fig 1a), HMM A = Bayesian hidden-Markov model as in Fig 1b with predictors for rate groups, HMM B = Bayesian hidden-markov model as in Fig 1b with predictors for individual rates, HMM C = Bayesian hidden-Markov model as in Fig 1c. For frequentist models, we show maximum likelihood estimate and 95% confidence interval, for Bayesian models we show posterior mean and 95% credible interval. The estimands are either log odds-ratio (Categorical, HMM) or log hazard ratio (Cox variants) or log ratio of mean duration of hospitalization (HMM duration). In all cases coefficient <0 means better patient outcome in the treatment group. Vertical lines indicate zero (blue) and substantial increase or decrease with odds or hazard ratio of 3:2 or 2:3 (green). Additionally the factors the model adjusted for are listed—Site = the study site, admitted = Admitted for Covid-19, Supportive = best supportive care initiated, Comorb. = total number of comorbidities, AZ = took azithromycin, HCQ = took hydroxychloroquine, FPV = took favipiravir, C. plasma = received convalescent plasma, first wave = only patients admitted before September 1st were included.

(TIFF)

S9 Fig. Association of markers with outcomes.

Estimates of model coefficients (log hazard ratios) for association between markers and death. The “Suspicious” section shows models that were found to not fit the data well or have computational issues, “Problematic” section shows models that were completely broken—see supplementary statistical analysis for details. Each row represents a model—Cox (competing) = frequentist Cox model using competing risks (as in Fig 1a), HMM A = Bayesian hidden-markov model as in Fig 1b with predictors for rate groups, JM = Bayesian joint longitudinal and time-to-event model. For frequentist models, we show maximum likelihood estimate and 95% confidence interval, for Bayesian models we show posterior mean and 95% credible interval. Additionally the factors the model adjusted for are listed—Site = the study site, Supportive = best supportive care initiated, HCQ = took Hydroxychloroquine. We show posterior mean and 95% credible interval.

(TIFF)

S1 File. Data collection protocol.

(PDF)

S2 File. MS Excel table used for data collection.

(XLSX)

S3 File. Supplementary statistical analysis.

Contains details on all statistical models and procedures used.

(PDF)

Data Availability

The data that support the findings of this study are available on request from the corresponding author or the secretariat of the Institute of Microbiology of the Czech Academy of Sciences (contact via mbu@biomed.cas.cz) for researchers who meet the criteria for access to confidential data. The data are not publicly available due to privacy restrictions imposed by the Ethical committee of General University Hospital in Prague and the GDPR regulation of the European Union. We will be happy to arrange to run any analytical code locally and share the results, provided the code and the results do not leak personal information.

Funding Statement

MM was supported by ELIXIR CZ research infrastructure project (Ministry of Youth, Education and Sports of the Czech Republic, Grant No: LM2018131, https://www.msmt.cz/) including access to computing and storage facilities. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.COVID-19 v ČR: Otevřené datové sady a sady ke stažení [Internet]. Ministry of Health of the Czech Republic; Available: https://onemocneni-aktualne.mzcr.cz
  • 2. Sanders JM, Monogue ML, Jodlowski TZ, Cutrell JB. Pharmacologic Treatments for Coronavirus Disease 2019 (COVID-19): A Review. JAMA. 2020;323: 1824–1836. doi: 10.1001/jama.2020.6019 [DOI] [PubMed] [Google Scholar]
  • 3. Tobaiqy M, Qashqary M, Al-Dahery S, Mujallad A, Hershan AA, Kamal MA, et al. Therapeutic management of patients with COVID-19: a systematic review. Infection Prevention in Practice. 2020;2: 100061. doi: 10.1016/j.infpip.2020.100061 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Horby P, Lim WS, Emberson J, Mafham M, Bell J, Linsell L, et al. Effect of Dexamethasone in Hospitalized Patients with COVID-19—Preliminary Report [Internet]. Infectious Diseases (except HIV/AIDS); 2020. Jun. Available: http://medrxiv.org/lookup/doi/10.1101/2020.06.22.20137273 [Google Scholar]
  • 5. Siemieniuk RA, Bartoszko JJ, Ge L, Zeraatkar D, Izcovich A, Kum E, et al. Drug treatments for covid-19: living systematic review and network meta-analysis. BMJ. 2020;370. doi: 10.1136/bmj.m2980 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Catteau L, Dauby N, Montourcy M, Bottieau E, Hautekiet J, Goetghebeur E, et al. Low-dose hydroxychloroquine therapy and mortality in hospitalised patients with COVID-19: a nationwide observational study of 8075 participants. International Journal of Antimicrobial Agents. 2020; 106144. doi: 10.1016/j.ijantimicag.2020.106144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Del Valle DM, Kim-Schulze S, Huang H-H, Beckmann ND, Nirenberg S, Wang B, et al. An inflammatory cytokine signature predicts COVID-19 severity and survival. Nature Medicine. 2020; 1–8. doi: 10.1038/s41591-020-1051-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Castelnuovo AD, Costanzo S, Antinori A, Berselli N, Blandi L, Bruno R, et al. Use of hydroxychloroquine in hospitalised COVID-19 patients is associated with reduced mortality: Findings from the observational multicentre Italian CORIST study. European Journal of Internal Medicine. 2020; doi: 10.1016/j.ejim.2020.08.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Janiaud P, Axfors C, Schmitt AM, Gloy V, Ebrahimi F, Hepprich M, et al. Association of Convalescent Plasma Treatment With Clinical Outcomes in Patients With COVID-19: A Systematic Review and Meta-analysis. JAMA. 2021;325: 1185. doi: 10.1001/jama.2021.2747 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Maeda T, Obata R, Rizk DO D, Kuno T. The association of interleukin-6 value, interleukin inhibitors, and outcomes of patients with COVID-19 in New York City. J Med Virol. 2020; jmv.26365. doi: 10/gg6m3s [DOI] [PubMed] [Google Scholar]
  • 11. Brat GA, Weber GM, Gehlenborg N, Avillach P, Palmer NP, Chiovato L, et al. International electronic health record-derived COVID-19 clinical course profiles: the 4CE consortium. npj Digital Medicine. Nature Publishing Group; 2020;3: 1–9. doi: 10.1038/s41746-020-00308-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Hood K, Dahly D, Wilkinson J. Statistical review of Efficacy and safety of lopinavir/ritonavir or arbidol in adult patients with mild/moderate COVID-19: an exploratory randomized controlled trial. 2020; doi: 10.5281/zenodo.3779933 [DOI] [Google Scholar]
  • 13. Hood K, Goulao B, Dahly D, Yap C. Statistical review of Remdesivir in adults with severe COVID-19: a randomised, double-blind, placebo-controlled, multicentre trial [Internet]. Zenodo; 2020. May. Available: https://zenodo.org/record/3819778#.X1yBlotS-Uk [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Wynants L, Calster BV, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal. BMJ. 2020;369. doi: 10.1136/bmj.m1328 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Steegen S, Tuerlinckx F, Gelman A, Vanpaemel W. Increasing Transparency Through a Multiverse Analysis. Perspect Psychol Sci. 2016;11: 702–712. doi: 10.1177/1745691616658637 [DOI] [PubMed] [Google Scholar]
  • 16.R Core Team. R: A Language and Environment for Statistical Computing [Internet]. 2020. Available: https://www.R-project.org/
  • 17. Wickham H, Averick M, Bryan J, Chang W, McGowan L, François R, et al. Welcome to the Tidyverse. JOSS. 2019;4: 1686. doi: 10.21105/joss.01686 [DOI] [Google Scholar]
  • 18. Therneau TM, Grambsch PM. Modeling survival data: extending the Cox model. New York: Springer; 2000. [Google Scholar]
  • 19. Bürkner P-C. Advanced Bayesian Multilevel Modeling with the R Package brms. The R Journal. 2018;10: 395. doi: 10.32614/RJ-2018-017 [DOI] [Google Scholar]
  • 20. Williams JP, Storlie CB, Therneau TM, J CR Jr, Hannig J. A Bayesian Approach to Multistate Hidden Markov Models: Application to Dementia Progression. Journal of the American Statistical Association. 2020;115: 16–31. doi: 10.1080/01621459.2019.1594831 [DOI] [Google Scholar]
  • 21. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12: 77. doi: 10.1186/1471-2105-12-77 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Parackova Z, Zentsova I, Bloomfield M, Vrabcova P, Smetanova J, Klocperk A, et al. Disharmonic Inflammatory Signatures in COVID-19: Augmented Neutrophils’ but Impaired Monocytes’ and Dendritic Cells’ Responsiveness. Cells. Multidisciplinary Digital Publishing Institute; 2020;9: 2206. doi: 10.3390/cells9102206 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Lu J, Hu S, Fan R, Liu Z, Yin X, Wang Q, et al. ACP risk grade: a simple mortality index for patients with confirmed or suspected severe acute respiratory syndrome coronavirus 2 disease (COVID-19) during the early stage of outbreak in Wuhan, China. medRxiv. 2020; doi: 10.1101/2020.02.20.20025510 [DOI] [Google Scholar]
  • 24. Chen X, Liu Z. Early prediction of mortality risk among severe COVID-19 patients using machine learning. medRxiv. 2020; doi: 10.1101/2020.04.13.20064329 [DOI] [Google Scholar]
  • 25. Shi Y, Yu X, Zhao H, Wang H, Zhao R, Sheng J. Host susceptibility to severe COVID-19 and establishment of a host risk score: findings of 487 cases outside Wuhan. Critical Care. 2020;24: 108. doi: 10.1186/s13054-020-2833-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Caramelo F, Ferreira N, Oliveiros B. Estimation of risk factors for COVID-19 mortality—preliminary results. medRxiv. 2020; doi: 10.1101/2020.02.24.20027268 [DOI] [Google Scholar]
  • 27. Bello-Chavolla OY, Bahena-López JP, Antonio-Villa NE, Vargas-Vázquez A, González-Díaz A, Márquez-Salinas A, et al. Predicting Mortality Due to SARS-CoV-2: A Mechanistic Score Relating Obesity and Diabetes to COVID-19 Outcomes in Mexico. J Clin Endocrinol Metab. 2020;105: 2752–2761. doi: 10.1210/clinem/dgaa346 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Horby P, Mafham M, Linsell L, Bell JL, Staplin N, Emberson JR, et al. Effect of Hydroxychloroquine in Hospitalized Patients with COVID-19: Preliminary results from a multi-centre, randomized, controlled trial. medRxiv. 2020; doi: 10.1101/2020.07.15.20151852 [DOI] [Google Scholar]
  • 29. Liu STH, Lin H-M, Baine I, Wajnberg A, Gumprecht JP, Rahman F, et al. Convalescent plasma treatment of severe COVID-19: a propensity score–matched control study. Nat Med. 2020;26: 1708–1713. doi: 10.1038/s41591-020-1088-9 [DOI] [PubMed] [Google Scholar]
  • 30. Agarwal A, Mukherjee A, Kumar G, Chatterjee P, Bhatnagar T, Malhotra P, et al. Convalescent plasma in the management of moderate COVID-19 in India: An open-label parallel-arm phase II multicentre randomized controlled trial (PLACID Trial). medRxiv. 2020; doi: 10.1101/2020.09.03.20187252 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Štefan M, Chrdle A, Husa P, Beneš J, Dlouhý P. Covid-19: diagnostika a léčba [Internet]. Společnost infekčního lékařství ČLS JEP; 2021. Available: https://www.infekce.cz/Covid2019/DPcovid-19_SIL_0421.pdf
  • 32.COVID-19 Treatment Guidelines Panel. Coronavirus Disease 2019 (COVID-19)Treatment Guidelines [Internet]. National Institutes of Health; Available: https://www.covid19treatmentguidelines.nih.gov/ [PubMed]
  • 33. Greenland S, Pearl J, Robins JM. Causal Diagrams for Epidemiologic Research: Epidemiology. 1999;10: 37–48. doi: 10.1097/00001648-199901000-00008 [DOI] [PubMed] [Google Scholar]
  • 34. Arah OA. Analyzing Selection Bias for Credible Causal Inference: When in Doubt, DAG It Out. Epidemiology. 2019;30: 517–520. doi: 10.1097/EDE.0000000000001033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Lodi S, Phillips A, Lundgren J, Logan R, Sharma S, Cole SR, et al. Effect Estimates in Randomized Trials and Observational Studies: Comparing Apples With Apples. American Journal of Epidemiology. 2019;188: 1569–1577. doi: 10.1093/aje/kwz100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Hernán MA, Sauer BC, Hernández-Díaz S, Platt R, Shrier I. Specifying a target trial prevents immortal time bias and other self-inflicted injuries in observational analyses. Journal of Clinical Epidemiology. 2016;79: 70–75. doi: 10.1016/j.jclinepi.2016.04.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Meier AS, Richardson BA, Hughes JP. Discrete Proportional Hazards Models for Mismeasured Outcomes. Biometrics. 2003;59: 947–954. doi: 10.1111/j.0006-341X.2003.00109.x [DOI] [PubMed] [Google Scholar]
  • 38. Chen Y, Lawrence J, Hung HMJ, Stockbridge N. Methods for Employing Information About Uncertainty of Ascertainment of Events in Clinical Trials. Ther Innov Regul Sci. 2021;55: 197–211. doi: 10.1007/s43441-020-00206-3 [DOI] [PubMed] [Google Scholar]
  • 39. Steyerberg EW, Moons KGM, van der Windt DA, Hayden JA, Perel P, Schroter S, et al. Prognosis Research Strategy (PROGRESS) 3: Prognostic Model Research. PLoS Med. 2013;10: e1001381. doi: 10.1371/journal.pmed.1001381 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Altman DG, Royston P. The cost of dichotomising continuous variables. BMJ. 2006;332: 1080.1. doi: 10.1136/bmj.332.7549.1080 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Thoresen M. Spurious interaction as a result of categorization. BMC Med Res Methodol. 2019;19: 28. doi: 10.1186/s12874-019-0667-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Smith G. Step away from stepwise. J Big Data. 2018;5: 32. doi: 10.1186/s40537-018-0143-6 [DOI] [Google Scholar]
  • 43. Gupta RK, Marks M, Samuels THA, Luintel A, Rampling T, Chowdhury H, et al. Systematic evaluation and external validation of 22 prognostic models among hospitalised adults with COVID-19: an observational cohort study. Eur Respir J. 2020;56: 2003498. doi: 10.1183/13993003.03498-2020 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Aleksandar R Zivkovic

3 Jun 2021

PONE-D-20-40463

Detailed disease progression of 213 patients hospitalized with Covid-19 in the Czech Republic: An exploratory analysis

PLOS ONE

Dear Dr. Modrak,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jul 16 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Aleksandar R. Zivkovic

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. In the ethics statement in the manuscript and in the online submission form, please provide additional information about the patient records used in your retrospective study, including:

a) whether all data were fully anonymized before you accessed them;

b) the date range (month and year) during which patients' medical records were accessed;

c) the date range (month and year) during which patients whose medical records were selected for this study sought treatment.

If patients provided informed written consent to have data from their medical records used in research, please include this information.

3. Please ensure you have thoroughly discussed any potential limitations of this study within the Discussion section, including the potential impact of confounding factors.

4. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

5. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer #1: The authors have provided an overview of COVID-19 hospitalized patients in the Czech Republic. The study is well-written in a proper English language and fashion, also, with quite dynamic scientific flow. The study cohort is rather small. However, there are few major concerns that should be clarified by the authors.

1. The "Introduction section" contains misleading information - Remdesivir is no longer available in only limited amounts of patients. Remdesivir is currently one of the front-line therapies.

2. The authors should explain why are the patients treated mostly with hydroxychloroquine despite only low clinical benefit (mentioned in the Introduction).

3. Table 1 should contain the total number of deaths. Also, the number of hydroxychloroquine treated patients is missing.

4. Given the presented data, it is clear that by the time of this study, the majority of patients was treated with either hydroxychloroquine or azithromycin. This has fundamentally changed during the past few months. The majority of patients is receiving dexamthasone and remdesivir, while in this study, the proportions are only 2%, and 1%, respectively.

This might imply two things:

either the included institutions have no access to first-line therapies or the included institutions are not following the treatment guidelines. Does this really reflect the treatment of COVID-19 patients in the Czech Republic in 2021? If so,

that is very disturbing.

At this point I believe that the authors have several options. They might collect a new up-to-date cohort and correlate the treatment strategies and outcomes of now and then. That would significantly improve the quality of the manuscript. In that case the manuscript title could reffer to "disease progression of patients hospitalized in the Czech R."

Other option is to highlight that these data are on the efficacy of hydroxychloroquine in the treatment of COVID-19 which was applied at the very beginning of the COVID pandemic. In such case, the manuscript title must change. Also the Introduction section should provide more data on hydroxychloroquine and the reasons for its application and also the study cohort should be presented as "remdesivir-naive" or dexamethasone-naive" which could be actually beneficial for the observation of hydroxychloroquine treatment of patients. Since Hydroxychloroquine is being used in a wide range of autoimmune diseases, the immune background of COVID-19 should be included. This study, however, cannot be titled as it currently stands. The title and the abstract must change.

5. The authors should at least discuss the current first-line treatments of COVID-19 in the Czech Republic and reffer to pertinent guidelines.

Reviewer #2: This study aimed at significant COVID-related health problems.

However, this study requires extensive revisions.

1. The organization of the text is complicated. The structure of this paper is far from a standard scientific article.

2. The sampling methods are unclear. The authors should clearly define sampling methods, research centers included in this study, characteristics of hospitals - single wards? or all hospitals were dedicated to COVID-19? Moreover, the timeline of the patients' enrolment of participants should be clearly defined.

3. More information about the COVID-19 pandemic in the Czech Republic is needed. Moreover, the Authors should present data on the organization of COVID-19 dedicated treatment.

4. Conclusions seem to be not supported by the results.

Reviewer #3: I like the general idea of this manuscript and find it interesting to the readers of Plos One. However, the conclusion section is very short and does not contain any references. I suggest rewriting the conclusion section and discussing your results in detail with reference to similar studies

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Oct 6;16(10):e0245103. doi: 10.1371/journal.pone.0245103.r002

Author response to Decision Letter 0


2 Jul 2021

The response to reviewers was uploaded as a separate file, but since the system requires us to fill this free text, we also put a copy here.

====== Reviewer #1 ======

Comment: The authors have provided an overview of COVID-19 hospitalized patients in the Czech Republic. The study is well-written in a proper English language and fashion, also, with quite dynamic scientific flow. The study cohort is rather small. However, there are few major concerns that should be clarified by the authors.

Response: Thanks for the kind words. We believe a potential source of confusion is that the paper has unfortunately spent substantial time between submission to PLoS One and being sent out for review. The data were collected in the early phases of the pandemic (March - October 2020) and we submitted the manuscript in December 2020. Due to this delay (which is not the reviewers fault) the information has necessarily become outdated. We rewrote the Introduction section to clarify the timeline and make a better distinction between the state of knowledge at the time the patients were hospitalized and current state of knowledge.

Comment: 1. The "Introduction section" contains misleading information - Remdesivir is no longer available in only limited amounts of patients. Remdesivir is currently one of the front-line therapies.

Response: We updated the introduction with up-to-date information on Covid treatments and contrasted them with what was available upon data collection.

Comment: 2. The authors should explain why are the patients treated mostly with hydroxychloroquine despite only low clinical benefit (mentioned in the Introduction).

Response: We clarified the timeline and discuss what information was available at the time for which we collected data.

Comment: 3. Table 1 should contain the total number of deaths. Also, the number of hydroxychloroquine treated patients is missing.

Response: We changed “Deceased” to “Death” in Table 1 to avoid any confusion. We also duplicated the information on hydroxychloroquine treatment to be shown in both the columns of Table 1 as well as in rows to keep all of the drug information in a consistent format.

Comment: 4. Given the presented data, it is clear that by the time of this study, the majority of patients was treated with either hydroxychloroquine or azithromycin. This has fundamentally changed during the past few months. The majority of patients is receiving dexamthasone and remdesivir, while in this study, the proportions are only 2%, and 1%, respectively.

This might imply two things:

either the included institutions have no access to first-line therapies or the included institutions are not following the treatment guidelines. Does this really reflect the treatment of COVID-19 patients in the Czech Republic in 2021? If so,

that is very disturbing.

Response: The data reflect the state in Czech Republic at the early stages of the pandemic when the choice of treatments was different. We clarified this in the Introduction section.

Comment: At this point I believe that the authors have several options. They might collect a new up-to-date cohort and correlate the treatment strategies and outcomes of now and then. That would significantly improve the quality of the manuscript. In that case the manuscript title could reffer to "disease progression of patients hospitalized in the Czech R."

Other option is to highlight that these data are on the efficacy of hydroxychloroquine in the treatment of COVID-19 which was applied at the very beginning of the COVID pandemic. In such case, the manuscript title must change.

Response: Unfortunately collecting new data is not feasible. We changed the title to reflect that the data correspond to early phases of the pandemic. As we indicated in the paper, we believe that given the progress made in understanding the efficacy of hydroxychloroquine in the past year, our results on hydroxychloroquine are not of primary interest (although we report them as this was the initial primary aim of the study). We believe the currently more relevant results are our evaluation of the published prognostic models and our data on favipiravir, which despite the low number of patients are of some interest as favipiravir is comparatively underreported in the literature (we were already contacted for inclusion of the data in meta analyses).

Comment: Also the Introduction section should provide more data on hydroxychloroquine and the reasons for its application and also the study cohort should be presented as "remdesivir-naive" or dexamethasone-naive" which could be actually beneficial for the observation of hydroxychloroquine treatment of patients. Since Hydroxychloroquine is being used in a wide range of autoimmune diseases, the immune background of COVID-19 should be included. This study, however, cannot be titled as it currently stands. The title and the abstract must change.

Response: We clarified the treatment choices and timeline and made it clear in title and abstract that the data are from early phases of the pandemic when remdesivir and dexamethasone were not routinely used.

Comment: 5. The authors should at least discuss the current first-line treatments of COVID-19 in the Czech Republic and reffer to pertinent guidelines.

Response: References to the current guidelines were added into the Discussion section.

====== Reviewer #2 ======

Comment: 1. The organization of the text is complicated. The structure of this paper is far from a standard scientific article.

Response: We agree that since we discuss multiple results derived from the same dataset, the structure of the paper is more complex as each of the results needs to be discussed in methods, results and conclusions. We are however unsure how this could be improved without removing content from the paper. We made minor changes to headings to more closely match similar works already published in PLoS One (e.g. https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0249346) and added a Discussion section.

Comment: 2. The sampling methods are unclear. The authors should clearly define sampling methods, research centers included in this study, characteristics of hospitals - single wards? or all hospitals were dedicated to COVID-19? Moreover, the timeline of the patients' enrolment of participants should be clearly defined.

Response: We clarified the sampling methods and timeline in the “Patients and study design” subsection. We note that the full timeline for each site was given in Supplementary Figure 1.

Comment: 3. More information about the COVID-19 pandemic in the Czech Republic is needed. Moreover, the Authors should present data on the organization of COVID-19 dedicated treatment.

Response: Information on the pandemic in Czech Republic was added in the Introduction section.

Comment: 4. Conclusions seem to be not supported by the results.

Response: We are not sure which conclusions the reviewer assumes to be unsupported. We added a Discussion section to make the connection between Results and Conclusion sections more explicit.

====== Reviewer #3 ======

Comment:I like the general idea of this manuscript and find it interesting to the readers of Plos One. However, the conclusion section is very short and does not contain any references. I suggest rewriting the conclusion section and discussing your results in detail with reference to similar studies

Response: Thanks for the kind words. We added a Discussion section that discusses the results in more details and adds references.

Attachment

Submitted filename: Response to reviewers - PLoS One.pdf

Decision Letter 1

Aleksandar R Zivkovic

16 Jul 2021

Disease progression of 213 patients hospitalized with Covid-19 in the Czech Republic in March - October 2020: An exploratory analysis

PONE-D-20-40463R1

Dear Dr. Modrak,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Aleksandar R. Zivkovic

Academic Editor

PLOS ONE

Acceptance letter

Aleksandar R Zivkovic

7 Sep 2021

PONE-D-20-40463R1

Disease progression of 213 patients hospitalized with Covid-19 in the Czech Republic in March - October 2020: An exploratory analysis

Dear Dr. Modrák:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Aleksandar R. Zivkovic

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Data collection timeline.

    Data collection periods at individual sites, showing the range of admission dates of patients included in the study. Note that we cannot provide additional information to link the sites here with data shown elsewhere as that would increase the risk of deanonymization of the patients.

    (TIFF)

    S2 Fig. Treatment combinations.

    Upset plot of treatment combinations—each vertical bar displays the number of patients that received the combination indicated by filled dots in the matrix. Horizontal bars show the total number of patients receiveing the given treatment.

    (TIFF)

    S3 Fig. Outcomes per site.

    Number of patients and outcomes at the individual sites. The numbers above bars are the exact counts. Hospitalized = still hospitalized at the end of data collection at the site or transferred to other site and lost to followup. Sites are anonymized to preserve patient privacy.

    (TIFF)

    S4 Fig. Markers and outcomes.

    Density plots of worst marker values per patient, stratified by worst condition experienced by the patient. For each patient that had a given marker measured, the worst value was taken. Additionally the patients are classified by the worst condition (regardless of the timing relative to the worst marker levels). For each set of patients and marker an empirical density plot of the worst marker values is shown.

    (TIFF)

    S5 Fig. Treatment onset.

    Histogram of timing of first treatment relative to admission into one of the study sites. Two patients initiated treatment before admission, which is shown as the negative numbers.

    (TIFF)

    S6 Fig. Association of HCQ with outcomes.

    Estimates of model coefficients for association between hydroxychloroquine and main outcomes. The “Suspicious” section shows models that were found to not fit the data well or have computational issues—see supplementary statistical analysis for details. Each row represents a model—Categorical All/7/28 = Bayesian categorical regression for state at last observed day/day 7/day 28, Binary All/7/28 = Bayesian logistic regression for state at last observed day/day 7/day 28, Bayes Cox = Bayesian version of the Cox proportional hazards model with a binary outcome, Cox (single) = frequentist Cox model with a binary outcome, Cox (competing) = frequentist Cox model using competing risks (as in Fig 1a), HMM A = Bayesian hidden-Markov model as in Fig 1b with predictors for rate groups, HMM B = Bayesian hidden-markov model as in Fig 1b with predictors for individual rates, HMM C = Bayesian hidden-Markov model as in Fig 1c. For frequentist models, we show maximum likelihood estimate and 95% confidence interval, for Bayesian models we show posterior mean and 95% credible interval. The estimands are either log odds-ratio (Categorical, HMM) or log hazard ratio (Cox variants) or log ratio of mean duration of hospitalization (HMM duration). In all cases coefficient <0 means better patient outcome in the treatment group. Vertical lines indicate zero (blue) and substantial increase or decrease with odds or hazard ratio of 3:2 or 2:3 (green). Additionally the factors the model adjusted for are listed—Site = the study site, admitted = Admitted for Covid-19, Supportive = best supportive care initiated, Comorb. = total number of comorbidities, AZ = took azithromycin, HCQ = took hydroxychloroquine, FPV = took favipiravir, C. plasma = received convalescent plasma, first wave = only patients admitted before September 1st were included.

    (TIFF)

    S7 Fig. Association of azithromycin with outcomes.

    Estimates of model coefficients for association between azithromycin and main outcomes. The “Suspicious” section shows models that were found to not fit the data well or have computational issues—see supplementary statistical analysis for details. Each row represents a model—Categorical All/7/28 = Bayesian categorical regression for state at last observed day/day 7/day 28, Binary All/7/28 = Bayesian logistic regression for state at last observed day/day 7/day 28, Bayes Cox = Bayesian version of the Cox proportional hazards model with a binary outcome, Cox (single) = frequentist Cox model with a binary outcome, Cox (competing) = frequentist Cox model using competing risks (as in Fig 1a), HMM A = Bayesian hidden-Markov model as in Fig 1b with predictors for rate groups, HMM B = Bayesian hidden-markov model as in Fig 1b with predictors for individual rates, HMM C = Bayesian hidden-Markov model as in Fig 1c. For frequentist models, we show maximum likelihood estimate and 95% confidence interval, for Bayesian models we show posterior mean and 95% credible interval. The estimands are either log odds-ratio (Categorical, HMM) or log hazard ratio (Cox variants) or log ratio of mean duration of hospitalization (HMM duration). In all cases coefficient <0 means better patient outcome in the treatment group. Vertical lines indicate zero (blue) and substantial increase or decrease with odds or hazard ratio of 3:2 or 2:3 (green). Additionally the factors the model adjusted for are listed—Site = the study site, admitted = Admitted for Covid-19, Supportive = best supportive care initiated, Comorb. = total number of comorbidities, AZ = took azithromycin, HCQ = took hydroxychloroquine, FPV = took favipiravir, C. plasma = received convalescent plasma, first wave = only patients admitted before September 1st were included.

    (TIFF)

    S8 Fig. Association of favipiravir with outcomes.

    Estimates of model coefficients for association between favipiravir and main outcomes. The “Suspicious” section shows models that were found to not fit the data well or have computational issues—see supplementary statistical analysis for details. Each row represents a model—Categorical All/7/28 = Bayesian categorical regression for state at last observed day/day 7/day 28, Binary All/7/28 = Bayesian logistic regression for state at last observed day/day 7/day 28, Bayes Cox = Bayesian version of the Cox proportional hazards model with a binary outcome, Cox (single) = frequentist Cox model with a binary outcome, Cox (competing) = frequentist Cox model using competing risks (as in Fig 1a), HMM A = Bayesian hidden-Markov model as in Fig 1b with predictors for rate groups, HMM B = Bayesian hidden-markov model as in Fig 1b with predictors for individual rates, HMM C = Bayesian hidden-Markov model as in Fig 1c. For frequentist models, we show maximum likelihood estimate and 95% confidence interval, for Bayesian models we show posterior mean and 95% credible interval. The estimands are either log odds-ratio (Categorical, HMM) or log hazard ratio (Cox variants) or log ratio of mean duration of hospitalization (HMM duration). In all cases coefficient <0 means better patient outcome in the treatment group. Vertical lines indicate zero (blue) and substantial increase or decrease with odds or hazard ratio of 3:2 or 2:3 (green). Additionally the factors the model adjusted for are listed—Site = the study site, admitted = Admitted for Covid-19, Supportive = best supportive care initiated, Comorb. = total number of comorbidities, AZ = took azithromycin, HCQ = took hydroxychloroquine, FPV = took favipiravir, C. plasma = received convalescent plasma, first wave = only patients admitted before September 1st were included.

    (TIFF)

    S9 Fig. Association of markers with outcomes.

    Estimates of model coefficients (log hazard ratios) for association between markers and death. The “Suspicious” section shows models that were found to not fit the data well or have computational issues, “Problematic” section shows models that were completely broken—see supplementary statistical analysis for details. Each row represents a model—Cox (competing) = frequentist Cox model using competing risks (as in Fig 1a), HMM A = Bayesian hidden-markov model as in Fig 1b with predictors for rate groups, JM = Bayesian joint longitudinal and time-to-event model. For frequentist models, we show maximum likelihood estimate and 95% confidence interval, for Bayesian models we show posterior mean and 95% credible interval. Additionally the factors the model adjusted for are listed—Site = the study site, Supportive = best supportive care initiated, HCQ = took Hydroxychloroquine. We show posterior mean and 95% credible interval.

    (TIFF)

    S1 File. Data collection protocol.

    (PDF)

    S2 File. MS Excel table used for data collection.

    (XLSX)

    S3 File. Supplementary statistical analysis.

    Contains details on all statistical models and procedures used.

    (PDF)

    Attachment

    Submitted filename: Response to reviewers - PLoS One.pdf

    Data Availability Statement

    The data that support the findings of this study are available on request from the corresponding author or the secretariat of the Institute of Microbiology of the Czech Academy of Sciences (contact via mbu@biomed.cas.cz) for researchers who meet the criteria for access to confidential data. The data are not publicly available due to privacy restrictions imposed by the Ethical committee of General University Hospital in Prague and the GDPR regulation of the European Union. We will be happy to arrange to run any analytical code locally and share the results, provided the code and the results do not leak personal information.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES