Abstract
Background
We have an incomplete understanding of COVID-19 characteristics at hospital presentation and whether underlying subphenotypes are associated with clinical outcomes and therapeutic responses.
Methods
For this cross-sectional study, we extracted electronic health data from adults hospitalized between 1 March and 30 August 2020 with a PCR-confirmed diagnosis of COVID-19 at five New York City Hospitals. We obtained clinical and laboratory data from the first 24 h of the patient’s hospitalization. Treatment with tocilizumab and convalescent plasma was assessed over hospitalization. The primary outcome was mortality; secondary outcomes included intubation, intensive care unit (ICU) admission and length of stay (LOS). First, we employed latent class analysis (LCA) to identify COVID-19 subphenotypes on admission without consideration of outcomes and assigned each patient to a subphenotype. We then performed robust Poisson regression to examine associations between COVID-19 subphenotype assignment and outcome. We explored whether the COVID-19 subphenotypes had a differential response to tocilizumab and convalescent plasma therapies.
Results
A total of 4620 patients were included. LCA identified six subphenotypes, which were distinct by level of inflammation, clinical and laboratory derangements and ranged from a hypoinflammatory subphenotype with the fewest derangements to a hyperinflammatory with multiorgan dysfunction subphenotypes. Multivariable regression analyses found differences in risk for mortality, intubation, ICU admission and LOS, as compared to the hypoinflammatory subphenotype. For example, in multivariable analyses the moderate inflammation with fever subphenotype had 3.29 times the risk of mortality (95% CI 2.05, 5.28), while the hyperinflammatory with multiorgan failure subphenotype had 17.87 times the risk of mortality (95% CI 11.56, 27.63), as compared to the hypoinflammatory subphenotype. Exploratory analyses suggested that subphenotypes may differential respond to convalescent plasma or tocilizumab therapy.
Conclusion
COVID-19 subphenotype at hospital admission may predict risk for mortality, ICU admission and intubation and differential response to treatment.
KEY MESSAGE
This cross-sectional study of COVID patients admitted to the Mount Sinai Health System, identified six distinct COVID subphenotypes on admission. Subphenotypes correlated with ICU admission, intubation, mortality and differential response to treatment.
Keywords: SARS CoV-2 infection, COVID-19, subphenotypes, mortality
Introduction
The Severe Acute Respiratory Syndrome Coronavirus 2 (SARS CoV2) pandemic has resulted in significant morbidity and mortality [1,2]. In the United States, there were an estimated 83.1 million total infections from March through December 2020 [3]. Of those infected, the disease burden ranged from asymptomatic to critical illness, and the mortality among hospitalized patients in the US was estimated to be 21% [4]. A number of studies have identified early clinical markers of severe coronavirus disease 2019 (COVID-19), such as fever, hyperglycemia, and elevated inflammatory markers that may be associated with worse outcomes [5–9]. Recent studies have shifted focus to COVID-19 heterogeneity [10,11]. We posit that COVID-19 heterogeneity at the time of hospital admission represents underlying subphenotypes with different natural histories, clinical and biological characteristics, outcomes and, possibly, responses to treatment [12–14]. Better characterizations of COVID-19 subphenotypes and their associations with outcomes could inform treatment options. Furthermore, secondary analyses of completed trials using COVID-19 subphenotypes may explain variable responses to therapeutics and improve targeted therapy.
It is now understood that syndromes of critical illness, such as acute respiratory disease syndrome (ARDS) and sepsis, both seen in severe COVID-19, are not singular in presentation but rather are composed of multiple underlying subphenotypes with differing associated morbidity and mortality risk. Secondary analyses of randomized controlled trials of ARDS employed latent class analysis and consistently identified hyperinflammatory and hypoinflammatory subphenotypes where the hyperinflammatory subphenotype was associated with higher risk of mortality [13,15–17]. These subphenotypes may differentially respond to therapy, although evidence is mixed [13,15,16]. Within the COVID-19 framework, published studies have evaluated whether individual measures such as oxygen saturation, creatinine, D-dimer or C-reactive protein (CRP) predict disease severity. Machine learning approaches have been applied to understand risk for COVID-19 mortality and critical illness however these approaches are limited by data missingness and sample size requirements [18–24]. More recent studies have examined COVID-19 subphenotypes at the time of ICU admission when the disease is advanced and successful interventions may be limited [25,26]. Moreover, randomized controlled trials of therapeutics recruit COVID-19 patients broadly and do not enrich for subphenotypes that may be more likely to respond to that therapeutic [27–29]. More research is needed to identify subphenotypes of disease severity on hospital presentation and assist with clinical risk stratification and treatment algorithms [30,31].
To address this knowledge gap, we conducted a retrospective analysis to identify COVID-19 subphenotypes on admission, examine whether identified subphenotypes were associated with COVID-19 outcomes, and explore differential response to therapeutics. Specifically, we leveraged electronic health record data from inpatient COVID-19 encounters within five New York City (NYC) hospitals during the Spring through Summer 2020 surge. Our primary COVID-19 clinical outcome was mortality; secondary outcomes included intensive care unit (ICU) admission, intubation and length of stay (LOS).
Methods
Study participants
We extracted electronic health data from all persons hospitalized with a PCR-confirmed diagnosis of COVID-19 at five Mount Sinai Hospital System Hospitals including Mount Sinai Brooklyn, Mount Sinai Queens, the Mount Sinai Hospital, Mount Sinai Morningside and Mount Sinai West. Specifically, we included patients who were aged 18 years or older, admitted between 1 March 2020 and 30 August 2020, and had a positive SARS Cov2 PCR nasal swab within 7 days of admission.
Ethics
The study was approved by the Institutional Review Board at the Icahn School of Medicine at Mount Sinai (20-00547).
Electronic health record data
Electronic health record data were collected from the first COVID-19 encounter through 30 August 2020 thereby capturing complete hospital admissions and outcomes over the NYC Spring 2020 COVID-19 surge. Data were obtained from the Mount Sinai Data Warehouse COVID-19 Data Sets, which obtained data from Mount Sinai’s Caboodle, Clarity and Enterprise Data Warehouse databases. The Mount Sinai Hospitals are located in the NYC boroughs of Manhattan [Mount Sinai Hospital (1141-bed), Mount Sinai Morningside (495-bed), and Mount Sinai West (514-bed)], Brooklyn [Mount Sinai Brooklyn (212-bed)] and Queens [Mount Sinai Queens (235-bed)].
For each patient encounter, we extracted date of admission, hospital, sex, age, self-reported race/ethnicity, insurance provider and date of SARS-CoV-2 PCR test. For patients with multiple encounters, data collected at the first encounter that met inclusion criteria were used in analyses. Information on medical comorbidities were extracted from the electronic health record using international classification of disease 10 (ICD 10) codes. Comorbidities were grouped into organ-specific categories. Persons with history of asthma, chronic obstructive pulmonary disease (COPD) and/or obstructive sleep apnea (OSA) were categorized as having pulmonary disease. Persons with history of hypertension (HTN), coronary artery disease (CAD), congestive heart failure (CHF) and/or myocardial infarction (MI) were categorized as having cardiovascular diseases. Persons were also categorized as having a history of cancer or obesity (as measured by ICD 10 code, or a calculated BMI >30). The number of organ-specific comorbidities were then summed.
We obtained clinical and laboratory data from the first 24 h of the patient’s first hospitalization that met inclusion criteria. Only variables with data available from at least 60% of participants or variables with a strong biological basis based on prior studies [erythrocyte sedimentation rate (ESR), interleukin 6 (IL-6), interleukin 1 beta (IL-1B)] were included. For variables with repeated observations, we identified the worst value recorded within 24 h of admission. Clinical variables included lowest oxygen saturation, lowest systolic and diastolic blood pressure, highest heart rate, and highest temperature within the first 24 h. Laboratory variables examined included inflammatory markers [C-reactive protein (CRP), ESR, IL-6, IL-1B, lactate dehydrogenase (LDH), procalcitonin, ferritin]; hematologic markers [white blood cell (WBC), hemoglobin, platelets, d-dimer, fibrinogen, prothrombin time (PT), partial thromboplastin time (PTT)); cardiac markers (troponin, brain natriuretic peptide (BNP)]; and renal and hepatic markers (alanine transaminase (ALT), aspartate aminotransferase (AST), albumin, total bilirubin, sodium, potassium, calcium, bicarbonate, blood urea nitrogen (BUN), creatinine, anion gap, glucose. Laboratory values above the laboratory-defined limit of detection were assigned the value at the limit of detection.
Patient outcomes were assessed across all hospital encounters. The primary outcome was mortality; secondary outcomes included intubation and admission to the intensive care unit (ICU). Specifically, patients listed as ‘expired’ or ‘deceased’ as per Epic discharge disposition were classified as deceased. Electronic health record mortality data also included health-record linked post-discharge deaths. Patients assigned to an ICU bed at any point of any hospitalization were classified as having an ICU admission. Patients recorded as having a surgical intubation, non-surgical airway intubation or endotracheal intubation during any encounter were classified as being intubated. Amongst survivors, we also determined length of stay (LOS) of the initial hospitalization. Data regarding COVID-19 therapeutics, including tocilizumab and convalescent plasma, administered while hospitalized were also collected across all hospital encounters.
Covariates
We extracted the hospital in which patients were hospitalized (categorical variable), self-identified race/ethnicity (categorical variable: White, Black, Hispanic, Asian, Other), insurance provider (categorical variable: Private/Medicare, Medicaid/Emergency Medicaid, Other) and onset time, defined as the time in days from first SARS-Cov-2 PCR positive admission in the Mount Sinai Health System to the individual patient’s admission.
Statistical analysis
A two-step approach was undertaken in this analysis. First, we employed latent class analysis (LCA) to identify COVID-19 subphenotypes on admission without consideration of outcomes. The distribution and completeness of clinical and laboratory data was examined. As LCA allows for missingness, no data imputations were performed. Clinical and laboratory variables were placed into quintiles for LCA. We fit LCA models ranging from 2 to 10 subphenotype classes and then determined the best fitting model (i.e. the number of subphenotypes). Criteria for number of subphenotypes included: (1) consistent Akakie information criteria (cAIC) and adjusted Bayesian information criteria (aBIC), where lower values suggest better fit; (2) entropy, where higher values suggest better class separation; (3) likelihood ratio; and (4) number of participants per subphenotype, where models with adequate sample size in each class are optimal. Once the best fitting model and number of subphenotypes was identified, participants were assigned to the subphenotype for which they had the highest probability of correct assignment. These subphenotype assignments were then used as the independent variable for subsequent regression analyses.
Given that our COVID-19 outcomes of interest were common (occurring in more than 10% of the cohort), we employed bivariate and multivariable Poisson regression models with robust error variance to examine associations between COVID-19 subphenotype assignment and risk of mortality, ICU admission, and intubation, considered separately [32,33] using the R package sandwich [34,35]. Amongst survivors, we employed bivariate and multivariable generalized linear regression to examine associations between COVID-19 subphenotype and length of stay. Multivariable models adjusted for onset time, hospital, self-identified race/ethnicity and insurance provider. Finally, we explored whether the COVID-19 subphenotypes had a differential response to tocilizumaband convalescent plasma therapies through introduction of an interaction term and in treatment-stratified models.
Analyses were conducted in R software version 4.1.2. Latent class analyses were performed using the poLCA package [36,37].
Results
A total of 4620 patients were admitted to the Mount Sinai Health System during the study period and met inclusion criteria (patient characteristics, Table 1). The largest percentage of patients in the cohort were admitted to Mount Sinai Hospital (34%, n = 1550), followed by Mount Sinai Queens (19%, n = 883), Mount Sinai Brooklyn (18%, n = 850) and Mount Sinai Morningside (18%, n = 838). Patients median age was 67 (IQR 55–78) years and 57% (n = 2629) were male. Twenty-eight percent (n = 1272), 27% (n = 1258), and 5% (n = 223) of patients self-identified as Hispanic, Black or Asian, respectively. Categories of co-morbidities were summed as described above, and the majority of the patients had zero (37%, n = 1715) or one (40%, n = 1,831) organ-specific comorbidity. Overall 29% (n = 1342) of patients died, 16% (n = 714) were intubated, and 21% (n = 972) were admitted to the ICU.
Table 1.
Number | Per cent | |
---|---|---|
Age, years (median (range), IQR) | 67 (18–110) | 55, 78 |
Male | 2629 | 56.9 |
Race/ethnicity (self-reported) | ||
Hispanic | 1272 | 27.5 |
Black | 1258 | 27.2 |
White | 1094 | 23.7 |
Asian | 223 | 4.8 |
Other | 635 | 13.7 |
Missing | 138 | 3.0 |
Insurance | ||
Medicaid | 1076 | 23.2 |
Medicare/Private Insurance | 3415 | 73.9 |
Other | 97 | 2.1 |
Missing | 32 | 0.7 |
Hospital | ||
MS Hospital | 1550 | 33.5 |
MS Queens | 883 | 19.1 |
MS Brooklyn | 850 | 18.4 |
MS Morningside | 838 | 18.1 |
MS West/Not specified | 495 | 10.8 |
Missing | 4 | |
Number of comorbiditiesb | ||
0 | 1715 | 37.1 |
1 | 1831 | 39.6 |
2 | 814 | 17.6 |
3 | 238 | 5.2 |
4 | 22 | 0.5 |
Time from onset of pandemica, days (median (range), IQR) | 28 (0–173) | 20, 38 |
Outcomes | ||
Mortality | 1342 | 29.0 |
Intubation | 714 | 15.5 |
ICU admission | 972 | 21.0 |
Length of stay (N = 3278 survivors), days (median (range), IQR) | 6.2 (0.01–103) | 3.1, 10.9 |
aTime in days from first SARS-Cov-2 PCR positive admission in the Mount Sinai Health System to the individual patient’s admission.
bComorbidities included history of cancer, obesity, pulmonary disease (asthma, chronic obstructive pulmonary disease (COPD) and/or obstructive sleep apnea (OSA)), and cardiovascular disease (hypertension (HTN), coronary artery disease (CAD), congestive heart failure (CHF) and/or myocardial infarction (MI)).
Latent class analysis
Overall, clinical and laboratory data completeness was high (Supplemental Table S1). Supplemental Figure S1 displays the LCA model fits, specifically cAIC, aBIC, entropy and likelihood ratio for models with 2–10 subphenotypes. Using these variables, we determined that the optimal fit was six subphenotypes.
The six subphenotypes appeared clinically distinct (Figure 1, Supplemental Table S1). We summarize these subphenotypes as: an elderly (median age 75.5 years, IQR 65, 85), hyperinflammatory with multiorgan dysfunction subphenotype (n = 821); a younger (median age 59 years, IQR 49, 66), febrile with moderate inflammation subphenotype (n = 766); an elderly (median age 77 years, IQR 67, 84), moderate inflammation and coagulopathic subphenotype (n = 953); a younger (median age 51 years, IQR 35, 69), predominantly female (64%) hypoinflammatory subphenotype (n = 673); a predominantly male (82.5%) hyperinflammatory with liver dysfunction subphenotype (n = 790); and a hyperinflammatory with renal dysfunction subphenotype (n = 617). The hypoinflammatory subphenotype had the fewest clinical and laboratory abnormalities thus was the referent class in subsequent analyses.
Associations between COVID-19 subphenotype and mortality
Bivariate models suggest that, as compared to the hypoinflammatory subphenotype, all subphenotypes had increased risk of mortality (Supplemental Table S2). In multivariable models adjusting for onset time, hospital, self-identified race/ethnicity and insurance provider, all subphenotypes had increased risk of mortality as compared to the hypoinflammatory subphenotype (moderate inflammation and febrile RR 3.29, 95% CI 2.05, 5.28; hyperinflammation with liver dysfunction RR 6.85, 95% CI 4.37, 10.73; moderate inflammation and coagulopathic RR 8.09, 95% CI 5.19, 12.61; hyperinflammatory with renal dysfunction RR 8.78, 95% CI 5.61, 13.75; hyperinflammatory with multiorgan dysfunction RR 17.87, 95% CI 11.56, 27.63, Figure 2(A), complete model output Supplemental Table S2).
Associations between COVID-19 subphenotype and intubation
In bivariate and multivariate models, all subphenotypes had increased risk of intubation as compared to the hypoinflammatory subphenotype (multivariable model: moderate inflammation and febrile RR 3.44, 95% CI 2.07, 5.74; hyperinflammation with liver dysfunction RR 7.49, 95% CI 4.60, 12.20; moderate inflammation and coagulopathic RR 4.72, 95% CI 2.87, 7.77; hyperinflammatory with renal dysfunction RR 3.30, 95% CI 1.95, 5.60; hyperinflammatory with multiorgan dysfunction RR 10.88, 95% CI 6.72, 17.62, Figure 2(B), complete model output Supplemental Table S3).
Associations between COVID-19 subphenotype and ICU admission
In bivariate and multivariate models, all subphenotypes had increased risk of admission to the ICU as compared to the hypoinflammatory subphenotype (multivariable model: moderate inflammation and febrile RR 2.00, 95% CI 1.50, 2.67; hyperinflammation with liver dysfunction RR 3.31, 95% CI 2.52, 4.34; moderate inflammation and coagulopathic RR 2.25, 95% CI 1.71, 2.98; hyperinflammatory with renal dysfunction RR 1.84, 95% CI 1.35, 2.51; hyperinflammatory with multiorgan dysfunction RR 3.94, 95% CI 3.02, 5.15, Figure 2(C), complete model output Supplemental Table S4)
Associations between COVID-19 subphenotype and length of stay
Amongst survivors (n = 3278), the median length of stay was 6.2 days (IQR 3.1, 10.9). Length of stay in days was longer in all subphenotypes as compared to the hypoinflammatory subphenotype in unadjusted models (multivariable model: moderate inflammation and febrile β = 2.95 days, 95% CI 1.88, 4.01; hyperinflammation with liver dysfunction β = 4.91 days, 95% CI 3.80, 6.02; moderate inflammation and coagulopathic β = 4.87 days, 95% CI 3.81, 5.93; hyperinflammatory with renal dysfunction β = 5.15 days, 95% CI 3.95, 6.34; hyperinflammatory with multiorgan dysfunction β = 10.19 days, 95% CI 8.79, 11.58, Figure 2(D), complete model output Supplemental Table S5).
Exploratory analyses of COVID-19 therapies
Exploratory analyses to examine effect modification of the association between convalescent plasma (CP) and mortality by COVID-19 subphenotype suggested a differential effect by subphenotype. Within the cohort, n = 93 patients received CP. As compared to the hypoinflammatory subphenotype, the other subphenotypes demonstrated increased risk of dying in those who did not receive CP as compared to those who did [moderate inflammation with fever: received CP (n = 10) RR 0.85, 95% CI 0.22, 3.31 vs no CP (n = 82) RR 4.05, 95% CI 2.49, 6.59, p-int 0.06; hyperinflammatory with liver dysfunction: received CP (n = 18) RR 1.43, 95% CI 0.39, 5.24 vs no CP (n = 179) RR 8.59, 95% CI 5.42, 13.62, p-int 0.04; moderate inflammation with coagulopathy: received CP (n = 25) RR 1.64, 95% CI 0.46, 5.92 vs no CP (n = 253) RR 10.07, 95% CI 6.39, 15.87, p-int 0.03; hyperinflammatory with renal dysfunction: received CP (n = 11) RR 1.31, 95% CI 0.34, 5.00 vs no CP (n = 186) RR 11.29, 95% CI 7.14, 17.86, p-int <0.01; hyperinflammatory with multiorgan dysfunction: received CP (n = 27) RR 2.76, 95% CI 0.78, 9.76 vs no CP (n = 530) RR 23.96, 95% CI 15.34, 37.41, p-int <0.01, Figure 3].
Within the cohort, n = 118 patients received tocilizumab. The associated risks for mortality in the hyperinflammatory with renal dysfunction and hyperinflammatory with multiorgan dysfunction, as compared to the hypoinflammatory subphenotype were higher amongst persons who did not received tocilizumab as compared to those who did (hyperinflammatory with renal dysfunction: received tocilizumab (n = 6) RR 1.29, 95% CI 0.23, 7.11 vs no tocilizumab (n = 191) RR 10.61, 95% CI 6.78, 16.60, p-int 0.08; hyperinflammatory with multiorgan dysfunction: received tocilizumab (n = 34) RR 2.00, 95% CI 0.40, 10.03 vs no tocilizumab (n = 523) RR 22.75, 95% CI 14.74, 35.13, p-int 0.03) (Figure 4).
Discussion
Utilizing a cohort of 4620 COVID-19 patients during the Spring to Summer 2020 NYC surge, these data suggest that there are six distinct COVID-19 subphenotypes at the time of hospital admission. These subphenotypes have varying clinical courses, with differences in associated risk for mortality, intubation, ICU admission and LOS. Further, despite limited treatment options in the early pandemic, this work provides insight that COVID-19 subphenotypes may differentially respond to therapeutics, suggesting that further characterization of COVID-19 subphenotypes may be critical for future COVID-19 therapeutic trials and, ultimately, to guide therapy. This result expands on the currently published literature in two significant ways. It uses data at time of hospital admission, rather than trajectories while hospitalized or ICU admission, to provide the earliest possible timepoint to identify distinct subphenotypes [14,25,26]. Additionally, much of current literature focuses on strictly identifying subphenotypes [10,24,38]. This study goes beyond identification to begin to explore the role subphenotypes play in response to potential treatments.
Our admission dataset includes a number of serologic markers (IL-6, IL-1B, ferritin, LDH, ESR, CRP and procalcitonin) with variability across the cohort. The identified six subphenotypes differed predominantly in inflammatory profiles on admission, as defined by these serologic inflammatory markers. For example, median ferritin level on admission was 724 ng/ml (IQR 326, 1653) and varied amongst subphenotypes. The median ferritin in the hypoinflammatory subphenotype was 148.0 (IQR 56.8, 261.5) as compared to the hyperinflammatory subphenotypes (hyperinflammation with liver dysfunction median 1341.0 (IQR 770.2, 2465.2), hyperinflammation with renal dysfunction median 1004.0 (IQR 430.0, 2376.5) and hyperinflammation with multiorgan dysfunction subphenotype median 1441.5 (IQR 685.0, 2756.0)). We noted similarly separation of subphenotypes by IL-6, CRP and LDH in particular. While there is no single marker (serologic or otherwise) that has been identified to predict disease severity, associated organ involvement or outcome, multiple studies have demonstrated the role of IL-6, IL-8, CRP, LDH, procalcitonin and ferritin, in identifying patients at higher risk of poor outcomes [10,39–41]. Interestingly, the inflammatory markers are not uniformly elevated suggesting nuances in the inflammatory cascade and/or host response that need further investigation.
Organ dysfunction occurred predominantly in subphenotypes with the largest inflammatory derangements. For example, subphenotypes with hyperinflammation had multiorgan dysfunction or renal or liver failure. Notably, the liver failure group were predominantly younger (median age 61, IQR 52, 68) men which is not a group previously considered high risk [42]. Given the cross-sectional view of clinical and laboratory measures at admission we cannot temporally identify whether the inflammation directly led to the organ dysfunction or whether the inflammatory markers were elevated due to reduced renal or hepatic clearance [43]. It is interesting to note, however, that the two subphenotypes with moderately elevated inflammatory markers on average did not have renal or hepatic dysfunction. These data suggest that specific inflammatory markers in conjunction with markers of renal and hepatic injury may be used to identify patients at risk of clinical deterioration.
While the mortality risk observed for the most elderly group with multiorgan derangements is not surprising, it is important to note that this approach allowed us to identify subphenotypes that were at increased risk for mortality but may not previously have been considered to be at higher risk. For example, we identified a younger group (median age 59 years, IQR 49, 66) with moderate inflammation and febrile subphenotype that had 3.3 times the risk of death (RR 3.29, 95% CI 2.05 5.28), 3.4 times the risk of intubation (RR 3.44, 95% CI 2.07, 5.74) and 2 times the risk of ICU admission (RR 2.00, 95% CI 1.50, 2.67) in the multivariable model as compared to the hypoinflammatory subphenotype. On average, members of this group were previously healthy with no (N = 279, 36%) or one (N = 332, 43%) organ system comorbidity. It is critical that future studies focus on these populations to better understand the pathophysiologic mechanisms of increased risk, enable early identification and initiate appropriate treatments.
Further, these data suggest that identification of these subphenotypes at the time of hospital admission may be helpful in designing future COVID-19 therapeutic trials, guiding secondary analyses of existing COVID-19 randomized controlled treatment studies and, ultimately, in generating a patient-centered treatment algorithm. This work builds on prior analyses by Calfee et al. suggesting that ARDS subphenotypes differentially respond to therapeutic interventions in ARDS including fluid management strategies and use of statins [13,15,16].
Specifically, we find that the hyperinflammatory subphenotype with multiorgan dysfunction may differentially respond to tocilizumab and convalescent plasma therapies. Emerging evidence supports this concept and suggests that some treatments may be more efficacious in certain populations [39,44]. Current literature on the efficacy of tocilizumab and convalescent plasma have demonstrated conflicting results. Five large randomized trials examining the efficacy of tocilizumab, which enrolled patients with varying degrees of respiratory failure, found a mortality benefit in only two studies [45–49]. A recent meta-analysis of IL-6 antagonists, including tocilizumab, however did demonstrate a mortality benefit [50]. Similarly, studies of convalescent plasma have not shown mortality benefit but do suggest that early administration to mildly ill patients or with high titer plasma, may provide some benefit [51–55]. Our data support secondary analyses of completed randomized controlled trials to better understand if COVID-19 subphenotypes differentially respond to therapeutics. These analyses will be critical to inform patient-centered treatment algorithms.
Our study has several strengths. By leveraging the Mount Sinai electronic medical record data repository, we were able to evaluate a large, diverse sample of patients from multiple hospitals in New York City. We employed a data-driven method to analyze the heterogeneous population of COVID-19 patients and were able to identify underlying subphenotypes and demonstrate associations with mortality, ICU admission, intubation risk, and length of stay, key COVID-19 endpoints. Our exploratory analyses suggest that these subphenotypes may have a differential response to therapeutic treatments, which suggests that completed COVID-19 RCTs may benefit from secondary analyses of their datasets even if no effect in the overall cohort was found. Our employment of admission data to generate subphenotypes provides a platform for early identification of subphenotypes, which if replicated in other studies, suggests the potential for selecting therapeutic options based on identified differences early in the hospitalization.
We also acknowledge limitations. There is variability in duration of illness prior to presentation to the hospital that is not captured by this dataset. Patients may have presented at different phases of illness, which we know was true in New York City during the first wave of the pandemic as factors including patient volume and limited resources impacted decisions on when to present to the hospital or be admitted. Hospital capacity and available resources may have also impacted aggressiveness of treatment (e.g. palliative care) which we are unable to capture by electronic health records. We adjust for time since the onset of the pandemic to address this. An evaluation of the stability of the subphenotypes over time would lend additional evidence that the subphenotypes are distinct classes, regardless of duration of illness at presentation [56,57]. Exploratory analyses of associations with treatments are limited by potential biases as these data are retrospective. Access to these treatments was limited in Spring 2020. Certain treatments were not available at every hospital site and were restricted to patients with more severe illness rather than distributed in a randomized fashion, introducing selection bias. Criteria for use changed over time as more studies became available about these treatments, limiting generalizability of these analyses. Additionally, these data reflect the original strain of COVID, and were collected prior to the development of vaccines, which has impacted the clinical presentations and potentially altered the clinical subphenotypes.
In conclusion, these data find six distinct, clinically relevant, COVID-19 subphenotypes present on admission, which are associated with risk for mortality, intubation, ICU admission and LOS and suggest differential responses to tocilizumab and convalescent plasma. Future studies should validate these subphenotypes in other populations and health systems. Post hoc analyses of randomized control trials of tocilizumab, convalescent plasma and other therapeutics are warranted.
Supplementary Material
Acknowledgements
This work was supported in part through the computational and data resources and staff expertise provided by Scientific Computing at the Icahn School of Medicine at Mount Sinai.
Funding Statement
Research reported in this publication was supported by the National Institute of Minority Health and Health Disparities under Grant no. R01MD013310. A. G. L. was supported by the National Health, Lung and Blood Institute K23HL135349. The funders had no role in the conduct or publication of this work.
Disclosure statement
No potential conflict of interest was reported by the author(s).
Data availability statement
The deidentified data presented in this study is available upon request from the corresponding author.
References
- 1.Bilinski A, Emanuel EJ.. COVID-19 and excess all-cause mortality in the US and 18 comparison countries. JAMA. 2020;324(20):2100–2102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Wu JT, Leung K, Leung GM.. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet. 2020;395(10225):689–697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Centers for Disease Control and Prevention [CDC] . Estimated Disease Burden of COVID-19. 2020; [cited 2021 April 20]; Available from: https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/burden.html
- 4.Noor FM, Islam MM.. Prevalence and associated risk factors of mortality among COVID-19 patients: a meta-analysis. J Community Health. 2020;45(6):1270–1282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wang D, Hu B, Hu C, et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus-infected pneumonia in Wuhan, China. JAMA. 2020;323(11):1061–1069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Chen N, Zhou M, Dong X, et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395(10223):507–513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Carrasco-Sánchez FJ, López-Carmona MD, Martínez-Marcos FJ, et al. Admission hyperglycaemia as a predictor of mortality in patients hospitalized with COVID-19 regardless of diabetes status: data from the Spanish SEMI-COVID-19 registry. Ann Med. 2021;53(1):103–116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Cummings MJ, Baldwin MR, Abrams D, et al. Epidemiology, clinical course, and outcomes of critically ill adults with COVID-19 in New York City: a prospective cohort study. Lancet. 2020;395(10239):1763–1770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Guo Y-R, Cao Q-D, Hong Z-S, et al. The origin, transmission and clinical therapies on coronavirus disease 2019 (COVID-19) outbreak – an update on the status. Mil Med Res. 2020;7(1):11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Wang X, Jehi L, Ji X, et al. Phenotypes and subphenotypes of patients with coronavirus disease 2019: a latent class modeling analysis. Chest. 2021;159(6):2191–2204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Legrand M, Phillips RV, Malenica I, et al. Differences in clinical deterioration among three sub-phenotypes of COVID-19 patients at the time of first positive test: results from a clustering analysis. Intensive Care Med. 2021;47(1):113–115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wenzel SE. Asthma phenotypes: the evolution from clinical to molecular approaches. Nat Med. 2012;18(5):716–725. [DOI] [PubMed] [Google Scholar]
- 13.Calfee CS, Delucchi K, Parsons PE, et al. Subphenotypes in acute respiratory distress syndrome: latent class analysis of data from two randomised controlled trials. Lancet Respir Med. 2014;2(8):611–620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Sinha P, Furfaro D, Cummings MJ, et al. Latent class analysis reveals COVID-19-related acute respiratory distress syndrome subgroups with differential responses to corticosteroids. Am J Respir Crit Care Med. 2021;204(11):1274–1285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Calfee CS, Delucchi KL, Sinha P, et al. Acute respiratory distress syndrome subphenotypes and differential response to simvastatin: secondary analysis of a randomised controlled trial. Lancet Respir Med. 2018;6(9):691–698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Famous KR, Delucchi K, Ware LB, et al. Acute respiratory distress syndrome subphenotypes respond differently to randomized fluid management strategy. Am J Respir Crit Care Med. 2017;195(3):331–338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Sinha P, Delucchi KL, Thompson BT, et al. Latent class analysis of ARDS subphenotypes: a secondary analysis of the statins for acutely injured lungs from sepsis (SAILS) study. Intensive Care Med. 2018;44(11):1859–1869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Xie J, Hungerford D, Chen H, et al. Development and external validation of a prognostic multivariable model on admission for hospitalized patients with COVID-19. 2020. medRxiv. [Google Scholar]
- 19.Vaid A, Somani S, Russak AJ, et al. Machine learning to predict mortality and critical events in a cohort of patients with COVID-19 in New York city: model development and validation. J Med Internet Res. 2020;22(11):e24018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ma X, Ng M, Xu S, et al. Development and validation of prognosis model of mortality risk in patients with COVID-19. Epidemiol Infect. 2020;148:e168–e168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Knight SR, Ho A, Pius R, et al. Risk stratification of patients admitted to hospital with Covid-19 using the ISARIC WHO clinical characterisation protocol: development and validation of the 4C mortality score. BMJ. 2020;370:m3339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhang S, Guo M, Duan L, et al. Development and validation of a risk factor-based system to predict short-term survival in adult hospitalized patients with COVID-19: a multicenter, retrospective, cohort study. Crit Care. 2020;24(1):438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Su C, Zhang Y, Flory JH, et al. Clinical subphenotypes in COVID-19: derivation, validation, prediction, temporal patterns, and interaction with social determinants of health. NPJ Digit Med. 2021;4(1):110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ranard BL, Megjhani M, Terilli K, et al. Identification of endotypes of hospitalized COVID-19 patients. Front Med. 2021;8:770343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Vasquez CR, Gupta S, Miano TA, et al. Identification of distinct clinical subphenotypes in critically ill patients with COVID-19. Chest. 2021;160(3):929–943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Su C, Xu Z, Hoffman K, et al. Identifying organ dysfunction trajectory-based subphenotypes in critically ill patients with COVID-19. Sci Rep. 2021;11(1):15872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Prescott HC, Calfee CS, Thompson BT, et al. Toward smarter lumping and smarter splitting: rethinking strategies for sepsis and acute respiratory distress syndrome clinical trial design. Am J Respir Crit Care Med. 2016;194(2):147–155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Maley JH, Thompson BT.. Embracing the heterogeneity of ARDS. Chest. 2019;155(3):453–455. [DOI] [PubMed] [Google Scholar]
- 29.Amati F, Dela Cruz CS.. One size does not fit all: moving towards a personalized approach for steroids in COVID-19. Chest. 2021;159(5):1693–1695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Hajifathalian K, Sharaiha RZ, Kumar S, et al. Development and external validation of a prediction risk model for short-term mortality among hospitalized U.S. COVID-19 patients: a proposal for the COVID-AID risk tool. PLoS One. 2020;15(9):e0239536. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Petrilli CM, Jones SA, Yang J, et al. Factors associated with hospital admission and critical illness among 5279 people with coronavirus disease 2019 in New York City: prospective cohort study. BMJ. 2020;369:m1966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Chen W, Qian L, Shi J, et al. Comparing performance between log-binomial and robust Poisson regression models for estimating risk ratios under model misspecification. BMC Med Res Methodol. 2018;18(1):63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zou G. A modified Poisson regression approach to prospective studies with binary data. Am J Epidemiol. 2004;159(7):702–706. [DOI] [PubMed] [Google Scholar]
- 34.Zeileis A, Köll S, Graham N.. Various versatile variances: an object-oriented implementation of clustered covariances in R. J Stat Soft. 2020;95(1):1–36. [Google Scholar]
- 35.Zeileis A. Object-oriented computation of sandwich estimators. J Stat Soft. 2006;16(9):1–16. [Google Scholar]
- 36.Linzer DA, Lewis JB.. poLCA: an R package for polytomous variable latent class analysis. J Stat Soft. 2011;42(10):1–29. [Google Scholar]
- 37.Linzer D Jeffrey. poLCA: polytomous Variable Latent Class Analysis. 2013. R package version 1.4. [Google Scholar]
- 38.Lusczek ER, Ingraham NE, Karam BS, et al. Characterizing COVID-19 clinical phenotypes and associated comorbidities and complication profiles. PLoS One. 2021;16(3):e0248956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Chen H, Xie J, Su N, et al. Corticosteroid therapy is associated with improved outcome in critically ill patients with COVID-19 with hyperinflammatory phenotype. Chest. 2021;159(5):1793–1802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Del Valle DM, Kim-Schulze S, Huang H-H, et al. An inflammatory cytokine signature predicts COVID-19 severity and survival. Nat Med. 2020;26(10):1636–1643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hariyanto TI, Japar KV, Kwenandar F, et al. Inflammatory and hematologic markers as predictors of severe outcomes in COVID-19 infection: a systematic review and meta-analysis. Am J Emerg Med. 2021;41:110–119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wu J, Song S, Cao H-C, et al. Liver diseases in COVID-19: etiology, treatment and prognosis. World J Gastroenterol. 2020;26(19):2286–2293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Heilmann E, Gregoriano C, Wirz Y, et al. Association of kidney function with effectiveness of procalcitonin-guided antibiotic treatment: a patient-level meta-analysis from randomized controlled trials. Clin Chem Lab Med. 2021;59(2):441–453. [DOI] [PubMed] [Google Scholar]
- 44.Mariette X, Hermine O, Tharaux P-L, et al. Effectiveness of tocilizumab in patients hospitalized with COVID-19: a follow-up of the CORIMUNO-TOCI-1 randomized clinical trial. JAMA Intern Med. 2021;181(9):1241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Interleukin-6 receptor antagonists in critically ill patients with covid-19. New Eng J Med. 2021;385:1147–1149. [DOI] [PubMed] [Google Scholar]
- 46.Rosas IO, Bräu N, Waters M, et al. Tocilizumab in hospitalized patients with severe Covid-19 pneumonia. N Engl J Med. 2021;384(16):1503–1516. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Salama C, Han J, Yau L, et al. Tocilizumab in patients hospitalized with covid-19 pneumonia. N Engl J Med. 2021;384(1):20–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Salvarani C, Dolci G, Massari M, et al. Effect of tocilizumab vs standard care on clinical worsening in patients hospitalized with COVID-19 pneumonia: a randomized clinical trial. JAMA Intern Med. 2021;181(1):24–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Hermine O, Mariette X, Tharaux P-L, et al. Effect of tocilizumab vs usual care in adults hospitalized with COVID-19 and moderate or severe pneumonia: a randomized clinical trial. JAMA Intern Med. 2021;181(1):32–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Group, T.W.R.E.A.f.C.-T.W. Association between administration of IL-6 antagonists and mortality among patients hospitalized for COVID-19: a meta-analysis. JAMA. 2021;326(6):499–518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Simonovich VA, Burgos Pratx LD, Scibona P, et al. A randomized trial of convalescent plasma in covid-19 severe pneumonia. N Engl J Med. 2021;384(7):619–629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Agarwal A, Mukherjee A, Kumar G, PLACID Trial Collaborators, et al. Convalescent plasma in the management of moderate Covid-19 in adults in India: open label phase II multicentre randomised controlled trial (PLACID trial). BMJ. 2020;371:m3939. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Joyner MJ, Carter RE, Senefeld JW, et al. Convalescent plasma antibody levels and the risk of death from covid-19. N Engl J Med. 2021;384(11):1015–1027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Libster R, Pérez Marc G, Wappner D, et al. Early high-titer plasma therapy to prevent severe covid-19 in older adults. N Engl J Med. 2021;384(7):610–618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Horby PW, Estcourt L, Peto L, et al. Convalescent plasma in patients admitted to hospital with COVID-19 (RECOVERY): a randomised, controlled, open-label, platform trial. medRxiv; 2021. p. 2021.03.09.21252736. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Su C, Xu Z, Hoffman K, et al. Identifying organ dysfunction trajectory-based subphenotypes in critically ill patients with COVID-19. medRxiv; 2020. p. 2020.07.16.20155382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Delucchi K, Famous KR, Ware LB, et al. Stability of ARDS subphenotypes over time in two randomised controlled trials. Thorax. 2018;73(5):439–445. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The deidentified data presented in this study is available upon request from the corresponding author.