Skip to main content
iScience logoLink to iScience
. 2022 Jun 17;25(7):104612. doi: 10.1016/j.isci.2022.104612

Integrative metabolomic and proteomic signatures define clinical outcomes in severe COVID-19

Mustafa Buyukozkan 1,2,11, Sergio Alvarez-Mulett 3,11, Alexandra C Racanelli 3,11, Frank Schmidt 4, Richa Batra 1,2, Katherine L Hoffman 5, Hina Sarwath 4, Rudolf Engelke 4, Luis Gomez-Escobar 3, Will Simmons 5, Elisa Benedetti 1,2, Kelsey Chetnik 1,2, Guoan Zhang 6, Edward Schenck 3, Karsten Suhre 7, Justin J Choi 8, Zhen Zhao 9, Sabrina Racine-Brzostek 9, He S Yang 9, Mary E Choi 10, Augustine MK Choi 3, Soo Jung Cho 3,, Jan Krumsiek 1,2,12,∗∗
PMCID: PMC9212983  PMID: 35756895

Summary

The coronavirus disease-19 (COVID-19) pandemic has ravaged global healthcare with previously unseen levels of morbidity and mortality. In this study, we performed large-scale integrative multi-omics analyses of serum obtained from COVID-19 patients with the goal of uncovering novel pathogenic complexities of this disease and identifying molecular signatures that predict clinical outcomes. We assembled a network of protein-metabolite interactions through targeted metabolomic and proteomic profiling in 330 COVID-19 patients compared to 97 non-COVID, hospitalized controls. Our network identified distinct protein-metabolite cross talk related to immune modulation, energy and nucleotide metabolism, vascular homeostasis, and collagen catabolism. Additionally, our data linked multiple proteins and metabolites to clinical indices associated with long-term mortality and morbidity. Finally, we developed a novel composite outcome measure for COVID-19 disease severity based on metabolomics data. The model predicts severe disease with a concordance index of around 0.69, and shows high predictive power of 0.83–0.93 in two independent datasets.

Subject areas: Biological sciences, Clinical finding, Human metabolism, Medicine, Physiology

Graphical abstract

graphic file with name fx1.jpg

Highlights

  • COVID-19 patients show serum changes in various pathways

  • Metabolomic/proteomic cross-talk defines pathways of disease

  • COVID-19 disease severity correlates with various markers in blood

  • COVID-19 disease severity can be accurately predicted by a small set of metabolites


Biological sciences; Clinical finding; Human metabolism; Medicine; Physiology

Introduction

The novel coronavirus disease 2019 (COVID-19) has a broad spectrum of clinical features that range from asymptomatic disease to acute respiratory distress syndrome (ARDS) (Guan et al., 2020; Arons et al., 2020). COVID-19 ARDS can lead to refractory hypoxia, mechanical ventilation, prolonged intensive care unit (ICU) stay and increased mortality (Zhou et al., 2020). Previous studies have shown a high incidence of concomitant organ failure in COVID-19, including acute kidney injury (AKI) (Hirsch et al., 2020), acute liver injury (Zhang et al., 2020), thromboembolic events (Bilaloglu et al., 2020; Tang et al., 2020) and secondary infections contributing to a fatal outcome (Elezkurtaj et al., 2021).

Massive investigative efforts by multiple scientific groups have used proteomic and metabolomic approaches to begin to unravel disease mechanisms relevant to SARS-CoV-2 infection such as inflammation, coagulation, and metabolism (Lucas et al., 2020). However, how COVID-19 specific protein-metabolite interactions relate to the severity of disease and clinical outcomes remains poorly understood. Key study limitations have included relatively small sample sizes, absence of protein-metabolite network analysis and the focus on dichotomous outcome measures such as death and survival. These limitations have been difficult to overcome and restrict our understanding of COVID-19 pathogenesis.

Here, we report the largest study to integrate targeted metabolomic and proteomic analyses of serum samples obtained from hospitalized COVID-19 patients during SARS-CoV-2 infection compared to patients admitted during the same time period with symptoms related to COVID-19 and negative RT-PCR for SARS-CoV-2 as controls (Figure 1). Through this work, we uncovered COVID-19-specific metabolite and protein profiles, identified novel protein-metabolite modules, and defined the molecular signatures of several clinical indices (CRP, ferritin, platelet count, AKI, and death). Additionally, we developed the first clinical composite outcome prediction model in COVID-19, where the input of discrete metabolic profiles significantly improves our clinical insight into a broad range of outcomes known to plague many survivors of severe COVID-19.

Figure 1.

Figure 1

Study outline. Study population composed by COVID-19 patients and controls

Serum samples obtained from clinically indicated specimens during the first 72 h of admission. Metadata obtained from electronic medical records. We obtained metabolomic and proteomic profiles, after which we integrated our findings into a single network describing the associations between differentially expressed proteins and metabolites in COVID-19. We then correlated the metabolites and proteins with baseline characteristics, laboratory parameters and clinical events. Finally, we developed a novel metabolite-based prediction model for a composite outcome measure comprised of key clinical parameters including death, mechanical ventilation, initiation of dialysis, supplemental oxygen requirement, development of acute kidney injury, and length of hospital stay.

Results

Study cohort and dataset

Our cohort was composed of 330 patients with confirmed SARS-CoV-2 RT-PCR, and 97 non-COVID-19 controls with negative RT-PCR results who were hospitalized at the NewYork-Presbyterian Hospital/Weill Cornell Medical Center between March and April 2020. Serum samples were obtained within the first 3 days of admission. The majority of COVID-19 patients had samples drawn at two or three different time points resulting in a total of 582 serum samples from the 330 COVID-19 patients, whereas all 97 controls had one sample drawn. Metabolomics was measured for all available samples. Proteomics was measured for fewer samples (n = 189), also across different time points for some patients. Notably, there were only minor time effects across the three days (Figure S1), and the repeated samples were thus treated as replicates using a linear mixed effect model, see STAR Methods. A detailed description of the clinical and demographic characteristics of the cohort can be found in Table 1, Tables S1 and S2. Of note, we excluded samples collected after intubation because we found that the clinical act of intubation significantly alters a patient’s metabolic profile (Table S3).

Table 1.

Demographics and clinical characteristics of the cohort

Characteristic Negative, N = 97a COVID-19, N = 330a p valueb
Demographics
 Age 64 (51, 77) 64 (52, 75) 0.8
 Sex 0.017
  Female 48 (49%) 117 (35%)
  Male 49 (51%) 213 (65%)
 Race <0.001
  Asian 10 (10%) 24 (7.3%)
  Black 15 (15%) 40 (12%)
  Not specified 16 (16%) 48 (15%)
  Other 6 (6.2%) 71 (22%)
  White 50 (52%) 147 (45%)
 BMI 24.7 (21.7, 28.5) 27.4 (23.7, 31.0) <0.001

Risk Factors
 CAD 23 (24%) 55 (17%) 0.2
 DM 19 (20%) 89 (27%) 0.2
 HTN 55 (57%) 182 (55%) 0.9
 CKD/ESRD 17 (18%) 17 (5.2%) <0.001
 Active cancer 40 (41%) 30 (9.1%) <0.001
 Immunosuppressed state 29 (30%) 14 (4.2%) <0.001

Laboratories
 Lymphocyte count 1.03 (0.50, 1.60) 0.80 (0.60, 1.10) 0.081
 Platelet count 202 (132, 275) 191 (145, 252) 0.9
 D-dimer 242 (150, 555) 386 (242, 667) 0.2
 Alanine aminotransferase 22 (14, 40) 35 (23, 54) <0.001
 Aspartate aminotransferase 25 (19, 38) 40 (28, 61) <0.001
 Ferritin 727 (147, 1167) 769 (358, 1449) 0.4
 C-reactive protein 3 (1, 7) 11 (6, 19) 0.006
 Procalcitonin 0.43 (0.13, 1.06) 0.17 (0.08, 0.42) 0.007

Clinical Outcomes
 Acute Kidney Injury (AKI) stage <0.001
  0 89 (92%) 214 (65%)
  1 4 (4.1%) 43 (13%)
  2 0 (0%) 16 (4.8%)
  3 4 (4.1%) 42 (13%)
 Kidney replacement therapy 7 (7.2%) 26 (7.9%) >0.9
 Acute respiratory failure requiring intubation 11 (11%) 103 (31%) <0.001
 Thromboembolic event 6 (6.2%) 22 (6.7%) >0.9
 In-hospital mortality 13 (13%) 56 (17%) 0.6

Bold values indiciate p<0.05.

a

Statistics presented: median (IQR); n (%).

b

Statistical tests performed: Wilcoxon rank-sum test; chi-square test of independence; Fisher’s test with simulated p value; Fisher’s exact test.

Metabolic profiles were assessed for all samples using liquid chromatography coupled with mass spectrometry (LC/MS). After quality control and data preprocessing, 125 metabolites were available for comparative analysis. Targeted proteomic profiling was performed on a subset of 227 samples (173 from COVID-19 patients and 54 controls) using the Olink inflammation, cardiovascular II and cardiovascular III panels, which cover 266 unique protein biomarkers. These panels were selected, because it has previously been shown that inflammation and cardiovascular pathways are essential during COVID-19 pathogenesis (Wiersinga et al., 2020).

Metabolomic and proteomic changes associated with COVID-19

Differential metabolomic analysis identified significant changes in abundance of 70 out of the 125 analyzed metabolites between COVID-19 and controls at a false discovery rate (FDR) of 0.05 (Figure 2A). The top three differentially expressed metabolites were involved in amino acid metabolism: N-acetyl-L-aspartic acid (p value = 9.19E-18), N-acetyl-aspartyl-glutamic acid (p value = 5.30E-15), and argininosuccinic acid (p value = 6.35E-12) (Figure 2B). KEGG pathway mapping of the differentially expressed metabolites revealed an involvement of various metabolic pathways (Figure 2C). These pathways included arginine and proline metabolism, glycine and serine metabolism, alanine metabolism, methionine metabolism, sphingolipid metabolism, gluconeogenesis, and the TCA cycle pathway, demonstrating involvement of the broader categories of amino acid, lipid and energy metabolism in COVID-19 pathogenesis. Our results confirm prior reports that reported altered phenylalanine and tryptophan metabolism in severe COVID-19 patients compared to non-COVID-19 patients (Wu et al., 2021; Thomas et al., 2020; Barberis et al., 2020). In addition, our data has shown that multiple metabolites involved in sphingolipid metabolism are significantly increased in COVID-19 patients, pointing toward other potential targets for COVID-19 treatment (Törnquist et al., 2021).

Figure 2.

Figure 2

Metabolomics and Proteomics changes associated with COVID-19

(A) Volcano plot showing differentially expressed metabolites between COVID-19 patients and controls at an adjusted p value <0.05. In red, upregulated metabolites in COVID-19 patients. In blue, upregulated metabolites in the control group.

(B) Top three differentially expressed metabolites in COVID-19 patients vs. controls based on adjusted p values. Y axis shows log2 fold changes in relation to the mean of the control group.

(C) Number of significantly regulated molecules in KEGG pathways, top 15 shown.

(D) Volcano plot showing the differentially expressed proteins between COVID-19 patients and controls at an adjusted p value <0.05.

(E) Top three differentially expressed proteins in COVID-19 patients vs. controls based on adjusted p values. Y axis shows log2 fold changes in relation to the mean of the control group.

(F) Number of significantly regulated molecules in KEGG pathways, top 15 shown.

Comparative proteomic analysis identified significant changes in the expression of 48 out of the 266 analyzed proteins between COVID-19 and controls at an FDR of 0.05 (Figure 2D). The top three differentially expressed proteins were C-X-C motif chemokine ligand 10 (CXCL10) (p value = 5.09E-13), galectin 9 (Gal-9) (p value = 5.49E-10), and monocyte chemoattractant protein 3 (MCP-3) (p value = 2.85E-09) (Figure 2E). KEGG pathway mapping revealed that these proteins participated in various protein pathways (Figure 2F), including the interleukin 17 (IL-17), tumoral necrosis factor (TNF) and JAK-STAT signaling pathways. Detailed results of the differential analysis and pathway mappings can be found in Table S4. Of note, the utility of the JAK inhibitors baricitinib or ruxolitinib has already been demonstrated in a clinical trial for selected patients with severe or critical COVID-19 patients (Kalil et al., 2020; Iastrebner et al., 2021). Based on our data, clinical trials further targeting IL-17 and TNF signaling in COVID-19 may lead to additional therapeutic approaches for treating COVID-19.

Global principal component analysis (PCA) on the metabolomics and proteomics data revealed no clear separation of COVID-19 and control groups (Figure S2). This is an effect that we have commonly observed in previous studies of blood data (Roberts et al., 2021; Valdés et al., 2022; Meoni et al., 2021; Overmyer et al., 2021; Di et al., 2020; Gisby et al., 2021), where omics profiles only separated groups in a specific (single molecules and pathways) rather than a global fashion.

Protein-metabolite networks identify potential mediators of COVID-19 pathology

To obtain further mechanistic insight into the biology of COVID-19, we developed a comprehensive, data-driven network for the integrative analysis of our multi-omics dataset. We first generated a Gaussian graphical model (GGM) of correlated metabolites and proteins from our COVID-19 cohort (Data S1). GGMs are correlation-based network models that we have previously demonstrated to accurately reconstruct biological pathways from blood-based omics data (Krumsiek et al., 2011, 2012; Benedetti et al., 2017). A minimum spanning tree-based algorithm was then used to identify a focused subnetwork that connects the most significantly correlated metabolites and proteins from the original network (Figure 3). The subnetwork included 13 proteins from the Olink inflammatory panel, 32 proteins from the Olink cardiovascular panels II and III, and 70 metabolites. From this subnetwork, we selected four network modules to query the interplay between metabolism, inflammation, and vascular dysfunction in COVID-19 patients.

Figure 3.

Figure 3

Protein-metabolite networks in COVID-19

Gaussian graphical model (GGM) representing the significant partial correlations between all the measured metabolites and proteins, generated by a minimum spanning tree (MST) based reduction combined with a shortest paths-based approach. The network includes all differentially expressed metabolites and proteins between COVID-19 patients and controls. Squares indicate proteins and green circles indicate metabolites. Inflammatory proteins are colored in purple, whereas vascular injury proteins are colored in red. Shadows represent molecule modules.

Inflammation-related network modules

Module 1

Our network identified hexanoylcarnitine as a key metabolite associated with inflammatory cytokines in COVID-19 illness. The module contained IFN-gamma, CXCL10 and CXCL11 that were all upregulated in COVID-19 and associated with hexanoylcarnitine. IFN-gamma, CXCL10 and CXCL11 are proinflammatory cytokines which regulate T cell immunity (Tokunaga et al., 2018), whereas hexanoylcarnitine is a medium-chain fatty acid conjugate that plays a critical role in energy metabolism and mitochondrial fatty acid β-oxidation (Figure 3) (Stanley et al., 1992). Levels of other carnitine species (L-carnitine and caprylic acid) were also elevated in COVID-19 patients, suggesting a role for inflammation, dysregulated fatty acid β-oxidation and mitochondrial dysfunction in COVID-19 pathogenesis (Wajner and Amaral, 2015; Thomas et al., 2020; León et al., 2021; Soliman et al., 2020).

Module 2

In a second inflammation-related module, cytosine was another key metabolite linked with inflammatory cytokines during COVID-19. The module consisted of a group of macrophage-derived cytokines, including MCP-2, MCP-3 and GRN, which were all upregulated in COVID-19 and positively associated with cytosine (Figure 3). This finding is consistent with the hyperinflammatory state of COVID-19 infection in which cytokine storms are often observed (McGonagle et al., 2021). Cytosine, a pyrimidine-class nucleotide that is an essential metabolite for cell proliferation and survival, is commonly upregulated in the host response during viral infection and is furthermore an important mediator of viral replication (Hoelzer et al., 2008). Prior reports have shown cytosine levels to be elevated in COVID-19 patients (Liu et al., 2019), which was corroborated in our own cohort. Taken together, cytosine may be a key metabolite linking viral replication to SARS-CoV-2 induced inflammation.

Vascular-related network modules

Module 3

Although COVID-19 presents itself mainly as a respiratory disease, autopsy reports have additionally described significant vascular injury because of endothelial cell damage, microcirculatory thrombi, and impaired cellular junction integrity (Varga et al., 2020). Our network uncovered the coordination of MERTK, RAGE and thrombomodulin (TM) with the metabolite 4-hydroxyproline in COVID-19 (Figure 3). MERTK, RAGE and TM are proteins key to maintaining features of endothelial homeostasis, and hydroxyproline is a major component of collagen (Xu et al., 2019). MERTK, a member of the TAM family of receptor tyrosine kinases (RTKs), and RAGE, a pro-coagulant and inflammatory molecule were upregulated, whereas TM was downregulated, suggesting induction of a pro-coagulant and inflammatory vascular state in COVID-19. Notably, the coordination of these proteins centered around alterations in levels of hydroxyproline, where downregulation of TM directly correlated with hydroxyproline levels. A link between TM and hydroxyproline has been described, where administration of TM had anti-fibrotic effects on lung and kidney murine models (Takeshita et al., 2020). Taken together, these data link the pro-coagulant and fibrotic state of COVID-19 through thrombomodulin and hydroxyproline.

Module 4

In a second vascular-related module, we found that cathepsin D (CTSD), a lysosomal protease known to disrupt endothelial cell junctions and increase vasopermeability, was associated with a group of glycolytic metabolites including D-glucose, glycerol-3-phosphate, and lactic acid (Monickaraj et al., 2018). Notably, increased vascular permeability and glycolysis are known features of a pro-angiogenic state (Leung and Shi, 2021). Although dysregulated angiogenesis occurs in COVID-19 patients (Ackermann et al., 2020), how these events are coordinated remains poorly understood. Here, we report the concomitant upregulation of cathepsin D and intermediate products of glycolysis (D-glucose and lactic acid) in our COVID-19 cohort, which position this enzyme as a potential driver of the COVID-19 phenotype. Of note, elevated plasma activity of cathepsin D has been found in patients with type 2 diabetes, suggesting a link between abnormal vasculature and the dysregulated glucose metabolism seen in our higher risk diabetic COVID-19 patients (Chen et al., 2020). Taken together these data suggest that the disruption of vascular and glucometabolic homeostasis in COVID-19 is mediated by cathepsin D.

Serum metabolites and proteins associate with clinical indices in COVID-19

Serum metabolites and proteins within the COVID-19 patient group were assessed for correlation with relevant clinical indices including: i) demographics (sex, age, and BMI), ii) concurrent comorbidity (hypertension, preexisting kidney disease, diabetes mellitus [DM], severity of illness [SOFA]), iii) laboratory markers of inflammation (C-reactive protein [CRP], d-dimer, lymphocyte count, platelet count, ferritin), and iv) future clinical events (ARDS, death) (Figures 4A and 4D). Preexisting kidney disease was found to be associated with the highest number of both metabolites and proteins. Other clinical parameters including ARDS, death, BMI, gender, age, SOFA score, hypertension, DM, and d-dimer levels were associated with numerous metabolites but only few proteins. Conversely, platelet count and ferritin level were associated with a large number of proteins but relatively few metabolites. Detailed association results are provided in Tables S5 and S6.

Figure 4.

Figure 4

Associations of molecules with clinical indices in COVID-19

(A) Lollipop plot representing the number of altered metabolites for each analyzed parameter (∗Diabetes mellitus, ∗∗C-reactive protein, ∗∗∗Acute respiratory distress syndrome, represented by whether patients were intubated).

(B) Heatmap indicating the number of differentially expressed metabolites associated with each clinical parameter and its significance. We included four clinical parameter categories: demographics, comorbidities, clinical events, and laboratory parameters.

(C) Volcano plot showing detailed metabolic changes correlated with death, kidney disease and C-reactive protein.

(D) Lollipop plot representing the number of altered proteins for each analyzed parameter.

(E) Heatmap indicating the number of differentially expressed proteins on each clinical parameter and its significance, using the same four categories described above. Proteins are marked according to Olink panels.

(F) Volcano plot showing detailed protein changes correlated with death, platelet count and ferritin levels.

Hierarchical clustering of clinical indices by their correlation with metabolites revealed two broad clusters: one predominantly related to laboratory markers of inflammation, and a second largely related to non-laboratory indices (Figure 4B). Volcano plots of three representative indices (death, preexisting kidney disease and CRP) are shown in Figure 4C. Seventy-five metabolites were significantly associated with death, 96 with preexisting kidney disease and 37 with CRP level. 69 metabolites were associated with both death and kidney disease and 20 were associated with death, kidney disease and CRP levels. Key metabolites associated with each of the three selected clinical indices are depicted in Figure 4C. Remarkably, hexanoylcarnitine and cytosine, which we earlier showed to be upregulated in COVID-19, were among the 20 metabolites associated with all three of these clinical indices. This finding supports the potential utility of hexanoylcarnitine and cytosine not only as biomarkers for COVID-19 but also as predictors of disease severity.

Clustering of clinical indices according to their correlation with proteins also revealed two broad clusters (Figure 4E) that were notably similar to the metabolite-derived clusters, except that age and kidney disease clustered together with laboratory markers of inflammation. Volcano plots of three representative indices (death, ferritin level and platelet count) are depicted in Figure 4F. Seventeen proteins correlated with death, 26 with platelet count and 29 with ferritin. Six proteins were associated with both death and ferritin level, 9 with death and platelet count, and 5 (ANGPT1, PDGF subunit A, PDGF subunit B, LAP TGF-beta-1, CXCL5) with death, ferritin level and platelet count (Figure 4F). ANGPT1, PDGF subunit A and PDGF subunit B are markers of endothelial injury and platelet dysfunction, whereas LAP, TGF-beta-1, and CXCL5 are markers of inflammation. The fact that these proteins correlated with known clinical indices of COVID-19 mortality and morbidity corroborates the importance of vascular injury and inflammation in COVID-19 pathogenesis, which we observed in our network analysis above.

Metabolomics signature predicts clinical outcomes in COVID-19

We devised a novel hierarchical composite outcome measure for COVID-19 severity which incorporates a series of clinical events, ranked in order of severity, that characterize both acute COVID-19 and some of the sequelae of COVID-19 seen in post-acute COVID-19 syndrome (PACS) (Nalbandian et al., 2021): in-hospital mortality, mechanical ventilation (MV) at discharge, kidney replacement therapy (KRT) at discharge, prolonged organ failure support (mechanical ventilation and/or kidney replacement therapy for more than 2 weeks), supplemental oxygen requirement, acute kidney injury and length of hospital stay (Figure 5A). Our measure represents each patient on a continuous spectrum of severity to provide more information than a dichotomous classification such as mortality. Details on the design of this score and patient numbers in each group can be found in Figure S3.

Figure 5.

Figure 5

Metabolomics signature predicts clinical outcomes of COVID-19

(A) Scheme of the hierarchical composite outcome ranging from in-hospital mortality to length of hospital stay and disposition (MV: Mechanical ventilation, KRT: Kidney replacement therapy).

(B) Dot plot comparing the predictive performance (Concordance index) of a baseline model using age, sex, and BMI and a model including baseline plus metabolites. The results show that the prediction accuracy improves when adding metabolites to the model. Dots represent median values and shaded areas are 95% confidence intervals.

(C) Bar plot showing the main altered metabolic groups included in the composite outcome score.

(D) Dot plots showing individual outcome analysis, comparing the baseline model and baseline plus metabolites added. Metabolites improve the prediction accuracy of ARDS, AKI, supplemental oxygen requirement and prolonged hospitalization.

(E) Replication analysis of our model in two independent datasets by Su et al. (Su et al., 2020) and Shen et al. (Shen et al., 2020). This analysis was performed on the reduced 6-metabolite model. The Shen et al. study developed their own model, which is plotted in black (Shen et al., 2020).

A machine learning algorithm based on an ordinal response mixed effect model with LASSO regularization was used to generate our prediction model of the composite outcome measure using serum metabolic profiles and baseline patient demographics (age, sex and BMI) as combined inputs (Figure 5B). Proteins were not included in the model, because our study had around three times more metabolomics samples than proteomics samples. The final model included 32 metabolites and achieved an average concordance-index of d = 0.69 (SE = 0.017, 95% CI of [0.65, 0.72]), which is equivalent to a receiver operating characteristics area under the curve (ROC-AUC) on a binary outcome. This was a significant improvement (p value = 3E-5) over a baseline model containing only age, sex, and BMI, which had a performance of d = 0.59 (SD = 0.02, 95% CI of [0.56, 0.64]). The 32 metabolites in the final model included 14 amino acids, six nucleotides, six lipids and two metabolites related to energy metabolism (Figure 5C). Interestingly, our metabolite-based model showed improvement over the baseline model not only for predicting the composite outcome, but also for predicting some of its individual components (i.e., intubation, AKI, supplemental oxygen requirement, and length of hospital stay, Figure 5D).

We assessed the tradeoff between the number of metabolites incorporated into the model and the model’s prediction performance across a range of included metabolites. Our analysis revealed that most of the predictive power (d ∼ 0.68) was already achieved with the first six metabolites (cis-aconitic acid, hydrocinnamic acid, pantothenic acid, 7-methylguanosine, citrulline, methionine sulfoxide), after which prediction performance did not significantly improve (Figure S4). This finding suggests that a targeted assay of six metabolites could predict disease severity with this accuracy, although independent validation in other datasets remains to be established.

As a validation step, we tested the performance of our reduced six-metabolite model on blood metabolomics data from two previously published studies by Su et al. (2020) and Shen et al. (2020) (Figure 5E). The Su study reported COVID-19 severity using a WHO-based score with seven ordinal levels from mild to severe. Our model achieved a concordance index of d = 0.83, again outperforming a model that just consisted of age, sex, and BMI (Figure 5E, left). The Shen study differentiated two groups of COVID-19 patients, mild and severe. That study also developed a prediction model, and we extracted their patient prediction scores from the paper to be able to calculate a concordance index for comparison. The models were tested on two test cohorts, called ‘C2’ and ‘C3’ from the original publication (Figure 5E, right). Our 6-metabolite model consistently outperformed both the score from Shen et al. as well as the baseline model with age, sex, and BMI. Notably, overall concordance scores in the Shen dataset were substantially higher than in our own dataset, both as reported in the original paper as well as in the validation of our own model on their data. We believe this effect occurs because of the complex nature of our composite outcome score as opposed to a simple yes vs. no classification of severity, the small sample size in Shen et al., and the reporting of an unvalidated training set AUC of 0.95 in their study. Overall, this analysis demonstrates that the model replicates in independent datasets with varying definitions of severity, and that our model provides slightly better results compared to the previously reported prediction model.

Discussion

The novel coronavirus has ravaged the global healthcare system because of its high transmissibility and unpredictable clinical course that often affects multiple organ systems. Moreover, the long-term consequences of COVID-19 infection remain poorly understood. A full understanding of the pathogenesis of COVID-19 will require an unraveling of the mechanisms of inflammation, immune dysfunction, endothelial cell injury and dysregulated coagulation that underlie this disease.

In our study, we used an integrative proteomic-metabolomic analysis to identify global molecular signatures specific to the acute illness of COVID-19, whereas many prior metabolomic and proteomic studies have not assessed the interplay between proteins and metabolites. Our analyses establish associations of specific inflammation and vascular injury-related proteins with various metabolites during COVID-19, which appear to link inflammation with mitochondria-dependent energy metabolism and viral replication, as well as coagulation with fibrogenesis and glycolysis.

Our discovered network modules not only provide a better understanding of disease pathogenesis, but also facilitate novel potential therapeutic targets for COVID-19. The modules identified various proteins and metabolites involved in inflammatory and vascular injury processes, such as MMP 12, Cathepsin D and RAGE which, to the best of our knowledge, have not yet been studied as targets for therapeutic intervention in COVID-19. Of note, our modules contained IL-6, which is already a mainstay of treatment for severe disease (Brown et al., 2021), and several other molecules such as carnitine, niacinamide and IFN-gamma which others have been studying in the context of COVID-19 therapies (Li et al., 2021; van Laarhoven et al., 2021; Fu et al., 2021).

There is increasing evidence that evaluating symptoms and multiple clinical outcomes during acute disease is crucial in determining the risk of long COVID-19 (Sudre et al., 2021). To the best of our knowledge, we are the first group to develop a composite outcome measure in COVID-19 using multiple clinical indices in a prediction model that assesses not only COVID-19 disease severity but also the sequelae of COVID-19 that characterize post-acute COVID-19 syndrome (PACS). Compared to dichotomous outcome measures such as death and survival, our composite outcome score reflects a broader, more holistic assessment of COVID-19 morbidity in the hospital setting. Moreover, we were able to validate the model in two independent studies, thereby demonstrating its generalizability and translational potential.

Several key strengths underlie our study cohort. As opposed to the use of healthy controls reported in other COVID-19 studies (Shen et al., 2020; Arunachalam et al., 2020; Overmyer et al., 2021; Lucas et al., 2020; Su et al., 2020), our use of non-COVID patient samples, in the same hospital during the same period between March and April 2020, allowed us to investigate the interactions highly relevant to COVID-19 pathogenesis and clinical course. Additionally, we analyzed a relatively larger cohort compared to other studies, with hundreds of samples available for both metabolomic and proteomic analysis.

In conclusion, our investigation has sought to not only define the metabolomic and proteomic signatures of COVID-19, but also to explore interactions between metabolites and proteins that can serve as a roadmap for future mechanistic studies. We have furthermore proposed a novel clinical composite outcome score that can be used in a clinical prediction model for COVID-19. Ultimately, a better understanding of the pathophysiology of COVID-19 at the molecular level may lead to short-term and long-term targeted therapies.

Limitations of the study

As alterations in proteome and metabolome were analyzed in sera but not in lung tissues or bronchoalveolar lavage fluid, our results may not reflect what occurs at tissue-specific cellular levels. Furthermore, based on the current study design and methodology, the correlative relationships we report between metabolomic, and proteomic alterations and SARS-CoV-2 outcomes should be interpreted as purely correlative rather than causal in nature. Additional studies are required to define the mechanistic roles of individual molecules highlighted in this paper. Finally, as our study was only a single center investigation, our results will need to be validated in other cohorts.

STAR★Methods

Key resources table

REAGENT or RESOURCE SOURCE IDENTIFIER
Biological samples

Human Serum Samples Weill Cornell Medicine IRB Protocol #19-10020914 – NewYork-Presbyterian Core Laboratory N/A

Chemicals, peptides, and recombinant proteins

Methanol (LC/MS grade) Fisher scientific A456-4
Water (LC/MS grade) Fisher scientific W6-4
Acetonitrile (LC/MS grade) Fisher scientific A955-4
Ammonium acetate Fluka 631-61-8
Ammonium hydroxide solution Sigma-Aldrich 338818-100ML

Deposited data

Metabolomics, proteomics, and clinical data https://doi.org/10.6084/m9.figshare.19115972 N/A

Software and algorithms

XCalibur 4.1 ThermoFisher Scientific N/A

Other

Q Exactive Orbitrap mass spectrometer ThermoFisher Scientific N/A
Vanquish UPLC system ThermoFisher Scientific N/A
SeQuant ZIC-pHILIC column Millipore Sigma 1504600001

Resource availability

Lead contact

Further information and requests for resources should be directed to and will be fulfilled by the lead contact, Jan Krumsiek.

Materials availability

This study did not generate new unique reagents.

Experimental model and subject details

Cohort description and ethical approvals

This is a single-center prospective analysis of one cohort comparing hospitalized COVID-19 patients and non-COVID-19 controls. Our cohort was comprised of 330 patients with confirmed SARS-CoV-2 RT-PCR, and 97 non-COVID-19 controls with negative RT-PCR results who were hospitalized at the NewYork-Presbyterian Hospital/Weill Cornell Medical Center between March and April 2020. Remnant serum samples were matched with selected patients after which patients were deidentified. Controls were randomly selected patients admitted to the hospital with symptoms suspicious for COVID-19, but with negative SARS-CoV-2 RT-PCR. Seventy nine percent of control group patients had shortness of breath, fever, cough or chest pain which are commonly seen in COVID-19. Children (less than 18 years old) and pregnant women (confirmed by a positive beta-HCG test and/or medical records) were excluded. A full description of the cohort can be found in Tables 1, S1 and S2. The collection and analysis of the data and human samples satisfied the principles outlined in the Declaration of Helsinki and were approved by the local IRB (Weill Cornell Medicine IRB, protocol #19-10020914), with a waiver of informed consent.

Method details

Data collection

Data were obtained from the Weill Cornell Medicine COVID Institutional Data Repository (COVID-IDR), which is a high-quality registry of COVID-19 patients at NewYork-Presbyterian - Cornell with laboratory confirmed SARS-CoV-2 RT-PCR. The COVID-IDR houses both manually and automatically extracted Electronic Health Record (EHR) data. Demographics, comorbidities, and important dates of patients’ hospital course (admission, intubation, extubation, discharge, death) were extracted by a team of medical professionals and stored in the COVID-IDR. Laboratory tests, ventilation parameters, vital signs, and respiratory variables were additionally available via automated extraction through the Weill Cornell-Critical Care Database for Advanced Research (WC-CEDAR) within the COVID-IDR. WC-CEDAR (Schenck et al., 2021) is a critical care database originally designed to automatically extract, transform, and store EHR data on Intensive Care Unit (ICU) patients; it was expanded to include all hospitalized patients during New York City’s COVID-19 surge. Data not available within WC-CEDAR were manually extracted and recorded in REDCap.

Sample handling

Standard practices for serum collection and storage at the NewYork-Presbyterian/Weill Cornell Medical College include collecting venous blood into a serum-separating tube (SST), and serum is obtained by centrifuging at 1,500g for 7 minutes as soon as possible with a maximum time limit of 2 hours from the time of collection. The specimens are typically stored at 4°C for 1 to 5 days before coded/de-identified and then transferred into a −80°C freezer. Samples were thawed and inactivated in different ways: for the metabolomic analysis, x3 sample volume of HPLC grade ethanol were added; for the proteomic analysis, the samples were heat-inactivated in a water bath of 56°C for 15 minutes. After these processes, the samples were stored again at −80°C until the analyses were performed.

Metabolomic profiling

Targeted metabolite profiling was performed according to a method described in a previous publication (Chen et al., 2016). For metabolite extraction, 80 μL of pre-chilled methanol (−80 °C) was added to 20 μL of serum. The sample was vortexed for 1 min and then incubated at −80 °C for 2 hours before it was centrifuged at 20,000 g for 15 min at 4 °C to remove the pellet. The supernatant was transferred to a new Eppendorf tube and dried completely with a Speedvac for 30 min (with heat off). The dried sample was redissolved in HPLC grade water before it was applied to the hydrophilic interaction chromatography LC-MS. The sample injection order was randomized.

Metabolites were measured on a Q Exactive Orbitrap mass spectrometer (Thermo Scientific), which was coupled to a Vanquish UPLC system (Thermo Scientific) via an Ion Max ion source with a HESI II probe (Thermo Scientific). A Sequant ZIC-pHILIC column (2.1 mm i.d. × 150 mm, particle size of 5 μm, Millipore Sigma) was used for separation of metabolites. A 2.1 × 20 mm guard column with the same packing material was used for protection of the analytical column. Flow rate was set at 150 μL/min. Buffers consisted of 100% acetonitrile for mobile phase A, and 0.1% NH4OH/20 mM CH3COONH4 in water for mobile phase B. The chromatographic gradient ran from 85% to 30% A in 20 min followed by a wash with 30% A and re-equilibration at 85% A. The column temperature was set to 30 °C and the autosampler temperature was set to 4 °C. The Q Exactive was operated in full scan, polarity-switching mode with the following parameters: the spray voltage 3.0 kV, the heated capillary temperature 300 °C, the HESI probe temperature 350 °C, the sheath gas flow 40 units, the auxiliary gas flow 15 units. MS data acquisition was performed in the m/z range of 70–1,000, with 70,000 resolution (at 200 m/z). The AGC target was 3,000,000 and the maximum injection time was 100 ms. The MS data was processed using XCalibur 4.1 (Thermo Scientific) to extract the metabolite signal intensity for relative quantitation. Metabolites were identified using an in-house library established using chemical standards. Identification required exact mass (within 5ppm) and standard retention times. As a quality control, a mixture of standard compounds was injected thirteen times throughout the LC-MS data acquisition process for monitoring of the stability of LC retention time, the MS mass accuracy and the signal intensity. The median coefficient of variation for metabolite quantitation based on the quality control sample was 0.061. The data from the serum samples showed that both retention time and mass accuracy were highly stable throughout the experiment (Figure S5).

Proteomic profiling

Proteomics analysis was performed using the Olink platform (Uppsala, Sweden) at the Proteomics Core of Weill Cornell Medicine-Qatar, according to manufacturer’s instructions. We used the Inflammation, Cardiovascular II and Cardiovascular III panels. High throughput real-time PCR of reporter DNA lined to protein specific antibodies was performed on a 96-well integrated fluidic circuits chip (Fluidigm, San Francisco, CA). Each sample was spiked with quality controls to monitor the incubation, extension, and detection steps of the assay. Additionally, samples representing external, negative, and inter-plate controls were included in each analysis run. From raw data, real time PCR cycle threshold (Ct) values were extracted using Fluidigm reverse transcription polymerase chain reaction (RT-PCR) analysis software at a quality threshold of 0.5 and linear baseline correction. Ct values were further processed using the Olink NPX manager software (Olink, Uppsala, Sweden). Here, log2-transformed Ct values from each sample and analyte were normalized based on spiked-in extension controls and scale-inverted to obtain Normalized log2-scaled Protein Expression (NPX) values. NPX values were adjusted based on the median of inter plate controls (IPC) for each protein and intensity median scaled between all samples and plates.

Each metabolite and protein was annotated with pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database (Kanehisa et al., 2021).

Data preprocessing

Children, pregnant women, and samples after intubation were excluded from all analyses. The metabolomics data was measured in three different batches. For each batch, data was preprocessed by filtering out samples with more than 50% missing values, followed by filtering out metabolites with more than 25% missing values, probabilistic quotient normalization (Dieterle et al., 2006), and log2 transformation. Two extreme outlier metabolites were manually removed (phosphorylcholine and adenosine monophosphate). The next step was to merge the different batches into a joint dataset. Batch 3 contained only control patients and could thus not be simply added by batch correction. To avoid issues created by this imbalanced experimental design, batches 2 and 3 contained an overlapping set of samples which were used for an anchor-based normalization by dividing each metabolite in batch 3 by the mean fold change of the overlapping samples. The anchor samples from batch 3 were then deleted and the batches combined using median-based batch correction (Do et al., 2018). Overall, this procedure eliminates batch effects and allows for a batch that only contains control samples. Missing values were then imputed using the k-nearest neighbor approach (Do et al., 2018).

Proteomics data preprocessing included the same steps of filtering, quotient normalization, logging, and missing value imputation with identical parameters as for the metabolomics data. Ten proteins were measured as duplicates on the Olink platform, so their expression values were averaged.

Quantification and statistical analysis

Statistical analysis

Differential expression of metabolites and proteins for both the COVID-19 vs. control analysis as well as the clinical parameter analysis within the COVID-19 cohort was assessed using the following linear mixed effect model:

met_protoutcome+time+(1|patient),

where met_prot is each individual metabolite or protein, outcome is either COVID-19 yes/no or the value of a clinical parameter, time is the day of sample taking as a factor, and (1|patient) is a random effect per patient to account for repeated measurements. P-values are reported for the significance of the outcome term.

Data preprocessing and statistical analysis was performed using the “maplet” toolbox for R (Chetnik et al., 2021) (https://github.com/krumsieklab/maplet).

Network construction

The dataset was first reduced to the samples that overlap between metabolomics and proteomics (n = 227), and corrected for age, sex, BMI, and COVID-19 status (yes/no). A Gaussian Graphical Model (GGM) based network was then constructed using the GeneNet algorithm (Schäfer and Strimmer, 2005) and drawing an edge for all partial correlations with an FDR smaller than 0.2. In a second step, this network was condensed to highlight the connections between molecules that were significantly different between COVID-19 and controls. To this end, a shortest-path distance matrix between all molecules was constructed and subset to the significant molecules. A minimum spanning tree (Prim, 1957) of this matrix was then constructed to visualize a simplified network.

Composite outcome

A detailed description of the construction of the composite outcome along with patient numbers in each group can be found in Figure S3.

Regularized linear mixed effect ordinal regression model

A new regression model was developed to deal with an ordinal outcome, repeated measurements, and feature selection with regularization. Repeated measurements are handled as a random effect, while age, sex, BMI, and metabolites are treated as fixed effects. The metabolites are penalized using an L1 LASSO-type regularization to obtain a sparse solution. The model is fitted using an mixed-effect ordinal regression model with complementary log-log link function (McCullagh, 1980), using maximum likelihood (ML) estimation as proposed by Ripatti and Pamgren (Ripatti and Palmgren, 2000). An optimal LASSO penalty parameter was estimated through an iterative algorithm for maximizing the Laplace approximation of the integrated model likelihood. This approach was adapted from Therneau (Therneau, 2020), which was originally developed for mixed effect Cox models. To obtain an unbiased estimate of the model performance, leave-one-out-cross-validation across the entire dataset was performed. The added value of metabolomics data over baseline clinical data was assessed by comparing the final model with a model only consisting of age, sex, and BMI.

Validation datasets

Metabolomics data were downloaded from Su et al. (2020) (n = 121) and Shen et al. (2020) (containing two validation sets, n = 10 and n = 19). The Su dataset contained all six metabolites from our reduced model as well as age, sex and BMI as baseline parameters. The Shen study also covered age, sex and BMI, and the first test dataset (“C2”) contained all six metabolites. The second test dataset (“C3”) was measured using a targeted assay of only 7 metabolites and 22 proteins, and the metabolites did not overlap with our model metabolites. Thus, we had to follow a more complex procedure for validation. We first applied our risk score in their training cohort, which contained all metabolites. In the training cohort, this score was then regressed on the available measurements in C3, i.e. modeling ‘score ∼ metabolite1 + … + metabolite7 + protein1 + …+ protein22’. The coefficients from this model were then used in C3 to derive a surrogate severity score, which we evaluated in Figure 5.

Acknowledgments

JK is supported by the National Institute of Aging of the National Institutes of Health under award 1U19AG063744. SJC is supported by National Heart, Lung and Blood Institute under award K08HL138285. KS is supported by ‘Biomedical Research Program’ funds at Weill Cornell Medical College in Qatar, a program funded by the Qatar Foundation and multiple grants from the Qatar National Research Fund (QNRF).

Author contributions

Conception and design, M.B., S.A-M., A.C.R., R.B., E.S., A.M.K.C., S.J.C., and J.K.; Analysis and Interpretation, M.B., S.A-M., A.C.R., R.B., F.S., K.L.H., H.S., R.E., L.G-E, W.S., E.B., K.C., G.Z., E.S., K.S., J.J.C., Z.Z., S.R-B., H.S.Y., M.E.C., A.M.K.C., S.J.C., and J.K.; Drafting the Manuscript for Important Intellectual Content, M.B., S.A-M., A.C.R., R.B., F.S., E.B., K.S., E.S., A.M.K.C., S.J.C., and J.K.

Declaration of interests

A.M.K.C. is a cofounder and equity stock holder for Proterris, which develops therapeutic uses for carbon monoxide. A.M.K.C. has a use patent on CO. In addition, A.M.K.C. has a patent in COPD. J.K. is scientific advisor at iollo and holds equity in Chymia LLC and IP in PsyProtix.

Published: July 15, 2022

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.isci.2022.104612.

Contributor Information

Soo Jung Cho, Email: sjc9006@med.cornell.edu.

Jan Krumsiek, Email: jak2042@med.cornell.edu.

Supplemental information

Document S1. Figures S1–S5, Tables S1 and S2
mmc1.pdf (1.4MB, pdf)
Table S3. Metabolic effects of intubation, related to Figure 2
mmc2.xlsx (7.9MB, xlsx)
Table S4. Statistical results for COVID-19 vs. control analysis, related to Figure 2
mmc3.xlsx (63.5KB, xlsx)
Table S5. Statistical results for clinical manifestations (metabolomics), related to Figure 4
mmc4.xlsx (166.5KB, xlsx)
Table S6. Statistical results for clinical manifestations (proteomics), related to Figure 4
mmc5.xlsx (304.2KB, xlsx)
Data S1. Cytoscape version of multi-omics network, related to Figure 3
mmc6.zip (125.2KB, zip)

Data and code availability

References

  1. Ackermann M., Verleden S.E., Kuehnel M., Haverich A., Welte T., Laenger F., Vanstapel A., Werlein C., Stark H., Tzankov A., et al. Pulmonary vascular endothelialitis, thrombosis, and angiogenesis in Covid-19. N. Engl. J. Med. 2020;383:120–128. doi: 10.1056/nejmoa2015432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Arons M.M., Hatfield K.M., Reddy S.C., Kimball A., James A., Jacobs J.R., Taylor J., Spicer K., Bardossy A.C., Oakley L.P., et al. Presymptomatic SARS-CoV-2 infections and transmission in a skilled nursing facility. N. Engl. J. Med. 2020;382:2081–2090. doi: 10.1056/NEJMoa2008457. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Arunachalam P.S., Wimmers F., Mok C.K.P., Perera R.A.P.M., Scott M., Hagan T., Sigal N., Feng Y., Bristow L., Tak-Yin Tsang O., et al. Systems biological assessment of immunity to mild versus severe COVID-19 infection in humans. Science. 2020;369:1210–1220. doi: 10.1126/science.abc6261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Barberis E., Timo S., Amede E., Vanella V.V., Puricelli C., Cappellano G., Raineri D., Cittone M.G., Rizzi E., Pedrinelli A.R., et al. Large-scale plasma analysis revealed new mechanisms and molecules associated with the host response to SARS-CoV-2. Int. J. Mol. Sci. 2020;21:8623. doi: 10.3390/ijms21228623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Benedetti E., Pučić-Baković M., Keser T., Wahl A., Hassinen A., Yang J.Y., Liu L., Trbojević-Akmačić I., Razdorov G., Štambuk J., et al. Network inference from glycoproteomics data reveals new reactions in the IgG glycosylation pathway. Nat. Commun. 2017;8:1483. doi: 10.1038/s41467-017-01525-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bilaloglu S., Aphinyanaphongs Y., Jones S., Iturrate E., Hochman J., Berger J.S. Thrombosis in hospitalized patients with COVID-19 in a New York city Health system. JAMA. 2020:799–801. doi: 10.1001/jama.2020.13372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Brown M.J., Alazawi W., Kanoni S. Interleukin-6 receptor antagonists in critically ill patients with Covid-19. N. Engl. J. Med. 2021;384:1491–1502. doi: 10.1056/NEJMc2108482. [DOI] [PubMed] [Google Scholar]
  8. Chen J., Wu C., Wang X., Yu J., Sun Z. The impact of COVID-19 on blood glucose: a systematic review and meta-analysis. Front. Endocrinol. 2020;11:574541. doi: 10.3389/fendo.2020.574541. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chen W.W., Freinkman E., Wang T., Birsoy K., Sabatini D.M. Absolute quantification of matrix metabolites reveals the dynamics of mitochondrial metabolism. Cell. 2016;166:1324–1337.e11. doi: 10.1016/j.cell.2016.07.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chetnik K., Benedetti E., Gomari D.P., Schweickart A., Batra R., Buyukozkan M., Wang Z., Arnold M., Zierer J., Suhre K., Krumsiek J. maplet: an extensible R toolbox for modular and reproducible omics pipelines. arXiv. 2021 doi: 10.1093/bioinformatics/btab741. https://ui.adsabs.harvard.edu/abs/2021arXiv210504305C Preprint at. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Di B., Jia H., Luo O.J., Lin F., Li K., Zhang Y., Wang H., Liang H., Fan J., Yang Z. Identification and validation of predictive factors for progression to severe COVID-19 pneumonia by proteomics. Signal Transduct. Target. Ther. 2020;5:217. doi: 10.1038/s41392-020-00333-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Dieterle F., Ross A., Schlotterbeck G., Senn H. Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. Anal. Chem. 2006;78:4281–4290. doi: 10.1021/ac051632c. [DOI] [PubMed] [Google Scholar]
  13. Do K.T., Wahl S., Raffler J., Molnos S., Laimighofer M., Adamski J., Suhre K., Strauch K., Peters A., Gieger C., et al. Characterization of missing values in untargeted MS-based metabolomics data and evaluation of missing data handling strategies. Metabolomics. 2018;14:128. doi: 10.1007/s11306-018-1420-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Elezkurtaj S., Greuel S., Ihlow J., Michaelis E.G., Bischoff P., Kunze C.A., Sinn B.V., Gerhold M., Hauptmann K., Ingold-Heppner B., et al. Causes of death and comorbidities in hospitalized patients with COVID-19. Sci. Rep. 2021;11:4263. doi: 10.1038/s41598-021-82862-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fu W., Lei C., Ma Z., Qian K., Li T., Zhao J., Hu S. CAR macrophages for SARS-CoV-2 immunotherapy. Front. Immunol. 2021;12:669103. doi: 10.3389/fimmu.2021.669103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Gisby J., Clarke C.L., Medjeral-Thomas N., Malik T.H., Papadaki A., Mortimer P.M., Buang N.B., Lewis S., Pereira M., Toulza F., et al. Longitudinal proteomic profiling of dialysis patients with COVID-19 reveals markers of severity and predictors of death. Elife. 2021;10:e64827. doi: 10.7554/elife.64827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Guan W.J., Ni Z.Y., Hu Y., Liang W.H., Ou C.Q., He J.X., Liu L., Shan H., Lei C.L., Hui D.S.C., et al. Clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. 2020;382:1708–1720. doi: 10.1056/NEJMoa2002032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hirsch J.S., Ng J.H., Ross D.W., Sharma P., Shah H.H., Barnett R.L., Hazzan A.D., Fishbane S., Jhaveri K.D., Abate M., et al. Acute kidney injury in patients hospitalized with COVID-19. Kidney Int. 2020;98:209–218. doi: 10.1016/j.kint.2020.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Hoelzer K., Shackelton L.A., Parrish C.R. Presence and role of cytosine methylation in DNA viruses of animals. Nucleic Acids Res. 2008;36:2825–2837. doi: 10.1093/nar/gkn121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Iastrebner M., Castro J., García Espina E., Lettieri C., Payaslian S., Cuesta M.C., Gutiérrez Fernández P., Mandrile A., Contreras A.P., Gervasoni S., et al. Ruxolitinib in severe COVID-19: results of a multicenter, prospective, single arm, open-label clinical study to investigate the efficacy and safety of ruxolitinib in patients with COVID-19 and severe acute respiratory syndrome. Rev. Fac. Cien. Med. Univ. Nac. Cordoba. 2021;78:294–302. doi: 10.31053/1853.0605.v78.n3.32800. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kalil A.C., Patterson T.F., Mehta A.K., Tomashek K.M., Wolfe C.R., Ghazaryan V., Marconi V.C., Ruiz-Palacios G.M., Hsieh L., Kline S., et al. Baricitinib plus remdesivir for hospitalized adults with Covid-19. N. Engl. J. Med. 2020;384:795–807. doi: 10.1056/nejmoa2031994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Kanehisa M., Furumichi M., Sato Y., Ishiguro-Watanabe M., Tanabe M. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 2021;49:D545–D551. doi: 10.1093/nar/gkaa970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Krumsiek J., Suhre K., Evans A.M., Mitchell M.W., Mohney R.P., Milburn M.V., Wägele B., Römisch-Margl W., Illig T., Adamski J., et al. Mining the unknown: a systems approach to metabolite identification combining genetic and metabolic information. PLoS Genet. 2012;8:e1003005. doi: 10.1371/journal.pgen.1003005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Krumsiek J., Suhre K., Illig T., Adamski J., Theis F.J. Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data. BMC Syst. Biol. 2011;5:21. doi: 10.1186/1752-0509-5-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. León G., Herrera M., Vargas M., Arguedas M., Sánchez A., Segura Á., Gómez A., Solano G., Corrales-Aguilar E., Risner K., et al. Development and characterization of two equine formulations towards SARS-CoV-2 proteins for the potential treatment of COVID-19. Sci. Rep. 2021;11:9825. doi: 10.1038/s41598-021-89242-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Leung S.W.S., Shi Y. The glycolytic process in endothelial cells and its implications. Acta Pharmacol. Sin. 2021:251–259. doi: 10.1038/s41401-021-00647-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Li C., Ou R., Wei Q., Shang H. Carnitine and COVID-19 susceptibility and severity: a mendelian randomization study. Front. Nutr. 2021;8:780205. doi: 10.3389/fnut.2021.780205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Liu M., O’Connor R.S., Trefely S., Graham K., Snyder N.W., Beatty G.L. Metabolic rewiring of macrophages by CpG potentiates clearance of cancer cells and overcomes tumor-expressed CD47−mediated ‘don’t-eat-me’ signal. Nat. Immunol. 2019;20:265–275. doi: 10.1038/s41590-018-0292-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Lucas C., Wong P., Klein J., Castro T.B.R., Silva J., Sundaram M., Ellingson M.K., Mao T., Oh J.E., Israelow B., et al. Longitudinal analyses reveal immunological misfiring in severe COVID-19. Nature. 2020;584:463–469. doi: 10.1038/s41586-020-2588-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. McCullagh P. Regression models for ordinal data. J. Roy. Stat. Soc. B. 1980;42:109–127. doi: 10.1111/j.2517-6161.1980.tb01109.x. [DOI] [Google Scholar]
  31. McGonagle D., Ramanan A.V., Bridgewood C. Immune cartography of macrophage activation syndrome in the COVID-19 era. Nat. Rev. Rheumatol. 2021;17:145–157. doi: 10.1038/s41584-020-00571-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Meoni G., Ghini V., Maggi L., Vignoli A., Mazzoni A., Salvati L., Capone M., Vanni A., Tenori L., Fontanari P., et al. Metabolomic/lipidomic profiling of COVID-19 and individual response to tocilizumab. PLoS Pathog. 2021;17:e1009243. doi: 10.1371/journal.ppat.1009243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Monickaraj F., Mcguire P., Das A. Cathepsin D plays a role in endothelial–pericyte interactions during alteration of the blood–retinal barrier in diabetic retinopathy. FASEB J. 2018;32:2539–2548. doi: 10.1096/fj.201700781rr. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Nalbandian A., Sehgal K., Gupta A., Madhavan M.V., Mcgroder C., Stevens J.S., Cook J.R., Nordvig A.S., Shalev D., Sehrawat T.S., et al. Post-acute COVID-19 syndrome. Nat. Med. 2021;27:601–615. doi: 10.1038/s41591-021-01283-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Overmyer K.A., Shishkova E., Miller I.J., Balnis J., Bernstein M.N., Peters-Clarke T.M., Meyer J.G., Quan Q., Muehlbauer L.K., Trujillo E.A., et al. Large-scale multi-omic analysis of COVID-19 severity. Cell Syst. 2021;12:23–40.e7. doi: 10.1016/j.cels.2020.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Prim R.C. Shortest connection networks and some generalizations. Bell Syst. Tech. J. 1957;36:1389–1401. doi: 10.1002/j.1538-7305.1957.tb01515.x. [DOI] [Google Scholar]
  37. Ripatti S., Palmgren J. Estimation of multivariate frailty models using penalized partial likelihood. Biometrics. 2000;56:1016–1022. doi: 10.1111/j.0006-341x.2000.01016.x. [DOI] [PubMed] [Google Scholar]
  38. Roberts I., Wright Muelas M., Taylor J.M., Davison A.S., Xu Y., Grixti J.M., Gotts N., Sorokin A., Goodacre R., Kell D.B. Untargeted metabolomics of COVID-19 patient serum reveals potential prognostic markers of both severity and outcome. Metabolomics. 2021;18:6. doi: 10.1007/s11306-021-01859-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Schäfer J., Strimmer K. A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics. Stat. Appl. Genet. Mol. Biol. 2005;4:Article32. doi: 10.2202/1544-6115.1175. [DOI] [PubMed] [Google Scholar]
  40. Schenck E.J., Hoffman K.L., Cusick M., Kabariti J., Sholle E.T., Campion T.R. Critical carE database for advanced Research (CEDAR): an automated method to support intensive care units with electronic Health record data. J. Biomed. Inf. 2021;118:103789. doi: 10.1016/j.jbi.2021.103789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Shen B., Yi X., Sun Y., Bi X., Du J., Zhang C., Quan S., Zhang F., Sun R., Qian L., et al. Proteomic and metabolomic characterization of COVID-19 patient sera. Cell. 2020;182:59–72.e15. doi: 10.1016/j.cell.2020.05.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Soliman S., Faris M.E., Ratemi Z., Halwani R. Switching host metabolism as an approach to dampen SARS-CoV-2 infection. Ann. Nutr. Metab. 2020;76:297–303. doi: 10.1159/000510508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Stanley C.A., Hale D.E., Berry G.T., Deleeuw S., Boxer J., Bonnefont J.-P. A deficiency of carnitine–acylcarnitine translocase in the inner mitochondrial membrane. N. Engl. J. Med. 1992;327:19–23. doi: 10.1056/nejm199207023270104. [DOI] [PubMed] [Google Scholar]
  44. Su Y., Chen D., Yuan D., Lausted C., Choi J., Dai C.L., Voillet V., Duvvuri V.R., Scherler K., Troisch P., et al. Multi-omics resolves a sharp disease-state shift between mild and moderate COVID-19. Cell. 2020;183:1479–1495.e20. doi: 10.1016/j.cell.2020.10.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Sudre C.H., Murray B., Varsavsky T., Graham M.S., Penfold R.S., Bowyer R.C., Pujol J.C., Klaser K., Antonelli M., Canas L.S., et al. Attributes and predictors of long COVID. Nat. Med. 2021;27:626–631. doi: 10.1038/s41591-021-01292-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Takeshita A., Yasuma T., Nishihama K., D’Alessandro-Gabazza C.N., Toda M., Totoki T., Okano Y., Uchida A., Inoue R., Qin L., et al. Thrombomodulin ameliorates transforming growth factor-β1–mediated chronic kidney disease via the G-protein coupled receptor 15/Akt signal pathway. Kidney Int. 2020;98:1179–1192. doi: 10.1016/j.kint.2020.05.041. [DOI] [PubMed] [Google Scholar]
  47. Tang N., Li D., Wang X., Sun Z. Abnormal coagulation parameters are associated with poor prognosis in patients with novel coronavirus pneumonia. J. Thromb. Haemost. 2020;18:844–847. doi: 10.1111/jth.14768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Therneau T. 2020. Coxme and the Laplace approximation [Online]https://cran.r-project.org/web/packages/coxme/vignettes/laplace.pdf [Google Scholar]
  49. Thomas T., Stefanoni D., Reisz J.A., Nemkov T., Bertolone L., Francis R.O., Hudson K.E., Zimring J.C., Hansen K.C., Hod E.A., et al. COVID-19 infection alters kynurenine and fatty acid metabolism, correlating with IL-6 levels and renal status. JCI Insight. 2020;5:140327. doi: 10.1172/jci.insight.140327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Tokunaga R., Zhang W., Naseem M., Puccini A., Berger M.D., Soni S., Mcskane M., Baba H., Lenz H.-J. CXCL9, CXCL10, CXCL11/CXCR3 axis for immune activation - a target for novel cancer therapy. Cancer Treat. Rev. 2018;63:40–47. doi: 10.1016/j.ctrv.2017.11.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Törnquist K., Asghar M.Y., Srinivasan V., Korhonen L., Lindholm D. Sphingolipids as modulators of SARS-CoV-2 infection. Front. Cell Dev. Biol. 2021;9:689854. doi: 10.3389/fcell.2021.689854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Valdés A., Moreno L.O., Rello S.R., Orduña A., Bernardo D., Cifuentes A. Metabolomics study of COVID-19 patients in four different clinical stages. Sci. Rep. 2022;12:1650. doi: 10.1038/s41598-022-05667-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. van Laarhoven A., Kurver L., Overheul G.J., Kooistra E.J., Abdo W.F., van Crevel R., Duivenvoorden R., Kox M., Ten Oever J., Schouten J., et al. Interferon gamma immunotherapy in five critically ill COVID-19 patients with impaired cellular immunity: a case series. Med (N Y) 2021;2:1163–1170.e2. doi: 10.1016/j.medj.2021.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Varga Z., Flammer A.J., Steiger P., Haberecker M., Andermatt R., Zinkernagel A.S., Mehra M.R., Schuepbach R.A., Ruschitzka F., Moch H. Endothelial cell infection and endotheliitis in COVID-19. Lancet. 2020;395:1417–1418. doi: 10.1016/s0140-6736(20)30937-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Wajner M., Amaral A.U. Mitochondrial dysfunction in fatty acid oxidation disorders: insights from human and animal studies. Biosci. Rep. 2015;36:e00281. doi: 10.1042/bsr20150240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Wiersinga W.J., Rhodes A., Cheng A.C., Peacock S.J., Prescott H.C. Pathophysiology, transmission, diagnosis, and treatment of coronavirus disease 2019 (COVID-19): a review. JAMA. 2020;324:782–793. doi: 10.1001/jama.2020.12839. [DOI] [PubMed] [Google Scholar]
  57. Wu P., Chen D., Ding W., Wu P., Hou H., Bai Y., Zhou Y., Li K., Xiang S., Liu P., et al. The trans-omics landscape of COVID-19. Nat. Commun. 2021;12:4543. doi: 10.1038/s41467-021-24482-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Xu S., Gu M., Wu K., Li G. Unraveling the role of hydroxyproline in maintaining the thermal stability of the collagen triple helix structure using simulation. J. Phys. Chem. B. 2019;123:7754–7763. doi: 10.1021/acs.jpcb.9b05006. [DOI] [PubMed] [Google Scholar]
  59. Zhang C., Shi L., Wang F.-S. Liver injury in COVID-19: management and challenges. Lancet Gastroenterol. Hepatol. 2020;5:428–430. doi: 10.1016/s2468-1253(20)30057-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Zhou F., Yu T., Du R., Fan G., Liu Y., Liu Z., Xiang J., Wang Y., Song B., Gu X., et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395:1054–1062. doi: 10.1016/s0140-6736(20)30566-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S5, Tables S1 and S2
mmc1.pdf (1.4MB, pdf)
Table S3. Metabolic effects of intubation, related to Figure 2
mmc2.xlsx (7.9MB, xlsx)
Table S4. Statistical results for COVID-19 vs. control analysis, related to Figure 2
mmc3.xlsx (63.5KB, xlsx)
Table S5. Statistical results for clinical manifestations (metabolomics), related to Figure 4
mmc4.xlsx (166.5KB, xlsx)
Table S6. Statistical results for clinical manifestations (proteomics), related to Figure 4
mmc5.xlsx (304.2KB, xlsx)
Data S1. Cytoscape version of multi-omics network, related to Figure 3
mmc6.zip (125.2KB, zip)

Data Availability Statement


Articles from iScience are provided here courtesy of Elsevier

RESOURCES