Dear Editor,
The early diagnosis and prognosis of COVID-19 remain major challenges. At present, there are few studies focusing on the changes of sera proteome between active and recovered COVID-19 patients, and almost no biomarkers in sera are used to predict the function disorder of the injured tissues and organs in COVID-19 patients. In this study, we applied sera proteomics in the active and recovered COVID-19 patients, as well as healthy volunteers, combining with pathological staining of the patients’ tissues to identify the potential prognosis biomarkers that can reflect specific organ damage.
We collected sera samples from 32 patients with COVID-19, including 15 patients with both active and recovered samples, and 19 healthy donors (HDs) (Supplementary Table S1), and we used the data-independent acquisition (DIA) method to analyze these samples (Fig. 1a). After performing quality control analyses, 426, 455 and 460 proteins were identified in the sera from HDs, COVID-19 patients, and recovered COVID-19 patients, respectively (Fig. 1b, Supplementary Fig. S1, and Supplementary Table S2). There were 93 proteins differentially expressed (Supplementary Fig. S2a and Table S3) in the COVID-19 patients compared with the HDs. The proteins upregulated in COVID-19 patients were found to be involved in the inflammatory response, vesicle transport and immune response (Supplementary Fig. S2b and Fig. S3a–c). By contrast, the proteins downregulated in COVID-19 patients were found to be involved in the lipid-related processes, wound healing and coagulation (Supplementary Fig. S2b and Fig. S3d–f).
We found that the differentially expressed proteins were enriched in various tissues, especially in liver and lung (Supplementary Fig. S4a, b), which are the target organs of SARS-CoV-2 infection.1 Then, we conducted the protein-protein interaction network of these lung- and liver-enriched differentially expressed proteins. Among the proteins enriched in liver, several proteins were involved in liver related functions, including the cell proliferation, retinoid, lipid and hyaluronan metabolism, vitamin transport, etc (Fig. 1c). These proteins included CD14 and RBP4. CD14, which is expressed in liver Kupffer cells, can be induced under lipopolysaccharide stimulation and is upregulated in inflammatory liver diseases. RBP4 is uniquely synthesized by hepatocytes, and its disorder is associated with several diseases, including retinal dystrophy, iris coloboma, comedogenic acne syndrome, and microphthalmia.2 The patients with COVID-19 had increased CD14 expression and decreased RBP4 expression (Fig. 1d), suggesting that these patients may have increased risk of inflammatory liver diseases and eye dysfunction.
Further, immunofluorescence staining of the lungs (Fig. 1e and Supplementary Fig. S4c) and livers (Fig. 1d and Supplementary Fig. S4c) showed decreased expression of tenascin XB, an essential regulator of collagen deposition by dermal fibroblasts, in patients with COVID-19, indicating an increased risk of lung/liver fibrosis in COVID-19 patients. Profilin-1 (PFN1), an actin-regulatory protein important in viral transcription activation and airway hyperresponsiveness, had been previously reported to be downregulated in non-severe COVID-19 patients compared to patients without COVID-19.3 By contrast, we found PFN1 to be upregulated not only in sera but also in tissues of the COVID-19 patients (Fig. 1d, e and Supplementary Fig. S4c), indicating the reliability of the biomarkers identified.
The principal coordinate analysis (PCoA) was performed, and the results indicated that the protein expression pattern in the recovered patients was closer to that in the HDs than that in the COVID-19 patients (Fig. 1f). This finding indicated that the sera molecular profile of recovered patients has returned to close to that of HDs. Then, unsupervised hierarchical clustering analysis (HCA) was performed, which resulted in the classification of the differentially expressed proteins among the three groups into three clusters (Fig. 1g and Supplementary Fig. S5). Cluster 1, consisting of the proteins that were downregulated in COVID-19 patients compared with HDs and recovered COVID-19 patients. These proteins could be used as prognosis biomarkers for monitoring the recovery process. Cluster 2, representing the upregulated proteins in the COVID-19 group that were not restored to the HDs levels in the recovered group, including antibodies, which were likely produced in response to SARS-CoV-2 infection. Among these antibodies, IGLC2, IGLC7, IGLV3-10, and IGLV3-19 had been previously reported to be upregulated in the B-cells of COVID-19 patients during COVID-19 pneumonia, and IGLC7 had been selected as a marker to distinguish severe and non-severe cases of COVID-19.4 Cluster 3, consisting of proteins upregulated in the COVID-19 patient compared to the HDs and recovered COVID-19 patient groups. As the expression of these proteins was upregulated after SARS-CoV-2 infection, while returned to HDs levels after recovery, they might be potential prognosis biomarkers.
We combined our dataset (proteins in Cluster 1 and 3 in Fig. 1g) with the published dataset,3,5 and we used HCA to identify trends in protein expression that might suggest COVID-19 severity (non-severe to severe COVID-19). The HCA results revealed two groups of proteins; the first group contained 23 sera proteins whose expression increased from HDs, patients with non-severe COVID-19, to patients with severe COVID-19 (Supplementary Fig. S6). Among them, 11 proteins were also included in Cluster 3 from our results (Fig. 1h). The second group contained 34 proteins whose expression decreased from HDs, patients with non-severe COVID-19 to patients with severe COVID-19 (Supplementary Fig. S6). Among them, 30 proteins were also included in Cluster 1 of our results (Fig. 1i). Next, we verified several of these potential biomarkers using a parallel reaction monitoring (PRM) strategy (Supplementary Table S4). The expression levels of the 10 proteins in the COVID-19 group were restored to the corresponding levels in HDs after disease recovery (Fig. 1j and Supplementary Fig. S7a), which further highlighted their values as prognostic biomarkers. Besides, the expression of three of these markers (SAA1, ORM1, and FGA) was confirmed in the lung (Fig. 1k) and/or liver (Supplementary Fig. S7b) tissues of COVID-19 patients using in situ histoimmunofluorescence. Among them, SAA1 and FGA were upregulated in the portal vein of liver, and CYP3A4 (a drug metabolizing enzyme, which is specifically expressed in central vein of liver) was low expressed in central vein of liver. Overall, these results suggested that the 41 proteins identified in these analyses might be prognostic biomarkers to monitor disease progression and can serve as quantifiable parameters to help evaluate therapeutic efficacy. In order to study whether the differentially expressed proteins in Cluster 1, 2 and 3 can indicate the disease caused by SARS-CoV-2 infection, different organ related diseases were analyzed (Supplementary Fig. S8). For example, proteins related to thrombosis (ORM1, CD14, FGA, etc.) in COVID-19 patients were upregulated, and central nervous system diseases (CETP, F13B, TF, etc.) were downregulated compared to those in recovered patients, indicating that the risk of these lung- and central nervous system-related diseases should be noted by health care providers during the treatment of COVID-19 patients.
In conclusion, this study has revealed a series of prognostic and diagnostic sera biomarkers of COVID-19, many of which are expressed in liver and lung tissues, and provided additional clues for the link between COVID-19 and multi-tissue/organ damages. More importantly, these biomarkers could provide some directions for rational evaluation of therapeutic efficacy and discovery of potential therapeutic drug targets of COVID-19.
Supplementary information
Acknowledgements
We thank Dr. Lei Zhao for his help in sample preparing, M.S. Dunqin Gao for her help in staining experiments, and Prof. Yunping Zhu for his help in proteomics data analysis. This work was supported by the National Science and Technology Major Project (2018ZX09711003) and the CAMS Innovation Fund for Medical Sciences (CIFMS) (2020-I2M-CoV19-001).
Author contributions
L.L., W.Z., J.M., and R.C. designed the experiments. L.L., M.L., and J.M. performed the proteomics experiments and bioinformatics analyses. R.C., W.L., and G.L. carried out sera sample preparation. R.C., W.L., D.M., and H.L. participated in COVID-19 tissue sample preparation. L.L., R.C., M.L., J.M., and S.Z. performed most of biological and functional experiments and analyzed the data. L.L., J.M., M.L., and R.C. wrote and edited the manuscript. W.Z., R.C., S.Z., and H.L. conceived the overall study. All authors made important comments regarding the manuscript.
Data availability
All proteomics raw data have been deposited to the ProteomeXchange Consortium via the iProX partner repository with the dataset identifier PXD021954.
Competing interests
The authors declare no competing interests.
Footnotes
These authors contributed equally: Ling Leng, Mansheng Li, Wei Li, Danlei Mou, Guopeng Liu.
Contributor Information
Hongjun Li, Email: lihongjun00113@126.com.
Ruiyuan Cao, Email: 21cc@163.com.
Wu Zhong, Email: zhongwu@bmi.ac.cn.
Supplementary information
The online version contains supplementary material available at 10.1038/s41392-021-00612-5.
References
- 1.Leng L, et al. Pathological features of COVID-19-associated liver injury-a preliminary proteomics report based on clinical samples. Signal Transduct. Target Ther. 2021;6:9. doi: 10.1038/s41392-020-00406-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Khan KN, et al. Vitamin A deficiency due to bi-allelic mutation of RBP4: there’s more to it than meets the eye. Ophthalmic Genet. 2017;38:465–466. doi: 10.1080/13816810.2016.1227453. [DOI] [PubMed] [Google Scholar]
- 3.Shen B, et al. Proteomic and metabolomic characterization of COVID-19 patient Sera. Cell. 2020;182:59–72e15. doi: 10.1016/j.cell.2020.05.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.McClain MT, et al. Dysregulated transcriptional responses to SARS-CoV-2 in the periphery. Nat. Commun. 2021;12:1079. doi: 10.1038/s41467-021-21289-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Messner CB, et al. Ultra-high-throughput clinical proteomics reveals classifiers of COVID-19 infection. Cell Syst. 2020;11:11–24e14. doi: 10.1016/j.cels.2020.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All proteomics raw data have been deposited to the ProteomeXchange Consortium via the iProX partner repository with the dataset identifier PXD021954.