Coronavirus disease 2019 (COVID-19), the disease caused by infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has developed into a global pandemic that continues to pose an enormous threat to public health and the global economy. Since the first SARS-CoV-2 genome was released [1], thousands of genetic variants have been identified in SARS-CoV-2 strains isolated from worldwide patients. Some variants in the spike (S) protein have been reported to be associated with functional changes in viral transmissibility or viral loads, such as the D614G variant [2,3], the N501Y change, the P681H substitution and deletion of ΔH69-ΔV70 [4]. However, there is currently little direct evidence linking the genomic variants and clinical severity of COVID-19.
Based on the variants at two significantly linked SNP sites 8782 (in orf1ab) and 28 144 (in ORF8) (NC_045 512 as reference), SARS-CoV-2 genomes can be categorized into S lineage (U8782 and C28 144) or L lineage (C8782 and U28 144). The categorization was first proposed based on the analysis of 103 SARS-CoV-2 genomes in our previous study [5], subsequently confirmed by other studies [6–8]. Of note, the S and L lineage corresponds to the A and B lineage in Rambaut’s et al. A/B nomenclature system [8]. Among the 103 genomes, the L lineage was more prevalent than the S lineage (∼70% versus ∼28%), and evolutionary analysis inferred that S was ancestral, and L was the derived form [5,9]. As recently demonstrated [7], ∼99.8% of the SARS-CoV-2 strains sequenced from global samples during the pandemic can be categorized into either the L or S lineage, suggesting the delineation of L and S lineages is robust (see Supplementary data for the analysis of 127 119 SARS-CoV-2 genomes deposited in the Global Initiative on Sharing All Influenza Data (GISAID, https://www.gisaid.org; as of October 19, 2020)). Interestingly, the L lineage was more prevalent than S as the COVID-19 pandemic developed, and the S-genomes almost disappeared after the end of June 2020 (Fig. S1).
Did the patients with L- or S-lineage SARS-CoV-2 experience different clinical outcomes? To address this issue, we collected SARS-CoV-2 samples from 271 patients diagnosed with COVID-19 in the early outbreak of the pandemic from five Wuhan hospitals (see Patient data source section of Supplementary data, Table S1). These patients were randomly recruited with regard to the viral lineage. Their admission dates spanned two key time points: January 23, 2020, when Wuhan implemented the entire city lockdown, and February 14, 2020, when Wuhan adopted closed-off community management. Because the strict city lockdown was implemented rapidly, largely confining the spread of SARS-CoV-2 to a closed environment within Wuhan, these SARS-CoV-2 samples are well suited for investigating the difference in clinical features associated with the two SARS-CoV-2 lineages.
Among the 271 patients, SARS-CoV-2 in 73 (26.9%) cases were S lineage, and in 198 (73.1%) cases were L lineage (see Supplementary Methods, Workflow I, and Table S2 for details). Although many genetic variants arose in both the L and S lineages as the pandemic continued [7], we believe the L- and S-lineage viruses isolated from the 271 patients were very close to the reference genome (NC_045 512). The major differences between the two lineages surveyed here were located at sites 8782 and 28 144, and neither lineage carried the G614 variant in the S protein (see Supplementary data and Fig. S2 for details). We gathered clinical features from the 271 patients’ medical records for statistical analysis alongside the SARS-CoV-2 lineages. There was no significant difference in overall clinical outcomes (i.e. the proportion of patients that were recovered and discharged) between patients of the two lineages (Table S3). According to the Guidelines on the Diagnosis and Treatment of Novel Coronavirus issued by the National Health Commission, China (7th Edition), the 271 patients were divided into four groups of clinical severity: mild (n = 24), moderate (n = 98), severe (n = 117) and critical disease (n = 32). A significantly higher proportion of S-lineage patients were in the severe or critical condition relative to the L-lineage patients (P = 0.011, Fisher's exact test, Fig. 1A). Considering the relatively small number of patients in the mild (21 for L and 3 for S) and critical (18 for L and 14 for S) groups, we grouped the mild and moderate patients into a ‘non-serious’ category and severe and critical patients into a ‘serious’ category, as previously performed [10]. A significantly higher proportion of the S-lineage patients (69.9%, 51/73) fell into the serious category, relative to L-lineage patients (49.5%, 98/198) (P = 0.004, Fig. 1B).
Confounding factors such as age, gender and underlying medical conditions affect the symptoms and clinical severity of COVID-19 [11]. Here, Fisher's exact tests were used to examine the relationships between clinical severity (non-serious versus serious) and age (<65 versus ≥65 years old), underlying medical conditions (without versus with underlying diseases) and gender (female versus male). The results showed that elderly (≥65 years) patients, males, and patients with underlying medical conditions were more likely to appear in the serious category (all P values were < 0.05, Fig. S3 and Table S4), which confirmed the previous conclusions. To further evaluate the potential influences of these confounding factors on patients’ outcomes, we performed a multivariate logistic regression analysis of clinical severity against the variables, including the previous three confounding factors and viral lineage (L versus S). As shown in Fig. 1C, in the multivariate
analysis, both age and underlying medical conditions of patients were significantly associated with clinical severity (P = 0.002 for both factors), but the effect of gender on clinical severity was marginal (P = 0.073), and the possible interaction between age and underlying medical conditions had a non-significant effect on clinical severity (P > 0.4). Intriguingly, we still detected a significant association between viral lineage and the clinical severity of patients in the multivariate regression analysis, with S-lineage patients more likely falling into the serious category (OR = 2.19, 95% CI 1.15–4.29, P = 0.019, Fig. 1C). This result was robust when we changed the cutoff for age stratification (<60 versus ≥60 years old; P = 0.009, Fig. S4A) or treating age as a continuous variable (P = 0.034, Fig. S4B) in the multivariate logistic regression analysis. Thus, after excluding potential confounding factors (age, gender and underlying medical conditions), we still found that patients with S-lineage SARS-CoV-2 had significantly greater odds of experiencing severe disease than patients with L-lineage SARS-CoV-2.
Of note, a previous study surveyed 112 COVID-19 patients in Shanghai, China, and detected no significant difference in clinical severity between the L- and S-lineage patients [10]. The inconsistency between that study and ours might be partially because the fraction of serious cases in Hubei Province (Wuhan is the capital city of Hubei) tended to be higher than that in other areas of China (27.50% versus 5.21%, Fig. S5). Also, the hospitals from which the vast majority of patients were recruited were the designated hospitals to treat serious (severe/critical) COVID-19 patients in Wuhan, which gave rise to a higher proportion of serious cases in this study (∼55%) than the previous one (∼19%). Thus, the larger sample size and the higher proportion of serious cases gave us more power in the statistical tests to detect the difference.
How the L- and S-lineages differ in patients’ clinical severity remains unknown. Zhang et al. reported the ORF8 protein (28 144 was in ORF8) could mediate immune evasion by downregulating MHC-I molecules, but both L- and S-lineage SARS-CoV-2 ORF8 showed a similar effect on down-regulating MHC-I [12]. Yao et al. demonstrated that among the patient-derived SARS-CoV-2 isolates, S lineage showed lower viral titer in Vero cells compared to L lineage SARS-CoV-2 isolates [13]. However, the detailed mechanisms deserve further studies.
In summary, here, we analyzed 271 COVID-19 patients in the early outbreak in Wuhan and detected a significant difference in clinical severity between the L- and S-lineage patients. Although it remains unclear when and where the L and S lineages split, our finding is consistent with the hypothesis that the pathogenicity of SARS-CoV-2 might have been attenuated during the evolution from the S to L lineage. One limitation of this study is that most patients recruited in this study were from the designated hospitals to treat serious COVID-19 patients in Wuhan. The asymptomatic patients were under-represented in this study because relatively fewer of them were hospitalized in the early stage of the pandemic. Moreover, a recent population-level study on anti-SARS-CoV-2 antibodies in Wuhan showed that most people positive for pan-immunoglobulins were asymptomatic [14]. Thus, more studies are required to deepen our understanding of the connections between clinical manifestations and genetic variants of SARS-CoV-2 during the pandemic.
Supplementary Material
Acknowledgements
We thank all the patients who participated in this study and their families, and all the healthcare professionals who have worked on the front lines of the COVID-19 pandemic. We gratefully acknowledge the authors, originating and submitting laboratories of the sequences from GISAID (https://www.gisaid.org/). The list of specific acknowledgments is detailed in Table S5. We thank Dr. Chung-I Wu, Hong Wu, Jindong Zhao, Ruiping Xiao, Ning Zhang, Fangqing Zhao, Jie Cui, and Xionglei He for helpful discussions and recommendations related to this study.
Contributor Information
Ben Hu, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education and School of Pharmaceutical Sciences, Wuhan University, China.
Ran Liu, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education and School of Pharmaceutical Sciences, Wuhan University, China; CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, China.
Xiaolu Tang, State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, China.
Yuchen Pan, Department of Cardiology, Renmin Hospital of Wuhan University, China.
Ming Wang, Department of Clinical Laboratory, Renmin Hospital of Wuhan University, China.
Yongqing Tong, Department of Clinical Laboratory, Renmin Hospital of Wuhan University, China.
Guangming Ye, Department of Clinical Laboratory Medicine, Zhongnan Hospital of Wuhan University, China.
Gaigai Shen, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education and School of Pharmaceutical Sciences, Wuhan University, China.
Ruochen Ying, State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, China.
Aisi Fu, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education and School of Pharmaceutical Sciences, Wuhan University, China.
Di Li, Department of Clinical Laboratory, Renmin Hospital of Wuhan University, China.
Wanxu Zhao, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education and School of Pharmaceutical Sciences, Wuhan University, China.
Jing Peng, Wuhan Dgensee Clinical Laboratory Co., Ltd., China.
Jie Guo, Wuhan Dgensee Clinical Laboratory Co., Ltd., China.
Dong Men, CAS Key Laboratory of Special Pathogens and Biosafety, Centre for Biosafety Mega-Sciences, Wuhan Institute of Virology, Chinese Academy of Sciences, China.
Xinmin Yao, State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, China.
Yirong Wang, State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, China.
Hong Zhang, State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, China.
Zhihui Feng, Wuhan Dgensee Clinical Laboratory Co., Ltd., China.
Junping Yu, CAS Key Laboratory of Special Pathogens and Biosafety, Centre for Biosafety Mega-Sciences, Wuhan Institute of Virology, Chinese Academy of Sciences, China.
Liangjun Chen, Department of Clinical Laboratory Medicine, Zhongnan Hospital of Wuhan University, China.
Zixin Deng, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education and School of Pharmaceutical Sciences, Wuhan University, China.
Xuemei Lu, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, China.
Ya-Ping Zhang, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, China.
Yirong Li, Department of Clinical Laboratory Medicine, Zhongnan Hospital of Wuhan University, China.
Bende Liu, Department of Emergency Medicine, First People's Hospital of Jiangxia, China.
Lilei Yu, Department of Cardiology, Renmin Hospital of Wuhan University, China.
Yan Li, Department of Clinical Laboratory, Renmin Hospital of Wuhan University, China.
Jian Lu, State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, China.
Tiangang Liu, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education and School of Pharmaceutical Sciences, Wuhan University, China.
FUNDING
This project was supported by grants from the National Key Research and Development Program of China (No. 2018YFA0900400, Dr. Tiangang Liu), grants from the Emergency Project of Hubei Province (No. 2020FCA034, Dr. Yan Li and Dr. Tiangang Liu), Fundamental Research Funds for the Central Universities (No. 2042020kf1003, Dr. Tiangang Liu), grants from Special Fund for COVID-19 Emergency Research of Wuhan (No. EX20M02, Dr. Tiangang Liu), grants from the National Natural Science Foundation of China (No. 91 731 301, Dr. Jian Lu), grants from the Ministry of Science and Technology of the People's Republic of China (No. 2020YFC0847000, Drs. Xuemei Lu & Jian Lu), and grants from the Key Project for Anti-2019 Novel Coronavirus Pneumonia from the National Key Research and Development Program of China (No. 2020YFC0845500, Dr. Yirong Li).
Conflict of interest statement. None declared.
REFERENCES
- 1.Wu F, Zhao S, Yu Bet al. . Nature 2020; 579: 265–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Korber B, Fischer WM, Gnanakaran Set al. . Cell 2020; 182: 812–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Volz E, Hill V, McCrone JTet al. . Cell 2021; 184: 64–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Davies NG, Abbott S, Barnard Ret al. . Science 2021; 372: eabg3055. 10.1126/science.abg3055 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Tang X, Wu C, Li Xet al. . Natl Sci Rev 2020; 7: 1012–23. 10.1093/nsr/nwaa036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Forster P, Forster L, Renfrew Cet al. . Proc Natl Acad Sci USA 2020; 117: 9241–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Tang X, Ying R, Yao Xet al. . Sci Bull 2021; DOI: 10.1016/j.scib.2021.02.012. 10.1016/j.scib.2021.02.012 [DOI] [Google Scholar]
- 8.Rambaut A, Holmes EC, O’Toole Áet al. . Nat Microbiol 2020; 5: 1403–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Li T, Tang X, Wu Cet al. . Sci. China Life Sci. 2020; 63: 1608–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zhang X, Tan Y, Ling Yet al. . Nature 2020; 583: 437–40. [DOI] [PubMed] [Google Scholar]
- 11.Wang D, Hu B, Hu Cet al. . JAMA 2020; 323: 1061–9. 10.1001/jama.2020.1585 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhang Y, Zhang J, Chen Yet al. . bioRxiv 2020: p. 2020.05.24.111823. [Google Scholar]
- 13.Yao H, Lu X, Chen Qet al. . Cell Discov 2020; 6: 76. 10.1038/s41421-020-00226-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.He Z, Ren L, Yang Jet al. . Lancet 2021; 397: 1075–1084. 10.1016/S0140-6736(21)00238-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.