Abstract
Emerging variants enable the continuous spread of SARS‐CoV‐2 in humans. The factors contributing to behavioral differences in variants remain elusive despite associations with several Spike protein mutations. Exploring accessory proteins may provide a wider understanding of these differences since these proteins may affect viral processes that occur beyond infection. Various bioinformatics tools were utilized to identify significant accessory protein mutations and determine their structural and functional effects over time. The ViruClust web application was used to retrieve accessory protein amino acid sequences and determine mutation frequencies in these sequences across time. The structural and functional effects of the mutations were determined using Missense3D and PROVEAN, respectively. The accessory and Spike protein mutations were compared using mutation densities. Q57H and T151I of ORF3a; T21I and W27L of ORF6; G38V, V82A, and T120I of ORF7a; S31P and T40I of ORF7b; and R52I, C61F, and I121L of ORF8 were highly frequent in most variants of concern and were within known functional domains. Thus, these are good candidates for further experimental evaluation. Among the accessory proteins, ORF6 and ORF8 were highlighted because of their strong and weak correlation with Spike protein mutations, respectively.
Keywords: accessory proteins, bioinformatics, COVID‐19, mutations, SARS‐CoV‐2 variants
1. INTRODUCTION
The severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) virus, the causative agent of coronavirus disease 2019 (COVID‐19), has been causing a pandemic since 2020. Emerging SARS‐CoV‐2 variants have been fueling the continuous spread of this virus in humans despite widespread vaccination. So far, the origin of these variants is still unknown and the factors that drive the mutations in these variants remain elusive.
As of January 2022, the World Health Organization (WHO) has recognized and monitored 12 SARS‐CoV‐2 variants that are classified into variants of interest (VOI) and variants of concern (VOC). The WHO defines a VOI as a “SARS‐CoV‐2 isolate with mutations that lead to amino acid changes associated with established or suspected phenotypic implications.” Moreover, these variants have been shown to “cause community transmission, multiple COVID‐19 case clusters, or has been detected in multiple countries.” A VOC also has similar characteristics to a VOI. However, a VOC is “associated with an increase in transmissibility, change in the epidemiology, increase in virulence, change in disease outcome, or decrease in the effectiveness of diagnostics, vaccines, therapeutics, or public health and social measures” in a global scale. 1 So far, there are five VOC (Alpha, Beta, Gamma, Delta, and Omicron) and two VOI (Lambda and Mu).
The genetic basis for this classification has mostly been associated with the Spike protein, a viral membrane protein that facilitates entry of SARS‐CoV‐2 in susceptible cells by binding to the human ACE2 receptor (hACE2). The earliest known Spike protein mutation is the D614G mutation, which enables increased deposition of Spike protein in viral particles, increased binding affinity to hACE2, and increased infectivity of SARS‐CoV‐2 in human bronchial epithelial cells. 2 K417N/T, N439K, E484K, L452R, and N501Y mutations are also significant mutations since these also lead to increased infectivity and transmission of SARS‐CoV‐2 variants 3 , 4 , 5 and resistance to human antiviral immune response. 6 Aside from decreased hACE2 receptor binding affinity, Spike protein mutations such as P681R may also lead to increased infectivity through efficient Spike protein cleavage. 4
Despite associations between Spike protein mutations and SARS‐CoV‐2 variants, there are still aspects of these variants that cannot be explained solely by the mutations in the Spike protein. There seems to be a difference in the intrinsic virulence of the Alpha, Beta, Gamma, and Delta variants in humans even if they contain similar amounts of Spike protein mutations. 7 The severity of the disease caused by the Alpha and Beta variants did not differ significantly in SARS‐CoV‐2 infected Syrian hamsters. 8 However, Delta variant infection leads to more severe pathology in infected Syrian hamsters compared to the Alpha variant. 9 On the other hand, the recently emerging Omicron variant may have decreased virulence despite having a significantly higher number of mutations in the Spike protein than the Delta variant. 10 Therefore, there is a need to explore the inherent effects of SARS‐CoV‐2 mutations in the other proteins expressed by SARS‐CoV‐2.
Non‐Spike proteins may provide a wider landscape for understanding the contribution of genetic changes in the behavior of SARS‐CoV‐2. The potential of non‐Spike proteins in inducing behavioral changes in SARS‐CoV‐2 is exemplified by mutations in the nucleocapsid protein, a viral protein encapsulating the viral genetic material, such as the R203K and G204R mutations of the Alpha variant. These changes have been associated with increased production of viral particles leading to increased infectivity and virulence. 11
Aside from structural proteins (Spike, Nucleocapsid, Membrane, and Envelope proteins), SARS‐CoV‐2 can express a set of accessory proteins. This subset of viral proteins may not be involved in in vitro viral replication, but they may play a role in mediating different virus‐host interactions during infection. 12 There are at least nine accessory proteins that can be expressed by SARS‐CoV‐2: ORF3a, ORF3b, ORF6, ORF7a, ORF7b, ORF8, ORF9b, ORF10, and ORF14. 13 Different SARS‐CoV‐2 accessory proteins contribute to several viral processes. ORF9b interacts with TOM70 in transfected cells to facilitate immune modulation, 14 while ORF3a and ORF7b induce apoptosis in infected cells. 15 , 16 Thus, mutations in accessory proteins may give rise to possible mechanisms contributing to changes in the virulence of SARS‐CoV‐2 variants.
Using different bioinformatic platforms, this study aimed to identify significant amino acid mutations in the accessory proteins of SARS‐CoV‐2 variants and determine their likely effect on the structure and function of this set of proteins over time in silico. Furthermore, the study also aimed to determine how these mutations correlate with changes in the Spike protein.
2. METHODOLOGY
Amino acid sequences, via the ViruClust web application, 17 were collected and divided into two data sets, which represent the date of collection of the samples: early (March 01, 2021–April 30, 2021) and late (July 01, 2021–August 31, 2021). These data sets represent the emergence of the Alpha and Delta variants. As of writing, these are the most recent waves of COVID‐19 variants available in the ViruClust database. The geographical characteristics and lineage distribution of sequences in each group were identified. The amino acid sequences were also analyzed according to known variants: all the VOC (e.g., Alpha, Beta, Gamma, and Delta) and two representative VOI (e.g., Lambda and Mu). The ViruClust web application is available at http://gmql.eu/viruclust/.
The amino acid changes were characterized and compared across time using different bioinformatic tools and quantitative measurements of mutations (Figure 1A). The frequency of mutations was determined using ViruClust. The effects of the most frequent mutations in each variant on the secondary structure and biologic function of each accessory protein were determined using Missense3D 18 and PROVEAN, 19 respectively. Lastly, the mutations in the accessory proteins were compared to Spike protein mutations by calculating the mutation densities for each protein. The mutation density is the number of mutations divided by the amino acid length of the corresponding accessory protein. Then, the correlation coefficient of the mutation densities across the time points was compared for each accessory protein across the different variants. The Missense3D and PROVEAN web servers are available at http://www.sbg.bio.ic.ac.uk/%7Emissense3d and http://provean.jcvi.org, respectively.
For further information, details of the methodology can be found in the supplementary documents of this article.
3. RESULTS
3.1. Characteristics of sequences obtained from GISAID
A total of 1,324,216 amino acid sequences were retrieved using ViruClust (Figure 1B,C). The number of sequences obtained for the early and late groups was 809,963 and 514,253, respectively. Most of the collected sequences were from Europe and Asia. There were no sequences obtained from South America, which is common to both data sets. Furthermore, the Alpha variant was the predominant variant in Group A at 66%, while the Delta variant was more frequently observed in Group B at 49%.
3.2. Frequency of mutations in accessory protein genes across different variants
The ViruClust web application was used to identify missense and deletion mutations, which represent changes from the SARS‐CoV‐2 ancestral strain that was first identified in Wuhan, China. Moreover, the frequency of mutations was quantified as the percentage of the collected amino acid sequences for the specific variant with the specific point mutation plotted against the amino acid position of each accessory protein mutation. Only accessory proteins ORF3a, ORF6, ORF7a, ORF7b, and ORF8 were included in the study due to the availability of data in ViruClust. Therefore, the other SARS‐CoV‐2 accessory proteins such as ORF3b, ORF9b, ORF10, and ORF14 were not analyzed in this study.
ORF3a is a 275 amino acid multi‐pass transmembrane protein that forms an ion channel in the host cell membrane. 20 The Alpha and the Delta variant had the greatest number of ORF3a mutations in the early and late groups, respectively (Supporting Infomation: Figure 1). However, all ORF3a mutations in the Alpha variant occurred at less than 10% frequency (Figure 2A). The Delta variant had the highest increase (sevenfold) in the number of mutations between the two time points. The most frequent mutations in all variants are mostly within the ion channel domain of the protein, which includes the Q57H mutation (Figure 3A). This mutation is also present in both a VOC (Beta) and a VOI (Mu).
ORF6 is a nuclear protein with 61 amino acid residues. This protein may interact with the nuclear pore of SARS‐CoV‐2 infected cells via a putative binding site. 21 The mutations of this accessory protein occurred at less than 10% frequency in all the variants (Figures 2B and 3B and Supporting Information: Figure 2). The Delta variant accumulated the most ORF6 mutations between the time points with a fivefold increase. The most common mutations are T21I and W27L. The T21I mutation can be found in both a VOC (Alpha and Beta) and VOI (Lambda), while W27L is only present in VOCs (Alpha, Gamma, and Delta).
ORF7a is a single‐pass transmembrane protein with 121 amino acid residues. This protein contains a putative binding site for lymphocyte antigen an endoplasmic localization signal. 22 The Alpha variant had the highest number of ORF7a mutations in the early time point, while the Delta variant has the highest number of mutations in the late time point (Supporting Information: Figure 3). Overall, the Delta variant had the highest number of ORF7a mutations, with a fourfold increase, in between the two time points. Only the Delta variant had ORF7a mutations that have frequencies greater than 50% in the amino acid sequences in both time points (Figure 2C). The most consistent mutations in this variant are T120I and V82A. The T120I is within the di‐lysine motif of ORF7a, which is an important structure for endoplasmic reticulum localization of proteins 23 (Figure 3C). Mutations were also seen in the lymphocyte antigen‐binding site of ORF7a in both the Alpha and Delta variants. However, the most frequent mutations in the Delta variant were not within this functional domain.
ORF7b is predicted to be a 43 amino acid single‐pass transmembrane protein in the endoplasmic reticulum. 24 The data show that there is mostly low ORF7b mutation frequency of <1% in all variants except in the Delta variant (Figure 2D). Interestingly, the Lambda variant had no detectable mutations in the early time point (Supporting Information: Figure 4). Only the Delta variant had mutation frequencies that reached greater than 50% in both time points (Figure 3D). The most frequent mutation is T40I, which is only present in VOCs (Beta, Gamma, and Delta). This mutation has also increased frequency in Group B by threefold. The majority of the variations occur beyond the transmembrane domain.
ORF8 is a 121 amino acid single‐pass transmembrane protein in the endoplasmic reticulum 25 with an Immunoglobulin‐like (Ig‐like) domain. 26 The Alpha variant has ORF8 mutations that are present in all sequences in both time points such as Y73C, Q27* (* = no amino acid equivalent due to formation of a stop codon), and R52I. Moreover, the Alpha variant was able to retain the same set of mutations in both time points. The most frequent mutations were not maintained in the Delta variant through time (Figure 2 and Supporting Information: Figure 5). The most common ORF8 mutations across the variants were R52I and I121L. The bulk of the mutations across the variants encompasses the Ig‐like domain of ORF8 (Figure 3E).
All in all, the data show that point mutations occur in the functional domains of the majority of the accessory proteins. The results also reveal that ORF8 was the only accessory protein that did not accumulate high‐frequency mutations between the two time points in the Delta variant. This is in contrast with other variants, especially the Alpha variant, which had high‐frequency ORF8 mutations. Interestingly, ORF6 was the only accessory protein that did not contain any high‐frequency mutations across the variants.
3.3. Effect of mutations on the secondary structure of SARS‐CoV‐2 accessory proteins
The 10 most frequent mutations per accessory protein were selected for the determination of the possible structural and functional effects of these mutations. The effects of the mutations on secondary structure were determined using the Missense3D online platform.
Most of the selected mutations in ORF3a were predicted to have no structural damage to the protein (Figure 4A). Only the T151I mutation of ORF3a was found to lead to structural damage, which is a buried H‐bond breakage in both the Alpha and Beta variants.
All the selected mutations in ORF6 were predicted to have no structural damage to the protein (Figure 4B).
Most of the selected mutations in ORF7a were predicted to have no structural damage to the protein (Figure 4C). Only the G38V mutation was found to influence the secondary structure of ORF7a by introducing a steric clash in the Delta, Lambda, and Mu variants.
Most of the selected mutations in ORF7b were predicted to have no structural damage to the protein (Figure 4D). Only the S31P mutation was found to influence the secondary structure of ORF7b by introducing a steric clash in the Lambda variant.
Most of the selected mutations in ORF8 were predicted to have no structural damage to the protein (Figure 4E). Only the C61F mutation was found to influence the secondary structure of ORF8 by introducing a steric clash and disulfide breakage in the Alpha variant.
In summary, most of the point mutations in the accessory proteins did not contribute to a change in the secondary structure of these proteins. However, there were four mutations that can contribute to a change in the secondary structure such as T151I of ORF3a, G38V of ORF7a, S31P of ORF7b, and C61F of ORF8.
3.4. Effect of mutations on the biologic function of SARS‐CoV‐2 accessory proteins
The PROVEAN software was used to determine if the selected mutations can independently lead to either a possible loss‐of‐function or neutral outcome. A loss‐of‐function outcome results when the mutation leads to a nonfunctional protein. Meanwhile, a neutral outcome implies that the protein has retained its function despite the presence of the mutation. Like the Missense3D platform, only the individual effects of each mutation can be analyzed by PROVEAN.
In ORF3a, the number of loss‐of‐function and neutral mutations were similar in number across the VOC. (Figure 5A). However, more neutral ORF3a mutations were identified in the VOIs. The T151I mutation of ORF3a is a loss‐of‐function mutation.
In ORF6, the number of loss‐of‐function mutations were greater than neutral mutations (Figure 5B). Moreover, the VOCs have more loss of function mutations than neutral mutations.
In ORF7a and ORF7b, the number of loss‐of‐function mutations was greater than neutral mutations (Figure 5C,D). The G38V mutation of ORF7a and the S31P mutation of ORF7b are loss‐of‐function mutations.
In ORF8, the number of loss‐of‐function and neutral mutations were similar in number across all variants (Figure 5E). All the selected ORF8 mutations of the alpha variant were identified as loss‐of‐function mutations including the C61F mutation.
All in all, the proportion of loss‐of‐function and neutral mutations are the same in ORF3a and ORF8. Meanwhile, ORF6, ORF7a, and ORF7b have more loss‐of‐function mutations than neutral mutations (Figure 5G). Interestingly, despite not having any effect on the secondary structure of ORF6, some ORF6 mutations may still lead to a loss of function. T151I of ORF3a, G38V of ORF7a, S31P of ORF7b, and C61F of ORF8 were shown to be loss‐of‐function mutations.
3.5. Correlation of accessory protein mutations with Spike protein mutations
The same analyses on the frequency of mutations and the effect of the mutations on secondary structure and biologic function were done using the Spike protein sequences of each variant.
The D614G mutation is the only mutation that is present in all variants (Figure 2F and Supporting Information: Figure 6). The Delta variant is the only variant that was able to preserve its most frequent mutations (D614G, P681R, L452R, T478K, T19R) across time.
In terms of location, most of the mutations occur in the different structural and functional domains of the Spike protein (Figure 3F). All variants contain mutations in the receptor‐binding domain. All variants except the Beta and Lambda variants contain mutations in the transmembrane domain. All variants except the Beta variant contain mutations in the furin cleavage site of the Spike protein.
The effect of most of the selected mutations in the Spike protein was predicted to have no structural damage to the protein (Figure 4F). The D80A and T95I are mutations that may influence the secondary structure of the Spike protein. The former mutation may lead to the replacement of a buried charge and the breakage of a salt bridge in the Beta variant. Meanwhile, the latter mutation may lead to the breakage of a buried hydrogen bond in the Mu variant.
Most of the selected mutations of the Spike protein are neutral mutations except for the T716I mutation in the Alpha variant (Figure 5F). The D80A and T95I mutations are neutral mutations.
To determine the possible correlations between the mutations of accessory proteins to the Spike protein, the mutation densities of these proteins were calculated and compared across the different variants (Figure 6A,B). Correlation of the mutation densities of the accessory proteins to the Spike protein revealed that ORF7b was highly correlated (r > 0.9) with the Spike protein (r = 0.95) in the early group (Figure 6C). Meanwhile, all the accessory proteins were highly correlated with the Spike protein in the late group wherein ORF6 had the highest correlation (r = 0.99) (Figure 6D). ORF8 was the least correlated accessory protein with the Spike protein in both time points across the different variants.
4. DISCUSSION
This study identified several amino acid mutations in the accessory proteins of SARS‐CoV‐2 variants. Despite the limited contribution of these mutations to the secondary structure of these proteins, some are still good candidates for further investigation such as the Q57H and T151I mutations of ORF3a; T21I and W27L mutations of ORF6; G38V, V82A, and T120I mutations of ORF7a; S31P and T40I mutations of ORF7b; and R52I, C61F, and I121L mutations of ORF8. These may influence the function of these accessory proteins because these are found within known functional domains and are shown to be highly frequent in most VOC. It should be noted that there are still no experimental studies that can provide evidence on the effects of these mutations. Thus, the effects of these mutations should be verified through experimental studies.
This study also determined the correlation between the SARS‐CoV‐2 accessory and Spike protein mutations. Most Spike protein mutations occur in the different structural and functional domains of the Spike protein in different relative frequencies across the different variants. The DG14G mutation is the only Spike protein mutation present in all SARS‐CoV‐2 variants, which is consistent with previous reports. 27 The effect of most of the selected Spike protein mutations, along with D614G, were predicted to be neutral mutations that may not produce any structural damage to the protein. These findings are in contrast with the observed experimental effects of D614G of increasing infectivity, transmission, and replication kinetics of SARS‐CoV‐2 in vitro and in vivo. 28 These structural and functional damages were not apparent since only the effects of single mutations were examined and investigated. Therefore, these results should also be verified experimentally.
The correlations with the Spike protein mutations highlighted ORF6 and ORF8. ORF8 is not well conserved through time, which suggests that there may be no selection pressure to retain the structure and function of ORF8. Furthermore, the mutation density of this protein did not correlate well with the Spike protein relative to the other accessory proteins, which is also observed in the ORF8 of other beta coronavirus species. 29 The lack of consistency with the Spike protein may be because ORF8 may not contribute to the increased virulence in the Delta variant. Even if initial studies reveal that it may contribute to immune evasion 28 , 29 , 30 cytokine storm, 31 the contribution of ORF8 on the pathogenesis may not be reflected in real life. This observation is consistent with previous studies showing the presence of genomes with or without truncated ORF8 32 where these deletions were associated with mild disease. 33 Lastly, the highly random changes of ORF8 also implies that more studies are needed if ORF8 is to be considered as a therapeutic target.
On the other hand, ORF6 was the only accessory protein that did not have any high‐frequency mutations across the SARS‐CoV‐2 variants. Thus, the structure of the protein may be retained as SARS‐CoV‐2 evolves over time. The putative role of ORF6 in the pathogenesis of SARS‐CoV‐2 may be in mediating immune evasion by disrupting the activity of transcription factors associated with the innate immune response. 21 Like ORF8, so far, there is no evidence linking ORF6 mutations with increased infection or severity of COVID‐19. Thus, experimental studies should be done to verify the effects of the mutations. However, the strong correlation between the mutation of Spike protein and ORF6 suggests that ORF6 can be used as a possible surrogate marker for the Spike protein.
The findings of this study are consistent results with other in silico investigations on SARS‐CoV‐2 mutations. The highly variable mutations in ORF8 support that the gene is a mutational hotspot. 34 The Q57H and T151I mutations identified in ORF3a were also observed by another study utilizing a different bioinformatic approach. 35 Additionally, the Q57H mutation in ORF3a is also consistently observed in a mutational analysis done using sequences obtained from the early part of the COVID‐19 pandemic. 36
Despite the numerous findings on accessory protein mutations in this study, interpretation of these data should be done with caution due to some limitations. First, the time points used in the study reflect the transition when the Delta variant overtook the Alpha variant in terms of the number of sequences, which explains the discrepancy in the number of samples collected per variant between the two time points. Despite the uneven number of sequences, there were still retained amino acid sequences in ORF6. Secondly, there are effects of mutations that may have been missed by the bioinformatic tools since they only predict the effects of individual mutations and do not account for the possible synergistic effects of multiple mutations in influencing the structure and function of the accessory proteins. Lastly, it should be emphasized that these tools can only make predictions on the effects of mutations. Therefore, experimental validation must be done to ascertain if these mutations lead to changes in function.
Nonetheless, this study was still able to identify significant amino acid substitutions in the accessory proteins (Q57H and T151I in ORF3a; T21I and W27L in ORF6; G38V, V82A, and T120I in ORF7a; S31P and T40I in ORF7b; and R52I, C61F, and I121L in ORF8) that may alter their functions. Given the preliminary evidence that these accessory proteins largely contribute to immune evasion of SARS‐CoV‐2 in humans, these accessory proteins may play a role in facilitating viral pathogenesis. Thus, these mutations can be candidates for experimental studies to further broaden the understanding of the effects of these changes in viral processes downstream of infection.
5. CONCLUSION
SARS‐CoV‐2 continues to evolve into different variants across time. The Spike protein is the most implicated SARS‐CoV‐2 protein for the infectivity and pathogenesis of the virus, which may provide a skewed perspective because past and current research are mostly focused on this protein. Accessory proteins may also play a vital role in facilitating viral processes. This study was able to monitor the temporal changes in accessory proteins ORF3a, ORF6, ORF7a, ORF7b, and ORF8 using amino acid sequences from two‐time points during the transition from the Alpha variant to Delta variant dominance. All accessory proteins obtained several mutations that can be good candidates for further experimental evaluation. Knowing the effects of these mutations may facilitate the discovery of potential therapeutic targets, disease mechanisms, and factors that drive mutations. Furthermore, as some accessory proteins are correlated with the Spike protein, the mutations in future SARS‐CoV‐2 variants may be predicted. The succeeding SARS‐CoV‐2 variant would likely have mutations in ORF8 but may remain free from ORF6 mutations.
AUTHOR CONTRIBUTIONS
Christian Alfredo K. Cruz and Paul Mark B. Medina contributed in the conceptualization, data analysis, writing, and revision of this study article. Christian Alfredo K. Cruz performed the data collection and produced the figures of this study. Paul Mark B. Medina contributed to the critical evaluation of the data and figures of this manuscript.
CONFLICT OF INTEREST
The author declares no conflict of interest.
Supporting information
ACKNOWLEDGEMENTS
All authors were involved in the conceptualization, collection, analyses, and interpretation of the data presented in this article. The authors would like to thank the different laboratories that made their bioinformatics platforms available online: Luca Cilibrasi's team for the ViruClust database, Yoonwook Choi and Agnes Chan for the PROVEAN tool, Prof. Micheal Sternberg's team for the Missense3D portal, and Emma Hodcroft for the CoVariants website.
Cruz CAK, Medina PMB. Temporal changes in the accessory protein mutations of SARS‐CoV‐2 variants and their predicted structural and functional effects. J Med Virol. 2022;94:5189‐5200. 10.1002/jmv.27964
DATA AVAILABILITY STATEMENT
The data generated by this study are available in the main text and supplementary materials. Raw data can be retrieved from ViruClust, Missense3D, and PROVEAN web applications at http://gmql.eu/viruclust/ http://www.sbg.bio.ic.ac.uk/%7Emissense3d and http://provean.jcvi.org, respectively.
REFERENCES
- 1. Global Consultation on a Decision Framework for Assessing the Impact of SARS‐CoV‐2 Variants of Concern on Public Health Interventions, 29 March 2021: Meeting Report: World Health Organization; 2021.
- 2. Hou YJ, Chiba S, Halfmann P, et al. SARS‐CoV‐2 D614G variant exhibits efficient replication ex vivo and transmission in vivo. Science. 2020;370:1464‐1468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Barton MI, MacGowan SA, Kutuzov MA, Dushek O, Barton GJ, van der Merwe PA. Effects of common mutations in the SARS‐CoV‐2 Spike RBD and its ligand, the human ACE2 receptor on binding affinity and kinetics. eLife. 2021;10:e70658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Liu Y, Liu J, Johnson BA, et al. Delta spike P681R mutation enhances SARS‐CoV‐2 fitness over Alpha variant. Cell Reports. 2022;39:110829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Liu Y, Liu J, Plante KS, et al. The N501Y spike substitution enhances SARS‐CoV‐2 infection and transmission. Nature. 2021;602:294‐299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Garcia‐Beltran WF, Lam EC, Denis KS, et al. Multiple SARS‐CoV‐2 variants escape neutralization by vaccine‐induced humoral immunity. Cell. 2021;184(9):2372‐2383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Ong SWX, Chiew CJ, Ang LW, et al. Clinical and virological features of severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) variants of concern: A retrospective cohort study comparing B. 1.1. 7 (Alpha), B. 1.351 (Beta), and B. 1.617. 2 (Delta). Clin Infect Dis. 2021:1‐9. [DOI] [PMC free article] [PubMed]
- 8. Abdelnabi R, Boudewijns R, Foo CS, et al. Comparing infectivity and virulence of emerging SARS‐CoV‐2 variants in Syrian hamsters. EBioMedicine. 2021;68:103403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Saito A, Irie T, Suzuki R, et al. Enhanced fusogenicity and pathogenicity of SARS‐CoV‐2 Delta P681R mutation. Nature. 2021;602:1‐10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Wolter N, Jassat W, Walaza S, et al. Early assessment of the clinical severity of the SARS‐CoV‐2 omicron variant in South Africa: a data linkage study. The Lancet. 2022;399:437‐446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Wu J, Shi Y, Pan X, et al. SARS‐CoV‐2 ORF9b inhibits RIG‐I‐MAVS antiviral signaling by interrupting K63‐linked ubiquitination of NEMO. Cell Rep. 2021;34(7):108761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Laguette N, Benkirane M. Shaping of the host cell by viral accessory proteins. Front Microbiol. 2015;6:142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Kim D, Lee JY, Yang JS, Kim JW, Kim VN, Chang H. The architecture of SARS‐CoV‐2 transcriptome. Cell. 2020;181(4):914‐921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Thorne LG, Bouhaddou M, Reuschl AK, et al. Evolution of enhanced innate immune evasion by SARS‐CoV‐2. Nature. 2022;602:487‐495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Ren Y, Shu T, Wu D, et al. The ORF3a protein of SARS‐CoV‐2 induces apoptosis in cells. Cell Mol Immunol. 2020;17(8):881‐883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Yang R, Zhao Q, Rao J, et al. SARS‐CoV‐2 accessory protein ORF7b mediates tumor necrosis factor‐α‐induced apoptosis in cells. Front Microbiol. 2021;12:654709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Cilibrasi L, Pinoli P, Bernasconi A, Canakoglu A, Chiara M, Ceri S. ViruClust: direct comparison of SARS‐CoV‐2 genomes and genetic variants in space and time. Bioinformatics. 2022;38:1988‐1994. [DOI] [PubMed] [Google Scholar]
- 18. Ittisoponpisan S, Islam SA, Khanna T, Alhuzimi E, David A, Sternberg MJ. Can predicted protein 3D structures provide reliable insights into whether missense variants are disease associated? J Mol Biol. 2019;431(11):2197‐2212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Choi Y, Chan AP. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31(16):2745‐2747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Kern DM, Sorum B, Mali SS, et al. Cryo‐EM structure of SARS‐CoV‐2 ORF3a in lipid nanodiscs. Nat Struct Mol Biol. 2021;28:573‐582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Miorin L, Kehrer T, Sanchez‐Aparicio MT, et al. SARS‐CoV‐2 Orf6 hijacks Nup98 to block STAT nuclear import and antagonize interferon signaling. Proc Natl Acad Sci. 2020;117(45):28344‐28354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Zhou B, Thao TTN, Hoffmann D, et al. SARS‐CoV‐2 spike D614G change enhances replication and transmission. Nature. 2021;592(7852):122‐127. [DOI] [PubMed] [Google Scholar]
- 23. Bazan B, Wiktor M, Maszczak‐Seneczko D, Olczak T, Kaczmarek B, Olczak M. Lysine at position 329 within a C‐terminal dilysine motif is crucial for the ER localization of human SLC35B4. PLoS One. 2018;13(11):e0207521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Schaecher SR, Mackenzie JM, Pekosz A. The ORF7b protein of severe acute respiratory syndrome coronavirus (SARS‐CoV) is expressed in virus‐infected cells and incorporated into SARS‐CoV particles. J Virol. 2007;81(2):718‐731. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Flower TG, Buffalo CZ, Hooy RM, Allaire M, Ren X, Hurley JH. Structure of SARS‐CoV‐2 ORF8, a rapidly evolving immune evasion protein. Proc Natl Acad Sci. 2021;118:2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Tan Y, Schneider T, Leong M, Aravind L, Zhang D. Novel immunoglobulin domain proteins provide insights into evolution and pathogenesis of SARS‐CoV‐2‐related viruses. mBio. 2020;11(3):e00760‐20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Chakraborty C, Saha A, Sharma AR, Bhattacharya M, Lee SS, Agoramoorthy G. D614G mutation eventuates in all VOI and VOC in SARS‐CoV‐2: is it part of the positive selection pioneered by Darwin? Mol Ther Nucleic Acids. 2021;26:237‐241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Zhang J, Xiao T, Cai Y, et al. Membrane fusion and immune evasion by the spike protein of SARS‐CoV‐2 Delta variant. Science. 2021;374(6573):1353‐1360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Pereira F. Evolutionary dynamics of the SARS‐CoV‐2 ORF8 accessory gene. Infect Genet Evol. 2020;85:104525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Zhang Y, Chen Y, Li Y, et al. The ORF8 protein of SARS‐CoV‐2 mediates immune evasion through down‐regulating MHC‐Ι. Proc Natl Acad Sci. 2021;118(23):1‐12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Lin X, Fu B, Yin S, et al. ORF8 contributes to cytokine storm during SARS‐CoV‐2 infection by activating IL‐17 pathway. Iscience. 2021;24(4):102293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Hassan SS, Kodakandla V, Redwan EM, et al. An issue of concern: unique truncated ORF8 protein variants of SARS‐CoV‐2. PeerJ. 2022;10:e13136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Young BE, Fong SW, Chan YH, et al. Effects of a major deletion in the SARS‐CoV‐2 genome on the severity of infection and the inflammatory response: an observational cohort study. The Lancet. 2020;396(10251):603‐611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Badua CLDC, Baldo KAT, Medina PMB. Genomic and proteomic mutation landscapes of SARS‐CoV‐2. J Med Virol. 2021;93(3):1702‐1721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Azad GK, Khan PK. Variations in Orf3a protein of SARS‐CoV‐2 alter its structure and function. Biochem Biophys Rep. 2021;26:100933. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Chaudhuri D, Majumder S, Datta J, Giri K. In silico study of mutational stability of SARS‐CoV‐2 proteins. Protein J. 2021;40(3):328‐340. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data generated by this study are available in the main text and supplementary materials. Raw data can be retrieved from ViruClust, Missense3D, and PROVEAN web applications at http://gmql.eu/viruclust/ http://www.sbg.bio.ic.ac.uk/%7Emissense3d and http://provean.jcvi.org, respectively.