Skip to main content
Virus Research logoLink to Virus Research
. 2022 Dec 5;323:199016. doi: 10.1016/j.virusres.2022.199016

SARS‐CoV‐2 Non-structural protein 1(NSP1) mutation virulence and natural selection: Evolutionary trends in the six continents

Samira Salami Ghaleh a,1, Karim Rahimian b,1, Mohammadamin Mahmanzar c,1, Bahar Mahdavi d, Samaneh Tokhanbigli e, Mahsa Mollapour Sisakht f, Amin Farhadi g, Mahsa Mousakhan Bakhtiari h, Donna Lee Kuehu c, Youping Deng c,
PMCID: PMC9721189  NIHMSID: NIHMS1855899  PMID: 36473671

Highlights

  • The COVID-19 pandemic takes the position of the focus of attention in worldwide.

  • NSP1 is a conserved region on SARS-CoV-2.

  • E87D as the most frequent mutation on NSP1, effects on destabilization of the virus.

  • Host mRNA translation is impacted by mutations in the NSP1 region.

Keywords: COVID-19, SARS-CoV-2, NSP1, Mutations, Translational inhibition, E87D

Abstract

Objective

Rapid transmission and reproduction of RNA viruses prepare conducive conditions to have a high rate of mutations in their genetic sequence. The viral mutations make adapt the severe acute respiratory syndrome coronavirus 2 in the host environment and help the evolution of the virus then also caused a high mortality rate by the virus that threatens worldwide health. Mutations and adaptation help the virus to escape confrontations that are done against it.

Methods

In the present study, we analyzed 6,510,947 sequences of non-structural protein 1 as one of the conserved regions of the virus to find out frequent mutations and substitute amino acids in comparison with the wild type. NSP1 mutations rate divided into continents were different.

Results

Based on this continental categorization, E87D in global vision and also in Europe notably increased. The E87D mutation has signed up to January 2022 as the first frequent mutation observed. The remarkable mutations, H110Y and R24C have the second and third frequencies, respectively.

Conclusion

According to the important role of non-structural protein 1 on the host mRNA translation, developing drug design against the protein could be so hopeful to find more effective ways the control and then treatment of the global pandemic coronavirus disease 2019.

Abbreviations

Severe acute respiratory syndrome coronavirus 2

(SARS-CoV-2),

Open reading frames

(ORF),

Spike proteins

(S),

Envelope protein

(E),

Membrane protein

(M),

Nucleocapsid protein

(N)

Non-structural protein 1

(NSP1),

Coronavirus disease 2019

(COVID-19),

Amino acid sequence

(AAS),

Amino acid

(AA),

Interferon

(IFN).

1. Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is an unsegmented positive-sense single-stranded RNA virus that belongs to the β-coronavirus genus and Coronaviridae family, first reported in Wuhan (Wang et al., 2020; Ye et al., 2020). Phylogenetic analysis revealed that SARS-CoV-2 has a 79% sequence similarity with SARS-CoV that caused an outbreak in 2002 (Xu et al., 2020). Therefore, it is of vital importance to understand the differences and the relationships between these two types of viruses (Li et al., 2021).

The SARS-CoV-2 genome is ∼ 30 kb in length and composed of 13–15 (12 functional) open reading frames (ORFs) which are located 5′ to 3′ and known as ORF1a and ORF1b that encode 16 non-structural proteins (Fooladinezhad et al., 2022; Mazhari et al., 2021) (Fig. 1B). In the following, several structural proteins such as spike proteins (S), envelope protein (E), membrane protein (M), and nucleocapsid protein (N) encode via the structural region of the genome (Benedetti et al., 2020; Zhao et al., 2021) that have a significant role in virus pathogenicity and infectivity (Emam et al., 2021). Among these structural regions, there are accessory factors such as ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10. This segmentation is displayed in Fig. 1A. These gene products play important roles in viral entry and RNA synthesis, fusion, and survival in host cells, and also have the capability of gaining rapid mutations as the virus spreads (Banerjee et al., 2021; Laha et al., 2020; Naqvi et al., 2020).

Fig 1.

Fig 1

The schematic diagram illustrates an overview of the SARS-CoV-2 genome more detailed NSP1. (A) SARS-CoV-2 genome organization includes the ORF1ab sequence, the structural regions (S, E, M, and N) and accessory factors (ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10). (B) The ORF1ab sequence is self-cleaved into 16 non-structural proteins, and (C) NSP1 domains, 3D structure, and its amino acid sequence.

A major virulence factor of SARS-CoVs is the non-structural protein 1 (NSP1), which has a significant role in suppressing host gene expression by ribosome association. Binding NSP1 from SARS-CoV-2 to the 40S ribosomal subunit of the human cell as a host results in shutting down of mRNA translation both in vitro and in vivo. Cryo-electron microscopy analysis helps to clarify how they are placed together. When the NSP1 C-terminus (Fig. 1C) binds to the 40S subunit of the ribosome (Zhao et al., 2021), the formed complex prevents binding the mRNA to the ribosomal entry tunnel. NSP1 blocks gene I-dependent innate immune responses by retinoic acid-inducible that finally leads to eradication of the infection. The structural characterization of the inhibitory mechanism of NSP1 against SARS-CoV-2 can be useful for further significant insights into the drug design (Thoms et al., 2020). NSP1 acts as a ribosome gatekeeper and binds the small ribosomal subunit and stalls canonical mRNA translation at various stages during initiation (Kamitani et al., 2009; Lokugamage et al., 2012). In addition, degradation of host mRNA occurs because of binding to the ribosome which leads to endonucleolytic cleavage (Huang et al., 2011). Thus, NSP1 is a major viral virulence factor and because of this sensitive role of NSP1 in Coronavirus disease 2019 (COVID-19) tracking any structural mutation is vitally important (Nakagawa and Makino, 2021).

Over time, the increase in a number of patients suffering from COVID-19 and mortality rate differed in geographical regions. This difference appears because of multiple reasons that can affect these studied mutations (Zhou et al., 2021). Due to the modifications on NSP1 protein amino acids (AAs), the incidence of different variations on the SARS-CoV-2 was observed. These mutations can affect binding to the human 40S subunit in ribosomal complexes and then, change flexibility and stabilization in the virus. Translation inhibition takes place as a result of this phenomenon that occurs in both in vitro translation systems and in vivo (Schubert et al., 2020).

Despite global vaccination efforts, various mutations continue to be observed that can represent virus escape from vaccination and all other administered drugs (eg. Remdesivir, hydroxychloroquine, chloroquine, ribavirin, ritonavir, lopinavir, favipiravir, interferons, bevacizumab, azithromycin, etc.) (Dos Santos, 2021). Location-dependent species in each continent demonstrate vaccination has not been able to have the expected effect of controlling the disease (Rubin, 2021). Thus, having clear information from amino acid sequences (AASs) and the virus structure is useful and necessary to detect more details about the mutation dynamics of the SARS-CoV-2 genome (Albahli and Albattah, 2021).

In this regard, we aimed to investigate the amino acid mutation patterns and their specificities divided into geographical areas from the beginning of the pandemic to January 2022. In addition, we represented the link between mutations and incidence of them on different continents that can be helpful in the prediction of dynamic transmission in the future.

2. Materials and methods

2.1. Sequence and source

This study focuses on evaluating a big database of NSP1 AASs of the SARS-CoV-2 genome. NSP1 is located in the N-terminal region of the ORF1ab sequence that is susceptible to many mutations. The reference sequence is the Wuhan-2019 virus with access number ‘EPI_ISL_402124′ also known as wild type. All of the AAS samples were compared with the reference sequence. 180 AAs of the NSP1 region were extracted from GISAID (www.gisaid.org) (Elbe and Buckland‐Merrett, 2017; Khare et al.; Shu and McCauley, 2017) database belong period January 2020 to January 2022. Erasmus Medical Center grunted the accession to this database.

2.2. Sequence analyses and processing

NSP1 was extracted from other genes and after analyzing mutations and sequence alignment, FASTA files were processed by Python 3.8.0 software. ‘Numpy’ and ‘Pandas’ libraries optimized the whole process. Mutations were defined as each difference between sample and reference, then the location and replaced AA were reported. Less or more than 180 AAs belong to non-human samples such as pangolin or bat, and samples containing non-specified AAs (reported as X) were eliminated, and finally, 6510,947 refined samples were examined in the study. The identifying algorithm for detecting mutants is as follows:

In this algorithm, since all sequences have equal lengths, respectively ‘Refseq,’ and ‘seq’ refer to reference sequence and sample sequence. for refitem, seqitem in zip (refseq, seq) if (refitem! = seqitem)

Report a new mutant

Each sample's continent name and geographical coordinates were labeled on them. Pycountry-convert 0.5.8 software and ‘Titlecase’ library in Python were used to report global prevalence maps of mutations. The procedure of data refining was presented in Fig. 2.

Fig 2.

Fig 2

Data processing workflow steps are used to represent and validate NSP1 AAs mutations.

2.3. Secondary protein structure and dynamic prediction

Analyzing the mutational structure and molecular flexibility of NSP1 protein modeling on the E87D mutation as the most frequent was performed by the DynaMut web server (http://biosig.unimelb.edu.au/dynamut/). The PDB ID of protein (7K3N) was taken from the Protein Data Bank (https://www.rcsb.org/) and used for prediction on the DynaMut web server.

2.4. Statistical analysis

R 4.0.3 and Microsoft Power BI software were used to conduct data normalization and comparison charts outlining. GraphPad Prism 8.0.2 software were used for data visualization. To improve and compare the data's results, the normalized frequency of each region was reported. Then the number of mutations was divided by the number of sequences on that region comparable in equal proportions.

3. Results

3.1. Numbers and incidence of mutations in NSP1 AAS based on geographical areas

Relevant statistical analyses were performed to identify the potential mutations in the NSP1 protein structure. At the present, the number of AAS mutations in the whole of 6510,947 sequences was examined. Of the total number of NSP1 FASTA files, 277,242 data belong to not-matched length samples, 3434 data relate to non-human sequences, and 264,251 data belong to sequences that contained X. Finally, 544,927 data were not included in the analysis, and 5966,020 NSP1 data remained for examination in the study.

The statistical results showed that 6.3106% of AASs had one mutation, 0.2024% of AASs included two mutations, 0.0039% of AASs comprised three mutations, and 0.0010% of AASs contained more than four mutations. In contrast, 93.4819% of sequences had no mutation in their AASs. In addition, the frequency percentage of mutations was investigated in six geographical regions; North America, South America, Europe, Asia, Oceania, and Africa (Fig. 3A and Supplementary file1).

Fig 3.

Fig 3

The heat map of the number and approximate region of mutations in NSP1 of SARS-CoV-2. (A) Heat map chart of the mutations’ number in NSP1 of SARS-CoV-2 until January 2022 in each continent. (B) The NSP1 protein was divided into ten segments. The rate of each mutation per segmented 18 amino acids was indicated in the heat map of the protein. The highest frequency rate occurred in the 72 to 126 amino acid sequence of the NSP1 protein that we can consider as a hot spot region in NSP1. The percentage of data were used to draw heat maps. Supplementary file1 and Supplementary file 2.

In continental-related analysis regarding the frequency of mutations number, Oceania has the least frequency of samples that have one or more than one mutations in their AASs. Also, the top rank frequency of no mutant samples is dedicated to this area. In this way, the data regarding Oceania with 27,270 AASs comprising of no mutations in 96.2923% of sequences, one mutation in 3.6379%, and two mutations in 0.0696% of AASs, but in the examined AASs from this continent, more than two mutations were not detected. Europe was labeled the top rank frequency of one and two mutations in samples compared to other regions. Despite that, Europe has the least frequency of no mutant samples. Data related to Europe indicates 2931,894 AASs represent that no mutation was detected in 92.5615% of sequences, although one mutation in 7.1954% and two mutations in 0.2374% of their AASs were detected.

In Africa, we observed a high frequency of three or more than three mutations in samples. This continent with 31,907 AASs data revealed three mutations in 0.0094% of AASs, and more than four mutations in 0.0094% of AASs represented. The details about these frequencies are represented in Fig. 3A.

According to the high percentage of no mutant regions on NSP1 AASs in all geographical areas, the data demonstrate NSP1 as a conserved region that was preserved against a high replication and high mutation rate allowing RNA viruses to escape the immune system during the evolution (Hadj Hassine et al., 2022). The mutation frequency was determined based on the number of mutations in each segment relative to the total mass. The worldwide results depicted that the highest mutation frequency occurred in the region of 72 to 90 AA (0.0990%) and then in the districts of 108 to 126 AA (0.0549%). More details about the frequency of mutations in each geographical area and the location where they belong separately were mentioned in Fig. 3B and Supplementary file2.

3.2. Mutation's specificities according to geographical areas

More detail to survey the NSP1 AASs mutations, particularly the location of mutations in the protein structure and their frequency were investigated between January 2020 to January 2022. The first five frequent mutations regardless of geographical distribution were represented in Table 1.

Table 1.

NSP1 top five frequent mutations from January 2020 to January 2022.

Rank Residue Frequency (%)
Top 1 E(87)D 1.1754
Top 2 H(110)Y 0.3100
Top 3 R(24)C 0.2937
Top 4 S(100)N 0.1148
Top 5 E(37)D 0.1104

The statistical incidence of these five mutations based on the continents is listed in Table 2. The significant point is all of these mutations didn't appear among continents as the top mutations.

Table 2.

The incidence of the global NSP1 top five mutations is based on the continents.

Residue Variant frequency (%)
North America South America Europe Asia Oceania Africa
E(87)D 0.3475 0.1021 2.0835 0.1243 0.0806 0.1441
H(110)Y 0.3051 0.3862 0.3222 0.1946 1.5219 0.5077
R(24)C 0.3421 0.2229 0.2628 0.2001 0.1796 0.1723
S(100)N 0 0 0.1982 0 0 0
E(37)D 0.0728 0.1344 0.1480 0 0.0147 0.0752

Due to the results of our analysis, E87D in NSP1 was labeled the most frequent mutation up until January 2022 with a frequency rate of E87D at 1.1754%. Among these mutations, the E87D mutation was present on all continents except South America, as one of the top mutations. Therefore, Europe by 2.0835% of frequency is the highest, and Oceania by 0.0806% of frequency is the least ranked for E87D mutation. H110Y mutation has been observed as one of the top mutations with the highest frequency in Oceania and the least frequency in Asia by 1.5219% and 0.1946% of frequency respectively. R24C mutation has been remarked in North America as the highest prevalent mutation and in Africa as the least frequent mutation among the top mutations with 0.3421% and 0.1723% of frequency respectively. S100N mutation was detected only in Europe by 0.1982% frequency among the top mutations. E37D mutation has been shown in North America, South America, and Europe by 0.0728%, 0.1344%, and 0.1480% of frequency respectively.

In North America, R24C, E87D, and H110Y are the top three mutations that have more frequency than others, respectively. Also, there is more variety in the type of amino acid to which it is converted in the last ranks of mutation. South America has a different pattern in the frequency of the high-rank mutations. In South America, H110Y was labeled the first frequent mutation.

Regarding the Europe data, E87D is the first rank among mutations and there is a significant difference between the frequency of E87D and other mutations that have less frequency. This pattern has the most matches in the global ranking. In Asia, E148G is the most frequent mutation by 0.8849% of frequency that has a considerable discrepancy with other mutations. Indeed in the 148th position of AAs, glutamic acid (E) replaces Glycine (G). Oceania especially has a remarkable role in enhancing the H110Y mutation global rate by 1.5219% of frequency that should be considered. In Africa, E102Q is the first frequent mutation. In this mutation, glutamic acid converts to glutamine with 0.9120% of frequency, and also lysine replaces glutamic acid with 0.5391% of frequency causing structural modification in AAs.

The global distribution in mutation rates represents that the NSP1 region is one of the conserved regions in the virus genome during evolution. In diverse geographical distributions, different percentages of mutations were observed that may lead to significant impacts on mortality rate, drug resistance or vaccine escape, and severity of the disease (McLean et al., 2022; Mohammed, 2021). The top five mutations of each geographical area and which amino acids they have been replaced are shown in Fig. 4 based on the logarithm 10 of data frequency in the percentage of substituted AAs. The details about other happened mutations are available in the Supplementary file3.

Fig 4.

Fig 4

NSP1 top ten mutations with the highest frequency globally and geographic areas including North America, South America, Europe, Asia, Oceania, and Africa. The position of changed amino acids and substituted ones is shown differently based on the logarithm 10 of data frequency in the percentage of substituted AAs. The mutation frequency was evaluated for each of them by normalizing the number of genomes carrying a given mutation in an intended geographic area.

3.3. Evolutionary assessment of happened mutations in the position of AAs of top five mutations according to time and geographical regions

To detect the incidence of each mutation, AASs of each geographic area were analyzed over time by classifying them according to the month of sample collection from January 2020 until January 2022, as represented in the GISAID database (Elbe and Buckland‐Merrett, 2017; Khare et al.; Shu and McCauley, 2017).

The continuation of mutations was investigated and collected monthly. In the following, fluctuations with a prevalence rate higher than 0.01 per AASs were reported. In the global timeline, two fluctuations of mutations that were incident in the R24 and E87 positions are more significant than the prevalence of other mutations. R24 had increased at the beginning of the pandemic which had a higher rate than other mutations and also had a peak in May 2020 to almost July 2020. On the other hand, the peak of E87 mutation started in May 2021.

Three high slope peaks of D75 (February 2020 by the highest rate 0.03), and R24 (May 2020, by the highest rate 0.02) have occurred in North America. D144 mutation peak incidence in this area is different from others and it was prevalent with less slope than the start in March 2020 and raised in September 2020 to 0.02 frequency. In South America, I114 has three continuous peaks between July 2020 and February 2021. V86 peaked in April 2021 up to 0.01 rates. In the last month of study in this area, we detected raising the frequency of V84.

In Europe, particularly mutations that belong to the E87 position were increased and raised to a 0.03 frequency rate in July 2021. The frequency was overtaking during four months and got close to 0.04 rate in November 2021, the last month of the study. The timeline of Asia illustrated a peak of E148 in October 2021 with 0.09 frequency. Oceania has a peak of H110 mutation in May 2021 with the highest rate of 0.2. E102, the top mutation in Africa, reached a rate of 0.18 in November 2020. Detailed distribution of the five ranks in high mutation rates of NSP1 variants globally and on each continent is provided by the month of sample collection and displayed in Fig. 5. The details about other peaks of mutations that happened during the study timeline are also available in Supplementary file4.

Fig 5.

Fig 5

Plots time evolution pathways of top five high-rate mutations of NSP1 globally and in different geographic areas including North America, South America, Europe, Asia, Oceania, and Africa. The data was computed as the number of AASs having a given mutation over the total number of AASs according to the month of sample collection.

3.4. Variant of concern by the E102K mutation

The E102K mutation in the N-terminal of NSP1 results in substitutes glutamic acid (E) for lysine (K), a replacement from negative to positive charge. This positive charge may increase the affinity of NSP1 for negatively charged RNA. The noticed change can affect the replication rate and preferential expression of viral RNA in the infected cell and increase them. The E102K mutation that reported as the top mutation in Africa, was founded in the Tanzanian variant (Hadj Hassine et al., 2022).

3.5. Assessment of E87D mutation on dynamicity and flexibility of NSP1

To unveil the effect of E87D mutation on the structure of NSP1 protein, we used the DynaMut website for protein modeling. The variation in vibrational entropy energy (∆∆SvibENCoM) between wild and mutant types was calculated. The obtained data represent that the mutation at E87D increases the molecule flexibility on the NSP1 protein structure by a value 0.569 kcal.mol−1.K  1. On the other hand, binding affinity change caused by the alteration between glutamate and aspartate in position 87 can result in destabilizing the structure of NSP1 protein by a value −0.624 kcal/mol.

Investigation on the effects of this alteration in amino acids intramolecular interactions can give us a vision of the reason for destabilization in the protein structure after mutation. Although both glutamate and aspartate are brønsted base and have a negative net charge in mutant type, by deleting a CH2 in the structure of residue, the three hydrogen bonds that made the protein more stable in wild-type were eliminated (Fig. 6).

Fig 6.

Fig 6

Impact of E87D mutation on structural dynamics of the NSP1 protein. Amino acids are colored according to the vibrational entropy change upon mutation. The red color represents a gain in flexibility of NSP1 protein. Wild-type and mutant residues are colored in light-green and are also represented as sticks alongside the surrounding residues which are involved in any type of interaction.

4. Discussion

SARS-CoV-2, as RNA viruses are more susceptible to mutations than DNA viruses(Pachetti et al., 2020; Rahimian et al., 2022). NSP1 is known as a critical virulent factor with significant effects on the virus-host interaction interface, such as inhibiting host mRNA translation (Lokugamage et al., 2015; Narayanan et al., 2015), antagonizing interferon (IFN) signaling (Wathelet et al., 2007), and inducing inflammatory cytokines and chemokines (Law et al., 2007).

Furthermore, some of these effects on NSP1 are conserved in other β-CoVs and even α-CoVs. Given the very likely structural similarity between SARS-CoV-2 NSP1 and SARS-CoV NSP1, the information of SARS-CoV NSP1 functions could be highly helpful for understanding the biological and pathological roles of NSP1 in SARS-CoV-2 (Min et al., 2020).

This study was designed to analyze all mutations that occurred between January 2020 to January 2022, and regional patterns of mutations on NSP1 detected in the following six regions; North and South America, Europe, Asia, Oceania, and Africa. A recent investigation demonstrated the effects of tirilazad, phthalocyanine, and zk-806,450, which showed lower energy scores compared to alisporivir and cyclosporine, two compounds with in vitro activity against NSP1 that may have higher inhibition effectiveness (de Lima Menezes and da Silva, 2021). Hence, it is expected inhibiting NSP1 activity in the virus by these compounds leads to resuming host translation normally.

Esculin is a glucoside and naturally occurs in barley, and horse chestnut. It is given to improve capillary permeability and fragility and has been reported to inhibit collagenase and hyaluronidase enzymes. The molecule has been shown to have antioxidant and anti-inflammatory activity (Wishart et al., 2018). Esculin interacts mainly with R62, S63, A68, H72, and M74 (major interacting residues in the docking pose) through H-bond interactions with esculin. According to the preformed analysis, these interactions are located in the hotspot region of NSP1 that cause the influence of esculin on NSP1 protein function. This suggests the ability of esculin to not only inhibit NSP1 activity but also play a role against secondary symptoms such as inflammation. Hence, designing a drug against NSP1 in addition to other methods for treatment of COVID-19 likely gives a hopeful insight into possible control and finally eradicating the disease (Sharma et al., 2020).

As for previous studies, E87D mutation is located on the globular domain of in vitro protein that can affect the stability and flexibility of the virus. Therefore, this mutation can help in gaining flexibility and destabilization in the protein structure. Based on a recent study, a decrease in molecular flexibility was detected in H110Y NSP1 as the second frequent mutation and D75E which is one of the less frequent mutants. When the R24C, as the third frequent mutation, is happening in NSP1, flexibility was changed in the region containing AA residues E65, L64, Q63, P62, C24, D16, and V14. Similarly, in some less frequent mutants like D48G NSP1, an increase in molecular flexibility was detected at AA residues S40, R43, Q44, K47, and G48. These alerts in AAS might have effects on its natural function in terms of blocking host mRNA translation and evading the immune system (Mou et al., 2021).

Different reasons involve natural selection in determining the fate of mutations in each continent (Sanjuán and Domingo-Calap, 2021). Depending on conditions and every factor that affects the viral genome, different mutations with different symptoms in patients are possible. Mutations that have enough potential of resistance against the host immune system, can adapt to the conditions and then, increase proliferation, and facilitate virus transmission.

Based on our results, there are some less frequent mutations such as S100N, E37D, V28I, K120N, and P62S in NSP1. But due to evolutionary factors, based on our results, there are some less frequent mutations such as S100N, E37D, V28I, K120N, and P62S in NSP1. However, there is no significant increase in the frequency of these mutations, while the frequency of obvious mutation E87D, notably has increased. The top five mutations position on AASs of NSP1 protein were shown in Fig. 7.

Fig 7.

Fig 7

3D structure of NSP1 protein. The positions of top five frequent mutation on the protein structure are displayed in different colors (E87, R24, H110, S100, E37).

The tissue and cell expression patterns of known SARS-CoV-2 interacting human proteins, based on transcriptomics and antibody-based proteomics in the Human Protein Atlas database (https://www.proteinatlas.org/humanproteome/sars-cov-2) helped us to find the human genes that are related to NSP1 of SARS-CoV-2. COLGALT1, PKP2, POLA1, POLA2, PRIM1, and PRIM2 are the genes that encode collagen beta(1-O)galactosyltransferase 1, plakophilin 2, the catalytic subunit of DNA polymerase alpha 1, the accessory subunit of DNA polymerase alpha 2, DNA primase subunit 1, and 2 respectively. The products of these genes have low immune cell specificity and commonly medium tissue expression. Generally, these genes play a role in DNA replication except COLGALT1 and PKP2. Collagen beta(1-O)galactosyltransferase 1 by acting on collagen glycosylation facilitates the formation of collagen triple helix and plakophilin 2, which may play a role in junctional plaques (Chung et al., 2012; Hennet, 2019; Kim et al., 2020; Novelli et al., 2018; Parry et al., 2020; Toukoki and Gryllos, 2013).

There are two limitations in the present study that should be noted. One of the limitations is focusing only on AASs without considering the nucleotide sequences. Another limitation in this process is that samples reported from Europe had more rates than the data which were reported from other regions, and reciprocally Oceania is the one area that has the least sample rate. Thus, there may be other significant mutations present without extensive sequencing of samples that have not been available in the data.

5. Conclusion

Our findings suggest mutations of SARS-CoV-2 are expanding and help the virus adapt inside of hosts, providing conditions for virus survival and making other mutations in new geographical areas. E87D, H110Y, and R24C mutations increased in the timeline of the study as the first, second, and third frequent mutant sequences, respectively. The effects of these mutations on the flexibility and stabilizing of NSP1 and subsequently on survival features of the virus can have different results in patients that should be considered. To figure out the precise impact of the mutations on the disease, further studies should be done in the future.

Funding

This work was partially supported by the NIH grants 5P30GM114737, 5P20GM103466, 5U54MD007601, 1P20GM139753, 5P30CA071789, 2U54CA14372.

Authors’ contributions

K.R., M.M.S., M.M.B., and A.F. contributed to data collection. Y.D., M.M., and K.R. contribute to study design, K.R., and M.M. design workflow and code, and data analysis. B.M, K.R., and M.M., and A.F. contributed to data visualization. S.S.G. wrote the manuscript. S.T., M.M., D.L.K. and B.M. corrected the manuscript and provided useful comments. M.M., K.R., and B.M. monitored the accuracy of Additional data. B.M. designed graphical contents. Y.D. have final edited and supervised the work.

Data availability

The raw data supporting the conclusions of this article is available in supplementary file(s).

Declaration of competing interest

The authors declare that they have no conflicts of interest that might be relevant to the contents of this manuscript and the research was carried out regardless of commercial or financial relationships that may cause any conflict of interests.

Acknowledgments

The authors thank all of the researchers who have shared genome data openly via the Global Initiative on Sharing All Influenza Data (GISAID).

Footnotes

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.virusres.2022.199016.

Appendix. Supplementary materials

mmc1.xlsx (73.6KB, xlsx)
mmc2.xlsx (80.2KB, xlsx)
mmc3.xlsx (15.5KB, xlsx)
mmc4.xlsx (14.9KB, xlsx)

References

  1. Albahli S., Albattah W. Deep transfer learning for COVID-19 prediction: case study for limited data problems. Curr. Med. Imag. 2021;17(8):973. doi: 10.2174/1573405616666201123120417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Banerjee S., Seal S., Dey R., Mondal K.K., Bhattacharjee P. Mutational spectra of SARS-CoV-2 orf1ab polyprotein and signature mutations in the United States of America. J. Med. Virol. 2021;93(3):1428–1435. doi: 10.1002/jmv.26417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Benedetti F., Snyder G.A., Giovanetti M., Angeletti S., Gallo R.C., Ciccozzi M., Zella D. Emerging of a SARS-CoV-2 viral strain with a deletion in nsp1. J Transl Med. 2020;18(1):1–6. doi: 10.1186/s12967-020-02507-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chung J., Tsai S., James A.H., Thames B.H., Shytle S., Piedrahita J.A. Lack of genomic imprinting of DNA primase, polypeptide 2 (PRIM2) in human term placenta and white blood cells. Epigenetics. 2012;7(5):429–431. doi: 10.4161/epi.19777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. de Lima Menezes G., da Silva R.A. Identification of potential drugs against SARS-CoV-2 non-structural protein 1 (nsp1) J. Biomol. Struct. Dyn. 2021;39(15):5657–5667. doi: 10.1080/07391102.2020.1792992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Dos Santos W.G. Impact of virus genetic variability and host immunity for the success of COVID-19 vaccines. Biomed. Pharmacother. 2021;136 doi: 10.1016/j.biopha.2021.111272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Elbe S., Buckland-Merrett G. Data, disease and diplomacy: gISAID's innovative contribution to global health. Glob. Challenges. 2017;1(1):33–46. doi: 10.1002/gch2.1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Emam M., Oweda M., Antunes A., El-Hadidi M. Positive selection as a key player for SARS-CoV-2 pathogenicity: insights into ORF1ab, S and E genes. Virus Res. 2021 doi: 10.1016/j.virusres.2021.198472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Fooladinezhad, H., Shahidi, M., Mahmanzar, M., Mahdavi, B., Tokhanbigli, S., Sisakht, M.M., Moradi, B., Ganjalikhany, M.R., Rahimian, K., Ganjalikhani-Hakemi, M., 2022. SARS-CoV-2 NSP3, NSP4 and NSP6 mutations and Epistasis during the pandemic in the world: evolutionary trends and natural selections in six continents. medRxiv.
  10. Hadj Hassine I., Ben M'hadheb M., Menéndez-Arias L. Lethal mutagenesis of RNA viruses and approved drugs with antiviral mutagenic activity. Viruses. 2022;14(4):841. doi: 10.3390/v14040841. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Hennet T. Collagen glycosylation. Curr. Opin. Struct. Biol. 2019;56:131–138. doi: 10.1016/j.sbi.2019.01.015. [DOI] [PubMed] [Google Scholar]
  12. Huang C., Lokugamage K.G., Rozovics J.M., Narayanan K., Semler B.L., Makino S. SARS coronavirus nsp1 protein induces template-dependent endonucleolytic cleavage of mRNAs: viral mRNAs are resistant to nsp1-induced RNA cleavage. PLoS Pathog. 2011;7(12) doi: 10.1371/journal.ppat.1002433. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Kamitani W., Huang C., Narayanan K., Lokugamage K.G., Makino S. A two-pronged strategy to suppress host protein synthesis by SARS coronavirus Nsp1 protein. Nat. Struct. Mol. Biol. 2009;16(11):1134–1140. doi: 10.1038/nsmb.1680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Kim T.Y., Ji E.S., Lee J.Y., Kim J.Y., Yoo J.S., Szasz A.M., Dome B., Marko-Varga G., Kwon H.J. DNA polymerase alpha subunit B is a binding protein for erlotinib resistance in non-small cell lung cancer. Cancers (Basel) 2020;12(9):2613. doi: 10.3390/cancers12092613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Laha S., Chakraborty J., Das S., Manna S.K., Biswas S., Chatterjee R. Characterizations of SARS-CoV-2 mutational profile, spike protein stability and viral transmission. Infect, Genetic. Evol. 2020;85 doi: 10.1016/j.meegid.2020.104445. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Law A.H., Lee D.C., Cheung B.K., Yim H.C., Lau A.S. Role for nonstructural protein 1 of severe acute respiratory syndrome coronavirus in chemokine dysregulation. J. Virol. 2007;81(1):416–422. doi: 10.1128/JVI.02336-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Li G., Zhang L., Du N. Relative synonymous codon usage of ORF1ab in SARS-CoV-2 and SARS-CoV. Genes Genomics. 2021;43(11):1351–1359. doi: 10.1007/s13258-021-01136-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Lokugamage K.G., Narayanan K., Huang C., Makino S. Severe acute respiratory syndrome coronavirus protein nsp1 is a novel eukaryotic translation inhibitor that represses multiple steps of translation initiation. J. Virol. 2012;86(24):13598–13608. doi: 10.1128/JVI.01958-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Lokugamage K.G., Narayanan K., Nakagawa K., Terasaki K., Ramirez S.I., Tseng C.-T.K., Makino S. Middle East respiratory syndrome coronavirus nsp1 inhibits host gene expression by selectively targeting mRNAs transcribed in the nucleus while sparing mRNAs of cytoplasmic origin. J. Virol. 2015;89(21):10970–10981. doi: 10.1128/JVI.01352-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Mazhari, S., Alavifard, H., Rahimian, K., Karimi, Z., Mahmanzar, M., Sisakht, M.M., Bitaraf, M., Arefian, E., 2021. SARS-CoV-2 NSP-12 mutations survey during the pandemic in the world.
  21. McLean G., Kamil J., Lee B., Moore P., Schulz T.F., Muik A., Sahin U., Türeci Ö., Pather S. The impact of evolving SARS-CoV-2 mutations and variants on COVID-19 vaccines. MBio. 2022 doi: 10.1128/mbio.02979-21. e02979-02921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Min Y.-.Q., Mo Q., Wang J., Deng F., Wang H., Ning Y.-.J. SARS-CoV-2 nsp1: bioinformatics, potential structural and functional features, and implications for drug/vaccine designs. Front. Microbiol. 2020:2393. doi: 10.3389/fmicb.2020.587317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Mohammed M.E.A. SARS-CoV-2 proteins: are they useful as targets for COVID-19 drugs and vaccines? Curr. Mol. Med. 2021 doi: 10.2174/1566524021666210223143243. [DOI] [PubMed] [Google Scholar]
  24. Mou K., Mukhtar F., Khan M.T., Darwish D.B., Peng S., Muhammad S., Al-Sehemi A.G., Wei D.-.Q. Emerging mutations in Nsp1 of SARS-CoV-2 and their effect on the structural stability. Pathogens. 2021;10(10):1285. doi: 10.3390/pathogens10101285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Nakagawa K., Makino S. Mechanisms of coronavirus Nsp1-mediated control of host and viral gene expression. Cells. 2021;10(2):300. doi: 10.3390/cells10020300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Naqvi A.A.T., Fatima K., Mohammad T., Fatima U., Singh I.K., Singh A., Atif S.M., Hariprasad G., Hasan G.M., Hassan M.I. Insights into SARS-CoV-2 genome, structure, evolution, pathogenesis and therapies: structural genomics approach. Biochimica et Biophysica Acta (BBA)-Mol. Basis Dis. 2020;1866(10) doi: 10.1016/j.bbadis.2020.165878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Narayanan K., Ramirez S.I., Lokugamage K.G., Makino S. Coronavirus nonstructural protein 1: common and distinct functions in the regulation of host and viral gene expression. Virus Res. 2015;202:89–100. doi: 10.1016/j.virusres.2014.11.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Novelli V., Malkani K., Cerrone M. Pleiotropic phenotypes associated with PKP2 variants. Front. Cardiovasc. Med. 2018:184. doi: 10.3389/fcvm.2018.00184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Pachetti M., Marini B., Benedetti F., Giudici F., Mauro E., Storici P., Masciovecchio C., Angeletti S., Ciccozzi M., Gallo R.C. Emerging SARS-CoV-2 mutation hot spots include a novel RNA-dependent-RNA polymerase variant. J. Transl. Med. 2020;18(1):1–9. doi: 10.1186/s12967-020-02344-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Parry D.A., Tamayo-Orrego L., Carroll P., Marsh J.A., Greene P., Murina O., Uggenti C., Leitch A., Káposzta R., Merő G. PRIM1 deficiency causes a distinctive primordial dwarfism syndrome. Genes Dev. 2020;34(21–22):1520–1533. doi: 10.1101/gad.340190.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Rahimian K., Mahmanzar M., Mahdavi B., Arefian E., Kuehu D.L., Deng Y. SARS2Mutant: sARS-CoV-2 Amino-Acid mutation atlas database. bioRxiv. 2022 doi: 10.1093/nargab/lqad037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Rubin R. COVID-19 vaccines vs variants—determining how much immunity is enough. JAMA. 2021;325(13):1241–1243. doi: 10.1001/jama.2021.3370. [DOI] [PubMed] [Google Scholar]
  33. Sanjuán R., Domingo-Calap P. Genetic diversity and evolution of viral populations. Encyclopedia of Virology. 2021:53. [Google Scholar]
  34. Schubert K., Karousis E.D., Jomaa A., Scaiola A., Echeverria B., Gurzeler L.-.A., Leibundgut M., Thiel V., Mühlemann O., Ban N. SARS-CoV-2 Nsp1 binds the ribosomal mRNA channel to inhibit translation. Nat. Struct. Mol. Biol. 2020;27(10):959–966. doi: 10.1038/s41594-020-0511-8. [DOI] [PubMed] [Google Scholar]
  35. Sharma A., Tiwari V., Sowdhamini R. Computational search for potential COVID-19 drugs from FDA-approved drugs and small molecules of natural origin identifies several anti-virals and plant products. J. Biosci. 2020;45(1):1–18. doi: 10.1007/s12038-020-00069-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Shu Y., McCauley J. GISAID: global initiative on sharing all influenza data–from vision to reality. Eurosurveillance. 2017;22(13):30494. doi: 10.2807/1560-7917.ES.2017.22.13.30494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Thoms M., Buschauer R., Ameismeier M., Koepke L., Denk T., Hirschenberger M., Kratzat H., Hayn M., Mackens-Kiani T., Cheng J. Structural basis for translational shutdown and immune evasion by the Nsp1 protein of SARS-CoV-2. Science. 2020;369(6508):1249–1255. doi: 10.1126/science.abc8665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Toukoki C., Gryllos I. PolA1, a putative DNA polymerase I, is coexpressed with PerR and contributes to peroxide stress defenses of group A Streptococcus. J. Bacteriol. 2013;195(4):717–725. doi: 10.1128/JB.01847-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Wang R., Hozumi Y., Yin C., Wei G.-.W. Mutations on COVID-19 diagnostic targets. GenomicsGenomics. 2020;112(6):5204–5213. doi: 10.1016/j.ygeno.2020.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Wathelet M.G., Orr M., Frieman M.B., Baric R.S. Severe acute respiratory syndrome coronavirus evades antiviral signaling: role of nsp1 and rational design of an attenuated strain. J. Virol. 2007;81(21):11620–11633. doi: 10.1128/JVI.00702-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Wishart D.S., Feunang Y.D., Guo A.C., Lo E.J., Marcu A., Grant J.R., Sajed T., Johnson D., Li C., Sayeeda Z. DrugBank 5.0: a major update to the DrugBank database for. Nucl. Acids Res. 2018;46(D1):D1074–D1082. doi: 10.1093/nar/gkx1037. 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Xu J., Zhao S., Teng T., Abdalla A.E., Zhu W., Xie L., Wang Y., Guo X. Systematic comparison of two animal-to-human transmitted human coronaviruses: sARS-CoV-2 and SARS-CoV. Viruses. 2020;12(2):244. doi: 10.3390/v12020244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Ye C., Chiem K., Park J.-.G., Oladunni F., Platt R.N., Anderson T., Almazan F., de la Torre J.C., Martinez-Sobrido L. Rescue of SARS-CoV-2 from a single bacterial artificial chromosome. MBio. 2020;11(5) doi: 10.1128/mBio.02168-20. e02168-02120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Zhao K., Ke Z., Hu H., Liu Y., Li A., Hua R., Guo F., Xiao J., Zhang Y., Duan L. Structural basis and function of the N terminus of SARS-CoV-2 nonstructural protein 1. Microbiol Spectr. 2021;9(1) doi: 10.1128/spectrum.00169-21. e00169-00121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Zhou B., Kojima S., Kawamoto A., Fukushima M. COVID-19 pathogenesis, prognostic factors, and treatment strategy: urgent recommendations. J. Med. Virol. 2021;93(5):2694–2704. doi: 10.1002/jmv.26754. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.xlsx (73.6KB, xlsx)
mmc2.xlsx (80.2KB, xlsx)
mmc3.xlsx (15.5KB, xlsx)
mmc4.xlsx (14.9KB, xlsx)

Data Availability Statement

The raw data supporting the conclusions of this article is available in supplementary file(s).


Articles from Virus Research are provided here courtesy of Elsevier

RESOURCES