Abstract
The rapid evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is manifested by the emergence of an ever-growing pool of genetic lineages. The aim of this study was to analyze the genetic variability of SARS-CoV-2 in Jordan, with a special focus on the UK variant of concern. A total of 579 SARS-CoV-2 sequences collected in Jordan were subjected to maximum likelihood and Bayesian phylogenetic analysis. Genetic lineage assignment was undertaken using the Pango system. Amino acid substitutions were investigated using the Protein Variation Effect Analyzer (PROVEAN) tool. A total of 19 different SARS-CoV-2 genetic lineages were detected, with the most frequent being the first Jordan lineage (B.1.1.312), first detected in August 2020 (n = 424, 73.2%). This was followed by the second Jordan lineage (B.1.36.10), first detected in September 2020 (n = 62, 10.7%), and the UK variant of concern (B.1.1.7; n = 36, 6.2%). In the spike gene region, the molecular signature for B.1.1.312 was the non-synonymous mutation A24432T resulting in a deleterious amino acid substitution (Q957L), while the molecular signature for B.1.36.10 was the synonymous mutation C22444T. Bayesian analysis revealed that the UK variant of concern (B.1.1.7) was introduced into Jordan in late November 2020 (mean estimate); four weeks earlier than its official reporting in the country. In Jordan, an exponential increase in COVID-19 cases due to B.1.1.7 lineage coincided with the new year 2021. The highest proportion of phylogenetic clustering was detected for the B.1.1.7 lineage. The amino acid substitution D614G in the spike glycoprotein was exclusively present in the country from July 2020 onwards. Two Jordanian lineages dominated infections in the country, with continuous introduction/emergence of new lineages. In Jordan, the rapid spread of the UK variant of concern should be monitored closely. The spread of SARS-CoV-2 mutants appeared to be related to the founder effect; nevertheless, the biological impact of certain mutations should be further investigated.
Keywords: COVID-19, epidemiology, middle-income country, variant, mutation, UK variant, variant of concern
1. Introduction
The evolutionary analysis of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is appealing for several reasons.
First, this novel virus harbours a ribonucleic acid (RNA) genome, with replication using RNA-dependent RNA polymerase. This replicase enzyme has a minimal proofreading activity; the hallmark of rapidly-evolving viruses (e.g., influenza virus and hepatitis C virus) [1,2].
In addition, the pandemic nature of coronavirus disease 2019 (COVID-19), with more than 100 million detected cases so far, translates into a huge pool of susceptible hosts with varying selective pressure on the viral genome [3,4]. This resulted in rapid divergence of the novel virus from its common ancestor that crossed the species barrier, accompanied by a noticeable genetic diversity in less than a year [5,6,7].
Moreover, the evolutionary analysis of viruses is helpful for epidemiologic purposes [6,8]. Characterization of SARS-CoV-2 genetic variants (monophyletic clades designated as genetic lineages herein) is valuable to track the dynamics of its introduction and dissemination within a certain region [6,9]. Additionally, a consensus on SARS-CoV-2 nomenclature and classification is invaluable for correlating the genetic diversity of the virus with potential biological differences (e.g., antigenicity, transmissibility, virulence) [5,10].
Besides the aforementioned points, phylogenetic studies on SARS-CoV-2 are facilitated by the burgeoning availability of genetic sequences of the virus, as a result of advancement in sequencing technology [5]. As of 3 February 2021; the total number of full-genome SARS-CoV-2 sequences exceeded 400,000 in the global science initiative and primary source for genomic data of influenza viruses (GISAID) [11].
To characterize the genetic lineages of SARS-CoV-2, a comprehensive and dynamic nomenclature/classification scheme was proposed by Rambaut et al. and was referred to as “Pango nomenclature system (from the first-person Latin verb meaning ‘I set’, ‘I fix’ or ‘I record’)” [5,12].
In the Pango system, the following notable lineages have been described so far: (1) lineage A, which dates back to December 2019, and contains the root of COVID-19 pandemic; (2) lineage B, which has Chinese sequences in its base, with branching indicating global exports; (3) lineage B.1, which dates back to January 2020 corresponding to the Italian outbreak; (4) lineage B.1.1.7 (a.k.a. the UK variant of concern) that was detected in the UK in September 2020 with N501Y and P681H as the most notable amino acid substitutions; (5) lineage B.1.351 which dates back to December 2020 (the South African variant) with N501Y, K417N, and E484K as the most notable amino acid substitutions; and (6) lineage P.1 (the Brazilian variant) that was first detected in December 2020 with N501Y, and E484K as the most notable amino acid substitutions [5,13,14,15,16,17].
Amino acid substitutions have been described in SARS-CoV-2 isolates across the world, with possible biological value pending further evidence [18,19,20,21]. Substitutions in the surface (spike) glycoprotein is of particular concern considering its importance in receptor binding and being a target for neutralizing antibodies [22]. The most notable example of such substitutions is the replacement of aspartic acid by glycine at position 614 of the spike glycoprotein of the virus (D614G). The predominance of such substitution was observed in various regions including the Middle East and North Africa [23,24]. Another amino acid substitutions with potential significance include N501Y and E484K in the receptor binding domain of the spike glycoprotein [18,25].
In Jordan, the COVID-19 epidemic went through several phases. In March 2020, mitigation measures took place including travel restrictions, wide lockdowns and curfew, enforcement of masks and social distancing, and prohibition of large gatherings [26,27]. This resulted in clusters of cases during the first five months. However, the inevitable SARS-CoV-2 dissemination started in August/September 2020 when the first wave of community transmission took place [27,28]. Following the peak in active cases and mortality in November 2020, a decline in the number of newly diagnosed cases and deaths was reported in 20 December 2020 and January 2021. The first reporting of B.1.1.7 UK variant of concern dated back to 24 December 2020, and the start of COVID-19 vaccination in the country took place in mid-January 2021 [27]. Several challenges face the country amid the current COVID-19 epidemic including the widespread prevalence of misinformation and conspiracy beliefs, in addition to vaccine hesitancy [26,29,30]. Additionally, the scarcity of molecular and epidemiologic studies on the virus in Jordan can hamper the control efforts in the country.
Thus, the aims of the current study were: (1) to describe the genetic diversity of SARS-CoV-2 in Jordan by analysis of the virus lineages; (2) to analyze the proportion of phylogenetic clustering indicative of local virus spread among the major lineages in the country; and (3) to estimate the timing of the introduction of the UK variant of concern (B.1.1.7) into the country.
2. Results
2.1. Characteristics of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) Jordanian Dataset
The total number of GISAID SARS-CoV-2 sequences collected in Jordan and utilized in this study were 579 full-genome sequences. Stratified per month, the number of SARS-CoV-2 sequences collected in the country together with total number of COVID-19 cases and deaths in Jordan are shown in (Table 1). The majority of SARS-CoV-2 sequences were collected in October 2020 (n = 344, 59.4%) and September (n = 103, 17.8%), which coincided with surge in number of newly diagnosed COVID-19 cases in the country.
Table 1.
Month | Newly Diagnosed COVID-19 1 Cases | COVID-19 Related Deaths | Number of SARS-CoV-2 2 Sequences | Percentage of Sequences Compared to New Cases |
---|---|---|---|---|
March 2020 | 274 | 5 | 22 | 8.0292% |
April 2020 | 179 | 3 | 6 | 3.3520% |
May 2020 | 286 | 1 | 0 | 0 |
June 2020 | 393 | 0 | 4 | 1.0178% |
July 2020 | 61 | 2 | 2 | 3.2787% |
August 2020 | 841 | 4 | 15 | 1.7836% |
September 2020 | 9791 | 46 | 103 | 1.0520% |
October 2020 | 60,782 | 768 | 344 | 0.5660% |
November 2020 | 146,823 | 1922 | 43 | 0.0293% |
December 2020 | 75,064 | 1083 | 2 | 0.0027% |
January 2021 | 30,539 | 447 | 38 | 0.1244% |
1 COVID-19: Coronavirus disease 2019; 2 SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2.
2.2. Description of the Genetic Lineages of SARS-CoV-2 in Jordan Using the “Pango” System
Using the Pango system, a total of 19 different genetic lineages were found in Jordan. The most common was the first Jordan lineage designated B.1.1.312 (n = 424, 73.2%), followed by the second Jordan lineage designated B.1.36.10 (n = 62, 10.7%), and the UK variant B.1.1.7 (n = 36, 6.2%).
Over a period of 10 months, the predominant lineage shifted from other B lineages during (March–April 2020), to B.1 during (June–July 2020), while the first Jordan lineage B.1.1.312 dominated infections from August 2020 until November 2020, with the concomitant presence of the second Jordan lineage B.1.36.10 over the same period. The last two months (December 2020–January 2021) were dominated by the UK variant B.1.1.7 (Figure 1).
The molecular signature found consistently in the Spike gene region of the first Jordan lineage B.1.1.312 was the replacement of adenine by thymine at position 24,432 (A24432T) of the reference genome NC_045512 (thymine instead of uracil since the results were those of DNA sequencing). This mutation was non-synonymous resulting in the replacement of glutamine (Q) by leucine (L) at position 957 of the spike glycoprotein (Q957L).
The molecular signature in the Spike gene region for the second Jordan lineage B.1.36.10 was C22444T (a synonymous mutation).
Using the Tamura–Nei model, the evolutionary divergence for both B.1.1.312 and B.1.36.10 from the reference SARS-CoV-2 sequence NC_045512, was 0.00064 substitutions/site, while the divergence from the reference sequence was the highest for B.1.1.7 (0.00188 substitutions/site). Assessing within-lineage genetic diversity using the same model revealed the highest diversity within B.1.36.10 sequences (0.00023 substitutions/site, and within B.1.1.312 sequences (0.00022 substitutions/site), while the genetic diversity was the lowest among the UK lineage B.1.1.7 (0.00008 substitutions/site).
2.3. The Proportion of Phylogenetic Clustering among the Three Most Common Lineages in Jordan
To determine sequence clustering among the three most common genetic lineages of SARS-CoV-2 in Jordan, we conducted maximum likelihood (ML) phylogeny construction. Using the Spike gene region, the proportion of phylogenetic clustering was the highest among the B.1.1.7 lineage sequences (35/35, 100.0%) followed by B.1.36.10 sequences (19/62, 30.6%), and B.1.1.312 sequences (126/414, 30.4%).
A higher proportion of phylogenetic clustering for the Jordan lineages was detected using the open reading frame 1ab (ORF1ab) region, with the highest proportion of clustering also seen among the lineage B.1.1.7 sequences (32/35, 91.4%), followed by B.1.36.10 (44/62, 71.0%) and B.1.1.312 (176/401, 43.9%, Figure 2). Please refer to the Materials and Methods section for the explanation of difference in B.1.1.312 number of sequences for the two sub-genomic regions (Supplementary S1).
2.4. Amino Acid Substitutions in the Surface Glycoprotein of the Three Major Genetic Lineages in Jordan
For the three major genetic lineages in Jordan (B.1.1.312; B.1.36.10 and B.1.1.7), an assessment of amino acid substitutions in the spike glycoprotein compared to that in the reference sequence (YP_009724390) was undertaken.
The amino acid substitution D614G was detected in the vast majority of sequences (n = 566, 97.8%), and the wild type (D614) was last identified in June 2020.
The amino acid substitutions N501Y and P681H besides the deletion Δ69/70 were consistently found among the lineage B.1.1.7 sequences, while N501I was detected in a single sequence from the first Jordan lineage B.1.1.312.
The following amino acid substitutions were totally absent from the sequences that were analyzed in this study: K417N and E484K.
Using the Protein Variation Effect Analyzer (PROVEAN) tool, two amino acid substitutions were predicted to be deleterious for the spike glycoprotein: T716I detected among B.1.1.7 sequences and Q957L found in the first Jordan lineage B.1.1.312 (Table 2).
Table 2.
SARS-CoV-2 Lineage | Amino Acid Substitution | PROVEAN 1 Score | Prediction (Cutoff = −2.5) |
---|---|---|---|
UK variant of concern (B.1.1.7) | H69_V70del | 0.808 | Neutral |
V143_Y144del | 1.318 | Neutral | |
N501Y | −0.090 | Neutral | |
A570D | −0.682 | Neutral | |
D614G | 0.598 | Neutral | |
P681H | 0.060 | Neutral | |
T716I | −3.293 | Deleterious | |
S982A | −1.505 | Neutral | |
D1118H | −1.142 | Neutral | |
First Jordan lineage (B.1.1.312) | D614G | 0.598 | Neutral |
Q957L | −2.929 | Deleterious | |
Second Jordan Lineage (B.1.36.10) | D614G | 0.598 | Neutral |
1 Variants with a score equal to or below −2.5 are considered “deleterious,” and variants with a score above −2.5 are considered “neutral” in the Protein Variation Effect Analyzer (PROVEAN) tool.
2.5. The UK Variant of Concern was Introduced into Jordan in Late November 2020
Bayesian analysis of the UK variant of concern (B.1.1.7) lineage, with 35 SARS-COV-2 S sequences collected in Jordan between 24 December 2020 and 6 January 2021 revealed that the time to the most recent common ancestor (tMRCA) of this lineage in Jordan was 21 November 2020 (95% highest posterior density interval: 17 November 2020–24 December 2020). Coalescent analysis using a Bayesian skyline plot showed a rapid exponential increase in the number of effective infections between 1 January 2021 and 5 January 2021 (Figure 3).
3. Discussion
In this study, we utilized molecular clock and coalescent analyses to describe the timeline of introduction of the genetic lineage B.1.1.7—commonly known as the UK variant of concern—and its spread in Jordan. Additionally, we employed the Pango classification system, which facilitates the classification and nomenclature of SARS-CoV-2 genetic lineages, containing molecular signatures that can be helpful to track its introduction/emergence and spread [5]. This approach can be used to evaluate public health measures including control and mitigation practices [31]. The negative consequences of the current COVID-19 pandemic necessitates such in-depth epidemiologic studies, which can be helpful to plan effective preventive strategies [32,33].
The major result of this study revealed that the genetic lineage B.1.1.7 was introduced into Jordan about four weeks earlier than the official reporting of its introduction into the country [27]. Bayesian skyline coalescent analysis showed that the exponential increase in infections as a result of the B.1.1.7 lineage coincided with the new year 2021, following a lag phase of several weeks. It is known that the human behavior can drive a surge in infections if a super spreader event takes place in a large gathering [34,35]. However, this hypothesis needs further evaluation using contact tracing data together with dense sampling to reconstruct the evolutionary history of this lineage in the country.
Despite the need for further evidence regarding the biological significance of B.1.1.7 lineage, several studies reported on the rapid dissemination of this lineage in UK among several other countries [6,16,36,37]. This proposed change in virus behavior can be related to enhanced binding between the spike glycoprotein of this lineage and its receptor; and this enhancement has been proposed to be the result of N501Y amino acid substitution [18,38].
Additionally, we used the Pango classification system to describe the molecular epidemiology of COVID-19 in Jordan [5]. Since the first introduction of the novel coronavirus into humans, the expanding genetic diversity of the virus demanded a scheme to classify and name monophyletic clades, which would facilitate the study of epidemiologic features of the virus including its spread. This would also provide a consensus to study the possible biological significance of such lineages [39,40]. In this study, we adopted the approach conceived by Rambaut et al., that can help in analyzing patterns of introduction and spread of this novel virus in a certain region [5,12].
Community transmission of SARS-CoV-2 in Jordan became apparent in August 2020, and was dominated by three genetic lineages starting with the first and second Jordan lineages (B.1.1.312 and B.1.36.10), and it was recently driven by the UK variant of concern (B.1.1.7.). The emergence/introduction of the two Jordan lineages can be mostly related to a founder effect, since no discernible advantageous or neutral mutations were detected among the two lineages [41,42,43]. The molecular signature of the second Jordan lineage (B.1.36.10) was found in earlier sequences collected in Turkey [44]. This might point to a possibility of introduction of this lineage into Jordan in early September 2020, considering that travelers coming from Turkey (classified as a green country at that time) were not required to be quarantined [45].
One result that should be investigated further is the higher proportion of phylogenetic clustering for the B.1.1.7 lineage compared to the two Jordan lineages. This indicates a higher proportion of domestic transmission, which can be linked to enhanced transmissibility of the virus. However, such a result is pending further evidence to support the current observations linking such a genetic lineage with a higher transmission [37].
In line with several previous studies, genetic analysis of SARS-CoV-2 in Jordan showed the shift into B lineage, harboring the spike D614G amino acid substitution, with all sequences collected in Jordan harboring this substitution from July 2020 onwards [23,24]. This amino acid substitution was present in the country as early as March 2020, which hints to the effects of virus genetic changes on its epidemic behavior, despite the need for further evidence to support such a correlation [21,46,47,48].
Study Strengths and Limitations
The current study used the state-of-the-art phylogenetic inference methods to characterize the molecular epidemiology of SARS-CoV-2 in Jordan. Additionally, this study can be considered among the first studies in the Middle East and North Africa region utilizing the Pango classification system to characterize the genetic diversity of SARS-CoV-2 to the best of our knowledge.
Limitations of this study included potential sampling bias in time, which was manifested by variation in sequencing proportion in relation to new cases diagnosed each month; with 1.3% sequencing rate out of the newly diagnosed cases before October 2020 and 0.1% thereafter.
Another caveat of this study can be the enhanced surveillance of passengers (and their contacts) coming from UK or other countries where the UK variant of concern was reported. This may have caused the dominance of B.1.1.7 lineage among sequences collected in December 2020–January 2021.
4. Materials and Methods
4.1. Compilation of SARS-CoV-2 Jordanian Dataset and Epidemiologic Data
All SARS-CoV-2 genetic sequences that were collected in Jordan were retrieved from GISAID, as of 30 January 2021 [11]. The Jordanian sequences were aligned together with the reference SARS-CoV-2 sequence Wuhan-Hu-1 (accession number: NC_045512). Multiple sequence alignment was undertaken through a multiple alignment program for amino acid or nucleotide sequences (MAFFT v.7) [49].
Data on daily COVID-19 diagnosed cases and deaths in Jordan were retrieved from Coronavirus Source Data, and covered the period from 3 March 2020 to 29 January 2021 [50].
4.2. SARS-CoV-2 Lineage Assignment
To describe the genetic lineages of the sequences in the SARS-CoV-2 Jordanian dataset, we utilized Phylogenetic Assignment of Named Global Outbreak Lineages (Pangolin) [51]. The Pangolin tool follows the ‘Pango’ nomenclature system for classifying SARS-CoV-2 genomic sequences [5,12].
The measurement of within-lineage genetic distances was done using MEGA6, which was also used to detect the following amino acid substitutions/deletions in the spike glycoprotein sequence: D614G, E484K, N501Y, P681H, 69–70del, and K417N [52].
Genetic divergence from the reference sequence of SARS-CoV-2 and within-lineage genetic diversity were assessed using the Tamura–Nei model as implemented in MEGA6 [52,53].
4.3. Assessment Spike Protein of the Major Lineages in Jordan
For the three major SARS-CoV-2 lineages circulating in Jordan (B.1.1.312; B.1.36.10; and B.1.1.7), we used the Protein Variation Effect Analyzer (PROVEAN) tool in order to assess the possible functional changes in the spike glycoprotein compared to that in the reference sequence (YP_009724390) [54].
4.4. Maximum Likelihood Phylogenetic Analysis
To conduct the ML phylogenetic analysis, we used two sub-genomic parts of the dataset: (1) ORF1ab (NC_045512 positions: 266–21,555); with the following Jordanian sequences removed for having long (>10) stretches of ambiguous (N) bases: EPI_ISL_429992; EPI_ISL_429995; EPI_ISL_430008; EPI_ISL_430013; EPI_ISL_450189; EPI_ISL_636390; EPI_ISL_730391; EPI_ISL_730473; EPI_ISL_730545; EPI_ISL_755118; EPI_ISL_755120; EPI_ISL_755121; EPI_ISL_755122; EPI_ISL_755123; EPI_ISL_755124; EPI_ISL_755125; EPI_ISL_755126; EPI_ISL_755127; EPI_ISL_755128; EPI_ISL_755129; EPI_ISL_755131; EPI_ISL_755237; EPI_ISL_755238; EPI_ISL_755239; EPI_ISL_755240; EPI_ISL_755243; EPI_ISL_755247; EPI_ISL_755267; EPI_ISL_878495; which yielded a dataset with 550 Jordanian sequences; (2) Spike S (NC_045512 positions: 21,563–25,384); with the following Jordanian sequences removed for having long (> 10) stretches of ambiguous (N) bases: EPI_ISL_430013; EPI_ISL_450189; EPI_ISL_755118; EPI_ISL_755120; EPI_ISL_755121; EPI_ISL_755123; EPI_ISL_755125; EPI_ISL_755126; EPI_ISL_755128; EPI_ISL_755131; EPI_ISL_878495; EPI_ISL_430008; EPI_ISL_730543; EPI_ISL_730545; which yielded a dataset with 565 Jordanian sequences.
Phylogeny construction for the two sub-genomic Jordanian datasets using the ML approach was done using PhyML v3 [55]. The Smart Model Selection (SMS) was used for selection of the most appropriate nucleotide substitution model, depending on the Akaike Information Criterion (AIC) [56]. Models that were used for construction of ML trees were: GTR + G for ORF1ab; and HKY85 + I for S region.
4.5. Timing of B.1.1.7 Lineage (UK Variant) Introduction into Jordan
To estimate the time to the most recent common ancestor (tMRCA) for the B.1.1.7 lineage in Jordan, we used the Bayesian Markov chain Monte Carlo (MCMC) method in BEAST v1.8.4 [57]. The following criteria were used for Bayesian evolutionary analysis by sampling trees (BEAST) analysis: HKY nucleotide substitution model with discrete gamma-distributed rate heterogeneity, uncorrelated relaxed clock model with a uniform rate prior (initial value of 0.0065) and a Bayesian skyline tree density model [23]. A single run with 200 million chain length was performed, with samples of trees and parameters collected every 20,000 steps after discarding a burn-in of 20%. Convergence was checked for using Tracer v1.6.0. with all parameters having effective sample sizes (ESSs) of >200. Construction of the Bayesian skyline plot was done in Tracer; and assembly of the maximum clade credibility (MCC) tree was done using TreeAnnotator available in BEAST package [57]. Visualization of the trees in this study was undertaken in FigTree [58].
5. Conclusions
In the current study, molecular characterization of SARS-CoV-2 in Jordan was undertaken for the first time to the best of our knowledge. A recent report by Edyth Parker et al investigated the emergence of lineage B.1.1.7 in Jordan and revealed the current dominance of this lineage in Jordan [59]. Two Jordan lineages dominated infections in the country, with a recent introduction of the lineage B.1.1.7. This UK variant of concern was present in the country several weeks before its official reporting, with an exponential propagation over the first few days of the new year 2021.
The introduction of new lineages in the country appeared to be related to founder effect; nevertheless, the biological significance of certain mutations should be further evaluated. An important note should be clarified, which is related to the distinction that should be made between the epidemiologic and contact tracing value of determination of virus lineages as opposed to the identification and characterization of novel strain, subtypes or types of viruses that have distinct biological features. Thus, continuous surveillance of genetic variability of SARS-CoV-2 is recommended to track the emergence of new genetic variants, with subsequent studies of its potential biological significance.
The media hype about the UK variant of concern seems justified considering its rapid spread and the number of amino acid changes detected in the spike glycoprotein of this lineage, which can have important effects on antigenicity and transmissibility. In turn, this can have implications for the current vaccine formulations and resurgence of new waves of infection.
Acknowledgments
We sincerely thank the originating lab (Biolab Diagnostic Laboratories) and the submitting lab (Andersen lab at Scripps research) who have sequenced and shared the full genome data for SARS-CoV-2 in the GISAID database.
Abbreviations
AIC | Akaike Information Criterion |
aLRT-SH | Approximate likelihood Shimodaira-Hasegawa statistical support |
BEAST | Bayesian evolutionary analysis by sampling trees |
COVID-19 | Coronavirus disease 2019 |
D614G | Replacement of aspartic acid by glycine at position 614 of the spike glycoprotein |
del | Deletion |
E484K | Replacement of glutamic acid by lysine at position 484 of the spike glycoprotein |
ESS | Effective sample size |
GISAID | The global science initiative and primary source for genomic data of influenza viruses |
K417N | Replacement of lysine by asparagine at position 417 of the spike glycoprotein |
MCC | Maximum clade credibility |
MCMC | Markov chain Monte Carlo |
ML | Maximum likelihood |
N501Y | Replacement of asparagine by tyrosine at position 501 of the spike glycoprotein |
Ne | Effective population size |
ORF | Open reading frame |
P681H | Replacement of proline by histidine at position 681 of the spike glycoprotein |
PROVEAN | Protein Variation Effect Analyzer tool |
RNA | Ribonucleic acid |
S | Spike gene of SARS-CoV-2 |
SARS-CoV-2 | Severe acute respiratory syndrome coronavirus 2 |
tMRCA | Time to the most recent common ancestor |
UK | United Kingdom |
Supplementary Materials
The following are available online at https://www.mdpi.com/2076-0817/10/3/302/s1. Supplementary S1: Detailed maximum likelihood phylogenetic trees for the Jordanian SARS-CoV-2 sequences.
Author Contributions
Conceptualization, M.S. and A.M.; methodology, M.S. and A.M.; validation, M.S.; formal analysis, M.S.; investigation, M.S. and A.M.; data curation, M.S. and A.M.; writing—original draft preparation, M.S.; writing—review and editing, M.S. and A.M.; visualization, M.S.; supervision, M.S. and A.M.; project administration, M.S. and A.M.; All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data authors can be contacted directly via GISAID website: https://www.gisaid.org/. The da-tasets analysed during the current study (ML analyses files, xml files without sequences, Tracer log files) are available from the corresponding author (M.S.) on a reasonable request and considering the terms of use by GISAID.
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Duffy S., Shackelton L.A., Holmes E.C. Rates of evolutionary change in viruses: Patterns and determinants. Nat. Rev. Genet. 2008;9:267–276. doi: 10.1038/nrg2323. [DOI] [PubMed] [Google Scholar]
- 2.Sevajol M., Subissi L., Decroly E., Canard B., Imbert I. Insights into RNA synthesis, capping, and proofreading mechanisms of SARS-coronavirus. Virus Res. 2014;194:90–99. doi: 10.1016/j.virusres.2014.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Worldometer COVID-19 Coronavirus Pandemic. [(accessed on 31 January 2021)]; Available online: https://www.worldometers.info/coronavirus/
- 4.Sironi M., Hasnain S.E., Rosenthal B., Phan T., Luciani F., Shaw M.A., Sallum M.A., Mirhashemi M.E., Morand S., Gonzalez-Candelas F., et al. SARS-CoV-2 and COVID-19: A genetic, epidemiological, and evolutionary perspective. Infect. Genet. Evol. 2020;84:104384. doi: 10.1016/j.meegid.2020.104384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rambaut A., Holmes E.C., O’Toole A., Hill V., McCrone J.T., Ruis C., du Plessis L., Pybus O.G. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat. Microbiol. 2020;5:1403–1407. doi: 10.1038/s41564-020-0770-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.du Plessis L., McCrone J.T., Zarebski A.E., Hill V., Ruis C., Gutierrez B., Raghwani J., Ashworth J., Colquhoun R., Connor T.R., et al. Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK. Science. 2021;371:708–712. doi: 10.1126/science.abf2946. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Koyama T., Platt D., Parida L. Variant analysis of SARS-CoV-2 genomes. Bull. World Health Organ. 2020;98:495. doi: 10.2471/BLT.20.253591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Pybus O.G., Tatem A.J., Lemey P. Virus evolution and transmission in an ever more connected world. Proc. Biol. Sci. 2015;282:20142878. doi: 10.1098/rspb.2014.2878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Nie Q., Li X., Chen W., Liu D., Chen Y., Li H., Li D., Tian M., Tan W., Zai J. Phylogenetic and phylodynamic analyses of SARS-CoV-2. Virus Res. 2020;287:198098. doi: 10.1016/j.virusres.2020.198098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Tang X., Wu C., Li X., Song Y., Yao X., Wu X., Duan Y., Zhang H., Wang Y., Qian Z. On the origin and continuing evolution of SARS-CoV-2. Natl. Sci. Rev. 2020;7:1012–1023. doi: 10.1093/nsr/nwaa036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Elbe S., Buckland-Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob. Chall. 2017;1:33–46. doi: 10.1002/gch2.1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Rambaut A., Holmes E.C., O’Toole A., Hill V., McCrone J.T., Ruis C., du Plessis L., Pybus O.G. Addendum: A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat. Microbiol. 2021;6:415. doi: 10.1038/s41564-021-00872-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wise J. Covid-19: New coronavirus variant is identified in UK. BMJ. 2020;371:m4857. doi: 10.1136/bmj.m4857. [DOI] [PubMed] [Google Scholar]
- 14.Kupferschmidt K. New mutations raise specter of ‘immune escape’. Am. Assoc. Adv. Sci. 2021;371:329–330. doi: 10.1126/science.371.6527.329. [DOI] [PubMed] [Google Scholar]
- 15.Faria N.R., Morales Claro I., Candido D., Moyses Franco L.A., Andrade P.S., Coletti T.M., Silva C.A.M., Sales F.C., Manuli E.R., Aguiar R.S. Genomic Characterisation of an Emergent SARS-CoV-2 Lineage in Manaus: Preliminary Findings. [(accessed on 3 February 2021)]; Available online: https://virological.org/t/genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-manaus-preliminary-findings/586.
- 16.Volz E., Mishra S., Chand M., Barrett J.C., Johnson R., Geidelberg L., Hinsley W.R., Laydon D.J., Dabrera G., O’Toole Á., et al. Transmission of SARS-CoV-2 Lineage B.1.1.7 in England: Insights from linking epidemiological and genetic data. medRxiv. 2021 doi: 10.1101/2020.12.30.20249034. [DOI] [Google Scholar]
- 17.Tegally H., Wilkinson E., Giovanetti M., Iranzadeh A., Fonseca V., Giandhari J., Doolabh D., Pillay S., San E.J., Msomi N., et al. Emergence and rapid spread of a new severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) lineage with multiple spike mutations in South Africa. medRxiv. 2020 doi: 10.1101/2020.12.21.20248640. [DOI] [Google Scholar]
- 18.Luan B., Wang H., Huynh T. Molecular Mechanism of the N501Y Mutation for Enhanced Binding between SARS-CoV-2’s Spike Protein and Human ACE2 Receptor. bioRxiv. 2021 doi: 10.1101/2021.01.04.425316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yurkovetskiy L., Wang X., Pascal K.E., Tomkins-Tinch C., Nyalile T.P., Wang Y., Baum A., Diehl W.E., Dauphin A., Carbone C., et al. Structural and Functional Analysis of the D614G SARS-CoV-2 Spike Protein Variant. Cell. 2020;183:739–751. doi: 10.1016/j.cell.2020.09.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Plante J.A., Liu Y., Liu J., Xia H., Johnson B.A., Lokugamage K.G., Zhang X., Muruato A.E., Zou J., Fontes-Garfias C.R., et al. Spike mutation D614G alters SARS-CoV-2 fitness. Nature. 2020 doi: 10.1038/s41586-020-2895-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.van Dorp L., Richard D., Tan C.C., Shaw L.P., Acman M., Balloux F. No evidence for increased transmissibility from recurrent mutations in SARS-CoV-2. Nat. Commun. 2020;11:1–8. doi: 10.1038/s41467-020-19818-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Robson B. COVID-19 Coronavirus spike protein analysis for synthetic vaccines, a peptidomimetic antagonist, and therapeutic drugs, and analysis of a proposed achilles’ heel conserved region to minimize probability of escape mutations and drug resistance. Comput. Biol. Med. 2020;121:103749. doi: 10.1016/j.compbiomed.2020.103749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Sallam M., Ababneh N.A., Dababseh D., Bakri F.G., Mahafzah A. Temporal increase in D614G mutation of SARS-CoV-2 in the Middle East and North Africa. Heliyon. 2021;7:e06035. doi: 10.1016/j.heliyon.2021.e06035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Korber B., Fischer W.M., Gnanakaran S., Yoon H., Theiler J., Abfalterer W., Hengartner N., Giorgi E.E., Bhattacharya T., Foley B., et al. Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus. Cell. 2020;182:812–827. doi: 10.1016/j.cell.2020.06.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Voloch C.M., Silva F.R.D., de Almeida L.G.P., Cardoso C.C., Brustolini O.J., Gerber A.L., Guimarães A.P.d.C., Mariani D., Costa R.M.d., Ferreira O.C., et al. Genomic characterization of a novel SARS-CoV-2 lineage from Rio de Janeiro, Brazil. medRxiv. 2020 doi: 10.1101/2020.12.23.20248598. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Sallam M., Dababseh D., Yaseen A., Al-Haidar A., Taim D., Eid H., Ababneh N.A., Bakri F.G., Mahafzah A. COVID-19 misinformation: Mere harmless delusions or much more? A knowledge and attitude cross-sectional study among the general public residing in Jordan. PLoS ONE. 2020;15:e0243264. doi: 10.1371/journal.pone.0243264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ministry of Health, The Hashemite Kingdom of Jordan COVID-19 Updates in Jordan. [(accessed on 3 February 2021)]; Available online: https://corona.moh.gov.jo/en.
- 28.Reuters Jordan to Close Border with Syria after Spike in COVID-19 Cases. [(accessed on 3 February 2021)]; Available online: https://www.reuters.com/article/idUSL1N2FE0IK.
- 29.Sallam M., Dababseh D., Eid H., Al-Mahzoum K., Al-Haidar A., Taim D., Yaseen A., Ababneh N.A., Bakri F.G., Mahafzah A. High Rates of COVID-19 Vaccine Hesitancy and Its Association with Conspiracy Beliefs: A Study in Jordan and Kuwait among Other Arab Countries. Vaccines. 2021;9:42. doi: 10.3390/vaccines9010042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Sallam M., Dababseh D., Yaseen A., Al-Haidar A., Ababneh N.A., Bakri F.G., Mahafzah A. Conspiracy Beliefs Are Associated with Lower Knowledge and Higher Anxiety Levels Regarding COVID-19 among Students at the University of Jordan. Int. J. Environ. Res. Public Health. 2020;17:4915. doi: 10.3390/ijerph17144915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lam T.T., Hon C.C., Tang J.W. Use of phylogenetics in the molecular epidemiology and evolutionary studies of viral infections. Crit. Rev. Clin. Lab. Sci. 2010;47:5–49. doi: 10.3109/10408361003633318. [DOI] [PubMed] [Google Scholar]
- 32.Scudiero O., Lombardo B., Brancaccio M., Mennitti C., Cesaro A., Fimiani F., Gentile L., Moscarella E., Amodio F., Ranieri A., et al. Exercise, Immune System, Nutrition, Respiratory and Cardiovascular Diseases during COVID-19: A Complex Combination. Int. J. Environ. Res. Public Health. 2021;18:904. doi: 10.3390/ijerph18030904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Park M., Cook A.R., Lim J.T., Sun Y., Dickens B.L. A Systematic Review of COVID-19 Epidemiology Based on Current Evidence. J. Clin. Med. 2020;9:967. doi: 10.3390/jcm9040967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Chang S., Pierson E., Koh P.W., Gerardin J., Redbird B., Grusky D., Leskovec J. Mobility network models of COVID-19 explain inequities and inform reopening. Nature. 2021;589:82–87. doi: 10.1038/s41586-020-2923-3. [DOI] [PubMed] [Google Scholar]
- 35.Koizumi N., Siddique A.B., Andalibi A. Assessment of SARS-CoV-2 transmission among attendees of live concert events in Japan using contact-tracing data. J. Travel Med. 2020;27:taaa096. doi: 10.1093/jtm/taaa096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Leung K., Shum M.H., Leung G.M., Lam T.T., Wu J.T. Early transmissibility assessment of the N501Y mutant strains of SARS-CoV-2 in the United Kingdom, October to November 2020. Eurosurveillance. 2021;26:2002106. doi: 10.2807/1560-7917.ES.2020.26.1.2002106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Graham M.S., Sudre C.H., May A., Antonelli M., Murray B., Varsavsky T., Kläser K., Canas L.S., Molteni E., Modat M., et al. The effect of SARS-CoV-2 variant B.1.1.7 on symptomatology, re-infection and transmissibility. medRxiv. 2021 doi: 10.1101/2021.01.28.21250680. [DOI] [Google Scholar]
- 38.Makowski L., Olson-Sidford W., W-Weisel J. Biological and Clinical Consequences of Integrin Binding via a Rogue RGD Motif in the SARS CoV-2 Spike Protein. Viruses. 2021;13:146. doi: 10.3390/v13020146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Lauring A.S., Hodcroft E.B. Genetic Variants of SARS-CoV-2-What Do They Mean? JAMA. 2021;325:529–531. doi: 10.1001/jama.2020.27124. [DOI] [PubMed] [Google Scholar]
- 40.Callaway E. The coronavirus is mutating-does it matter? Nature. 2020;585:174–177. doi: 10.1038/d41586-020-02544-6. [DOI] [PubMed] [Google Scholar]
- 41.Diez-Fuertes F., Iglesias-Caballero M., Garcia-Perez J., Monzon S., Jimenez P., Varona S., Cuesta I., Zaballos A., Jimenez M., Checa L., et al. A Founder Effect Led Early SARS-CoV-2 Transmission in Spain. J. Virol. 2021;95:e01583-20. doi: 10.1128/JVI.01583-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Ruan Y., Luo Z., Tang X., Li G., Wen H., He X., Lu X., Lu J., Wu C.-I. On the founder effect in COVID-19 outbreaks: How many infected travelers may have started them all? Natl. Sci. Rev. 2020;8:nwaa246. doi: 10.1093/nsr/nwaa246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Farkas C., Fuentes-Villalobos F., Garrido J.L., Haigh J., Barria M.I. Insights on early mutational events in SARS-CoV-2 virus reveal founder effects across geographical regions. PeerJ. 2020;8:e9255. doi: 10.7717/peerj.9255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.DemIr A.B., Benvenuto D., AbacioGlu H., Angeletti S., Ciccozzi M. Identification of the nucleotide substitutions in 62 SARS-CoV-2 sequences from Turkey. Turk. J. Biol. 2020;44:178–184. doi: 10.3906/biy-2005-69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Roya News Updated List of Green, Yellow and Red Countries Eligible to Travel to and from Jordan. [(accessed on 3 February 2021)]; Available online: https://en.royanews.tv/news/22149/Updated-list-of-green-yellow-and-red-countries-eligible-to-travel-to-and-from-Jordan.
- 46.Maitra A., Sarkar M.C., Raheja H., Biswas N.K., Chakraborti S., Singh A.K., Ghosh S., Sarkar S., Patra S., Mondal R.K., et al. Mutations in SARS-CoV-2 viral RNA identified in Eastern India: Possible implications for the ongoing outbreak in India and impact on viral structure and host susceptibility. J. BioSci. 2020;45:76. doi: 10.1007/s12038-020-00046-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhang L., Jackson C.B., Mou H., Ojha A., Peng H., Quinlan B.D., Rangarajan E.S., Pan A., Vanderheiden A., Suthar M.S., et al. SARS-CoV-2 spike-protein D614G mutation increases virion spike density and infectivity. Nat. Commun. 2020;11:6013. doi: 10.1038/s41467-020-19808-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Volz E., Hill V., McCrone J.T., Price A., Jorgensen D., O’Toole A., Southgate J., Johnson R., Jackson B., Nascimento F.F., et al. Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity. Cell. 2021;184:64–75. doi: 10.1016/j.cell.2020.11.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Rozewicki J., Li S., Amada K.M., Standley D.M., Katoh K. MAFFT-DASH: Integrated protein sequence and structural alignment. Nucleic Acids Res. 2019;47:W5–W10. doi: 10.1093/nar/gkz342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Ritchie H. Our World in Data: Coronavirus Source Data. [(accessed on 30 January 2021)]; Available online: https://ourworldindata.org/coronavirus-source-data.
- 51.O’Toole Á., McCrone J.T., Scher E. Phylogenetic Assignment of Named Global Outbreak LINeages (Pangolin COVID-19 Lineage Assigner) [(accessed on 30 January 2021)]; Available online: https://pangolin.cog-uk.io/
- 52.Tamura K., Stecher G., Peterson D., Filipski A., Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol. Biol. Evol. 2013;30:2725–2729. doi: 10.1093/molbev/mst197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Tamura K., Nei M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol. Biol. Evol. 1993;10:512–526. doi: 10.1093/oxfordjournals.molbev.a040023. [DOI] [PubMed] [Google Scholar]
- 54.Choi Y., Sims G.E., Murphy S., Miller J.R., Chan A.P. Predicting the functional effect of amino acid substitutions and indels. PLoS ONE. 2012;7:e46688. doi: 10.1371/journal.pone.0046688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Guindon S., Dufayard J.F., Lefort V., Anisimova M., Hordijk W., Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 2010;59:307–321. doi: 10.1093/sysbio/syq010. [DOI] [PubMed] [Google Scholar]
- 56.Lefort V., Longueville J.E., Gascuel O. SMS: Smart Model Selection in PhyML. Mol. Biol. Evol. 2017;34:2422–2424. doi: 10.1093/molbev/msx149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Drummond A.J., Suchard M.A., Xie D., Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 2012;29:1969–1973. doi: 10.1093/molbev/mss075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Rambaut A. FigTree v1. 4. [(accessed on 24 August 2017)]; Available online: http://tree.bio.ed.ac.uk/software/figtree/
- 59.Parker E., Anderson C., Zeller M., Havens J., Tibi A., Hussein L., Ali S., Saddedin B., Robles-Sikisaka R., Kurzban E., et al. The emergence of the B.1.1.7 lineage in Jordan. [(accessed on 17 March 2021)]; Available online: https://virological.org/t/the-emergence-of-the-b-1-1-7-lineage-in-jordan/634.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data authors can be contacted directly via GISAID website: https://www.gisaid.org/. The da-tasets analysed during the current study (ML analyses files, xml files without sequences, Tracer log files) are available from the corresponding author (M.S.) on a reasonable request and considering the terms of use by GISAID.