Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 May 15;128:104441. doi: 10.1016/j.jcv.2020.104441

Investigating the genomic landscape of novel coronavirus (2019-nCoV) to identify non-synonymous mutations for use in diagnosis and drug design

Manish Tiwari a,1, Divya Mishra b,*,1
PMCID: PMC7227581  PMID: 32425659

Abstract

Novel coronavirus has wrecked medical and health care facilities claiming ∼5% death tollsglobally. All efforts to contain the pathogenesis either using inhibitory drugs or vaccines largelyremained futile due to a lack of better understanding of the genomic feature of this virus. In thepresent study, we compared the 2019-nCoV with other coronaviruses, which indicated that batSARS like coronavirus could be a probable ancestor of the novel coronavirus. The proteinsequence similarity of pangolin-hCoV and bat-hCoV with human coronavirus was higher ascompared to their nucleotide similarity denoting the occurrence of more synonymous mutationsin the genome. Phylogenetic and alignment analysis of 591 novel coronaviruses of differentclades from Group I to Group V revealed several mutations and concomitant amino acidchanges. Detailed investigation on nucleotide substitution unfolded 100 substitutions in thecoding region of which 43 were synonymous and 57 were of non-synonymous type. The nonsynonymous substitutions resulting into 57 amino acid changes were found to be distributed overdifferent hCoV proteins with maximum on spike protein. An important di-amino acid change RGto KR was observed in ORF9 protein. Additionally, several interesting features of the novelcoronavirus genome have been highlighted in respect to various other human infecting viruseswhich may explain extreme pathogenicity, infectivity and simultaneously the reason behindfailure of the antiviral therapies.

Summary

This study presents a comprehensive phylogenetic analysis of SARS-CoV2 isolates to understand discrete mutations that are occurring between patient samples. The analysis unravel various amino acid mutations in the viral proteins which may provide an explanation for varying treatment efficacies of different inhibitory drugs and a future direction towards a combinatorial treatment therapies based on the kind of mutation in the viral genome.

Keywords: Bat-hCoV, Coronavirus, Pangolin-hCoV, Phylogeny, SARS, Synonymous and non-synonymous substitutions

1. Introduction

Current and last two decades saw emergence of zoonotic coronavirus (CoV) crossing the species barrier ultimately infecting human species resulting in pandemics such as severe acute respiratory syndrome (SARS) and Middle East respiratory syndrome (MERS) [1,2]. An apocalyptic threat is posed by a sempiternal pathogen ruining the health and the economies on global scale. A severe pneumonia outbreak starting in December 2019 in the Wuhan city, Hubei Province, China was caused by novel CoV referred as “2019 novel coronavirus” or “2019-nCoV” [3,4]. CoVs are RNA viruses with wide host pathogenesis in mammals including humans, pangolins and birds. Genetically the CoVs were categorised into four major genera: Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus [5]. The alpha and beta CoVs infect mammals whereas the gamma and delta CoVs infect birds [6]. Primary symptoms associated with CoV infection include respiratory, hepatic, enteric and neurological diseases. Previous investigation showed that there are 6 type of CoVs (hCoV-NL63, hCoV-229E, hCoV-OC43, hCoV-HKU1, SARS-CoV, and MERS-CoV) which can infect the human species. HCoV-NL63, hCoV-229E belongs to alphaCoV genus while rest belongs to betaCoV genus. [6]. The betaCoVs appears to be prima-facie genre of CoVs which will peril universal human civilization in upcoming decades. Recently, the 2019-nCoV outbreak spread from China to the intercontinental arena and already infected 0.3 million people globally claiming ∼13,000 (∼4.3 %) deaths till 21st March 2020 (https://www.worldometers.info/coronavirus/#countries). China and Italy were the epicentres until now and chances for more calamitous centres cannot be ruled out in near future. Genome sequence analysis of SARS, MERS and 2019-nCoV confirmed its presence in betaCoVs family and divergence from the other two viruses [4]. The 2019-nCoV is a positive-strand RNA viruses with ∼29 Kb genome size, 125 nm in diameter and 6 to 11 open reading frames (ORFs) [7]. Viral genome encodes for 4 major structural proteins namely envelope (E), spike (S), membrane (M) and 3–5 nucleocapsid (N) proteins. The genome starts with short untranslated regions (5′ UTR) followed by genes 5′-replicase (rep gene), S, E, M, N and 3′ UTR [7]. Two-third of the genome is represented by the rep gene at 5′ end which encodes for non-structural protein (Nsp). Spike protein is responsible for receptor binding and corresponding viral entry into the host and hence important target for future drugs to restrict the viral titre [8,9]. Viral assembly relies primarily on M and E proteins and RNA synthesis is achieved by action of N protein [7].

To mitigate the severity of 2019-nCoV, researchers around the world are trying to develop antibodies and vaccine against this deadly virus. The problem with the delay in antiviral medication is superficial understanding of the virus. A dire need is to unravel the mutations in the viral genome and concomitant amino acid changes occurring presumably due to varying geographical location or upon interaction with the diverse human immune system. Various reports compared the SARS, MERS, bat and pangolin coronaviruses and paved way for significant findings, still leaving a lacunae in terms of the variations in the hCoV genomes and comparison with the previous available viruses resources. The present study deals with the mutations in the hCoV genomes and resulting change in amino acids.

2. Material and methods

To analyse the phylogenetic relationship between different coronaviruses, 591 genomes were downloaded from Global Initiative on Sharing All Influenza Database (GISAID) (https://www.gisaid.org/). The hCoV is an RNA virus and the deposited sequences are in DNA format. To prevent anomaly in the data represented, complete genomes and only high coverage datasets were utilized. The genomic sequences were aligned using MUSCLE program (v3.8.31) [10]. The alignments were utilized to deduce various nucleotide substitutions and maximum likelihood phylogenetic tree with 1000 bootstrap was constructed by RAxML program [11]. The alignment and tree were visualized using Jalview 2.11.0 [12] and iTOL respectively [13]. Different substitutions and resulting amino acid changes were analyzed between human, bat, pangolin and SARS coronavirus genomes. To deduce a mutation or amino acid change only those confirmed in three individual genomes were considered (replicates for biological significance).

3. Results and discussion

3.1. Comparative genomic analyses of human novel coronavirus with other coronaviruses

Genomic features may provide an important clue about the relatedness and evolution of the organism. In order to get an insight into the similitude and dissimilitude between human novel coronavirus (hCoV) and other coronaviruses, the genome sequence of human novel coronavirus (hCoV) were compared with bat coronavirus (GU190215.1) [14], severe acute respiratory syndrome-related coronavirus strain BtKY72 (KY352407.1) [15], bat SARS-like coronavirus isolate bat-SL-CoVZC45 (MG772933.1) [16], hCoV-19/pangolin/Guangdong/1/2019|EPI ISL 410721 (pangolin-hCoV) and hCoV-19/bat/Yunnan/RaTG13/2013|EPI ISL (bat-hCoV) which revealed approximately 81 %, 81 %, 89 %, 90 % and 96 % similarity, respectively (Table 1 ).

Table 1.

The percentage similarity of genomes of bat coronavirus, bat SARS like coronavirus, SARS coronavirus, pangolin-hCoV and bat-hCoV compared to 2019-hCoV genome.

Coronavirus Identity with hCoVs Adenine Guanine Thymine Cytosine
GU190215.1 Bat coronavirus BM48-31/BGR/2008 80.84 % 8130 6151 9304 5691
KY352407.1 Severe acute respiratory syndrome-related coronavirus strain BtKY72 80.84 % 8354 6052 9440 5428
MG772933.1 Bat SARS-like coronavirus isolate bat-SL-CoVZC45 89.12 % 8740 6020 9468 5574
hCoV19/pangolin/Guangdong/1/2019|EPI ISL 410721 90 % 8956 5862 9465 5542
hCoV19/bat/Yunnan/RaTG13/2013|EPI ISL 402131 96 % 8912 5857 9587 5489

To further assess the relationship between hCoV and other coronaviruses, alignment and phylogenetic analysis was carried out. Alignment of hCoV with above mentioned viruses showed that several nucleotide sites were unique in hCoV sequences when compared to other coronaviruses (Table S1). Among these sites, C:T (hCoV :other coronavirus) is the most prevalent substitution followed by T:C, G:A and A:G (Table S1). Many regions were absent in bat and SARS coronavirus genome when compared to hCoV, bat-hCoV and pangolin-hCoV. Among these regions, one of largest portion is of 391 nt (28026–28417) coding for ORF8 protein in hCoV putatively involved in interspecies transmission [17]. Genomic similarities and alignment indicate that several mutation events over the time is responsible for emergence of human novel coronavirus. Further a phylogenetic analysis between these viruses displayed that hCoVs are closer to bat/SARS-like virus (MG772933.1) and distant from SARS coronavirus (KY352407.1) and bat coronavirus (GU190215.1) (Fig. 1 ). These results demonstrate that SARS coronavirus and bat coronavirus (GU190215.1) could be apparent ancestor of other coronaviruses studied in the investigation.

Fig. 1.

Fig. 1

Phylogenetic relationship of hCoVs with other coronavirus. Phylogenetic analysis of bat coronavirus (GU190215.1), severe acute respiratory syndrome-related coronavirus strain BtKY72 (KY352407.1), bat SARS-like coronavirus isolate, bat-SL-CoVZC45 (MG772933.1), pangolin-hCoV, bat-hCoV and hCoV using the maximum-likelihood method (RAxML) keeping the bootstrap value 1000. Human coronavirus (hCoV, pangolin-hCoV, bat-hCoV) and bat SARS-like coronavirus falls in one clade while Severe acute respiratory syndrome-related coronavirus strain BtKY72 (KY352407.1) and bat coronavirus (GU190215.1) in another clade.

Scrutiny of nucleotide and amino acid in coding region of the genome revealed that the hCoV genome share 92.67 % and 96.92 % similarity at nucleotide level with pangolins and bat hCoV genome, whereas the similarity level increased up to 97.82 % and 98.67 % at amino acid level (Table 2 ). This indicates that most substitutions taking place were of synonymous type. Among various protein coding genes Nsp4-10, Nsp12-14, Nsp16, M, E and ORF6 shared highly conserve amino acid composition between bat-hCoV and hCoV with >99 % similarity, especially Nsp7-10, Nsp16, E and ORF6 share 100 % amino acid similarity (Table 2). The 100 % similarity in these regions across 591 hCoVs, bat and pangolin-hCoV mark them to be a probable target region for future antibodies and vaccine therapy. Notably, Nsp2 and Nsp14 region in hCoVs were most diversified in terms of nucleotide when compared to pangolin and bat-hCoV, whereas ORF10 and E regions were the least diverse (Table 2).

Table 2.

The comparison of nucleotide and amino acid similarity of pangolin-hCoV and bat-hCoV with hCoV.

Gene(s) Nucleotide
Amino acid
Pangolin hCoV/hCoV Bat hCoV/hCoV Pangolin hCoV/hCoV Bat hCoV/hCoV
Nsp1 91.68 96.29 95 96.11
Nsp2 88.47 95.4 93.89 98.28
Nsp3 88.7 95.61 92.4 96.66
Nsp4 90.13 95.87 97.6 99.6
Nsp5 93.98 95.86 99.35 99.67
Nsp6 92.87 95.98 98.97 99.66
Nsp7 91.97 97.59 100 100
Nsp8 93.43 98.15 100 100
Nsp9 88.86 98.82 100 100
Nsp10 95.92 98.32 100 100
Nsp12 91.38 97.82 99.46 99.57
Nsp13 92.18 98.06 99.83 99.83
Nsp14 88.56 97.22 98.86 99.24
Nsp15 92 96.24 97.69 97.69
Nsp16 92.51 95.64 100 100
ORF3A 93.24 96.26 97.1 97.83
ORF4/E 99.12 99.56 100 100
ORF5/M 93.62 95.37 98.65 99.55
ORF6 95.7 98.38 96.72 100
ORF7A 93.44 95.63 97.54 97.54
ORF7B 92.42 99.24 95.35 97.67
ORF8 92.08 96.72 94.26 94.26
ORF9/N 96.11 96.83 97.62 98.81
ORF10 99.15 99.15 97.37 97.37
SPIKE 89.17 92.89 highly variable 97.41
average 92.6676 96.916 97.81916667 98.67

3.2. Phylogenetic analyses of human novel coronavirus

We investigated the phylogenetic analysis of 591 genomic sequences of hCoV obtained from GISAID database using RAxML methods. The phylogram was majorly divided into 5 groups based on their clade division. Bat and pangolin-hCoV were categorized in the group I and all other hCoVs were categorized in group II to V (Fig. 2 ). Group II comprises of the human 2019-nCoV mainly belonging to different province of China. However, few exceptions were also from South Korea, Japan, Vietnam, Chile, USA, India, Belgium, Spain, Germany and France hCoVs. Group III mainly comprised of the hCoVs belonging to USA while Group IV represented the hCoVs of mixed type population belonging to several countries distributed over continents. Group V possess the hCoV from European countries along with few hCoVs of America and one from Taiwan. To understand mystery underlying the clustering pattern of the hCoVs, bat and pangolin-hCoV were used as a reference sequence to observe the nucleotide substitution in hCoV members in different groups. Interestingly, hCoV members (Group II and Group III) falling in proximity to Group I have less substitution in the genome sequences (Table S2). The T:C (GroupV-hCoV:bat-hCoV and GroupV-hCoV:pangolin-hCoV) substitution were frequent in Group V as compared to hCoV representing other groups (Table S2). The genomic signature of USA-hCoVs present in Group V is very different from USA-hCoVs of Group III. This could be indicative of differences between direct and community transmission of the virus. Member belonging to each subgroup has distinct genomic features in terms of nucleotide substitution (Table S2).

Fig. 2.

Fig. 2

Phylogenetic relationship among 2019-nCoV. The phylogenetic tree of 519 hCoVs sequence were divided in 5 groups. Bat and pangolin hCoVs were categorised in group I and rest of the hCoVs in group II—V. The phylogram was constructed by maximum likelihood keeping the bootstrap value 1000.

3.3. Non-synonymous substitutions and associated amino acid changes

Genomic comparison of 591 hCoV sequences among the human as well as with pangolin-hCoV and bat-hCoV revealed several sites possessing substitutions which clearly indicated the mutation in viral genome either according to the geographical locations or upon interaction with the human immune system. The nucleotide substitution in hCoV genomes were predominantly of transition type with ∼45 % being C:T (Fig. 3 A). A detailed investigation of the nucleotide substitution in the coding region of hCoVs genome with perspective of encoded amino acids revealed 43 synonymous and 57 non-synonymous substitutions (Table S3). The proteins Nsp1, Nsp5, Nsp7-10, Nsp14-16, ORF4, ORF7a, ORF7b and ORF10 mainly possessed synonymous substitutions and hence were mostly devoid of amino acid changes (Fig. 3B).

Fig. 3.

Fig. 3

Types of substitution. (A) The histogram representing types of substitution. The y-axis denotes the substitution frequency and x-axis denotes the type of substitution. (B) The histogram representing the synonymous and non-synonymous types in various genes. The y-axis denotes the substitution frequency and x-axis denotes the region of mutation of hCoV sequence.

The 57 amino acid changes were distributed over 12 regions in the ∼30 kb genome. The number of amino acids substitutions varied between different regions such as 7 in Nsp2, 10 in Nsp3, 5 in Nsp4, 3 in Nsp6, 1 in Nsp12, 4 in Nsp13, 11 in Spike, 3 in ORF3a, 2 in ORF5, 1 in ORF6, 2 in ORF8 and 8 in ORF9 (Fig. 4 ). Intriguingly, various important non-synonymous mutations were observed majorly in European and US continent while the mutations were mostly synonymous in Asian continent. These interesting observations can be used to infer the reason behind larger infectivity and pathogenicity in these regions (Table S3). Further two type of amino acid change viz., conservative and radial replacements were intensively studied with respect to previous reports stating the effect of such changes on the enzymatic activities. Mutations were most prevalent in the spike region followed by Nsp2, Nsp3 and ORF9 (N) (Table S3). Spike region determines the specific binding to host receptor and initiation of viral replication. This region is reported to be the most potent and indispensable for viral attachment and entry into host system. The RRAR amino acids found only in the human CoVs spike region has proved to be essential for binding to host receptor [18]. We observed similar region in the hCoV genomes studied (23713–23724 region in nucleotide alignment), although there was mutation in two hCoV-England nucleotide sequences (CTCCGCGGCGGG in place of CTCCTCGGCGGG) but the resulting amino acid remained same in all hCoV genomes. These findings corroborate the essentiality of RRAR sequence for viral infection to host system. We found different type of mutation in hCoV spike protein at different places such as leucine to valine (L8V), glutamine to histidine (Q675H and also found in ORF3a:Q57 H), glutamine to lysine (Q239 K) and aspartate to glycine (D614 G and also found in ORF5, D3G) might have potential role to augment viral infection (Table S3). Previous investigations showed mutations such as leucine to valine change in retroviral envelope protein, glutamine to lysine in influenza virus, glutamine to histidine and aspartate to glycine in H1N1 had a severe impact in virus entry, replication and cross infectivity to other species [[19], [20], [21]].

Fig. 4.

Fig. 4

The genomic landscape of the hCoV genome representing amino acid changes. The non-synonymous mutations resulting into 57 amino acid changes in respective proteins are marked. The numerals on the arrow represent the position corresponding to amino acid change.

Additionally, mutations were present in structural proteins such as, glycine to valine mutation in ORF3a (G196 V and G251 V). Similar amino acid change imparts resistance against inhibitor drug saquinavir in the human immunodeficiency virus type 1 (HIV-1). This might provide an explanation why drugs used for treating HIV became a failure in case of hCoV infection [22]. Notably, in ORF9 region the nucleotide sequence GGG changed to AAC in European and American continent resulting in a change of amino acid from RG to KR (AGGGGA coding for RG changed to AAACGA coding for KR, 28993–28995 in nucleotide alignment).

Furthermore, several amino acid changes were also observed in the non-structural proteins (Nsp) of the hCoVs which may affect the virulence and titer. Threonine to isoleucine substitution was observed in different Nsp proteins (Nsp2:T85I, Nsp3:T127I and T1030I and Nsp4:T295I) mainly in European and US samples. Earlier reports established that threonine to isoleucine substitution increased viral infectivity of Ebola virus and resistance to ganciclovir in human cytomegalovirus [23,24]. Importantly, alanine to valine substitution in non-structural protein, NS2A in Zika virus affects viral RNA synthesis and results in vivo viral attenuation [25]. This mutated virus also induce a comprehensive protection against lethal challenge proposed by the wild type Zika virus. Falling in similar lines alanine to valine substitutions in non-structural proteins (Nsp3, A1187 V, Nsp4, A457 V and Nsp6, A46 V) could reduce viral lethality of hCoVs (Table S3). These mutations might pave way towards identification of less lethal strains and help to raise immunity to counteract the noxious strains. An isoleucine to valine mutation (Nsp2, I559 V and Nsp3, I797 V) and methionine to isoleucine (Nsp4, M33I) were observed in hCoVs. Change of isoleucine to valine in polymerase subunit PB2 of influenza virus resulted in critically enhanced activity of reconstituted polymerase complex [26] and M to I substitution in HIV-1 reverse transcriptase imparted resistance to nucleoside analog 2′,3′-dideoxy-3′thiacytidine (3TC) [27]. Interestingly, presence of a non-synonymous substitution in RNA Dependent RNA Polymerase (RDRP) region in majority of European hCoV samples resulted in change of amino acid from proline to leucine (P314 L). It will be quite interesting to validate the effect of this substitution on RDRP activity as one of the previous study established that similar change of proline-to-leucine substitution (P236 L) of HIV-1 reverse transcriptase, imparts resistance against a highly specific inhibitor bisheteroarylpiperazines (BHAPs) [28]. These examples clearly show that amino acid changes may significantly affect the functional competency of polymerase and the associated subunits.

In conclusion, present study enlightens about several types of mutation such as deletion, insertion and substitutions present in 2019-nCoV samples. These mutations may vary at different geographical distribution or interaction with different host systems. Few mutations also resulted in change of amino acid which may provide an explanation for failure of previously employed antiviral therapies. This research will better equip the researchers to utilize the mutated amino acid information for drug targets in particular geography and less cases of failure. Beside the substitution resulting into transformation to a more virulent strain there are number of highly conserved regions in the hCoV genome which can be used as target for inhibitory drugs and vaccine development for a large repertoire of strains. Finally, we believe that our data provide useful information pertaining the changes in genomic and proteomic features which could serve as a guide to design the future antiviral therapies and diagnostics.

Author contributions

M.T. performed the computational analysis, D.M. prepared all the figures and tables. M.T. and D.M. designed the project and wrote the article.

Declaration of competing Interest

The Authors declare no conflict of interest.

Acknowledgements

We kindly acknowledge National Institute of Plant Genome Research (NIPGR) and Department of Biotechnology, Govt. of India (http://www.dbtindia.nic.in).

Footnotes

Appendix A

Supplementary material related to this article can be found, in the online version, at doi:https://doi.org/10.1016/j.jcv.2020.104441.

Appendix A. Supplementary data

The following are Supplementary data to this article:

mmc1.xlsx (14.5KB, xlsx)
mmc2.xlsx (20.3KB, xlsx)
mmc3.xlsx (17.3KB, xlsx)

References

  • 1.Drosten C., Günther S., Preiser W., van der Werf S., Brodt H.-R., Becker S., Rabenau H., Panning M., Kolesnikova L., Fouchier R.A.M. Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 2003;348:1967–1976. doi: 10.1056/NEJMoa030747. [DOI] [PubMed] [Google Scholar]
  • 2.Zaki A.M., van Boheemen S., Bestebroer T.M., Osterhaus A.D.M.E., Fouchier R.A.M. Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N. Engl. J. Med. 2012;367:1814–1820. doi: 10.1056/NEJMoa1211721. [DOI] [PubMed] [Google Scholar]
  • 3.Huang C., Wang Y., Li X., Ren L., Zhao J., Hu Y., Zhang L., Fan G., Xu J., Gu X. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395:497–506. doi: 10.1016/S0140-6736(20)30183-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Zhu N., Zhang D., Wang W., Li X., Yang B., Song J., Zhao X., Huang B., Shi W., Lu R. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 2020;382:727–733. doi: 10.1056/NEJMoa2001017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Li F. Structure, function, and evolution of coronavirus spike proteins. Annu. Rev. Virol. 2016;3:237–261. doi: 10.1146/annurev-virology-110615-042301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Tang Q., Song Y., Shi M., Cheng Y., Zhang W., Xia X.-Q. Inferring the hosts of coronavirus using dual statistical models based on nucleotide composition. Sci. Rep. 2015;5:17155. doi: 10.1038/srep17155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Song Z., Xu Y., Bao L., Zhang L., Yu P., Qu Y., Zhu H., Zhao W., Han Y., Qin C. From SARS to MERS, thrusting coronaviruses into the spotlight. Viruses. 2019;11 doi: 10.3390/v11010059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Du L., He Y., Zhou Y., Liu S., Zheng B.-J., Jiang S. The spike protein of SARS-CoV — a target for vaccine and therapeutic development. Nat. Rev. Microbiol. 2009;7:226–236. doi: 10.1038/nrmicro2090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Du L., Yang Y., Zhou Y., Lu L., Li F., Jiang S. MERS-CoV spike protein: a key target for antivirals. Expert Opin. Ther. Targets. 2017;21:131–143. doi: 10.1080/14728222.2017.1271415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Edgar R.C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Waterhouse A.M., Procter J.B., Martin D.M.A., Clamp M., Barton G.J. Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25:1189–1191. doi: 10.1093/bioinformatics/btp033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Letunic I., Peer B. Interactive tree of life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics. 2007;23:127–128. doi: 10.1093/bioinformatics/btl529. [DOI] [PubMed] [Google Scholar]
  • 14.Drexler J.F., Gloza-Rausch F., Glende J., Corman V.M., Muth D., Goettsche M., Seebens A., Niedrig M., Pfefferle S., Yordanov S. Genomic characterization of severe acute respiratory syndrome-related coronavirus in European bats and classification of coronaviruses based on partial RNA-dependent RNA polymerase gene sequences. J. Virol. 2010;84:11336–11349. doi: 10.1128/JVI.00650-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Tao Y., Tong S. Complete genome sequence of a severe acute respiratory syndrome-related coronavirus from Kenyan bats. Microbiol. Resour. Announ. 2019;8 doi: 10.1128/MRA.00548-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hu D., Zhu C., Ai L., He T., Wang Y., Ye F., Yang L., Ding C., Zhu X., Lv R. Genomic characterization and infectivity of a novel SARS-like coronavirus in Chinese bats. Emerg. Microbes Infect. 2018;7:154. doi: 10.1038/s41426-018-0155-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Lau S.K.P., Feng Y., Chen H., Luk H.K.H., Yang W.-H., Li K.S.M., Zhang Y.-Z., Huang Y., Song Z.-Z., Chow W.-N. Severe acute respiratory syndrome (SARS) coronavirus ORF8 protein is acquired from SARS-related coronavirus from greater horseshoe bats through recombination. J. Virol. 2015;89 doi: 10.1128/JVI.01048-15. 10532 LP – 10547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Walls A.C., Park Y.-J., Tortorici M.A., Wall A., McGuire A.T., Veesler D. Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein. Cell. 2020 doi: 10.1016/j.cell.2020.02.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Côté M., Zheng Y.-M., Li K., Xiang S.-H., Albritton L.M., Liu S.-L. Critical role of leucine-valine change in distinct low pH requirements for membrane fusion between two related retrovirus envelopes. J. Biol. Chem. 2012;287:7640–7651. doi: 10.1074/jbc.M111.334722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Glinsky G.V. Genomic analysis of pandemic (H1N1) 2009 reveals association of increasing disease severity with emergence of novel hemagglutinin mutations. Cell Cycle. 2010;9:958–970. doi: 10.4161/cc.9.5.10913. [DOI] [PubMed] [Google Scholar]
  • 21.Yamada S., Hatta M., Staker B.L., Watanabe S., Imai M., Shinya K., Sakai-Tagawa Y., Ito M., Ozawa M., Watanabe T. Biological and structural characterization of a host-adapting amino acid in Influenza virus. PLoS Pathog. 2010;6 doi: 10.1371/journal.ppat.1001034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Hong L., Zhang X.-J., Foundling S., Hartsuck J.A., Tang J. Structure of a G48H mutant of HIV-1 protease explains how glycine-48 replacements produce mutants resistant to inhibitor drugs. FEBS Lett. 1997;420:11–16. doi: 10.1016/s0014-5793(97)01477-4. [DOI] [PubMed] [Google Scholar]
  • 23.Kurosaki Y., Ueda M.T., Nakano Y., Yasuda J., Koyanagi Y., Sato K., Nakagawa S. Different effects of two mutations on the infectivity of Ebola virus glycoprotein in nine mammalian species. J. Gen. Virol. 2018;99:181–186. doi: 10.1099/jgv.0.000999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wolf D.G., Smith I.L., Lee D.J., Freeman W.R., Flores-Aguilar M., Spector S.A. Mutations in human cytomegalovirus UL97 gene confer clinical resistance to ganciclovir and can be detected directly in patient plasma. J. Clin. Invest. 1995;95:257–263. doi: 10.1172/JCI117648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Marquez-Jurado S., Nogales A., Avila-Perez G., Iborra F.J., Martinez-Sobrido L., Almazan F. An alanine-to-valine substitution in the residue 175 of Zika virus NS2A protein affects viral RNA synthesis and attenuates the virus in vivo. Viruses. 2018;10 doi: 10.3390/v10100547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Rolling T., Koerner I., Zimmermann P., Holz K., Haller O., Staeheli P., Kochs G. Adaptive mutations resulting in enhanced polymerase activity contribute to high virulence of influenza A virus in mice. J. Virol. 2009;83:6673–6680. doi: 10.1128/JVI.00212-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Julias J.G., Boyer P.L., McWilliams M.J., Alvord W.G., Hughes S.H. Mutations at position 184 of human immunodeficiency virus type-1 reverse transcriptase affect virus titer and viral DNA synthesis. Virology. 2004;322:13–21. doi: 10.1016/j.virol.2004.01.004. [DOI] [PubMed] [Google Scholar]
  • 28.Fan N., Evans D.B., Rank K.B., Thomas R.C., Tarpley W.G., Sharma S.K. Mechanism of resistance to U-90152S and sensitization to L-697,661 by a proline to leucine change at residue 236 of human immunodeficiency virus type 1 (HIV-1) reverse transcriptase. FEBS Lett. 1995;359:233–238. doi: 10.1016/0014-5793(95)00051-a. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.xlsx (14.5KB, xlsx)
mmc2.xlsx (20.3KB, xlsx)
mmc3.xlsx (17.3KB, xlsx)

Articles from Journal of Clinical Virology are provided here courtesy of Elsevier

RESOURCES