Abstract
The newly assigned subfamily Avulavirinae in the family Paramyxoviridae includes avian paramyxoviruses (APMVs) isolated from a wide variety of avian species across the globe. Till date, 21 species of APMVs are reported and their complete genome sequences are available in GenBank. The APMV genome comprises of a single stranded, negative sense, non-segmented RNA comprising six transcriptional units (except APMV-6 with seven units) each coding for a structural protein. Additionally, by co-transcriptional RNA editing of phosphoprotein (P) gene, two mRNAs coding for accessory viral proteins, V and W, are generated along with unedited P mRNA. However, in APMV-11, the unedited mRNA codes for V protein while +2 edited mRNA translates to P protein, similar to members of subfamily Rubulavirinae in the same family. Such RNA editing in paramyxoviruses enables maximizing the coding capacity of their smaller genome. The three proteins of P gene: P, V and W, share identical N terminal but varied C terminal sequences that contribute to their unique functions. Here, we analyzed the P gene editing site, V and W sequences of all 21 APMV species known so far (55 viruses) by using bioinformatics and report their genetic variations and molecular evolution. The variations observed in the sequence and hexamer phase positions of the P gene editing sites is likely to influence the levels and relative proportions of P, V and W proteins’ expressions which could explain the differences in the pathogenicity of APMVs. The V protein sequences of APMVs had conserved motifs similar to V proteins of other paramyxoviruses including the seven cysteine residues involved in MDA5 interference, STAT1 degradation and interferon antagonism. Conversely, W protein sequences of APMVs were distinct. High sequence homology was observed in both V and W proteins between strains of the same species than between species except in APMV-3 which was the most divergent APMV species. The estimates of synonymous and non-synonymous substitution rates suggested negative selection pressure on the V and W proteins within species indicating their low evolution rate. The molecular clock analysis revealed higher conservation of V protein sequence compared to W protein indicating the important role played by V protein in viral replication, pathogenesis and immune evasion. However, we speculate the genetic diversity of W proteins could impact the degree of pathogenesis, variable interferon antagonistic activity and the wide host range exhibited by APMV species. Phylogenetically, V proteins of APMVs clustered into three groups similar to the recent classification of APMVs into three new genera while no such pattern could be deciphered in the analysis of W proteins except that strains of same species grouped together. This is the first comprehensive study describing in detail the genetic variations and the molecular evolution of P gene edited, accessory viral proteins of Avian paramyxoviruses.
Subject terms: Phylogenetics, Evolution, Molecular evolution, Virology, Viral evolution
Introduction
Avian paramyxoviruses (APMVs) are a group of paramyxoviruses known to infect a variety of bird species across the globe. Till date, 21 species of APMVs have been identified and the list is expected to grow with increase in viral surveillance in wild and domesticated birds. With recent ICTV 2019 classification, these viruses now belong to three genera, Metaavulavirus (APMV-2, -5, -6, -7, -8, -10, -11, -14, -15 and -20), Orthoavulavirus (APMV-1, -9, -12, -13, -16, APV-A, APV-B and APV-C) and Paraavulavirus (APMV-3 and -4) within the new subfamily Avulavirinae under family Paramyxoviridae in the order Mononegavirales1. APMV-1 to -9 were isolated before 1980, APMV-10 to -13 were identified by 2015 and all the other APMVs were reported in the recent years2.
Avian Paramyxoviruses are enveloped with a single stranded, non-segmented, negative sense RNA genome of size 13 to 17 kb2. The prototype virus, APMV-1 of genus Orthoavulavirus, also well known as Newcastle disease virus or NDV, is the most extensively studied virus in this group. NDV causes severe economically important disease in poultry. There are five pathotypes of NDV based on the clinical signs exhibited by infected chickens: (a) viscerotropic velogenic or highly virulent, pantropic NDV causing severe mortality (b) neurotropic velogenic or highly virulent NDV, specifically causing neurological illnesses and high mortality (c) mesogenic or moderately virulent NDV, with mortalities as high as 50% and reducing egg production (d) lentogenic–either respiratory or enteric type NDV, low virulence and causing low reduction in egg production and (e) asymptomatic or avirulent NDV. However, the pathotype classification is not always clear-cut3. There are more than thousand strains of NDV which have been isolated, sequenced and found to exhibit wide spectrum of virulence. The viral RNA genome encodes for six genes arranged in tandem, each coding for six structural proteins, 3’-N-P-M-F-HN-L-5’4,5. N is nucleocapsid protein, each N protomer is known to bind exactly 6 nucleotides of genomic and antigenomic RNA of most paramyxoviruses thus imposing a hexamer phase on the entire RNA genome. In nature, the genomic length of paramyxoviruses is polyhexameric (6n + 0) which is found to be necessary for efficient replication and this is called the ‘rule of six’6,7. N together with P, phosphoprotein and L, large polymerase protein, forms viral RNA dependent RNA polymerase complex essential for viral genome transcription and replication; M, matrix protein, is seen within the envelope, aids in virus assembly and budding; two viral glycoproteins, F, fusion protein and HN, hemagglutinin-neuraminidase protein, are studded on the envelope and assist with fusion of virus with host membrane and receptor binding, respectively4. NDV F protein is known as the virulence determinant; the virulent strains have unique multiple basic amino acids, at least three arginine (R) or lysine (K) residues, at fusion protein cleavage site starting at amino acid position 113, and a phenylalanine residue at position 1173. APMV-6 is also known to express an additional small hydrophobic (SH) protein from SH gene located between F and HN genes8. Further, by co-transcriptional RNA editing of P gene, two mRNAs, V and W are expressed4,9–12. Also, in certain paramyxoviruses, by a process of alternative transcription initiation in P gene (+1 reading frame), accessory C proteins are generated13,14. Thus by these mechanisms, paramyxoviruses are able to efficiently utilize over 95% of their small RNA genome for expression of viral proteins12.
The P gene carries a slippery sequence, a stretch of adenosine (A) nucleotides and guanosine (G) nucleotides called the ‘editing site’ where insertions of 1 G or 2 G nucleotides occur during transcription of P gene by the stuttering viral polymerase that reiteratively reads the template base12,15. A single G nucleotide addition leads to +1 frameshift in the ORF, generating V mRNA with a frequency of 25 to 35% and two G nucleotides addition leads to +2 frameshift in the ORF generating W mRNA with a frequency of 2 to 8.5% and the unedited mRNA (60-70%) codes for P protein in NDV12,16. Among the paramyxoviruses, members of the subfamily Rubulavirinae and APMV-11 of genus Metaavulavirus, encode V protein from their unedited transcript, while P protein is coded by +2 frameshift and W protein is expressed by +1 frameshift17,18. The resulting three mRNAs from P gene (P, V and W mRNAs) share common N terminal sequences and differ both in length and amino acid composition in their C terminal region. Their specific functions are dictated by their unique C terminal sequences. Studies on V protein of APMV-1 and other paramyxoviruses have revealed that V protein is multifunctional, targets STAT1 degradation, interferes with MDA5, is interferon antagonist19–22, inhibits apoptosis23,24, assist in viral replication20,25 and plays important roles in tissue tropism, virulence determination11,20,26 and host range restriction21. On the other hand, there is very limited information about the function of W protein. The W protein of Nipah virus has been shown to impact viral pathogenesis and support the virus to evade the host immunity27–29. In APMV-1, the nuclear localization of W protein and its incorporation into the virion has been recently reported16,30.
The complete genome sequences of all 21 species of APMV have been described individually17,31–62. A comprehensive comparative analysis of complete genome and structural genes of 20 species of Avulaviruses has been recently published2. Three clades of viruses were concluded based on phylogenomic analysis of 20 APMVs: Clade I included APMV-2, -5, -6, -7, -8, -10, -11, -14, -15 and -20 (currently classified under new genus Metaavulavirus), clade II comprised of APMV -1, -9, -12, -13, -16, APV-A, -B and -C (currently assigned under genus Orthoavulavirus) and clade III included APMV-3 and -4 (currently under new genus Paraavulavirus)1,2. One of the viruses, previously classified as APMV-17 (South Korean “avian paramyxovirus 17”) has now been proposed as a separate species (APMV-21) based on phylogenies of complete genomes, complete F and L genes, PASC and STD analysis2,63. Nevertheless, very little is known about the P gene edited accessory viral proteins of APMVs. We have examined and analyzed 55 viruses belonging to all 21 APMV species identified till date and discuss here the genetic diversity and molecular evolution of P gene edited proteins, V and W.
Materials and methods
Sequence information
The full length sequences of P gene available for all the 21 species of APMVs were obtained from National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/). Additionally, publications reporting the complete genome sequence of these viruses were also referred for identification of P gene editing site, for prediction of sequences of V and W proteins. A total of 55 viruses belonging to 21 species of APMVs were analyzed in this study which included (their GenBank accession numbers are provided in Table 1) four strains each of avirulent APMV-1, moderately virulent APMV-1 and highly virulent APMV-1; eight isolates of APMV-6; five isolates each of APMV-2 and APMV-8; four isolates of APMV-10; three isolates of APMV-13; two isolates each of APMV-3, APMV-4, APMV-5; one isolate each of APMV-7, APMV-9, APMV-11, APMV-12, APMV-14, APMV-15, APMV-16, APMV-20, APMV-21 and APV-A, APV-B and APV-C. Detailed information of these isolates along with metadata such as hosts, year and location of isolation in addition to their sequence information are provided in Table 1. The complete sequences of P gene ORF, V proteins and W proteins used in this study that were either directly collected from NCBI or derived by prediction using DNASTAR software suite are provided in Supplementary Files S1, S2 and S3.
Table 1.
S.No | APMV species | Genbank Id | Year of Isolation | Place of isolation | Isolated from | P ORF (Nt) /P Protein length (aa) | P gene ‘RNA editing site’ sequence (numbers indicate the position of nucleotide in the genome, editing site is bolded and hexamer phase of the start of the short C run in edit site is underlined) | Subunit hexamer phasing at the P gene mRNA editing site | V ORF (Nt) /V Protein length (aa) | W ORF (Nt) /W Protein length (aa) | No. of unique amino acids in the C terminal region of W protein | |
1 | APMV -1 (NDV) | Velo-genic | KJ808820.1 | 2012 | China | Pigeon | 1188 /395 | 2283CGAU UUUUCC CCGG | 5 | 720/239 | 684/227 | 94 |
2 | FJ386394.2 | 2009 | China | Chicken | 1188 /395 | 2277CGAU UUUUCC CGGG | 5 | 720/239 | 666/221 | 88 | ||
3 | GU187941.1 | 1987 | India | Chicken | 1188/395 | 2277CGAU UUUUCC CGGG | 5 | 720/239 | 531/196 | 62 | ||
4 | JQ015296.1 | 2011 | China | Chicken | 1188/395 | 2283CGAU UUUUCC CGGG | 5 | 720/239 | 534/177 | 43 | ||
5 | Meso-genic | KX761866.1 | 2010 | China | Duck | 1188/395 | 2277CGAU UUUUCC CGGG | 5 | 720/239 | 684/227 | 94 | |
6 | KT445901.1 | 1945 | Palestine | Duck | 1188/395 | 2277CGAU UUUUCC CGGG | 5 | 720/239 | 666/221 | 88 | ||
7 | AY562986.1 | 1993 | USA | Anhinga | 1188/395 | 2283UGAU UUUUCC CGGG | 5 | 720/239 | 540/179 | 45 | ||
8 | KJ736742.1 | 1998 | Germany | Pigeon | 1188/395 | 2283CGAU UUUUCC CCGG | 5 | 720/239 | 414/137 | 3 | ||
9 | Lento-genic | KM885162.1 | 2013 | China | Duck | 1200/399 | 2277CGAU UUUUCC CGGG | 5 | 738/245 | 552/183 | 49 | |
10 | JN688862.1 | 2009 | China | Chicken | 1200/399 | 2277CGAU UUUUCC CGGG | 5 | 738/245 | 552/183 | 49 | ||
11 | JX193078.1 | 2009 | China | Duck | 1188/395 | 2277CGAU UUUUCC CGGG | 5 | 720/239 | 540/179 | 45 | ||
12 | AB524405.1 | 1991 | Alaska | Goose | 1200/399 | 2277CGAU UUUUCC CGGG | 5 | 738/245 | 552/183 | 49 | ||
13 | APMV - 2 | HM159993.1 | 2006 | England | Chicken | 1200/399 | 2089CAAUUU UUCCCC UU | 3 | 699/232 | 624/207 | 67 | |
14 | HQ896023.1 | 1999 | China | Gallus gallus | 1200/399 | 2089CAAUUU UUCCCC UU | 3 | 699/232 | 624/207 | 67 | ||
15 | EU338414.1 | 1956 | USA | Chicken | 1200/399 | 2089CAAUUU UUCCCC UU | 3 | 699/232 | 624/207 | 67 | ||
16 | HM159994.1 | 1980 | Kenya | Gadwell | 1200/399 | 2089CAAUUU UUCCCC UU | 3 | 699/232 | 624/207 | 67 | ||
17 | HQ896024.1 | 2001 | China | Gallus gallus | 1200/399 | 2089CAAUUU UUCCCC UU | 3 | 699/232 | 624/207 | 67 | ||
18 | APMV - 3 | EU403085.1 | 1975 | Netherland | PKT | 1161/386 | 2078AAAAA UUUCCC CCG | 4 | 747/248 | 378/125 | 14 | |
19 | EU782025.1 | 1968 | USA | Turkey | 1173/390 | 2083AAAAAU UUCCCC CG | 3 | 759/252 | 384/127 | 12 | ||
20 | APMV - 4 | FJ177514.1 | 1975 | China | Duck | 1182/393 | 2052G AAAAUU UCCCCC C | 2 | 675/224 | 414/137 | 2 | |
21 | JX133079.1 | 2010 | South Africa | Alopochen aegyptiacus | 1182/393 | 2052G AAAAUU UCCCCC C | 2 | 675/224 | 414/137 | 2 | ||
22 | APMV - 5 | GU206351.1 | 1974 | Japan | Budgerigar | 1341/446 | 2359AAUUUU UUCCCG GG | 3 | 834/277 | 564/187 | 5 | |
23 | LC168750.1 | 1975 | Japan | Melopsitt-acus undulatus | 1341/446 | 2357GU AAUUUU UUCCGG G | 3 | 834/277 | 564/187 | 5 | ||
24 | APMV - 6 | JX522537.1 | 2011 | China | Anas poecilorhyn-cha | 1293/430 | 2146AGU UUUUUC CCCUU | 6 | 825/274 | 489/162 | 4 | |
25 | KP762799.1 | 2013 | Kazakhstan | Red-crested pochard | 1293/430 | 2146AGU UUUUUC CCCUU | 6 | 825/274 | 489/162 | 5 | ||
26 | EU622637.2 | 1977 | Hong Kong | Duck | 1293/430 | 2146AGU UUUUUC CCCUU | 6 | 807/268 | 534/177 | 5 | ||
27 | AY029299.1 | 1998 | Taiwan | Duck | 1293/430 | 2146AGU UUUUUC CCCUU | 6 | 807/268 | 534/177 | 20 | ||
28 | KF267717.1 | 2011 | China | Anas poecilorhyncha | 1293/430 | 2146AGU UUUUUC CCCUU | 6 | 825/274 | 489/162 | 20 | ||
29 | KT962980.1 | 2009 | Russia | Anas crecca | 1293/430 | 2139AGUU UUUUCC CCUU | 5 | 825/274 | 594/197 | 5 | ||
30 | EF569970.1 | 2003 | Russia | Domestic goose | 1293/430 | 2146AGU UUUUUC CCCUU | 6 | 807/268 | 534/177 | 40 | ||
31 | GQ406232.1 | 2007 | Italy | Duck | 1293/430 | 2134AGU UUUUUC CCGCU | 6 | 816/271 | 474/157 | 4 | ||
32 | APMV - 7 | FJ231524.1 | 1975 | USA | Dove | 1185/394 | 2150AAUUU UUUCCC AAU | 4 | 753/250 | 378/125 | 1 | |
33 | APMV - 8 | FJ215863.2 | 1976 | USA | Canada goose | 1218/405 | 2095UAAUUU UUCCCG GG | 3 | 717/238 | 612/203 | 65 | |
34 | MF448515.1 | 2013 | Kazakhstan | Little stint | 1218/405 | 2095UAAUUU UUCCCG GG | 3 | 717/238 | 519/172 | 34 | ||
35 | JX901129.1 | 1978 | Japan | Pintail duck | 1218/405 | 2095UAAUUU UUCCCG GG | 3 | 717/238 | 519/172 | 34 | ||
36 | FJ619036.1 | 1976 | USA | Canada goose | 1218/405 | 2095UAAUUU UUCCCG GG | 3 | 717/238 | 519/172 | 34 | ||
37 | MF448514.1 | 2013 | Kazakhstan | Whooper swan | 1218/405 | 2095UAAUUU UUCCCG GG | 3 | 717/238 | 519/172 | 34 | ||
38 | APMV - 9 | EU910942.1 | 1978 | USA | Domestic duck | 1260/419 | 2319GAUU UUUUCC CGAG | 5 | 792/263 | 585/194 | 56 | |
39 | APMV - 10 | HM755886.2 | 2007 | United Kingdom | Penguin | 1218/405 | 2094A GAAUUU UUCCCG UG | 3 | 741/246 | 519/172 | 34 | |
40 | HM147142.3 | 2007 | Falkland Islands | Penguin | 1218/405 | 2094A GAAUUU UUCCCG UG | 3 | 741/246 | 519/172 | 34 | ||
41 | HM755887.2 | 2007 | United Kingdom | penguin | 1218/405 | 2094A GAAUUU UUCCCG UG | 3 | 741/246 | 519/172 | 34 | ||
42 | HM755888.2 | 2007 | United Kingdom | penguin | 1218/405 | 2094A GAAUUU UUCCCG UG | 3 | 741/246 | 519/172 | 34 | ||
43 | APMV - 11 | JQ886184.1 | 2010 | France | Common snipe | 1344 /447(2 G insertion) | 2333AA AAUUCU UCCCCC A | 2 | 834/277 (V mRNA is the unedited transcript) | 546/181(1 G insertion) | 15 | |
44 | APMV - 12 | KC333050.1 | 2005 | Italy | Wigeon | 1218/405 | 2324UGAUU UUUUCC CUAG | 5 | 762/253 | 564/187 | 31 | |
45 | APMV - 13 | KU646513.1 | 2013 | Kazakhstan | Wild goose | 1194/397 | 2318UGAUU UUUUCC CGUC | 5 | 726/241 | 453/150 | 2 | |
46 | KX119151.2 | 2011 | Ukraine | White-fronted goose | 1194/397 | 2318UGAUU UUUUCC CGUC | 5 | 726/241 | 453/150 | 2 | ||
47 | LC041132.1 | 2000 | Japan | Ansersp | 1194/397 | 2318UGAUU UUUUCC CGUC | 5 | 726/241 | 453/150 | 2 | ||
48 | APMV - 14 | KX258200.1 | 2011 | Japan | Duck | 1227/408 | 2094A AAAUUU UCCCCG GC | 2 | 759/252 | 471/156 | 12 | |
49 | APMV – 15 | KX932454.2 | 2012 | Brazil | Calidrisfuscicollis | 1248/415 | 2039AA UUUUUC CCCCCU G | 6 | 759/252 | 438/145 | 2 | |
50 | APMV - 16 | KY511044.1 | 2014 | South Korea | Wild duck | 1200/399 | 2289UGAU UUUUUC CCAGU | 6 | 738/245 | 423/140 | 5 | |
51 | APV-A (APMV – 17) | KY452442.1 | 2014 | Antarctica | Pygoscelispapua | 1209/402 | 2105AAUUUUUUCCCCAGU | Hexamer phasing position not determined as their genome lengths did not conform to ‘rule of six’ | 771/256 | 492/163 | 12 | |
52 | APV-B (APMV – 18) | KY452443.1 | 2014 | Antarctica | Pygoscelispapua | 1179/392 | 2116AAUUUUUUCCCCAGU | 711/236 | 531/176 | 35 | ||
53 | APV-C (APMV – 19) | KY452444.1 | 2014 | Antarctica | Pygoscelispapua | 1134/377 | 2153GAAUUUUUUCCCAGU | 666/221 | 393/130 | 4 | ||
54 | APMV - 20 | MF033136.1 | 2014 | Kazakhstan | Gull | 1296/431 | 2171AA AAUUUU CCCCCU C | 1 | 792/263 | 498/165 | 17 | |
55 | APMV - 21 | MF594598.1 | 2015 | South Korea | Wild birds | 1320/439 | 2270CGAUU UUUUCC CGUC | 5 | 915/304 | 432/143 | 3 |
A summary on the viruses analyzed in this study with specifications of their P, V and W protein sequences.
Nt: nucleotides, aa: amino acids.
Sequence alignment, comparison and prediction of conserved motifs/domains
Multiple sequence alignments of V and W proteins were performed using the TCOFFEE multiple alignment algorithm, mode ‘expresso’ and the sequence similarities were colored through ESPript64–66. All residues/amino acid positions mentioned in the results and discussion correspond to APMV-1 strain KJ808820.1, the strain that appears first in the alignment file. Individual sequences were also analyzed in NCBI’s interface, conserved domain (CD)-search67 and aligned sequences were run in DREME version 5.0.5 software68 to identify conserved motifs/domains. The intraclade amino acid percentage identity was estimated using Megalign software from DNASTAR.
Prediction of Nuclear Localization Signal (NLS) and Nuclear Export Signal (NES) in V and W proteins
The nuclear localization signal (NLS) in V and W proteins of APMV species were identified using online tool, cNLS mapper with a cut-off score of 5.0 that predicted NLS specific to the importin αβ pathway69. The presence of nuclear export signal (NES) in V and W proteins of APMV species was predicted using online tool, NetNES 1.1 server that predicted leucine-rich NES using a combination of neural networks and hidden Markov models70 and using LocNES that predicted the classical NESs in CRM1 cargoes71.
Phylogenetic analysis and evolutionary divergence
Phylogenetic analysis was performed using MEGA7 software. For drawing the phylogenetic trees, evolutionary history was inferred by using the Maximum Likelihood method with JTT matrix-based model72 for V proteins and Dayhoff matrix based model for W proteins73. For drawing the phylogenetic tree of V proteins, bootstrap consensus tree inferred from 500 replicates was taken to represent the evolutionary history of the taxa analyzed74. Branches corresponding to partitions, reproduced in less than 80% bootstrap replicates, were collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) are shown next to the branches. Initial trees for heuristic search were obtained automatically by applying Neighbor-Joining and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then topology with superior log likelihood value was selected. A discrete gamma distribution was used for V proteins tree to model evolutionary rate differences among sites (16 categories (+G, parameter = 2.4693)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 5.67% sites). For drawing the phylogenetic tree of W proteins, a discrete gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 1.1488)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 0.70% sites). Analysis of both the trees involved 55 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 141 and 71 positions in the final dataset for drawing V and W proteins’ phylogenetic trees, respectively.
The estimates of evolutionary divergence over sequence pairs between groups were analyzed for V and W proteins of all 21 APMV species using MEGA775. Briefly, based on the maximum likelihood fits of 56 different amino acid substitution models, the final analyses were conducted in MEGA7 using JTT matrix-based model72 for V proteins and Dayhoff matrix based model73 for W proteins. The rate variation among sites was modeled with a gamma distribution (shape parameter = 1). The analysis included 55 amino acid sequences. All positions containing gaps and missing data were eliminated.
Selection pressure analysis
The number of nonsynonymous substitutions per nonsynonymous site (dN), the number of synonymous substitutions per synonymous site (dS), and the dN/dS ratios for the nucleotide sequences of V and W proteins of all 21 species were analyzed for the entire sequence and also their shared N terminal and unique C terminal regions. The shared portion in the N-terminus of all the three proteins was considered up to the RNA editing site (KKG motif). The C terminal regions of V and W proteins of all 21 species were considered after RNA editing site (KKG motif). The dN/dS ratio of 21 species of APMV nucleotide sequences were estimated by DnaSP v6.12.03 software76. The protein was considered under positive selection or diversifying when the dN/dS ratio is >1 and negative or purifying selection when dN/dS ratio <1.
Evolutionary rate analysis
To estimate evolutionary rates of different APMV species in V and W nucleotide sequences, the substitution rate analysis was performed by BEAST v 1.10.4 software77. The substitution model GTR and site heterogeneity model G + I was found to be the best by MEGA7 and was used here to study the substitution rate of V and W sequences. The tree prior coalescent, constant size was used for individual and all the species. The uncorrelated relaxed clock with lognormal was implemented. The MCMC chain 4 × 108 cycles was used to reach the ESS value more than 200 to converge the data except for W proteins of APMV-8 strains where MCMC chain length of 2 × 108 cycles was used. The final data analysis was performed using tracer v 1.7.1 software.
Results
RNA editing site and prediction of V and W protein sequences
Previous reports suggested identical P gene RNA editing sites for APMV-1, -2, -4, -5, -6, -7, -8, -9, -10, -12, -13, -15, -20, APV -A, -B and –C and varied P gene editing site sequences for APMV-3, -11, -14 and -16. We observed the following conserved pattern in the RNA editing sequences among APMVs: U3C6 for APMV-4; U4C4 for APMV-14; U4C5 for APMV-20; U5C3 for APMV -1 (all strains except KJ736742.1 and KJ808820.1), APMV-8 and APMV-10; U5C4 for APMV -1 (strains KJ736742.1 and KJ808820.1) and APMV-2; U5C6 for APMV-15; U6C3 for APMV -5 (strain GU206351.1), APMV -7, -9, -12, -13, -16, -19, -21; U6C2 for APMV-5 (strain LC168750.1); U6C4 for APMV -6, -17, -18 and UUCUUC5 for APMV-11. Further, variations were observed in the cis-acting sequence at the editing site: the sequences immediately upstream of the editing site were3’AA in APMV -2, -3, -4, -5, -7, -8, -10, -11, -14, -15, -17, -18, -19, -20 and3’GA in APMV -1, -9, -12, -13, -16 and -21 (all orthoavulaviruses) while3’AG was conserved among APMV-6 strains.
The hexamer phase of the start of the template C run in the P gene editing site (Table 1) revealed that this position was conserved within each species except in APMV-3 and for one strain of APMV-6 (KT962980.1). The hexamer phasing positions for APV -A, -B, -C were not determined as their genome lengths did not conform to ‘rule of six’. The start of the C run was at hexamer position 1 for APMV-20; at hexamer position 2 for APMV -4, -11 and -14; at hexamer position 3 for APMV -2, -3 (EU782025), -5, -8 and -10; at hexamer position 4 for APMV-3 (EU403085) and APMV-7; at hexamer position 5 for APMV-1, one strain of APMV-6 (KT962980.1), APMV -9, -12, -13, and -21 and at hexamer position 6 for APMV -15, -16 and all strains of APMV-6 except one strain (KT962980.1).
All APMVs except APMV-11 expressed P protein from unedited mRNA. The editing site of APMV-11 resembled that of other paramyxoviruses that insert 2G for generating P mRNA or the ‘genomic V’ viruses12. The P protein of APMV-11 is derived from 2G nucleotides insertion, W protein from single G nucleotide addition and unedited mRNA expresses the V protein17. The V and W protein sequences of the other 20 APMV species were predicted by insertion of single G and two G nucleotides at the P gene RNA editing site, respectively.
Amino acid sequence analysis: percentage identity and conservations
The V and W protein sequences of all 21 APMV species shared common N terminal region with P protein. The variations in the amino acid sequences were minimum within the first 60 amino acids in N terminal region. The N-terminal portions of P, V and W proteins of metaavulaviruses and orthoavulaviruses showed closer identity than paraavulaviruses. In ortho and paraavulaviruses, the N- and C- terminal regions showing high homology up to 93%. Metaavulaviruses showed 100% identity in all their C- and N- terminal regions. The C terminal region of W-protein sequences showed 0.0–100% identity, as both the amino acid composition and length variations at C-terminal portion for all the species were higher (Table 2). As described previously, the soyuz1 and soyuz2 motifs were observed within N terminal region in all APMVs except in APMV-3 strains78. Additionally, conserved domains (CD) were predicted in APMV-1 mesogenic strain Komarov (CD between aa 25 to 167) for large tegument protein UL36 (superfamily member PHA03247), in APMV-14 (CD between aa 31 and 144) for gene regulated by oestrogen in breast cancer- GREB1 (superfamily member, pfam15782) and in APMV-21 (CD between aa 53 and 120) for Tumor necrosis factor receptor superfamily member cd13415.
Table 2.
Genus Orthoavulavirus | Genus Metaavulavirus | Genus Paraavulavirus | |
---|---|---|---|
P complete | 31.8–99.5 | 22.6–100 | 24.6–91.9 |
V complete | 13.5–99.2 | 26.1–100 | 18.6–92.9 |
V- N terminal | 31.0–98.0 | 22.5–100 | 19.5–93.3 |
V- C terminal | 30.3–98.9 | 26.3–100 | 27.0–92.1 |
W complete | 26.3–99.5 | 19.6–100 | 14.8–92.7 |
W- N terminal | 31.0–99.2 | 22.4–100 | 16.7–93.3 |
W- C terminal | 0.0–100 | 0.0–100 | 50.0–100 |
P complete, V complete and W complete refer to the entire protein sequences. N terminal refers to N terminal region of respective proteins shared with P protein and C terminal refers to the unique C terminal region of respective proteins. The amino acid sequences of a particular protein of all virus strains, within the particular genus, were analyzed and the percentage amino acid identity was calculated using Lasergene DNASTAR MegAlign software.
Comparison of V protein sequences of APMV species
The V protein of APV-C was the shortest (221 aa, MW: 23.73 kDa) and that of APMV-21 was the longest (304 aa, MW: 31.98 kDa) among the 21 APMV species. The length of V protein (in aa) conserved within species was as follows: APMV-2 strains (232 aa), APMV-4 strains (224 aa), APMV-5 strains (277 aa), APMV-8 strains (238 aa), APMV-10 strains (246 aa) and APMV-13 strains (241 aa). However, in APMV-1, -3 and -6, variation in the length of V protein was observed between strains within the same species. Between species, the similarity in the V protein length was observed as follows: V protein length of 252 aa was observed in APMV-14, -15 and one strain of APMV-3; APMV-11 and -5 showed 277 aa long V protein; APMV -9 and -20 had 263 aa long V protein while the V protein length of APMV-16 and two lentogenic strains of APMV-1 were 245 aa. The lowest amino acid identity (10.4%) was observed between V proteins of APMV-3 strain Wisconsin and APV-A. The lowest amino acid divergence was noticed between APV-B and APV-C (70.4) and both showed identity of 53.7% at amino acid level which was the highest between APMV species (Supplementary File S4).
The multiple sequence alignment of V protein sequences of APMV species revealed higher amino acid conservations in both N (majorly in the first 60 amino acids) and C terminal regions (Fig. 1). All viruses in this study had the following conserved motifs similar to V proteins of other known paramyxoviruses (i) KKG motif in the N terminal region, which is the coding sequences at the P gene mRNA editing site (residues 132-134, corresponding to APMV-1 strain KJ808820.1, the first strain in the alignment file) except in APMV-3, APMV-4, APMV-12, APMV-13, APMV-14 and APMV-20 (ii) HRRE motif (residues 177 – 180, corresponding to APMV-1 strain KJ808820.1), (iii) WCNP motif (residues 195-198, corresponding to APMV-1 strain KJ808820.1) and (iv) conserved seven cysteine-rich domain. Interestingly, the following amino acids were also conserved in the C terminal region in majority of APMV species with few exceptions: Proline residues at five positions: (a) position 175 in ten species (except in APMV -3,-4,-6, -7, -8, -9, -10, -11, -15, -20 and -21), (b) position 198 in all APMVs, (c) position 202 (except in APMV-4), (d) position 207 in all APMVs and (e) position 218 in all APMVs; Glycine residues at three positions: (a) position 176 (except in APMV-4 and -6), (b) position 188 (except in certain strains of APMV-1 and APMV-7, -9, -11, -12, -13, -14, -15, -16, APV -A and -B), (c) position 215 (except in a single lentogenic strain, KM885162, of APMV-1, all strains of APMV-3, all strains of APMV-10 and interestingly in all these viruses, the glycine residue was replaced with arginine residue); Serine residues at two positions: (a) position 182 (except in APMV -4, -11, -20) and (b) position 194 (except in APMV -3, -4, -5, -6, -12, -14); Arginine residue at position 208 (except in APMV -3, -4, -5, -7, -8, -11, -13, -16, APV-A, -B and –C); Leucine residue at position 223 (except in APMV-5, -6, -7, -8 and -11) and Aspartic acid residue at position 227 (except in APMV-2, -3, -7, -9, -11 and -21). The percentage amino acid conservation in the C terminal region of V proteins of all 21 APMV species was between 30% (in the longest V protein, that of APMV-21) and 48% (in the shortest V protein, that of APV-C). The NLS were predicted in V proteins of APMV-5 and APMV-20 by cNLS mapper with a cut off score of 5.0. The NES were identified only in APMV-5 strains (Table 3b).
Table 3.
APMV species | Genbank ID | Predicted Nuclear localization signal sequence and position in W proteins | Predicted Nuclear export signal sequence and positionin W proteins |
---|---|---|---|
3a: NLS and NES within W proteins | |||
APMV -1 | FJ386394.2 | 138RAPKRGTTNVRLNSREVNPAAETVRKDRRTK168 |
210IPLYL214 (predicted by NetNES 1.1 Server); 105VDTQLRTGASNSLLL119,202SDQGRAKTIPLYLRI216, 203DQGRAKTIPLYLRIM217 (predicted by LocNES) |
JQ015296.1 | 106DTQLKTGASNSLLSMLDKLSNKSSNAKKGAH136 | None predicted | |
KX761866.1 | 106DTQLKTGASNSLLSMLDKLSNKSSNAKKGAH136 | 210ILLHLWIMSSYLSTL224 (predicted by LocNES) | |
KT445901.1 | 138RAPKRGTTNVRLNSREVNPAAETVRKDRRTK168 |
210IPLYL214 (predicted by NetNES 1.1 Server); 105VDTQLRTGASNSLLL119, 202SDQGRAKTIPLYLRI216, 203DQGRAKTIPLYLRIM217 (predicted by LocNES) |
|
AY562986.1 | 106DAQLKTGASNSLLSMLDKLSNKSPNTKKGAH136 |
13LEI15 (predicted by NetNES 1.1 Server); 1MATFTDAEIDDILEIn (predicted by LocNES) |
|
APMV-6 | GQ406232.1 | 125RKLEKQGKSSGKTTADSSPGRDPSPSKKGARn | 13LELSSDVI20(predicted by NetNES 1.1 Server) |
APMV-9 | EU910942.1 | 157RRRKTRHPADQPTHRDTAWSPERPPSCQSR186 |
12LMEQSGLVI20 (predicted by NetNES 1.1 Server 125KEGATGGLLDMLDRI139 (predicted by LocNES) |
APMV-20 | MF033136.1 | 117KSNMGKDLDSALAKLEKKAASIKSDK142 (within N terminal region) | None predicted |
3b: NLS and NES within V proteins | |||
APMV species | Genbank ID | Predicted Nuclear localization signal sequence and position in V proteins | Predicted Nuclear export signal sequence and positionin V proteins |
APMV-5 | GU206351.1 | 197PTTKHTAKPNQDRLSNQKRHR217 | 4TDDQAILDLLTL-N |
LC168750.1 | nPTTKHTAKSNQDRLSNQKRHR217 | 4TDDQAILDLLTL-N | |
APMV-20 | 117KSNMGKDLDSALAKLEKKAASIKSDK142 (within N terminal region) | None predicted |
Comparison of W protein sequences of APMV species
The length of W protein varied between 125 and 227 amino acids (aa) with calculated molecular weights between 13.30 kDa and 24.38 kDa. APMV-3 strain Netherland and APMV-7 had the shortest W protein (125 aa) and two strains of APMV-1, mesogenic strain KX761866.1 and velogenic strain KJ808820.1 (227 aa) had the longest W protein.
The length (in aa) of W protein was conserved among all the strains of APMV-2 (207 aa), all the strains of APMV-4 (137 aa), all the strains of APMV-5 (187 aa), all the strains of APMV-10 (172 aa) and all the strains of APMV-13 (150 aa). Also, similarity in the W protein length was noticed between the following species: W protein length of 172 aa was observed in all strains of APMV-10 and all strains of APMV-8 except strain FJ215863.2; W protein length of 177 aa was deduced in one strain of APMV-1 (JQ015296.1) and 3 strains of APMV-6 (EU622637.2, AY029299.1, EF569970.1), while the W protein length of APMV-12 and APMV-5 were 187 aa. Variations in the W protein length between strains within the same species were observed in APMV-1 (227, 221, 196, 183, 179, 177, 137 aa), APMV-3 (125, 127 aa), APMV-6 (157, 162, 177, 197 aa) and APMV-8 (172, 203 aa). APMV-1 strains analyzed in this study, had the longest unique C terminal region when compared to other APMV species (Table 1 and Fig. 2).
The lowest amino acid percentage identity (2.4) was observed between W proteins of APMV-12 and APMV-3 strain Netherland. Incidentally, APMV-3 strain Netherland also showed the lowest amino acid identity with W proteins of other APMV species. The lowest amino acid divergence (83.7) was noticed between APMV-1 isolate HN1007 (KX761866.1) and APMV-16, their W protein amino acid identity was 48.6% which was the highest homology observed between APMV species (Supplementary File S5).
The NLS were identified in five out of the twelve strains of APMV-1, in one of the eight strains of APMV-6 (GQ406232.1) and in APMV-9 in their C terminal region while NLS in W protein of APMV-20 was observed in the N terminal region (shared with P and V proteins). The presence of NES were predicted in these viruses except in one strain of APMV-1 (JQ015296.1) and APMV-20 (Table 3a).
Phylogenetic tree and evolutionary distance analysis
Based on their V protein sequences, phylogenetically the APMV species formed three distinct groups: group 1 consisted of APMV -3 strains, group 2 consisted of APMV -1, -9, -12, -13, -16, -21, APV -A, -B and -C (all orthoavulaviruses) and group 3 consisted of APMV -2, -5, -6, -7, -8, -10, -11, -14, -15, -20 and -4 (all metaavulaviruses and one paraavulavirus) (Fig. 3). The highest evolutionary divergence of 2.20 was observed between APMV-3 & APMV-4 and APMV-3 & APMV-12 followed by a divergence value of 1.91 between APMV-3 & APMV-5 and APMV-3 & APMV-11. The lowest divergence was between APV-A and APV-B (0.47) followed by APMV-9 & APMV-21 (0.50). The distance between the strains of the same species was noticed more in APMV-3 (0.4) followed by APMV-1 (0.297), which was further reiterated by their lower percentage of amino acid homology (Table 4).
Table 4.
The number of amino acid substitutions per site from averaging over all sequence pairs between groups is shown. Standard error estimate(s) are shown above the diagonal. Analyses were conducted using the JTT matrix-based model. Evolutionary analyses were conducted in MEGA7.
The phylogenetic tree obtained from W protein sequences analysis showed clustering of strains of the same species (Fig. 4). The evolutionary distance analyses of W proteins of APMV species revealed that APMV-3 species is more divergent than other APMV species. The highest evolutionary divergence was noticed between APMV-3 & APV-C species (10.322) followed by APMV-3 & APMV-13 (8.594) and APMV-3 & APMV-12 (8.270). The lowest divergence was observed between APMV-9 & APMV-21 (0.400) followed by APV-A & APV-B (0.407). The distance between the strains of the same species was more in APMV-3 (0.619) followed by APMV-1 (0.256) which was also apparent from their lower percentage of amino acid homology (Table 5).
Table 5.
The number of amino acid substitutions per site from averaging over all sequence pairs between groups is shown. Standard error estimate(s) are shown above the diagonal. Analyses were conducted using the Dayhoff matrix based model. The rate variation among sites was modeled with a gamma distribution (shape parameter = 1). Evolutionary analyses were conducted in MEGA7.
Selection pressure analysis
The dN/dS ratio was used to determine the natural selection pressure acting on the P gene edited products. The dN/dS ratio was estimated by DnaSP v6.12.03 for APMV species that comprised of more than one strain. The dN/dS values were significantly less than 1 for both V and W sequences (complete, N- and C-terminal regions) of most species explaining that they are under negative selection pressure. Only the C terminal region of V proteins of APMV-3 strains showed positive selection with dN/dS> 1 (Table 6).
Table 6.
Species | V -complete dN/dS ratio | W -complete dN/dS ratio | P-V-W shared N terminal region dN/dS ratio | V -C terminal region dN/dS ratio | W -C terminal region dN/dS ratio |
---|---|---|---|---|---|
APMV-1 | 0.435 | 0.583 | 0.604 | 0.660 | 0.893 |
APMV-2 | 0.637 | 0.115 | 0.459 | 0.000 | — |
APMV-3 | 0.489 | 0.651 | 0.212 | 1.092 | 0.437 |
APMV-4 | 0.327 | 0.776 | 0.636 | 0.987 | — |
APMV-5 | 0.450 | 0.646 | 0.534 | 0.631 | 0.297 |
APMV-6 | 0.513 | 0.600 | 0.582 | 0.935 | 0.000 |
APMV-8 | 0.439 | 0.548 | 0.529 | — | — |
APMV-10 | 0.388 | 0.684 | 0.534 | 0.00 | 0.00 |
APMV-13 | 0.398 | 0.320 | 0.502 | 0.143 | 0.00 |
Species wise calculation of dN/dS ratios. Only those species comprising of more than one strain were included for the substitution calculation. The dN/dS ratio was estimated by DnaSP v6.12.0374. The dN/dS value was zero in those where the number of non-synonymous substitutions was zero and the dN/dS value could not be determined (−) in those where the number of synonymous substitutions were zero.
Evolutionary rate analysis
The substitution rate of the V and W nucleotide sequences of APMV species that comprised of more than two strains were estimated by uncorrelated relaxed clock with lognormal using BEAST software. APMV-10 comprised of four strains, which were 98.65% to 100% identical to each other and hence the substitution rate could not be determined. The substitution rate was highest in APMV-13 followed by APMV-2, APMV-6 for both V and W proteins. The overall substitution rate was 7.37 × 10−5 for V protein and 8.07 × 10−5 for W protein (Table 7).
Table 7.
Species | No. of strains | V protein substitution rate | W protein substitution rate |
---|---|---|---|
APMV-1 | 12 | 5.15 × 10−4 | 1.00 × 10−4 |
APMV-2 | 5 | 3.38 × 10−2 | 5.51 × 10−2 |
APMV-6 | 8 | 2.20 × 10−2 | 2.22 × 10−2 |
APMV-8 | 5 | 7.89 × 10−3 | 8.21 × 10−3 |
APMV-13 | 3 | 2.45 × 10−2 | 9.64 × 10−2 |
Total | 55 (21 APMV species) | 7.37 × 10−5 | 8.07 × 10−5 |
The best clock determined was uncorrelated relaxed clock with lognormal. The BEAST analysis could not done in species with less than 3 strains (APMV-3,-4,-5,-7,-9,-11,-12,-14,-15,-16,-17,-18,-19,-20 and -21) and data for APMV-10 could not be determined because APMV-10 strains were 98.65% to 100% identical to each other. The final data analysis was performed using tracer v 1.7.1 software.
Discussion
Avian paramyxoviruses are known to infect a variety of bird species across the globe. Currently 21 species (previously called as serotypes) of APMVs are characterized and more viruses could be identified in future with improved viral surveillance programs. Paramyxoviruses with their small genome have a unique strategy of maximizing their genomic information by expressing viral proteins through co-transcriptional RNA editing. This helps to avoid error catastrophe caused by higher mutation rates often associated with larger genomes. Additionally, these viruses follow the ‘rule of six’ for efficient replication. Though, detailed studies on APMV structural genes and their complete genomes are available, a comparative knowledge of their accessory proteins expressed through RNA editing is lacking. In this study, using bioinformatics approach, we analyzed the P gene editing site, predicted and studied the protein sequences of edited products- V and W, of all 21 APMV species (55 viruses) known till date.
The hexamer phasing at the P gene editing site within each virus group is conserved7. We observed conserved hexamer phasing between certain APMV species and also, within each APMVs except in APMV-3 and for one strain of APMV-6. The hexamer phase is known to regulate the mRNA editing pattern, though subtle, it is important; for example, in human and bovine parainfluenza virus type 3 (PIV-3) in which the hexamer positions at P gene editing site are 2 to 5, higher mRNA editing frequency (~70%) and more number of G insertions (1 to 6 at equal frequencies) are observed while least mRNA editing ~30% with only 1 to 3 G insertions occur in Sendai virus wherein hexamer phase position is 1 at the P gene editing site 7,79,80. Based on the hexamer phasing position, it is anticipated that, in all APMVs except in APMV -20, the editing frequency could be extensive with possibilities of more number of G insertions. However, the cis-acting sequence of P gene editing site in APMV-20 (3’AA), suggests higher mRNA editing frequency and increased number of G insertions as reported in human and bovine PIV-381,82. Thus APMVs seem to follow PIV-3 RNA editing phenotype.
Another interesting observation is the unique editing site sequence of APMV-11 (3’A4UUCUUC5), in which the unedited mRNA translates to V protein and it has been suggested that 2G insertions in mRNA translates to P protein17. In rubulaviruses with P gene editing site sequence of3’A3UUCUC4, realignment of the nascent mRNA/template hybrid during 1G insertion would mean non permissible A:C base pairing hence the minimum insertion expected is 2G83. The base pairing between the nascent chain and the template genome to form a hybrid is important to prevent transcriptional slippage by the polymerase80. Similarly, in APMV-11, 1G and 2G insertions would lead to unstable A:C base pairing, hence 3G insertion (V protein) could be the minimum number of insertions expected, also, while a 4G insertion would translate to W protein, a 5G insertion would lead to P protein synthesis. It needs to be explored if APMV-11 expresses more V protein (from both unedited mRNA and 3G insertions) than other paramyxoviruses. Furthermore, it will be interesting to study if deletions in addition to G insertions could happen in APMV-11 and other APMVs with longer C runs at the editing site as described previously with recombinant Sendai virus and PIV-3 minigenomes83.
Three factors, the editing site (sequence and the length of C runs), the type of sequences immediately upstream of the editing site (cis-acting sequence) and the hexamer phase positions are known to decide the editing phenotype (i.e. number of G insertions, deletions and frequency of mRNA editing) which further can influence the virus pathogenicity81,82,84,85. Among APMVs, variations are observed in (i) P gene editing site, (ii) hexamer phase at the editing site and also (iii) the sequences immediately upstream of the editing site, all of which will determine the expression levels and relative proportions of P, V and W proteins in the APMVs which in turn could explain their differences in replication, pathogenicity and virus-host interactions.
With respect to the length and amino acid composition of V and W proteins, there were huge variations between species than within species, which was also reiterated by their dN/dS estimates. The V proteins were more conserved than W proteins. Higher sequence identity for V proteins was observed between the strains of the same species (exception was APMV-3) more often than between species. Phylogenetically, V protein analysis of APMV species grouped viruses similar to the individual gene-based phylogeny2, all the members of genus Orthoavulavirus clustered into one group and all members of genus Metaavulvirus along with one of the avian paraavulaviruses (APMV-4) formed the second group while the avian paraavulavirus, APMV-3, formed an outgroup. This was further affirmed by evolutionary distance analysis. The phylogenetic analysis and the evolutionary distance data of both V and W proteins clearly showed that APMV-3 strains are the most divergent.
The N terminal region of V and W proteins which is shared with P protein, showed highest conservation among APMVs. In their C terminal region, the V proteins of paramyxoviruses carry conserved arginine and isoleucine residues upstream of highly conserved seven cysteine residues (zinc binding domain), known to play important roles in MDA5 interference, STAT1 degradation and blocking interferon signaling to evade host immunity86–88. The V proteins of all 21 APMV species analyzed in this study had the seven cysteine residues and remarkably, many other amino acids were also conserved in their C terminal region. Though similar observations have been made earlier in other paramyxovirus V proteins, their functional importance is unknown yet86.
The V and W genes were found to be under negative selection pressure with dN/dS <1 in all the species. This shows the conserved nature of the non-structural viral proteins within the species and probably indicates their functional importance, which is yet to be completely explored. Furthermore, the substitution rates of APMV species determined by molecular clock was varying across the species and was higher between species than within species. The substitution rates for W proteins was higher than V proteins except for APMV-1, where more strains were available for comparison. Though the substitution rate was slightly higher for V nucleotide sequence (5.15 × 10−4) of APMV-1 compared to W nucleotide sequence (1.0 × 10−4), it did not lead to changes in amino acid sequence as evident from dN/dS ratio estimates suggesting negative selection pressure. The higher conservation of V protein sequence implies its significant role in virus biology such as replication, pathogenesis and immune evasion.
In contrast to V proteins, the APMV W proteins were highly disordered, showed little sequence conservation when compared to V proteins and their divergence values were higher when compared to V proteins. The evolutionary data analysis of W proteins suggested higher sequence identity among strains of same species and higher variability between species. There were no conserved sequences or motifs in the C terminal region of W proteins except that most them carried large number of basic amino acids suggesting W protein to be highly basic as described previously12. The exceptions were one strain of APMV-1 (KJ736742.1), APMV-4, APMV-7 and APV-C, and all of them had shorter W protein length. This genetic diversity seen in the W proteins may determine the degree of pathogenesis, variable interferon antagonistic activity and the wide host range exhibited by the APMV species.
The likelihood of W mRNA occurrence could be less than that of V mRNA, because of two unstable base pairing created by the two mismatches (2G) during polymerase stuttering. This skepticism becomes more compelling and doubts rise to whether W protein is expressed at all in those APMV species whose predicted W protein sequences have only fewer amino acid residues in their C terminal region- single (in APMV-7) or two (in APMV-4, APMV-13 & APMV-15) or three (in APMV-1 isolate R75/98) or four (in APMV-6, JX522537.1 & APV-C) or five (in APMV-5 & APMV-16) amino acids. However, equal or higher frequencies of insertions of 1G and 2G during RNA editing have been reported in certain paramyxoviruses such as Nipah virus and Bovine Parainfluenza virus type 389,90.
The presence of W mRNA of APMV-1 was first accounted in 1993 with a frequency of about 10%12, and the W protein expression from APMV-1 lentogenic strain Clone 30 and from APMV-1 lentogenic strain La Sota and velogenic strain SG10 was recently confirmed16,91]. We had earlier shown that W protein of APMV-1 mesogenic strain Komarov compartmentalized in the nucleus using plasmid system30, while the same has also been documented during virus infection in cells in the above two studies. Here, we report NLS and NES of W proteins predicted only in certain APMV species, also, we could identify NLS only in five out of twelve strains of APMV-1 implying that the not all the W proteins of APMV-1 strains localize in the nucleus. The W protein sequence analysis of nearly 1000 strains of APMV-1 in our lab show variations in the W protein length between strains (unpublished data),which was also reported recently in an analysis of 286 strains of NDV91, furthermore, W proteins of only about 50% of the strains analyzed by us are predicted to localize into the nucleus (data not shown) leading us to speculate that these differences in W proteins can attribute to the wide spectrum of pathogenicity and virulence observed in Newcastle disease.
Among paramyxoviruses, the W protein of Nipah and Hendra viruses, are the most well characterized. The nuclear localization of W protein of Nipah virus was found to modify p53 expression and activity92, sequester inactive STAT1 within nucleus93, prevent IRF3 phosphorylation, inhibit IFN signaling mediated both by the virus and TLR-329, modulate host immunity, influence the disease course and viral pathogenesis specifically neurovirulence27,28,94. Intriguingly, neither the lack of W protein nor its cytoplasmic localization in APMV-1 strain clone 30 had any effect on viral replication in cell culture16. Though no conserved motifs could be identified between the W proteins of APMVs and Nipah virus, it will be interesting to study if similar roles are executed by W proteins of any or all the APMV species. To our knowledge, this is the first comprehensive and comparative evolutionary study of the P gene edited accessory viral proteins of APMVs. The information obtained by this study will enable designing future studies to understand the specific functions of conserved motifs/amino acids of V and W proteins and decipher their evolutionary significance on the virus and as well as on the host.
Supplementary information
Acknowledgements
The authors thank their lab members, National Institute of Animal Biotechnology and Department of Biotechnology, Government of India for supporting the study. This work was funded by the National Institute of Animal Biotechnology core grant (#C0007) and the Department of Biotechnology, Government of India (#BT/PR8740/AGR/36/772/2013).
Author contributions
M.S. conceived and designed the study, P.L.R. collected the data, M.S., P.L.R. and R.G. analyzed the data, M.S., P.L.R., R.G. drafted the manuscript.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Change history
10/2/2020
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
Supplementary information
is available for this paper at 10.1038/s41598-020-66252-x.
References
- 1.Amarasinghe GK, et al. Taxonomy of the order Mononegavirales: update 2019. Archives of Virology. 2019;164:1967–1980. doi: 10.1007/s00705-019-04247-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Aziz-ul-Rahman MM, Shabbir MZ. Molecular Phylogenetics and Evolution Comparative evolutionary and phylogenomic analysis of Avian avulaviruses. Molecular Phylogenetics and Evolution. 2018;127:931–951. doi: 10.1016/j.ympev.2018.06.040. [DOI] [PubMed] [Google Scholar]
- 3.OIE. Infection with Newcastle disease virus. In OIE - Terrestrial Animal Health Code. Preprint at, http://www.oie.int/fileadmin/Home/eng/Health_standards/tahc/current/chapitre_nd.pdf (2019).
- 4.Lamb, R. A. & Parks, G. D. Paramyxoviridae: the viruses and their replication (Fields vir; & P. M. H. (Eds. B. N. Fields, D. N. Knipe, ed.). Lippincott, Williams, and Wilkins (2007).
- 5.Millar NS, Emmerson PT. Molecular Cloning and Nucleotide Sequencing of Newcastle Disease. Virus. Newcastle Disease. 1988;8:79–97. [Google Scholar]
- 6.Calain P, Roux L. The rule of six, a basic feature for efficient replication of Sendai virus defective interfering RNA. Journal of General Virology. 1993;67:4822–4830. doi: 10.1128/jvi.67.8.4822-4830.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kolakofsky D, et al. Paramyxovirus RNA synthesis and the requirement for hexamer genome length: the rule of six revisited. Journal of Virology. 1998;72:891–899. doi: 10.1128/jvi.72.2.891-899.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chang PC, et al. Complete nucleotide sequence of avian paramyxovirus type 6 isolated from ducks. Journal of General Virology. 2001;82:2157–2168. doi: 10.1099/0022-1317-82-9-2157. [DOI] [PubMed] [Google Scholar]
- 9.Chambers P, Samson AC. Non-structural proteins in Newcastle disease virus-infected cells. Journal of General Virology. 1982;58(Pt 1):1–12. doi: 10.1099/0022-1317-58-1-1. [DOI] [PubMed] [Google Scholar]
- 10.Locke DP, et al. Newcastle disease virus phosphoprotein gene analysis and transcriptional editing in avian cells. Virus Research. 2000;69:55–68. doi: 10.1016/s0168-1702(00)00175-1. [DOI] [PubMed] [Google Scholar]
- 11.Mebatsion T, Verstegen S, De Vaan LTC, Römer-Oberdörfer A, Schrier CC. A recombinant Newcastle Disease Virus with low-level V protein expression is immunogenic and lacks pathogenicity for chicken embryos. Journal of Virology. 2001;75:420–428. doi: 10.1128/JVI.75.1.420-428.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Steward M, Vipond IB, Millar NS, Emmerson PT. RNA editing in Newcastle disease virus. Journal of General Virology. 1993;74:2539–2547. doi: 10.1099/0022-1317-74-12-2539. [DOI] [PubMed] [Google Scholar]
- 13.Giorgi C, Blumberg BM, Kolakofsky D. Sendai virus contains overlapping genes expressed from a single mRNA. Cell. 1983;35:829–836. doi: 10.1016/0092-8674(83)90115-0. [DOI] [PubMed] [Google Scholar]
- 14.Kolakofsky, D., Vidal, S. & Curran, J. Paramyxovirus RNA Synthesis and P Gene Expression. In K. D.W. (Ed.), The Paramyxoviruses,The Viruses. Springer, Boston, MA (1991).
- 15.Vidal S, Curran J, Kolakofsky D. A stuttering model for paramyxovirus P mRNA editing. The EMBO Journal. 1990;9:2017–2022. doi: 10.1002/j.1460-2075.1990.tb08330.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Karsunke J, et al. W protein expression by Newcastle disease virus. Virus Research. 2019;263:207–216. doi: 10.1016/j.virusres.2019.02.003. [DOI] [PubMed] [Google Scholar]
- 17.Briand FX, Henry A, Massin P, Jestin V. Complete genome sequence of a novel avian paramyxovirus. Journal of Virology. 2012;86:7710–7710. doi: 10.1128/JVI.00946-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Paterson RG, Lamb RA. RNA editing by G-nucleotide insertion in Mumps virus P-gene mRNA transcripts. Journal of Virology. 1990;64:4137–4145. doi: 10.1128/jvi.64.9.4137-4145.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Childs KS, Andrejeva J, Randall RE, Goodbourn S. Mechanism of mda-5 inhibition by paramyxovirus V proteins. Journal of Virology. 2009;83:1465–73. doi: 10.1128/JVI.01768-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Huang Z, Krishnamurthy S, Panda A, Samal SK. Newcastle disease virus V protein is associated with viral pathogenesis and functions as an alpha interferon Antagonist. Journal of Virology. 2003;77:8676–8685. doi: 10.1128/JVI.77.16.8676-8685.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Park MS, García-Sastre A, Cros JF, Basler CF, Palese P. Newcastle Disease Virus V protein is a determinant of host range restriction. Journal of Virology. 2003;77:9522–9532. doi: 10.1128/JVI.77.17.9522-9532.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Park MS, et al. Newcastle disease virus (NDV)-based assay demonstrates interferon-antagonist activity for the NDV V protein and the Nipah virus V, W, and C proteins. Journal of Virology. 2003;77:1501–1511. doi: 10.1128/JVI.77.2.1501-1511.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Chu Z, et al. Newcastle Disease Virus V protein inhibits cell apoptosis and promotes viral replication by targeting CacyBP/SIP. Frontiers in Cellular and Infection Microbiology. 2018;8:304. doi: 10.3389/fcimb.2018.00304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wang C, et al. Newcastle disease virus V protein inhibits apoptosis in DF-1 cells by downregulating TXNL1. Veterinary Research. 2018;49:102. doi: 10.1186/s13567-018-0599-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Chu Z, et al. Newcastle disease virus V protein promotes viral replication in HeLa cells through the activation of MEK/ERK signaling. Viruses. 2018;10:489. doi: 10.3390/v10090489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Alamares JG, Elankumaran S, Samal SK, Iorio RM. The interferon antagonistic activities of the V proteins from two strains of Newcastle disease virus correlate with their known virulence properties. Virus Research. 2010;147:153–157. doi: 10.1016/j.virusres.2009.10.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Satterfield BA, et al. The immunomodulating V and W proteins of Nipah virus determine disease course. Nature Communications. 2015;6:7483. doi: 10.1038/ncomms8483. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Satterfield BA, Geisbert TW, Mire CE. Inhibition of the host antiviral response by Nipah virus: current understanding and future perspectives. Future Virology. 2016;11:331–344. [Google Scholar]
- 29.Shaw ML, Cardenas WB, Zamarin D, Palese P, Basler CF. Nuclear localization of the Nipah virus W protein allows for inhibition of both virus-and toll-like receptor 3-triggered signaling pathways. Journal of Virology. 2005;79:6078–6088. doi: 10.1128/JVI.79.10.6078-6088.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Vaidyanathan SP, Gawai S. & Subbiah, M. Poster Presentation, ICID 2016: Elucidation of the role of non-structural viral protein (W) of Newcastle disease virus. International Journal of Infectious Diseases. 2016;45:337–338. [Google Scholar]
- 31.Abolnik C, De Castro M, Rees J. Full genomic sequence of an African Avian Paramyxovirus Type 4 strain isolated from a wild duck. Virus Genes. 2012;45:537–541. doi: 10.1007/s11262-012-0805-y. [DOI] [PubMed] [Google Scholar]
- 32.Fereidouni S, et al. Next-generation sequencing of five new avian paramyxoviruses 8 isolates from Kazakhstan indicates a low genetic evolution rate over four decades. Archives of Virology. 2018;163:331–336. doi: 10.1007/s00705-017-3593-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Heiden S, Grund C, Höper D, Mettenleiter TC, Römer-Oberdörfer A. Pigeon paramyxovirus type 1 variants with polybasic F protein cleavage site but strikingly different pathogenicity. Virus Genes. 2014;49:502–506. doi: 10.1007/s11262-014-1111-7. [DOI] [PubMed] [Google Scholar]
- 34.Hiono, T., Matsuno, K., Tuchiya, K., Lin, Z., Okamatsu, M. & Sakoda, Y. Complete genome sequence of the avian paramyxovirus serotype 5 strain APMV-5/budgerigar/Japan/TI/75. Genome Announcement4, pii: e01005-16 (2016). [DOI] [PMC free article] [PubMed]
- 35.Jeong J, et al. Complete genome sequence of a novel avian paramyxovirus isolated from wild birds in South Korea. Archives of Virology. 2018;163:223–227. doi: 10.1007/s00705-017-3588-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Karamendin K, et al. Novel avian paramyxovirus isolated from gulls in Caspian seashore in Kazakhstan. PLoS One. 2017;12:1–15. doi: 10.1371/journal.pone.0190339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Karamendin K, et al. Complete genome sequence of avian paramyxovirus strain APMV-6/red-crested pochard/Balkhash/5842/2013 from Kazakhstan. Genome Announcements. 2015;3:e00158–15. doi: 10.1128/genomeA.00158-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kim SH, Subbiah M, Samuel AS, Collins PL, Samal SK. Roles of the fusion and hemagglutinin-neuraminidase proteins in replication, tropism, and pathogenicity of avian paramyxoviruses. Journal of Virology. 2011;85:8582–8596. doi: 10.1128/JVI.00652-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kumar S, Nayak B, Collins PL, Samal SK. Complete genome sequence of avian paramyxovirus type 3 reveals an unusually long trailer region. Virus Research. 2008;137:189–197. doi: 10.1016/j.virusres.2008.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Kumar S, Nayak B, Samuel AS, Xiao S, Collins PL, Samal SK. Complete genome sequence of avian paramyxovirus-3 strain Wisconsin: Evidence for the existence of subgroups within the serotype. Virus Research. 2010;149:78–85. doi: 10.1016/j.virusres.2009.12.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Kydyrmanov AI, et al. Novel avian paramyxovirus isolated from gulls in Caspian seashore in Kazakhstan. PLoS One. 28. 2017;12(12):e0190339. doi: 10.1371/journal.pone.0190339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Lee HJ, et al. A novel avian paramyxovirus (Putative Serotype 15) isolated from wild birds. Frontiers in Microbiology. 2017;8:786. doi: 10.3389/fmicb.2017.00786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Li X, Zhang S, Wang H, Zhao J, Zhang G. Genomic characterization of two avian paramyxovirus type 2 isolates from chickens in China. Virus Genes. 2011;43:55–59. doi: 10.1007/s11262-011-0604-x. [DOI] [PubMed] [Google Scholar]
- 44.Miller PJ, et al. Evidence for a new avian paramyxovirus serotype 10 detected in Rockhopper Penguins from the Falkland Islands. Journal of Virology. 2010;84:11496–11504. doi: 10.1128/JVI.00822-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Nayak B, Kumar S, Collins PL, Samal SK. Molecular characterization and complete genome sequence of avian paramyxovirus type 4 prototype strain duck/Hong Kong/D3/75. Virology Journal. 2008;5:124. doi: 10.1186/1743-422X-5-124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Neira V, et al. Novel avulaviruses in penguins, Antarctica. Emerging Infectious Diseases. 2017;23:1212–1214. doi: 10.3201/eid2307.170054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Paldurai A, Subbiah M, Kumar S, Collins PL, Samal SK. Complete genome sequences of avian paramyxovirus type 8 strains goose/Delaware/1053/76 and pintail/Wakuya/20/78. Virus Research. 2009;142:144–153. doi: 10.1016/j.virusres.2009.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Tirumurugaan KG, et al. Genotypic and pathotypic characterization of Newcastle disease viruses from India. PLoS One. 2011;6:e28414. doi: 10.1371/journal.pone.0028414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Samuel AS, Kumar S, Madhuri S, Collins PL, Samal SK. Complete sequence of the genome of avian paramyxovirus type 9 and comparison with other paramyxoviruses. Virus Research. 2009;142:10–18. doi: 10.1016/j.virusres.2008.12.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Samuel AS, Paldurai A, Kumar S, Collins PL, Samal SK. Complete genome sequence of avian paramyxovirus (APMV) serotype 5 completes the analysis of nine APMV serotypes and reveals the longest APMV genome. PLoS One. 2010;5:1–13. doi: 10.1371/journal.pone.0009269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Subbiah M, Nayak S, Collins PL, Samal SK. Complete genome sequences of avian paramyxovirus serotype 2 (APMV-2) strains Bangor, England and Kenya: Evidence for the existence of subgroups within serotype 2. Virus Research. 2010;152:85–95. doi: 10.1016/j.virusres.2010.06.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Subbiah M, Xiao S, Collins PL, Samal SK. Complete sequence of the genome of avian paramyxovirus type 2 (strain Yucaipa) and comparison with other paramyxoviruses. Virus Research. 2008;137:40–48. doi: 10.1016/j.virusres.2008.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Subbiah M, et al. Pathogenesis of two strains of avian paramyxovirus serotype 2, Yucaipa and Bangor, in chickens and turkeys. Avian Diseases. 2010;54:1050–1057. doi: 10.1637/9380-041910-Reg.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Terregino C, et al. Antigenic and genetic analyses of isolate APMV/wigeon/Italy/3920-1/2005 indicate that it represents a new avian paramyxovirus (APMV-12) Archives of Virology. 2013;158:2233–2243. doi: 10.1007/s00705-013-1735-2. [DOI] [PubMed] [Google Scholar]
- 55.Thampaisarn R, et al. Characterization of avian paramyxovirus serotype 14, a novel serotype, isolated from a duck fecal sample in Japan. Virus Research. 2017;228:46–57. doi: 10.1016/j.virusres.2016.11.018. [DOI] [PubMed] [Google Scholar]
- 56.Thomazelli LM, et al. Novel avian paramyxovirus (APMV-15) isolated from a migratory bird in South America. PLoS One. 2017;12:1–7. doi: 10.1371/journal.pone.0177214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Tian Z, et al. Complete nucleotide sequence of Avian Paramyxovirus type 6 strain JL isolated from mallard ducks in China. Journal of Virology. 2012;86:13112. doi: 10.1128/JVI.02317-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Tsunekuni R, Ito H, Otsuki K, Kida H, Ito T. Genetic comparisons between lentogenic Newcastle disease virus isolated from waterfowl and velogenic variants. Virus Genes. 2010;40:252–255. doi: 10.1007/s11262-009-0427-1. [DOI] [PubMed] [Google Scholar]
- 59.Wu W, et al. Molecular and antigenic characteristics of Newcastle disease virus isolates from domestic ducks in China. Infection, Genetics and Evolution. 2015;32:34–43. doi: 10.1016/j.meegid.2015.02.016. [DOI] [PubMed] [Google Scholar]
- 60.Xiao S, et al. Complete genome sequence of avian paramyxovirus type 7 (strain Tennessee) and comparison with other paramyxoviruses. Virus Research. 2009;145:80–91. doi: 10.1016/j.virusres.2009.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Xiao S, et al. Complete genome sequences of avian paramyxovirus serotype 6 prototype strain Hong Kong and a recent novel strain from Italy: Evidence for the existence of subgroups within the serotype. Virus Research. 2010;150:61–72. doi: 10.1016/j.virusres.2010.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Yamamoto E, Ito H, Tomioka Y, Ito T. Characterization of novel avian paramyxovirus strain APMV/Shimane67 isolated from migratory wild geese in Japan. The Journal of Veterinary Medical Science. 2015;77:1079–1085. doi: 10.1292/jvms.14-0529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Aziz-ul-Rahman & M. Z. Shabbir. One (1) new species in the genus Avulavirus (Mononegavirales: Paramyxoviridae). Taxonomic proposal document submitted to ICTV, code assigned 2019.014M (2019).
- 64.Armougom F, et al. Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee. Nucleic Acids Research. 2006;34:W604–W608. doi: 10.1093/nar/gkl092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Notredame, C., Higgins, D. G. & Heringa, J. T-coffee: a novel method for fast and accurate multiple sequence alignment11Edited by J. Thornton. Journal of Molecular Biology302, 205–217 (2000). [DOI] [PubMed]
- 66.Robert X, Gouet P. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Research. 2014;42:W320–W324. doi: 10.1093/nar/gku316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Marchler-Bauer A, et al. CDD: NCBI’s conserved domain database. Nucleic Acids Research. 2015;43:D222–D226. doi: 10.1093/nar/gku1221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Bailey TL. DREME: motif discovery in transcription factor ChIP-seq data. Bioinformatics. 2011;27:1653–1659. doi: 10.1093/bioinformatics/btr261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Kosugi, S., Hasebe, M., Tomita, M. & Yanagawa, H. Systematic identification of cell cycle-dependent yeast nucleocytoplasmic shuttling proteins by prediction of composite motifs. In Proceedings of the National Academy of Sciences106,10171-10176 (2009). [DOI] [PMC free article] [PubMed]
- 70.la Cour T, et al. Analysis and prediction of leucine-rich nuclear export signals. Protein Engineering, Design and Selection. 2004;17:527–536. doi: 10.1093/protein/gzh062. [DOI] [PubMed] [Google Scholar]
- 71.Xu D, et al. LocNES: a computational tool for locating classical NESs in CRM1 cargo proteins. Bioinformatics. 2015;31:1357–1365. doi: 10.1093/bioinformatics/btu826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Jones DT, Taylor WR, Thornton JM. The rapid generation of mutation data matrices. Computer Applications in the Biosciences. 1992;8:275–282. doi: 10.1093/bioinformatics/8.3.275. [DOI] [PubMed] [Google Scholar]
- 73.Schwartz RM, Dayhoff MO. Matrices for detecting distant relationships. In Atlas of protein sequence and structure. 1979;5:353–358. [Google Scholar]
- 74.Felsenstein J. Confidence Limits On Phylogenies: An Approach Using the Bootstrap. Evolution. 1985;39:783–791. doi: 10.1111/j.1558-5646.1985.tb00420.x. [DOI] [PubMed] [Google Scholar]
- 75.Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Molecular Biology and Evolution. 2016;33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Rozas J, et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Molecular biology and evolution. 2017;34:3299–3302. doi: 10.1093/molbev/msx248. [DOI] [PubMed] [Google Scholar]
- 77.Suchard MA, et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evolution. 2018;4:vey016. doi: 10.1093/ve/vey016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Karlin D, Belshaw R. Detecting remote sequence homology in disordered proteins: Discovery of conserved motifs in the N-termini of mononegavirales phosphoproteins. PLoS One. 2012;7:e31719. doi: 10.1371/journal.pone.0031719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Kolakofsky D. Paramyxovirus RNA synthesis, mRNA editing, and genome hexamer phase: A review. Virology. 2016;498:94–98. doi: 10.1016/j.virol.2016.08.018. [DOI] [PubMed] [Google Scholar]
- 80.Iseni F, et al. Chemical modification of nucleotide bases and mRNA editing depend on hexamer or nucleoprotein phase in Sendai virus nucleocapsids. RNA. 2002;8:1056–1067. doi: 10.1017/s1355838202029977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Hausmann S, Garcin D, Morel AS, Kolakofsky D. Two nucleotides immediately upstream of the essential A6G3 slippery sequence modulate the pattern of G insertions during Sendai virus mRNA editing. Journal of Virology. 1999;73:343–351. doi: 10.1128/jvi.73.1.343-351.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Hausmann S, Garcin D, Delenda C, Kolakofsky D. The versatility of paramyxovirus RNA polymerase stuttering. Journal of Virology. 1999;73:5568–5576. doi: 10.1128/jvi.73.7.5568-5576.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Jacques JP, Hausmann S, Kolakofsky D. Paramyxovirus mRNA editing leads to G deletions as well as insertions. The EMBO Journal. 1994;13:5496–503. doi: 10.1002/j.1460-2075.1994.tb06884.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Mebatsion T, Verstegen S, De Vaan LT, Römer-Oberdörfer A, Schrier CC. A recombinant Newcastle disease virus with low-level V protein expression is immunogenic and lacks pathogenicity for chicken embryos. Journal of Virology. 2001;75:420–428. doi: 10.1128/JVI.75.1.420-428.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Mebatsion T, de Vaan LT, de Haas N, Römer-Oberdörfer A, Braber M. Identification of a mutation in editing of defective Newcastle disease virus recombinants that modulates P-gene mRNA editing and restores virus replication and pathogenicity in chicken embryos. Journal of Virology. 2003;77:9259–9265. doi: 10.1128/JVI.77.17.9259-9265.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Ramachandran A, Horvath CM. Dissociation of paramyxovirus interferon evasion activities: universal and virus-specific requirements for conserved V protein amino acids in MDA5 interference. Journal of Virology. 2010;84:11152–11163. doi: 10.1128/JVI.01375-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Steward M, Samson AC. & Errington W, E. P. The Newcastle disease virus V protein binds zinc. Archives of Virology. 1995;140:1321–1328. doi: 10.1007/BF01322759. [DOI] [PubMed] [Google Scholar]
- 88.Qiu X, et al. Newcastle Disease Virus V Protein Targets Phosphorylated STAT1 to Block IFN-I Signaling. PloS One. 2016;11:e0148560–e0148560. doi: 10.1371/journal.pone.0148560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Kulkarni S, et al. Nipah Virus edits its P Gene at High Frequency to Express the V and W Proteins. Journal of Virology. 2009;83:3982–3987. doi: 10.1128/JVI.02599-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Pelet T, Curran J, Kolakofsky D. The P gene of bovine parainfluenza virus 3 expresses all three reading frames from a single mRNA editing site. The EMBO Journal. 1991;10:443–448. doi: 10.1002/j.1460-2075.1991.tb07966.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Yang Y, et al. Appropriate amount of W protein of avian avulavirus 1 benefits viral replication and W shows strain-dependent subcellular localization. Virology. 2019;538:71–85. doi: 10.1016/j.virol.2019.09.013. [DOI] [PubMed] [Google Scholar]
- 92.Martinez-Gil, L., Vera-Velasco, N. M. & Mingarro, I. Exploring the Human-Nipah Virus Protein-Protein Interactome. Journal of Virology91, pii: e01461-17. (2017). [DOI] [PMC free article] [PubMed]
- 93.Ciancanelli MJ, Volchkova VA, Shaw ML, Volchkov VE, Basler CF. Nipah Virus Sequesters Inactive STAT1 in the Nucleus via a P Gene-Encoded Mechanism. Journal of Virology. 2009;83:7828–7841. doi: 10.1128/JVI.02610-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Yoneda M, et al. The nonstructural proteins of Nipah virus play a key role in pathogenicity in experimentally infected animals. PloS One. 2010;5:e12709–e12709. doi: 10.1371/journal.pone.0012709. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.