Skip to main content
PeerJ logoLink to PeerJ
. 2014 Sep 2;2:e556. doi: 10.7717/peerj.556

Evidence that ebolaviruses and cuevaviruses have been diverging from marburgviruses since the Miocene

Derek J Taylor 1,, Matthew J Ballinger 1, Jack J Zhan 1, Laura E Hanzly 1, Jeremy A Bruenn 1
Editor: Claus Wilke
PMCID: PMC4157239  PMID: 25237605

Abstract

An understanding of the timescale of evolution is critical for comparative virology but remains elusive for many RNA viruses. Age estimates based on mutation rates can severely underestimate divergences for ancient viral genes that are evolving under strong purifying selection. Paleoviral dating, however, can provide minimum age estimates for ancient divergence, but few orthologous paleoviruses are known within clades of extant viruses. For example, ebolaviruses and marburgviruses are well-studied mammalian pathogens, but their comparative biology is difficult to interpret because the existing estimates of divergence are controversial. Here we provide evidence that paleoviral elements of two genes (ebolavirus-like VP35 and NP) in cricetid rodent genomes originated after the divergence of ebolaviruses and cuevaviruses from marburgviruses. We provide evidence of orthology by identifying common paleoviral insertion sites among the rodent genomes. Our findings indicate that ebolaviruses and cuevaviruses have been diverging from marburgviruses since the early Miocene.

Keywords: Ebolavirus, Marburgvirus, Paleovirus, Cricetidae, VP35, NP, Filoviruses, Divergence estimation

Introduction

Knowledge of the timescale of evolution is a critical part of understanding host-virus interactions. Studies of viral systems that have evolved for tens of millions of years would perhaps be complicated by host shifts, broad geographic distributions, and functional divergences. Knowledge of divergence times might also affect design of vaccines and programs that identify emerging pathogens. However, the timescale of viral evolution has remained controversial (Gilbert & Feschotte, 2010; Holmes, 2003; Patel, Emerman & Malik, 2011; Sharp & Simmonds, 2011; Wertheim & Kosakovsky Pond, 2011). Fossil and geographic calibrations are normally absent and evolutionary rates based on isolation dates of historical strains often grossly underestimate long-term divergences. Part of the underestimation is due to the failure of commonly used models to accommodate the strong purifying selection of viral proteins (Duchêne, Holmes & Ho, 2014; Patel, Emerman & Malik, 2011; Wertheim & Kosakovsky Pond, 2011). But other aspects of viruses such as variation in replication rate also affect clock-based estimates (Hicks & Duffy, 2014; Holmes, 2003). Even models that accommodate purifying selection will eventually encounter a mutational saturation problem (Wertheim & Kosakovsky Pond, 2011). Age estimation using co-phylogeny with the host seems more promising, but detailed co-phylogenies are still uncommon and can be complicated by host jumping (Holmes, 2003).

Another potentially reliable source of minimum divergence times is endogenous paleoviral elements (Katzourakis et al., 2007). Over the last decade, evidence of the most unexpected class of paleoviral elements, Non-retroviral Endogenous RNA Viral Elements (NERVEs), has been provided for each major eukaryotic group by sequencing across the integration boundaries of putative viral elements and host genomes (Crochu et al., 2004; Horie et al., 2010; Liu et al., 2010; Tanne & Sela, 2005; Taylor & Bruenn, 2009; Taylor, Leach & Bruenn, 2010). BLAST searches of animal genome databases alone suggest that representatives of all known viral genome architectures are involved in the formation of paleoviral elements (Belyi, Levine & Skalka, 2010; Katzourakis & Gifford, 2010). Agreement of the NERVE phylogeny with the host phylogeny is evidence of insertion in the genome of a common ancestor. This pattern can be complicated by the formation of non-orthologous copies from independent insertions, duplications and horizontal transfers (Taylor & Bruenn, 2009). But, these complications become less important in phylogenies with greater taxonomic representation. Even stronger support for orthology is provided by evidence of common integration sites for NERVEs (Katzourakis & Gifford, 2010; Taylor, Leach & Bruenn, 2010). If the host genomic flanking sequences show significant similarity (microsynteny), then it is unlikely that NERVE insertions are independent (given the large number of possible insertions sites in eukaryotic genomes). Ballinger et al. (2014), for example, were able to identify microsyntenous NERVEs from a novel bunyavirid for the genus Drosophila and estimate a minimum date of 42 MY. As with the hosts, the paleoviral sequences are “sister” phylogenetic groups supporting a single origin. So, the strongest paleoviral calibrations satisfy two conditions: evidence of a common integration site in the host genomes and of a similar phylogeny of host and paleovirus.

Age estimation by synteny has limitations. As with real fossils, the age estimates based on paleoviruses are minimum dates based on the material currently available. The synteny of the oldest copies may be difficult to establish because of chromosomal evolution that occurred post-integration. An additional source of uncertainty arises from the dating of actual host fossils and of host divergences. It also may be imagined that NERVE-virus phylogenetic comparisons suffer because the mutation rate of RNA viruses is orders of magnitude greater than the mutation rate of the hosts. However, this reasoning ignores the growing evidence of strong purifying selection in viruses—a high mutation rate is not necessarily reflected in the amino acid substitution rate. Indeed, ancient NERVEs would simply be undetectable for viruses that diverged rapidly at the amino acid level.

Still, very few paleoviral calibrations are available for internal nodes of phylogenies of extant RNA viruses. Taylor et al. (2011) reported that the family Filoviridae must be at least 13 MY old because fossil copies of the NP and VP35-like genes have been integrated at common sites shared among the mouse (Mus musculus) and the rat (Rattus norvegicus) NCBI reference genomes. However, paleoviral calibrations that would permit estimation of a minimum divergence date for extant ebolaviruses, cuevaviruses and marburgviruses are unknown. Molecular estimates of the age of the common ancestor of extant known filovirids fall into two time ranges. One range is coincident with the rise of agriculture in humans from 7,100 to 10,400 years ago (Carroll et al., 2013; Suzuki & Gojobori, 1997). The other range is from the Middle Pleistocene at 155,000 years ago (Negredo et al., 2011). Still others have stated that the oldest extant filovirids (and other RNA viruses) are in a divergence zone that is simply recalcitrant to molecular clock dating (Wertheim & Kosakovsky Pond, 2011). In such a zone, even models that have been corrected for purifying selection will fail to a temporal signal that has been destroyed by synonymous substitutions.

Here we show that the limitations for using the molecular clock to date RNA viruses can be mitigated by the discovery and dating of orthologous paleoviral elements within clades of extant RNA viruses. From congruent evidence of two genes we report that the divergence of the known extant filovirids (marburgviruses, ebolaviruses, and cuevaviruses) is likely older than the Miocene ancestor of the hamsters and voles—a separation that is orders of magnitude greater than previous Holocene and Middle Pleistocene estimates of divergence.

Materials & Methods

PCR and DNA sequencing

We obtained a tail clipping of a dead specimen of a meadow vole from western New York state. We extracted DNA using the Epicentre Quickextract kit. PCR reactions with primers based on the genome assembly of Microtus ochrogaster (Wagner, 1842) were designed to amplify from the exon across the VP35-like gene insert boundary and from the intergenic region across the NP-like boundary of putative orthologs of hamster loci. Primers based on the assembly of the genome of the prairie vole (Microtus ochrogaster) for the VP-35-like region were: GAGCAGGCTTTTGCTTTGATTCCAG (forward), CTGATCTCAGCTATCTCACCTGCTAAGA (reverse). For the NP-like region primers were: TGCATTGCTTGGCCGTTCTGTATGC (forward) and ATAAGACATGCTCCTTGTCTTGAAG (reverse). The 5′ end of the mitochondrial COI gene region was also PCR amplified using custom primers based on published sequences of Microtus: TTACAGTCTAATGCTTTACTCAGCC (forward), ACTTCTGGGTGTCCGAAGAATCAG (reverse). PCR products were purified and submitted for Sanger sequencing at the Roswell Park Cancer Institute’s Biopolymer facility. Chromatograms were assembled and trimmed using Geneious 7.0 (Biomatters).

Bioinformatics

Genomic sequences from the NCBI WGS and reference genome databases were obtained by using protein sequences of NP and VP35 of Ebola virus as queries. We used the tBLASTn algorithm with mammals as a taxonomic delimiter. Resulting contigs with an expect value <10−5 were retained and exported. The NP and VP35 protein sequences from Ebola virus were then used to search for significant matches of the contigs using the FASTA program tfasty (Pearson, 2004). Translated sequences were then prepared as a FASTA format alignment by changing the NCBI header to a user-friendly name with a Python script. Sequences were submitted to the E-INS-I algorithm of MAFFT for multiple sequence alignment (Katoh & Standley, 2014). The resulting alignment file was then submitted to the transitive consistency score (TCS) algorithm of T-Coffee to assess alignment reliability (Chang, Di Tommaso & Notredame, 2014). Unfiltered and filtered alignment files in FASTA format for each filovirid-like gene are provided in the Supplemental Information. The lowest scoring categories of columns were successively filtered from the alignments using a Python script to assess the effect of rapidly evolving or differently evolving sites on branch support. To assess possible effects of increased rate evolution at the tips of the tree, the sequences of ancestral nodes of endogenous viral clades were estimated using the three ancestral reconstruction methods (Delport et al., 2010) in HyPhy (joint maximum likelihood, marginal maximum likelihood, and mode of the posterior distribution of characters) with a JTT + gamma substitution model. Midpoint and outgroup rooting was carried out in Figtree 1.4, with the outgroup being clades of mammalian filovirid-like NERVEs outside of the clade of extant known filovirids. Seaview 4.5.2 (Gouy, Guindon & Gascuel, 2010) was further used to explore rooting the tree “at the point in the tree that minimizes the variance of root-to-tip distances”. Protein models were fit to the resulting alignments using Partitionfinder protein (Lanfear et al., 2012). Phylograms were estimated using Bayesian MCMC using MrBayes 3.2.2 (Ronquist et al., 2012) as implemented at the CIPRES science gateway (Miller, Pfeiffer & Schwartz, 2010). Priors for MrBayes included the amino acid model fixed as JTT (Jones). The sampling frequency was every 1000 generations with the MCMC analysis continuing until the average standard deviation of split frequencies was less than 0.01. We used a burnin fraction of 0.25 and a random starting tree. Branch reliability was assessed with Bayesian posterior probability values and by approximate likelihood ratio tests (aLRT). Maximum likelihood was carried out in PhyML 3.1 (Guindon et al., 2009) as implemented in Seaview 4.5.2 with the subtree pruning and regrafting search algorithm (SPR). Phylograms were visualized in Figtree 4.1 (Rambaut, 2012) and Adobe Illustrator.

Microsynteny of NERVE insertion sites was assessed by carrying out a BLAST search with NERVEs as the queries and the annotated reference genomes of rodents as the databases. NERVE-containing segments were compared among rodents after using the progressive alignment algorithm in the Mauve (Darling, Mau & Perna, 2010) plugin of Geneious 7 (Biomatters). Patristic genetic distances (measured from branches on a gene tree) based on nucleotide alignments of filovirid-like regions in rodents and their extracted intronic or intergenic backgrounds were estimated in Seaview 4.5.2 (Gouy, Guindon & Gascuel, 2010) using the HKY distance (Hasegawa, Kishino & Yano, 1985) with site rate variation being optimized in four categories.

Results and Discussion

Significant Blast expect values were found for 50 NP-like sequences from mammalian genomes and 11 VP35-like sequences. Only one assembly per species was retained for the analysis. We detected several previously unknown filovirid-like NERVEs from rodent genomes. These included NERVEs from the Upper Galilee Mountains blind mole rat (Spalax galili), the golden hamster (Mesocricetus auratus), the prairie vole (Microtus ochrogaster), and the North American Deermouse (Peromyscus maniculatus bairdii). An additional NERVE sequence was amplified by PCR from the meadow vole (Microtus pennsylvanicus). Our cytochrome c oxidase subunit 1 mitochondrial sequence is consistent with the taxonomic identification as it yielded a 99% identity score with sequences from meadow voles (e.g., JQ350481.1). Additional known filovirid-like sequences from mouse and rat genomes and those present in EST libraries or isolated from mammalian genomes by PCR (Taylor, Leach & Bruenn, 2010; Taylor et al., 2011) were excluded because we focused on the relationships within the clade of known extant filovirids.

Both the NP-like (Fig. 1) and the VP35-like (Fig. 2) sequence phylogenies revealed a clade of cricetid rodent sequences within the clade of extant filovirids. Indeed, both genes had cricetid clades paired with ebolaviruses and cuevaviruses to the exclusion of marburgviruses. Taylor et al. (2011) had previously identified this position for the genomes of a single rodent, the striped dwarf hamster (Cricetulus barabensis griseus), but here we have found support for sequences of other cricetid rodents forming a monophyletic clade. This is the most closely related clade of endogenous mammalian genes known for filovirids. The phylogenetic positions are strongly supported by posterior probabilities and aLRTs.

Figure 1. Phylogenetic relationships of filovirid NP-like paleoviruses in mammalian genomes and amino acid sequences from extant filovirids.

Figure 1

Bayesian posterior probabilities for the extant filovirus clade greater than 0.95 are shown as black circles. The phylogeny is based on an alignment with transitive consistency scores <3 filtered. The Blue colors represent branches leading to rodent sequences. Red colors represent branches leading to extant viral sequences. Black bars represent branches leading to non-rodent mammalian sequences. Taxonomic labels indicate phylogenetic placement of sequences from specimens assigned to the given taxon.

Figure 2. Phylogenetic relationships of filovirid VP35-like paleoviruses in mammalian genomes and amino acid sequences from extant filovirids.

Figure 2

Bayesian posterior probabilities for the extant filovirus clade greater than 0.95 are shown as black circles. The phylogeny is based on an alignment with transitive consistency scores <3 filtered. Blue colors represent branches leading to rodent sequences. Red colors represent branches leading to extant viral sequences. Black bars represent branches leading to non-rodent mammalian sequences. Taxonomic labels indicate phylogenetic placement of sequences from specimens assigned to the given taxon.

The occurrences are unlikely to be assembly artifacts because the genomes in question are NCBI reference genomes with strong sequence coverage. The striped dwarf hamster (C. griseus) has independent genome assemblies that agree on the insert locations. Also, the only mammalian species in this filovirid clade are cricetid rodents, some of which have identical insertion locations in their genomes. The pattern of shared insertion among monophyletic taxa is a prediction of common ancestry rather than of assembly artifacts. Finally, we carried out PCR in the meadow vole using primers designed to flank the VP35-like region of the prairie vole (which has a genome project). The PCR reaction was positive and the sequence had strong identity to the microtine sequence from the genome assembly (Fig. 3). Excluding indels the sequence across this putative insert region showed 94% (583 nt) identity between the genome assembly of the pairie vole (M. ochrogaster) and the PCR product of the meadow vole (M. pennsylvanicus). Our partial sequence is consistent with an orthologous insert of a VP35-like sequence in the genome of the meadow vole (M. pennsylvanicus) and confirms the assembled location of this region in rodents in the 3′ intron of the Tax1-binding protein 1 (TAX1BP1) gene locus.

Figure 3. DNA sequence validation of integration for the filoviral VP35-like sequence in voles of the genus Microtus.

Figure 3

The section of the intron common to rodent introns is highlighted in gray and the proposed filovirid-like insert is highlighted in red. Sequence comparisons (colored blocks are differences) between the PCR product (black bar) for the meadow vole (M. pennsylvanicus) and the genome assembly for the prairie vole (M. ochrogaster) are shown for (A) the shared intron of rodent genomes and (B) the proposed insert containing a filovirid VP35-like sequence in genomes of cricetid rodents.

We explored the possibility of systematic error contributing to the pairing of cricetid sequences with ebolaviruses. Long branch attraction (LBA) can occur with real data even under model-based approaches that account for among-site rate variation (Anderson & Swofford, 2004; Omilian & Taylor, 2001; Taylor & Piel, 2004). In some cases, distant outgroups can play a role in LBA (Sanderson et al., 2000). It is expected that support for LBA groupings will be reduced if sites that are rapidly evolving or that lack agreement among pairwise alignments are reduced in the data. However, with the VP35-like and NP-like genes, successively filtering such sites according to the transitive consistency score either increased support for the observed cricetid/ebolavirus pairing or had no effect on support (Fig. 4). Support for the internal position eroded only when the number of sites had been reduced to less than 15% of the data for the NP gene and 34% for the VP35 gene. We also note that similar LBA conditions between the genes are lacking because the branch length patterns are reversed for the two genes. For the VP35 phylogeny, marburgviruses have the longest distance to the root among extant viruses, but for the NP gene, ebolaviruses have the longest distance to the root. Yet, in each case the cricetid sequences group with ebolaviruses and cuevaviruses. The increase in support for this clade with filtering, then, is most likely a result of culling evolutionary noise from the mammalian genes. Most of the NERVEs are pseudogenes that accumulate evolutionary noise in the form of indels and reading frame disruptions.

Figure 4. Graphs of phylogenetic support values for the branch that groups rodent sequences with ebolaviruses and cuevaviruses to the exclusion of marburgviruses.

Figure 4

(A) the NP-like region and (B) the VP35-like region. The x-axis represents the size of the alignment after culling sites according to their transitive consistency scores (TCS). Note that successive removal of the sites that most disagree among pairwise alignments fails to erode support for the branch in question until the alignment size is small. aLRTs are approximate likelihood ratio tests.

To further explore a role for outgroups in affecting the relationships of the ingroup we carried out several analyses with new alignments that omitted outgroup taxa. Every analysis for the VP35-like gene alignment grouped the cricetid sequences inside the clade of extant filovirids with strong support (Table 1). The ingroup analysis using the complete NP-like alignment indicated that cricetid sequences grouped outside of the extant filovirids. However, replacing the cricetid clade with ancestral reconstructions of this clade of pseudogenes moved the cricetid NP-like sequences internal to the extant filovirids. Ancestral reconstruction of the NERVE clade reduced gaps that may have contributed to a biased attraction of extant viral genera with fewer indels. In support of this notion, removing the most rapidly evolving sites and using ancestral reconstructions of the cricetid copies gave the same results as the outgroup-rooted sequences for both genes. So, phylogenetic analysis of ingroup sequences alone after reducing the most rapidly evolving and rapidly eroding sites (including indels) indicates that the position of the cricetid sequences within extant filovirids is unlikely to be the result of an outgroup bias.

Table 1. Exploration of rooting, taxon set, and filtering of rapidly evolving sites on the phylogenetic position of filovirid-like endogenous genes in cricetid rodents.

Likelihood scores and aLRT branch support values are given for the best topologies found. Observed ingroup topologies are shown in Newick format (brackets and commas) where E stands for ebolaviruses, L for Lloviu virus, C for cricetid filovirid-like and M for marburgvirus sequences.

Alignment Sites
remaining
Rooting ((E, L, C), M) ((E, L), (C, M)) ((E, L, M), C)
VP35 390 Outgroup, midpoint, and minimum variance −8451.3 (0.60)
NP 991 Outgroup, midpoint, and minimum variance −40584.3 (0.96)
VP35 (TCS filtered) 237 Outgroup, midpoint, and minimum variance −5360.3 (0.78)
NP (TCS filtered) 391 Outgroup, midpoint, and minimum variance −22024.7 (0.98)
VP35 ingroup 381 Minimum variance −6468.8 (1.0)
VP35 ingroup (TCS filtered) 280 Minimum variance −5061.6 (0.96)
NP ingroup 858 Minimum variance −19899.3 (1.0)
NP ingroup (TCS filtered) 487 Minimum variance −10813.9 (1.0)
VP35 ingroup ancestral NIRV 381 Minimum variance −10813.9 48 (0.99)
NP ingroup ancestral NIRV 858 Minimum variance −10834.9 (1.0)
VP35 ingroup ancestral NIRV (rapid sites filtered) 295 Minimum variance −3648.6 (1.0)
NP ingroup ancestral NIRV(rapid sites filtered) 534 Minimum variance −5692.5 (1.0)

Microsynteny was observed among a monophyletic clade of VP35-like NERVEs in cricetid rodents. The (Microtus, (Cricetulus, Mesocricetus)) grouping agrees with rodent taxonomy, with the microsynteny being apparent at the genic and the nucleotide level. Namely, these cricetid rodents share a VP35-like insert in the 3′-most intron of the Tax1-binding protein 1 (TAX1BP1) gene locus (Fig. 5). This insertion site is also identical to that of the partial VP35-like gene from the meadow vole (M. pennsylvanicus) that we amplified by PCR. TAX1BP1 is involved in the down regulation of inflammation genes (Verstrepen et al., 2011). Interestingly, both the TAX1BP1 of mammals (Parvatiyar, Barber & Harhaj, 2010) and VP35 of ebolaviruses (Basler et al., 2003; Hartman, Towner & Nichol, 2004; Hartman et al., 2008) inhibit IRF (interferon regulatory factor)3, a critical transcription factor for the initiation of viral innate immunity in mammals. The host’s inhibition of the interferon pathway is necessary to prevent the deleterious effects of inflammation. A C-terminal motif (PRACQKSLR) (Hartman, Towner & Nichol, 2004; Hartman et al., 2008) of the VP35 of ebolaviruses targets the same transcription factor to inhibit mammalian innate immunity to viruses. Unlike the more divergent VP35-like NERVEs from bats (Belyi, Levine & Skalka, 2010), the cricetid VP35-like NERVEs show evidence of conservation of the three basic residues affecting the interferon response. Specifically, two of the striped dwarf hamster (C. griseus) NERVEs and one of the ancestral reconstructions have these important residues (PRPCQKSIR). The shared insertion of a viral gene that inhibits IFR3 in a mammalian gene that plays a key role in the inhibition of IFR3 immediately suggests selective maintenance of the integration. The VP35 of Ebola virus fails to inhibit the type I interferon response in hamsters (Ebihara et al., 2013). Also, Ebola virus infections in hamsters cause downregulation of proinflammatory cytokines, while still inducing a type I interferon response (Ebihara et al., 2013). Although intron sequence variation is known to be functional in mammals (Praetorius et al., 2013), we are unaware of functional studies of the VP35-interrupted TAX1BP1 of rodents. Functional studies are needed to address the possibility that co-option of a viral interferon pathway regulator (VP35) contributes to the immune response of hamsters.

Figure 5. Cartoons comparing an orthologous genomic region among rodents that (A) lack a filovirid VP35-like insert; and (B) possess an orthologous filovirid VP35-like insert.

Figure 5

Closeups of the upstream and downstream putative insertion boundaries are shown revealing microsynteny at the nucleotide level (the blocks of differing colors are nucleotides). A gray bar represents intronic sequence and a red bar represents the putative filovirid VP35-like insert region.

Microsynteny was also observed for the NP-like genes of cricetid rodents (Fig. 6). The oldest case of microsynteny involved inserts in the same intergenic region between the gliomedin and cytochrome P450 19A1-like loci shared by rodents of the following genera: Microtus, Mesocricetus, and Cricetulus. Hamsters of the genera Cricetulus and Mesocricetus had very high similarity of flanking sequences (Fig. 6) indicating a shared insert site. The insert in the vole genome was just under 10 kbp from the hamster insert site. Possible causes of the differing insert location could be a modest rearrangement, an assembly artifact, an independent insert in the same intergenic region, or tandem duplication followed by loss of the original insert. We sequenced across the putative insert boundary using DNA from the meadow vole (M. pennsylvanicus) as a template to assess the assembly. The assembly and location of the insert was verified by the sequence, which had at least 91% nucleotide sequence identity with the assembly of the pairie vole (M. ochrogaster). We then compared the pairwise sequence divergences of putatively orthologous NERVEs, under the expectation that orthologous NERVEs would evolve at a similar or slower rate than the background intron or intergenic region of their insertion. The NP-like insert of the vole evolved at about twice the rates of the intronic, intergenic and other NERVE comparisons, suggesting that the vole insert may not be orthologous to the hamster insert (Fig. 7). A dotplot comparison of this sequence revealed no evidence for recombination. We conclude that there is strong evidence for the orthology of NP-like inserts for hamsters of the genera Cricetulus and Mesocricetus in the intergenic region between gliomedin and cytochrome P450 19A1-like loci. However, the NP-like insert of the vole may be an independent insert or a paralog. If true, then gene order evidence alone for mammals may be too crude a measure to evaluate orthology for NERVEs.

Figure 6. Cartoon comparing the aligned genomic regions of cricetid rodents that contain a putative orthologous filovirid NP-like sequence.

Figure 6

For this genomic segment, the assembly from the North American deermouse (Peromyscus manipulates bairdi) lacks a detectable insert. Specimens of Microtus ochrogaster, Cricetulus griseus, and Mesocricetus auratus share an insert in the same intergenic region. However, the insert of the prairie vole (M. ochrogaster) is about 10 kbp upstream of the shared insert of members of the other species. The identity bar reveals strong sequence similarity in this intergenic region and flanking the insert site of the striped dwarf hamster (C. griseus) and the golden hamster (M. auratus). Most non-identity is due to indels in the intergenic region.

Figure 7. A graph comparing genetic distance (patristic) among putatively orthologous filovirid-like gene inserts in cricetid rodent genomes.

Figure 7

(A) compares pairwise distances for the VP35-like inserts with those based on the intronic background. (B) compares distances of the NP-like inserts with those of their intergenic sequence background.

The combination of monophyly, phylogenetic agreement with rodent subfamilies, topological agreement between genes, and microsynteny indicate that the cricetid rodent VP35-like insertion was present in the common ancestor of hamsters and voles and the NP-like insertion was present in the common ancestor of hamsters of the genera Cricetulus and Mesocricetus. It may be argued that integrated viruses evolve by a different mode than do extant viruses or that integrated viruses are too distant from extant viruses to be biologically relevant. However, strong purifying selection has been demonstrated to occur in both RNA viruses (Wertheim & Kosakovsky Pond, 2011) and in their integrated eukaryotic versions (Taylor et al., 2011). Some of these show both RNA expression products and purifying selection (Ballinger et al., 2014). In at least one case an integrated RNA viral gene produces a protein product (Taylor et al., 2013). Under strong purifying selection, similarity at the amino acid level can be preserved despite differences in the mutation rates for many millions of years. Our analysis of the transient consistency scores indicates that if differing modes of evolution exist, they have little effect on the major relationships of the sequences from inferred amino acids. Given our evidence for orthology of the ebolavirus-like genes, we can provide a minimum estimate of the age of the insert as the age of the common ancestor of hamsters and voles. Molecular clock estimates using fossil calibrations agree that hamsters and voles had a common ancestor in the Miocene (Abramson et al., 2009; Fabre et al., 2012; Horn et al., 2011; Jansa, Barker & Heaney, 2006; Parada et al., 2013; Steppan, Adkins & Anderson, 2004). Indeed, these studies indicate a divergence date of about 18 MY ago (error bars span much of the Miocene). The common ancestor of hamsters of the genera Cricetulus and Mesocricetus has also been estimated in the Miocene at 7–12 MY ago (Neumann et al., 2006). If the phylogenetic placement of the cricetid NERVEs within known extant filovirids is correct, then the divergence of marburgviruses from other filovirids (ebolaviruses and cuevaviruses) must also be at least as old as the Miocene. This age is orders of magnitude older than previously thought and will likely aid in understanding the comparative biology of filovirids. The differing genome architecture, transcriptional editing, and immunological reactivity of cuevavirions, ebolavirions, and marburgvirions (Kuhn et al., 2010) had a much longer time to evolve than the rise of agriculture. Our results provide strong evidence that molecular clock based estimates for extant filovirids have been severely underestimated as predicted by the saturation problem posed by Wertheim & Kosakovsky Pond (2011). However, our methods also provide a solution to this problem—dating of orthologous paleoviruses.

Small rodents appear overrepresented amongst the NERVEs of filovirids. The analysis here further bolsters this pattern with the mouse-related clade being represented in each of the three deep clades of filovirids (fossil and extant). Indeed, the genome of the North American deermouse (P. maniculatus bairdii) has NERVEs from all three major filovirid clades, suggesting multiple historical integrations of divergent filoviral lineages. It is unknown why the genomes of mouse-like rodents appear overrepresented in the list of mammals with filovirid-like NERVEs. Because NERVEs originate as rare macromutations, nearly all inserts will be lost by failing to achieve integration into germ-line cells. Those that become endogenous will most likely disappear after genetic drift or selection. Casual infection alone, then, is unlikely to result in the fixation of numerous long-lived integrations in rodents (Johnson, 2010). The evidence of repeated genomic and evolutionary interactions of filovirids (including the extant clade of filovirids) with cricetid rodents should be considered when comparing the differing immunological responses among mammals to infections with modern filovirids (Wahl-Jensen et al., 2012).

Cricetid rodents have captured orthologous NP and VP35-like gene segments from filovirids that group phylogenetically within the extant filovirids. The sharing of these genomic sections provides the first evidence that extant known filovirids have been diverging since the Miocene. The results show that fossil copies of RNA viruses can provide minimum estimates of divergence in a divergence range that is recalcitrant to present molecular clock methods with extant viruses. Our results also bolster evidence that mouse-like rodents have had repeated genomic interactions with filovirids. Our finding of a filoviral insert that interrupts an important regulator of the innate antiviral response also informs hypotheses regarding the possible biological significance to the mammalian host of such inserts.

Supplemental Information

Supplemental Information 1. Multiple sequence alignment files of filovirus-like sequences in mammals in fast format.

NP-like and VP35-like alignment files from mammals and viruses. Stop codons are treated as gaps.

DOI: 10.7717/peerj.556/supp-1

Acknowledgments

We thank Dr. Solon Morse for the tail clipping of the meadow vole.

Funding Statement

The author declares there was no funding for this work.

Additional Information and Declarations

Competing Interests

Jeremy Bruenn is an Academic Editor for PeerJ. The authors declare there are no competing interests.

Author Contributions

Derek J. Taylor conceived and designed the experiments, analyzed the data, contributed reagents/materials/analysis tools, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.

Matthew J. Ballinger and Laura E. Hanzly conceived and designed the experiments, performed the experiments, analyzed the data, reviewed drafts of the paper.

Jack J. Zhan performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, prepared figures and/or tables, reviewed drafts of the paper.

Jeremy A. Bruenn analyzed the data, reviewed drafts of the paper.

DNA Deposition

The following information was supplied regarding the deposition of DNA sequences:

GenBank KM189810KM189812.

References

  • Abramson et al. (2009).Abramson I, Lebedev S, Tesakov S, Bannikova A. Supraspecies relationships in the subfamily Arvicolinae (Rodentia, Cricetidae): an unexpected result of nuclear gene analysis. Molecular Biology. 2009;43:834–846. doi: 10.1134/S0026893309050148. [DOI] [PubMed] [Google Scholar]
  • Anderson & Swofford (2004).Anderson FE, Swofford DL. Should we be worried about long-branch attraction in real data sets? Investigations using metazoan 18S rDNA. Molecular Phylogenetics and Evolution. 2004;33:440–451. doi: 10.1016/j.ympev.2004.06.015. [DOI] [PubMed] [Google Scholar]
  • Ballinger et al. (2014).Ballinger MJ, Bruenn JA, Hay J, Czechowski D, Taylor DJ. Discovery and evolution of bunyavirids in arctic phantom midges and ancient bunyavirid-like sequences in insect genomes. Journal of Virology. 2014;88:8783–8794. doi: 10.1128/JVI.00531-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Basler et al. (2003).Basler CF, Mikulasova A, Martinez-Sobrido L, Paragas J, Mühlberger E, Bray M, Klenk H-D, Palese P, García-Sastre A. The Ebola virus VP35 protein inhibits activation of interferon regulatory factor 3. Journal of Virology. 2003;77:7945–7956. doi: 10.1128/JVI.77.14.7945-7956.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Belyi, Levine & Skalka (2010).Belyi VA, Levine AJ, Skalka AM. Unexpected inheritance: multiple integrations of ancient bornavirus and ebolavirus/marburgvirus sequences in vertebrate genomes. PLoS Pathogens. 2010;6:e556. doi: 10.1371/journal.ppat.1001030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Carroll et al. (2013).Carroll SA, Towner JS, Sealy TK, McMullan LK, Khristova ML, Burt FJ, Swanepoel R, Rollin PE, Nichol ST. Molecular evolution of viruses of the family Filoviridae based on 97 whole-genome sequences. Journal of Virology. 2013;87:2608–2616. doi: 10.1128/JVI.03118-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Chang, Di Tommaso & Notredame (2014).Chang J-M, Di Tommaso P, Notredame C. TCS: a new multiple sequence alignment reliability measure to estimate alignment accuracy and improve phylogenetic tree reconstruction. Molecular Biology and Evolution. 2014;31:1625–1637. doi: 10.1093/molbev/msu117. [DOI] [PubMed] [Google Scholar]
  • Crochu et al. (2004).Crochu S, Cook S, Attoui H, Charrel RN, De Chesse R, Belhouchet M, Lemasson J-J, de Micco P, de Lamballerie X. Sequences of flavivirus-related RNA viruses persist in DNA form integrated in the genome of Aedes spp. mosquitoes. Journal of General Virology. 2004;85:1971–1980. doi: 10.1099/vir.0.79850-0. [DOI] [PubMed] [Google Scholar]
  • Darling, Mau & Perna (2010).Darling AE, Mau B, Perna NT. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS ONE. 2010;5:e556. doi: 10.1371/journal.pone.0011147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Delport et al. (2010).Delport W, Poon AF, Frost SD, Pond SLK. Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics. 2010;26:2455–2457. doi: 10.1093/bioinformatics/btq429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Duchêne, Holmes & Ho (2014).Duchêne S, Holmes EC, Ho SY. Analyses of evolutionary dynamics in viruses are hindered by a time-dependent bias in rate estimates. Proceedings of the Royal Society B: Biological Sciences. 2014;281:e556. doi: 10.1098/rspb.2014.0732. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ebihara et al. (2013).Ebihara H, Zivcec M, Gardner D, Falzarano D, LaCasse R, Rosenke R, Long D, Haddock E, Fischer E, Kawaoka Y, Feldmann H. A Syrian golden hamster model recapitulating ebola hemorrhagic fever. Journal of Infectious Diseases. 2013;207:306–318. doi: 10.1093/infdis/jis626. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Fabre et al. (2012).Fabre PH, Hautier L, Dimitrov D, Douzery EJ. A glimpse on the pattern of rodent diversification: a phylogenetic approach. BMC Evolutionary Biology. 2012;12:88. doi: 10.1186/1471-2148-12-88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gilbert & Feschotte (2010).Gilbert C, Feschotte C. Genomic fossils calibrate the long-term evolution of hepadnaviruses. PLoS Biology. 2010;8:e556. doi: 10.1371/journal.pbio.1000495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gouy, Guindon & Gascuel (2010).Gouy M, Guindon S, Gascuel O. SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Molecular Biology and Evolution. 2010;27:221–224. doi: 10.1093/molbev/msp259. [DOI] [PubMed] [Google Scholar]
  • Guindon et al. (2009).Guindon S, Delsuc F, Dufayard J-F, Gascuel O. Estimating maximum likelihood phylogenies with PhyML. Methods in Molecular Biology. 2009;537:113–137. doi: 10.1007/978-1-59745-251-9_6. [DOI] [PubMed] [Google Scholar]
  • Hartman et al. (2008).Hartman AL, Bird BH, Towner JS, Antoniadou Z-A, Zaki SR, Nichol ST. Inhibition of IRF-3 activation by VP35 is critical for the high level of virulence of Ebola virus. Journal of Virology. 2008;82:2699–2704. doi: 10.1128/JVI.02344-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Hartman, Towner & Nichol (2004).Hartman AL, Towner JS, Nichol ST. A C-terminal basic amino acid motif of Zaire ebolavirus VP35 is essential for type I interferon antagonism and displays high identity with the RNA-binding domain of another interferon antagonist, the NS1 protein of influenza A virus. Virology. 2004;328:177–184. doi: 10.1016/j.virol.2004.07.006. [DOI] [PubMed] [Google Scholar]
  • Hasegawa, Kishino & Yano (1985).Hasegawa M, Kishino H, Yano T-A. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. Journal of Molecular Evolution. 1985;22:160–174. doi: 10.1007/BF02101694. [DOI] [PubMed] [Google Scholar]
  • Hicks & Duffy (2014).Hicks AL, Duffy S. Cell tropism predicts long-term nucleotide substitution rates of mammalian RNA viruses. PLoS Pathogens. 2014;10:e556. doi: 10.1371/journal.ppat.1003838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Holmes (2003).Holmes EC. Molecular clocks and the puzzle of RNA virus origins. Journal of Virology. 2003;77:3893–3897. doi: 10.1128/JVI.77.7.3893-3897.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Horie et al. (2010).Horie M, Honda T, Suzuki Y, Kobayashi Y, Daito T, Oshida T, Ikuta K, Jern P, Gojobori T, Coffin JM. Endogenous non-retroviral RNA virus elements in mammalian genomes. Nature. 2010;463:84–87. doi: 10.1038/nature08695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Horn et al. (2011).Horn S, Durka W, Wolf R, Ermala A, Stubbe A, Stubbe M, Hofreiter M. Mitochondrial genomes reveal slow rates of molecular evolution and the timing of speciation in beavers (Castor), one of the largest rodent species. PLoS ONE. 2011;6:e556. doi: 10.1371/journal.pone.0014622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Jansa, Barker & Heaney (2006).Jansa SA, Barker FK, Heaney LR. The pattern and timing of diversification of Philippine endemic rodents: evidence from mitochondrial and nuclear gene sequences. Systematic Biology. 2006;55:73–88. doi: 10.1080/10635150500431254. [DOI] [PubMed] [Google Scholar]
  • Johnson (2010).Johnson WE. Endless forms most viral. PLoS Genetics. 2010;6:e556. doi: 10.1371/journal.pgen.1001210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Katoh & Standley (2014).Katoh K, Standley DM. MAFFT: iterative refinement and additional methods. Methods in Molecular Biology. 2014;1079:131–146. doi: 10.1007/978-1-62703-646-7_8. [DOI] [PubMed] [Google Scholar]
  • Katzourakis & Gifford (2010).Katzourakis A, Gifford RJ. Endogenous viral elements in animal genomes. PLoS Genetics. 2010;6:e556. doi: 10.1371/journal.pgen.1001191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Katzourakis et al. (2007).Katzourakis A, Tristem M, Pybus OG, Gifford RJ. Discovery and analysis of the first endogenous lentivirus. Proceedings of the National Academy of Sciences of the United States of America. 2007;104:6261–6265. doi: 10.1073/pnas.0700471104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kuhn et al. (2010).Kuhn JH, Becker S, Ebihara H, Geisbert TW, Johnson KM, Kawaoka Y, Lipkin WI, Negredo AI, Netesov SV, Nichol ST. Proposal for a revised taxonomy of the family Filoviridae: classification, names of taxa and viruses, and virus abbreviations. Archives of Virology. 2010;155:2083–2103. doi: 10.1007/s00705-010-0814-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Lanfear et al. (2012).Lanfear R, Calcott B, Ho SY, Guindon S. PartitionFinder: combined selection of partitioning schemes and substitution models for phylogenetic analyses. Molecular Biology and Evolution. 2012;29:1695–1701. doi: 10.1093/molbev/mss020. [DOI] [PubMed] [Google Scholar]
  • Liu et al. (2010).Liu H, Fu Y, Jiang D, Li G, Xie J, Cheng J, Peng Y, Ghabrial SA, Yi X. Widespread horizontal gene transfer from double-stranded RNA viruses to eukaryotic nuclear genomes. Journal of Virology. 2010;84:11876–11887. doi: 10.1128/JVI.00955-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Miller, Pfeiffer & Schwartz (2010).Miller MA, Pfeiffer W, Schwartz T. Creating the CIPRES Science Gateway for inference of large phylogenetic trees. Gateway Computing Environments Workshop (GCE), 2010; 2010. pp. 1–8. [Google Scholar]
  • Negredo et al. (2011).Negredo A, Palacios G, Vázquez-Morón S, González F, Dopazo H, Molero F, Juste J, Quetglas J, Savji N, de la Cruz Martínez M. Discovery of an ebolavirus-like filovirid in Europe. PLoS Pathogens. 2011;7:e556. doi: 10.1371/journal.ppat.1002304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Neumann et al. (2006).Neumann K, Michaux J, Lebedev V, Yigit N, Colak E, Ivanova N, Poltoraus A, Surov A, Markov G, Maak S. Molecular phylogeny of the Cricetinae subfamily based on the mitochondrial cytochrome b and 12S rRNA genes and the nuclear vWF gene. Molecular Phylogenetics and Evolution. 2006;39:135–148. doi: 10.1016/j.ympev.2006.01.010. [DOI] [PubMed] [Google Scholar]
  • Omilian & Taylor (2001).Omilian AR, Taylor DJ. Rate acceleration and long-branch attraction in a conserved gene of cryptic daphniid (Crustacea) species. Molecular Biology and Evolution. 2001;18:2201–2212. doi: 10.1093/oxfordjournals.molbev.a003767. [DOI] [PubMed] [Google Scholar]
  • Parada et al. (2013).Parada A, Pardiñas UF, Salazar-Bravo J, Delía G, Palma RE. Dating an impressive Neotropical radiation: molecular time estimates for the Sigmodontinae (Rodentia) provide insights into its historical biogeography. Molecular Phylogenetics and Evolution. 2013;66:960–968. doi: 10.1016/j.ympev.2012.12.001. [DOI] [PubMed] [Google Scholar]
  • Parvatiyar, Barber & Harhaj (2010).Parvatiyar K, Barber GN, Harhaj EW. TAX1BP1 and A20 inhibit antiviral signaling by targeting TBK1-IKKi kinases. Journal of Biological Chemistry. 2010;285:14999–15009. doi: 10.1074/jbc.M110.109819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Patel, Emerman & Malik (2011).Patel MR, Emerman M, Malik HS. Paleovirology–ghosts and gifts of viruses past. Current Opinion in Virology. 2011;1:304–309. doi: 10.1016/j.coviro.2011.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Pearson (2004).Pearson W. Finding protein and nucleotide similarities with FASTA. Current Protocols in Bioinformatics. 2004:3.9.1–3.9.23. doi: 10.1002/0471250953.bi0309s04. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Praetorius et al. (2013).Praetorius C, Grill C, Stacey SN, Metcalf AM, Gorkin DU, Robinson KC, Van Otterloo E, Kim RS, Bergsteinsdottir K, Ogmundsdottir MH. A polymorphism in IRF4 affects human pigmentation through a tyrosinase-dependent MITF/TFAP2A pathway. Cell. 2013;155:1022–1033. doi: 10.1016/j.cell.2013.10.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Rambaut (2012).Rambaut A. FigTree v1. 4. University of Edinburgh, Edinburgh, UK; 2012. Available at http://tree bio ed ac uk/software/figtree . [Google Scholar]
  • Ronquist et al. (2012).Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Höhna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Systematic Biology. 2012;61:539–542. doi: 10.1093/sysbio/sys029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Sanderson et al. (2000).Sanderson MJ, Wojciechowski MF, Hu J-M, Khan TS, Brady SG. Error, bias, and long-branch attraction in data for two chloroplast photosystem genes in seed plants. Molecular Biology and Evolution. 2000;17:782–797. doi: 10.1093/oxfordjournals.molbev.a026357. [DOI] [PubMed] [Google Scholar]
  • Sharp & Simmonds (2011).Sharp PM, Simmonds P. Evaluating the evidence for virus/host co-evolution. Current Opinion in Virology. 2011;1:436–441. doi: 10.1016/j.coviro.2011.10.018. [DOI] [PubMed] [Google Scholar]
  • Steppan, Adkins & Anderson (2004).Steppan SJ, Adkins RM, Anderson J. Phylogeny and divergence-date estimates of rapid radiations in muroid rodents based on multiple nuclear genes. Systematic Biology. 2004;53:533–553. doi: 10.1080/10635150490468701. [DOI] [PubMed] [Google Scholar]
  • Suzuki & Gojobori (1997).Suzuki Y, Gojobori T. The origin and evolution of Ebola and Marburg viruses. Molecular Biology and Evolution. 1997;14:800–806. doi: 10.1093/oxfordjournals.molbev.a025820. [DOI] [PubMed] [Google Scholar]
  • Tanne & Sela (2005).Tanne E, Sela I. Occurrence of a DNA sequence of a non-retro RNA virus in a host plant genome and its expression: evidence for recombination between viral and host RNAs. Virology. 2005;332:614–622. doi: 10.1016/j.virol.2004.11.007. [DOI] [PubMed] [Google Scholar]
  • Taylor et al. (2013).Taylor DJ, Ballinger MJ, Bowman SM, Bruenn JA. Virus-host co-evolution under a modified nuclear genetic code. PeerJ. 2013;1:e556. doi: 10.7717/peerj.50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Taylor & Bruenn (2009).Taylor DJ, Bruenn J. The evolution of novel fungal genes from non-retroviral RNA viruses. BMC Biology. 2009;7:88. doi: 10.1186/1741-7007-7-88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Taylor et al. (2011).Taylor DJ, Dittmar K, Ballinger MJ, Bruenn JA. Evolutionary maintenance of filovirid-like genes in bat genomes. BMC Evolutionary Biology. 2011;11:336. doi: 10.1186/1471-2148-11-336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Taylor, Leach & Bruenn (2010).Taylor DJ, Leach RW, Bruenn J. Filovirids are ancient and integrated into mammalian genomes. BMC Evolutionary Biology. 2010;10:193. doi: 10.1186/1471-2148-10-193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Taylor & Piel (2004).Taylor DJ, Piel WH. An assessment of accuracy, error, and conflict with support values from genome-scale phylogenetic data. Molecular Biology and Evolution. 2004;21:1534–1537. doi: 10.1093/molbev/msh156. [DOI] [PubMed] [Google Scholar]
  • Verstrepen et al. (2011).Verstrepen L, Verhelst K, Carpentier I, Beyaert R. TAX1BP1, a ubiquitin-binding adaptor protein in innate immunity and beyond. Trends in Biochemical Sciences. 2011;36:347–354. doi: 10.1016/j.tibs.2011.03.004. [DOI] [PubMed] [Google Scholar]
  • Wahl-Jensen et al. (2012).Wahl-Jensen V, Bollinger L, Safronetz D, de Kok-Mercado F, Scott DP, Ebihara H. Use of the Syrian hamster as a new model of ebola virus disease and other viral hemorrhagic fevers. Viruses. 2012;4:3754–3784. doi: 10.3390/v4123754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wertheim & Kosakovsky Pond (2011).Wertheim JO, Kosakovsky Pond SL. Purifying selection can obscure the ancient age of viral lineages. Molecular Biology and Evolution. 2011;28:3355–3365. doi: 10.1093/molbev/msr170. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Information 1. Multiple sequence alignment files of filovirus-like sequences in mammals in fast format.

NP-like and VP35-like alignment files from mammals and viruses. Stop codons are treated as gaps.

DOI: 10.7717/peerj.556/supp-1

Articles from PeerJ are provided here courtesy of PeerJ, Inc

RESOURCES