Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2017 Aug;25:23–27. doi: 10.1016/j.coviro.2017.06.006

Origin of the retroviruses: when, where, and how?

Alexander Hayward 1
PMCID: PMC5962544  PMID: 28672160

Highlights

  • Retroviruses are a virus family of considerable medical and veterinary importance.

  • Until recently, very little was known about deep retroviral origins.

  • New research supports a marine origin of retroviruses, ∼460–550 million years ago.

  • The evolutionary events leading to the origin of retroviruses remain obscure.

  • Improved understanding of Metaviridae diversity and evolution are required for this.

Abstract

Retroviruses are a virus family of considerable medical and veterinary importance. Additionally, it is now clear that endogenous retroviruses (ERVs) comprise significant portions of vertebrate genomes. Until recently, very little was known about the deep evolutionary origins of retroviruses. However, advances in genomics and bioinformatics have opened the way for great strides in understanding. Recent research employing a wide variety of bioinformatic approaches has demonstrated that retroviruses evolved during the early Palaeozoic Era, between 460 and 550 million years ago, providing the oldest inferred date estimate for any virus group. This finding presents an important framework to investigate the evolutionary transitions that led to the emergence of the retroviruses, offering potential insights into the infectious origins of a major group of pathogenic viruses.


Current Opinion in Virology 2017, 25:23–27

This review comes from a themed issue on Paleovirology

Edited by Aris Katzourakis

For a complete overview see the Issue and the Editorial

Available online 30th June 2017

http://dx.doi.org/10.1016/j.coviro.2017.06.006

1879-6257/© 2017 The Author. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

Introduction

The origin of species was famously described as ‘that mystery of mysteries’ by the 19th Century scientist Sir Charles Lyell. Since Charles Darwin presented his theory of natural selection, thereby solving the mystery, much progress has been made in understanding speciation. But a major challenge remains elucidating the evolutionary transitions leading to the emergence of whole new groups of taxa. This is especially so for viruses, with their vast diversity and multiple independent origins. Unravelling viral beginnings is particularly challenging because viruses typically have small genomes, rapid rates of sequence evolution, and scant fossil remains. Yet, over recent decades it has become apparent that viruses occasionally leave a historical record in their host's genomes in the form of endogenous viral elements (EVEs) [1, 2], which can provide valuable information for discerning their histories. As a consequence of their lifecycle, this record is particularly rich for retroviruses. To replicate, a retrovirus must integrate a copy of its genome into the host's genome as a provirus. If this occurs in a host germ cell, the provirus may be inherited by the host's progeny and down the generations as an endogenous retrovirus (ERV). All vertebrate genomes examined so far contain ERVs, providing a wealth of opportunities to study retroviral ecology and evolution, including the origin of this medically and veterinarily important viral group [3]. In combination with modern genomic technologies, analysis of ERV sequences is facilitating rapid advances in the understanding of retroviral evolution. Here, we review recent progress in discerning retroviral origins, outlining key advances and highlighting outstanding research questions (Box 1).

Box 1. Retroviruses.

Retroviruses form a family of enveloped RNA viruses (the Retroviridae), entirely restricted to vertebrate hosts [4]. Exogenous retroviruses (XRVs) are transmitted horizontally among hosts. The number of described XRVs is small, and is distributed across 7 genera, containing just 53 described species [5]. Meanwhile, ERVs, which are inherited vertically in the genomes of their hosts, represent a great wealth of retroviral sequence diversity that has accumulated over millions of years of vertebrate–retrovirus interactions. ERVs accumulate mutations at the background rate of sequence mutation in their host's genome, gradually degrading until their sequences are no longer recognisable. ERV taxonomy is not covered by the guidelines of the International Committee on Taxonomy of Viruses (ICTV). Consequently, here we refer to ERVs as retrovirus‘-like’, with reference to their placement in phylogenetic analyses [3•, 6].

The when, where, and how of retrovirus origins

When did retroviruses originate?

Several methods have been employed to estimate the age of retroviral lineages over evolutionary timescales. These approaches can be divided broadly into those that reconstruct ERV integration histories, and those that analyse host–virus co-phylogenetic relationships. We discuss the relevance of these for examining ancient retroviral origins below, providing a brief overview of recent developments of particular significance.

When a retrovirus undergoes reverse transcription to produce viral DNA, identical sequences called long terminal repeats (LTRs) are generated at both ends of the viral genome [4]. After integration, LTRs diverge independently at a rate approximately equal to the background (neutral) rate of host genome sequence evolution [7]. Consequently, measures based on divergence between 5′ and 3′ LTRs can be used to calculate approximate proviral integration dates by adopting the formula t = k/2N, where t is time, k is LTR divergence in substitutions per site, and N is the neutral rate of evolution for the host genome. This approach has been widely applied to estimate integration dates for a range of ERV lineages, from diverse host genomes, including primates [8, 9], bats [10], amphibians [11, 12••] and fish [12••]. However, gene conversion and recombination between LTRs can mislead results [13]. Further, while LTR divergence estimates can be accurate over relatively recent timescales, poor data for neutral mutation rates in ancestral host genomes and the cumulative effects of sequence erosion over ancient time periods limit the utility of this method for addressing the ultimate origins of retroviruses.

An alternative approach to estimate the age of retroviral lineages is to screen host genomes for orthologous ERVs (those shared due to integration in a common ancestor). Since multiple integrations and fixations of ERVs at the exact same location are extremely unlikely, a shared ERV locus is strong evidence of inheritance from the common ancestor where the original germ line integration occurred. Thus, identifying the node in host phylogeny whereby descendent taxa share an orthologous ERV can provide a minimum integration estimate. Using this approach, analysis of lenti-like ERVs orthologous to RELIK (the first identified endogenous lentiretrovirus [14]) from across the order Lagomorpha, provided a minimum age of ∼12 million years for integration of the ancestral element [15]. Similarly, analyses of ERV-L elements orthologous across placental mammals placed integration of an ancestral element at 104–110 mya [16••]. However, ‘look-back time’ using orthologues is limited by computational and practical difficulties associated with recovering ERV and host flanking sequences [16••], and no older orthologous ERVs have yet been identified. A similar approach, employing host genome duplication events instead of orthology, can also be used to date retroviral lineages [17], but is subject to the same limitations.

The age of ancient retroviral lineages can also be estimated by considering host–virus coevolutionary relationships. Since host switching is frequent among many retroviral lineages [3•, 18], such analyses are largely restricted to spumaretroviruses (foamy viruses), which show relatively few cross-species transmission events [19••, 20]. Additionally, spumaretroviruses are the most basal currently recognised retroviral genus [3], providing a suitable group with which to explore retroviral origins. Incorporation of ancient spuma-like ERVs from sloth genomes (SloEFVs) and exogenous spumaretroviruses isolated from diverse mammalian hosts in several analyses have supported strict host–virus cophylogeny, establishing that spumaviruses are at least 100 million years old [17]. More recently, a study that included ERVs screened from a wide diversity of host taxa from primitive vertebrate lineages, provided an estimated date of origin for spumaretroviruses/spuma-like viruses of ∼455–473 mya during the Ordovician [12••]. This suggests that retroviruses emerged alongside their vertebrate hosts during the early Palaeozoic Era, ∼460–550 mya [12••]. Such an ancient postulated origin constitutes an exciting development in the study of retrovirology, and virology more generally, providing the oldest date directly inferred for any viral group [12••].

Where did retroviruses originate?

A retroviral origin during the Ordovician period or earlier means that retroviruses must have evolved within the marine environment [12••]. Vertebrates were wholly restricted to the sea during the Ordovician, and the first tetrapods did not evolve until the late Devonian. Even considering the possibility that retroviruses evolved outside of the Vertebrata, life on land during the Ordovician was extremely limited, making it unlikely that retroviral origins lie anywhere other than the World's ancient oceans.

It is currently unclear in which host lineage retroviruses originated. The more basal branches of retrovirus phylogeny remain poorly resolved [3•, 12••]. However, given broad congruence between phylogeny for spumaretroviruses and their hosts, and date estimates calculated using rates of viral evolution [12••], it is probable that retroviruses were already present before the most recent common ancestor of the chondrichthyans (sharks, rays and chimaeras) and osteichthyans (bony fishes and tetrapods). A small number of ERVs that form a lineage lying more basal to the spumaretroviruses are present in the genome of the sea lamprey, which belongs to a jawless fish lineage estimated to have evolved ∼500 mya [3]. However, these ERVs may originate from a host-switching event rather than being modern descendants of early retroviruses [3].

How did retroviruses originate?

Retroviruses bear much similarity to transposable elements in the LTR retrotransposon group. For example, both groups share a similar genomic content, including LTRs and group specific antigen (gag) and polymerase (pol) genes. Indeed, the family Retroviridae is generally classified within the LTR retrotransposon group, alongside Metaviridae (Ty3-gypsy-like elements), Pseudoviridae (Ty1-copia-like elements), and Bel-Pao elements [5, 21, 22]. Retroviral sequences share most similarity with Ty3-gypsy-like elements, and early phylogenetic analyses suggested that Metaviridae is the group most closely related to Retroviridae [23•, 24, 25•], which remains the prevailing view [5, 26]. A commonly stated difference between Retroviridae and Metaviridae is that retroviral elements possess an envelope gene (env, coding for transmembrane and host receptor binding proteins that facilitate viral transmission and infection [4]). However, this distinction is somewhat blurry, since ERVs often loose env [27], essentially making them analogous to LTR-retrotransposons. Meanwhile, multiple evolutionary transitions are apparent in several Ty3-gypsy-like lineages, whereby a third open reading frame coding for an env-like gene allows Ty3-gypsy-like elements to act similarly to infectious retroviruses [28, 29]. The retroviral env gene appears to have a different origin to env-like genes examined from the Metaviridae, which are proposed to have diverse origins [28, 29]. Specifically, it is hypothesised that there have been at least three acquisitions of an envelope-like gene from disparate viral sources in Metaviridae, with: insect errantiviruses (i.e. gypsy and closely related lineages) acquiring an envelope-like gene from a class of insect baculoviruses (double-stranded DNA viruses with no RNA intermediate); Cer elements acquiring an envelope-like gene from phleboviruses (a genus of negative-sense single-stranded RNA viruses in the family Bunyaviridae); and Tas elements potentially acquiring an envelope-like gene from herpesviruses (double-stranded DNA viruses with no RNA intermediate). The realisation that some lineages in Metaviridae include members with envelope-like genes has led to some murkiness in nomenclature, with Ty3-gypsy-like elements containing env-like genes sometimes being referred to as retroviruses [29, 30, 31, 32, 33, 34]. Meanwhile, the pararetroviruses, although closely related to LTR retrotransposons [35], are reverse transcribing viruses that possess a viral genome consisting of a circular double stranded DNA molecule, that do not require integration into the host genome for replication. The pararetroviruses include the enveloped Hepadnaviridae, which infect vertebrates (e.g. Hepatitis B virus), and the Caulimoviridae, which infect plants (e.g. Cauliflower mosaic virus, CaMV).

The close relationship between Metaviridae and Retroviridae presents a kind of evolutionary chicken-and-egg scenario over which came first, since Ty3-gypsy-like elements may have evolved from an exogenous viral ancestor and lost their env gene (with subsequent independent gains of env-like genes in certain lineages), or retroviruses may have evolved from an LTR retrotransposon ancestor and gained an env gene (Figure 1). Given the extremely deep timescales involved, so far it has not been possible to explicitly resolve the question using outgroup comparison to root the Metaviridae-Retroviridae association (e.g. using members of the Pseudoviridae). Thus, considering the ultimate origins of the retroviruses, the above framework presents three alternative hypotheses: (i) Retroviridae forms a sister group to the Metaviridae, (ii) Retroviridae originates from a lineage within the Metaviridae (i.e. Metaviridae is paraphyletic), or, (iii) Metaviridae originates from a lineage within the Retroviridae (i.e. Retroviridae is paraphyletic) (Figure 2). Retroviridae is entirely restricted to vertebrate hosts, whereas Metaviridae has a wide range of hosts from across the Eukaryota [5]. Consequently, it is tempting to speculate that Retroviridae originates from a lineage within the Metaviridae, that flourished alongside the evolution of its vertebrate hosts in early Palaeozoic seas. However, the situation could be yet more complicated. For example, some authors have suggested that Retroviridae is polyphyletic, comprising of at least three separate lineages that each share a closer relationship with a lineage of Ty3-gypsy-like element [36].

Figure 1.

Figure 1

Which came first, the virus or the retrotransposon? LTR transposons may have evolved from a viral ancestor by loosing an envelope gene (with subsequent gains of env-like genes in some cases) (a). Alternatively, Retroviruses may have evolved from an LTR transposon ancestor by gaining an envelope gene (b).

Figure 2.

Figure 2

Three key hypotheses to elucidate the ultimate evolutionary origins of Retroviridae: (a) Retroviridae is sister to Metaviridae, (b) Retroviridae originates from a paraphyletic Metaviridae, or (c) Metaviridae originates from a paraphyletic Retroviridae.

A key limitation to resolving the origin of retroviruses is that testing among competing hypotheses requires a deep-time phylogenetic approach, at the very limit of existing methods of analysis. Current approaches to estimating retroviral phylogeny typically employ alignments based on a contiguous section of the pol gene. While these methods work well for estimating retroviral phylogeny over relatively recent timescales and among closely related taxa, the rapid rate of retrovirus sequence evolution generally means that they do not perform well over deep evolutionary timescales or diverse sets of taxa. Recently, an alternative approach utilising multiple short motifs from across the retroviral genome was developed (with the exclusion of env, which evolves too rapidly to be useful at deep timescales), and utilised to infer broad-scale evolutionary patterns across retroviral diversity [3•, 18, 37]. By focussing on short, slowly evolving sequences associated with essential retroviral functions, such as the active sites of core enzymes, this approach offers a means of reducing noise and maximising signal in phylogenetic datasets. It is likely that similar approaches may offer a useful means of disentangling the evolutionary relationships leading to the origin of the Retroviridae in the future. Such approaches necessarily avoid the utilisation of env because: (i) the surface component (SU) of the retroviral envelope is the interface between host and virus, and is subject to particularly high substitution rates as a consequence of ongoing host–virus antagonistic coevolutionary cycles; and, (ii) phylogenetic analyses have provided evidence of the promiscuous acquisition of env, whereby recombination events lead to the envelope gene having a separate evolutionary history from the rest of the retroviral genome (i.e. so-called ‘env-swapping’ or ‘env-snatching’) [38, 39, 40].

Conclusions and future directions

Recent years have seen great developments in our understanding of retrovirus biology, including a deeper knowledge of retroviral phylogeny, diversity, and evolution. This has been fuelled by rapid advancements in genomics and bioinformatics, driving novel insights into what is traditionally considered to be a difficult group. High among the list of achievements is a much improved understanding of the evolution and origin of retroviral lineages, facilitated, for example, by rigorous screening of diverse host genomes and the development of novel and elegant forms of analysis [12••]. However, many challenges remain to more fully elucidate the deep evolutionary origins of retroviruses. More basal regions of retrovirus phylogeny are still relatively poorly known, with whole new retroviral clades being discovered and requiring better characterisation [3•, 12••]. Improving knowledge of phylogenetic relationships in this region of retroviral phylogeny may help to narrow down the ultimate date of retroviral origin, and illuminate the host lineages in which the first retroviruses evolved. Additionally, greater scrutiny of the evolutionary transitions leading up to the origin of the retroviruses is required. A current poor understanding of Metaviridae phylogeny complicates this. Further work to reconstruct the phylogeny of Metaviridae is also important to trace the evolution of the retroviral envelope gene and env-like genes in Ty3-gypsy-like elements, which is an enigmatic topic of potential medical importance. Further progress in these areas will likely require genomic screening of even greater swathes of host taxonomic diversity, alongside continued methodological developments. But as is clear from recent progress, persistence offers considerable scope for significant advancements, making the elucidation of retroviral origins an exciting research field to work in.

Conflict of interest statement

The authors declare no conflict of interest.

References and recommended reading

Papers of particular interest, published within the period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgements

This work was supported by a British Biological Sciences Research Council (BBSRC) David Phillips Fellowship to AH (grant number: BB/N020146/1).

References

  • 1.Katzourakis A., Gifford R.J. Endogenous viral elements in animal genomes. PLoS Genet. 2010;6:pe1001191. doi: 10.1371/journal.pgen.1001191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Holmes E.C. The evolution of endogenous viral elements. Cell Host Microbe. 2011;10:368–377. doi: 10.1016/j.chom.2011.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3•.Hayward A., Cornwallis C.K., Jern P. Pan-vertebrate comparative genomics unmasks retrovirus macroevolution. Proc Natl Acad Sci U S A. 2015;112:464–469. doi: 10.1073/pnas.1414980112. [DOI] [PMC free article] [PubMed] [Google Scholar]; This paper uses a newly developed approach to estimating retroviral phylogeny to provide novel insights into retrovirus phylogeny, diversity, and macroevolution, using ERVs mined from a large number of host genomes.
  • 4.Coffin J.M. Cold Spring Harbor Laboratory Press; 1997. Retroviruses. [PubMed] [Google Scholar]
  • 5.King A.M.Q., Carstens E.B., Lefkowitz E.J. Elsevier; 2012. Virus Taxonomy: Classification and Nomenclature of Viruses: Ninth Report of the International Committee on Taxonomy of Viruses; pp. 477–495. [Google Scholar]
  • 6.Jern P., Sperber G.O., Blomberg J. Use of Endogenous Retroviral Sequences (ERVs) and structural markers for retroviral phylogenetic inference and taxonomy. Retrovirology. 2005;2 doi: 10.1186/1742-4690-2-50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Dangel A.W. Complement component C4 gene intron 9 as a phylogenetic marker for primates — long terminal repeats of the endogenous Retrovirus Erv-K(C4) are a molecular clock of evolution. Immunogenetics. 1995;42:41–52. doi: 10.1007/BF00164986. [DOI] [PubMed] [Google Scholar]
  • 8.Gifford R.J. A transitional endogenous lentivirus from the genome of a basal primate and implications for lentivirus evolution. Proc Natl Acad Sci U S A. 2008;105:20362–20367. doi: 10.1073/pnas.0807873105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Brown K., Emes R.D., Tarlinton R.E. Multiple groups of endogenous epsilon-like retroviruses conserved across primates. J Virol. 2014;88:12464–12471. doi: 10.1128/JVI.00966-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cui J. Identification of diverse groups of endogenous gammaretroviruses in mega- and microbats. J Gen Virol. 2012;93(Pt 9):2037–2045. doi: 10.1099/vir.0.043760-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Sinzelle L. Characterization of a Xenopus tropicalis endogenous retrovirus with developmental and stress-dependent expression. J Virol. 2011;85:2167–2179. doi: 10.1128/JVI.01979-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12••.Aiewsakun P., Katzourakis A. Marine origin of retroviruses in the early Palaeozoic Era. Nat Commun. 2017;8:13954. doi: 10.1038/ncomms13954. [DOI] [PMC free article] [PubMed] [Google Scholar]; The authors provide the first direct evidence that retroviruses are a truly ancient group, stretching back to the origins of the vertebrates, employing novel analyses and a thorough screen of spuma-like ERVs from the genomes of diverse host taxa.
  • 13.Kijima T.E., Innan H. On the estimation of the insertion time of LTR retrotransposable elements. Mol Biol Evol. 2010;27:896–904. doi: 10.1093/molbev/msp295. [DOI] [PubMed] [Google Scholar]
  • 14.Katzourakis A. Discovery and analysis of the first endogenous lentivirus. Proc Natl Acad Sci U S A. 2007;104:6261–6265. doi: 10.1073/pnas.0700471104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Keckesova Z. Identification of a RELIK orthologue in the European hare (Lepus europaeus) reveals a minimum age of 12 million years for the lagomorph lentiviruses. Virology. 2009;384:7–11. doi: 10.1016/j.virol.2008.10.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16••.Lee A. Identification of an ancient endogenous retrovirus, predating the divergence of the placental mammals. Philos Trans R Soc Lond B Biol Sci. 2013;368:20120503. doi: 10.1098/rstb.2012.0503. [DOI] [PMC free article] [PubMed] [Google Scholar]; Demonstration of an ancient retroviral orthologue that predates the divergence of the plancental mammals.
  • 17•.Katzourakis A. Macroevolution of complex retroviruses. Science. 2009;325:1512. doi: 10.1126/science.1174149. [DOI] [PubMed] [Google Scholar]; Application of multiple sources of evidence to show an ancient and stable co-evolutionary relationship between spumaretroviruses and mammals.
  • 18.Hayward A., Grabherr M., Jern P. Broad-scale phylogenomics provides insights into retrovirus–host evolution. Proc Natl Acad Sci U S A. 2013;110:20146–20151. doi: 10.1073/pnas.1315419110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19••.Switzer W.M. Ancient co-speciation of simian foamy viruses and primates. Nature. 2005;434:376–380. doi: 10.1038/nature03341. [DOI] [PubMed] [Google Scholar]; This article was the first to demonstrate ancient primate–spumaretrovirus cospeciation, highlighting an extremely low rate of spumavirus evolution and strong congruence between host–virus phylogeny.
  • 20.Katzourakis A. Discovery of prosimian and afrotherian foamy viruses and potential cross species transmissions amidst stable and ancient mammalian co-evolution. Retrovirology. 2014;11:p61. doi: 10.1186/1742-4690-11-61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wicker T. A unified classification system for eukaryotic transposable elements. Nat Rev Genet. 2007;8:973–982. doi: 10.1038/nrg2165. [DOI] [PubMed] [Google Scholar]
  • 22.Kapitonov V.V., Jurka J. A universal classification of eukaryotic transposable elements implemented in Repbase. Nat Rev Genet. 2008;9:411–412. doi: 10.1038/nrg2165-c1. author reply 414. [DOI] [PubMed] [Google Scholar]
  • 23•.Doolittle R.F. Origins and evolutionary relationships of Retroviruses. Quart Rev Biol. 1989;64:1–30. doi: 10.1086/416128. [DOI] [PubMed] [Google Scholar]; Early, pioneering work on the evolutionary origin of retroviruses, much of the discussion remains relevant today.
  • 24.Yue X., Eickbush T.H. Similarity of reverse transcriptase-like sequences of viruses, transposable elements, and mitochondrial introns. Mol Biol Evol. 1988;5:675–690. doi: 10.1093/oxfordjournals.molbev.a040521. [DOI] [PubMed] [Google Scholar]
  • 25•.Xiong Y., Eickbush T.H. Origin and evolution of retroelements based upon their reverse-transcriptase sequences. Embo J. 1990;9:3353–3362. doi: 10.1002/j.1460-2075.1990.tb07536.x. [DOI] [PMC free article] [PubMed] [Google Scholar]; An early seminal study concerning the evolutionary origin of retroelement lineages.
  • 26.Eickbush T.H., Jamburuthugoda V.K. The diversity of retrotransposons and the properties of their reverse transcriptases. Virus Res. 2008;134:221–234. doi: 10.1016/j.virusres.2007.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Magiorkinis G. Env-less endogenous retroviruses are genomic superspreaders. Proc Natl Acad Sci U S A. 2012;109:7385–7390. doi: 10.1073/pnas.1200913109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Eickbush T.H., Malik H.S. Origins and evolution of retrotransposons, in mobile DNA II. Am Soc Microbiol. 2002:1111–1144. [Google Scholar]
  • 29.Malik H.S., Henikoff S., Eickbush T.H. Poised for contagion: evolutionary origins of the infectious abilities of invertebrate retroviruses. Genome Res. 2000;10:1307–1318. doi: 10.1101/gr.145000. [DOI] [PubMed] [Google Scholar]
  • 30.Terzian C., Pelisson A., Bucheton A. Evolution and phylogeny of insect endogenous retroviruses. BMC Evol Biol. 2001;1:p3. doi: 10.1186/1471-2148-1-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Alberola T.M., de Frutos R. Molecular structure of a gypsy element of Drosophila subobscura (gypsyDs) constituting a degenerate form of insect retroviruses. Nucleic Acids Res. 1996;24:914–923. doi: 10.1093/nar/24.5.914. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Misseri Y. Comparative sequence analysis and predictions for the envelope glycoproteins of insect endogenous retroviruses. Trends Microbiol. 2003;11:253–256. doi: 10.1016/s0966-842x(03)00119-7. [DOI] [PubMed] [Google Scholar]
  • 33.Song S.U. An Env-like protein encoded by a Drosophila retroelement — evidence that gypsy is an infectious retrovirus. Genes Dev. 1994;8:2046–2057. doi: 10.1101/gad.8.17.2046. [DOI] [PubMed] [Google Scholar]
  • 34.Kim A. Retroviruses in invertebrates: the gypsy retrotransposon is apparently an infectious retrovirus of Drosophila melanogaster. Proc Natl Acad Sci U S A. 1994;91:1285–1289. doi: 10.1073/pnas.91.4.1285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Krupovic M., Koonin E.V. homologous capsid proteins testify to the common ancestry of retroviruses, caulimoviruses, pseudoviruses, and metaviruses. J Virol. 2017;91 doi: 10.1128/JVI.00210-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Llorens C., Fares M.A., Moya A. Relationships of gag-pol diversity between Ty3/Gypsy and Retroviridae LTR retroelements and the three kings hypothesis. BMC Evol Biol. 2008;8:p276. doi: 10.1186/1471-2148-8-276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Groenen M.A.M. Analyses of pig genomes provide insight into porcine demography and evolution. Nature. 2012;491:393–398. doi: 10.1038/nature11622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Henzy J.E., Johnson W.E. Pushing the endogenous envelope. Philos Trans R Soc Lond B Biol Sci. 2013;368:p20120506. doi: 10.1098/rstb.2012.0506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Vargiu L. Classification and characterization of human endogenous retroviruses; mosaic forms are common. Retrovirology. 2016;13:p7. doi: 10.1186/s12977-015-0232-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Kim F.J. Emergence of vertebrate retroviruses and envelope capture. Virology. 2004;318:183–191. doi: 10.1016/j.virol.2003.09.026. [DOI] [PubMed] [Google Scholar]

RESOURCES