Summary
Mammalian hosts deploy a multitude of germline-encoded mechanisms to detect and restrict virus infection. These must avoid pathological responses to endogenous retroviruses (ERVs) and other endogenous retrotransposable elements (RTEs) – viruses and virus-like genomic parasites that have invaded the host germline and are passed down the generations as host genes. Although the location, specificity and sensitivity of innate pattern-recognition receptors (PRRs) and restriction factors are tuned to facilitate discrimination of infecting viruses from those that are part of self, immune cross-reactions do occur. The RTE viral heritage may therefore compromise the ability of the host to respond to virus infection without risking pathology. Nevertheless, it also provides opportunities for RTEs to be co-opted as an alarm amplification system, repurposed as antiviral factors and contribute to the evolution of antiviral genes.
Introduction
The evolutionary success of an organism is determined by replication of its genome in the face of exposure to a wide range of infectious parasites. All organisms have evolved defenses against infection (1). In mammals, distinct germline-encoded defenses cell-autonomously sense and neutralize invading viruses, and trigger signaling cascades alerting neighboring cells and the immune system to the presence of the invader. These defenses target evolutionarily conserved structures, such as viral capsids, which protect the invading genomes, viral envelope glycoproteins, which attach and fuse virus particles with the target cells, and viral genomes and their replication intermediates, all of which present patterns distinct from the host.
Restriction factors are germline-encoded host defense molecules that can directly limit virus replication by interfering with one or more steps in virus replication or by restricting the availability of cellular resources. Pattern-recognition receptors (PRRs) can sense virus-specific patterns and initiate a signaling cascade, leading to the production of interferons (IFNs) and induction of IFN-stimulated genes (ISGs) with antiviral activity. Although some restriction factors and PRRs are expressed constitutively, many are IFN-inducible, bolstering antiviral defenses following virus sensing. Moreover, certain restriction factors may either directly initiate pro-inflammatory signaling cascades or indirectly facilitate sensing by dedicated PRRs, connecting the network of innate antiviral defenses.
However, these defenses cannot evolve to provide complete resistance – otherwise, there would be no selection pressure to maintain them in the absence of exposure (1). Rather they provide sufficient resistance to ensure the survival of the host, so both host and virus species can survive. Moreover, these mechanisms can be potentially as damaging for the host as for the virus, and their deployment must be specific and proportionate to the risk of infection. During the ensuing arms race, viruses evolve countermeasures that allow them to evade or subvert host defenses and establish acute or chronic infection in somatic cells.
The situation becomes more complicated when we consider retroviruses, viruses that must integrate into host DNA as part of their replication cycle. Some can invade the host germline, thus becoming endogenous and blurring the boundaries between host and virus. These elements, termed endogenous retroviruses (ERVs), join other types of retrotransposable elements (RTEs) in the genome, with which they share a common ancestry (2) and the ability to amplify their DNA copies in the host germline through the reverse transcription and reintegration of their RNA transcripts into DNA. There are now over 4 million RTE integrations in the human genome – making up ˜42% of our DNA – which belong to distinguishable groups (2, 3), with distinct replication life-cycles and with specific consequences for the host innate and adaptive immune systems (Figure 1).
Figure 1. Distinct immunogenic features of exogenous viruses and RTEs.
Exogenous viruses typically target one or few cell types, where they produce excessive amounts of viral nucleic acids, often with distinctive molecular patterns and recognized by common or cell type-specific sensors. Their virions also access specialized sentinel cells, such as pDCs, which become the predominant sources of IFN. In contrast, RTEs are present and can be reactivated in every nucleated cell, but are rarely transcribed at high levels, or produce specific molecular patterns and extracellular particles that would access pDCs.
Through accumulated mutations and deletions, the vast majority of RTEs have lost the ability to replicate or retrotranspose. In addition, they are suppressed by epigenetic mechanisms developed in response to these genomic parasites. Nevertheless, some RTEs are still able to produce replication intermediates and proteins that can activate antiviral immunity. This creates a challenge for the host to discriminate between the few RTEs that retain pathogenic potential from those that no longer pose a threat or have even been co-opted in host physiology, whilst still maintaining sufficient responsiveness to exogenous virus infection. Here, we review how host antiviral defenses have evolved in the context of RTEs, and discuss the surprising opportunities and ammunition against other infections that RTEs provide for their host.
Guarding against cellular entry and exit of viruses
Perhaps most effective defenses against infection are barriers to the ingress of the invader’s genetic material through the membrane of the host cell. Entry of enveloped viruses requires attachment to cellular receptors and fusion of lipid membranes. Several host factors have evolved to block this step or reduce its efficiency. Where this defense succeeds, no further host action may be necessary. Processes required for virus entry are, therefore, frequently targeted by host restriction factors.
A recurring mechanism in host defense is the expression of intact or mutated envelope glycoproteins from ERVs, which blocks entry of exogenous viruses in a manner resembling viral superinfection resistance, whereby infection with a virus that uses a particular cellular entry receptor prevents entry of other viruses that use the same receptor. This has led to the convergence in evolution of exogenous retrovirus restriction in many species, including mice, cats and sheep (Table 1) (4). A preserved copy of the human endogenous retrovirus (HERV)-T envelope glycoprotein that can bind its cognate receptor, MCT-1, without causing membrane fusion, may have led to the extinction of this family of exogenous retroviruses in our primate ancestors (5). Syncytin-1, derived from a HERV-W envelope glycoprotein and co-opted in human placentation, as well as the phylogenetically distant ERVH48-1 envelope glycoprotein (also known as Suppressyn) bind to ASCT2 as a receptor. ASCT2 acts as a cellular receptor for several other retroviruses, including currently circulating primate retroviruses, and Syncytin-1 and ERVH48-1 binding to ASCT2 can, consequently, inhibit infection with diverse endogenous and exogenous retroviruses (6, 7).
Table 1. RTE-related antiviral sensors and restriction factors.
Only mammalian antiviral sensors and restriction factors that are directly derived or shaped by RTEs are included. The species in which they were first described is also indicated. eMLV, ecotropic MLV; enJSRV, endogenous JSRV; ERV-DC, feline endogenous gammaretrovirus of the domestic cat; FeLV, Feline leukemia virus; MCF, mink cell focus-forming; MERV, murine endogenous retrovirus.
Antiviral factor | Relation to RTEs | Ligand or target | Antiviral activity | ||
---|---|---|---|---|---|
Gene | Protein | RTEs | Exogenous viruses | ||
Fv4 (mouse) | Fv4 | eMLV Env-derived | mCAT-1 | eMLV | eMLV |
Rmcf (mouse) | Rmcf | MCF virus Env-derived | XPR1 | Nonecotropic MLV |
Nonecotropic MLV |
Refrex1 (cat) | Refrex-1 | ERV-DC Env-derived | COPT1 | ERV-DC | ERV-DC |
Felix (cat) | FeLIX | FeLV-B Env-derived | PiT1 | FeLV-B | FeLV-B |
ERVS71-1 (human) | ENVT1 | HERV-T Env-derived | MCT-1 | HERV-T (defective) | HERV-T (extinct) |
ERVH48-1 (human) | Suppressyn | ERVH48-1 Env-derived | ASCT2 | ERVH48-1 (defective) |
RD114/mammalian type-D retroviruses |
BST2 (human) | BST2; Tetherin |
Antagonized by HERV-K(HML-2) Env |
Env, budding | ERVs, Exosomes, LINEs | Retroviruses, Enveloped Viruses |
Fv1 (mouse) | Fv1 | MERV-L Gag-derived | Capsid uncoating | MLV | MLV, diverse retroviruses |
enJS56A1 (sheep) | enJS56A1 | JSRV Gag-derived | Capsid assembly | enJSRV | JSRV |
Apobec3 (mouse) | ABEC3 | MLV insertion in the Rfv3 allele | DNA, RNA | ERVs, LINEs | MLV |
APOBEC3B (human) | ABC3B | Regulation by nearby HERV-H | DNA, RNA | ERVs, LINEs | Retroviruses, Hepadnaviruses |
AIM2 (human) | AIM2 | IFN inducibility by MER41 |
Cytoplasmic DNA | Untested | Diverse DNA and RNA viruses |
IFI6 (human) | IFI6 | IFN inducibility by MER41 | Unclear | Untested | Diverse DNA and RNA viruses |
GBP2 (human) | GBP2 | Driven by ERV9 LTR12 |
Env processing | HERV-K(HML-2) pseudotypes | Diverse enveloped viruses |
GBP5 (human) | GBP5 | Driven by ERV9 LTR12 |
Env processing | HERV-K(HML-2) pseudotypes | Diverse enveloped viruses |
TLR7 (human) | TLR7 | Regulation by nearby HERV-H | Endosomal ssRNA | ERVs, LINEs, SINEs | Diverse ssRNA viruses |
TLR8 (human) | TLR8 | Regulation by nearby HERV-H | Endosomal ssRNA | ERVs, LINEs, SINEs | Diverse ssRNA viruses |
Efficient transmission of exogenous mouse mammary tumor viruses (MMTVs) relies on induction of T cell proliferation by virally-encoded superantigens. Endogenous MMTV proviruses promote thymic deletion of superantigen-reactive T cells (3), thus providing relative resistance to exogenous MMTV infection through a unique mechanism.
Viral egress also represents an important target for virus restriction where ERV products may play a role. ERV envelope glycoproteins can inhibit retroviruses that use unrelated receptors. For example, defective envelope glycoproteins from the most recently endogenized HERV-K(HML-2) family have been shown to limit production of human immunodeficiency virus-1 (HIV-1) in an incompletely understood manner (8). Tetherin (encoded by BST2) is a restriction factor that prevents the release of enveloped virus particles from infected cells by physically tethering them to the plasma membrane of the producer cell. It is active against most tested viruses, including HERV-K(HML-2). In addition to this direct activity, virus assembly at the plasma membrane can be sensed as a molecular pattern by Tetherin, triggering NF-κB-dependent proinflammatory responses, and Tetherin-mediated accumulation of virus particles may also facilitate sensing by other receptors (9).
Several exogenous viruses can inactivate Tetherin, albeit using different mechanisms. Such antagonistic activity seems to be retained by the envelope glycoprotein of some HERV-K(HML-2) proviruses, expression of which at the plasma membrane can inactivate Tetherin (10), potentially facilitating instead of inhibiting exogenous virus release. As HERV-K(HML-2) envelope glycoproteins are variably expressed between individuals and certain proviruses are insertionally polymorphic – present in some individuals but not others – they may differentially affect susceptibility to exogenous infection by disparate effects on Tetherin (10).
The apparent success of restriction mechanisms blocking viral entry and exit may be responsible for the evolution of an RTE life cycle involving intracellular genome amplification without the need for cell-to-cell transmission or somatic cell infection (11), further complicating the host’s task in distinguishing endogenous from exogenous virus replication (3).
Targeting incoming viral genomes
If blocks at the cell membrane are circumvented, a second tier of defenses awaits. These include restriction factors that act to prevent the synthesis of complete complementary DNA (cDNA) copies of the viral genome or its access to the host genome for integration (12). A key target is the viral capsid protein that forms a shell enclosing the viral replication machinery and preventing sensing of viral nucleic acids. Examples of restriction factors that target the viral capsid include TRIM5α, a member of the large family of proteins bearing the Tripartite Motif, many of which contribute to antiviral immunity, and Fv1 which is derived from a murine endogenous retrovirus L family (MuERV-L) RTE and has followed an evolutionary pathway spawning variants with specific recognition properties for different retroviruses (13). TRIM5α appears to act by causing premature uncoating of the viral core inhibiting cDNA synthesis or exposing the viral nucleic acids to TREX1, which digests cytosolic DNA and cDNA, while Fv1 most likely prevents core transit to the nucleus (14).
As with antiviral restriction factors targeting envelope glycoprotein-mediated virus entry and exit, those targeting the viral capsid can also act at multiple steps of the viral life-cycle. For example, the Gag protein from an endogenous HERV-K(HML-2) provirus can poison HIV-1 core assembly (15), as does the Gag protein of endogenous Jaagsiekte sheep retrovirus (JSRV), which inhibits assembly of the cognate infectious retrovirus (16). These examples further underscore the repeated co-option of ERV proteins as restriction factors that provide genetic resistance to retroviral infection.
Triggering an IFN response
Successful entry of a virus particle into a cell warrants a response, proportional to the extent of subsequent virus replication. This requires fast and accurate detection of the invader, coupled to immediate block of replication in the infected cell, as well as of virus spread to other cells. Detection is achieved by a battery of strategically placed PRRs, including Toll-like receptors (TLRs), RIG-I-like receptors (RLRs), NOD-like Receptors (NLRs) and AIM2-like receptors (ALRs), that recognize diverse pathogen-associated molecular patterns (PAMPs), but may also bind ligands produced by the host cell. Several mechanisms have been described that maximize the ability of PRRs to sense infection, whilst limiting reactivity with self-generated ligands, including those from RTEs (17).
PRRs can evolve specificity for structures that are produced exclusively or primarily during virus replication. They may also be expressed in a cell-specific manner or predominantly in specialized sentinels, such as plasmacytoid dendritic cells (pDCs) (Figure 1). Nevertheless, discrimination between self- and virus-produced PRR ligands is not always possible, necessitating additional mechanism of regulation.
Owing to 5’ capping and other modifications, Pol II-transcribed, host mRNAs are not sensed by RIG-I, which recognizes blunt-ended, double-stranded RNA (dsRNA), unmethylated at the 2’-O position, and with 5’ di- or tri-phosphates. However, Pol III-transcribed host RNAs, including 5S rRNA, and RTE RNAs, including the non-autonomous short interspersed nuclear elements (SINE), lack 5’ capping and, instead, avoid sensing by RIG-I by forming complex secondary structures or by interacting with RNA-binding proteins which shield them from detection (18). The dsRNA sensor MDA5, encoded by IFIH1, recognizes dsRNA longer than 300 base pairs (bp) produced during replication of certain viruses, as well as by the host cell. Ancestrally derived from the 7SL RNA gene, the primate-specific Alu subfamily of SINE have accumulated to over 1 million copies in the human genome, thus increasing the probability that two or more Alu repeats may be present in the same RNA transcript. Inverted Alu repeats that are sufficiently similar and proximal can form intramolecular RNA hairpins for the entire length of the Alu sequence, which averages ˜280 bp. This length, together with imperfect complementarity of the dsRNA region, further disrupted by ADAR-mediated editing, considerably reduces the ability of such hairpins to activate MDA5, suggesting the length of Alu sequence and tolerance of MDA5 to dsRNA mismatches are tuned to avoid this interaction (19). This balance is disrupted by gain-of-function IFIH1 mutations or loss of ADAR activity, triggering spontaneous IFN production in immune disorders, including Aicardi-Goutières syndrome (AGS) and systemic lupus erythematosus (SLE) (20). Additional mechanisms that limit the availability of endogenous ligands for PRRs include nucleoside or nucleotide metabolizing enzymes such as SAMHD1, TREX1 and HELZ2, which digests highly structured RNAs (21).
When endogenous ligands cannot be efficiently modified or removed, their sensing can still be limited by regulated PRR expression and localization. Endosomal localization of the ligand-biding domains of dsRNA sensor TLR3, single-stranded RNA (ssRNA) sensors TLR7 and TLR8, and unmethylated CpG DNA sensor TLR9, also prevents detection of cytosolic ligands, and differences in intracellular localization also determine the targeting of host or virus RNA by ZAP (22). Although several PRRs are constitutively expressed in anticipation of infection, several require IFN to induce their transcription or a different isoform of the protein is induced following an IFN response. For example, in response to IFN, the ADAR gene produces a larger isoform that, in addition to dsRNA-binding domains, contains a domain (Zα) binding Z-form DNA or RNA. IFN signaling is a prerequisite for expression of the only other known mammalian protein with Zα domains, ZBP1, which can sense Z-form DNA or RNA, leading to further IFN induction or necroptosis.
Certain PRRs, including those that are inducible by IFN, can regulate the IFN response at the resolution phase of an infection. The ZC3HAV1 gene encodes two isoforms of the RNA-binding protein ZAP that differ in length. While the longer isoform is constitutively expressed and targets viral RNA, the expression of the shorter isoform requires IFN and targets the mRNA of IFN genes for degradation, thereby negatively regulating the IFN response (22). Therefore, the IFN response to exogenous virus infection appears to be finely tuned by coordinated regulation of sensing and signaling cascades, as well as by negative feedback loops.
Innate sensing of RTEs
An evolutionary trade-off between prevention of overt PRR engagement by RTEs and the necessity to maintain sensitivity to exogenous virus implies that RTE sensing may not be entirely avoided, particularly when homeostasis is disrupted. Moreover, as not all RTEs have been fully disarmed through evolution, the host must be prepared to sense their dysregulation to prevent further spread in the germline, as well as in somatic cells. Failure of such mechanisms is indicated by the frequent resurrection of ERVs in sensor- and immune-deficient mice (23, 24).
At least three conditions contribute to RTE products being expressed and sensed by cells. Firstly, epigenetic repression, which otherwise prevents RTE transcription, can be reversed during cellular infection, damage or malignant transformation, leading to the production of excessive RTE nucleic acids and proteins. Indeed, several exogenous viruses can upregulate RTE transcription, either through direct transactivation (25) or indirectly, in the case of SINE, through elevated transcription by host Pol III (18). Exogenous viruses also induce pro-inflammatory mediators, and target RTE repressors, restriction and other antiviral factors (26), all of which enhance RTE transcription indirectly. Infection with exogenous retroviruses in particular, can complement replication defects in ERVs, leading to their mobilization. Secondly, IFN production during viral infection or other pathologies, will additionally upregulate PRR expression which may facilitate detection of RTE products. Thirdly, to complete their own replication, viruses may also counteract restriction factors and sensor regulators, such as SAMHD1, Tetherin, PKR, and the APOBEC3 family, further enhancing the availability of RTE products for sensing.
The increased availability of RTE-derived ligands, coupled with increased PRR expression and reduced antiviral factor expression now permits the sensing of RTEs, which can contribute to the overall antiviral response of the host (Figure 2). Sensing of RTEs in the absence of infection or malignant transformation can directly cause or increase the risk of age-related inflammation, systemic autoimmunity or severe interferonopathies (3). However, it may be less pathogenic during virus infection, which should independently trigger an antiviral response. While certain viruses can benefit from the RTEs they induce (27), the contribution of induced RTEs to the antiviral response can enhance resistance to exogenous virus infection (26, 28).
Figure 2. RTE activation and sensing triggered by viral infection.
Infection with exogenous viruses increases RTE transcription and availability either directly, through transactivation by viral transcription factors, and indirectly thought pro-inflammatory mediators or by inactivating repressors restriction and other antiviral factors that normally silence or control RTEs. In parallel, IFN production during viral infection upregulates PRR expression. The combination of increased RTE transcription and reduced control, and increased PRR availability can initiate innate immune responses to RTEs, which, in turn, amplify responses to the exogenous infection.
RTEs as regulators of adjacent innate immune gene function
Owing to their mutagenic potential and the regulatory sequences they carry, RTEs are a major force of host genome innovation. RTE integrations near or within genes can modify gene function on longer evolutionary timescales and with more profound consequences, and innate immune genes involved in sensing or restriction are no exception. RTE regulatory effects on genes that, in turn, affect resistance to infection or control RTE activity may have strong evolutionary effects.
For example, an intronic integration of a murine leukemia virus (MLV) long terminal repeat (LTR) in murine Apobec3 enhances its expression and, consequently, its antiviral activity, thereby providing resistance to exogenous MLV infection (29). In human embryonic stem cells (ESCs), a HERV-H integration has been co-opted as a functional enhancer of APOBEC3G, and genome-wide HERV-H transcriptional activity is positively correlated with expression of the entire APOBEC3 locus, as well as of the restriction factor IFITM1 (30). A subfamily of murine B2 SINE repeats was found to contain STAT1-binding sites and function as IFN-inducible enhancers (31), and MER41_BT and Bov-A2 repeats play a similar role in bovine cells (32). Human MER41 repeats provide IFN inducibility to several antiviral sensors and effectors (33), including AIM2, a cytosolic dsDNA sensor, and IFI6, which exhibits broad antiviral activity (34, 35), but may also prevent RIG-I activation through competition for RNA ligands (36). Solitary LTR12C – single copies of the ERV9 subfamily LTR – also regulate expression and IFN inducibility of the downstream genes GBP2 and GBP5, which inhibit envelope glycoproteins of exogenous viruses, as well as of ERVs, such as HERV-K(HML-2), being processed by the cellular protease furin (37, 38). An intronic L1M2a element has been co-opted as an enhancer of human IFNAR1, regulating both constitutive and IFN-inducible expression in B cells (39). Genetic loss of TRIM28 in murine dendritic cells leads to the induction of proinflammatory and antiviral genes, including Ifitm1 and Gbp2b, which has been attributed to regulatory control by adjacent derepressed RTEs (40). A copy of the Lx9c11 murine long interspersed nuclear element (LINE) subfamily has created the non-coding RNA Lx9c11-RegoS, which is indispensable in preventing lethal hyperinflammation during virus infection, through regulation of the genes encoding the Schlafen (SLFN) family of proteins (41).
Their cis-regulatory potential allows RTE integrations that are fixed in the genome to impact immune gene function negatively, too. Gene-dysregulating effects of RTEs are normally minimized by their epigenetic repression and suppression of their exonisation, but are particularly pronounced following malignant transformation, which is associated with RTE derepression, and may also manifest in other contexts. Transcriptional activation of a HERV-H provirus integrated downstream of the tandem TLR7 and TLR8 loci on human chromosome X initiates antisense transcription that is associated with loss of TLR7 and TLR8 transcription in several cancer types (42). Similarly, antisense transcription driven by a MER11C integration downstream of human APOBEC3B reduces its expression, without affecting expression of other APOBEC3 members on chromosome 22 in tumors of the male germline (42). Moreover, hypoxia-driven transcriptional activation of an intronic HERV-E provirus reduces the expression of RNGTT (42), encoding an enzyme that initiates 5’ capping of newly synthesized mRNAs, a modification essential for mRNA stability, translation and prevention of RIG-I sensing.
The cis-regulatory potential of germline RTEs may also modify the function of proteins involved in viral infection and concomitant innate immune response. ACE2 is a major entry receptor for seasonal and pandemic human coronaviruses, HCoV-NL63, SARS-CoV and SARS-CoV-2; and higher-affinity ACE2 binding is a critical but readily acquired trait for animal coronavirus spillover to humans (43). Evolutionary ancient intronic integrations of MIRb and LTR16A1 repeats act as the promoter of an IFN-inducible, truncated ACE2 isoform, thus coupling the IFN response to the induction of an ACE2 protein form that cannot mediate virus entry (44). A primate-specific, intronic AluJr repeat has been co-opted as an alternative terminal exon of IFNAR2, creating non-functional decoy receptor for type I IFNs that lacks the intracellular domain responsible for downstream signaling, thereby regulating the antiviral response (45). These examples highlight the immense potential of RTEs to alter or tune the function and expression of innate immune genes, with direct consequences for resistance to viral infection.
Retention of viral heritage hinders co-option or causes pathology
Although RTEs are now part of the host genetic constitution and can be co-opted in essential host processes, this peaceful coexistence can sometimes be disrupted, causing such processes to fail.
The fusogenic potential of ERV envelope glycoproteins has been recurrently co-opted by placental mammals, in the form of the Syncytin genes, and function during the formation of the syncytiotrophoblast, a multinucleated layer of cells covering the placenta (46). Syncytin-mediated fusion of trophoblast cells follows the same principles as fusion during virus entry and may be subject to restriction by the same factors that have evolved to prevent the latter. Indeed, syncytiotrophoblast formation can be adversely affected by IFITMs (47). These are IFN-inducible restriction factors that inhibit the entry of several viruses by targeting envelope glycoproteins, and IFITM1 may also suppress several RTE families in human embryonic stem cells (48). Excessive activity of IFITMs induced by IFN or congenital infection can target Syncytins, by virtue of their retroviral origin, ultimately leading to complications in pregnancy (47). Furthermore, the antiviral activity of GBP5 can also inhibit the processing and fusogenic function of Syncytin-1 (49). These findings highlight a balance between the need to control potential pathogenic RTEs or exogenous viruses and the exapted function of Syncytins at that stage of human embryo development.
The promoter and enhancer function of HERV LTRs that has been co-opted in the transcription and IFN inducibility of innate immune genes represents another example where host mechanisms evolved to limit infection may modulate this alternative function. RTEs are epigenetically repressed, in part by a multi-gene family of KZNPs, which recruit the repressor TRIM28 (also known as KAP1) (50). Together with specialized mechanisms, such as PIWI-interacting RNAs (piRNAs), silencing RTEs specifically in the germline (51), KZNP-mediated repression may reduce the immediate risk to adjacent genes posed by cis-regulatory activity of newly-acquired RTEs, which can then be tested over longer evolutionary time for potentially beneficial activities through gradual or somatic cell type-specific release from KZNP control. However, excessive or persistent KZNP-mediated repression can restrict the evolutionary advantage that such cis-regulatory activity may provide. Indeed, reduced KZNP activity in subsets of elite controllers of HIV-1 infection has been correlated with increased LTR-driven expression of key antiviral genes, including GBP2 and GBP5 (52), suggesting that KZNP targeting of proviral LTRs may modulate the ability of exapted cis-regulatory LTRs to drive or otherwise regulate antiviral genes.
Conclusions and future directions
Although the interaction of RTEs with host innate immunity may be pathogenic when dysregulated, or hinder the co-option of RTEs in physiological processes of the host, it also provides opportunities for the evolution of novel immune function. The current evidence supports a model, whereby the transcriptional responsiveness of RTEs to viral infection or malignant transformation may provide an alarm amplification system for such types of danger. Exogenous viruses can outpace host innate immunity with faster evolution of countermeasures. In contrast, RTE integrations follow the much slower evolutionary pace of the host genome. This increased genetic stability, together with their presence in every host cell, creates an ideal source of innate immune stimulation that can augment the immunogenicity of infected or transformed cells.
Inactivation of RTEs, either during the replication process or over evolutionary time also offers the opportunity to repurpose their individual components in the antiviral response, as exemplified by the utilization of defective ERV envelope glycoproteins and capsids as restriction factors. This creates a cycle, initiated by widespread infection with an exogenous retrovirus, occasionally leading to its endogenization and eventual inactivation of endogenous copies through mutation (Figure 3). Expression of defective components from the now endogenous copies may block infection with the cognate circulating retrovirus, ultimately leading to its extinction, and endogenous copies become fixed in the host germline (Figure 3). While the antiviral activities of defective ERVs might have been effective against ancient viral epidemics caused by their exogenous counterparts, they do not necessarily protect against modern-day or future viruses (53). The introduction of a retrovirus that bypasses existing ERV-derived antiviral activities initiates another cycle, until effective antiviral activities evolve further or are acquired de novo by the host.
Figure 3. A proposed cycle of retroviral epidemic and evolution of innate resistance.
Infection with a new exogenous retrovirus may occasionally cause germline infection leading to endogenization of one or more proviral copies (ERV). With long evolutionary time, ERVs are co-opted to produce defective products that interfere and restrict infection with related retroviruses, and achieve fixation in the host genome. When such restriction is efficient, it drives related exogenous viruses to extinction, but may not necessarily protect against more distant exogenous retroviruses, which can cause the next epidemic.
In addition to their ability to trigger an IFN response and to the co-option of their components as restriction factors, genomic RTE integrations that are eventually allowed by the host to function as regulatory sequences contribute to genomic innovation, including the control of innate immunity genes. Identification of such functions may not be trivial. RTEs in the genome can be recognized by homology with consensus sequences. However, high sequence similarity between copies of the same RTE family within the same genome, as well as sequence and insertional polymorphisms between genomes complicate their analyses. Even more challenging is the annotation of transcripts that utilize or overlap with RTEs, particularly transcript isoforms that may characterize pathological conditions, such as malignant transformation (54, 55), although it will be facilitated by more comprehensive annotation of the diversity and distribution of RTEs in the human reference genome (56), and by advances in long-read RNA sequencing methodologies. The IFN response is also associated with characteristic shift in the transcriptome (57), and it would be important to characterize the contribution of RTEs in the creation of the IFN-inducible transcript isoforms. A potential contribution of RTEs is further supported by the association of their epigenetic state with inter-individual variation in the innate immune response to HIV-1 (52) and Influenza A (58). More comprehensive genomic and transcriptomic analyses of RTEs, including their inter-individual variation, will be an essential step toward our understanding of the evolutionary constraints our innate immune system faces in its fight against current and future exogenous viruses.
Teaser.
Mammalian hosts continue to sense, control and even enlist endogenous retroelements in the evolution of antiviral defenses
Acknowledgments
Funding
This work was supported by the Francis Crick Institute (CC2088), which receives its core funding from Cancer Research UK, the UK Medical Research Council, and the Wellcome Trust. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant no. 101018670).
Footnotes
Author contributions: G.K. and J.P.S. researched the literature and wrote the manuscript.
Competing interests: G.K. is a scientific cofounder of EnaraBio and a member of its scientific advisory board. G.K. has consulted for EnaraBio, Repertoire Immune Medicines, ErVimmune and AdBio Partners. The authors declare no other competing interests.
References and notes
- 1.Cohn M. The immune system: a weapon of mass destruction invented by evolution to even the odds during the war of the DNAs. Immunol Rev. 2002;185:24–38. doi: 10.1034/j.1600-065x.2002.18504.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Wells JN, Feschotte C. A Field Guide to Eukaryotic Transposable Elements. Annu Rev Genet. 2020;54:539–561. doi: 10.1146/annurev-genet-040620-022145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Kassiotis G. The Immunological Conundrum of Endogenous Retroelements. Annu Rev Immunol. 2023;41:99–125. doi: 10.1146/annurev-immunol-101721-033341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Malfavon-Borja R, Feschotte C. Fighting fire with fire: endogenous retrovirus envelopes as restriction factors. J Virol. 2015;89:4047–4050. doi: 10.1128/JVI.03653-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Blanco-Melo D, Gifford RJ, Bieniasz PD. Co-option of an endogenous retrovirus envelope for host defense in hominid ancestors. Elife. 2017;6 doi: 10.7554/eLife.22519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Frank JA, et al. Evolution and antiviral activity of a human protein of retroviral origin. Science. 2022;378:422–428. doi: 10.1126/science.abq7871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Khare S, et al. Receptor-recognition and antiviral mechanisms of retrovirus-derived human proteins. Nat Struct Mol Biol. 2024;31:1368–1376. doi: 10.1038/s41594-024-01295-6. [DOI] [PubMed] [Google Scholar]
- 8.Terry SN, et al. Expression of HERV-K108 envelope interferes with HIV-1 production. Virology. 2017;509:52–59. doi: 10.1016/j.virol.2017.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Galão RP, Le Tortorec A, Pickering S, Kueck T, Neil SJ. Innate sensing of HIV-1 assembly by Tetherin induces NFκB-dependent proinflammatory responses. Cell Host Microbe. 2012;12:633–644. doi: 10.1016/j.chom.2012.10.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lemaître C, Harper F, Pierron G, Heidmann T, Dewannieux M. The HERV-K human endogenous retrovirus envelope protein antagonizes Tetherin antiviral activity. J Virol. 2014;88:13626–13637. doi: 10.1128/JVI.02234-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Belshaw R, Katzourakis A, Paces J, Burt A, Tristem M. High copy number in human endogenous retrovirus families is associated with copying mechanisms in addition to reinfection. Mol Biol Evol. 2005;22:814–817. doi: 10.1093/molbev/msi088. [DOI] [PubMed] [Google Scholar]
- 12.Malim MH, Bieniasz PD. HIV Restriction Factors and Mechanisms of Evasion. Cold Spring Harb Perspect Med. 2012;2:a006940. doi: 10.1101/cshperspect.a006940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Yap MW, Young GR, Varnaite R, Morand S, Stoye JP. Duplication and divergence of the retrovirus restriction gene Fv1 in Mus caroli allows protection from multiple retroviruses. PLoS Genet. 2020;16:e1008471. doi: 10.1371/journal.pgen.1008471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wolf D, Goff SP. Host restriction factors blocking retroviral replication. Annu Rev Genet. 2008;42:143–163. doi: 10.1146/annurev.genet.42.110807.091704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Monde K, et al. Molecular mechanisms by which HERV-K Gag interferes with HIV-1 Gag assembly and particle infectivity. Retrovirology. 2017;14:27. doi: 10.1186/s12977-017-0351-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Mura M, et al. Late viral interference induced by transdominant Gag of an endogenous retrovirus. Proc Natl Acad Sci U S A. 2004;101:11117–11122. doi: 10.1073/pnas.0402877101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Bartok E, Hartmann G. Immune Sensing Mechanisms that Discriminate Self from Altered Self and Foreign Nucleic Acids. Immunity. 2020;53:54–77. doi: 10.1016/j.immuni.2020.06.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Naesens L, Haerynck F, Gack MU. The RNA polymerase III-RIG-I axis in antiviral immunity and inflammation. Trends Immunol. 2023;44:435–449. doi: 10.1016/j.it.2023.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ahmad S, et al. Breaching Self-Tolerance to Alu Duplex RNA Underlies MDA5-Mediated Inflammation. Cell. 2018;172:797–810.:e713. doi: 10.1016/j.cell.2017.12.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Vinuesa CG, Grenov A, Kassiotis G. Innate virus-sensing pathways in B cell systemic autoimmunity. Science. 2023;380:478–484. doi: 10.1126/science.adg6427. [DOI] [PubMed] [Google Scholar]
- 21.Luqman-Fatah A, Nishimori K, Amano S, Fumoto Y, Miyoshi T. Retrotransposon life cycle and its impacts on cellular responses. RNA Biol. 2024;21:11–27. doi: 10.1080/15476286.2024.2409607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Schwerk J, et al. RNA-binding protein isoforms ZAP-S and ZAP-L have distinct antiviral and immune resolution functions. Nat Immunol. 2019;20:1610–1620. doi: 10.1038/s41590-019-0527-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Young GR, et al. Resurrection of endogenous retroviruses in antibody-deficient mice. Nature. 2012;491:774–778. doi: 10.1038/nature11599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Yu P, et al. Nucleic acid-sensing Toll-like receptors are essential for the control of endogenous retrovirus viremia and ERV-induced tumors. Immunity. 2012;37:867–879. doi: 10.1016/j.immuni.2012.07.018. [DOI] [PubMed] [Google Scholar]
- 25.Evans EF, Saraph A, Tokuyama M. Transactivation of Human Endogenous Retroviruses by Viruses. Viruses. 2024;16 doi: 10.3390/v16111649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hale BG. Antiviral immunity triggered by infection-induced host transposable elements. Curr Opin Virol. 2022;52:211–216. doi: 10.1016/j.coviro.2021.12.006. [DOI] [PubMed] [Google Scholar]
- 27.Karijolich J, Abernathy E, Glaunsinger BA. Infection-Induced Retrotransposon-Derived Noncoding RNAs Enhance Herpesviral Gene Expression via the NF-κB Pathway. PLoS Pathog. 2015;11:e1005260. doi: 10.1371/journal.ppat.1005260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Rookhuizen DC, et al. Induction of transposable element expression is central to innate sensing. bioRxiv. 2021:2021.2009.2010.457789 [Google Scholar]
- 29.Sanville B, et al. Adaptive evolution of Mus Apobec3 includes retroviral insertion and positive selection at two clusters of residues flanking the substrate groove. PLoS Pathog. 2010;6:e1000974. doi: 10.1371/journal.ppat.1000974. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Singh M, et al. A new human embryonic cell type associated with activity of young transposable elements allows definition of the inner cell mass. PLoS Biol. 2023;21:e3002162. doi: 10.1371/journal.pbio.3002162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Horton I, Kelly CJ, Dziulko A, Simpson DM, Chuong EB. Mouse B2 SINE elements function as IFN-inducible enhancers. Elife. 2023;12 doi: 10.7554/eLife.82617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kelly CJ, Chitko-McKown CG, Chuong EB. Ruminant-specific retrotransposons shape regulatory evolution of bovine immunity. Genome Res. 2022;32:1474–1486. doi: 10.1101/gr.276241.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Chuong EB, Elde NC, Feschotte C. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science. 2016;351:1083–1087. doi: 10.1126/science.aad5497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Le Pen J, Rice CM. The antiviral state of the cell: lessons from SARS-CoV-2. Curr Opin Immunol. 2024;87:102426. doi: 10.1016/j.coi.2024.102426. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Richardson RB, et al. A CRISPR screen identifies IFI6 as an ER-resident interferon effector that blocks flavivirus replication. Nat Microbiol. 2018;3:1214–1223. doi: 10.1038/s41564-018-0244-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Villamayor L, et al. Interferon alpha inducible protein 6 is a negative regulator of innate immune responses by modulating RIG-I activation. Front Immunol. 2023;14:1105309. doi: 10.3389/fimmu.2023.1105309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Braun E, et al. Guanylate-Binding Proteins 2 and 5 Exert Broad Antiviral Activity by Inhibiting Furin-Mediated Processing of Viral Envelope Proteins. Cell Rep. 2019;27:2092–2104.:e2010. doi: 10.1016/j.celrep.2019.04.063. [DOI] [PubMed] [Google Scholar]
- 38.Srinivasachar Badarinarayan S, et al. HIV-1 infection activates endogenous retroviral promoters regulating antiviral gene expression. Nucleic Acids Res. 2020;48:10890–10908. doi: 10.1093/nar/gkaa832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Buttler CA, Ramirez D, Dowell RD, Chuong EB. An intronic LINE-1 regulates IFNAR1 expression in human immune cells. Mob DNA. 2023;14:20. doi: 10.1186/s13100-023-00308-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Chikuma S, et al. TRIM28 Expression on Dendritic Cells Prevents Excessive T Cell Priming by Silencing Endogenous Retrovirus. J Immunol. 2021;206:1528–1539. doi: 10.4049/jimmunol.2001003. [DOI] [PubMed] [Google Scholar]
- 41.Bartonicek N, et al. The retroelement Lx9 puts a brake on the immune response to virus infection. Nature. 2022;608:757–765. doi: 10.1038/s41586-022-05054-9. [DOI] [PubMed] [Google Scholar]
- 42.Loong J, et al. Retroelement co-option disrupts the cancer transcriptional programme. Genome Med. 2025;17:48. doi: 10.1186/s13073-025-01479-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Starr TN, et al. ACE2 binding is an ancestral and evolvable trait of sarbecoviruses. Nature. 2022;603:913–918. doi: 10.1038/s41586-022-04464-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ng W, et al. Tissue-specific and interferon-inducible expression of nonfunctional ACE2 through endogenous retroelement co-option. Nat Genet. 2020;52:1294–1302. doi: 10.1038/s41588-020-00732-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Pasquesi GIM, et al. Regulation of human interferon signaling by transposon exonization. Cell. 2024;187:7621–7636.:e7619. doi: 10.1016/j.cell.2024.11.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Lavialle C, et al. Paleovirology of 'syncytins', retroviral env genes exapted for a role in placentation. Philos Trans R Soc Lond B Biol Sci. 2013;368:20120507. doi: 10.1098/rstb.2012.0507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Buchrieser J, et al. IFITM proteins inhibit placental syncytiotrophoblast formation and promote fetal demise. Science. 2019;365:176–180. doi: 10.1126/science.aaw7733. [DOI] [PubMed] [Google Scholar]
- 48.Fu Y, et al. IFITM1 suppresses expression of human endogenous retroviruses in human embryonic stem cells. FEBS Open Bio. 2017;7:1102–1110. doi: 10.1002/2211-5463.12246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Krchlikova V, et al. Inhibition of placental trophoblast fusion by guanylate-binding protein 5. Sci Adv. 2025;11:eadt5388. doi: 10.1126/sciadv.adt5388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Rosspopoff O, Trono D. Take a walk on the KRAB side. Trends Genet. 2023;39:844–857. doi: 10.1016/j.tig.2023.08.003. [DOI] [PubMed] [Google Scholar]
- 51.Abajorga M, Yurkovetskiy L, Luban J. piRNA Defense Against Endogenous Retroviruses. Viruses. 2024;16 doi: 10.3390/v16111756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Singh M, et al. Transposable elements may enhance antiviral resistance in HIV-1 elite controllers. Genome Biol. 2025;26:28. doi: 10.1186/s13059-025-03484-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Johnson WE. Origins and evolutionary consequences of ancient endogenous retroviruses. Nat Rev Microbiol. 2019;17:355–370. doi: 10.1038/s41579-019-0189-2. [DOI] [PubMed] [Google Scholar]
- 54.Attig J, et al. LTR retroelement expansion of the human cancer transcriptome and immunopeptidome revealed by de novo transcript assembly. Genome research. 2019;29:1578–1590. doi: 10.1101/gr.248922.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Jang HS, et al. Transposable elements drive widespread expression of oncogenes in human cancers. Nat Genet. 2019;51:611–617. doi: 10.1038/s41588-019-0373-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Hoyt SJ, et al. From telomere to telomere: The transcriptional and epigenetic state of human repeat elements. Science. 2022;376:eabk3112. doi: 10.1126/science.abk3112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Ueda MT, et al. Functional and dynamic profiling of transcript isoforms reveals essential roles of alternative splicing in interferon response. Cell Genom. 2024;4:100654. doi: 10.1016/j.xgen.2024.100654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Chen X, et al. Transposable elements are associated with the variable response to influenza infection. Cell Genom. 2023;3:100292. doi: 10.1016/j.xgen.2023.100292. [DOI] [PMC free article] [PubMed] [Google Scholar]