Skip to main content
EMBO Reports logoLink to EMBO Reports
. 2018 Jul 30;19(10):e45000. doi: 10.15252/embr.201745000

KAP1 regulates endogenous retroviruses in adult human cells and contributes to innate immune control

Christopher HC Tie 1, Liane Fernandes 1, Lucia Conde 2, Luisa Robbez‐Masson 1, Rebecca P Sumner 1, Tom Peacock 1, Maria Teresa Rodriguez‐Plata 1, Greta Mickute 1, Robert Gifford 3, Greg J Towers 1, Javier Herrero 2, Helen M Rowe 1,
PMCID: PMC6172469  PMID: 30061100

Abstract

Endogenous retroviruses (ERVs) have accumulated in vertebrate genomes and contribute to the complexity of gene regulation. KAP1 represses ERVs during development by its recruitment to their repetitive sequences through KRAB zinc‐finger proteins (KZNFs), but little is known about the regulation of ERVs in adult tissues. We observed that KAP1 repression of HERVK14C was conserved in differentiated human cells and performed KAP1 knockout to obtain an overview of KAP1 function. Our results show that KAP1 represses ERVs (including HERV‐T and HERV‐S) and ZNF genes, both of which overlap with KAP1 binding sites and H3K9me3 in multiple cell types. Furthermore, this pathway is functionally conserved in adult human peripheral blood mononuclear cells. Cytosine methylation that acts on KAP1 regulated loci is necessary to prevent an interferon response, and KAP1‐depletion leads to activation of some interferon‐stimulated genes. Finally, loss of KAP1 leads to a decrease in H3K9me3 enrichment at ERVs and ZNF genes and an RNA‐sensing response mediated through MAVS signaling. These data indicate that the KAP1‐KZNF pathway contributes to genome stability and innate immune control in adult human cells.

Keywords: 5‐azacytidine, adult human cells, endogenous retroviruses, epigenetic control, innate immune genes, KAP1 (KRAB‐associated protein 1), retrotransposons, SETDB1

Subject Categories: Chromatin, Epigenetics, Genomics & Functional Genomics; Immunology; Transcription

Introduction

Retrotransposons occupy at least half of the human genome, and individual loci reflect the genetic conflict experienced by the host throughout evolution 1. Retrotransposons are divided into two groups, those with long terminal repeats (LTR) and those without (non‐LTR). LTR retrotransposons are known as endogenous retroviruses (ERVs) because they are derived from exogenous retroviruses. ERVs constitute around 9% of the human genome 2 with important roles in human health and disease. For example, LTRs of the primate‐specific retrovirus, MER41, function as natural poised enhancers for a network of interferon‐induced genes 3. The co‐option of ERVs into normal processes such as development and immune defense is the result of co‐evolution of ERVs and their regulatory DNA sequences, which are scattered across the genome, with their hosts over millions of years 1.

Endogenous retroviruses are subject to transcriptional repression in adult tissues through DNMT1‐dependent cytosine methylation 4. This serves to prevent expression of their nucleic acids and proteins, which could potentially trigger an autoimmune response. Treatment of cancer cells with DNA demethylating agents (based on 5‐azacytidine, 5‐AZA) leads to the reactivation of ERVs 5, 6, 7 and induction of interferon‐stimulated genes (ISGs). This signaling pathway proceeds through MDA‐5, MAVS, and IRF7 and is thought to result from RNA sensing of double‐stranded RNA derived from ERVs. This mechanism may contribute to anti‐tumor immunity in patients treated with these drugs and remarkably, cancer initiating cells pretreated with 5‐AZA form fewer tumors in mice in a MAVS‐dependent manner 6. Reactivation of ERVs in adult cells has also been linked to autoimmune diseases such as multiple sclerosis 8. Despite the presence and importance of ERV transcriptional regulation in adult tissues, however, very little is known about the epigenetic control pathways in operation.

In contrast, it is well known that the KAP1 and krüppel‐associated box domain zinc‐finger protein (KZNF) pathway initiates epigenetic silencing at ERVs early in development 9, 10, 11, 12, 13. KZNFs (known as KZFPs in mouse) recognize DNA target sequences mainly within repetitive DNA 14, 15, 16, and recruit KAP1 and epigenetic modifiers including HP1, SETDB1, and DNMTs 17, 18, 19. KAP1 has been shown to bind to certain ERVs in human CD4+ T cells 11 and plays a functional role in ERV regulation in human development in neural progenitor cells 20. KAP1 has been depleted in several differentiated murine cell types with little effect on ERVs 21, 22. SETDB1, in contrast, which exerts KAP1‐dependent and KAP1‐independent roles in ERV silencing 13, has an established role in repressing ERVs in somatic cells. It is augmented in certain cancer cells where it functions in immune evasion 23, and it is necessary for the silencing of ERVs in mouse B lymphocytes 24 and in mouse embryonic fibroblasts 25.

The potential role and relevance of KAP1 in differentiated cells, particularly in adult human tissues, is a fundamental and open question, which we set out to address here by employing CRISPR/Cas9‐mediated genome editing.

Results

The human retrovirus HERVK14C is repressed by KAP1 in undifferentiated and differentiated cells including adult PBMCs

We selected the ERV lineage HERVK14C as a tool to explore KAP1 function in differentiated cells because it has been reported to be KAP1‐regulated in undifferentiated human embryonic stem cells (ESCs) 11. HERVK14C is restricted to Great Apes and Old World Monkeys (Fig 1A) and is a low copy number ERV (Fig EV1A), making it relatively easy to study. We designed two primer sets to detect specific HERVK14C loci (Fig 1B) and employed KAP1 depletion and qRT–PCR to confirm that KAP1 represses these ERVs in undifferentiated OCT4‐expressing embryonic NTERA‐2 cells (Figs 1C and EV1B). We then focused on two differentiated cell lines, HeLa and 293T cells, and generated KAP1 knockout clones using CRISPR/Cas9 genome editing. We validated KAP1 knockout and complementation functionally using described KAP1‐KZNF reporters 9, 26 (Fig EV1C and D), and we discovered that like in undifferentiated cells, KAP1 represses HERVK14C in differentiated cell lines (Fig 1D and E). Two other KAP1‐regulated retrotransposons, L1PA4 (an L1 subfamily) and SVA D (a SVA subfamily), in contrast, were less affected by KAP1 depletion (Fig 1C–E). One possibility why SVAs may be harder to resurrect could be due to their enriched cytosine methylation (Fig EV1E). Most importantly, we found using shRNA‐mediated RNAi that KAP1 is necessary to regulate ERVs not only in cell lines but also in adult peripheral blood mononuclear cells (PBMCs), and to a lesser extent in CD4+ T cells (Figs 1F and EV1F–H), in which the HERVK14C 5′LTR was enriched for cytosine methylation (Fig EV1I). Overall, these data suggest that KAP1 regulation persists in adult human cells.

Figure 1. The human retrovirus HERVK14C is repressed by KAP1 in undifferentiated and differentiated cells including adult PBMCs.

Figure 1

  • A
    Schematic diagram showing the age of HERVK14C.
  • B
    Venn diagram showing the chromosome copies detected by the two primer sets.
  • C
    qRT–PCR expression of endogenous repeats following shRNA‐mediated KAP1 depletion in NTERA‐2 cells. Results were normalized to β2 microglobulin (B2M). KAP1 expression levels were verified by qRT–PCR and Western blot. A representative experiment of two experiments is shown. Two‐tailed unpaired t‐test P‐values are as follows: 0.002300 (HERVK14C_2; “shControl” compared to “shKAP1” at time points Day 4 and Day 6 combined).
  • D, E
    qRT–PCR expression of endogenous repeats following KAP1 knockout in HeLa (D) and 293T cells (E). Results were normalized to B2M. KAP1 expression levels were verified by qRT–PCR and Western blot. For (E), we found that clone I (also treated with KAP1 sgRNA) retained KAP1 protein expression (E, right panel), so we only used clones II, III, and IV to explore phenotype (E, left panel), which represented validated KAP1 knockouts. Two‐tailed unpaired t‐tests were done to compare the controls (“Hela WT” in panel D or “293T WT” in panel E) against all respective KO clones (4 HeLa KO clones in panel D or 3 293T KO clones in panel E). P‐values are as follows: (D) HERVK14C_1: 0.0017, HERVK14C_2: 0.0044, SVA D VNTR: 0.0243; (E) HERVK14C_2: < 0.0001. Clones 8, 12, and 15 (from D) were selected for mRNA‐sequencing. n = 4.
  • F
    qRT–PCR expression of endogenous repeats following shRNA‐mediated KAP1 depletion in PBMCs (day 6 post‐transduction). Results were normalized to B2M. Unpaired t‐tests: P‐values: 0.0394 (L1PA4), 0.0079 (HERVK14C), 0.0009 (SVA D VNTR), 0.0033 (GAPDH), <0.001 (B2M). n = 3.
Data information: All error bars show standard deviation (SD). All numbers above bars depict fold changes compared to control cells (to one decimal place). ***P < 0.001, **P < 0.01, and *P < 0.05.Source data are available online for this figure.

Figure EV1. Validation of cell types and functional validation of KAP1 knockout.

Figure EV1

  • A
    Chromosome map showing the HERVK14C loci with at least one internal open reading frame and long terminal repeats. Note that many copies are present on the Y chromosome so are not relevant in female cells.
  • B
    NTERA‐2 and 293T cells were stained for OCT4 intracellularly and were subjected to flow cytometry analysis. A PE isotype control antibody was used as a negative control.
  • C
    KAP1 knockout HeLa cells were validated using a known KAP1‐KZFP target sequence 12, 26: Wild‐type HeLa cells, KAP1 knockout cells, and KAP1 reconstituted knockout cells (+KAP1 cDNA) were doubly transduced with a PBS‐Pro‐GFP reporter vector (from 10) and a vector expressing either the cognate ZFP, Zfp809, or a control ZFP, Zfp819 (left). KAP1 expression levels are shown by Western blot (right). In “Mock” controls, the ZFP expression vectors were replaced with same volume of media. Error bars show SD, N = 3. Two‐tailed unpaired t‐test P‐values are as follows: HeLa wild type: 0.033; KAP1 KO clone_1: 0.3185; KAP1 KO clone_2: 0.0036. Note that P‐values are not available for the KAP1 KO (bulk) bars only due to N < 3.
  • D
    KAP1 knockout 293T cells were validated using known KAP1‐KZNF target sequences (constructs were a kind gift from David Haussler) 9. KAP1 wild‐type and knockout 293T cells were co‐transfected with the luciferase reporter construct, a Renilla control plasmid and constructs expressing either ZNF91 or ZNF93 (see Materials and Methods). Two‐tailed unpaired t‐test P‐values are as follows: WT SVA_ZNF91: < 0.0001; WT LINE1_ZNF93: < 0.0001; KO SVA_ZNF91: 0.6008; KO LINE1_ZNF93: 0.0841. Error bars show SD, N = 3.
  • E
    DNA methylation analysis of endogenous SVAs in 293T cells. Primers were designed on a SVA copy on chromosome 7 but primers recognize 219 copies of SVAs, some of which exhibit CpG deletions or mutations (shown by “x” on the CpG map). PCR for the OCT4 gene body was used as a positive endogenous control for cytosine methylation.
  • F
    Shows an independent PBMC experiment with a different donor to that shown in Fig 1F with the same time point (day 6). Expression was normalized to B2M. Note that KD efficiency is measured on the bulk population of cells and not reflective of the KD efficiency within the population of cells most efficiently transduced, which is typically lower for PBMCs than cell lines. Two‐tailed unpaired t‐test P‐values are as follows: <0.0001 (LIPA4) and (KAP1 normalized to GAPDH).
  • G, H
    Primary CD4+ T cells were activated and transduced with KAP1 knockdown or control vectors. Cells were harvested at different time points for (G) Western blot and (H) qRT–PCR with B2M normalization. Two‐tailed unpaired t‐test P‐values are as follows: 0.64 (HERVK14C_1; “shControl” compared to “Day 9 shKAP1.2”) and 0.04 (HERVK14C_2; “shControl” compared to “Day 9 shKAP1.2”). Error bars show SEM, n = 2.
  • I
    DNA methylation status of the HERVK14C LTR region on chromosome 15 in CD4+ T cells as tested using bisulfite sequencing.
Data information: All numbers above bars depict fold changes compared to control cells (to one decimal place). ***P < 0.001, **P < 0.01, and *P < 0.05.Source data are available online for this figure.

A common role for KAP1 in repressing ERVs and ZNF genes

To gain insight into the role of KAP1 in differentiated cells, we selected KAP1 knockout HeLa cell clones (from Fig 1D) for mRNA‐sequencing. Sequencing reads were mapped to the human genome and to RepBase to assess the global expression of genes and repetitive elements and we focused on significant changes (where differential effects were > 2‐fold with P‐values < 0.05). Results reveal that KAP1 knockout induces a very specific phenotype involving overexpression of ERVs and ZNF genes (Fig 2). ERVs (HERV‐S, HERV‐K, and HERV‐T, all of which are derived from exogenous retroviruses 27, 28) were the only class of repetitive elements overexpressed in knockout HeLa cells (Fig 2A and Dataset EV1), and as expected, these ERV families were also bound by KAP1 according to ENCODE data (Fig 2A and Appendix Fig S1A). These ERVs are present in diverse primates and other mammals (Fig 2B). Of note, several retrotransposons were downregulated in knockout cells, which might represent indirect effects (Appendix Fig S1B). The fact that few ERVs were reactivated may relate to the low number of active ERV transcriptional units in the human compared with the mouse genome 29.

Figure 2. A common role for KAP1 in repressing ERVs and ZNF genes through H3K9me3.

Figure 2

  1. Boxplots showing repeats significantly upregulated (> 2‐fold where P < 0.05 using DESeq2) in knockout (N = 3) compared to wild‐type (N = 3) HeLa cells based on mRNA‐sequencing data. *P = 0.0174 (HERV‐S), **P = 0.0047 (HERV‐K14C), ***P = 0.00013 (HERV‐T LTR6B), and ***P = 1.90E‐06 (HERV‐T HERVS71). HERV‐T and HERV‐S but not HERVK14C also reached significance when only adjusted P‐values were considered, where differences are compared to the whole of RepBase. KAP1 binding (ENCODE data) is shown in Appendix Fig S1A. All numbers above bars depict fold changes compared to control cells (to one decimal place). Error bars show SD.
  2. Evolutionary tree showing the age of KAP1‐repressed HERVs identified in (A). Here, HERV‐K refers to HERVK14C. Estimated ages of stated lineages are given and marked with a star. Myr: million years.
  3. The 38 upregulated genes identified (> 2‐fold where P adj < 0.05) were converted to DAVID IDs and used for gene ontology analysis. Three gene clusters were significantly enriched (P‐value < 0.05, drawn on the plot as a dotted line) in the data set: zinc‐finger proteins (P = 2.2 × 10−7), negative regulators of metabolic processes (P = 0.0036), and angiogenesis (P = 0.011). Venn diagrams on the right show numbers of upregulated genes, ZNFs and KZNFs, and the overlap.
  4. The nature of conserved KAP1 binding sites between human ESCs 11 and 293Ts (ENCODE) (see Fig EV2D) is shown (left pie chart) compared to their abundance in the genome (right pie chart).
  5. The 614 KAP1 common binding sites (see Fig EV2D) were interrogated for their nearest gene, and this gene list was converted to DAVID IDs and used for gene ontology analysis. Four gene clusters were significantly enriched (P‐value < 0.05, drawn on the plot as a dotted line): zinc‐finger proteins (P = 1.1 × 10−19), fibronectin folding (P = 0.016), protein phosphatase (P = 0.033), and synapse (P = 0.047). Venn diagrams on the right show numbers of KAP1‐bound loci, ZNFs and KZNFs, and the overlap.
  6. Genomic coordinates of the common KAP1 sites identified in Fig EV3D were subjected to ChIP‐seq correlation analyses using ChIP‐Cor software (see Materials and Methods). Each plot shows duplicate ChIP‐seq experiments from ENCODE. See Fig EV3B for complete data.

Interrogation of the transcriptome showed that KAP1 knockout also affects several hundred cellular genes (Fig EV2A and B, Dataset EV2). When we focused on upregulated genes (> 2‐fold where P adj < 0.05), we found that strikingly, ZNF genes, of which there were 13 (9 of which were KZNFs) are the class of cellular genes most significantly overexpressed (Fig 2C, with the highest 6.81× overexpressed, Fig EV2C). All of these KZNFs are also direct KAP1‐binding targets (using ENCODE data; Fig EV2C), illustrating that KAP1 plays a functional role in transcriptional regulation of these genes, a notion that was previously only hypothesized 30, 31.

Figure EV2. Cellular genes are among the targets of KAP1 regulation.

Figure EV2

  1. Examples of cellular genes upregulated (ZNF432, ZNF569, SNAI1, CYP17A1, CTGF shown) or downregulated (KAP1 shown) upon KAP1 knockout in HeLa cells based on mRNA‐sequencing data. BigWig files were uploaded to the UCSC genome browser as URL links and the same scale (stated) applied to all samples.
  2. The top 100 most significantly downregulated genes (> 2‐fold with a P adj‐value < 0.05) were converted to DAVID IDs and used for gene ontology analysis. All shown gene clusters were significantly enriched (P‐values < 0.05) in the data set: signal peptide (P = 8.9 × 10−19), glycoprotein (P = 3.6 × 10−19), cell adhesion (P = 2.5 × 10−10), extracellular matrix (P = 1.6 × 10−8), EGF‐like region (P = 0.000004), wounding response (P = 0.000006), plasma membrane (P = 0.000029), cell‐cell adhesion (P = 0.000160), and innate immune response (P = 0.000780).
  3. Table showing that all upregulated KZNFs (from Fig 2C) are KAP1‐bound according to ENCODE data.
  4. Venn diagrams showing that some KAP1 binding sites are common between human ESCs and 293T cells; 614 common binding sites were identified based on their presence in duplicate KAP1 ChIP‐sequencing experiments in human ESCs as well as in duplicate KAP1 ChIP‐sequencing experiments in 293T cells. Data are from 11 and ENCODE (293T cells).

Conserved KAP1 binding and H3K9me3 at ERVs and ZNF genes

Since KAP1 repression of ERVs and ZNF genes is a phenotype that has also been observed in ESCs 10, 11, we hypothesized that KAP1 may bind ERVs and ZNF genes at common genomic loci in both undifferentiated and differentiated cells. To address this question, we determined whether any KAP1 binding sites were common between human ESCs and differentiated cells (293T cells) using public ChIP‐seq data 11 and ENCODE data. We identified 614 common peaks (Fig EV2D and Dataset EV3). We found these loci to be highly enriched for ERVs compared to their abundance in the genome (Fig 2D). We determined the nearest gene to each of these sites, and interestingly, gene ontology analysis showed that the most common gene cluster was ZNFs (of which there were 61, including 40 KZNFs; Fig 2E), mirroring our functional data for the upregulated genes (Fig 2C). Finally, examining the occurrence of LINE1 elements and ERVs within KAP1 binding sites in (i) human ESCs or in (ii) 293T cells or (iii) within common sites revealed a similar pattern of distribution, except that ERV1 elements were slightly enriched within the common sites (Fig EV3A). This suggests that the landscape of KAP1 binding may be mainly conserved between these cell types. We then found using ENCODE data that common KAP1 sites correlate with the silent chromatin mark H3K9me3, as well as with KAP1 and SETDB1 in multiple cell types including primary adult cells (PBMCs; Figs 2F and EV3B for the whole dataset). There was no correlation with active histone marks (Fig EV3B).

Figure EV3. Conserved KAP1 binding and H3K9me3 at ERVs and ZNFs.

Figure EV3

  1. Comparison of the distribution of KAP1‐bound ERVs and LINE1 elements between undifferentiated cells (human ESCs), differentiated cells (293Ts), and KAP1 common sites (sites conserved between human ESCs and 293T cells). Here, all genomic ERVs or LINE1s were intersected with the stated KAP1 peaks. Total number of intersections (15,399 for KAP1 ESC, 2,519 for KAP1 293T, and 742 for KAP1 common) is different to the total number of KAP1 peaks per dataset (shown) because some peaks intersect multiple or no repeats. For more information on the identity of each common KAP1 peak, see Dataset EV3.
  2. Complete data of which some is shown in Fig 2F: Genomic coordinates of the common KAP1 sites identified in Fig EV2D were subjected to ChIP‐seq correlation analyses using ChIP‐Cor software (see Materials and Methods). Each plot shows duplicate ChIP‐seq experiments from ENCODE.
  3. The binding sites (where known 14, 16) within repeats and genes of the top 100 most highly expressed (defined by RNA‐seq RPKM) KZNFs in 293T cells. These KZNFs were also confirmed to be expressed at the protein level (http://www.proteinatlas.org/cell).
  4. The binding sites within repeats and genes of KAP1 at common KAP1 sites.
  5. KZNFs within the top 100 most highly expressed group that are known to bind ERVs were assessed for their age and type of ERV that they bind to.

KZNFs expressed in differentiated human cells target ERVs

We reasoned that a subset of KZNFs must be widely expressed and at a sufficient level to recruit KAP1 to common DNA target sites. We therefore identified the top one hundred most highly expressed KZNFs in 293T cells using mRNA‐sequencing data 32, verified them to be expressed at the protein level (the human protein atlas 33), and recorded their targets where known 14, 16 (Fig EV3C and Dataset EV4). Many of these KZNFs bind to ERVs (Fig EV3C), reflecting KAP1 binding at common loci (Fig EV3D). Of note, ZNF274 is widely expressed (Dataset EV4) and known to bind the 3′ ends of ZNF genes 30. Interestingly, the ERV‐binding KZNFs are widely conserved among primates or mammals and recognize ERVL or ERV1 sequences (Fig EV3E). These data show that KAP1 and KZNFs repress ERVs and ZNF genes in multiple cell types and suggest that core KZNF‐ERV interactions may be conserved between species.

PBS‐dependent de novo repression of ERVs is conserved in primary adult cells

We next asked whether differentiated cells were equipped with high enough levels of KAP1 and its associated epigenetic machinery to induce de novo repression of incoming ERV sequences. KAP1/KZNFs are known to target several retroviruses in ESCs including HERV‐K through the primer binding site (PBS) 11, 12, 34 as well as other sites 21, 35. We selected the KAP1‐bound and repressed HERVK14C integrant on chromosome 15 as a model to measure de novo KAP1 repression in a reporter assay. This provirus contains a variant lysine PBS sequence (PBSChr15), which we cloned upstream of the HERVK14C LTR into a reporter vector (termed PBSChr15‐LTR‐GFP) and tested alongside a vector containing the consensus PBS‐lys1,2 sequence (PBS‐LTR‐GFP) or a control vector with the HERVK14C LTR alone (LTR‐GFP; Fig 3A). Using this system, we could demonstrate that the HERVK14C chromosome 15 PBS sequence and the consensus PBS sequence could induce KAP1‐dependent reporter repression in both undifferentiated and differentiated cells (Fig 3B and C). Most importantly, we also detected PBS‐dependent reporter repression in PBMCs, in which we verified that repression was not due to lack of reporter integration (Fig 3D). This suggested that a subset of KZNFs must be expressed in multiple differentiated cell types to recruit KAP1 to ERVs. To find out how many KZNFs are expressed at the mRNA level in HeLa cells, 293T cells, macrophages, and CD4+ T cells, we analyzed mRNA‐sequencing data and found 77 (Fig 3E, Dataset EV5) to be conserved. We verified that several KZNFs are also expressed at the protein level (Fig EV4).

Figure 3. PBS‐dependent de novo repression of ERVs is conserved in primary cells.

Figure 3

  1. Schematic diagram showing reporter constructs used. The 18‐bp PBS sequence was cloned upstream of the HERVK14C LTR promoter (which is identical in all vectors) in an antisense orientation.
  2. NTERA‐2 cells were transduced with either an empty vector (WT) or the same vector containing an shRNA against KAP1 (KD) prior to puromycin selection and transduction with increasing doses of GFP reporter vectors. GFP expression was analyzed 72 h post‐reporter transduction. A representative experiment of two experiments is shown. Two‐tailed unpaired t‐test *P‐values: PBS‐LTR‐GFP: 0.0148; PBSChr15‐LTR‐GFP: 0.0377.
  3. The same as (B) except that here KAP1 wild‐type (WT) and knockout (KO) 293T cells were used. *P‐values: PBS‐LTR‐GFP: 0.0044; PBSChr15‐LTR‐GFP: 0.0077.
  4. PBMCs were transduced with increasing doses of GFP reporter vectors (vectors were normalized to the same number of infectious units after titering them on permissive cells), left plot. Vectors were integrated at similar levels as measured by GFP Taqman qPCR, right plot. A 293T cell line with a single vector copy integrant (PGK‐GFP) was used as a control to estimate the absolute copy numbers. A representative experiment of three experiments is shown here. Two‐tailed unpaired t‐test **P‐value = 0.0042 (doses 3 and 4). Error bars show SD.
  5. The expression of 340 KZNFs was assessed in multiple cell types using public mRNA‐sequencing data, and Venn diagrams show the overlap in expression profiles. KZNFs were considered to be expressed in a particular cell type when their expression value was > 0.5 RPKM in all replicates.

Figure EV4. KZNFs are expressed at the protein level in differentiated cells.

Figure EV4

Western blots showing the protein expression levels of ZNF37A, ZNF33A, and ZNF320 in HeLa cells, 293T cells, and NTERA‐2 cells. Arrowhead indicates expected size of ZNF33A.Source data are available online for this figure.

Cytosine methylation acts on KAP1‐regulated retrotransposons and prevents innate immune activation

Endogenous retroviruses are subject to epigenetic repression in cancer cells, and disruption of cytosine methylation with anti‐cancer 5‐AZA‐based drugs leads to immune activation 5, 6, but it is unclear which epigenetic factors regulate 5‐AZA‐modulated ERVs. We asked whether KAP1, like cytosine methylation, is involved in controlling an intrinsic innate immune response by depleting KAP1 or treating cells with 5‐AZA and measuring derepression of ERVs and induction of ISGs. We found that depletion of KAP1 alone leads to activation of the interferon‐stimulated chemokines, CCL5 and CXCL10 in Hela cells but not in PBMCs (Fig 4A, Appendix Fig S2A and B). Of note, KAP1 depletion was less effective in PBMCs (Figs 1F and EV1F). In contrast, 5‐AZA treatment leads to potent ISG induction in 293T cells, HeLa cells, and primary cells (Fig 4B, Appendix Fig S2C). 5‐AZA is thought to act through reducing DNA methylation at retrotransposons rather than at ISGs. We verified that these ISG promoters were not methylated (ENCODE reduced representation bisulfite sequencing data and Ref. 6) and found that 5‐AZA treatment leads to SVA demethylation (Fig 4C). KAP1 depletion has little effect on SVA DNA methylation, perhaps explaining its only modest effect on ISGs (Fig 4C). The HERVK14C LTR, in contrast to SVAs, retains only low levels of DNA methylation that are little impacted by KAP1 depletion or 5‐AZA (Appendix Fig S2D).

Figure 4. Cytosine methylation acts on KAP1‐regulated retrotransposons and prevents innate immune activation.

Figure 4

  1. qRT–PCR expression of endogenous repeats (left) and interferon stimulated genes (ISGs) (right) following shRNA‐mediated KAP1 depletion in HeLa cells (day 6 post‐transduction), normalized to B2M and GAPDH, respectively. n = 2. See also Appendix Fig S2A and B.
  2. qRT–PCR expression of endogenous repeats (left) and ISGs (right) following 5‐AZA treatment of HeLa cells and PBMCs (day 6 post‐transduction). A representative experiment of two is shown in each case. Results were normalized to GAPDH and B2M (GAPDH results shown). n = 2. See also Appendix Fig S2C for results in 293T cells.
  3. DNA methylation at endogenous SVAs was measured over 18 CpGs in the stated treatment groups (day 5 post‐transductions, or day 2 post‐5‐AZA treatment). Each point represents the average methylation state of one sequence with at least 10 sequences analyzed per group. Mann–Whitney U‐tests were performed to compare shControl to shKAP1 (P = 0.6593) and DMSO to 5‐AZA (P = 0.0027). 5‐AZA was added to all experiments at 7 μM.
Data information: All error bars show SD of technical triplicates. All numbers above bars depict fold changes compared to control cells (to one decimal place).

KAP1 contributes to innate immune control

Once ISGs are activated, negative regulation of the interferon response prevents chronic immune activation 36. Interestingly, we found that 12 immune response genes (Appendix Table S3) were downregulated in stable KAP1 knockout HeLa cell clones, suggesting that they may have been first activated. Of note, negative phenotypes would have been outselected during single‐cell cloning of knockout clones and we found that initial KAP1 depletion caused a growth defect (Appendix Fig S2E) as described for SETDB1 depletion 25. Therefore, we asked whether KAP1 depletion would lead to more potent immune induction in cells where reactivation of ERV nucleic acids and proteins is most pronounced. For this, we employed mRNA‐sequencing data from naïve mouse ESCs depleted of epigenetic factors 37. Indeed, we found that KAP1 depletion led to activation of genes involved in innate and adaptive immunity (Fig EV5A and B). SETDB1 depletion revealed similar results with both depletions overlapping in their activation of innate immune genes as the most significant gene cluster induced (P = 0.0017; Fig EV5C and D). Overall, these results suggest that KAP1 contributes to the regulation of retrotransposons and innate immune genes.

Figure EV5. Common genes induced upon KAP1 or SETDB1 depletion are most enriched in innate immunity.

Figure EV5

  1. We recently found that KAP1 and SETDB1 depletion exerts the greatest impact on retrotransposon transcriptional and protein upregulation in naïve mESCs 37. Data from this work (GSE107840) were therefore subjected to DAVID gene ontology analyses to see whether innate immune genes were induced. We focused on genes induced > 2‐fold with adjusted P‐values < 0.05.
  2. 983 genes were induced upon KAP1 depletion with 18% involved in immune or cell death‐related clusters, which are listed and highlighted. Only clusters with P‐values < 0.05 were included. Both innate and adaptive immune pathways were activated.
  3. 456 genes were induced upon SETDB1 depletion with 29% involved in immune or cell death‐related clusters as listed and highlighted. A clear innate immunity phenotype was activated.
  4. 183 identical genes were induced in either KAP1‐ or SETDB1‐depleted naive mESCs. Innate immunity was the most highly enriched gene cluster.

KAP1 depletion leads to a decrease in H3K9me3 at ERVs and a MAVS‐dependent innate immune response

A key question was how KAP1 is regulating ERVs and the innate immune response in differentiated human cells. We hypothesized that KAP1 regulates ERVs through H3K9me3 maintenance as is the case in ESCs 10, 38. We established that H3K9me3 was enriched at ZNF239, HERVK14C, and SVAs in both HeLa cells and 293T cells (Fig 5A). We then used 293T cells due to their amenity to genetic manipulation and depleted KAP1 or SETDB1 (Fig 5B, left), and found that either depletion led to a decrease in H3K9me3 at ZNF239 and HERVK14Cs, as well as HERVK14C reactivation (Fig 5B, right).

Figure 5. KAP1 depletion leads to a decrease in H3K9me3 at ERVs and a MAVS‐dependent innate immune response.

Figure 5

  1. H3K9me3 ChIP‐PCR where error bars show SD of three independent IPs. Results show the IP enrichment relative to the total input normalized to GAPDH as a negative control region. IgG control IPs gave only background enrichments, shown.
  2. 293T cells were transduced with the stated shRNAs and cells harvested 4 days after puro selection for Western blot to verify KAP1 and SETDB1 depletion (left) and ChIP (right). Cell numbers were normalized per treatment group before sonication and IPs were performed in triplicate. Results show IP relative to total input and GAPDH normalized. IgG control IPs gave only background enrichments shown. One representative experiment of two is shown. Unpaired t‐tests were performed. P‐values: ZNF239: KAP1, 0.0292; SETDB1, 0.0137; HERVK14C_1: KAP1, 0.0227; SETDB1, 0.0082; HERVK14C_2: SETDB1, 0.0462; SVA D VNTR, SETDB1, 0.0064. The inlay shows qRT–PCR expression analysis of HERVK14C in the same experiment, normalized to B2M or GAPDH as stated. All error bars show SD.
  3. Venn diagram showing H3K9me3‐enriched sites (18,271) as those identified in both ChIP replicates in the human cell line K562 (left) using data from Ref. 39. Right: % of ISGs or random genes intersecting a H3K9me3 or KAP1 ChIP‐seq peak. 5 kb upstream and downstream of each gene was also included. N = 437 genes per group.
  4. An IFIT1 promoter‐based luciferase reporter THP‐1 cell line was used or 293T cells or HeLa cells transduced with an IFNB promoter‐based luciferase reporter. These cell lines were transduced with an shKAP1 or shControl vector or a vFLIP‐expressing vector and luciferase measured 6 days later. Error bars show SD. Unpaired t‐tests were used to compare shControl to shKAP1 samples: P‐values = 0.0122 (THP‐1), 0.0001 (HeLa), 0.0001 (293T cells). n = 3.
  5. WT, STING KO, and MAVS KO THP‐1 cells were verified by Licor blot.
  6. Cells from (E) were untreated or transduced with shControl or shKAP1 vectors and luciferase read over time (left). Cells were also treated with control stimulants (right). Results show the average luciferase secreted over two time points (days 5 and 6 for the left plot and days 1 and 2 for the right plot) with all error bars showing SEM. HT‐DNA: herring testis DNA. n = 2.
Data information: ***P < 0.001, **P < 0.01, and *P < 0.05.Source data are available online for this figure.

We next asked whether ISGs could be directly regulated by KAP1 and H3K9me3 by employing KAP1 and H3K9me3 ChIP‐seq data 11, 39. For KAP1, we had identified 6,148 binding sites (Fig EV2D), and for H3K9me3, we found 18,271 enriched sites (common to duplicate ChIPs, Fig 5C, left). We identified 437 ISGs (genes induced 10‐fold upon IFN treatment), using the tool: http://www.interferome.org 40 and also produced three independent sets of 437 randomly selected genes. These genes (including 5 kb each side) were intersected with the H3K9me3 or KAP1 peaks, which showed no significant differences of KAP1 (present on 6% of ISGs) or silent chromatin (H3K9me3, present on 14% of ISGs) per group (Fig 5C, right). This suggests that KAP1 does not regulate ISGs by directly binding to them. This is also important for understanding the mechanism of 5‐AZA treatment of cancer since 5‐AZA can impact on H3K9me3 deposition 41.

Finally, we asked whether the innate immune response activated upon KAP1 depletion is caused by an RNA‐sensing or a DNA‐sensing response. To address this question, we first selected THP‐1 cells, in which we detected robust induction of a luciferase‐based ISG reporter upon KAP1 depletion (Fig 5D). The reporter was IFIT‐1‐promoter‐driven luciferase in THP‐1 cells and IFNβ‐promoter‐driven luciferase in 293T and HeLa cells. Here, viral FLIP (vFLIP) (from KSHV) encoded from a lentiviral vector served as a positive control because it constitutively activates NFκB and the immune response 42. We then employed CRISPR/Cas9 genome editing to disrupt the innate adaptors STING or MAVS in THP‐1 cells (Fig 5E) and performed ISG reporter assays. MDA‐5 and RIG‐I RNA‐sensing pathways are dependent on MAVS signaling 43, whereas the cGAS DNA‐sensing pathway is dependent on STING 44. Results revealed that ISG reporter induction observed upon KAP1 depletion is mainly dependent on MAVS signaling, consistent with an RNA‐sensing innate immune response (Fig 5F). As expected, MAVS KO THP‐1 cells were unable to mount a response to poly:IC but were competent for DNA sensing (of herring testis DNA, HT‐DNA), whereas the reverse was true for STING KO THP1 cells (Fig 5F).

Discussion

The present work highlights the previously unknown role for KAP1 in repressing ERVs and ZNF genes in differentiated human cells including adult cells, and in contributing to innate immune control. KAP1 is relevant to the mode of action of anti‐cancer 5‐AZA‐based drugs since KAP1 and cytosine methylation repress some of the same ERVs and suppress MAVS‐dependent RNA sensing (see Fig 6 for a model). However, KAP1 knockout in differentiated cells impacts on ERVs and cellular genes only modestly compared to the more pronounced effects observed following KAP1 knockout in mouse or human ESCs 11, 22. This may relate to the increased requirement for KAP1 in early development to establish epigenetic marks 13. Of note, the fact that 5‐AZA exerts a dramatic effect on ISGs compared with KAP1 depletion could be due to a combination of effects this drug has on chromatin 41.

Figure 6. Model.

Figure 6

KAP1 regulates ERVs and ZNF genes in differentiated human cells including adult PBMCs and contributes to innate immune control. KAP1 and SETDB1 binding 30 are detected at ERVs and ZNF genes in differentiated cells and overlap with the silent chromatin mark H3K9me3 as well as some cytosine methylation, which we detect at HERVK14C and SVAs. KAP1 inactivation reveals that the KAP1‐KZNF pathway is functionally intact and required in differentiated cells including PBMCs. KAP1 depletion leads to a decrease in H3K9me3 at retrotransposons and to reactivation of HERVs and SVAs. These retrotransposons or potentially other endogenous RNAs produce double‐stranded RNA structures, which activate a type I interferon response through MAVS signaling. KAP1 depletion is not sufficient for global ISG induction, suggesting it exerts partial redundancy at repressing ERVs with other epigenetic mechanisms in differentiated human cells.

We found that a cohort of KZNFs are widely expressed at the mRNA and protein level in human cells and may restrict incoming retroviruses. Intriguingly, the mouse KZFP, ZFP809 is rapidly degraded at the protein level in mouse differentiated cells, restricting its potency mainly to ESCs 45. An open question is whether KZNF expression is modulated by viruses, interferon treatment, or other stimuli as an antiviral strategy. Finally, it is possible that KAP1 repression of common loci is dynamic and KAP1‐bound ERV regulatory sequences may have been co‐opted to activate cellular gene networks. We found here that KAP1 regulates HERV‐T and HERVK14C, both of which show evidence of co‐option 27, 46. KAP1 is often enriched at regulatory hubs together with other transcription factors (Appendix Fig S1).

Future work will be necessary to determine exactly which factors act in addition to KAP1 to maintain ERV silencing in differentiated cells. Many epigenetic regulators exert cross‐talk with DNA methylation including SETDB1 47, which can act independently or in concert with KAP1 25 and which itself has a reported role in innate immune control in cancer cells 23. Another key question concerns the identities of the potential immuno‐stimulatory RNAs that may derive from retrotransposons producing secondary structure. Note, however, that to date there is no direct link known between ERV upregulation and an ISG response.

Hundreds of KZNFs have recently been described to target specific sequences within repetitive DNA, including ERVs 14, 15, 16. The relevance of such interactions, however, remains unknown. Our work illustrates that cell lines rather than ESCs could be used to determine the individual functions of a core set of KZNFs that are widely expressed through their targeted gene knockout. Most interestingly, KAP1 and widely expressed KZNFs in combination with other epigenetic factors may become new drug targets for cancer with the aim to reactivate subsets of ERVs and other retrotransposons and harness their natural ability to trigger innate and adaptive anti‐tumor immunity.

Materials and Methods

Cell culture

Human teratoma‐derived NTERA‐2 cells (kind gift from Peter Andrews, University of Sheffield) were cultured in Dulbecco's modified Eagle's medium (DMEM, Gibco) high glucose, supplemented with 2 mM l‐Glutamine, 10% fetal calf serum (FCS), and 1% Penicillin/Streptomycin (P/S). They were split 1:2 or 1:3 every 3–4 days by cell scraping. HEK293T (293T) and HeLa cells were grown in standard DMEM + 10% FCS and P/S and split 1:4 every 2 days using trypsin. Human primary CD4+ T cells were grown in Roswell Park Memorial Institute medium (RPMI, Gibco) supplemented with 10% human serum and 10 U/ml of recombinant IL‐2 and activated using αCD3 and αCD28 pre‐coated flasks for 72 h during which time IL‐2 was increased to 25 U/ml. Peripheral blood mononuclear cells (PBMCs) were grown in RPMI supplemented with 20% FCS and 10 U/ml of IL‐2 and activated using 3 μg/ml of Phytohemagglutinin‐M (PHA) for 72 h. The THP‐1 ISG reporter cell line (THP‐1‐IFIT‐1‐GLuc) was a gift from Veit Hornung. 5‐azacytidine (5‐AZA) was added at 7 μM or vehicle only (DMSO).

CRISPR/Cas9 genome editing

Guide RNAs (sgRNAs, see Appendix Table S2) specific to several different exons of KAP1 (exons 1 and 9) were designed using the website: http://crispr.mit.edu/and cloned into the PX459 plasmid (Addgene), which was then transfected into HeLa and HEK293 T cells. After 24 h, the cells were subjected to puromycin selection for 24 h or until control cells had completely died. The bulk population was then used for single‐cell cloning by limiting dilution. Knockout was assessed across a panel of clones by Western blotting using the KAP1 antibody, MAB3662 (Millipore), and validated functionally using KAP1‐KZNF reporter assays. For knockout of STING or MAVS, sgRNAs (listed in Appendix Table S2) were cloned into lentiCRISPRv2 and THP‐1 cells transduced, selected with puromycin and single cell cloned as above.

Primary cell isolation

peripheral blood mononuclear cells were isolated from a buffy coat or from fresh blood from healthy donors using lymphoprep (Axis‐Shield). CD4+ T cells were obtained from the isolated PBMCs using the CD4+ T cell Isolation Kit (Miltenyi Biotec) according to manufacturer's instructions. The purity of the cells was verified by flow cytometry following antibody staining. Antibodies used are shown in Appendix Table S1.

Western blotting

1–2 × 106 cells were washed with PBS and lysed in NuPAGE LDS sample buffer (Thermo Fisher) with 5% β‐mercaptoethanol. Samples were sonicated for 90 s at 20 Hz and heated at 95°C for 5 min. Lysates were then loaded onto handcast 10% SDS–polyacrylamide gels in tris/glycine/SDS buffer and mini‐PROTEAN tanks (Bio‐Rad), followed by wet transfers onto polyvinylidene difluoride (PVDF) membranes. Antibodies used for blotting the membrane are listed below. All secondary antibodies were horseradish peroxidase‐conjugated (GE healthcare), and membranes were developed using ECL kits (ECL, Prime or Select kits from Amersham).

shRNA lentiviral vectors and transduction

Hairpin sequences against human mRNAs were designed using the Clonetech RNAi designer website (http://bioinfo.clontech.com/rnaidesigner/) and annealed into oligo duplexes. The duplexes were then cloned into an shRNA vector (HIV SIREN) at BamHI‐EcoRI sites, and the products were checked by sequencing. VSV‐G‐pseudotyped lentiviral vectors were produced by co‐transfecting 293T cells in 10‐cm plates with 1.5 μg of the shRNA plasmid, 1 μg p8.91, and 1 μg pMDG2 encoding VSV‐G. The supernatant was harvested on two consecutive days (from 48 h post‐transfection) and used neat or concentrated via ultracentrifugation (20,000 × g for 2 h at 4°C) for primary cells. For immune activation assays, vectors were pretreated with DNase to remove potential DNA from the vector preparations. Two days post‐transduction, the cells were selected with puromycin (2.5 μg/ml) for 48 h (or until control cells had completely died) and qRT–PCR was done 6 days post‐transduction.

RNA extraction and quantification

Total RNA was extracted using an RNeasy mini kit (Qiagen) and DNase (Ambion AM1907) treated. 500 ng of RNA was used for cDNA production using SuperScript II Reverse Transcriptase (ThermoFisher) and random primers following the manufacturer's instructions. mRNA expression levels were quantified using quantitative reverse transcription PCR (qRT–PCR) using an ABI 7500 Real Time PCR System (Applied Biosystems). SYBR green Fast PCR mastermix (Life Technologies) was used. CT values for the test genes were normalized against those of Gapdh or B2M using the −ΔΔC t method to calculate fold change. See Appendix Table S2 for primer sequences.

DNA methylation analysis

DNA was harvested using a DNeasy Blood & Tissue Kit (Qiagen), and 1 μg of DNA was used for bisulfite conversion using an EpiTect Bisulfite Kit (Qiagen) following the manufacturer's protocol. 4 μl of converted DNA was then amplified through PCR using the primer pairs described in the (Appendix Table S2). Primers were designed using the site: http://urogene.org/methprimer/and the PCR products were cloned using the TOPO TA‐Cloning Kit (Thermo Fisher Scientific), and the T7P primer was used to sequence the products. DNA methylation status of the TOPO clones was measured using the QUMA online tool (http://quma.cdb.riken.jp) by the Riken Institute.

GFP reporter assay

The HERVK14C LTR was cloned into a PGK‐GFP plasmid in place of the PGK promoter at XhoI‐BamHI sites. The consensus lysine PBS sequence was cloned upstream of the LTR after annealing primers into the XhoI site, while the Chromosome 15 HERVK14C‐specific lysine PBS was cloned into the backbone through a PCR strategy into the XhoI‐BamHI sites. The final products were verified via sequencing.

Cells were plated at a concentration of 105 cells/ml in 24‐well plates. After 6 h, wells were transduced with VSV‐pseudotyped GFP vectors at increasing doses (normalized between vectors by the number of transducing units, which were calculated by first titering vectors on KAP1 knockout 293T cells) and fresh media was replenished after 24 h. After a further 48 h, the cells were fixed in 1% PFA and washed in PBS and GFP was read using flow cytometry.

Intracellular OCT4 staining

1 × 106 cells per condition were fixed and permeabilized using intracellular staining buffers (eBioscience, 00‐5523). The cells were then stained with OCT4 or isotype control antibodies (see Antibody list above), washed, and analyzed by flow cytometry.

Luciferase reporter assays

Luciferase assays were conducted according to the Promega Dual Luciferase Kit instructions. 293T cells were plated 6 h before transfection at a concentration of 105 cells/ml in 24‐well plates. Following ratios defined before 9, 200 ng of KRAB ZNF plasmid DNA, 20 ng of Luciferase reporter plasmid DNA, and 2 ng of pRT‐TK_Renilla control plasmid were co‐transfected (10:1:0.1 ratio) using 1.5 μl of Fugene 6 (Promega) and 30 μl of Opti‐MEM (Gibco). Forty‐eight hours post‐transfection, cells were lysed and luciferase was measured using the Dual Luciferase® assay kit (Promega, E1910) in an opti‐plate using a Glomax 96 microplate Luminometer's (Promega) Dual Glow program. The Renilla‐encoding plasmid was used as an internal control for transfection efficiency normalization. The firefly to Renilla ratio was then further normalized against the empty vector or negative control as repression readouts and expressed in percentages where the control is set to 100%. The IFN‐β promoter‐driven F‐luciferase lentiviral vector plasmid 48 was a kind gift from Jan Rehwinkel. For assays using the IFIT1‐GLuc THP‐1 cells, supernatant was harvested and coelenterazine (2 μg/ml, Prolume) was used as a substrate for luminescence readings. Controls were herring testis DNA (HT‐DNA, 1 μg/ml, Sigma) and poly I:C (1 μg/ml Invivogen) and a vFLIP‐expressing vector (kind gift from Mary Collins). Data are presented as fold induction over the mock‐treated sample for each cell line.

mRNA‐sequencing and analysis

Total RNA samples were processed using Illumina's TruSeq Stranded mRNA LT sample preparation kit (RS‐122‐2101) according to manufacturer's instructions with some deviations: Libraries to be multiplexed in the same run were pooled in equimolar quantities and calculated from Qubit and Bioanalyser fragment analysis. Samples were sequenced on a NextSeq 500 instrument (Illumina, San Diego, USA) using a 43 bp paired end run resulting in > 15 million reads per sample. Run data were demultiplexed and converted to fastq files using Illumina's bcl2fastq Conversion Software v2.16. Fastq files were then aligned to the human genome NCBI build 37.2 using Tophat 2.014 and then deduplicated using Picard Tools 1.79. Reads per transcript were counted using HTSeq, and differential expression was estimated using the BioConductor package DESeq2. P‐values were adjusted for multiple testing with the Benjamini–Hochberg false discovery rate (FDR) procedure. Genes were considered upregulated or downregulated where differential affects were > 2‐fold and where adjusted P‐values were < 0.05. For analysis of repeats, TrimGalore v0.4.0 was used to remove adaptors and trim read ends using default parameters. Reads were mapped against the RepBase v20.06 human library using Bowtie2v2.2.4. The samtools v.1.19 idxstat utility was used to extract the number of mapped reads per repeat that were inputted to DESeq2 to identify differentially expressed repeats. P‐values were adjusted for multiple testing with the Benjamini–Hochberg false discovery rate (FDR) procedure. Gene ontology analysis was conducted using the DAVID website: https://david-d.ncifcrf.gov/ 49, 50. Public mRNA‐sequencing data were used to determine whether KZNFs were expressed in different cell types (see Data accessibility for accession numbers). A gene was considered to be expressed if it had an RPKM value of > 0.5 in all replicates. The top 100 KZNFs expressed in 293T cells were determined by sorting mRNA‐sequencing data on RPKM values.

Chromatin immunoprecipitation (ChIP)

ChIP experiments were performed as described 22 but using a Bioruptor Pico for sonication (four cycles, 15 s ON, 90 s OFF) and a H3K9me3 antibody (ab8898, Abcam). Chromatin immunoprecipitation‐sequencing data for human ESCs and 293T cells were downloaded from NCBI Gene Expression Omnibus (GEO) under accession numbers GSE57989 (HuESC) and GSE27929 (293T). TrimGalore v0.4.0 was used to remove adaptors and trim read ends, and reads were mapped against the human genome (hg19 assembly) using Bowtie2 v2.2.4. Peaks were called in each replicate using Macs2 v 2.1.1, and the bioconductor package DiffBind (https://bioconductor.org/packages/DiffBind) was used to construct Venn diagrams and identify overlapping ESC and 293T peaks. Human repeat and gene locations were downloaded from the UCSC browser (RepeatMasker and RefGene tables), and the repeats and genes closest to the overlapping ESC/293T peaks were identified using bedtools v2.17.0. Chip‐sequencing correlations were analyzed using the Chip‐Cor website: http://ccg.vital-it.ch/chipseq/chip_cor.php.

Statistical analysis

All error bars show standard deviation (SD) or standard error of the mean (SEM), as stated. Where n ≥ 3, Student's t‐tests or other statistical tests were used, as stated. A P‐value of < 0.05 was considered statistically significant (***P < 0.001, **P < 0.01, and *P < 0.05).

Ethics statement

Healthy adult blood donors provided written informed consent. Culture of primary peripheral blood mononuclear cells from blood donors has been reviewed and granted ethical permission by the National Research Ethics Service through The Joint UCL/UCLH Committees on the Ethics of Human Research (Committee Alpha) 2 December 2009; reference number 06/q0502/92.

Data accessibility

mRNA‐sequencing data are available on the NCBI Gene Expression Omnibus (GEO) (http://www.ncbi.nlm.nih.gov/geo/) database (accession number GSE114998), and other data are included in this article and its supplementary information files. Accession numbers for the public data are as follows: 293T: GSE27929 (KAP1 ChIP‐seq), HuESC: GSE57989 (KAP1 ChIP‐seq, mRNA‐seq), 293T: GSE44267 (mRNA‐seq), macrophages: GSE36952 (mRNA‐seq), CD4+ T cells: GSE69549 (mRNA‐seq), K562 cells: GSE95374 (H3K9me3 ChIP‐seq), Naïve mESCs: GSE107840 (mRNA‐seq).

Author contributions

CHCT conceived, designed and performed experiments, analyzed the data, and wrote the paper. LF, LR‐M, RPS, TP, MTR‐P, GM, RG, and GJT contributed to reagents, experiments, and ideas. LC and JH performed and conceived bioinformatics analyses. HMR conceived the study, performed experiments, analyzed data, and wrote the paper. All authors read and approved the final manuscript.

Conflict of interest

The authors declare that they have no conflict of interest.

Supporting information

Appendix

Expanded View Figures PDF

Dataset EV1

Dataset EV2

Dataset EV3

Dataset EV4

Dataset EV5

Source Data for Expanded View

Review Process File

Source Data for Figure 1

Source Data for Figure 5

Acknowledgements

We thank Connor Husovsky for technical assistance and Steen Ooi, Pierre Maillard, and members of UCL Cruciform wing 3.3 for advice. We thank Peter Andrews for the NTERA‐2 cells; Didier Trono for reagents previously generated in his laboratory; David Haussler for the SVA, LINE1, ZNF91, and ZNF93 constructs; and Mary Collins for the vFLIP‐expressing vector. We also thank the UCL Genomics Unit and Tony Brooks for carrying out the mRNA‐sequencing library preparation, sequencing, and initial data analysis. This work was supported through a UCL Grand Challenges Studentship awarded to CHCT (code 518099) and a Sir Henry Dale Fellowship, which is jointly funded by the Wellcome Trust and Royal Society (Grant number 101200/Z/13/Z) awarded to HMR. RG is funded by the MRC (MC_UU_12014/10) and GJT, RS, TP, and MTR by Wellcome Trust (108183) and ERC (339223) grants awarded to GJT. LF is funded through an ERC grant (678350) awarded to HMR. Lucia Conde and Javier Herrero were funded through a Cancer Research UK‐University College London (CRUK‐UCL) Centre Award [C416/A25145]. The authors acknowledge the use of the UCL Legion High Performance Computing Facility (Legion@UCL), and associated support services, in the completion of this work.

EMBO Reports (2018) 19: e45000

References

  • 1. Chuong EB, Elde NC, Feschotte C (2017) Regulatory activities of transposable elements: from conflicts to benefits. Nat Rev Genet 18: 71–86 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W et al (2001) Initial sequencing and analysis of the human genome. Nature 409: 860–921 [DOI] [PubMed] [Google Scholar]
  • 3. Chuong EB, Elde NC, Feschotte C (2016) Regulatory evolution of innate immunity through co‐option of endogenous retroviruses. Science 351: 1083–1087 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Howard G, Eiges R, Gaudet F, Jaenisch R, Eden A (2008) Activation and transposition of endogenous retroviral elements in hypomethylation induced tumors in mice. Oncogene 27: 404–408 [DOI] [PubMed] [Google Scholar]
  • 5. Chiappinelli KB, Strissel PL, Desrichard A, Li H, Henke C, Akman B, Hein A, Rote NS, Cope LM, Snyder A et al (2015) Inhibiting DNA methylation causes an interferon response in cancer via dsRNA including endogenous retroviruses. Cell 162: 974–986 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Roulois D, Loo Yau H, Singhania R, Wang Y, Danesh A, Shen SY, Han H, Liang G, Jones PA, Pugh TJ et al (2015) DNA‐demethylating agents target colorectal cancer cells by inducing viral mimicry by endogenous transcripts. Cell 162: 961–973 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Goel S, DeCristo MJ, Watt AC, BrinJones H, Sceneay J, Li BB, Khan N, Ubellacker JM, Xie S, Metzger‐Filho O et al (2017) CDK4/6 inhibition triggers anti‐tumour immunity. Nature 548: 471–475 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Antony JM, van Marle G, Opii W, Butterfield DA, Mallet F, Yong VW, Wallace JL, Deacon RM, Warren K, Power C (2004) Human endogenous retrovirus glycoprotein‐mediated induction of redox reactants causes oligodendrocyte death and demyelination. Nat Neurosci 7: 1088–1095 [DOI] [PubMed] [Google Scholar]
  • 9. Jacobs FM, Greenberg D, Nguyen N, Haeussler M, Ewing AD, Katzman S, Paten B, Salama SR, Haussler D (2014) An evolutionary arms race between KRAB zinc‐finger genes ZNF91/93 and SVA/L1 retrotransposons. Nature 516: 242–245 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Rowe HM, Jakobsson J, Mesnard D, Rougemont J, Reynard S, Aktas T, Maillard PV, Layard‐Liesching H, Verp S, Marquis J et al (2010) KAP1 controls endogenous retroviruses in embryonic stem cells. Nature 463: 237–240 [DOI] [PubMed] [Google Scholar]
  • 11. Turelli P, Castro‐Diaz N, Marzetta F, Kapopoulou A, Raclot C, Duc J, Tieng V, Quenneville S, Trono D (2014) Interplay of TRIM28 and DNA methylation in controlling human endogenous retroelements. Genome Res 24: 1260–1270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Wolf D, Goff SP (2007) TRIM28 mediates primer binding site‐targeted silencing of murine leukemia virus in embryonic cells. Cell 131: 46–57 [DOI] [PubMed] [Google Scholar]
  • 13. Wolf G, Yang P, Fuchtbauer AC, Fuchtbauer EM, Silva AM, Park C, Wu W, Nielsen AL, Pedersen FS, Macfarlan TS (2015) The KRAB zinc finger protein ZFP809 is required to initiate epigenetic silencing of endogenous retroviruses. Genes Dev 29: 538–554 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Imbeault M, Helleboid PY, Trono D (2017) KRAB zinc‐finger proteins contribute to the evolution of gene regulatory networks. Nature 543: 550–554 [DOI] [PubMed] [Google Scholar]
  • 15. Najafabadi HS, Mnaimneh S, Schmitges FW, Garton M, Lam KN, Yang A, Albu M, Weirauch MT, Radovani E, Kim PM et al (2015) C2H2 zinc finger proteins greatly expand the human regulatory lexicon. Nat Biotechnol 33: 555–562 [DOI] [PubMed] [Google Scholar]
  • 16. Schmitges FW, Radovani E, Najafabadi HS, Barazandeh M, Campitelli LF, Yin Y, Jolma A, Zhong G, Guo H, Kanagalingam T et al (2016) Multiparameter functional diversity of human C2H2 zinc finger proteins. Genome Res 26: 1742–1752 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Ivanov AV, Peng H, Yurchenko V, Yap KL, Negorev DG, Schultz DC, Psulkowski E, Fredericks WJ, White DE, Maul GG et al (2007) PHD domain‐mediated E3 ligase activity directs intramolecular sumoylation of an adjacent bromodomain required for gene silencing. Mol Cell 28: 823–837 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Matsui T, Leung D, Miyashita H, Maksakova IA, Miyachi H, Kimura H, Tachibana M, Lorincz MC, Shinkai Y (2010) Proviral silencing in embryonic stem cells requires the histone methyltransferase ESET. Nature 464: 927–931 [DOI] [PubMed] [Google Scholar]
  • 19. Rowe HM, Friedli M, Offner S, Verp S, Mesnard D, Marquis J, Aktas T, Trono D (2013) De novo DNA methylation of endogenous retroviruses is shaped by KRAB‐ZFPs/KAP1 and ESET. Development 140: 519–529 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Brattas PL, Jonsson ME, Fasching L, Nelander Wahlestedt J, Shahsavani M, Falk R, Falk A, Jern P, Parmar M, Jakobsson J (2017) TRIM28 controls a gene regulatory network based on endogenous retroviruses in human neural progenitor cells. Cell Rep 18: 1–11 [DOI] [PubMed] [Google Scholar]
  • 21. Ecco G, Cassano M, Kauzlaric A, Duc J, Coluccio A, Offner S, Imbeault M, Rowe HM, Turelli P, Trono D (2016) Transposable elements and their KRAB‐ZFP controllers regulate gene expression in adult tissues. Dev Cell 36: 611–623 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Rowe HM, Kapopoulou A, Corsinotti A, Fasching L, Macfarlan TS, Tarabay Y, Viville S, Jakobsson J, Pfaff SL, Trono D (2013) TRIM28 repression of retrotransposon‐based enhancers is necessary to preserve transcriptional dynamics in embryonic stem cells. Genome Res 23: 452–461 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Cuellar TL, Herzner AM, Zhang X, Goyal Y, Watanabe C, Friedman BA, Janakiraman V, Durinck S, Stinson J, Arnott D et al (2017) Silencing of retrotransposons by SETDB1 inhibits the interferon response in acute myeloid leukemia. J Cell Biol 216: 3535–3549 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Collins PL, Kyle KE, Egawa T, Shinkai Y, Oltz EM (2015) The histone methyltransferase SETDB1 represses endogenous and exogenous retroviruses in B lymphocytes. Proc Natl Acad Sci USA 112: 8367–8372 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Kato M, Takemoto K, Shinkai Y (2018) A somatic role for the histone methyltransferase Setdb1 in endogenous retrovirus silencing. Nat Commun 9: 1683 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Wolf D, Goff SP (2009) Embryonic stem cells use ZFP809 to silence retroviral DNAs. Nature 458: 1201–1204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Blanco‐Melo D, Gifford RJ, Bieniasz PD (2017) Co‐option of an endogenous retrovirus envelope for host defense in hominid ancestors. Elife 6: e22519 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Tristem M (2000) Identification and characterization of novel human endogenous retrovirus families by phylogenetic screening of the human genome mapping project database. J Virol 74: 3715–3730 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Babaian A, Mager DL (2016) Endogenous retroviral promoter exaptation in human cancer. Mob DNA 7: 24 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Frietze S, O'Geen H, Blahnik KR, Jin VX, Farnham PJ (2010) ZNF274 recruits the histone methyltransferase SETDB1 to the 3′ ends of ZNF genes. PLoS One 5: e15082 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. O'Geen H, Squazzo SL, Iyengar S, Blahnik K, Rinn JL, Chang HY, Green R, Farnham PJ (2007) Genome‐wide analysis of KAP1 binding suggests autoregulation of KRAB‐ZNFs. PLoS Genet 3: e89 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Zuin J, Dixon JR, van der Reijden MI, Ye Z, Kolovos P, Brouwer RW, van de Corput MP, van de Werken HJ, Knoch TA, van IJcken WF et al (2014) Cohesin and CTCF differentially affect chromatin architecture and gene expression in human cells. Proc Natl Acad Sci USA 111: 996–1001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Uhlen M, Oksvold P, Fagerberg L, Lundberg E, Jonasson K, Forsberg M, Zwahlen M, Kampf C, Wester K, Hober S et al (2010) Towards a knowledge‐based human protein atlas. Nat Biotechnol 28: 1248–1250 [DOI] [PubMed] [Google Scholar]
  • 34. Wolf D, Hug K, Goff SP (2008) TRIM28 mediates primer binding site‐targeted silencing of Lys 1,2 tRNA‐utilizing retroviruses in embryonic cells. Proc Natl Acad Sci USA 105: 12521–12526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Sadic D, Schmidt K, Groh S, Kondofersky I, Ellwart J, Fuchs C, Theis FJ, Schotta G (2015) Atrx promotes heterochromatin formation at retrotransposons. EMBO Rep 16: 836–850 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Taylor KE, Mossman KL (2013) Recent advances in understanding viral evasion of type I interferon. Immunology 138: 190–197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Robbez‐Masson L, Tie CHC, Conde L, Tunbak H, Husovsky C, Tchasovnikarova IA, Timms RT, Herrero J, Lehner PJ, Rowe HM (2018) The HUSH complex cooperates with TRIM28 to repress young retrotransposons and new genes. Genome Res 28: 836–845 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Tao Y, Yen MR, Chitiashvili T, Nakano H, Kim R, Hosohama L, Tan YC, Nakano A, Chen PY, Clark AT (2018) TRIM28‐regulated transposon repression is required for human germline competency and not primed or naive human pluripotency. Stem Cell Reports 10: 243–256 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Liu N, Lee CH, Swigut T, Grow E, Gu B, Bassik MC, Wysocka J (2018) Selective silencing of euchromatic L1s revealed by genome‐wide screens for L1 regulators. Nature 553: 228–232 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Rusinova I, Forster S, Yu S, Kannan A, Masse M, Cumming H, Chapman R, Hertzog PJ (2013) Interferome v2.0: an updated database of annotated interferon‐regulated genes. Nucleic Acids Res 41: D1040–D1046 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Komashko VM, Farnham PJ (2010) 5‐azacytidine treatment reorganizes genomic histone modification patterns. Epigenetics 5: 229–240 [DOI] [PubMed] [Google Scholar]
  • 42. Rowe HM, Lopes L, Brown N, Efklidou S, Smallie T, Karrar S, Kaye PM, Collins MK (2009) Expression of vFLIP in a lentiviral vaccine vector activates NF‐{kappa}B, matures dendritic cells, and increases CD8+ T‐cell responses. J Virol 83: 1555–1562 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Seth RB, Sun L, Ea CK, Chen ZJ (2005) Identification and characterization of MAVS, a mitochondrial antiviral signaling protein that activates NF‐kappaB and IRF 3. Cell 122: 669–682 [DOI] [PubMed] [Google Scholar]
  • 44. Sun L, Wu J, Du F, Chen X, Chen ZJ (2013) Cyclic GMP‐AMP synthase is a cytosolic DNA sensor that activates the type I interferon pathway. Science 339: 786–791 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Wang C, Goff SP (2017) Differential control of retrovirus silencing in embryonic cells by proteasomal regulation of the ZFP809 retroviral repressor. Proc Natl Acad Sci USA 114: E922–E930 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Goke J, Lu X, Chan YS, Ng HH, Ly LH, Sachs F, Szczerbinska I (2015) Dynamic transcription of distinct classes of endogenous retroviral elements marks specific populations of early human embryonic cells. Cell Stem Cell 16: 135–141 [DOI] [PubMed] [Google Scholar]
  • 47. Leung D, Du T, Wagner U, Xie W, Lee AY, Goyal P, Li Y, Szulwach KE, Jin P, Lorincz MC et al (2014) Regulation of DNA methylation turnover at LTR retrotransposons and imprinted loci by the histone methyltransferase Setdb1. Proc Natl Acad Sci USA 111: 6690–6695 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Hertzog J, Dias Junior AG, Rigby RE, Donald CL, Mayer A, Sezgin E, Song C, Jin B, Hublitz P, Eggeling C et al (2018) Infection with a Brazilian isolate of Zika virus generates RIG‐I stimulatory RNA and the viral NS5 protein blocks type I IFN induction and signaling. Eur J Immunol 48: 1120–1136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. da Huang W, Sherman BT, Lempicki RA (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4: 44–57 [DOI] [PubMed] [Google Scholar]
  • 50. da Huang W, Sherman BT, Lempicki RA (2009) Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 37: 1–13 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix

Expanded View Figures PDF

Dataset EV1

Dataset EV2

Dataset EV3

Dataset EV4

Dataset EV5

Source Data for Expanded View

Review Process File

Source Data for Figure 1

Source Data for Figure 5

Data Availability Statement

mRNA‐sequencing data are available on the NCBI Gene Expression Omnibus (GEO) (http://www.ncbi.nlm.nih.gov/geo/) database (accession number GSE114998), and other data are included in this article and its supplementary information files. Accession numbers for the public data are as follows: 293T: GSE27929 (KAP1 ChIP‐seq), HuESC: GSE57989 (KAP1 ChIP‐seq, mRNA‐seq), 293T: GSE44267 (mRNA‐seq), macrophages: GSE36952 (mRNA‐seq), CD4+ T cells: GSE69549 (mRNA‐seq), K562 cells: GSE95374 (H3K9me3 ChIP‐seq), Naïve mESCs: GSE107840 (mRNA‐seq).


Articles from EMBO Reports are provided here courtesy of Nature Publishing Group

RESOURCES