Skip to main content
PLOS Pathogens logoLink to PLOS Pathogens
. 2024 Mar 1;20(3):e1011716. doi: 10.1371/journal.ppat.1011716

The impact of HTLV-1 expression on the 3D structure and expression of host chromatin

Hiroko Yaguchi 1, Anat Melamed 1, Saumya Ramanayake 1, Helen Kiik 1, Aviva Witkover 1, Charles R M Bangham 1,*
Editor: Edward William Harhaj2
PMCID: PMC10936777  PMID: 38427693

Abstract

A typical HTLV-1-infected individual carries >104 different HTLV-1-infected T cell clones, each with a single-copy provirus integrated in a unique genomic site. We previously showed that the HTLV-1 provirus causes aberrant transcription in the flanking host genome and, by binding the chromatin architectural protein CTCF, forms abnormal chromatin loops with the host genome. However, it remained unknown whether these effects were exerted simply by the presence of the provirus or were induced by its transcription. To answer this question, we sorted HTLV-1-infected T-cell clones into cells positive or negative for proviral plus-strand expression, and then quantified host and provirus transcription using RNA-seq, and chromatin looping using quantitative chromosome conformation capture (q4C), in each cell population. We found that proviral plus-strand transcription induces aberrant transcription and splicing in the flanking genome but suppresses aberrant chromatin loop formation with the nearby host chromatin. Reducing provirus-induced host transcription with an inhibitor of transcriptional elongation allows recovery of chromatin loops in the plus-strand-expressing population. We conclude that aberrant host transcription induced by proviral expression causes temporary, reversible disruption of chromatin looping in the vicinity of the provirus.

Author summary

The human T cell leukemia virus HTLV-1 causes an aggressive leukemia or lymphoma in ~5% of people infected with the virus, and a further 1–4% develop a chronic inflammatory disease that leads to progressive paralysis of the legs. HTLV-1 is a retrovirus, like HIV, and these viruses insert themselves–as a ‘provirus’—into the DNA of the T lymphocytes that they infect: this is largely why the viruses are very difficult to eradicate. The provirus remains dormant for most of the time, but it can be reactivated when the T cell meets a change in temperature or pH, or a number of other conditions.

We recently discovered that HTLV-1, when it is integrated into the host DNA, changes the 3D structure of the genome in the infected cell, and interferes with the normal function of the host genes that lie near HTLV-1 in the genome. What we have now found is that the change in the 3D structure of the genome is caused simply by the presence of the provirus, but that when the provirus is reactivated the 3D structural changes are temporarily removed, and nearby host genes can be abnormally activated by the provirus.

Introduction

The 3D structure of chromatin is important in many biological processes, including transcription regulation, DNA replication, cell cycle and differentiation. The CCCTC-binding factor (CTCF) is a key player in chromatin loop formation [1,2]. CTCF binds a non-palindromic 20-nucleotide DNA motif at ~50,000 sites in the human genome [2]; two CTCF molecules bound to different genomic sites can dimerize, forming the boundary of chromatin loops extruded through the cohesin complex. Chromatin looping plays a critical part in gene regulation by regulating the contacts between specific enhancers and promoters.

Human T cell leukemia virus type 1 (HTLV-1, also known as human T-lymphotropic virus type 1), mainly infects CD4+ T-cells in vivo. Most carriers remain asymptomatic, but in 5% to 10% of the infected individuals, HTLV-1 infection leads to either an aggressive T-cell malignancy known as adult T-cell leukemia/lymphoma (ATL), or a chronic progressive neuro-inflammatory condition called HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP; referred to hereafter as HAM) [3]. HTLV-1 persists in the host mainly by clonal proliferation of infected cells, and a typical host carries >104 long-lived HTLV-1-infected T-cell clones [4]: each clone can be distinguished by the unique integration site of the single-copy provirus in the host genome [5].

The HTLV-1 provirus has two strands (Fig 1). The plus strand, transcribed from the 5′ LTR, encodes the structural proteins, the transcriptional transactivator protein Tax, the regulator of mRNA splicing and transport Rex, and minor accessory proteins. The minus strand, transcribed from the 3′ LTR, encodes the regulatory protein HBZ [6].

Fig 1. Structure and expression of the HTLV-1 provirus in the host genome.

Fig 1

(A) Definitions regarding classification of the direction of transcription. Here we use ‘same sense’ to denote transcription from the same strand of the host genome as the proviral plus strand (e.g. tax gene). We refer to flanking host transcription upstream and downstream of 5′LTR-driven plus-strand proviral expression respectively as (a) same (plus) sense, 5′ side of the provirus and (b) same sense, 3′ side of the provirus. Similarly, we refer to transcription upstream and downstream of 3′LTR-driven minus-strand proviral expression respectively as (c) antisense, 3′ side of the provirus and (d) antisense, 5′ side of the provirus. In this study we focus on the effects of plus-strand expression. (B) Diagram of HTLV-1 proviral genome and splicing pattern of the regulatory genes tax (encoded in plus strand) and HBZ (encoded in minus strand); red boxes represent exons. The 4C viewpoint containing the CTCF binding site (BS) is shown.

Single-cell heterogeneity in HTLV-1 proviral expression has been shown in naturally HTLV-1-infected T cell clones [79]. At a given time, a small proportion of cells express intense bursts of tax, the proportion varying between the clones [7] (Tables 1 and 2). By contrast, the minus strand is expressed in approximately 50% of circulating infected T cells at a given time [8]. Each HTLV-1-infected T-cell clone has its own pattern of proviral expression, that is, the frequency, intensity and duration of the transient transcriptional burst of the proviral plus-strand [1012]. Since the HTLV-1 proviral sequence varies little within the host, these clone-specific differences are thought to be largely due to the unique genomic integration site of the provirus; other factors may include the antigen specificity and epigenetic modifications of the host cell.

Table 1. Clone list.

Clone Genomic location of dominant integration site, hg 19* Provirus orientation ** Subject Clone derived from tax expression reporter Reference
HA1 chr07:18905519 R HEZ Asymptomatic carrier - Newly established
11.63 chr19:33829548 F TBW HAM/TSP patient - [5,10]
TBX4B chr22:44323198 F TBX HAM/TSP patient - [5,10]
3.60 chr04:70567285 F TBJ HAM/TSP patient - [5,10]
d2EGFP-TBX4B chr22:44323198 F TBX HAM/TSP patient GFP [12]
d2EGFP-11.63 chr19:33829548 F TBW HAM/TSP patient GFP Newly established
d2EGFP-11.50 chr19:28282587 R TBW HAM/TSP patient GFP [11]
Timer -TBX4B chr22:44323198 F TBX HAM/TSP patient Timer protein [11]
Timer- 3.60 chr04:70567285 F TBJ HAM/TSP patient Timer protein [11]

* hg 19 denotes human Genome Build 19

** Orientation of the provirus relative to the hg19 chromosome coordinates. F: provirus is oriented 5′ to 3′ in the chromosomal plus-strand. R: provirus is oriented 5′ to 3′ in the chromosomal minus-strand.

Table 2. Tax expression of HTLV-1-infected T cell clones.

Clone Tax-positive (%)
3.60 4 ± 0.8
TBX4B 8 ± 0.1
11.63 36 ± 5.5
11.50 40 ± 2.5
HA1 56 ± 3.7

T cell clones (clone details as shown in Table 1) were stained for Tax protein and analysed by flow cytometry. Mean ± SD (n = 2 or 3 biological replicate experiments) of Tax-positive fraction of viable cells are shown. Note: The reporter clones (GFP or Timer) shown in Table 1 were established from clones shown in this table. While the transcriptional burst of tax is short-lived, Tax protein persists for some days in each cell. In the present experiment, the cells were selected on the basis of Tax protein expression: at any one time, therefore, the percentage of Tax protein-positive cells at any instant in these clones in vitro is relatively high [11,12].

Disorganization of 3D chromatin structure can cause diseases by rewiring interactions between genes and regulatory elements [13], and certain mutations in CTCF are linked with human disease [1416]. We previously showed that the HTLV-1 provirus contains a CTCF binding site (BS): the provirus creates novel loops with the host genome, inducing transcriptional deregulation in the host genome flanking the provirus [10,17]. However, it remained unknown whether these effects were exerted simply by the presence of the provirus or by its transcription.

To answer this question, we sorted naturally-infected HTLV-1+ T cell clones to obtain subsets of provirus-expressing cells and non-expressing cells, and analysed the effects of proviral expression on chromatin looping, using quantitative circular chromosome conformation capture (q4C) [10], and on host transcription, using RNA-seq.

Results

HTLV-1 proviral plus-strand expression reduces chromatin looping between the provirus and the host genome

We hypothesized that the chromatin loops between the provirus and host genome region are regulated by expression of the provirus. To test this hypothesis, we focused on the proviral plus-strand, which is expressed in intense intermittent transcriptional bursts [7]. We used the HTLV-1 transcriptional transactivator protein Tax as a marker of plus-strand expression. T cell clones naturally infected with HTLV-1, each of which has a unique integration site (Table 1), were crosslinked with paraformaldehyde, flow-sorted into Tax+ and Taxpopulations (S1 Fig), and subjected to q4C assay, using a fragment of the proviral sequence containing its CTCF binding site as the q4C viewpoint [10] (Fig 1B).

In the Taxpopulation of clones 3.60 and HA1, we identified q4C peaks (long-range chromatin loops between the provirus and the host genome) (Fig 2A): the identified peaks often overlapped CTCF binding sites, as previously reported in the study using unsorted T cells [10].

Fig 2. HTLV-1 plus-strand expression results in fewer chromatin loops (number of q4C peaks) between provirus and host genome.

Fig 2

(A) q4C profiles of Taxand Tax+ cells of two different clones (3.60 and HA1). For each clone, the top panel depicts the q4C profile in the 5′ and 3′ host genome flanking the provirus (two biological duplicates), quantified as the normalized frequency of ligation events in overlapping windows (window width 10 kb, step 1 kb). On the horizontal axis, positive values denote positions extending from the 3′ LTR side of the provirus; negative values denote positions 5′ of the 5′LTR. VP–viewpoint in q4C (proviral integration site). Diamonds mark the positions of reproducible chromatin contact sites identified by the peak calling algorithm. CTCF track–open arrowheads denote positions of CTCF-binding sites (BS); the filled arrowhead denotes the CTCF-BS in the provirus. Genes track shows RefSeq protein-coding genes in the flanking host genome. The q4C profiles of remaining clones are shown in S2 and S3 Figs. (B) Number of identified peaks in non-expressing (Tax- or GFP-) and expressing (Tax+ or GFP+) subsets isolated from 6 clones: total (all peaks) and (C) peaks with or without a CTCF binding site.

Unexpectedly, the frequency of the chromatin loops identified in Taxcells was much lower in Tax+ cells, although the technical peak at the viewpoint (VP), which is always present in a successful 4C-seq assay [18], remained in Tax+ cells (Fig 2A).

Tax protein is a transcriptional transactivator that stimulates transcription of both HTLV-1 proviral genes and many host genes. To exclude the possibility that the anti-Tax antibody used in intracellular staining altered chromatin looping, we performed q4C assays using T-cell clones transduced with a reporter construct which expresses a modified EGFP with a half-life of ~2h (d2EGFP-TBX4B) when stimulated by Tax protein [11]. The GFP signal intensity was positively correlated with Tax expression [12]. Provirus-expressing (GFP+) cells and non-expressing (GFP) cells were flow-sorted after fixation in 1% paraformaldehyde and subjected to the q4C assay. Consistent with the results from intracellular Tax staining (Fig 2A), GFP+ cells also showed fewer chromatin loops compared with GFP cells (S2 Fig), and the 4C profiles of GFP+ d2EGFP-TBX4B cells were consistent with those of sorted Tax+ TBX4B cells. We conclude that the lower frequency of chromatin looping observed in Tax+ cells was associated with proviral transcription and was not caused by staining Tax protein. Also, the observation of similar changes in looping frequency in both the transduced and non-transduced cells indicates that transduction per se did not affect the results of q4C analysis.

A decrease in the number of q4C peaks at distant chromatin contacts in the host genome was observed in provirus plus-strand-expressing subsets in all T cell clones investigated (Figs 2B, S2 and S3), in regions either with or without CTCF binding sites (Fig 2C).

To corroborate these observations by an independent technique, 3C-qPCR was performed to quantify the frequency of the interaction between the provirus and the two host regions where the principal 4C peaks were identified in the Taxpopulation of clone 11.63 (S3 Fig), using specific primers designed to amplify the junction between the provirus and the host genome (S4A Fig and S1 Table). The results showed a reproducible significant decrease in chromatin contact frequency between the provirus and both peaks in Tax+ cells (S4C and S4D Fig), although there was no difference in the contact frequency observed within the provirus (S4B Fig).

Next, to quantify the difference in chromatin loop formation between provirus-expressing and non-expressing cells, peak regions were defined in the non-expressing cell population, and peaks were then sought at the same sites in the provirus-expressing cell population. (Fig 3A) At each site, we then compared the normalized peak height of the q4C profile in non-expressing cells with that in provirus-expressing cells. The results show that HTLV-1 plus-strand expression resulted in a significant reduction in the frequency of these chromatin loops, especially on the 3′ side of the integration site (Fig 3B). This reduction was observed in both regions with and without CTCF binding sites (Fig 3C).

Fig 3.

Fig 3

HTLV-1 plus-strand expression results in a significant reduction in contact frequency (q4C peak height) with the host genome (A) Schematic to show quantification of change in peak frequency with plus-strand expression. First, reproducible q4C peaks were identified in the non-expressing (Tax-negative) cells, as previously described [10]. Then, peaks were sought (using the same algorithm) in each corresponding genomic location in the plus-strand-expressing cell population. (B) Normalized peak height of q4C peaks identified in all clones analysed, respectively on the 5′ and 3′ sides of the provirus. Peak height is defined as the maximum number of ligation events per region (normalised to total ligation events in a sample) calculated for each peak region. p = 0.027 and p = 9.7 * 10−13 for upstream and downstream regions, respectively (unpaired two-tailed Wilcoxon test. (C) Normalized peak height of q4C peaks identified in all clones analysed, comparing peaks that contain a CTCF site and those without CTCF sites. Peak height was significantly greater in non-expressing cells, both in peaks with a CTCF site and those without (p = 0.0018 and p = 1.8 * 10−13, respectively, unpaired two-tailed Wilcoxon test).

HTLV-1 proviral plus-strand expression drives host transcription downstream in cis

Throughout this work, we refer to the direction of flanking host transcription relative to the orientation of the provirus. Thus, host transcription upstream (5′) of the 5′LTR, in the same sense as the proviral plus-strand, is denoted as ‘same-sense, 5′ side of provirus’ (Fig 1 arrow a), and transcription in the same sense 3′ of the 3′LTR as ‘same-sense, 3′ side of provirus’ (Fig 1 arrow b). Similarly, we refer to host transcription in the same sense as the proviral minus-strand as ‘antisense, 3′ side of provirus’ (Fig 1 arrow c) and ‘antisense, 5′ side of provirus’ (Fig 1 arrow d).

Next, we asked whether the aberrant host transcription is associated with proviral transcription or simply with the presence of the provirus, regardless of transcription. The landscape of host transcription flanking the provirus has been investigated in ATL cases [19,20], in which there is usually a dominant single malignant clone. However, there are typically widespread epigenetic and transcriptional abnormalities in malignant cells, and expression of the proviral plus-strand is frequently lost in ATL clones [21]. We previously reported abnormal in cis host transcription near the integration site in naturally-infected, non-malignant HTLV-1-positive T-cell clones [10], but since only a fraction of cells express the proviral plus-strand at a given instant, we could not distinguish whether the abnormal host transcription was associated with proviral plus-strand expression, or whether there was constitutive activation irrespective of the plus-strand burst, perhaps by the proviral enhancer.

RNA isolated from fresh, unfixed cells is required for optimal RNA-seq results. However, intracellular staining of Tax protein requires prior fixation. Therefore, to elucidate whether the aberrant host transcription was induced by proviral plus-strand transcription, we used d2EGFP-transduced clones (11.63, 11.50 and TBX4B) (see Table 1) [11], from which GFP+ (Tax-expressing) and GFP(Tax-non-expressing) cells were isolated by live-cell sorting; total cellular RNA was extracted and subjected to a stranded RNA-seq analysis. The proviral plus-strand was highly transcribed in all 3 clones in GFP+ cells but not in GFPcells (Fig 4A and 4B), confirming that GFP+ cell-sorting enriched a cell population with active HTLV-1 plus-strand transcription.

Fig 4. Proviral and host transcription and splicing in live-sorted T cell clones.

Fig 4

(A) RNA-seq analysis of HTLV-1 proviral expression in live-sorted d2EGFP clones (TBX4B, 11.50 and 11.63). (B) Tax expression measured by qPCR with primers specific for tax mRNA or 18S ribosomal RNA (18S rRNA). qPCR plots are expression values normalized to 18S rRNA. Data represent a mean of two biological replicates; error bars are SEM. AU—arbitrary units. (C) Host RNA expression 30kb on either side of the proviral integration site. On the horizontal axis, positive values denote positions extending from the 3′ LTR side of the provirus; negative values denote positions 5′ of the 5′LTR. Each row shows the transcription density (normalized RNA-seq read count) flanking that genomic position in the clone indicated at the right-hand side. In each case, transcription orientation and positions are shown relative to the integrated provirus. Read density shown in blue shows transcription in the same orientation as the proviral plus-strand (same sense); red shows transcription in the opposite sense to the proviral plus-strand (antisense). (D) Identification of splice sites of viral-host fusion transcripts in d2EGFP-TBX4B clone cells. Coverage tracks of same sense transcription (blue) and antisense transcription (red) in Integrative Genomics Viewer (IGV). Exons of PNPLA3 in the 3′ side of the integration site are highlighted in yellow. (E) Fusion transcripts between an HTLV-1 plus-strand major splice donor (red, proviral exon H1 or H2) and the canonical splice acceptor site in the host PNPLA3 gene (blue, PNPLA3 exon 3) were identified in GFP+ (HTLV-1 plus-strand-expressing) cells. To identify splice sites of fusion transcripts, reads were aligned to a reference genome (hg19) containing the HTLV-1 provirus (AB513134) genome in the TBX4B clone integration site at chr22:44323198. Fusion transcripts are shown with fused sequences.

Host transcription in the same sense on the 3′ side of the provirus near the clone’s respective proviral integration site was greater in GFP+ cells than in GFPcells in all 3 clones (Fig 4C). For example, in clone d2EGFP-TBX4B, HTLV-1 is integrated between exon 2 and exon 3 of the PNPLA3 gene (Fig 4D), which is not normally expressed in T cells (Fig 4C). The PNPLA3 gene (same-sense, 3′ side of the provirus) was highly transcribed in the GFP+ population, but was not transcribed in GFPcells of clone d2EGFP-TBX4B, or in two other clones (d2EGFP-11.50 and d2EGFP-11.63) regardless of Tax expression, suggesting that the observed increase in PNPLA3 mRNA did not result from Tax-mediated transactivation (Fig 4C). Aberrant transcription in the same-sense on the 5′ side of the provirus was also seen in two clones (11.50 and TBX4B) at a low level (Fig 4C and 4D).

In addition to splicing between the 3′LTR and the host genome (antisense, 5′ side of provirus), which has also been reported in ATL cases [19], in plus-strand-expressing cells we found that the exons of 5′LTR (exon H1/exon H2 in Fig 1) were spliced out to form not only normal viral mRNAs but also viral-host fusion transcripts, by fusion to downstream host plus-strand transcripts. For example, in the GFP+ subset of d2EGFP-TBX4B cells, proviral exons H1 or H2 were found to be fused to exon 3 of PNPLA3 (Fig 4D and 4E). The observed events always occurred between a canonical HTLV-1 splice donor and a canonical host splice acceptor. Such splicing events were observed even when no host gene was present in the flanking region (S5 Fig). We then reanalysed the RNA-seq data obtained previously [11] and found a novel transcript in clone Timer-3.60 cells expressing the plus-strand, formed by splicing out of the proviral plus-strand exon H1 and fusion to novel host exons (S5B Fig). These splicing events again followed the canonical GT|AG mRNA processing rule. Fusion was observed between a host splice acceptor and proviral splice donor up to 120 kb away. By contrast, splicing was not observed between a host splice donor and an HTLV-1 splice acceptor; although low-level host transcription in the same-sense on the 5′ side of the provirus was induced by proviral plus-strand expression in two clones (Fig 4C and 4D).

Treatment of T cell clones with an inhibitor of transcriptional elongation allows recovery of chromatin loops in the Tax+ population

The observation of upregulated host transcription near the integration site and decreased chromatin loops in the provirus-expressing cell populations raised the question whether provirus-induced host transcription disrupts chromatin loop formation between the provirus and host genome. To test this hypothesis, we treated the infected cells with flavopiridol, an inhibitor of transcriptional elongation. Flavopiridol was reported to inhibit both elongation and readthrough transcription induced by influenza virus [22]. Treatment of cells with 1 nM flavopiridol for 1.5 hours did not affect Tax protein expression (S6 Fig) so the cells could still be sorted for Tax protein. Flavopiridol treatment also did not reduce tax mRNA expression significantly. However, the level of aberrant host transcription in the same sense, 3′ side of the provirus was significantly decreased (Fig 5A and 5B).

Fig 5. Treatment of T cell clones with a transcriptional elongation inhibitor allows recovery of chromatin loop formation in the Tax+ population.

Fig 5

(A) After treatment with 1 nM flavopiridol (FP) for 1.5 hrs, total RNA was extracted from clone 11.63 and subjected to RT-qPCR for tax and three regions in the 3′ flanking host genome, respectively a: +188 bp, b: +535 bp and c: +3,198 bp from the 3′ end of the provirus. (B) Relative expression intensity (normalized to 18s rRNA) of tax and the host genome at positions a, b and c. Data are mean ± SEM. (N = 3). * P<0.05 (paired t-test). (C) q4C profiles of mock-treated Tax- cells (top track) and Tax+ cells (middle track), and flavopiridol (FP)-treated Tax+ cells (bottom track). Diamonds mark the positions of reproducible chromatin contact sites identified by the peak calling algorithm. Open arrowheads denote positions of CTCF-BS; the filled arrowhead denotes the CTCF-BS in the provirus. Gene track shows RefSeq protein-coding genes in the flanking host genome.

After flavopiridol treatment, the cells were stained intracellularly, sorted for Tax protein, and then subjected to q4C assay. While the frequency of chromatin loops between the provirus and the host genome was lower in Tax+ cells than in Taxcells (as in Fig 1 above), chromatin loops remained in Tax+ cells when they were treated with flavopiridol (Fig 5C). We conclude that inhibition of transcription elongation by flavopiridol reduced host transcription and allowed the preservation and restoration of chromatin loops.

Discussion

The results presented here lead to two main conclusions. First, q4C analysis of sorted T cell clones for plus-strand expression revealed that the chromatin loops formed between the provirus and the immediately flanking host genome are reduced or lost during plus-strand proviral transcription. Second, plus-strand proviral transcription causes (1) aberrant host transcription on the same-strand, chiefly on the 3′ side of the provirus and to a lesser extent on the 5′ side; (2) fusion transcripts between provirus and host RNA; the 5′LTR (exon H1/exon H2) was fused to host exons in the same sense on the 3′ side of the provirus, although intra–HTLV-1 splicing remained intact.

In clone TBX4B we observed that transcriptional activation of a host gene more distant (1.4 Mb) from the provirus was associated with expression of the wild-type provirus, but not with the provirus in which the CTCF binding site was knocked out (S7 Fig). We postulate that, while the nearby chromatin loops are diminished or lost during HTLV-1 proviral plus-strand transcription, more long-range CTCF-dependent loops can remain intact. This result (S7 Fig) suggests that the long-range enhancer effect of HTLV-1 requires the CTCF binding to the provirus and active viral transcription. Further investigation will be needed to clarify this mechanism.

The chromatin loops formed by the provirus reported here and previously [10] are made with adjacent regions of the host genome. The frequency of these loops declines rapidly within ~5 Mb of the provirus. These observations are consistent with the size distribution of normal chromatin loops in the human genome [23], and with the mechanism of loop extrusion by which these loops are formed [24]. It is known that contacts can also be made with more distant locations on the genome, including other chromosomes; however, such distant contacts are typically much less frequent, although certain trans-chromosomal contacts may be evolutionarily conserved [25].

Aberrant read-through transcription and splicing is a known feature of retroviral transcription [26]. Previous studies reporting RNA-seq analysis in ATL cases [19,20] showed aberrant antisense expression in the host genome flanking the 5′ side of the provirus; the 3′LTR sequence was spliced out and fused to host exons on the antisense strand on the 5′ side of the provirus. However, aberrant host transcription in the same sense as the proviral plus-strand (either 5′ or 3′ to the provirus), and splicing between the 5′LTR and host exons in the same sense on the 3′ side of the provirus, were rarely observed. In the non-malignant clones in the present study, although viral-host transcripts in the minus-strand were sometimes observed, plus-strand transcription always dominated.

We observed splicing between the 5′LTR and host exons in the plus-strand on the 3′ side of the provirus (Figs 4D, 4E and S5), even when there was no identified host gene in either strand. The observation that proviral-host chimeric RNA splicing events can activate cryptic host exons was recently reported in HIV-1-infected Jurkat T cell clones carrying reporter proviruses integrated into the introns of three cancer-related genes [27]. Our observations show that HTLV-1 plus-strand expression has the potential to disrupt host cell functions by creating novel transcripts, with potential biological functions, under the transcriptional control of the HTLV-1 5′LTR. To explore this possibility, further research is needed to test the stability and activity of these aberrant mRNAs.

Whereas frequent splicing was observed between a proviral donor and host acceptor, we did not observe splicing between a host donor and proviral acceptor, even if the host genome 5′ side region was transcribed. This observation implies that any same-sense host transcription on the 5′ side of the provirus stops at the integration site, whereas LTR-driven transcription can result in read-through into the host genome in the same sense on the 3′ side of the provirus.

CTCF, which binds to the HTLV-1 provirus, can regulate mRNA splicing: further work is required to test the possibility that CTCF bound to the provirus causes aberrant splicing in the flanking host genes.

The results reported here show that the aberrant host transcription induced by the HTLV-1 provirus differs from the host gene transcriptional landscape at the HIV-1 integration site [27]. No aberrant host transcription was observed in the antisense strand in HIV-1-infected cells. By contrast with HIV-1, HTLV-1 can induce host transcription in the same sense on the 5′ side of the provirus (Fig 4C), whereas both proviruses induce strong same-sense transcription on the 3′ side of the provirus. Whereas downstream transcription can be at least partially explained as readthrough transcripts from the provirus, same-sense transcription on the 5′ side of the provirus in an HTLV-1+ clone cannot be explained by readthrough from the 5′LTR. Two observations suggest that this same-sense transcription on the 5′ side of the provirus is due to an enhancer effect of the HTLV-1 5′LTR. First, the aberrant same-sense transcription was seen only in GFP+ cells (Fig 4C and 4D); second, the transcription correlated with the level of plus-strand proviral transcription in timer protein reporter clones (S8 Fig).

An unexpected finding in this study was that aberrant loop formation between the provirus and host chromatin is suppressed by proviral plus-strand transcription. Since CTCF appears to remain bound to the provirus during expression of tax [8], the observed decrease in chromatin looping cannot be attributed to loss of CTCF binding. The observations that proviral transcription increases aberrant transcription and splicing in the flanking host genome (Figs 4 and S5) and that reducing proviral-expression-induced host transcription with an inhibitor of transcriptional elongation allows recovery of chromatin loops in the Tax+ population (Fig 5) suggest that aberrant host transcription induced by the proviral expression disrupts chromatin looping. This inference is consistent with the report by Heinz et al. (2018) [22] that during influenza A virus (IAV) infection the IAV non-structural protein 1 (NS1) induces global inhibition of transcription termination of highly transcribed genes and causes readthrough transcription for hundreds of kilobases, resulting in disruption of chromatin interactions.

Chromatin looping between the HTLV-1 provirus and the host genome is mediated by CTCF binding to the provirus [14]. We postulated that CTCF binding confers a selective advantage on the provirus; however, the mechanism of this putative advantage is not known. At least three non-mutually-exclusive possibilities can be identified. First, through its function as an epigenetic barrier, CTCF may regulate epigenetic modification of the provirus [28] Second, CTCF (again through its barrier function) might block unwanted activation of plus-strand proviral transcription by the recently reported intragenic enhancer [29]. Third, CTCF may control the intranuclear position of the chromatin containing the provirus, which is correlated with selective survival of HTLV-1-infected T cell clones in vivo [30].

Methods

Ethics statement

All donors gave written informed consent in accordance with the Declaration of Helsinki to donate blood samples to the Communicable Diseases Research Tissue Bank, approved by the UK National Research Ethics Service (15/SC/0089).

Cells

The details of HTLV-1-infected T-lymphocyte clones used in this study are shown in Table 1. All clones were derived as previously described [11] from peripheral blood mononuclear cells (PBMCs) of donors attending the National Centre for Human Retrovirology (NCHR) at Imperial College Healthcare NHS Trust, St Mary’s Hospital, London. The identification of genomic insertion sites by LMPCR was described elsewhere [31]. The cells were maintained in RPMI-1640 (Sigma, R0883) supplemented with L-glutamine, penicillin+streptomycin and 20% fetal bovine serum (Gibco, 10500–064) in 5% CO2 at 37°C. IL-2 (Miltenyi Biotec, 130-097-745) was supplemented (100 unit/ml) into the culture twice a week. The integrase inhibitor raltegravir (Selleck Chemicals, MK-0518) was used at 10 μM throughout the culture to prevent secondary infection.

3C and q4C assay of sorted cells

HTLV-1- infected T cell clones were stained with LIVE/DEAD Fixable Near-IR Dead Cell Stain kit (Invitrogen, L34976) to enabling gating on live cells and then crosslinked in phosphate-buffered saline (PBS) containing 1% formaldehyde for 10 min at room temperature. Then the Tax protein was stained intracellularly with anti-Tax-AF647 (clone LT-4; 0.4 μg/ml), using a Foxp3 staining kit (eBioscience, 00-5523-00) and viable Tax+ and Tax- cells were sorted (S1 Fig). For the Tax expression reporter clones (d2EGFP clones), the cells were stained with LIVE/DEAD Fixable Near-IR Dead Cell Stain kit, crosslinked as above, and viable GFP+ cells and GFP- cells were sorted. After sorting, q4C was performed [10].

q4C analysis was carried out as previously described [10]. Reads spanning 100 to 150 bp were inspected for quality using FastQC, and filtered for correct amplification using the NlaIII restriction sequence and preceding 4 bases (total 8 bases) as an identifying barcode for the q4C viewpoint using Cutadapt. Reads where the first NlaIII site was incompletely digested (therefore containing an additional NlaIII fragment) were further trimmed by Cutadapt. Finally, Trimgalore was used to trim low quality reads (<20) and remove the Ilumina adaptor sequence, keeping reads of minimum 30 b in length. Trimmed reads were aligned to a combined reference of human (hg19) and viral (AB513134) genomes using Bowtie2. Ligation sites were quantified from aligned reads using Perl and R scripts. Ligation sites were quantified in windows of 10 kb across the chromosomes in order of base coordinates. Peaks were called as described previously [10] using a three-state hidden Markov model, requiring agreement between both samples (biological replicates).

Real-time 3C qPCR was described elsewhere [17]. Primer pairs and probe (S1 Table) were used to detect long-range chromatin contacts between the provirus and host genome region at Peak 1 (34.5 kb downstream of the provirus) and Peak 2 (68 kb downstream of the provirus) of chromosome 19. As control, we used a primer pair and Taqman probe to detect the contacts of two regions in the provirus, and a 101 bp sequence from the provirus was amplified to normalize the qPCR using internal control primer sets and probe (S1 Table). Data analysis was done with LinRegPCR software (version 2014.5). Six replicates were performed on each library, using a QuantStudio 7 Flex Real-Time PCR System utilizing TaqMan Gene Expression Master Mix (Applied Biosystems).

RNA-seq and RT-qPCR

d2EGFP clones were stained with LIVE/DEAD, and live cells were sorted with a BD FACSAria lll cell sorter under containment level 3 (CL3) conditions, as described elsewhere [11]. Total RNA was extracted from the sorted cells using an miRNeasy kit (Qiagen) and RNA-seq libraries were prepared using the Ribo-Zero Plus rRNA Depletion kit and TruSeq Stranded mRNA HT Sample Prep Kit and sequenced with the NovaSeq6000 (150 bp paired-end reads).

RNA-seq reads were inspected for quality using FastQC, and trimmed using Trimgalore to remove low-quality reads and Ilumina adapter sequences. Trimmed reads were aligned against a combined reference of human (hg19) and viral (AB513134) genomes using GSNAP v. 2019-06-10. Read coverage was counted using Bedtools coverage after split to exons using Bedtools bamtobed tool using the resolution denoted in each figure.

Reads were aligned to a combined reference of human (hg19) and viral (AB513134) inserted into the integration site at chr22:44323198, chr19:33829548 and chr04:70567285 to identify splice sites of fusion transcripts in TBX4B, 11.63 and 3.60, respectively.

RNA-seq of timer protein reporter clones was described elsewhere [11]. RNA-seq of TBX4B clones containing wild-type CTCF Binding site (BS) in the provirus and mutant (Mut) clone containing a mutated CTCF-BS were performed as described elsewhere [10].

For RT-qPCR, first-strand cDNA was synthesized with First Strand cDNA Synthesis Kit using random primers. Primers used are shown in S1 Table. Real-time qPCR was then performed in 6 replicates on each library, using a QuantStudio 7 Flex Real-Time PCR System utilizing Fast SYBR Green Master Mix (Applied Biosystems).

Supporting information

S1 Table. DNA sequence of primers and probes used in 3C-qPCR and RT-qPCR.

(TIFF)

ppat.1011716.s001.tiff (9.4MB, tiff)
S1 Fig. Flow sorting of Taxand Tax+ cells.

HTLV-1-infected T cell clone 11.63 was stained for live cells, crosslinked in 1% formaldehyde, stained intracellularly for Tax protein, and flow-sorted to isolate Taxand Tax+ subsets (see Methods).

(TIFF)

ppat.1011716.s002.tiff (9.9MB, tiff)
S2 Fig. Flow sorting of provirus-expressing and non-expressing cells by an independent technique produced similar q4C profiles.

(A) q4C profiles of Tax(upper panel) and Tax+ (lower panel) cells sorted from clone TBX4B after intracellular staining of Tax. (B) q4C profile of non-expressing (GFP-) (upper panel) and provirus-expressing (GFP+) (lower panel) cells isolated from d2EGFP-TBX4B clones, selected by GFP signal (without Tax staining).

(TIFF)

ppat.1011716.s003.tiff (9.9MB, tiff)
S3 Fig. Transcription of the proviral plus-strand is accompanied by a loss of contacts between the provirus and flanking host chromatin.

(A) q4C profiles of Tax(upper panel) and Tax+ (lower panel) cells from clone 11.63. (B) q4C profiles of non-expressing (GFP) (upper panel) and provirus-expressing (GFP+) cells (lower panel) from clone d2EGFP-11.50. Vertical axes show the normalized frequency of chromatin contacts between the provirus and the host genome.

(TIFF)

ppat.1011716.s004.tiff (9.9MB, tiff)
S4 Fig. Quantitative 3C (3C-qPCR) analysis confirmed decreased frequencies of chromatin looping.

(A) q4C profile of Taxcells of clone 11.63. The technical peak seen in the q4C viewpoint (VP) and two of the main peaks (Peak 1) and (Peak 2) identified in the output of the Taxfraction of clone 11.63. (B) As control, the frequency of chromatin interactions was quantified by 3C-qPCR on sorted Taxand Tax+ cells, using a primer pair and Taqman probe to detect the contacts between two regions: VP and another region in the provirus (S1 Table). There was no significant difference between Tax+ and Taxcells (combined p value = 0.932, Fisher’s method of combining p values). (C and D) Primer pairs and probe were used to detect long-range chromatin contacts between the provirus and host genome region at Peak 1 (C) or Peak 2 (D). Results of 3C-qPCR of two biological replicates (rep) are shown. Peak1 contact frequency was significantly higher in Taxcells than in Tax+ cells (combined p value 0.012, Fisher’s method). Peak2 contact frequency was significantly higher in Tax- cells than in Tax+ cells (combined p value 0.000607, Fisher’s method).

(TIFF)

ppat.1011716.s005.tiff (9.9MB, tiff)
S5 Fig. Fusion transcripts between HTLV-1 provirus and host genome.

(A) Identification of splice sites of fusion transcripts in the plus-strand-expressing cells of clone d2EGFP-11.63 and (B) in timer protein reporter clone Timer-3.60. Plus-strand fusion transcripts between HTLV-1 exon1 (H1) and same sense, 3′ side host gene exon (blue) or novel host exons (green) are shown with fused sequences.

(TIFF)

ppat.1011716.s006.tiff (9.9MB, tiff)
S6 Fig. Treatment with flavopiridol for 1.5 hrs did not alter Tax protein expression.

Clone 11.63 cells were treated with 1nM flavopiridol for 1.5 hrs and then stained for Live/Dead and then Tax protein.

(TIFF)

ppat.1011716.s007.tiff (9.9MB, tiff)
S7 Fig. Expression of distant host genes correlates with expression of tax.

(A) Normalized mRNA read counts of two genes that lie >1.4 Mb from the provirus in clone Timer-TBX4B, in the four successive phases of the HTLV-1 plus-strand transcriptional burst: DN–double negative (HTLV-1 silent); blue–early burst; DP double-positive (mid-burst); red–late burst. Results of two independent experiments are shown. Expression of both SMC1B and RIBC2 closely followed the trajectory of the HTLV-1 burst in clone Timer-TBX4B, but not in the unrelated HTLV-1-infected clone Timer-3.60. Data from [11]. (B) Knockout of the CTCF binding site in the provirus in clone Timer-TBX4B (middle panel) abolished the transcription of both SMC1B and RIBC2 observed in the wild-type clone (lower panel). Results of two independent experiments are shown. Neither gene was expressed in an unrelated HTLV-1-infected clone ED. The results suggest that maintenance of a CTCF-dependent chromatin loop between the host genome and the provirus is required for the burst of transcription of these distant genes associated with the HTLV-1 plus-strand burst.

(TIFF)

ppat.1011716.s008.tiff (3.9MB, tiff)
S8 Fig. RNA-seq analysis of HTLV-1 proviral expression in live-sorted cells of clone Timer-TBX4B.

Cells were sorted into four populations based on the fluorescence of the Timer protein, DN–double negative (HTLV-1 silent); blue–early burst; DP double-positive (mid-burst); red–late burst. [11].(A) Coverage tracks in IGV of plus strand HTLV-1 provirus transcription and (B) host gene PNPLA3 transcription. (C) Transcription in PNPLA3 exons 1 and 2 (note range on vertical axis 0 to 500).

(TIFF)

ppat.1011716.s009.tiff (9.3MB, tiff)

Acknowledgments

We thank Parisa Amjadi from the CL3 Cell Sorting Facility at The Centre for Immunology and Vaccinology at Imperial College London for cell sorting. We are grateful to Tomas Fitzgerald at the European Bioinformatics Institute and Aris Aristodemou for helpful discussion. We thank Laurence Game at MRC London Institute of Medical Sciences for q4C sequencing, and Oxford Genomics Centre for cDNA library preparation and RNA sequencing.

Data Availability

Sequence data have been deposited at the European Nucleotide Archive, (ENA, https://www.ebi.ac.uk/ena/browser/home), accession number PRJEB71982.

Funding Statement

This work was supported by a Wellcome Trust UK Investigator Award to C.R.M.B. (ref. 207477). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Baranello L, Kouzine F, Levens D. CTCF and cohesin cooperate to organize the 3D structure of the mammalian genome. Proc Natl Acad Sci U S A. 2014;111(3):889–90. Epub 2014/01/09. doi: 10.1073/pnas.1321957111 ; PubMed Central PMCID: PMC3903212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Phillips JE, Corces VG. CTCF: master weaver of the genome. Cell. 2009;137(7):1194–211. Epub 2009/07/01. doi: 10.1016/j.cell.2009.06.001 ; PubMed Central PMCID: PMC3040116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Bangham CRM, Matsuoka M. Human T-cell leukemia virus types 1 and 2. In: Howley PM, Knipe DM, editors. Fields Virology. RNA Viruses. 3. 7th ed: Wolters Kluwer; 2023. p. 527–57. [Google Scholar]
  • 4.Laydon DJ, Sunkara V, Boelen L, Bangham CRM, Asquith B. The relative contributions of infectious and mitotic spread to HTLV-1 persistence. PLoS Comput Biol. 2020;16(9):e1007470. Epub 2020/09/18. doi: 10.1371/journal.pcbi.1007470 ; PubMed Central PMCID: PMC7524007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Cook LB, Rowan AG, Melamed A, Taylor GP, Bangham CR. HTLV-1-infected T cells contain a single integrated provirus in natural infection. Blood. 2012;120(17):3488–90. Epub 2012/09/08. doi: 10.1182/blood-2012-07-445593 ; PubMed Central PMCID: PMC3482858. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kulkarni A, Bangham CRM. HTLV-1: Regulating the Balance Between Proviral Latency and Reactivation. Front Microbiol. 2018;9:449. Epub 2018/04/05. doi: 10.3389/fmicb.2018.00449 ; PubMed Central PMCID: PMC5867303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Billman MR, Rueda D, Bangham CRM. Single-cell heterogeneity and cell-cycle-related viral gene bursts in the human leukaemia virus HTLV-1. Wellcome Open Res. 2017;2:87. Epub 2017/10/25. doi: 10.12688/wellcomeopenres.12469.2 ; PubMed Central PMCID: PMC5645716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Miura M, Miyazato P, Satou Y, Tanaka Y, Bangham CRM. Epigenetic changes around the pX region and spontaneous HTLV-1 transcription are CTCF-independent. Wellcome Open Res. 2018;3:105. Epub 2019/01/08. doi: 10.12688/wellcomeopenres.14741.2 ; PubMed Central PMCID: PMC6305241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Mahgoub M, Yasunaga JI, Iwami S, Nakaoka S, Koizumi Y, Shimura K, et al. Sporadic on/off switching of HTLV-1 Tax expression is crucial to maintain the whole population of virus-induced leukemic cells. Proc Natl Acad Sci U S A. 2018;115(6):E1269–E78. Epub 2018/01/24. doi: 10.1073/pnas.1715724115 ; PubMed Central PMCID: PMC5819419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Melamed A, Yaguchi H, Miura M, Witkover A, Fitzgerald TW, Birney E, et al. The human leukemia virus HTLV-1 alters the structure and transcription of host chromatin in cis. Elife. 2018;7. Epub 2018/06/26. doi: 10.7554/eLife.36245 ; PubMed Central PMCID: PMC6019074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kiik H, Ramanayake S, Miura M, Tanaka Y, Melamed A, Bangham CRM. Time-course of host cell transcription during the HTLV-1 transcriptional burst. PLoS Pathog. 2022;18(5):e1010387. Epub 2022/05/17. doi: doi: 10.1371/journal.ppat.1010387 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ramanayake S, Moulding DA, Tanaka Y, Singh A, Bangham CRM. Dynamics and consequences of the HTLV-1 proviral plus-strand burst. PLoS Pathog. 2022;18(11):e1010774. Epub 2022/11/29. doi: 10.1371/journal.ppat.1010774 ; PubMed Central PMCID: PMC9731428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Anania C, Lupianez DG. Order and disorder: abnormal 3D chromatin organization in human disease. Brief Funct Genomics. 2020;19(2):128–38. Epub 2020/02/07. doi: 10.1093/bfgp/elz028 ; PubMed Central PMCID: PMC7115703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Boyling A, Perez-Siles G, Kennerson ML. Structural Variation at a Disease Mutation Hotspot: Strategies to Investigate Gene Regulation and the 3D Genome. Front Genet. 2022;13:842860. Epub 2022/04/12. doi: 10.3389/fgene.2022.842860 ; PubMed Central PMCID: PMC8990796. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Spielmann M, Lupianez DG, Mundlos S. Structural variation in the 3D genome. Nat Rev Genet. 2018;19(7):453–67. Epub 2018/04/26. doi: 10.1038/s41576-018-0007-0 . [DOI] [PubMed] [Google Scholar]
  • 16.Norton HK, Phillips-Cremins JE. Crossed wires: 3D genome misfolding in human disease. J Cell Biol. 2017;216(11):3441–52. Epub 2017/09/01. doi: 10.1083/jcb.201611001 ; PubMed Central PMCID: PMC5674879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Satou Y, Miyazato P, Ishihara K, Yaguchi H, Melamed A, Miura M, et al. The retrovirus HTLV-1 inserts an ectopic CTCF-binding site into the human genome. Proc Natl Acad Sci U S A. 2016;113(11):3054–9. Epub 2016/02/29. doi: 10.1073/pnas.1423199113 ; PubMed Central PMCID: PMC4801255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Krijger PHL, Geeven G, Bianchi V, Hilvering CRE, de Laat W. 4C-seq from beginning to end: A detailed protocol for sample preparation and data analysis. Methods. 2020;170:17–32. Epub 2019/07/29. doi: 10.1016/j.ymeth.2019.07.014 . [DOI] [PubMed] [Google Scholar]
  • 19.Kataoka K, Nagata Y, Kitanaka A, Shiraishi Y, Shimamura T, Yasunaga J, et al. Integrated molecular analysis of adult T cell leukemia/lymphoma. Nat Genet. 2015;47(11):1304–15. Epub 2015/10/06. doi: 10.1038/ng.3415 . [DOI] [PubMed] [Google Scholar]
  • 20.Rosewick N, Durkin K, Artesi M, Marcais A, Hahaut V, Griebel P, et al. Cis-perturbation of cancer drivers by the HTLV-1/BLV proviruses is an early determinant of leukemogenesis. Nat Commun. 2017;8:15264. Epub 2017/05/24. doi: 10.1038/ncomms15264 ; PubMed Central PMCID: PMC5457497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Takeda S, Maeda M, Morikawa S, Taniguchi Y, Yasunaga J, Nosaka K, et al. Genetic and epigenetic inactivation of tax gene in adult T-cell leukemia cells. Int J Cancer. 2004;109(4):559–67. Epub 2004/03/03. doi: 10.1002/ijc.20007 . [DOI] [PubMed] [Google Scholar]
  • 22.Heinz S, Texari L, Hayes MGB, Urbanowski M, Chang MW, Givarkes N, et al. Transcription Elongation Can Affect Genome 3D Structure. Cell. 2018;174(6):1522–-36.e22.. Epub 2018/08/23. doi: 10.1016/j.cell.2018.07.047 ; PubMed Central PMCID: PMC6130916. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Kempfer R, Pombo A. Methods for mapping 3D chromosome architecture. Nat Rev Genet. 2020;21(4):207–26. Epub 2019/12/19. doi: 10.1038/s41576-019-0195-2 . [DOI] [PubMed] [Google Scholar]
  • 24.Davidson IF, Peters JM. Genome folding through loop extrusion by SMC complexes. Nat Rev Mol Cell Biol. 2021;22(7):445–64. Epub 2021/03/27. doi: 10.1038/s41580-021-00349-7 . [DOI] [PubMed] [Google Scholar]
  • 25.Lohia R, Fox N, Gillis J. A global high-density chromatin interaction network reveals functional long-range and trans-chromosomal relationships. Genome Biol. 2022;23(1):238. Epub 2022/11/11. doi: 10.1186/s13059-022-02790-z ; PubMed Central PMCID: PMC9647974. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Coffin JM, Hughes SH, Varmus HE. The Interactions of Retroviruses and their Hosts. In: Coffin JM, Hughes SH, Varmus HE, editors. Retroviruses. Cold Spring Harbor (NY)1997. [PubMed] [Google Scholar]
  • 27.Liu R, Yeh YJ, Varabyou A, Collora JA, Sherrill-Mix S, Talbot CC Jr., et al. Single-cell transcriptional landscapes reveal HIV-1-driven aberrant host gene transcription as a potential therapeutic target. Sci Transl Med. 2020;12(543). Epub 2020/05/15. doi: 10.1126/scitranslmed.aaz0802 ; PubMed Central PMCID: PMC7453882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Cheng X, Joseph A, Castro V, Chen-Liaw A, Skidmore Z, Ueno T, et al. Epigenomic regulation of human T-cell leukemia virus by chromatin-insulator CTCF. PLoS Pathog. 2021;17(5):e1009577. Epub 2021/05/22. doi: 10.1371/journal.ppat.1009577 ; PubMed Central PMCID: PMC8174705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Matsuo M, Ueno T, Monde K, Sugata K, Tan BJY, Rahman A, et al. Identification and characterization of a novel enhancer in the HTLV-1 proviral genome. Nat Commun. 2022;13(1):2405. Epub 2022/05/04. doi: 10.1038/s41467-022-30029-9 ; PubMed Central PMCID: PMC9065021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Melamed A, Fitzgerald TW, Wang Y, Ma J, Birney E, Bangham CRM. Selective clonal persistence of human retroviruses in vivo: Radial chromatin organization, integration site, and host transcription. Sci Adv. 2022;8(17):eabm6210. Epub 2022/04/30. doi: 10.1126/sciadv.abm6210 ; PubMed Central PMCID: PMC9054021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Gillet NA, Malani N, Melamed A, Gormley N, Carter R, Bentley D, et al. The host genomic environment of the provirus determines the abundance of HTLV-1-infected T-cell clones. Blood. 2011;117(11):3113–22. Epub 2011/01/14. doi: 10.1182/blood-2010-10-312926 ; PubMed Central PMCID: PMC3062313. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Susan R Ross, Edward William Harhaj

1 Nov 2023

Dear Dr. Bangham,

Thank you very much for submitting your manuscript "The impact of HTLV-1 expression on the 3D structure and expression of host chromatin" for consideration at PLOS Pathogens. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by three independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Edward William Harhaj, Ph.D.

Academic Editor

PLOS Pathogens

Susan Ross

Section Editor

PLOS Pathogens

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************

Reviewer's Responses to Questions

Part I - Summary

Please use this section to discuss strengths/weaknesses of study, novelty/significance, general execution and scholarship.

Reviewer #1: The manuscript by Yaguchi et al describes studies of 3D conformation analysis and HTLV-1 transcription in infected cell lines and reported that plus strand transcription reduces aberrant chromatin loops with the CTCF binding site in the integrated provirus. The results are not interpretable in light of the fact that some of data (e.g. Fig 2) were not printed properly. In addition, key controls are missing. The functional significance of the chromatin loops with regards to infected cellular transcription, proliferation, or survival were not examined. Lastly, the manuscript is very difficult to read.

Reviewer #2: Previously, this group reported that the HTLV-1 genome contains a CTCF binding site involved in creating abnormal chromatin looping with the host cell genome. In the current manuscript, Yaguchi et al. seeks to analyze the effect of HTLV-1 transcription on chromatin architecture and transcription of neighboring DNA. From several HTLV-1-infected clones, the authors sorted the cells into populations based on the presence or absence of Tax expression (i.e. sense transcription). Using chromosome conformation capture (q4C and 3C) they observed that the provirus sense transcription reduces chromatin loops with the provirus, regardless of the presence of CTCF sites in the host genomic regions. In addition, correlating with the loss of chromatin loops, sense transcription is linked to increased transcription mainly from the 3’ side of the provirus, probably due to read-through transcription. This is an interesting study that confirms the profound effect of HTLV-1 integration and transcription on the host cell chromatin architecture and host transcription. A few comments below will help to clarify the manuscript and the findings.

Reviewer #3: HTLV-1 proviral DNA contains a CTCF binding site thought to be involved in regulating the formation of chromatin loops. This manuscript by Yaguchi et al. examined the impact of the HTLV-1 provirus on the structure and transcriptional activities of the host DNA sequences nearby. HTLV-1-infected cells undergoing Tax-mediated viral transcription or not were isolated and analyzed by a technique known as the quantitative chromosome conformation capture (q4C) and RNAseq. The study suggests that Tax-mediated proviral plus strand transcription alters the mRNA transcription and splicing of the host exons downstream of the viral 3’ LTR and reduces the formation of chromatin loops. The CTCF site in HTLV-1 proviral DNA appears to affect the transcriptional activation of a host gene distal to the proviral DNA.

Overall, the authors employed state-of-the-art techniques in the study. The experiments were well-controlled, and the data obtained were convincing. The most likely explanation for the “aberrant” host mRNA transcription observed is the transcriptional readthrough, initiating at the 5’ LTR (driven by Tax), elongating across 3’ LTR into the host gene, and followed by mRNA splicing with the HTLV-1 1st exon spliced with the host exon immediately downstream of the 3’ LTR. The reduction in chromatin loop formation during active transcription is novel and interesting. The notion that chromatin loop formation may allow the HTLV-1 provirus to activate distal genes (via Tax-bound HTLV-1 enhancer?) is hinted at but not fully developed. The authors can improve the manuscript further by addressing the following comments.

**********

Part II – Major Issues: Key Experiments Required for Acceptance

Please use this section to detail the key new experiments or modifications of existing experiments that should be absolutely required to validate study conclusions.

Generally, there should be no more than 3 such required experiments or major modifications for a "Major Revision" recommendation. If more than 3 experiments are necessary to validate the study conclusions, then you are encouraged to recommend "Reject".

Reviewer #1: Major Concerns

1) The graphs in Fig 2A are missing. The numbers of peaks listed in Fig 2B are very small and inadequate for any firm conclusions.

2) Almost all of the results are from subjects with HAM/TSP. Are these results unique to this subset?

3) It is unclear whether or not fixation, flow sorting, or transduction of reporters affected the chromatin conformation analysis or transcription results. Studies should have been performed to examine chromatin loops and transcription not involving the provirus to show that there is no effect of Tax expression.

4) Why does transcription affect chromatin contacts outside the provirus but not within the provirus? This seems counterintuitive.

5) The changes in 4C profile in Fig 5C with flavoperidol are subtle and limited to a single cell line, and thus not convincing. Why did flavoiridol not decrease Tax mRNA levels?

Reviewer #2: 1) Figure 3 is missing some information. Are we looking at all the q4C peak regions of all cell clones analyzed? The genomic position (distance from integration site) is missing numbers (Mb) to denote the positions analyzed.

2) Figure 5 does not show a reduction of HTLV-1 sense transcription correlating with a reduction of 3’ host sense transcription, since the elongation inhibitor does not significantly affect Tax mRNA. The authors should explain or speculate on reasons for this discrepancy (e.g., Tax (HTLV-1) mRNA stability?). Alternative experiments should be developed to better correlates HTLV sense transcription with read-through transcription and chromatin loops. Possibly, the authors could show RNA polymerase II occupancy (phosphorylation status?) at the HTLV-1 provirus before and after flavopiridol treatment.

3) The authors show by q4C that the sense transcription affects close and long-range chromatin loops; however, they focus on transcription in closed proximity of the provirus. While they show that provirus read-through sense transcription affects transcription in the neighboring host genome, they do not show the effect of sense transcription/chromatin changes on transcription at distal host genomic sites, and therefore, the study appears incomplete. The study would be made more thorough by determining the status of transcription of the genes located further downstream of the HTLV-1 provirus. Specifically, the authors could analyze genes further downstream of regions a, b and c in Figure 5, clone 11.63. If the provirus transcription affects long-range chromatin loops, one would think that the transcription of genes within these loops would also be affected. It seems that S7 Fig in the Discussion section is an attempt to demonstrate long-range effects, but without much explanation, this figure is difficult to interpret. Also, the data were derived from a different experimental design.

4) Integrating S7 Fig and S8 Fig in the Discussion without much explanation does not make the reading and comprehension of the manuscript easy. The manuscript would benefit from having these figures described in more detail at appropriate points in Results section.

Reviewer #3: 1. The chromatin loops formed by the proviral DNA involve only host DNA sequences that are adjacent to the integration sites and syntenic. Is this because of the constraint of the q4C technique or the nature of chromatin loops in general? The authors should discuss these points in more depth.

2. It is reported that there are between 15000 and 40000 CTCF binding sites in the human genome. Do most chromatin regions that HTLV-1 proviral DNA interacts with (as mapped in this study) contain one or more CTCF sites?

3. It should be pointed out to the readers that the so-called aberrant host transcription and splicing observed is primarily due to mRNA transcriptional readthrough and splicing (Fig. 4, Fig. S5). This is well known for retroviral transcription, and HTLV-1 is no exception.

4. The TBX4B study (Fig. S7) suggests the long-range “enhancer” effect of HTLV-1 requires the CTCF site and active viral transcription (with the recruitment of Tax-CREB-CBP/p300?). Is this observed in other HTLV-1-infected cell clones in this study?

**********

Part III – Minor Issues: Editorial and Data Presentation Modifications

Please use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity.

Reviewer #1: Minor Concerns

6) In Table 1, if the designations of subjects are patient initials they should be removed. Also, it is not clear to what the provirus orientation is compared – the centrosome?

7) Table 2 should be combined with Table 1. Why did they not look at Tax expression in all the cell clones? Why were so many cells positive in 3 cell lines (36-56% of cells) when the text says that “a small proportion of cells express intense bursts of Tax.”

8) Fig S2 lacks a label on the x-axis. Why are the patterns so different in the top and bottom figures of each of S2 Fig and S3 Fig which presumably are replicate analyses?

9) Fig 3A is unclear – is this just a schematic without actual data? why are there 2 peaks downstream of the provirus and only 1 upstream? It is surprising that there are so manyt contact sites without CTCF sites. In Fig 3 legend what is meant by “….both in both peaks with a CTCF site…” In Fig 3C, what are each of the panels?

10) How were the genes selected in S7 Fig for analysis? Can these results be generalized?

Reviewer #2: 1) The authors show that read-through of same sense transcription in the host genome could lead to production of aberrant mRNAs. They then comment in the Discussion that read-through of same sense transcription in the host genome was rarely observed in other previous studies using ATL cells (Kataoka et al., Rosewick et al.). Could these observations be interpreted as infected clones that produce aberrant mRNAs from read-through, same-sense transcription being selected against becoming ATL cells?

2) Additionally, did the authors determine whether these aberrant mRNAs are stable? Could they be exported from the nucleus and possibly translated? A discussion regarding this aspect would be interesting since they mentioned “potential biological functions” of these transcripts in the Discussion.

3) Table 1: R and F legends are missing

4) Figure 2: In figure 2A, clone HA1 is also labelled clone 3.60. Figure 2B, C, D: 11.63 and 3.60 appear to have the same khaki color and cannot be discriminated; d2EGFP-11.50 is not visible.

5) In S6 Fig, clone 11.60 is mentioned. Should it be clone 11.63?

Reviewer #3: None

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here on PLOS Biology: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Decision Letter 1

Susan R Ross, Edward William Harhaj

12 Feb 2024

Dear Dr. Bangham,

We are pleased to inform you that your manuscript 'The impact of HTLV-1 expression on the 3D structure and expression of host chromatin' has been provisionally accepted for publication in PLOS Pathogens.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

Edward William Harhaj, Ph.D.

Academic Editor

PLOS Pathogens

Susan Ross

Section Editor

PLOS Pathogens

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************************************************

Reviewer Comments (if any, and for reference):

Reviewer's Responses to Questions

Part I - Summary

Please use this section to discuss strengths/weaknesses of study, novelty/significance, general execution and scholarship.

Reviewer #2: All my comments were addressed.

Reviewer #4: In this manuscript by Yaguchi et al., authors found that HTLV-1 plus strand transcription induces aberrant transcription in the flanking genome and splicing with downstream splice acceptor sites. Upon enhanced plus strand transcription, the frequency of chromatin loop formation with the host chromatin in the vicinity of the integrated provirus declined but could be recovered upon short-term application of an inhibitor of transcription elongation. This suggests that HTLV-1 proviral gene expression leads to reversible disruption of chromatin loops in the vicinity of the integrated provirus.

This study is well-written and reports both expected and unexpected novel findings. HTLV-1 integration impacts transcription of flanking host genes, which is known from other retroviruses including HTLV-1/BLV, but convincingly shown in this study using several single-cell clones from HAM/TSP patients applying innovative techniques. The study also provides examples of aberrant splicing following HTLV-1 integration. The negative impact of HTLV-1 plus strand transcription on chromatin looping is the highlight of this study and of great interest to the scientific community since it is the first report to show that not the presence of the provirus, but its transcription is responsible for disruption of chromatin looping.

**********

Part II – Major Issues: Key Experiments Required for Acceptance

Please use this section to detail the key new experiments or modifications of existing experiments that should be absolutely required to validate study conclusions.

Generally, there should be no more than 3 such required experiments or major modifications for a "Major Revision" recommendation. If more than 3 experiments are necessary to validate the study conclusions, then you are encouraged to recommend "Reject".

Reviewer #2: (No Response)

Reviewer #4: This manuscript had already been reviewed and the authors provide a detailed point-to-point response to all comments raised by the reviewers. Although I agree that it would be interesting to see whether the q4C profile of more clones than one changes upon blocking transcription elongation (Fig 5C), the data provided and the arguments raised by the authors convince me.

**********

Part III – Minor Issues: Editorial and Data Presentation Modifications

Please use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity.

Reviewer #2: (No Response)

Reviewer #4: none

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Reviewer #4: No

Acceptance letter

Susan R Ross, Edward William Harhaj

27 Feb 2024

Dear Professor Bangham,

We are delighted to inform you that your manuscript, "The impact of HTLV-1 expression on the 3D structure and expression of host chromatin," has been formally accepted for publication in PLOS Pathogens.

We have now passed your article onto the PLOS Production Department who will complete the rest of the pre-publication process. All authors will receive a confirmation email upon publication.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Pearls, Reviews, Opinions, etc...) are generated on a different schedule and may not be made available as quickly.

Soon after your final files are uploaded, the early version of your manuscript, if you opted to have an early version of your article, will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. DNA sequence of primers and probes used in 3C-qPCR and RT-qPCR.

    (TIFF)

    ppat.1011716.s001.tiff (9.4MB, tiff)
    S1 Fig. Flow sorting of Taxand Tax+ cells.

    HTLV-1-infected T cell clone 11.63 was stained for live cells, crosslinked in 1% formaldehyde, stained intracellularly for Tax protein, and flow-sorted to isolate Taxand Tax+ subsets (see Methods).

    (TIFF)

    ppat.1011716.s002.tiff (9.9MB, tiff)
    S2 Fig. Flow sorting of provirus-expressing and non-expressing cells by an independent technique produced similar q4C profiles.

    (A) q4C profiles of Tax(upper panel) and Tax+ (lower panel) cells sorted from clone TBX4B after intracellular staining of Tax. (B) q4C profile of non-expressing (GFP-) (upper panel) and provirus-expressing (GFP+) (lower panel) cells isolated from d2EGFP-TBX4B clones, selected by GFP signal (without Tax staining).

    (TIFF)

    ppat.1011716.s003.tiff (9.9MB, tiff)
    S3 Fig. Transcription of the proviral plus-strand is accompanied by a loss of contacts between the provirus and flanking host chromatin.

    (A) q4C profiles of Tax(upper panel) and Tax+ (lower panel) cells from clone 11.63. (B) q4C profiles of non-expressing (GFP) (upper panel) and provirus-expressing (GFP+) cells (lower panel) from clone d2EGFP-11.50. Vertical axes show the normalized frequency of chromatin contacts between the provirus and the host genome.

    (TIFF)

    ppat.1011716.s004.tiff (9.9MB, tiff)
    S4 Fig. Quantitative 3C (3C-qPCR) analysis confirmed decreased frequencies of chromatin looping.

    (A) q4C profile of Taxcells of clone 11.63. The technical peak seen in the q4C viewpoint (VP) and two of the main peaks (Peak 1) and (Peak 2) identified in the output of the Taxfraction of clone 11.63. (B) As control, the frequency of chromatin interactions was quantified by 3C-qPCR on sorted Taxand Tax+ cells, using a primer pair and Taqman probe to detect the contacts between two regions: VP and another region in the provirus (S1 Table). There was no significant difference between Tax+ and Taxcells (combined p value = 0.932, Fisher’s method of combining p values). (C and D) Primer pairs and probe were used to detect long-range chromatin contacts between the provirus and host genome region at Peak 1 (C) or Peak 2 (D). Results of 3C-qPCR of two biological replicates (rep) are shown. Peak1 contact frequency was significantly higher in Taxcells than in Tax+ cells (combined p value 0.012, Fisher’s method). Peak2 contact frequency was significantly higher in Tax- cells than in Tax+ cells (combined p value 0.000607, Fisher’s method).

    (TIFF)

    ppat.1011716.s005.tiff (9.9MB, tiff)
    S5 Fig. Fusion transcripts between HTLV-1 provirus and host genome.

    (A) Identification of splice sites of fusion transcripts in the plus-strand-expressing cells of clone d2EGFP-11.63 and (B) in timer protein reporter clone Timer-3.60. Plus-strand fusion transcripts between HTLV-1 exon1 (H1) and same sense, 3′ side host gene exon (blue) or novel host exons (green) are shown with fused sequences.

    (TIFF)

    ppat.1011716.s006.tiff (9.9MB, tiff)
    S6 Fig. Treatment with flavopiridol for 1.5 hrs did not alter Tax protein expression.

    Clone 11.63 cells were treated with 1nM flavopiridol for 1.5 hrs and then stained for Live/Dead and then Tax protein.

    (TIFF)

    ppat.1011716.s007.tiff (9.9MB, tiff)
    S7 Fig. Expression of distant host genes correlates with expression of tax.

    (A) Normalized mRNA read counts of two genes that lie >1.4 Mb from the provirus in clone Timer-TBX4B, in the four successive phases of the HTLV-1 plus-strand transcriptional burst: DN–double negative (HTLV-1 silent); blue–early burst; DP double-positive (mid-burst); red–late burst. Results of two independent experiments are shown. Expression of both SMC1B and RIBC2 closely followed the trajectory of the HTLV-1 burst in clone Timer-TBX4B, but not in the unrelated HTLV-1-infected clone Timer-3.60. Data from [11]. (B) Knockout of the CTCF binding site in the provirus in clone Timer-TBX4B (middle panel) abolished the transcription of both SMC1B and RIBC2 observed in the wild-type clone (lower panel). Results of two independent experiments are shown. Neither gene was expressed in an unrelated HTLV-1-infected clone ED. The results suggest that maintenance of a CTCF-dependent chromatin loop between the host genome and the provirus is required for the burst of transcription of these distant genes associated with the HTLV-1 plus-strand burst.

    (TIFF)

    ppat.1011716.s008.tiff (3.9MB, tiff)
    S8 Fig. RNA-seq analysis of HTLV-1 proviral expression in live-sorted cells of clone Timer-TBX4B.

    Cells were sorted into four populations based on the fluorescence of the Timer protein, DN–double negative (HTLV-1 silent); blue–early burst; DP double-positive (mid-burst); red–late burst. [11].(A) Coverage tracks in IGV of plus strand HTLV-1 provirus transcription and (B) host gene PNPLA3 transcription. (C) Transcription in PNPLA3 exons 1 and 2 (note range on vertical axis 0 to 500).

    (TIFF)

    ppat.1011716.s009.tiff (9.3MB, tiff)
    Attachment

    Submitted filename: 240108 - Yaguchi et al - response to reviewers.docx

    ppat.1011716.s010.docx (62KB, docx)

    Data Availability Statement

    Sequence data have been deposited at the European Nucleotide Archive, (ENA, https://www.ebi.ac.uk/ena/browser/home), accession number PRJEB71982.


    Articles from PLOS Pathogens are provided here courtesy of PLOS

    RESOURCES