Skip to main content
eLife logoLink to eLife
. 2020 Sep 3;9:e57189. doi: 10.7554/eLife.57189

Repurposing of KLF5 activates a cell cycle signature during the progression from a precursor state to oesophageal adenocarcinoma

Connor Rogerson 1, Samuel Ogden 1, Edward Britton 1; The OCCAMS Consortium, Yeng Ang 2,3,, Andrew D Sharrocks 1,
Editors: Irwin Davidson4, Kevin Struhl5
PMCID: PMC7544504  PMID: 32880368

Abstract

Oesophageal adenocarcinoma (OAC) is one of the most common causes of cancer deaths. Barrett’s oesophagus (BO) is the only known precancerous precursor to OAC, but our understanding about the molecular events leading to OAC development is limited. Here, we have integrated gene expression and chromatin accessibility profiles of human biopsies and identified a strong cell cycle gene expression signature in OAC compared to BO. Through analysing associated chromatin accessibility changes, we have implicated the transcription factor KLF5 in the transition from BO to OAC. Importantly, we show that KLF5 expression is unchanged during this transition, but instead, KLF5 is redistributed across chromatin to directly regulate cell cycle genes specifically in OAC cells. This new KLF5 target gene programme has potential prognostic significance as high levels correlate with poorer patient survival. Thus, the repurposing of KLF5 for novel regulatory activity in OAC provides new insights into the mechanisms behind disease progression.

Research organism: Human

eLife digest

Acid fluids present in the gut can sometimes ‘go up’ and damage the oesophagus, the pipe that connects the mouth and the stomach. As a result, a small number of individuals can develop Barrett’s oesophagus, a condition where cells in the lining of the lower oesophagus show abnormal shapes. In certain patients, these cells then become cancerous, but exactly how this happens is unknown. This lack of understanding contributes to late diagnoses, limited treatment and low survival rates.

Many cancers feature ‘signature’ mutations in a set of genes that controls how a cell can multiply. Yet, in the case of cancers of the lower oesophagus, known genetic changes have had a limited impact on our understanding of the emergence of the disease. Here, Rogerson et al. focused instead on non-genetic changes and studied transcription factors, the proteins that bind to regulatory regions of the DNA to switch genes on and off.

A close inspection of cancer cells in the lower oesophagus revealed that, in that state, a transcription factor called KLF5 controls the abnormal activation of genes involved in cell growth. This is linked to the transcription factor adopting a different pattern of binding onto regulatory regions in diseased cells. Crucially, when the cell growth genes regulated by KLF5 are activated, patients have lower survival rates. Further work is now required to examine whether this finding could help to identify patients who are most at risk from developing cancer. More broadly, the results from the work by Rogerson et al. demonstrate how transcription factors can be repurposed in a disease context.

Introduction

Oesophageal cancer is the eighth most common cancer worldwide, and its 5-year survival rate of 15% makes it the sixth most-common cause of cancer-related death (Ferlay et al., 2015; Pennathur et al., 2013). A subtype of oesophageal cancer, oesophageal adenocarcinoma (OAC), is the predominant subtype in many Western countries and its incidence is rising rapidly (Coleman et al., 2018). Patients with OAC often present at a late stage with advanced disease (Smyth et al., 2017). The lack of molecular knowledge of OAC, combined with lack of tailored therapies, contribute to the low survival of OAC patients.

The accepted model of OAC development is the progression from an intestinal metaplastic condition of the lower oesophagus, known as Barrett’s oesophagus (BO), to OAC through increasing stages of dysplasia (Burke and Tosh, 2012; Spechler and Souza, 2014). Many mutations found in OAC are also present in BO, especially TP53, which suggests a stepwise transition to OAC (Ross-Innes et al., 2015; Stachler et al., 2015). Focal amplifications differ as they largely occur in OAC compared to BO (Lin et al., 2012; Stachler et al., 2015; Yamamoto et al., 2016). The amplified genes can be grouped into functional biological pathways with the RAS-ERK signalling pathway (e.g. ERBB2; EGFR; KRAS) and GATA transcription factors (GATA4; GATA6) being the most common (Frankell et al., 2019; Lin et al., 2012; Cancer Genome Atlas Research Network et al., 2017). The morphology of BO differs from the oesophageal epithelia by the presence of a columnar epithelium and secretory goblet cells, rather than squamous epithelium (reviewed in Spechler and Souza, 2014). Genomic and transcription events have been observed to differ between BO and OAC. Mutations in TP53 are more frequent in BO from patients that had progressed to OAC (Stachler et al., 2018) and SMAD4 mutations appear to occur exclusively in OAC, although at a low frequency (Weaver et al., 2014). Increased TGFβ signalling through other SMAD family members, SMAD2/3, promotes growth in OAC cells (Blum et al., 2019). Additionally, increased expression and increased activity of AP-1 transcription factors occurs in the transition from BO to OAC (Blum et al., 2019; Britton et al., 2017; Maag et al., 2017). Despite these studies, the definitive molecular mechanisms of progression to OAC are poorly understood and biomarkers to identify patients at risk of progression are lacking.

Changes to the chromatin landscape have been implicated in many cancers and chromatin accessibility changes during tumourigenesis are a major factor in altering regulatory element activity (Britton et al., 2017; Corces et al., 2018; Davie et al., 2015; Denny et al., 2016; Kelso et al., 2017; Rendeiro et al., 2016; Tome-Garcia et al., 2018; Zhou and Guo, 2018). We recently used Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) to ascertain the molecular basis of BO and identified a set of transcription factors that define the BO chromatin landscape and are retained in OAC (Rogerson et al., 2019). Here, we took a similar approach to discover important transcriptional regulators (Figure 1A) that are specifically operational in OAC and hence contribute to the molecular basis of disease progression from BO to OAC. We compared the open chromatin landscape in BO and OAC patient biopsies and uncovered KLF5 as an important transcriptional regulator that is repurposed to directly drive a cell cycle gene expression signature during the progression of BO to OAC.

Figure 1. Oesophageal adenocarcinoma (OAC) tumourigenesis is associated with enhanced cell cycle gene activity.

(A) Schematic of possible transcription factor network induction during OAC development. Coloured ovals represent different transcription factors. (B) Scatter plot of significant differentially (±1.5 x, Q-value <0.05) expressed genes between human Barrett’s oesophagus n = 13 and human OAC n = 12 samples (Maag et al., 2017). (C) Violin plots of expression of differentially expressed genes between Barrett’s oesophagus (n = 13) and oesophageal adenocarcinoma (n = 12) from discovery dataset (D; Maag et al., 2017) and validation dataset (V; BO = 3; OAC n = 4). Genesets are shown for upregulated (left) and downregulated (right) in OAC. (D) Gene set enrichment analysis of differentially expressed genes. The top two upregulated gene sets are shown with normalised enrichment score (NES) and Q-value. (E) Biological pathway GO term analysis of upregulated genes. The top 10 terms are shown. See also Figure 1—figure supplement 1.

Figure 1.

Figure 1—figure supplement 1. Differential gene expression analysis between Barrett’s and OAC patient samples.

Figure 1—figure supplement 1.

(A) PCA plot of 13 Barrett’s oesophagus (blue) and 12 OAC (red) RNA-seq samples (Maag et al., 2017). (B) GSEA plots of significantly downregulated gene sets in OAC compared to Barrett’s oesophagus. Normalised enrichment score (NES) and Q-value are shown. (C) Biological pathway GO term of significantly downregulated genes in OAC compared to Barrett’s oesophagus. (D) Box plot of Log2(1+FPKM) expression from BO (blue) and OAC (red) samples in discovery (D; Maag et al., 2017) and validation (V; this study) datasets for upregulated example genes (D) and downregulated example genes (E). Whiskers represent minimum and maximum values and one-way ANOVA p-value is shown.

Results

Enhanced cell cycle activity defines BO progression to OAC

To begin to understand the molecular events that distinguish OAC form the BO precursor state we first established the differential gene expression profiles between BO and OAC. We analysed public human BO and OAC RNA-seq data (Maag et al., 2017). These samples separate well after principal component analysis (PCA), therefore we retained all samples for further analysis (Figure 1—figure supplement 1A). Performing differential gene expression analysis, we identified 905 differentially expressed genes between BO and OAC (±1.5 x; Q-value <0.05; Figure 1B; Supplementary file 1). Of these 905 genes, 465 are upregulated in OAC and 440 are downregulated in OAC compared to BO. To validate these findings, we analysed RNA-seq data from our own sample collection (3 BO and 3 OAC). Genes that were upregulated in OAC from the discovery dataset were significantly upregulated in the validation dataset and likewise for downregulated in OAC genes (Figure 1C). To gain insights into biological pathways behind these differentially expressed genes, we used two approaches. Firstly, Gene Set Enrichment Analysis (GSEA) uncovered two cell cycle associated terms, ‘G2M checkpoint’ and ‘E2F1 targets’, as the most significant upregulated gene sets in OAC (Figure 1D). Conversely, ‘Fatty acid metabolism’ and ‘p53 pathway’ are the most significant downregulated gene sets (Figure 1—figure supplement 1B). Secondly, biological pathway gene ontology analysis of upregulated genes revealed many cell cycle associated terms, such as ‘Nuclear division’, ‘Regulation of mitotic cell cycle’ and ‘DNA replication’ (Figure 1E). Example genes such as CDC25B, CENPI and E2F1 all showed significant upregulation in OAC compared to BO in both datasets (Figure 1—figure supplement 1D). Downregulated genes uncovered metabolic associated terms, such as ‘alcohol metabolic process’, ‘monocarboxylic acid metabolic process’ and ‘Lipid catabolic process’ (Figure 1—figure supplement 1C). Representative example genes from these pathways such as IDI1, ADH4 and CIDEC all show significant downregulation in both datasets (Figure 1—figure supplement 1E). These initial results indicate a strong upregulation of genes associated with cell cycle processes during the progression from BO to OAC accompanied with the inactivation of genes controlled by the p53 pathway and genes associated with metabolism.

Chromatin accessibility changes in the transition from BO to OAC

To identify putative transcriptional regulators that may drive the transition to OAC and impact on this enhanced cell cycle profile, we analysed the accessible chromatin landscape using ATAC-seq from patient biopsies. To supplement our previous ATAC-seq datasets from BO and OAC patients (Britton et al., 2017; Rogerson et al., 2019), we performed ATAC-seq on two additional OAC biopsies, which were quality-checked and reproducible (Figure 2—figure supplement 1A and B). We wanted to focus on the differentially expressed genes in OAC compared to BO, therefore we generated a set of accessible regions representing potential regulatory regions that are associated with this set of genes. We took all ATAC-seq peaks from all samples within +/- 250 kb of a TSS of a differentially expressed gene (Figure 2A). After merging so that only unique peaks remained, 35,220 regions were used for further analyses (Supplementary file 2). We first performed principal component analysis on normalised ATAC-seq signal of all BO and OAC samples to identify differences between samples (Figure 2B). This led to clustering of all BO samples and clustering of most OAC samples. OAC samples T_003 and T_005 did not cluster with the other OAC samples and were therefore removed from the subsequent differential accessibility analysis. We then carried out differential accessibility analysis between BO and OAC on this peak set (Figure 2C; Supplementary file 2). A total of 1495 regions were significantly differentially accessible (±2 x; Q-value <0.1), the majority of which increased in accessibility (1327/1495). An example gene locus which shows differential accessibility in OAC is centred on KRT19 (Figure 2D). Within this locus, both gene promoters (14%; 1/7) and distal regulatory regions (86%; 6/7) gain accessibility in OAC. To assess whether the observed changes of accessibility near differentially expressed genes are common to other OACs, we compared our ATAC-seq data to independent, previously published ATAC-seq datasets from TCGA-ESCA oesophageal adenocarcinoma samples (Figure 2D, bottom; Figure 2E; Corces et al., 2018). TCGA-ESCA samples showed similar open chromatin peak profiles and clustered with our OAC samples with the exception of one sample, which clusters with our BO samples (Figure 2E). The chromatin accessibility profiles nearby genes differentially expressed in OAC are therefore reproducible across patients.

Figure 2. Altered chromatin accessibility landscape during OAC carcinogenesis.

(A) Schematic of ATAC-seq analysis. All peaks within ±250 kb of the TSS of a differentially expressed genes were assessed for differential accessibility between Barrett’s oesophagus and OAC. (B) Principal Component Analysis plot of log2(1+FPKM) ATAC-seq signal from all accessible regions within ±250 kb of a differentially expressed gene TSS from all human Barrett’s oesophagus (B; n = 4) and oesophageal adenocarcinoma samples (T; n = 6). (C) Heatmap of z-score ATAC-seq signal from human Barrett’s oesophagus (B; n = 4) and OAC (T; n = 4) samples at differentially accessible regions (±2 x; Q < 0.1). Hierarchical clustering of samples and regions performed using 1-Pearson correlation. (D) Example UCSC browser view of BO, OAC and TCGA ESCA ATAC-seq data surrounding the KRT19 locus with differentially accessible regions highlighted in red. (E) Correlation plot of Pearson correlation of log2(1+FPKM) ATAC-seq signal at differentially accessible regions. Hierarchical clustering performed using 1-Pearson correlation and the two main clusters are highlighted blue (BO) and red (OAC). TCGA samples are indicated by asterisks. See also Figure 2—figure supplement 1.

Figure 2.

Figure 2—figure supplement 1. ATAC-seq analysis of patient OAC samples.

Figure 2—figure supplement 1.

(A) Fragment length plot of log10 transformed ATAC-seq signal from technical replicates 1 and 2 of the T_007 OAC sample (left). Correlation plot of ATAC-seq signal from technical replicates 1 and 2 of the T_007 OAC sample at ATAC-seq peaks called from replicate 1 (right). Pearson correlation coefficient is shown. (B) Fragment length plot of log10 transformed ATAC-seq signal from technical replicates 1 and 2 of the T_008 OAC sample (left). Correlation plot of ATAC-seq signal from technical replicates 1 and 2 of the T_008 OAC sample at ATAC-seq peaks called from replicate 1 (right). Pearson correlation coefficient is shown. (C) Bar chart of percentage targets and percentage background of de novo discovered motifs at regions with significant decreased chromatin accessibility in OAC. De novo motifs, called transcription factor with motif match score (in brackets) and P-value are shown.

Next, we harnessed the differential accessibility data to uncover the identities of transcription factors bound in these regions. De novo motif discovery of regions that become more accessible in OAC contain significantly enriched motifs for AP-1, KLF, TBX, NFκB and p53 transcription factor families (Figure 3A; Supplementary file 3). AP-1 and KLF were clearly the most frequent motifs in the differential regions and showed the strongest match score for the consensus motif. Regions that showed decreased chromatin accessibility in OAC are enriched in EWSR1-FLI1, ASCL2, GLI2, E2F and ZBTB18 motifs, albeit with relatively low match scores (Figure 2—figure supplement 1C; Supplementary file 3). To further assess which transcription factors might be involved in gene expression control, we carried out footprinting analysis on differential accessible regions from our ATAC-seq datasets (Figure 3B; Bentsen et al., 2020). In differential accessible regions, motifs for KLF (e.g. KLF4, KLF5 and KLF1) and AP-1 (e.g. FOS, JUNB, JUND, JUN and FOSL1/2) transcription factors showed the highest footprinting score in OAC, whereas motifs for homeobox transcription factors (e.g. HNF1A, HOXA5 and NKX2-5), ARID3A and MEF transcription factors (e.g. MEF2A and MEF2C) showed more footprinting in BO. To provide more evidence for transcription factor occupancy, we then plotted ATAC-seq signal across their motifs. Both FOS (AP-1) and KLF4 (KLF) motifs show a clear increase in footprint depth in OAC, indicative of more transcription factor binding (Figure 3C). We have previously identified AP-1 as an important regulator in OAC (Britton et al., 2017), but the role of KLF transcription factors in OAC is poorly understood. We therefore focussed on the potential role of KLF transcription factors in the progression of BO to OAC.

Figure 3. KLF5 control a cell cycle gene expression programme in OAC.

(A) Bar chart of percentage targets and percentage background of de novo discovered motifs at increased accessible regions in OAC compared to Barrett’s oesophagus. De novo motif, called transcription factor with motif match score (brackets) and P-value are shown. (B) Scatter plot of differential footprinting depth around human transcription factor motifs in differential accessible regions in BO and OAC tissue. Significant motifs with more footprint depth in BO are labelled blue and in OAC labelled red. KLF TF-binding motifs are highlighted in orange and AP1 motifs are in blue font. (C) BO (blue) and OAC (red) ATAC-seq signal at FOS (AP1) and KLF4 (KLF) motifs in differentially accessible regions. (D) Expression (FPKM) of KLF family transcription factors in OAC RNA samples (n = 12; Maag et al., 2017). (E) Pie chart of percentage of upregulated genes in patient OAC samples that are also downregulated with siKLF5 treatment. p-Value shown. (F) Biological pathway GO term analysis of OAC upregulated and siKLF5 downregulated genes and a random gene selection. (G) Gene set enrichment analysis of genes that are upregulated in OAC and downregulated by siKLF5 treatment. Top two downregulated gene sets are shown. Normalised enrichment scores (NES) and Q-values are shown. (H) Example UCSC Genome Browser view of KLF5 ChIP-seq binding at the CDC25B locus. KLF5 peaks highlighted in red. See also Figure 3—figure supplements 1 and 2.

Figure 3.

Figure 3—figure supplement 1. Identification of the KLF5-regulated cistrome in OE19 cells.

Figure 3—figure supplement 1.

(A) Immunoblot of protein lysate from OE19 cells treated with either siNT or siKLF5 and probed with antibodies against KLF5 and ERK1/2. (B) Correlation plots of RNA-seq data from three biological replicates. Pearson correlations are shown. (C) Scatter plot of significantly differentially expressed genes (red are up- and blue are down-regulated genes;±1.3 x, Q-value <0.05) in OE19 cells with siKLF5 treatment. (D) Biological processes GO term analysis of downregulated genes with siKLF5 treatment. Cell cycle related GO terms are indicated with arrows. (E) Biological processes of GO term analysis of upregulated genes with siKLF5. Metabolism related GO terms are indicated with arrows. (F) Gene set enrichment analysis of downregulated genes with siKLF5 treatment. Cell cycle related gene sets, normalised enrichment scores (NES) and Q-values are shown. (G) Gene set enrichment analysis of upregulated genes with siKLF5 treatment. Metabolism-related gene sets, normalised enrichment scores (NES) and Q-values are shown. (H) Bar chart of fold change in expression of cell-cycle-related genes in siKLF5-treated OE19 cells (blue; n = 3) or patient OAC samples (red).
Figure 3—figure supplement 2. KLF5 ChIP-seq analysis in OE19 cells.

Figure 3—figure supplement 2.

(A) Immunoblot of KLF5 from OE19 cells following ChIP in different fractions; IgG FT (flow through from non-specific IgG precipitation), KLF5 FT (flow through from KLF5 precipitation), IgG elute (material eluted following IgG precipitation) KLF5 IP (material eluted following KLF5 immunoprecipitation). Arrows indicating immunoprecipitated protein (KLF5) and asterisks are IgG heavy chain. (B) Correlation plot of KLF5 ChIP-seq data from OE19 cells from two biological replicates. Pearson correlation coefficient is shown. (C) Venn diagrams showing overlap of peaks called by each replicate (Rep) for KLF5 ChIP-seq from OE19 cells. The numbers of peaks in each sector are given. (D) Bar chart of percentage targets and percentage background of de novo discovered motifs in OE19 KLF5 ChIP-seq peaks. De novo motif, called transcription factor with motif match score (in brackets) and p-value are shown. Note that the KLF5 motif is shown in the reverse orientation to those in other figures. (E) Cumulative distribution of the closest KLF5 ChIP-seq peaks to the TSS of genes that were significantly down or upregulated following siKLF5 treatment compared to expressed genes (FPKM >1) that were not significantly deregulated by siKLF5 treatment. Median distances to the TSS are shown.

KLF5 controls expression of cell cycle genes in OAC

To identify a specific KLF transcription factor that may be bound to these accessible regions, we analysed the expression of individual KLF transcription factors in OAC samples (Figure 3D). KLF5 was clearly the highest expressed among the KLF family in OAC. KLF5 has been previously implicated in oesophageal squamous cell carcinoma as a tumour suppressor (Tarapore et al., 2013) and has been identified as pro-tumorigenic in gastric cancer via amplifications (Chia et al., 2015). To determine the gene regulatory functions of KLF5, we carried out siRNA-mediated knockdowns of KLF5 in OE19 cells, a cell line we identified as having a similar chromatin landscape to OAC biopsies (Rogerson et al., 2019) and exhibits strong tumourigenic properties (Hassan et al., 2017). Knockdown of KLF5 was evident after 3 days siRNA transfection (Figure 3—figure supplement 1A) and RNA-seq replicates were highly correlative (Figure 3—figure supplement 1B). Carrying out differential expression analysis identified 4934 genes (2637 upregulated and 2297 downregulated) with significant changes in gene expression (±1.3 x; Q-value <0.05; Figure 3—figure supplement 1C; Supplementary file 4). Biological pathway GO term analysis revealed several enriched terms including ‘DNA replication’ and ‘Regulation of mitotic cell cycle’ for downregulated genes, and terms involving ‘Oxidative phosphorylation’ and ‘mitochondrial gene expression’ for upregulated genes (Figure 3—figure supplement 1D and E). The terms associated with downregulated genes are reminiscent of the terms enriched in genes upregulated in OAC (see Figure 1). Moreover, GSEA also found similar gene sets: ‘mitotic spindle’; ‘G2M checkpoint’ and ‘E2F targets’ for downregulated genes and ‘oxidative phosphorylation’; ‘xenobiotic metabolism’ and ‘fatty acid metabolism’ for upregulated genes (Figure 3—figure supplement 1F and G). Since the genes regulated by KLF5 are involved in similar processes as the genes aberrantly expressed in OAC, we asked whether any of the same genes are in each dataset. 21% (97/465) of the genes upregulated in OAC significantly overlap with those downregulated with siKLF5 (Figure 3E) and many of these are associated with cell cycle related functions, including genes encoding core cell cycle proteins like CCNE1, E2F1 and various MCM proteins (Figure 3—figure supplement 1H). Further analysis of the biological pathways enriched within these 97 genes identified very similar GO terms to those enriched in genes upregulated in OAC compared to BO (Figure 3F). GSEA also identified the same gene set terms: ‘G2M checkpoint’ and ‘E2F1 targets’ (Figure 3G).

Next, we asked whether these genes are directly regulated by KLF5, and carried out replicate ChIP-seq for KLF5 in OE19 cells which were highly correlated (Figure 3—figure supplement 2A and B). We therefore took the overlap of peaks between biological replicates forward for downstream analyses, resulting in 13,542 peaks (Figure 3—figure supplement 2C; Supplementary file 5). These peaks are highly enriched in the KLF5 motif, demonstrating the validity of the dataset, and also in AP1(FRA1) and GATA (GATA6) motifs, which we have previously revealed in genome wide studies as implicated in OAC (Britton et al., 2017; Rogerson et al., 2019Figure 3—figure supplement 2D). Focussing on the 97 genes that are upregulated in OAC and also downregulated after KLF5 depletion, 97% have a KLF5 ChIP-seq peak within 0.5 Mb of the TSS and the median distance between a KLF5 ChIP-seq peak and the TSS of all significantly downregulated genes was 11,975 bp (Figure 3—figure supplement 2E). In contrast, KLF5-binding regions are further away (>20 kb) from the TSS of genes that were either unaffected by KLF5 depletion or whose expression was increased. This is indicative of direct activation by KLF5. An example gene is CDC25B which harbours multiple KLF5 ChIP-seq peaks surrounding its locus (Figure 3H). Collectively, these results suggest a direct activator role of KLF5 in controlling cell cycle genes in OAC.

The KLF5 cistrome is reconfigured during the progression from BO to OAC

Having determined a role for KLF5 in controlling cell-cycle-associated gene expression in OAC cells, we sought to determine the mechanism through which KLF5 acquires these functions. We first asked whether KLF5 expression changes in the transition from BO to OAC, however no increase in expression was found (Figure 4A). An alternative mechanism might be through redistributing the binding of KLF5 to different regulatory elements in OAC. We therefore hypothesised that KLF5 is active in both BO and OAC but may regulate specific genes in OAC by binding at different loci.

Figure 4. KLF5 binds to distinct regions in OE19 cells.

(A) Expression Log2(1+FPKM) of KLF5 in BO and OAC tissue. (B) Heatmap of KLF5 ChIP-seq signal at regions (peak centre ±5 kb) significantly bound in OE19 only (+2x; Q < 0.05), shared regions (no significant change) and regions bound in CP-A only (−2x; Q-value <0.05). (C) Tag density plot of KLF5 ChIP-seq signal at regions (peak centre ±2.5 kb) bound in OE19 only, shared regions and regions bound in CP-A only. (D) Genome browser tracks showing KLF5 ChIP-seq and ATAC-seq in CP-A and OE19 cells at the CCNE1 locus. Differential bound regions are highlighted in red. (E) Bar chart of percentage targets and percentage background of de novo discovered motifs at regions bound in OE19 only, shared regions and regions bound in CP-A only. De novo motifs, called transcription factor with match scores and P-values shown. (F) UPSET plot of DNA motifs found in 371 KLF5 binding regions that are specific to OE19-specific binding regions that are located within loci (+/- 250 kb) containing genes upregulated in OAC and downregulated with KLF5 depletion. The motifs identified in E (KLF5, GATA1, FOXA2, FRA1 and TCF7L2) found within each peak are shown. (G) Venn diagram showing the overlap in genes downregulated in OE19 cells following treatment with siRNAs targeting KLF5 and GATA6. (H) Heatmap of ATAC-seq signal at the KLF5 binding regions from (F) in the indicated cell lines (left) or patient derived tissue (right). Regions were subject to k-means hierarchical clustering (k = 2). See also Figure 4—figure supplements 1 and 2.

Figure 4.

Figure 4—figure supplement 1. KLF5 ChIP-seq analysis in CP-A cells.

Figure 4—figure supplement 1.

(A) Immunoblot of CP-A and OE19 protein lysate probed with antibodies against KLF5 and ERK1/2. (B) Immunoblot of KLF5 from CP-A cells following ChIP in different fractions; IgG FT (flow through from non-specific IgG precipitation), KLF5 FT (flow through from KLF5 precipitation), IgG elute (material eluted following IgG precipitation) KLF5 IP (material eluted following KLF5 immunoprecipitation). Arrows indicating immunoprecipitated protein (KLF5) and asterisks are IgG heavy chain. (C) Scatter plots of KLF5 ChIP-seq data from CP-A from two biological replicates. Pearson correlation coefficient shown. (D) Venn diagrams showing overlap of peaks called by each replicate for KLF5 ChIP-seq from CP-A cells. The numbers of peaks in each sector are given. (E) Bar chart of percentage targets and percentage background of de novo discovered motifs at CP-A KLF5 ChIP-seq peaks. De novo motif, called transcription factor with motif match score (in brackets) and p-value shown. (F) Example UCSC genome browser tracks of KLF5 ChIP-seq and ATAC-seq data from CP-A and OE19 cells at the NKX3-1 (left) and JAG1 loci (right). CP-A-specific peaks are highlighted in blue and shared peaks are highlighted in grey.
Figure 4—figure supplement 2. Integrative analysis of KLF5 ChIP-seq data in OE19 and CP-A cells.

Figure 4—figure supplement 2.

(A) Tag density plots of normalised ATAC-seq signal at OE19 specific KLF5 ChIP-seq regions from CP-A (blue) and OE19 (red) cells (top) and merged BO (blue) and merged OAC (red) tissue (bottom). (B) Top GO terms of nearest genes associated with OE19-specific (top) or CP-A-specific (bottom) KLF5 peaks which show either increased (top) or decreased (bottom) expression in OAC. (C) Euler diagram of 299 KLF5-binding regions that are specific to OE19-specific binding regions that are located within loci (+/- 250 kb) containing genes upregulated in OAC and downregulated with KLF5 depletion. The motifs identified in Figure 4E (KLF5, GATA1, FOXA2, FRA1 and TCF7L2) found within each peak are shown (note that an additional 71 regions cannot be depicted due to the small numbers involved). The regions in the white circle contain none of the indicated motifs. (D) Heatmap of KLF5, GATA6 and HNF4A ChIP-seq signal from OE19 cells at OE19 specific KLF5 ChIP-seq peaks nearby KLF5-activated target genes. (E) RT-qPCR analysis of the indicated genes following treatment of CP-A cells with non-targeting (NT) or KLF5 siRNAs for 72 hr (n = 3; P-values **=<0.01; ns = not significant). (F) Tag density plots of normalised ATAC-signal at cluster 1 (left) and cluster 2 (right) regions (see Figure 4G) from CP-A (blue) and OE19 (red) cells (top) and merged BO (blue) and merged OAC (top) samples (bottom). (G) Bar chart of percentage targets and percentage background of de novo discovered motifs in cluster 1 (top) and cluster 2 (bottom) regions. De novo motif, called transcription factor with motif match score and p-value are shown.

CP-A cells are derived from non-dysplastic BO and do not exhibit strong tumourigenic properties (Lin et al., 2012) so we compared KLF5 expression in BO-derived CP-A and OAC-derived OE19 cells and found that KLF5 is expressed at similar levels in (Figure 4—figure supplement 1A). We therefore we used these cell-lines to model KLF5 activity in BO and OAC. To gain a more comprehensive view of KLF5 function, we performed ChIP-seq for KLF5 in CP-A cells and used spike-in normalisation to better assess differential binding relative to OE19 cells. Anti-KLF5 antibodies precipitated KLF5 in CP-A cells (Figure 4—figure supplement 1B) and biological replicates were highly reproducible (Figure 4—figure supplement 1C). We took the overlap of peaks between biological replicates forward for downstream analyses, resulting in 13,526 peaks (Figure 4—figure supplement 1D; Supplementary file 5). Motif analysis showed high enrichment of the KLF5 motif further demonstrating the quality of the data (Figure 4—figure supplement 1E). KLF5 peaks from CP-A and OE19 cells were merged, generating a combined peak set of 21,353 peaks. Differential binding analysis revealed an altered KLF5 binding profile between CP-A and OE19 cells (Figure 4B,C; Supplementary file 5): 8608 peaks show more binding in OE19 cells (40%), 9230 peaks are shared between the two cell-lines (44%) and 3515 show more binding in CP-A cells (16%). An example locus is CCNE1, which demonstrates increased KLF5 binding in OE19 cells at the promoter and putative enhancers associated with open chromatin regions (Figure 4D). Reciprocally, specific binding of KLF5 in CP-A cells is evident at the NKX3-1 locus and common binding of KLF5 in both CP-A and OE19 cells is evident at the JAG1 locus (Figure 4—figure supplement 1F).

We next asked whether the regions that exhibit differential binding are enriched for specific transcription factor motifs. Regions that are bound by KLF5 in OE19 cells are enriched in motifs for KLF, GATA, Forkhead, AP-1 and TCF transcription factors, whereas regions bound by KLF5 in CP-A cells are enriched for a different set of motifs with, TEAD, RUNX and p53 transcription factor families in addition to KLF and AP-1 motifs detected (Figure 4E; Supplementary file 6). These results are in-keeping with our previous work showing AP-1 and GATA6 functionality in OAC (Britton et al., 2017; Rogerson et al., 2019). Regions specifically bound by KLF5 in OAC cells also exhibited increased accessibility in OE19 cells and importantly, accessibility is also elevated around these binding sites in OAC tissue (Figure 4—figure supplement 2A). These findings are therefore consistent with a broad role of KLF5 in OAC.

To further probe the potential biological significance of the differentially bound KLF5 regions, we associated these with the nearest gene and determined the enriched GO terms for genes associated with cell-type-specific KLF5 peaks that also show preferential expression in BO or OAC. OE19-specific KLF5-binding events are associated with genes involved in ‘cell division’ control, whereas CP-A-specific KLF5 binding is associated with ‘epithelial cell differentiation’ (Figure 4—figure supplement 2B). The latter observation is consistent with the potential loss of cell identity in OAC. However, since oncogenic events during the progression from BO to OAC are poorly understood, we decided to focus on regions that acquire KLF5 binding in OE19 cells. To relate specific KLF5-binding events to gene expression changes, we took the set of 97 genes that are upregulated in OAC and downregulated by KLF5 depletion (i.e. activated by KLF5; Figure 3E) and found that there are 371 OE19-specific KLF5-binding peaks within a 0.5 Mb locus centred on the TSS. To further explore how KLF5 activates these genes in OAC, we assessed the transcription factor binding motif distribution (identified in Figure 4E) within this set of OE19-specific KLF5 peaks. We detected KLF5 binding motifs in 257/371 of these regions, and strikingly, 56% (145/257) of the peaks also house a mixture of FOXA, AP-1, GATA and TCF motifs, in addition to the KLF motif (Figure 4F; Figure 4—figure supplement 2C; Supplementary file 7). This suggests that KLF5 functions in a combinatorial manner with these other transcription factors to activate gene transcription during progression from BO to OAC. However, a large portion of these peaks (44%) contain only a KLF motif suggesting a more independent role for KLF5 in these regions (Figure 4F). To test whether these motif enrichments reflect transcription factor binding, we integrated our ChIP-seq data of transcription factors active in OAC (GATA6 and HNF4A; Rogerson et al., 2019), with KLF5-binding data. Since only GATA motifs are enriched in these regions we would expect co-binding with GATA6 and not HNF4A. We therefore compared ChIP-seq profiles for these transcription factors, and see extensive co-binding of GATA6 at these sites but no evidence of co-binding with HNF4A (Figure 4—figure supplement 2D). Finally, the predicted target gene co-regulation by KLF5 and GATA6 was validated by depletion of each factor in OE19 cells, which leads to a large significant overlap in downregulated genes (Figure 4G). However, this co-regulated gene set contains only two of the cell cycle associated genes regulated by KLF5, suggesting that this combination of transcription factors is not directly involved in controlling this process. Turning back to the cell cycle genes directly activated in OAC through OAC-specific binding of KLF5, we tested whether KLF5 is relevant for their expression in CP-A cells. As expected from the lower KLF5-binding levels in these cells, depletion of KLF5 had little effect on these genes (Figure 4—figure supplement 2E), consistent with a newly acquired function in OAC cells.

To establish whether the 371 KLF5-bound regions that are associated with KLF sensitive genes are relevant to OAC, we turned back to our ATAC-seq data and clustered the data to reveal two clusters. One set of regions is already partially open in CP-A cells that increase in accessibility in OE19 cells (cluster 1) and another set are closed in CP-A cells and become more accessible (cluster 2) (Figure 4H, Figure 4—figure supplement 2F, left). Importantly, the same pattern of accessibility is evident using ATAC-seq signal from BO and OAC tissue (Figure 4H, Figure 4—figure supplement 2F, right). To identify any potential differences between these clusters, we performed motif analysis (Figure 4—figure supplement 2G; Supplementary file 8). The most common motif in both clusters were KLF motifs and the most striking difference is the large proportion of AP1 motifs specifically associated with cluster one suggesting a potential role for AP1 in priming binding of KLF5 to these regions.

Together, these results indicate an altered DNA-binding profile for KLF5 in BO and OAC, and this altered binding is associated with chromatin opening. This altered binding profile for KLF5 in OAC reflects a direct role in controlling genes involved in cell cycle.

KLF5 converges with ERBB2 on cell cycle gene regulation and controls cell proliferation in OAC

Our results indicate a role of KLF5 in controlling increased cell cycle gene expression in OAC; however, it is unclear how this relates to genetic events that potentially impact on the same process. Genomic amplifications in signalling receptors are common in OAC, such as ERBB2 (32% OAC have an ERBB2 amplification; Cancer Genome Atlas Research Network et al., 2017) and occur during the transition from BO to OAC (Stachler et al., 2015). As the ERK pathway is implicated in promoting cell proliferation and is controlled by ERBB2, we investigated whether ERBB2 signalling impacts on KLF5-mediated gene regulatory events. First, we sought evidence for a link with transcription factor activity, and performed ATAC-seq on OE19 cells to investigate whether depletion of ERBB2 could alter chromatin accessibility. OE19 cells contain an amplification of the ERBB2 locus (Dahlberg et al., 2004) and are dependent on ERBB2 for their proliferation (Hong et al., 2012). ERBB2 levels were efficiently reduced after 72 hr of siRNA treatment and phosphorylation of downstream targets (ERK and AKT) was reduced (Figure 5—figure supplement 1A). ATAC-seq data were reproducible and good quality (Figure 5—figure supplement 1B and C). We performed differential accessibility analysis, which identified 717 regions with decreased chromatin accessibility and 733 regions with increased accessibility (Figure 5A; Supplementary file 9). De novo motif analysis of the regions that exhibit reduced chromatin accessibility following ERBB2 depletion, revealed that the majority contain AP-1-binding motifs as expected from the established connections between ERK pathway signalling and AP1 transcription factors. However, the binding motif for KLF transcription factors was also detected, albeit in a subset of the regions (Figure 5B; Supplementary file 10). We then used our KLF5 ChIP-seq dataset from OE19 cells to validate KLF5 binding at regions with reduced chromatin accessibility following ERBB2 depletion (Figure 5—figure supplement 1D). These regions are relevant in the context of OAC as they also show increased chromatin accessibility in OAC tissue compared to BO (Figure 5C). The convergence of ERBB2 signalling on KLF5 transcription factor activity suggested that they might also converge on the same genes. We therefore carried out RNA-seq in OE19 cells treated with siRNA against ERBB2. The RNA-seq data were highly reproducible (Figure 5—figure supplement 1E) and resulted in 778 genes down- and 664 genes up-regulated (two-fold change; FDR < 0.05, FPKM > 1) (Figure 5—figure supplement 1F). There is a large, statistically significant overlap between directly activated KLF5 target genes and genes downregulated by ERBB2 depletion. Moreover, a closer comparison reveals that the expression of the majority of the directly activated KLF5 target genes was reduced upon ERBB2 knockdown (Figure 5D). Most of these common target genes are cell cycle related. These results therefore indicate that ERBB2 and KLF5 converge on a similar set of regulatory regions to drive the expression of cell cycle regulatory genes. To establish whether ERBB2 can redistribute KLF5 binding and activate its target genes, we created BO-derived CP-A cell lines that stably over express ERBB2 to mimic the effect of amplification seen in OAC. These cells exhibit high levels of ERBB2 expression, maintain ERK and AKT activation in serum starved conditions (Figure 5—figure supplement 2A), and exhibit growth factor-independent proliferation (Figure 5—figure supplement 2B). Several cell cycle related genes that are activated by KLF5 in OAC cells are also activated by ERBB2 overexpression in BO cells (Figure 5—figure supplement 2C). However, we were unable to detect any increases in KLF5 occupancy at a panel of KLF5-binding regions associated with cell cycle genes (Figure 5—figure supplement 2D). These findings therefore reaffirm the convergence of ERBB2 signalling and KLF5 on the activation of a cell cycle gene signature but ERBB2 is not sufficient to trigger KLF5 redistribution.

Figure 5. KLF5 controls cell proliferation in OAC.

(A) Heatmap of ATAC-seq signal at regions (peak centre ±2.5 kb) with significantly differential accessibility (±2 x; Q < 0.05) in OE19 cells treated with siERBB2. (B) Bar chart of percentage targets and percentage background of de novo discovered motifs at regions closed with siERBB2 treatment in OE19 cells. De novo motifs, called transcription factor with motif match scores (brackets) and p-values are shown. (C) Tag density plot of ATAC-seq signal from BO and OAC tissue at regions that demonstrate KLF5 binding in OE19 cells and reduced chromatin accessibility in OE19 cells upon siERBB2 treatment. (D) Heatmap of z-score of expression of KLF5 direct target genes in OE19 cells treated with either siNT or siERBB2. Cell cycle related genes are indicated with a black bar. (E) Bar chart showing the % relative growth of OE19 cells treated with either siNT or siKLF5. p-Value is shown (n = 3). (F) Bar chart showing the % relative growth of OE19-dCas9-KRAB cells treated with either non-targeting guides or guides targeting the KLF5 TSS. p-Value is shown (n = 3). (G) Bar chart showing the % relative growth of OE19 cells treated with either siNT or siRNA against the indicated target genes. p-Values are shown (n = 3). (H) Kaplan-Meier curves of patient survival across 24 months for high (above median; red) or low (below median; blue) expression of the 97 KLF5 target genes (left), KLF5 (middle) or CLSPN (right). p-Values are shown. (I) Expression levels (FPKM) of CLSPN expression in BO and OAC patient samples. (J) Model of KLF5 action in BO and OAC. KLF5 binds chromatin in BO and is re-purposed in OAC to bind and regulate cell-cycle-related genes. See also Figure 5—figure supplements 1, 2 and 3.

Figure 5.

Figure 5—figure supplement 1. ERBB2 and KLF5 regulate an overlapping set of genes.

Figure 5—figure supplement 1.

(A) Immunoblot of OE19 protein lysate from cells treated with siERBB2 for up to 4 days or siNT probed with antibodies against ERBB2, phosphorylated AKT (pAKT), AKT, phosphorylated ERK (pERK) and ERK. (B) Correlation plots of replicates of ATAC-seq data from OE19 cells treated with either siNT or siERBB2 for 72 hr. Pearson correlations are shown. (C) Fragment length plots of replicates of ATAC-seq data from OE19 cells treated with either siNT or siERBB2. (D) Pie chart showing the percentage of ATAC-seq peaks showing a reduction of chromatin accessibility upon siERBB2 treatment that overlap with a KLF5 ChIP-seq peak in OE19 cells. Percentages and P-value (Fisher exact) for the overlap are shown. (E) Correlation plots of replicates of RNA-seq data from OE19 cells treated with either siNT or siERBB2 for 72 hr. Pearson correlations are shown. (F) Venn diagrams showing the overlap between the 97 direct KLF5-activated genes and genes either down- (left) or up- (right) regulated following siERBB2 treatment of OE19 cells (two fold change, FDR < 0.05, FPKM > 1). p-Values are hypergeometric tests.
Figure 5—figure supplement 2. ERBB2 overexpression drives growth factor independent proliferation and gene expression in BO-derived CP-A cells.

Figure 5—figure supplement 2.

(A) Western blots of the indicated proteins or phosphorylated (p) sites in CP-A cells stably transfected with empty or ERBB2 expressing vectors grown in serum free medium for 48 hr (lanes 1 and 2) or serum-free media and stimulated with complete media for 15 min (lanes 3 and 4). (B) Relative growth of CP-A cells stably transfected with empty or ERBB2 expressing vectors grown in serum-free media for 6 days (n = 3; ** = p value<0.01, two-way ANOVA). (C) RT-qPCR analysis of the indicated genes (normalised to GAPDH) in serum starved CP-A cells stably transfected with empty or ERBB2 expressing vectors (n = 3; p-values, *=<0.05, **=<0.01). (D) ChIP-qPCR of KLF5 binding to regulatory regions associated with the indicated genes in CP-A cells stably transfected with empty or ERBB2 expressing vectors grown in serum-free media for 48 hr. Non-specific IgG is provided as a control (n = 3; ns = non-significant increases).
Figure 5—figure supplement 3. KLF5 drives cell cycle progression in OE19 cells.

Figure 5—figure supplement 3.

(A) Immunoblot of OE19 cell lysate from cells treated with either siNT or siKLF5 for 6 days and probed with antibodies against KLF5 and Tubulin. (B) Immunoblot of OE19-dCas9-KRAB lysate of OE19-dCas9-KRAB cells treated with a non-targeting guide or a pool of three guides targeting the KLF5 TSS and probed with antibodies against KLF5 and Tubulin. (C) RT-qPCR of RNA from OE19 cells treated with either siNT, siCCNE1, siCDC25B, siKIF14, siCLSPN or siNR4A1. The gene that was knocked-down was tested relative to GAPDH and p-values are shown (n = 3). (D) Bar chart of the percentages of OE19 cells treated with either siNT, siCCNE1, siCDC25B, siKIF14, siCLSPN or siNR4A1 for 6 days in G1, S and G2/M phase of the cell cycle. P-values were calculated comparing each knockdown to siNT and p-values are shown (n = 3).

Finally, we assessed whether defective KLF5-driven cell cycle gene regulation led to proliferative defects in OAC cells. We first depleted KLF5 in OE19 cells using siRNA which resulted in the reduction of KLF5 protein (Figure 5—figure supplement 3A), and the growth of cells was significantly impeded after siKLF5 treatment (Figure 5E). Second, we validated this growth defect by using CRISPR interference technology. Stable transfection of dCas9-KRAB and subsequent transfection of sgRNAs targeting the promoter of KLF5 (sgKLF5) into OE19 cells resulted in the reduction of KLF5 protein levels (Figure 5—figure supplement 3B). CRISPRi knockdown of KLF5 also significantly reduced the growth of OE19 cells (Figure 5F), mirroring the result with siKLF5. We further explored the role of KLF5 in cell growth and cell cycle progression by performing similar assays while perturbing KLF5 target genes (CCNE1, CDC25B, KIF14, CLSPN and NR4A1). All these genes showed significant reductions in expression upon siRNA treatment (Figure 5—figure supplement 3C). The growth of OE19 cells was significantly reduced with the treatment of siRNA against CCNE1, KIF14 and CLSPN (Figure 5G). Knockdown of these genes also significantly altered cell cycle patterns, particularly knockdown of CLSPN which induced a prominent S-phase block (Figure 5—figure supplement 3D). These results provide more evidence for the role of KLF5 in the growth of cells and highlight the role of KLF5 target genes in this phenotype.

To assess whether the expression of KLF5 and its target genes has any clinical relevance, we sourced OAC expression and survival data (Cancer Genome Atlas Research Network et al., 2017) and plotted a survival of patients with high and low expression (±median) of KLF5 itself and KLF5 target genes up to 24 months (Figure 5H). Those with a higher expression of KLF5 showed no difference in patient survival, whereas patients with high target gene expression exhibited a significantly lower survival rate compared to those with low expression. This result is in keeping with the hypothesis that it is the activation of KLF5 target genes by its redistribution across chromatin, rather than its expression level that is important. It is noteworthy that CLSPN expression alone is predictive of increased patient survival (Figure 5H) and its enhanced expression in OAC compared to BO makes this a useful potential biomarker (Figure 5I).

Collectively, these results confirm the functional role of KLF5 in cell cycle control in OAC and convergence of action with the ERBB2 signalling pathway. This is clinically important as patients with highly expressed KLF5 target genes have a worse prognosis that those without.

Discussion

Genome sequencing efforts of patients with BO and OAC have provided insights into the molecular causes of BO and OAC and show the mutational relationships between these disease states (Ross-Innes et al., 2015; Stachler et al., 2015). This has provided evidence for a model of OAC developing from BO. The molecular mechanisms involved in progression to OAC are poorly understood; however, BO offers a therapeutic window of opportunity to identify those more at risk of OAC development. In addition to genetic events, epigenetic changes and alterations to the chromatin landscape are also likely to play an important role in disease progression. Here, we demonstrate that there are marked changed in chromatin accessibility and associated gene expression, indicating active changes at the chromatin level during carcinogenesis. One of the major contributing factors to this change is the transcription factor KLF5. KLF5 is re-purposed in OAC cells and its chromatin-binding profile is massively rewired to drive increased expression of cell cycle associated genes (Figure 5J). Conversely, this rewiring results in the loss of KLF5 binding to many regulatory regions occupied in Barrett’s cells. This loss is potentially associated with the loss of cell identity, and may also contribute to the development of the cancer phenotype.

Cell cycle deregulation is one of the key hallmarks of cancer (Hanahan and Weinberg, 2011) and here we uncovered a cell cycle gene expression signature, comprised of genes that are overexpressed in OAC. Recent research identified the cell-cycle as a perturbed pathway in OAC and suggested the possibility of CDK4/6 inhibitors as a therapeutic treatment (Frankell et al., 2019; Mourikis et al., 2019). We have previously uncovered a deregulated FOXM1 regulatory network active in OAC, a key regulator of late cell cycle gene expression (Wiseman et al., 2015). By integrating ATAC-seq data to identify upstream regulators of this signature, we also uncovered AP-1 and KLF5 as putative transcription factors in this process. We have previously identified AP-1 as an important factor in OAC (Britton et al., 2017) and others have shown an increase in AP-1 family transcription factors between non-dysplastic BO and low-grade dysplastic BO (Maag et al., 2017). What is less clear is the role of KLF5 in the progression of BO to OAC. KLF5 has been shown to have a tumour promoting function in pancreatic (He et al., 2018) and basal-like breast cancer (Qin et al., 2015). KLF5 is also frequently amplified in gastric cancer (Chia et al., 2015; Zhang et al., 2018) and has recently been shown to regulate gene expression in OAC in combination with other transcription factors, GATA6, ELF3 and EHF (Chen et al., 2020). This was reinforced by a recent study that identified KLF5 as a master transcription factor on which OAC cell-lines were dependent (Reddy et al., 2019). Paradoxically, KLF5 has been shown to have a tumour suppressor role in oesophageal squamous cell carcinoma (Tarapore et al., 2013) and breast cancer (Chen et al., 2002). The expression of the related protein, KLF4, together with three other genes, was able to stratify OAC from BO, albeit KLF4 expression is reduced in progression from BO to OAC (Maag et al., 2017).

Previous studies have begun to suggest a role for KLF5 in cell cycle control. For example, KLF5 binds to a CCNE1 promoter proximal element in bladder cancer cells (Pattison et al., 2016) and KLF5 increases the expression of Ccnb1 and Mcm2 downstream of oncogenic Ras in fibroblasts (Nandan et al., 2005). Here, we provide evidence that KLF5 exhibits a widespread role; directly controlling cell proliferation through activation of cell cycle associated genes. We also show that reduction of KLF5 levels, or several of its target genes, in OAC cells impairs growth. Indeed, this is exemplified by CLSPN which may have therapeutic potential as its gene product, Claspin, has recently been shown to have a broader role in cancer cell viability by protecting cancer cells from replication stress (Bianco et al., 2019). KLF5 directly binds and regulates core cell cycle genes for example CDC25B, CCNE1 and MCM2, some of which are cell cycle transcription factors for example E2F1, MYBL2, thus providing a mechanism for propagating its effects on cell cycle control. We also show KLF5 expression is almost unchanged between BO and OAC. By profiling KLF5 chromatin binding in BO and OAC cells, we have demonstrated an altered KLF5 binding profile. The regions bound by KLF5 specifically in OAC cells are enriched in motifs for several transcription factors, including the GATA family which suggests a combinatorial regulatory code. This is in keeping with our finding that there is extensive overlap between the binding of KLF5 and GATA6 which is reinforced by recent studies that show that KLF5 binds with GATA6 in OAC (Chen et al., 2020) and gastric cancer (Chia et al., 2015).

The overlap in regulatory potential with GATA6 provides a plausible link to one of the major genetic events that drive the BO to OAC transition. Our work also suggests a link to another major pathway that is activated through gene amplification in OAC, the ERBB2-driven RAS-ERK pathway. Knockdown of ERBB2 reduced the expression of many KLF5 target genes and KLF5 motifs were found at regions with reduced chromatin accessibility upon ERBB2 knockdown. However, ERBB2 overexpression in BO cells is insufficient to trigger KLF5 redistribution, indicating that other pathways contribute to KLF5 redistribution in OAC, but this needs further investigation. Nevertheless, it is clear that ERBB2 signalling and KLF5 activity converge on the same cell cycle genes and both are required for their activation, indicating functional synergy. The signalling pathways are more unclear in the context of BO, the precancerous precursor. We see enrichment of the TEAD motif only in CP-A cells and not OE19 cells, suggesting that KLF5 may be operating through the Hippo signalling pathway in BO. In other contexts, KLF5 has been shown to cooperate with TEAD transcription factors, downstream of YAP/TAZ (Wang et al., 2015) and KLF5 is stabilised by YAP in breast cancer cells (Zhi et al., 2012). Further work is needed to substantiate these links in BO.

In summary, we have used integrative analysis of RNA-seq and ATAC-seq from BO and OAC patient samples to uncover a cell cycle signature regulated by KLF5. Using a multi-omics approach, we found an oncogenic role of KLF5 in OAC, a transcription factor that has not been shown to be mutated, amplified and/or over-expressed in OAC. This study highlights the power of supplementing expression data with genome-wide chromatin profiling methods such as ATAC-seq. This provides molecular insights into the mechanisms by which BO progresses to OAC and identifies a signature of transcription factor gene targets that have potential prognostic significance and could be used as biomarkers in the clinic.

Materials and methods

Cell lines, cell culture conditions and patient biopsies

OE19 and CP-A cells were purchased from ATCC and tested negative for mycoplasma. OE19 cells were maintained in Gibco RPMI 1640 (ThermoFisher, 52400) supplemented with 10% Gibco fetal bovine serum (ThermoFisher, 10270) and 1% Gibco penicillin/streptomycin (ThermoFisher,15140122). CP-A cells were cultured in keratinocyte serum free media (ThermoFisher, 17005042) supplemented with 5 μg/L EGF (ThermoFisher, 10450–013), 50 mg/L bovine pituitary extract (ThermoFisher, 13028014) and 10% Gibco fetal bovine serum (ThermoFisher, 10270) and 1% Gibco penicillin/streptomycin (ThermoFisher,15140122). Cell-lines were authenticated by STR profiling and routinely tested for mycoplasma.

OE19-dCas9-KRAB stable cells were generated by transfecting 1 × 106 OE19 cells with 7.5 μg Cas9 plasmid with guides targeting the AAVS1 locus (Addgene #42230; 5’- GGGGCCACTAGGGACAGGAT-3’) and 7.5 μg donor plasmid (pAAVS1-Puro-DNR; Origene GE100024) containing doxycycline inducible dCas9-KRAB with Fugene HD (Promega, E2311), as per manufacturer’s instructions. After 7 hr, media was replaced and supplemented with 7.5 μM RS-1 (Sigma-Aldrich, R9782) and 1 μM SCR7 pyrazine (Sigma-Aldrich, SML1546), to promote homologous recombination and to inhibit non-homologous end joining respectively. Media was changed the next day and cells were selected with puromycin (0.75 μg/ml) for 14 days. Selected colonies were re-plated to grow single clones and clones screened for dCas9-KRAB protein expression by immunoblotting. OE19-dCas9-KRAB cells were cultured with 100 ng/mL doxycycline (Sigma-Aldrich, D3447) to induce dCas9-KRAB.

To create CP-A-ERBB2 (overexpressing ERBB2) and CP-A-empty (control) cells we first created the pHAGE-empty plasmid (pAS4940) by excising the ERBB2 coding sequence from pHAGE-ERBB2 (addgene #116734) using Xho1 (NEB, R0146S) followed by re-ligation of the vector. HEK293T cells were transfected with either pHAGE-ERBB2 or pHAGE-empty target plasmids, plus pMD2.G (Addgene, #12259), psPAX2 (Addgene, #12260) using Polyfect (Qiagen, 301107). Viral particles were precipitated from media using PEG-it (System Biosciences, LV810A-1). CP-A-empty and CP-A-ERBB2 stable cells were generated by transfecting 1 × 106 CP-A cells with lentiviral particles containing either pHAGE-ERBB2 or pHAGE-empty using polybrene (EMD Millipore, TR-1003) at MOI of 1 for 24 hr. Transfected cells were grown for 2 days in full media before selection using puromycin (0.75 μg/ml) for 14 days.

Fresh frozen OAC 2 mm biopsies were obtained by consenting patients undergoing endoscopy. Tissue collection was granted by the ethics committee of Salford Royal NHS Foundation Trust (04/Q1410/57). Patient consent was obtained in written form and signed by the patient and doctor. Patient biological replicates are defined as separate patients, and cell-line biological replicates are defined as separate cell-lines cultures, processed at the same time.

Protein extraction and immunoblotting

Cells were lysed directly in RIPA buffer and incubated on ice for 5 min. The lysate was then sonicated in a water bath sonicator (Diagenode Bioruptor) for 5 min, 30 s on/off and protein quantified using Pierce BCA Assay Kit (ThermoFisher, 23227). Lysates were supplemented with SDS-PAGE loading dye to a final concentration of 1x and boiled for 10 min. Equal amounts of protein were separated on a 10% polyacrylamide gel and transferred to a nitrocellulose membrane (GE life sciences, 1060002) using a Pierce Power Station (ThermoFisher). Membranes were blocked using Odyssey blocking buffer (Licor, 927–40000) and then incubated with antibodies against KLF5 (abcam, ab137676), Tubulin (Sigma-Aldrich, T9026), ERBB2 (ThermoFisher, MA5-14057), phospho-ERBB2 (Cell Signalling Technologies, 6942S), AKT (Cell Signalling Technologies, 2920S), phospho-AKT (Cell Signalling Technologies, 9106S), ERK1/2 (Cell Signalling Technologies, 4695S) or phospho-ERK1/2 (Cell Signalling Technologies, 9106S) overnight at 4°C. Membranes were incubated with IRDye secondary antibodies (Licor, 925–32212, 925–32213) and imaged using a Li-Cor Odyssey scanner.

RT-qPCR

RT-qPCR was carried out using QuantiTect SYBR Green RT-PCR Kit (Qiagen, 204243) using the primer pairs detailed in Supplementary file 11. Relative gene expression was calculated using the ΔΔCT method relative to levels of GAPDH mRNA.

siRNA and sgRNA transfection

200,000 cells were plated on a six-well plate and incubated for 24 hr. 100 pmol either control non-targeting (siNT; Dharmacon, D-001810-10-0020), siKLF5 (Dharmacon, L-013571-00-0005), or siERBB2 (Dharmacon, L-003126-00-0005) SMARTpool siRNA was transfected per well using Lipofectamine RNAiMAX (Thermofisher, 13778150) as per the manufacturer’s instructions and incubated for 72 hr. Modified full length sgRNAs were designed using E-CRISP (Heigwer et al., 2014; available at http://www.e-crisp.org/E-CRISP) using the KLF5 TSS (±200 bp) as input and obtained from Synthego. 9 pmol of either control non-targeting sgRNA (5’- GUAAGGCUAUGAAGAGAUAC-3’) or sgKLF5 pool (5’-GUGCGCUCGCGGUUCUCUCG-3’; 5’- AGGACGUUGGCGUUUACGUG-3’; 5’- GCGUCAAGUGUCAGUAGUCG-3’) was transfected per well using Lipofectamine RNAiMAX (Thermofisher, 13778150). Media was changed after 72 hr for longer treatments.

RNA extraction, RNA-seq processing and analysis

RNA was extracted from cells using a RNeasy RNA extraction Kit (Qiagen, 74136) and quality checked using Nanodrop 1000 (ThermoFisher). Paired-end RNA-seq libraries were generated using TruSeq stranded mRNA library kit (Illumina) and sequenced on a HiSeq 4000 platform (Illumina) by the University of Manchester Genomic Technologies Core Facility. Reads were trimmed using Trimmomatic v0.32 (Bolger et al., 2014) quality checked using FastQC (available at: http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc) and aligned to RefSeq transcript annotation of GRCh37 (hg19) using STAR (Dobin et al., 2013). Reads aligned to chromosomes 1–22 and chromosome X were retained. The Cufflinks package v2.2.1 (Trapnell et al., 2012) was used to calculate gene expression levels using Cuffnorm, and to analyse differential gene expression using Cuffdiff. Default parameters were used in both instances. Significant gene expression changes were defined by a fold change of ±1.3 and a Q-value of <0.05. For ERBB2 knockdown experiments, counts for genes were determined using featureCounts (Liao et al., 2014). Log2 transformed counts were obtained using DESeq2 variance stabilising transformation (VST) function.

Crystal violet assay

200,000 cells were plated on a six-well plate and siRNA/sgRNA treatment started after 24 hr incubation. At specific time-points after treatment plates were washed with PBS and fixed with 4% paraformaldehyde for 10 min. Plates were then washed twice with PBS and kept at 4°C. Cells were then stained by first incubating plates at room temperature for 10 min in 0.1% Triton X-100 with gentle shaking and then incubated at room temperature for 30 min in 0.1% crystal violet (Sigma-Aldrich, HT90132) with gentle shaking. Plates were extensively washed with water multiple times and left to dry. The dye was solubilised with 10% acetic acid for 10 min with gentle shaking and absorbance was read at 590 nm. Values for siNT at each time-point were used as 100% growth.

Propidium iodine staining assay

Cells were trypsinised and collected as a single-cell suspension, washed with cold PBS, then fixed in 70% ethanol and stored at −20°C for at least 2 hr. Cells were then resuspended in staining solution (50 μg/mL propidium iodide (Sigma, P4170), 100 μg/mL RNase (Sigma, R4642)) and incubated at room temperature for 30 min. Cells were analysed by the University of Manchester Flow Cytometry Core Facility on a LSRFortessa. Percentages of cells in each cell cycle phase were calculated using ModFit LT (http://www.vsh.com/products/mflt/).

ATAC-seq, processing and analysis

Patient samples were processed as previously described (Britton et al., 2017) and omni-ATAC-seq was performed as previously described (Corces et al., 2017). ATAC-seq libraries (~8 per lane) were sequenced on a HiSeq 4000 platform (Illumina) by the University of Manchester Genomic Technologies Core Facility. Reads were quality checked using FastQC (available at: http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc) and aligned to GRCh37 (hg19) using Bowtie2 v2.3.0 (Langmead and Salzberg, 2012) with the following options: -X 2000 –dovetail. Unique reads (>q30) aligned to chromosomes 1–22 and chromosome X were retained. Peaks were called using MACS2 v2.1.1 (Zhang et al., 2008) with the following parameters: -q 0.01 –nomodel --shift −75 --extsize 150 -B –SPMR. Peaks called from individual samples were merged using mergePeaks.pl (using d = 250 parameter) from the HOMER package v4.9 (Heinz et al., 2010) and resized to peak summit ±250 bp to generate a peak set on which to perform differential accessibility analyses. Amplifications in patient biopsies were removed as described previously (Denny et al., 2016) using a fold change of 16. bedGraph files were converted into BigWig files using bedGraphtoBigWig and visualised in the UCSC Genome Browser (Kent et al., 2002).

For comparing BO and OAC ATAC-seq, the Cufflinks package v2.2.1 (Trapnell et al., 2012) was used to calculate chromatin accessibility levels using Cuffnorm, and differential chromatin accessibility was analysed using Cuffdiff. Default parameters were used in both instances. Significant chromatin accessibility changes were defined as a fold change of ±2 and a Q-value of <0.1.

For ERBB2 knockdown experiments differential accessibility was calculated using DESeq2 (Love et al., 2014). Alignment files of biological repeats were combined, peaks recalled and peaks from both conditions were then merged using using mergePeaks.pl (using d = 250 parameter) from the HOMER package v4.9 (Heinz et al., 2010) and resized to peak summit ±250 bp. featureCounts from the SUBread package (Liao et al., 2014) was used to count reads within peaks from ATAC-seq samples and these were used an input for DESeq2 to calculate differential binding using default settings. A linear fold change of ±2 and a Q-value of <0.05 were used as a cut-off for further analyses.

ATAC-seq data visualisation

ATAC-seq fragment size was visualised using a custom python script. Correlation plots between technical replicates were visualised using multiBamSummary and plotCorrelation from the deepTools package (Ramírez et al., 2016). Tag density plots and heatmaps were also generated using computeMatrix and plotProfile or plotHeatmap tools from the deepTools package. ATAC-seq counts were also visualised using Morpheus (https://software.broadinstitute.org/morpheus/) and hierarchical clustering was performed with this software using 1-Pearson’s correlation unless otherwise stated. Correlation plots of samples were visualised using the similarity matrix tool from Morpheus.

De novo motif discovery

To analyse ATAC-seq or ChIP-seq peaks for enriched transcription factor motifs, genomic coordinates were analysed using findMotifsGenome.pl with –cpg –mask –size 200 -bg parameters from the Homer package (v4.7; Heinz et al., 2010). Background sequences were total accessible regions from all samples for ATAC-seq analysis and whole genome for ChIP-seq analysis.

Gene set enrichment analysis and gene ontology analysis

Pre-ranked genes (ranked by log2(fold change)) were subject to gene set enrichment analysis from hallmark gene sets (h.all.v6.2) using GSEAPreranked from GSEA v3.0 (Subramanian et al., 2005). Gene ontology analysis was carried out using Metascape (Zhou et al., 2019; metascape.org).

Footprinting analysis

To analyse footprinting signatures in ATAC-seq data the TOBIAS package was used (v0.5.1; Bentsen et al., 2020; available at https://github.molgen.mpg.de/loosolab/TOBIAS). Merged BAM files from each condition were processed using ATACorrect, footprint scores calculated using FootprintScores and differential footprinting analysis using BINDetect. Footprinting plots across identified footprints at TF motifs were plotted using plotProfile from the deepTools package (v2.5.0; Ramírez et al., 2016).

ChIP-qPCR and ChIP-seq and analysis

ChIP-qPCR and ChIP-seq analysis was carried out as described previously (Wiseman et al., 2015). For ChIP-qPCR, 2.5 × 106 cells and 1 μg antibody were used, and analysed using a Rotor-Gene SYBR Green PCR Kit (Qiagen, 204074). For ChIP-seq, 1 × 107 cells, 5 µg target protein antibody, 1 µg Spike-in antibody (Active Motif, 61686) and 50 μl Protein A Dynabeads were used. 20 ng Spike-in Drosophila chromatin (Active Motif, 53083) was supplemented to chromatin preps for Spike-in normalisation, as described previously (Egan et al., 2016). Normal rabbit IgG (Millipore, 12–370) antibody was used in parallel as a control. DNA libraries were prepared using TruSeq ChIP sample prep kit (Illumina) and sequenced on a HiSeq 4000 (Illumina) platform. Sequencing reads were aligned to GRCh37 (hg19) and dm6 using Bowtie2 v2.2.3 (Langmead et al., 2009). Reads aligning to the Drosophila genome were counted and used to generate scale factors. BAM files were then scaled to the sample with the lowest number of Drosophila reads. Only reads with a mapping quality >q30 were retained. Peak calling was performed on individual replicates using MACS2 v2.1.1 (Zhang et al., 2008) using default parameters with additional –SPMR parameter. bedGraph files were converted to bigwig using BedGraphtoBigWig script and visualised in the UCSC Genome Browser. The overlap of peaks between two biological replicates was calculated using BEDtools v2.26.0 (Quinlan and Hall, 2010) using bedtools intersect with default settings with -f 0.3 parameter. Peaks present in both datasets were taken forward for further analysis.

Differential binding analysis was performed using DESeq2 (Love et al., 2014). Overlaps from biological repeats were merged using bedtools merge to generate a final set of peaks. featureCounts from the SUBread package (Liao et al., 2014) was used to count reads within peaks from ChIP-seq samples and these were used an input for DESeq2 to calculate differential binding using default settings. A linear fold change of ±2 and a Q-value of <0.05 were used as a cut-off for further analyses.

ChIP-seq visualisation

Heatmaps of ChIP-seq signal were generated using computeMatrix and plotHeatmap from the deepTools package (Ramírez et al., 2016). Tag density plots were generated using computeMatrix and plotProfile tools from the deepTools package. Correlation of biological replicates was visualised using multiBigwigSummary and plotCorrelation. Euler diagrams were generated using the Euler R package (available at eulerr.co).

Principal component analysis

Principal component analysis was performed using the prcomp function in R (v3.5.1, R Core Team, 2018) using log2 transformed RNA-seq or ATAC-seq normalised counts. Principal component scores were then plotted in Excel.

Patient survival analysis

Average expression of KLF5 or KLF5 target genes (OAC upregulated and siKLF5 downregulated) was calculated per patient and patients were ranked by average target gene expression. The median was calculated and patients were classified as either above or under median expression. Survival (months) was plotted for each group and a Log-rank test was carried out using GraphPad Prism v8.

Statistical analysis

To determine statistical significance between two groups, a Student’s unpaired two-tail T- test was carried out using GraphPad Prism v7. To assess the changes in expression of a group of genes, a one-way ANOVA test was carried out in GraphPad Prism v7. To assess the significance of gene/region overlaps derived from sequencing data, a hypergeometric distribution test was carried out using the phyper function in R. p-values<0.05 were considered as significant.

Datasets

All data were obtained from ArrayExpress, unless stated otherwise. ATAC-seq data from human BO, OAC tissue and OE19 cells were obtained from E-MTAB-5169 (Britton et al., 2017) and E-MTAB-6751 (Rogerson et al., 2019). BO and OAC RNA-seq data were obtained from E-MTAB-4054 (Maag et al., 2017) and European Genome-phenome Archive (EGA)(EGAD00001005915). GATA6 and HNF4A ChIP-seq were obtained from E-MTAB-6858 and siGATA6 RNA-seq from E-MTAB-6756. The Cancer Genome Atlas OAC ATAC-seq data were obtained from the GDC data portal (portal.gdc.cancer.gov; Corces et al., 2018).

Data access

All sequencing data are deposited in ArrayExpress. Additional OAC ATAC-seq data are available at E-MTAB-8447 and additional BO and OAC RNA-seq data are available at E-MTAB-8584. siKLF5 RNA-seq data are available at E-MTAB-8446. KLF5 ChIP-seq data are available at E-MTAB-8568. siERBB2 ATAC-seq and RNA-seq data are available at E-MTAB-8576 and E-MTAB-8579 respectively. CP-A ATAC-seq data are available at E-MTAB-8994.

Acknowledgements

We thank Guanhua Yan for excellent technical assistance and staff in the Bioinformatics, and Genomic Technologies core facilities and Mike Jackson in the Flow Cytometry facility. We also thank Elizabeth Ratcliffe for collection of BO samples. We also thank Nicoletta Bobola and Shen-Hsi Yang for critical appraisal of the manuscript. This work was funded by grants from the Wellcome Trust (103857/Z/14/Z) and Cancer Research UK (through the Manchester Cancer Research Centre).

Appendix 1

Appendix 1—key resources table.

Reagent type
(species) or resource
Designation Source or reference Identifiers Additional information
Cell line (H. sapiens) OE19 ACACC 96071721
Cell line (H. sapiens) CP-A ATCC
KR-42421
Cell line (H. sapiens) OE19-dCas9-KRAB This study OE19 transfected with vector to express dCas9-KRAB under doxycycline control
Cell line (H. sapiens) CP-A-ERBB2 This study CP-A stably overexpressing ERBB2
Cell line (H. sapiens) CP-A-empty This study CP-A cells containing an empty vector control
Biological sample (H. sapiens) Barrett’s oesophagus biopsies Salford NHS FT Freshly isolated from patients undergoing endoscopy
Biological sample (H. sapiens) Oesophageal adenocarcinoma biopsies Salford NHS FT Freshly isolated from patients undergoing endoscopy
Transfected construct (human) SmartPool siRNA against KLF5 Horizon discovery L-013571-00-0005
Transfected construct (human) SmartPool siRNA against ERBB2 Horizon discovery L-003126-00-0005
Transfected construct (human) SmartPool non-targeting siRNA Horizon discovery D-001810-10-0020
Transfected construct (human) Full length non-targeting guide RNA Synthego 5’-GUAAGGCUAUGAAGAGAUAC-3’
Transfected construct (human) Full length guide RNAs targeting KLF5 TSS Synthego 5’-GUGCGCUCGCGGUUCUCUCG-3’
5’-AGGACGUUGGCGUUUACGUG-3’
5’-GCGUCAAGUGUCAGUAGUCG-3’
Antibody Rabbit monoclonal KLF5 antibody Abcam ab137676 (1:10000) for western blot;
5 ug for ChIP-seq
Antibody Mouse monoclonal tubulin antibody Sigma-Aldrich T9026 (1:2000) for western blot
Antibody Spike-in antibody Active Motif 61686 1 ug for ChIP-seq
Antibody Mouse monoclonal ErbB2 antibody ThermoFisher MA5-14057 (1:1000)
Antibody Mouse monoclonal AKT antibody Cell signalling technology 2920
(1:2000)
Antibody Rabbit monoclonal phosphor-Akt (S473) antibody Cell signalling technology 4060S
(1:2000)
Antibody Rabbit monoclonal Erk1/2 antibody Cell signalling technology 4695S
(1:1000)
Antibody Mouse monoclonal phosphor-Erk1/2 (T202,Y204) Cell signalling technology 9106S
(1:2000)
Antibody Donkey anti-mouse secondary antibody (800CW) Licor 925
-
32212
(1:10,000)
Antibody Donkey anti-rabbit secondary antibody (700CW) Licor 925–32213 (1:10,000)
Recombinant DNA reagent pX330-U6-Chimeric_BB-CBh-hSpCas9 (plasmid)
Addgene #42230 AAVS guide RNA sequence 5’-GGGCCACTAGGGACAGGAT-3’
Recombinant DNA reagent pAAVS1-Puro-TRE-dCas9-KRAB-DNR (plasmid) This study pAS-4939
Recombinant DNA reagent pHAGE-ERBB2
Addgene 116734
Recombinant DNA reagent pHAGE-empty This study pAS-4940
Recombinant DNA reagent pMD2.G Addgene 12259
Recombinant DNA reagent psPAX2 Addgene 12260
Sequenced-based reagent Primers This study See Supplementary file 11
Commercial assay or kit Lipofectamine
RNAiMAX
Thermofisher 13778150
Commercial assay or kit Fugene HD Promega E2311
Commercial assay or kit QuantiTect SYBR Green RT-PCR Kit Qiagen 204243
Commercial assay or kit RNeasy Plus Mini Kit Qiagen 74134
Commercial assay or kit RNase-free DNase set Qiagen 79254
Commercial assay or kit Ampure XP beads Beckman Coulter Agencourt A63881
Commercial assay or kit TruSeq stranded RNA library kit v2 Illumina RS-122–2001
Commercial assay or kit Nextera DNA library prep kit Illumina FC-121–1031
Commercial assay or kit Nextera Index kit Illumina FC-121–1012
Commercial assay or kit NEBNext high fidelity 2x PCR master mix NEB M0541
Commercial assay or kit DNA Clean and Concentrator Zymo D4013
Commercial assay or kit Polyfect Qiagen 301107
Commercial assay or kit PEG-it System Biosciences LV810A-1
Commercial assay or kit Polybrene EMD Millipore TR-1003
Chemical compound, drug RS-1 Sigma-Aldrich R9782 Used at final concentration 7.5 μM
Chemical compound, drug SCR7 pyrazine Sigma-Aldrich SML1546 Used at final concentration 1 μM
Chemical compound, drug Doxycycline Sigma-Aldrich D3447 Used at final concentration of 100 ng/mL
Chemical compound, drug propidium iodide Sigma P4170 Used at 50 μg/mL
Peptide, recombinant protein RNase Sigma R4642 Used at 100 μg/mL
Peptide, recombinant protein EGF ThermoFisher 10450–013 5 μg/L
Peptide, recombinant protein Bovine pituitary extract ThermoFisher 13028014 Used at 50 mg/L
Software, algorithm Trimmomatic Bolger et al., 2014 V0.34 http://www.usadellab.org/cms/?page=trimmomatic
Software, algorithm Bowtie2 Langmead and Salzberg, 2012 v2.3.0 http://bowtie-bio.sourceforge.net/bowtie2/index.shtml
Software, algorithm Star Dobin et al., 2013 V2.5.4 https://github.com/alexdobin/STAR
Software, algorithm Macs2 Zhang et al., 2008 v2.1.1 https://github.com/taoliu/MACS
Software, algorithm Cufflinks Tarapore et al., 2013 v2.2.1 http://cole-trapnell-lab.github.io/cufflinks/
Software, algorithm DEseq2 Love et al., 2014 V1.22.2 https://bioconductor.org/packages/release/bioc/html/DESeq2.html
Software, algorithm TOBIAS Bentsen et al., 2020 v0.5.1 https://github.com/loosolab/TOBIAS
Software, algorithm featureCounts Liao et al., 2014 V1.6.2 http://subread.sourceforge.net
Software, algorithm FastQC v0.11.4 https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
Software, algorithm bedtools Quinlan and Hall, 2010 v2.26.0 https://bedtools.readthedocs.io/en/latest/
Software, algorithm DeepTools Ramírez et al., 2016 V2.5.0 https://deeptools.readthedocs.io/en/develop/
Software, algorithm GSEA Subramanian et al., 2005 V3.0 http://software.broadinstitute.org/cancer/software/gsea/wiki/index.php/Main_Page
Software, algorithm Homer Heinz et al., 2010 v4.9 http://homer.ucsd.edu/homer/
Software, algorithm R R Core Team (2018) v3.5.1 https://www.r-project.org/
Software, algorithm GraphPad Prism V8.0 www.graphpad.com
Other Crystal violet Sigma Aldrich HT90132 Used at concentration of 0.1%
Other Gibco RPMI 1640 ThermoFisher 52400
Other Gibco fetal bovine serum ThermoFisher 10270
Other Gibco penicillin/streptomycin ThermoFisher 15140122
Other Keratinocyte SFM (1x) ThermoFisher 17005042

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Yeng Ang, Email: Yeng.Ang@srft.nhs.uk.

Andrew D Sharrocks, Email: andrew.d.sharrocks@manchester.ac.uk.

Irwin Davidson, Institut de Génétique et de Biologie Moléculaire et Cellulaire, France.

Kevin Struhl, Harvard Medical School, United States.

Funding Information

This paper was supported by the following grants:

  • Cancer Research UK Clinical PhD and PhD funding to Connor Rogerson, Edward Britton, Yeng Ang, Andrew D Sharrocks.

  • Wellcome Programme grant and studentship 103857/Z/14/Z to Samuel Ogden, Andrew D Sharrocks.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Formal analysis, Investigation, Methodology, Writing - original draft, Writing - review and editing.

Conceptualization, Formal analysis, Writing - review and editing.

Investigation, Writing - review and editing.

Resources.

Conceptualization, Resources, Supervision, Project administration, Writing - review and editing.

Conceptualization, Resources, Supervision, Funding acquisition, Project administration, Writing - review and editing.

Ethics

Human subjects: Ethical approval was via the ethics committee of Salford Royal NHS Foundation Trust (04/Q1410/57). Patient consent was obtained in written form and signed by the patient and doctor.

Additional files

Source code 1. ATAC fragment size visualisation.
Supplementary file 1. Differentially expressed genes in OAC.

Significantly (±1.5 x; Q-value <0.05) differentially expressed genes between BO (n = 13) and OAC (n = 12) (Maag et al., 2017).

elife-57189-supp1.xlsx (104.8KB, xlsx)
Supplementary file 2. Differentially accessible regions within ±250 kb of TSS of a DEG.

(A) Total accessible regions from BO (n = 4) and OAC (n = 6) samples. (B) Significant differentially accessible open regions (+2x; Q-value <0.1). (C) Significant differentially accessible closed regions (−2x; Q-value <0.1).

elife-57189-supp2.xlsx (3.9MB, xlsx)
Supplementary file 3. DNA motifs enriched in OAC-specific open chromatin regions.

Top ten motifs found by de novo motif discovery and their associated transcription factors that are enriched in ‘open in OAC’ (top) or ‘closed in OAC’ (bottom).

elife-57189-supp3.xlsx (12.9KB, xlsx)
Supplementary file 4. siKLF5 RNA-seq analysis.

Significant differentially expressed genes with siKLF5 treatment (±1.3 x, Q-value <0.05)

elife-57189-supp4.xlsx (499.5KB, xlsx)
Supplementary file 5. KLF5 ChIP-seq datasets.

(A) ChIP-seq peaks in OE19 cells. (B) ChIP-seq peaks in CP-A cells. (C) Differentially bound KLF5 ChIP-seq peaks (CP-A vs OE19).

elife-57189-supp5.xlsx (2.7MB, xlsx)
Supplementary file 6. De novo analysis of DNA motif enrichment in KLF5 ChIP-seq peak datasets.
elife-57189-supp6.xlsx (13.9KB, xlsx)
Supplementary file 7. (A) Frequency of KLF5, GATA1, FOXA2, FRA1 and TCF7L2 motifs within OE19 specific KLF5 ChIP-seq regions. one denotes present and 0 absent. (B) Overlaps of motifs and the basis of Figure 4G (A. KLF5; B. GATA1; C. FOXA2; D. FRA1; E. TCF7L2).
elife-57189-supp7.xlsx (31.1KB, xlsx)
Supplementary file 8. DNA motifs enriched in Cluster one and Cluster two regions.

Top 10 motifs found by de novo motif discovery and their associated transcription factors that are enriched in cluster 1 (top) or cluster 2 (bottom).

elife-57189-supp8.xlsx (13.3KB, xlsx)
Supplementary file 9. Genomic coordinates of regions on OE19 cells that show a decrease in ATAC-seq signal upon treatment of siERBB2 for 72 hr.
elife-57189-supp9.xlsx (27.1KB, xlsx)
Supplementary file 10. De novo discovered motifs from regions that exhibit reduced chromatin accessibility upon treatment of siERBB2 for 72 hr.

De novo motifs, % targets and % background, called transcription factor with match score and p-value are shown.

elife-57189-supp10.xlsx (13.9KB, xlsx)
Supplementary file 11. List of PCR primers used in RT-qPCR and ChIP-qPCR experiments.
Transparent reporting form

Data availability

All sequencing data are deposited in ArrayExpress. Additional OAC ATAC-seq data are available at E-MTAB-8447 and additional BO and OAC RNA-seq data are available at E-MTAB-8584. siKLF5 RNA-seq data are available at E-MTAB-8446. KLF5 ChIP-seq data are available at E-MTAB-8568. siERBB2 ATAC-seq and RNA-seq data are available at E-MTAB-8576 and E-MTAB-8579 respectively.

The following datasets were generated:

Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8447

Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8446

Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8568

Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8584

Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8576

Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8579

Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. EGA. EGAD00001005915

Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8994

The following previously published datasets were used:

Britton E, Rogerson C, Mehta S, Li Y, Li X, The OCCAMS Consortium. Fitzgerald RC, Ang YS, Sharrocks AD. 2017. Open chromatin profiling identifies AP1 as a transcriptional regulator in oesophageal adenocarcinoma. ArrayExpress. E-MTAB-5169

Rogerson C, Britton E, Withey S, Hanley N, Ang Y, Sharrocks AD. 2019. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its pre-cancerous precursor state. ArrayExpress. E-MTAB-6751

Rogerson C, Britton E, Withey S, Hanley N, Ang Y, Sharrocks AD. 2019. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its pre-cancerous precursor state. ArrayExpress. E-MTAB-6756

Rogerson C, Britton E, Withey S, Hanley N, Ang Y, Sharrocks AD. 2019. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its pre-cancerous precursor state. ArrayExpress. E-MTAB-6758

Maag JLV, Fisher OM, Levert-Mignon A, Kaczorowski DC, Thomas ML, Hussey DJ, Watson DI, Wettstein A, Bobryshev YV, Edwards M, Dinger ME, Lord RV. 2017. Novel Aberrations Uncovered in Barrett's Esophagus and Esophageal Adenocarcinoma Using Whole Transcriptome Sequencing. ArrayExpress. E-MTAB-4054

Corces MR, Granja JM, Shams S, Louie BH, Seoane JA, Zhou W, Silva TC, Groeneveld C, Wong CK, Cho SW, Satpathy AT, Mumbach MR, Hoadley KA, Robertson AG, Sheffield NC, Felau I, Castro MAA, Berman BP, Staudt LM, Zenklusen JC, Laird PW, Curtis C, The TCGA Network. Greenleaf WJ, Chang HY. 2018. The chromatin accessibility landscape of primary human cancers. GDC Data Portal. TCGA-ESCA

References

  1. Bentsen M, Goymann P, Schultheis H, Klee K, Petrova A, Wiegandt R, Fust A, Preussner J, Kuenne C, Braun T, Kim J, Looso M. ATAC-seq footprinting unravels kinetics of transcription factor binding during zygotic genome activation. Nature Communications. 2020;11:4267. doi: 10.1038/s41467-020-18035-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bianco JN, Bergoglio V, Lin YL, Pillaire MJ, Schmitz AL, Gilhodes J, Lusque A, Mazières J, Lacroix-Triki M, Roumeliotis TI, Choudhary J, Moreaux J, Hoffmann JS, Tourrière H, Pasero P. Overexpression of claspin and timeless protects Cancer cells from replication stress in a checkpoint-independent manner. Nature Communications. 2019;10:910. doi: 10.1038/s41467-019-08886-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Blum AE, Venkitachalam S, Ravillah D, Chelluboyina AK, Kieber-Emmons AM, Ravi L, Kresak A, Chandar AK, Markowitz SD, Canto MI, Wang JS, Shaheen NJ, Guo Y, Shyr Y, Willis JE, Chak A, Varadan V, Guda K. Systems biology analyses show hyperactivation of transforming growth Factor-β and JNK signaling pathways in esophageal Cancer. Gastroenterology. 2019;156:1761–1774. doi: 10.1053/j.gastro.2019.01.263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Britton E, Rogerson C, Mehta S, Li Y, Li X, Fitzgerald RC, Ang YS, Sharrocks AD, OCCAMS consortium Open chromatin profiling identifies AP1 as a transcriptional regulator in oesophageal adenocarcinoma. PLOS Genetics. 2017;13:e1006879. doi: 10.1371/journal.pgen.1006879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Burke ZD, Tosh D. Barrett's metaplasia as a paradigm for understanding the development of cancer. Current Opinion in Genetics & Development. 2012;22:494–499. doi: 10.1016/j.gde.2012.08.001. [DOI] [PubMed] [Google Scholar]
  7. Cancer Genome Atlas Research Network. Analysis Working Group: Asan University. BC Cancer Agency. Brigham and Women’s Hospital. Broad Institute. Brown University. Case Western Reserve University. Dana-Farber Cancer Institute. Duke University. Greater Poland Cancer Centre. Harvard Medical School. Institute for Systems Biology. KU Leuven. Mayo Clinic. Memorial Sloan Kettering Cancer Center. National Cancer Institute. Nationwide Children’s Hospital. Stanford University. University of Alabama. University of Michigan. University of North Carolina. University of Pittsburgh. University of Rochester. University of Southern California. University of Texas MD Anderson Cancer Center. University of Washington. Van Andel Research Institute. Vanderbilt University. Washington University. Genome Sequencing Center: Broad Institute. Washington University in St. Louis. Genome Characterization Centers: BC Cancer Agency. Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University. University of Southern California Epigenome Center. Genome Data Analysis Centers: Broad Institute. Brown University: University of California Santa Cruz. Biospecimen Core Resource: International Genomics Consortium. Research Institute at Nationwide Children’s Hospital. Tissue Source Sites: Analytic Biologic Services. Asan Medical Center. Asterand Bioscience. Barretos Cancer Hospital. BioreclamationIVT. Botkin Municipal Clinic. Chonnam National University Medical School. Christiana Care Health System. Cureline. Emory University. Erasmus University. Indiana University School of Medicine. Institute of Oncology of Moldova. International Genomics Consortium. Invidumed. Israelitisches Krankenhaus Hamburg. Keimyung University School of Medicine. National Cancer Center Goyang. Ontario Tumour Bank. Peter MacCallum Cancer Centre. Pusan National University Medical School. Ribeirão Preto Medical School. St. Joseph’s Hospital &Medical Center. St. Petersburg Academic University. Tayside Tissue Bank. University of Dundee. University of Kansas Medical Center. University of North Carolina at Chapel Hill. University of Pittsburgh School of Medicine. Disease Working Group: Duke University. Yonsei University College of Medicine. Data Coordination Center: CSRA Inc. Project Team: National Institutes of Health Integrated genomic characterization of oesophageal carcinoma. Nature. 2017;541:169–175. doi: 10.1038/nature20805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chen C, Bhalala HV, Qiao H, Dong J-T. A possible tumor suppressor role of the KLF5 transcription factor in human breast Cancer. Oncogene. 2002;21:6567–6572. doi: 10.1038/sj.onc.1205817. [DOI] [PubMed] [Google Scholar]
  9. Chen L, Huang M, Plummer J, Pan J, Jiang YY, Yang Q, Silva TC, Gull N, Chen S, Ding LW, An O, Yang H, Cheng Y, Said JW, Doan N, Dinjens WNM, Waters KM, Tuli R, Gayther SA, Klempner SJ, Berman BP, Meltzer SJ, Lin D-C, Koeffler HP. Master transcription factors form interconnected circuitry and orchestrate transcriptional networks in oesophageal adenocarcinoma. Gut. 2020;69:630–640. doi: 10.1136/gutjnl-2019-318325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chia NY, Deng N, Das K, Huang D, Hu L, Zhu Y, Lim KH, Lee MH, Wu J, Sam XX, Tan GS, Wan WK, Yu W, Gan A, Tan AL, Tay ST, Soo KC, Wong WK, Dominguez LT, Ng HH, Rozen S, Goh LK, Teh BT, Tan P. Regulatory crosstalk between lineage-survival oncogenes KLF5, GATA4 and GATA6 cooperatively promotes gastric Cancer development. Gut. 2015;64:707–719. doi: 10.1136/gutjnl-2013-306596. [DOI] [PubMed] [Google Scholar]
  11. Coleman HG, Xie SH, Lagergren J. The epidemiology of esophageal adenocarcinoma. Gastroenterology. 2018;154:390–405. doi: 10.1053/j.gastro.2017.07.046. [DOI] [PubMed] [Google Scholar]
  12. Corces MR, Trevino AE, Hamilton EG, Greenside PG, Sinnott-Armstrong NA, Vesuna S, Satpathy AT, Rubin AJ, Montine KS, Wu B, Kathiria A, Cho SW, Mumbach MR, Carter AC, Kasowski M, Orloff LA, Risca VI, Kundaje A, Khavari PA, Montine TJ, Greenleaf WJ, Chang HY. An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nature Methods. 2017;14:959–962. doi: 10.1038/nmeth.4396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Corces MR, Granja JM, Shams S, Louie BH, Seoane JA, Zhou W, Silva TC, Groeneveld C, Wong CK, Cho SW, Satpathy AT, Mumbach MR, Hoadley KA, Robertson AG, Sheffield NC, Felau I, Castro MAA, Berman BP, Staudt LM, Zenklusen JC, Laird PW, Curtis C, Greenleaf WJ, Chang HY, Cancer Genome Atlas Analysis Network The chromatin accessibility landscape of primary human cancers. Science. 2018;362:eaav1898. doi: 10.1126/science.aav1898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dahlberg PS, Jacobson BA, Dahal G, Fink JM, Kratzke RA, Maddaus MA, Ferrin LJ. ERBB2 amplifications in esophageal adenocarcinoma. The Annals of Thoracic Surgery. 2004;78:1790–1800. doi: 10.1016/j.athoracsur.2004.05.037. [DOI] [PubMed] [Google Scholar]
  15. Davie K, Jacobs J, Atkins M, Potier D, Christiaens V, Halder G, Aerts S. Discovery of transcription factors and regulatory regions driving in vivo tumor development by ATAC-seq and FAIRE-seq open chromatin profiling. PLOS Genetics. 2015;11:e1004994. doi: 10.1371/journal.pgen.1004994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Denny SK, Yang D, Chuang CH, Brady JJ, Lim JS, Grüner BM, Chiou SH, Schep AN, Baral J, Hamard C, Antoine M, Wislez M, Kong CS, Connolly AJ, Park KS, Sage J, Greenleaf WJ, Winslow MM. Nfib promotes metastasis through a widespread increase in chromatin accessibility. Cell. 2016;166:328–342. doi: 10.1016/j.cell.2016.05.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Egan B, Yuan CC, Craske ML, Labhart P, Guler GD, Arnott D, Maile TM, Busby J, Henry C, Kelly TK, Tindell CA, Jhunjhunwala S, Zhao F, Hatton C, Bryant BM, Classon M, Trojer P. An alternative approach to ChIP-Seq normalization enables detection of Genome-Wide changes in histone H3 lysine 27 trimethylation upon EZH2 inhibition. PLOS ONE. 2016;11:e0166438. doi: 10.1371/journal.pone.0166438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ferlay J, Soerjomataram I, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin DM, Forman D, Bray F. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. International Journal of Cancer. 2015;136:E359–E386. doi: 10.1002/ijc.29210. [DOI] [PubMed] [Google Scholar]
  20. Frankell AM, Jammula S, Li X, Contino G, Killcoyne S, Abbas S, Perner J, Bower L, Devonshire G, Ococks E, Grehan N, Mok J, O'Donovan M, MacRae S, Eldridge MD, Tavaré S, Fitzgerald RC, Oesophageal Cancer Clinical and Molecular Stratification (OCCAMS) Consortium The landscape of selection in 551 esophageal adenocarcinomas defines genomic biomarkers for the clinic. Nature Genetics. 2019;51:506–516. doi: 10.1038/s41588-018-0331-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hanahan D, Weinberg RA. Hallmarks of Cancer: the next generation. Cell. 2011;144:646–674. doi: 10.1016/j.cell.2011.02.013. [DOI] [PubMed] [Google Scholar]
  22. Hassan MS, Awasthi N, Li J, Schwarz MA, Schwarz RE, Holzen U, PLOS ONE Staff A novel intraperitoneal metastatic xenograft mouse model for survival outcome assessment of esophageal adenocarcinoma. PLOS ONE. 2017;12:e0171824. doi: 10.1371/journal.pone.0171824. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. He P, Yang JW, Yang VW, Bialkowska AB. Krüppel-like factor 5, increased in pancreatic ductal adenocarcinoma, promotes proliferation, Acinar-to-Ductal metaplasia, pancreatic intraepithelial neoplasia, and tumor growth in mice. Gastroenterology. 2018;154:1494–1508. doi: 10.1053/j.gastro.2017.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Heigwer F, Kerr G, Boutros M. E-CRISP: fast CRISPR target site identification. Nature Methods. 2014;11:122–123. doi: 10.1038/nmeth.2812. [DOI] [PubMed] [Google Scholar]
  25. Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Molecular Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hong J, Katsha A, Lu P, Shyr Y, Belkhiri A, El-Rifai W. Regulation of ERBB2 receptor by t-DARPP mediates trastuzumab resistance in human esophageal adenocarcinoma. Cancer Research. 2012;72:4504–4514. doi: 10.1158/0008-5472.CAN-12-1119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kelso TWR, Porter DK, Amaral ML, Shokhirev MN, Benner C, Hargreaves DC. Chromatin accessibility underlies synthetic lethality of SWI/SNF subunits in ARID1A-mutant cancers. eLife. 2017;6:e30506. doi: 10.7554/eLife.30506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D. The human genome browser at UCSC. Genome Research. 2002;12:996–1006. doi: 10.1101/gr.229102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nature Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
  32. Lin L, Bass AJ, Lockwood WW, Wang Z, Silvers AL, Thomas DG, Chang AC, Lin J, Orringer MB, Li W, Glover TW, Giordano TJ, Lam WL, Meyerson M, Beer DG. Activation of GATA binding protein 6 (GATA6) sustains oncogenic lineage-survival in esophageal adenocarcinoma. PNAS. 2012;109:4251–4256. doi: 10.1073/pnas.1011989109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Maag JLV, Fisher OM, Levert-Mignon A, Kaczorowski DC, Thomas ML, Hussey DJ, Watson DI, Wettstein A, Bobryshev YV, Edwards M, Dinger ME, Lord RV. Novel aberrations uncovered in Barrett's Esophagus and Esophageal Adenocarcinoma Using Whole Transcriptome Sequencing. Molecular Cancer Research. 2017;15:1558–1569. doi: 10.1158/1541-7786.MCR-17-0332. [DOI] [PubMed] [Google Scholar]
  35. Mourikis TP, Benedetti L, Foxall E, Temelkovski D, Nulsen J, Perner J, Cereda M, Lagergren J, Howell M, Yau C, Fitzgerald RC, Scaffidi P, Ciccarelli FD, Oesophageal Cancer Clinical and Molecular Stratification (OCCAMS) Consortium Patient-specific Cancer genes contribute to recurrently perturbed pathways and establish therapeutic vulnerabilities in esophageal adenocarcinoma. Nature Communications. 2019;10:1–17. doi: 10.1038/s41467-019-10898-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Nandan MO, Chanchevalap S, Dalton WB, Yang VW. Krüppel-like factor 5 promotes mitosis by activating the cyclin B1/Cdc2 complex during oncogenic Ras-mediated transformation. FEBS Letters. 2005;579:4757–4762. doi: 10.1016/j.febslet.2005.07.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Pattison JM, Posternak V, Cole MD. Transcription factor KLF5 binds a cyclin E1 polymorphic intronic enhancer to confer increased bladder Cancer risk. Molecular Cancer Research. 2016;14:1078–1086. doi: 10.1158/1541-7786.MCR-16-0123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Pennathur A, Gibson MK, Jobe BA, Luketich JD. Oesophageal carcinoma. The Lancet. 2013;381:400–412. doi: 10.1016/S0140-6736(12)60643-6. [DOI] [PubMed] [Google Scholar]
  39. Qin J, Zhou Z, Chen W, Wang C, Zhang H, Ge G, Shao M, You D, Fan Z, Xia H, Liu R, Chen C. BAP1 promotes breast Cancer cell proliferation and metastasis by deubiquitinating KLF5. Nature Communications. 2015;6:8471. doi: 10.1038/ncomms9471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Research. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Reddy J, Fonseca MAS, Corona RI, Nameki R, Segato Dezem F, Klein IA, Chang H, Chaves-Moreira D, Afeyan L, Malta TM, Lin X, Abbasi F, Font-Tello A, Sabedot T, Cejas P, Rodríguez-Malavé N, Seo J-H, Lin D-C, Matulonis U, Karlan BY, Gayther SA, Gusev A, Noushmehr H, Long H, Freedman ML, Drapkin R, Abraham BJ, Young RA, Lawrenson K. Predicting master transcription factors from pan-cancer expression data. bioRxiv. 2019 doi: 10.1101/839142. [DOI] [PMC free article] [PubMed]
  43. Rendeiro AF, Schmidl C, Strefford JC, Walewska R, Davis Z, Farlik M, Oscier D, Bock C. Chromatin accessibility maps of chronic lymphocytic leukaemia identify subtype-specific epigenome signatures and transcription regulatory networks. Nature Communications. 2016;7:11938. doi: 10.1038/ncomms11938. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Rogerson C, Britton E, Withey S, Hanley N, Ang YS, Sharrocks AD. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its precancerous precursor state. Genome Research. 2019;29:723–736. doi: 10.1101/gr.243345.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Ross-Innes CS, Becq J, Warren A, Cheetham RK, Northen H, O'Donovan M, Malhotra S, di Pietro M, Ivakhno S, He M, Weaver JMJ, Lynch AG, Kingsbury Z, Ross M, Humphray S, Bentley D, Fitzgerald RC. Whole-genome sequencing provides new insights into the clonal architecture of Barrett's esophagus and esophageal adenocarcinoma. Nature Genetics. 2015;47:1038–1046. doi: 10.1038/ng.3357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Smyth EC, Lagergren J, Fitzgerald RC, Lordick F, Shah MA, Lagergren P, Cunningham D. Oesophageal cancer. Nature Reviews Disease Primers. 2017;3:17048. doi: 10.1038/nrdp.2017.48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Spechler SJ, Souza RF. Barrett's esophagus. New England Journal of Medicine. 2014;371:836–845. doi: 10.1056/NEJMra1314704. [DOI] [PubMed] [Google Scholar]
  48. Stachler MD, Taylor-Weiner A, Peng S, McKenna A, Agoston AT, Odze RD, Davison JM, Nason KS, Loda M, Leshchiner I, Stewart C, Stojanov P, Seepo S, Lawrence MS, Ferrer-Torres D, Lin J, Chang AC, Gabriel SB, Lander ES, Beer DG, Getz G, Carter SL, Bass AJ. Paired exome analysis of Barrett's esophagus and adenocarcinoma. Nature Genetics. 2015;47:1047–1055. doi: 10.1038/ng.3343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Stachler MD, Camarda ND, Deitrick C, Kim A, Agoston AT, Odze RD, Hornick JL, Nag A, Thorner AR, Ducar M, Noffsinger A, Lash RH, Redston M, Carter SL, Davison JM, Bass AJ. Detection of mutations in Barrett's Esophagus Before Progression to High-Grade Dysplasia or Adenocarcinoma. Gastroenterology. 2018;155:156–167. doi: 10.1053/j.gastro.2018.03.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. PNAS. 2005;102:15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Tarapore RS, Yang Y, Katz JP. Restoring KLF5 in esophageal squamous cell Cancer cells activates the JNK pathway leading to apoptosis and reduced cell survival. Neoplasia. 2013;15:472–IN3. doi: 10.1593/neo.122126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Tome-Garcia J, Erfani P, Nudelman G, Tsankov AM, Katsyv I, Tejero R, Bin Z, Walsh M, Friedel RH, Zaslavsky E, Tsankova NM. Analysis of chromatin accessibility uncovers TEAD1 as a regulator of migration in human glioblastoma. Nature Communications. 2018;9:4020. doi: 10.1038/s41467-018-06258-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and cufflinks. Nature Protocols. 2012;7:562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Wang C, Nie Z, Zhou Z, Zhang H, Liu R, Wu J, Qin J, Ma Y, Chen L, Li S, Chen W, Li F, Shi P, Wu Y, Shen J, Chen C. The interplay between TEAD4 and KLF5 promotes breast Cancer partially through inhibiting the transcription of p27Kip1. Oncotarget. 2015;6:17685–17697. doi: 10.18632/oncotarget.3779. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Weaver JMJ, Ross-Innes CS, Shannon N, Lynch AG, Forshew T, Barbera M, Murtaza M, Ong CJ, Lao-Sirieix P, Dunning MJ, Smith L, Smith ML, Anderson CL, Carvalho B, O'Donovan M, Underwood TJ, May AP, Grehan N, Hardwick R, Davies J, Oloumi A, Aparicio S, Caldas C, Eldridge MD, Edwards PAW, Rosenfeld N, Tavaré S, Fitzgerald RC, OCCAMS consortium Ordering of mutations in preinvasive disease stages of esophageal carcinogenesis. Nature Genetics. 2014;46:837–843. doi: 10.1038/ng.3013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Wiseman EF, Chen X, Han N, Webber A, Ji Z, Sharrocks AD, Ang YS. Deregulation of the FOXM1 target gene network and its coregulatory partners in oesophageal adenocarcinoma. Molecular Cancer. 2015;14:69. doi: 10.1186/s12943-015-0339-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Yamamoto Y, Wang X, Bertrand D, Kern F, Zhang T, Duleba M, Srivastava S, Khor CC, Hu Y, Wilson LH, Blaszyk H, Rolshud D, Teh M, Liu J, Howitt BE, Vincent M, Crum CP, Nagarajan N, Ho KY, McKeon F, Xian W. Mutational spectrum of Barrett's stem cells suggests paths to initiation of a precancerous lesion. Nature Communications. 2016;7:10380. doi: 10.1038/ncomms10380. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS. Model-based analysis of ChIP-Seq (MACS) Genome Biology. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Zhang X, Choi PS, Francis JM, Gao GF, Campbell JD, Ramachandran A, Mitsuishi Y, Ha G, Shih J, Vazquez F, Tsherniak A, Taylor AM, Zhou J, Wu Z, Berger AC, Giannakis M, Hahn WC, Cherniack AD, Meyerson M. Somatic superenhancer duplications and hotspot mutations lead to oncogenic activation of the KLF5 transcription factor. Cancer Discovery. 2018;8:108–125. doi: 10.1158/2159-8290.CD-17-0532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Zhi X, Zhao D, Zhou Z, Liu R, Chen C. YAP promotes breast cell proliferation and survival partially through stabilizing the KLF5 transcription factor. The American Journal of Pathology. 2012;180:2452–2461. doi: 10.1016/j.ajpath.2012.02.025. [DOI] [PubMed] [Google Scholar]
  61. Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, Benner C, Chanda SK. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nature Communications. 2019;10:1523. doi: 10.1038/s41467-019-09234-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Zhou B, Guo R. Genomic and regulatory characteristics of significant transcription factors in colorectal Cancer metastasis. Scientific Reports. 2018;8:17836. doi: 10.1038/s41598-018-36168-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Irwin Davidson1
Reviewed by: Irwin Davidson2

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

In this study, the Sharrocks laboratory used an integrated genomics approach to reveal a key role of the transcription factor KLF5 in esophageal adenocarcinoma. The results show that KLF5 genome occupancy is rewired during the transformation process to drive the expression of key cell cycle genes required for proliferation.

Decision letter after peer review:

Thank you for submitting your article "Repurposing of KLF5 activates a cell cycle signature during the progression from a precursor state to oesophageal cancer" for consideration by eLife. Your article has been reviewed by three peer reviewers, including Irwin Davidson as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by Kevin Struhl as the Senior Editor.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

As the editors have judged that your manuscript is of interest, but as described below that additional experiments are required before it is published, we would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). First, because many researchers have temporarily lost access to the labs, we will give authors as much time as they need to submit revised manuscripts. We are also offering, if you choose, to post the manuscript to bioRxiv (if it is not already there) along with this decision letter and a formal designation that the manuscript is 'in revision at eLife'. Please let us know if you would like to pursue this option. (If your work is more suitable for medRxiv, you will need to post the preprint yourself, as the mechanisms for us to do so are still in development.)

Summary:

This study uses integrated genomics approaches on both patient material and model cell lines to investigate changes in chromatin accessibility (ATAC-seq) and KLF5 genomic occupancy in Barrett's Esophagus (BO) compared to esophageal adenocarcinoma (OAC) in an attempt to explain the changes in gene expression that drive the transition from the pre-cancerous to cancerous state. The results show changes in ATAC-seq accessibility and redistribution of KLF5 genomic occupancy in particular around a collection of 97 genes that appear as key KLF5 targets involved in cell cycle and up-regulated in OAC. The study further shows that ERBB2 signaling, a gene frequently amplified during BO to OAC progression partly regulates the redistribution of KLF5 to sites regulating the cell cycle genes.

While the reviewers were convinced by the data showing the redistribution of KLF5 genomic occupancy between the BO and OAC states and appreciated the integration of patient data with model cell lines, one of the major conclusions is that this redistribution participates actively in the transition between the two states. This is implied by the loss of function data, but in the opinion of the reviewers, gain of function showing that ERBB2 overexpression drives KLF5 redistribution and the OAC gene expression program is also necessary. Other revisions are also required that involve data analyses and changes to the text. We will be happy receive a revised version of the manuscript once the revisions have been completed.

Essential revisions:

1) The authors show that siRNA mediated silencing of ERBB2 results in loss of KLF5 binding (not shown directly but through ATAC-seq) at specific sites selectively occupied in OAC. In a complementary experiment can the authors show that gain of ERBB2 expression in CP-A cells (or in another pertinent experimental system), enhances cell proliferation, promotes KLF5 redistribution and enhances expression of the key KLF5 cell cycle genes. Such gain of function experiments would strongly strengthen the conclusions from the loss of function experiments shown in the paper.

2) The authors should discuss a potentially opposing hypothesis for the role of KLF5 in OAC, that loss of KLF5 binding sites between CP-A and OE19 cells (Figure 4B) also has a direct role in gene expression changes and OAC phenotypes. Do these binding events correlate with genes with lower expression in OE19 cells (and OAC tissues relative to controls)? The focus on potential gain of function might occlude key gene regulatory processes resulting from a loss of KLF5 occupancy during the BO to OAC switch. Ultimately, this alternative hypothesis and detract from the overall impact of the findings.

3) The authors demonstrate that KLF5 and GATA6 knockdown affects a statistically enriched set of target genes in OE19 cells (Figure 4G), but this is only relevant if these same genes are NOT regulated by KLF5 and GATA6 in non-OAC cell lines (like the CP-A cells). If these genes are truly de novo targets in OAC, they should be KLF5/GATA6 independent in CP-A or other lines. Can the authors comment on this?

4) The authors should fully describe the RNA-seq results of the siERBB2 experiment. How many genes are de-regulated? If the genes in Figure 5D correspond to only a small subset of those de-regulated by ERBB2 silencing, then this would call their relevance into question and there is no indication of their statistical significance compared to the full repertoire of ERBB2 regulated genes.

eLife. 2020 Sep 3;9:e57189. doi: 10.7554/eLife.57189.sa2

Author response


[…] While the reviewers were convinced by the data showing the redistribution of KLF5 genomic occupancy between the BO and OAC states and appreciated the integration of patient data with model cell lines, one of the major conclusions is that this redistribution participates actively in the transition between the two states. This is implied by the loss of function data, but in the opinion of the reviewers, gain of function showing that ERBB2 overexpression drives KLF5 redistribution and the OAC gene expression program is also necessary. Other revisions are also required that involve data analyses and changes to the text. We will be happy receive a revised version of the manuscript once the revisions have been completed.

In the original manuscript, we demonstrated that ERBB2 and KLF5 converge on the same signature of cell cycle genes that are more active in OAC. In our conclusions, we were careful to state that this was convergence rather than direct pathway intersection. We used this as further evidence to demonstrate the importance of the KLF5 regulon for gene regulation OAC. However, the reviewers make a valid point that we should test whether the functional interaction if direct or indirect. We now demonstrate that overexpression of ERBB2 in a Barrett’s cell line (CP-A) is sufficient to drive growth factor-independent activation of a panel of cell cycle genes from the KLF5 regulon (new Figure 5—figure supplement 2C), consistent with its ability to drive growth factor-independent cell proliferation (Figure 5—figure supplement 2B). However, ERBB2 did not promote enhanced KLF5 binding to the regulatory regions of cell cycle genes in these cells (new Figure 5—figure supplement 2D). These results are consistent with ERBB2 and KLF5 converging on the same genes but with no direct link between ERBB2 overexpression and KLF5 redistribution. Importantly, we now also show that KLF5 does not regulate a panel of cell cycle genes in Barrett’s (CP-A) cells (new Figure 4—figure supplement 2E) which we previously showed to be regulated and bound by KLF5 in OAC cells. This provides further evidence for KLF5 specifically functioning in the context of OAC to drive cell cycle gene activation.

Thus, our original conclusions are still valid and we have now altered the Discussion to be more definitive about the lack of direct links between ERBB2 and KLF5 redistribution, the trigger for which still needs to be determined though future studies.

Essential revisions:

1) The authors show that siRNA mediated silencing of ERBB2 results in loss of KLF5 binding (not shown directly but through ATAC-seq) at specific sites selectively occupied in OAC. In a complementary experiment can the authors show that gain of ERBB2 expression in CP-A cells (or in another pertinent experimental system), enhances cell proliferation, promotes KLF5 redistribution and enhances expression of the key KLF5 cell cycle genes. Such gain of function experiments would strongly strengthen the conclusions from the loss of function experiments shown in the paper.

See the discussion above. We have demonstrated that ERBB2 is sufficient to drive growth factor-independent cell proliferation of Barrett’s (CP-A) cells (new Figure 5—figure supplement 2A and B) and enhances expression of a panel of KLF5 target genes in these cells (see new Figure 5—figure supplement 2C). This is fully consistent with our loss of function experiments performed in OAC cells. However, we were unable to detect increases to KLF5 occupancy at a panel of regulatory regions (new Figure 5—figure supplement 2D) indicating that ERBB2 alone is insufficient to trigger KLF5 redistribution. In the original paper we concluded that KLF5 and ERBB2 converge on the same set of genes but not necessarily through direct interaction. This new data has allowed us to reaffirm this conclusion and rule out a causative effect of ERBB2 on KLF5 redistribution. This is not surprising given the complexities of cancer cell rewiring and future studies will be needed to investigate the underlying mechanism further. Note that our ATAC-seq data do point to a loss of KLF5 binding a subset of regions following ERBB2 inhibition but these are not relevant to the cell cycle phenotype we observe here.

2) The authors should discuss a potentially opposing hypothesis for the role of KLF5 in OAC, that loss of KLF5 binding sites between CP-A and OE19 cells (Figure 4B) also has a direct role in gene expression changes and OAC phenotypes. Do these binding events correlate with genes with lower expression in OE19 cells (and OAC tissues relative to controls)? The focus on potential gain of function might occlude key gene regulatory processes resulting from a loss of KLF5 occupancy during the BO to OAC switch. Ultimately, this alternative hypothesis and detract from the overall impact of the findings.

To address this issue, we examined the types of genes associated with the KLF5 regions that are lost in OAC cells and the resulting GO terms suggest a role in epithelial development which may have been retained in CP-A cells from an earlier stage (new Figure 4—figure supplement 2B). We added discussion of this to the paper and point out a potential role for this change (i.e. KLF5 binding loss) in the transition to OAC potentially by potentially losing developmental identity. Further work will be needed to further develop this hypothesis which is outside the scope of the current study.

3) The authors demonstrate that KLF5 and GATA6 knockdown affects a statistically enriched set of target genes in OE19 cells (Figure 4G), but this is only relevant if these same genes are NOT regulated by KLF5 and GATA6 in non-OAC cell lines (like the CP-A cells). If these genes are truly de novo targets in OAC, they should be KLF5/GATA6 independent in CP-A or other lines. Can the authors comment on this?

To address this point, we depleted KLF5 in CP-A cells and tested a panel of genes that are dependent on KLF5 for their activity in OE19 cells. No significant decreases in target gene expression were observed (new Figure 4—figure supplement 2E), demonstrating that they are KLF5-independent in Barrett’s cells but become dependent on KLF5 activity in OAC-derived OE19 cells. We have not tested GATA-dependency as this is not central to this paper on cell cycle functions of re-purposed KLF5, and the contribution of GATA-6 to the broader gene regulatory network warrants further extensive work that is beyond the scope of the current paper.

4) The authors should fully describe the RNA-seq results of the siERBB2 experiment. How many genes are de-regulated? If the genes in Figure 5D correspond to only a small subset of those de-regulated by ERBB2 silencing, then this would call their relevance into question and there is no indication of their statistical significance compared to the full repertoire of ERBB2 regulated genes.

We tested the significance of the overall overlap of KLF5 activated direct target genes (i.e. down with siKLF5) and ERBB2 activated genes (i.e. down with siERBB2 treatment) and show that this is highly significant whereas genes reciprocally regulated show a negligible, non-significant overlap (new Figure 5—figure supplement 1F). This reaffirms our conclusions that ERBB2 signalling and KLF5 converge on the same set of genes which are largely involved in cell cycle control (note that in Figure 5D, we do not take a fold cut off for the effect of ERBB2 to better illustrate the directionality of change).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8447 [DOI] [PMC free article] [PubMed]
    2. Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8446 [DOI] [PMC free article] [PubMed]
    3. Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8568 [DOI] [PMC free article] [PubMed]
    4. Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8584 [DOI] [PMC free article] [PubMed]
    5. Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8576 [DOI] [PMC free article] [PubMed]
    6. Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8579 [DOI] [PMC free article] [PubMed]
    7. Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. EGA. EGAD00001005915 [DOI] [PMC free article] [PubMed]
    8. Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8994 [DOI] [PMC free article] [PubMed]
    9. Britton E, Rogerson C, Mehta S, Li Y, Li X, The OCCAMS Consortium. Fitzgerald RC, Ang YS, Sharrocks AD. 2017. Open chromatin profiling identifies AP1 as a transcriptional regulator in oesophageal adenocarcinoma. ArrayExpress. E-MTAB-5169 [DOI] [PMC free article] [PubMed]
    10. Rogerson C, Britton E, Withey S, Hanley N, Ang Y, Sharrocks AD. 2019. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its pre-cancerous precursor state. ArrayExpress. E-MTAB-6751 [DOI] [PMC free article] [PubMed]
    11. Rogerson C, Britton E, Withey S, Hanley N, Ang Y, Sharrocks AD. 2019. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its pre-cancerous precursor state. ArrayExpress. E-MTAB-6756 [DOI] [PMC free article] [PubMed]
    12. Rogerson C, Britton E, Withey S, Hanley N, Ang Y, Sharrocks AD. 2019. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its pre-cancerous precursor state. ArrayExpress. E-MTAB-6758 [DOI] [PMC free article] [PubMed]
    13. Maag JLV, Fisher OM, Levert-Mignon A, Kaczorowski DC, Thomas ML, Hussey DJ, Watson DI, Wettstein A, Bobryshev YV, Edwards M, Dinger ME, Lord RV. 2017. Novel Aberrations Uncovered in Barrett's Esophagus and Esophageal Adenocarcinoma Using Whole Transcriptome Sequencing. ArrayExpress. E-MTAB-4054 [DOI] [PubMed]
    14. Corces MR, Granja JM, Shams S, Louie BH, Seoane JA, Zhou W, Silva TC, Groeneveld C, Wong CK, Cho SW, Satpathy AT, Mumbach MR, Hoadley KA, Robertson AG, Sheffield NC, Felau I, Castro MAA, Berman BP, Staudt LM, Zenklusen JC, Laird PW, Curtis C, The TCGA Network. Greenleaf WJ, Chang HY. 2018. The chromatin accessibility landscape of primary human cancers. GDC Data Portal. TCGA-ESCA [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Source code 1. ATAC fragment size visualisation.
    Supplementary file 1. Differentially expressed genes in OAC.

    Significantly (±1.5 x; Q-value <0.05) differentially expressed genes between BO (n = 13) and OAC (n = 12) (Maag et al., 2017).

    elife-57189-supp1.xlsx (104.8KB, xlsx)
    Supplementary file 2. Differentially accessible regions within ±250 kb of TSS of a DEG.

    (A) Total accessible regions from BO (n = 4) and OAC (n = 6) samples. (B) Significant differentially accessible open regions (+2x; Q-value <0.1). (C) Significant differentially accessible closed regions (−2x; Q-value <0.1).

    elife-57189-supp2.xlsx (3.9MB, xlsx)
    Supplementary file 3. DNA motifs enriched in OAC-specific open chromatin regions.

    Top ten motifs found by de novo motif discovery and their associated transcription factors that are enriched in ‘open in OAC’ (top) or ‘closed in OAC’ (bottom).

    elife-57189-supp3.xlsx (12.9KB, xlsx)
    Supplementary file 4. siKLF5 RNA-seq analysis.

    Significant differentially expressed genes with siKLF5 treatment (±1.3 x, Q-value <0.05)

    elife-57189-supp4.xlsx (499.5KB, xlsx)
    Supplementary file 5. KLF5 ChIP-seq datasets.

    (A) ChIP-seq peaks in OE19 cells. (B) ChIP-seq peaks in CP-A cells. (C) Differentially bound KLF5 ChIP-seq peaks (CP-A vs OE19).

    elife-57189-supp5.xlsx (2.7MB, xlsx)
    Supplementary file 6. De novo analysis of DNA motif enrichment in KLF5 ChIP-seq peak datasets.
    elife-57189-supp6.xlsx (13.9KB, xlsx)
    Supplementary file 7. (A) Frequency of KLF5, GATA1, FOXA2, FRA1 and TCF7L2 motifs within OE19 specific KLF5 ChIP-seq regions. one denotes present and 0 absent. (B) Overlaps of motifs and the basis of Figure 4G (A. KLF5; B. GATA1; C. FOXA2; D. FRA1; E. TCF7L2).
    elife-57189-supp7.xlsx (31.1KB, xlsx)
    Supplementary file 8. DNA motifs enriched in Cluster one and Cluster two regions.

    Top 10 motifs found by de novo motif discovery and their associated transcription factors that are enriched in cluster 1 (top) or cluster 2 (bottom).

    elife-57189-supp8.xlsx (13.3KB, xlsx)
    Supplementary file 9. Genomic coordinates of regions on OE19 cells that show a decrease in ATAC-seq signal upon treatment of siERBB2 for 72 hr.
    elife-57189-supp9.xlsx (27.1KB, xlsx)
    Supplementary file 10. De novo discovered motifs from regions that exhibit reduced chromatin accessibility upon treatment of siERBB2 for 72 hr.

    De novo motifs, % targets and % background, called transcription factor with match score and p-value are shown.

    elife-57189-supp10.xlsx (13.9KB, xlsx)
    Supplementary file 11. List of PCR primers used in RT-qPCR and ChIP-qPCR experiments.
    Transparent reporting form

    Data Availability Statement

    All sequencing data are deposited in ArrayExpress. Additional OAC ATAC-seq data are available at E-MTAB-8447 and additional BO and OAC RNA-seq data are available at E-MTAB-8584. siKLF5 RNA-seq data are available at E-MTAB-8446. KLF5 ChIP-seq data are available at E-MTAB-8568. siERBB2 ATAC-seq and RNA-seq data are available at E-MTAB-8576 and E-MTAB-8579 respectively.

    The following datasets were generated:

    Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8447

    Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8446

    Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8568

    Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8584

    Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8576

    Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8579

    Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. EGA. EGAD00001005915

    Rogerson C, Ogden S, Britton E, The OCCAMS Consortium. Yeng A, Sharrocks AD. 2020. Repurposing of KLF5 activates a cell cycle signature during the progression to Oesophageal Adenocarcinoma. ArrayExpress. E-MTAB-8994

    The following previously published datasets were used:

    Britton E, Rogerson C, Mehta S, Li Y, Li X, The OCCAMS Consortium. Fitzgerald RC, Ang YS, Sharrocks AD. 2017. Open chromatin profiling identifies AP1 as a transcriptional regulator in oesophageal adenocarcinoma. ArrayExpress. E-MTAB-5169

    Rogerson C, Britton E, Withey S, Hanley N, Ang Y, Sharrocks AD. 2019. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its pre-cancerous precursor state. ArrayExpress. E-MTAB-6751

    Rogerson C, Britton E, Withey S, Hanley N, Ang Y, Sharrocks AD. 2019. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its pre-cancerous precursor state. ArrayExpress. E-MTAB-6756

    Rogerson C, Britton E, Withey S, Hanley N, Ang Y, Sharrocks AD. 2019. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its pre-cancerous precursor state. ArrayExpress. E-MTAB-6758

    Maag JLV, Fisher OM, Levert-Mignon A, Kaczorowski DC, Thomas ML, Hussey DJ, Watson DI, Wettstein A, Bobryshev YV, Edwards M, Dinger ME, Lord RV. 2017. Novel Aberrations Uncovered in Barrett's Esophagus and Esophageal Adenocarcinoma Using Whole Transcriptome Sequencing. ArrayExpress. E-MTAB-4054

    Corces MR, Granja JM, Shams S, Louie BH, Seoane JA, Zhou W, Silva TC, Groeneveld C, Wong CK, Cho SW, Satpathy AT, Mumbach MR, Hoadley KA, Robertson AG, Sheffield NC, Felau I, Castro MAA, Berman BP, Staudt LM, Zenklusen JC, Laird PW, Curtis C, The TCGA Network. Greenleaf WJ, Chang HY. 2018. The chromatin accessibility landscape of primary human cancers. GDC Data Portal. TCGA-ESCA


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES