Abstract
NKX2-1, encoding a homeobox transcription factor, is amplified in approximately 15% of non-small cell lung cancers (NSCLC), where it is thought to drive cancer cell proliferation and survival. However, its mechanism of action remains largely unknown. To identify relevant downstream transcriptional targets, here we carried out a combined NKX2-1 transcriptome (NKX2-1 knockdown followed by RNAseq) and cistrome (NKX2-1 binding sites by ChIPseq) analysis in four NKX2-1-amplified human NSCLC cell lines. While NKX2-1 regulated genes differed among the four cell lines assayed, cell proliferation emerged as a common theme. Moreover, in 3 of the 4 cell lines, epidermal growth factor receptor (EGFR) was among the top NKX2-1 upregulated targets, which we confirmed at the protein level by western blot. Interestingly, EGFR knockdown led to upregulation of NKX2-1, suggesting a negative feedback loop. Consistent with this finding, combined knockdown of NKX2-1 and EGFR in NCI-H1819 lung cancer cells reduced cell proliferation (as well as MAP-kinase and PI3-kinase signaling) more than knockdown of either alone. Likewise, NKX2-1 knockdown enhanced the growth-inhibitory effect of the EGFR-inhibitor erlotinib. Taken together, our findings implicate EGFR as a downstream effector of NKX2-1 in NKX2-1 amplified NSCLC, with possible clinical implications, and provide a rich dataset for investigating additional mediators of NKX2-1 driven oncogenesis.
Introduction
Lung cancer accounts for the largest number of cancer-related deaths in the United States [1]. There are two major classes, small cell lung cancer and non-small cell lung cancer (NSCLC), the latter representing about 85% of cases, and including adenocarcinoma, squamous cell carcinoma, and large cell carcinoma histologies [2]. Among NSCLCs, recognized cancer drivers include activating mutations in EGFR, KRAS, BRAF and ERBB2 (HER2), as well as rearrangements of ALK and ROS1 [3]. Some of these, only recently identified, are now valuable therapeutic targets, underscoring the importance of defining new molecular targets and mechanisms.
NKX2-1 (also called TITF1 and TTF-1) encodes a homeobox transcription factor, and is found amplified at cytoband 14q13 in about 15% of NSCLCs (predominantly adenocarcinomas) [4, 5]. Understanding its mechanisms may provide new insight into lung carcinogenesis, and new strategies for therapy. NKX2-1 is expressed in the normal developing lung (as well as thyroid and forebrain) [6], where it is essential for organogenesis. In the adult lung, NKX2-1 expression is restricted to club cells (Clara) and type II pneumocytes, where it regulates surfactant production.
For years, NKX2-1 expression has been used by pathologists to deduce a lung origin of carcinomas [7]. Recently, NKX2-1 has been linked more directly to lung cancer, where the gene locus is amplified in some cases, leading to enhanced lung cancer cell proliferation and survival [8–11]. While its dual roles in driving lung development and differentiation on the one hand, and lung cancer (often viewed as de-differentiation) on the other seem paradoxical, NKX2-1 fits well into an emerging class of “lineage-survival” oncogenes—often master transcriptional regulators of normal cell lineage that become deregulated in cancers derived from that lineage [12]. Other examples include androgen receptor (AR) in prostate cancer, and MITF in melanoma.
Recent studies have identified candidate downstream mediators and collaborators of NKX2-1 oncogenesis, including ROR1 [13] and LMO3 [14]. Nonetheless, the mechanisms by which NKX2-1 contributes to lung carcinogenesis remain largely unknown. Indeed, in some contexts (mainly in the mouse), Nkx2-1 seems to function more as a tumor suppressor, inhibiting Kras-driven lung cancer and lung cancer metastasis [15, 16]. Here, to uncover oncogenic mechanisms in human lung cancer, we carried out a combined transcriptome (NKX2-1 knockdown followed by RNAseq) and cistrome (NKX2-1 binding sites by ChIP-seq) analysis in NKX2-1 amplified NSCLC cell lines. Among our findings, we identify EGFR as a downstream effector of NKX2-1 driven cell proliferation. Our results provide new insight into the mechanisms of NKX2-1 mediated lung cancer, and a dataset for continued exploration.
Materials and Methods
Cell culture
NCI-H1819, NCI-H661, HCC1195 and HCC1833 cell lines were obtained from Dr. John Minna (University of Texas Southwestern Medical Center) [17–20], where they were authenticated by short tandem repeat analysis. All cell lines were grown at 37°C in RPMI-1640 medium with L-glutamate, supplemented with 10% (vol/vol) fetal bovine serum and 1X Pen/Strep.
NKX2-1 isoform expression
A full-length NKX2-1 cDNA clone (in pOTB7) was obtained from Origene. DNA constructs corresponding to NKX2-1 transcript variant 1 (NM_001079668.2; encoding 401 amino acids) and NKX2-1 transcript variant 2 (NM_003317; encoding 371 amino acids, absent the N-terminal 30 amino acids of isoform 1) were PCR-amplified from the Origene plasmid template, sequence-verified, and then subcloned into the BamHI and XhoI sites of pCDNA6 (Invitrogen). PCR primers were: variant 1-F TCGAGGATCCCATGTGGTCCGGAGGCAG; variant 2-F TCGAGGATCCCATGTCGATGAGTCCAAAGCAC; variant 1/2-R GATCCTCGAGTCACCAGGTCCGACCG. Expression constructs were transfected into 293T cells (American Type Culture Collection) using Lipofectamine 2000 (Life Technologies) following the manufacturer’s protocol, and cell lysates collected 48 hrs later.
siRNA transfection
For siRNA transfection, 25,000–100,000 cells per 6-well plate well or 750,000–1,500,000 cells per 10cm plate were seeded and transfected using Lipofectamine 2000 (Life Technologies) following the manufacturer’s protocol. All siRNAs were On-TARGETplus pools purchased from Dharmacon/GE Healthcare (Table A in S1 File) and transfected at a final siRNA concentration of 50nM (unless otherwise specified) for 16 hr.
Western blot
Cells were lysed in RIPA buffer (Millipore) supplemented with 1mM sodium orthovanadate, 5mM NaF, 1mM PMSF, and 1X protease inhibitor cocktail (Roche). Then 40ug lysate was run on a 4–12% polyacrylamide gel (Biorad) and transferred to PVDF membrane (Biorad). Primary antibodies used were NKX2-1 (Santa Cruz Biotechnology, H-190, 1:200), EGFR (Cell Signaling, D38B1, 1:1000), MAPK (Erk1/2) (Cell Signaling, 137F5, 1:1000), p-MAPK (Erk1/2) (Thr202/Tyr204) (Cell Signaling, D12.14.4E, 1:1000), AKT (Cell Signaling, C67E7, 1:1000), p-AKT (Ser473) (Cell Signaling, D9E, 1:1000), and α-tubulin (Santa Cruz Biotechnology, 1:1000). All secondary antibodies were HRP-conjugated and SuperSignal West Pico Chemiluminescence (Pierce) used for detection. Western blots were quantified using ImageJ. All western blot results are representative of multiple independent experiments.
Cell proliferation/viability assay
Cell proliferation/viability was quantified 0, 24, 48, 72 and 96 hrs post-transfection by colorimetric assay based on the metabolic cleavage of the tetrazolium salt WST-1 in viable cells, according to the manufacturer protocol (Roche). In some experiments, erlotinib (Santa Cruz Biotechnology) was added at indicated concentrations (or vehicle control). IC50 values were determined by fitting a nonlinear log (inhibitor) versus response curve using GraphPad Prism.
RNAseq
Cell lines were transfected with either a NKX2-1-targeting siRNA pool or a non-targeting control (NTC) siRNA pool for 16 hrs. Total protein (to verify NKX2-1 knockdown) and RNA (by Qiagen RNAeasy Mini) were collected 40 hrs post-transfection. qRT-PCR was done first to verify reduced transcript for NKX2-1, as well as a known NKX2-1 regulated gene, SFTPB [21]. For q-RT-PCR, cDNA was made using SuperScript II (Life Technologies), and then qPCR was done in at least triplicate using TaqMan reagents (Applied Biosystems) on an ABS Fast 7500 instrument. Relative transcript levels were calculated by the ΔCT method, and normalized to GAPDH. Primers used are listed in Table B in S1 File. Barcoded cDNA libraries were then prepared from total RNA using the Illumina TruSeq RNAseq kit, and sequenced (single-end 36-bp reads) on an Illumina HiSeq instrument to a depth of 24–63 million reads per sample (Stanford Center for Genomics and Personalized Medicine) (Table C in S1 File). Sequence reads were mapped to the RefSeq transcriptome and transcripts quantified as reads per kilobase per million reads (RPKM) using DNAnexus. Subsequent analysis was confined to adequately expressed transcripts, defined as RPKM ≥1 in both the NKX2-1 and control knockdowns. Gene Set Enrichment Analysis was done using the desktop package [22]. The complete dataset of raw RNAseq reads is available at the NCBI Sequence Read Archive (Accession SRP045118).
Q-RT-PCR
For select genes, RNAseq results were subsequently validated by Q-RT-PCR. Reverse transcription was performed using SuperScript II reagents as per the manufacturer’s protocol (Life Technologies). qPCR was done in at least triplicate using TaqMan or SYBR Green reagents (Applied Biosystems) on an ABS Fast 7600 instrument. Relative transcript levels were calculated by the ΔCT method, and normalized to GAPDH. Primers used are listed in Table B in S1 File.
Chromatin immunoprecipitation (ChIP) seq
Five million cells per immunoprecipitation (IP) or control input sample were cross-linked in 1% formaldehyde at 25°C for 10 min, washed in ice cold PBS, lysed in cell lysis buffer (5mM PIPES (pH8), 85mM KCl, 1% Igepal, 1X Roche Complete Mini Protease Inhibitor Cocktail) at 4°C for 15 min, mechanically homogenized with 25 strokes of a dounce homogenizer, lysed in nuclear lysis buffer (50mM Tris (pH8), 10mM EDTA (pH8), 1% (w/v) SDS, 1X Roche Complete Mini Protease Inhibitor Cocktail) 30 min and sonicated with a Bioruptor XL to obtain 200-600bp chromatin fragments. Sonicated fragments were diluted 10-fold in IP buffer (20mM Tris (pH8), 2mM EDTA (pH8), 2% Triton X-100, 0.2% Sodium Deoxycholate, 200mM NaCl) and incubated overnight with a 1:1 mix of Protein A/Protein G beads blocked (5mg/mL BSA, 1X Roche Complete Mini Protease Inhibitor Cocktail) and pre-incubated with 5μg anti-NKX2-1 antibody (Santa Cruz Biotechnology, H-190). For ChIP-PCR experiments, rabbit IgG (Santa Cruz Biotechnology, sc-2027) was used as a negative control. Samples were washed 3x in IP wash buffer (50mM Tris (pH8), 150mM NaCl, 0.05% Triton X-100), once in TE buffer, and eluted from the beads into IP Elution buffer (1% SDS, 100mM NaHCO3) at 65°C for 1 hr. Crosslinks were reversed by incubation at 65°C for 16 hrs in modified elution buffer (1% SDS, 100mM NaHCO3, 200mM NaCl). ChIP’ed DNA was purified using a Qiagen PCR clean up kit. To first validate ChIP antibody specificity, qPCR was performed with custom TaqMan probes designed to a known NKX2-1 binding site (SFTPB promoter) [21], in comparison to a negative control (an irrelevant gene, WNT5A), and normalized to GAPDH. Primers used are listed in Table B in S1 File. Barcoded ChIP and input DNA libraries were then prepared using the Illumina TruSeq ChIPseq kit, and then sequenced (single-end 36-bp reads) on an Illumina HiSeq instrument to a depth of 18–60 million reads per sample (Stanford Center for Genomics and Personalized Medicine) (Table C in S1 File). Sequence reads were mapped to the human genome (NCBI36, hg18) and significant binding peaks called using DNAnexus. Binding peaks were annotated to the nearest gene (within 100kb) using a custom Perl script. MEME [23] was used for de novo motif scanning of NKX2-1 binding peaks (100bp centered on the top 500 binding peaks associated with genes). Enrichment of canonical pathways was evaluated by the Molecular Signatures Database [24] “compute overlap” function. The complete dataset of raw ChIPseq reads is available at the NCBI Sequence Read Archive (Accession SRP045118).
Integration of TCGA data
Processed TCGA lung adenocarcinoma [25] (n = 488) data, including DNA copy number (Affymetrix SNP 6.0) and transcript levels (RNAseq), were accessed from the Broad Firehose pipeline. Cases with NKX2-1 amplification were defined by NKX2-1 tumor/normal ratios > 1.5, and absence of amplification by tumor/normal <1.1. NKX2-1 high-expression was defined as the top 25 percentile of specimens. Genes differentially expressed between NKX2-1 amplified/highly-expressed (n = 30) and non-amplified/highly-expressed (n = 60) were identified by two-class SAM analysis [26], using a false discovery rate (FDR) <0.05.
Results
RNAseq analysis of the NKX2-1 regulated transcriptome
To investigate mechanisms of NKX2-1 mediated oncogenesis, we first sought to identify NKX2-1 regulated genes in NKX2-1 amplified NSCLC cell lines. We previously characterized two NSCLC cell lines, HCC1195 and HCC1833, that harbor NKX2-1 amplification and are dependent on NKX2-1 (based on knockdown studies) for cell proliferation [10]. Here, we surveyed additional NSCLC lines [19, 27], and identified two more NKX2-1 amplified and dependent lines, H1819 and H661, for a total of four cell lines (Fig 1). Three of the four cell lines derive from lung adenocarcinomas (H1819, HCC1195, HCC1833) while the fourth (H661) is of large cell carcinoma origin. Analysis of two lines (HCC1195 and HCC1833) indicated that the shorter of two reported NKX2-1 isoforms (with variant N-termini) [6] was the one predominantly expressed (Figure A in S2 File).
To identify NKX2-1 regulated genes in each of the four cell lines, we transfected the cells with an On-TARGETplus siRNA pool to knock down NKX2-1, or a non-targeting control (NTC) siRNA pool. (Because we had previously shown that independent siRNAs targeting NKX2-1 exhibited NKX2-1 knockdown and reduced cell proliferation comparable to the NKX2-1 siRNA pool [10] (Figure B, Panels A, B and C in S2 File), most experiments were done using the siRNA pool, with subsequent validation of select key findings using independent siRNAs.) We then assayed resultant transcriptome changes by RNAseq, comparing control and NKX2-1 knockdown cells. Transcriptomes were assayed 40 hrs post siRNA-transfection, a time point at which we observed knockdown of both the NKX2-1 transcript and protein, as well as reduced transcript of surfactant protein B (SFTPB), a known NKX2-1 regulated target [21] (Fig 2A).
Confining the analysis to well-measured genes (RPKM ≥1), approximately 10,000 genes were expressed in each of the cell lines, with 9,041 genes common across all four cell lines (Figure C, Panel A in S2 File, and Table D in S1 File). The number of genes substantially altered by NKX2-1 knockdown, defined as at least 25% decreased or increased expression, ranged from 1,160 to 1,615 among the four cell lines (Figure C, Panel B in S2 File, and Table E in S1 File). Despite this, few genes by that criterion were consistently altered across all four cell lines. Upon NKX2-1 knockdown, only 4 genes were consistently downregulated (NBL1, UNC84B, RRBP1, and FA2H), and only 9 genes were consistently upregulated (C7ORF60, CHD7, HIST1H2BC, HIST1H2BD, HIST1H2BE, ID3, PIK3CB, TP53INP1, ZBTB4) (Fig 2B). These genes have diverse cellular roles (Table F in S1 File), among which RRBP1 functions in endoplasmic reticulum stress response and has been found upregulated in lung cancer [28], and overexpression of the ID3 transcriptional repressor has been reported to reduce lung cancer growth in vivo [29].
To explore themes rather than individual genes, we also carried out a two-class gene set enrichment analysis (GSEA) [24], comparing transcript levels (across all four cell lines) following NKX2-1 knockdown vs. non-targeting control. The top enriched conical pathways included several relating to cell-cycle and cell proliferation (Fig 2C, and Table G in S1 File), consistent with the known role of NKX2-1 in cell proliferation [10, 30]. GSEA assessment of each cell line individually also showed enrichment of biology relating to cell proliferation (Table H in S1 File).
Although few genes met the stringent threshold for altered expression across all four cell lines (only 4 consistently downregulated and 9 upregulated genes), we investigated whether more genes might show a similar trend. Examining the top 100 downregulated and upregulated genes in each cell line (Table I in S1 File), a heatmap visualization (Fig 2D) revealed shared expression patterns among 3 of the 4 cell lines (H1819, HCC1195 and H661), particularly among genes downregulated with NKX2-1 knockdown. This finding, substantiated by correlation analysis (Fig 2D), suggested that these three cell lines might best model NKX2-1 amplified NSCLC.
Focusing on the three cell lines (H1819, HCC1195 and H661) with shared expression patterns, 69 genes were consistently downregulated (at least 25%) upon NKX2-1 knockdown (Fig 2B, and Table J in S1 File), indicating that NKX2-1 positively regulates their expression. Although several were of possible biologic interest, notable among them was epidermal growth factor receptor (EGFR), ranked respectively 8th (of 763), 68th (of 504), and 18th (of 708) among genes downregulated upon NKX2-1 knockdown in the three cell lines. Reduced expression of EGFR, as well as other select genes (to substantiate RNAseq results), was verified by Q-RT-PCR (Figure C, Panel C in S2 File). Two independent siRNAs targeting NKX2-1 each also reduced EGFR transcript levels (by Q-RT-PCR; Figure B, Panel D in S2 File) and protein levels (by western blot; Figure B, Panel E in S2 File), supporting EGFR reduction to be an on-target RNA interference phenotype; other candidate NKX2-1 regulated genes were not similarly evaluated. We had not previously succeeded in rescue of NKX2-1 knockdown by ectopic expression of NKX2-1 (unpublished), possibly due to non-optimal NKX2-1 expression levels. Therefore, we did not attempt to further verify on-target RNA interference by rescuing the NKX2-1 knockdown effect on EGFR levels.
ChIPseq analysis of the NKX2-1 cistrome
While the above strategy (NKX2-1 knockdown coupled with RNAseq) revealed NKX2-1 regulated genes, it did not distinguish direct from indirect NKX2-1 transcriptional targets. Therefore, in parallel, we carried out chromatin immunoprecipitation (ChIP)-seq to define NKX2-1 genome binding sites and associated genes in the same four NKX2-1 amplified/dependent NSCLC cell lines. ChIP-PCR of the SFTB promoter was done prior to ChIP-seq to validate the specificity of the anti-NKX2-1 antibody for ChIP (Fig 3A).
In the four cell lines, ChIPseq identified 150–1,844 NKX2-1 binding site-associated genes (nearest gene within 100Kb) (Fig 3B, and Table K in S1 File). Most binding sites occurred within 100Kb of genes, as would be expected for promoters or enhancers (Figure D, Panel A in S2 File). De novo motif analysis recovered the known Nkx2 binding site (CACTY) [31] (Fig 3C), validating the ChIPseq data quality. In addition, motif analysis of NKX2-1 binding peaks identified nearby enrichment of the FOXA1 motif in HCC1833 cells, consistent with the reported interaction of NKX2-1 with FOXA1 [32]. Other NKX2-1 binding site associated transcription factor motifs in HCC1833 included MYOD1, NFE2 and TCF3 (Fig 3C).
Despite the large number of NKX2-1 binding sites, only a small subset (2 associated genes: PVT1 and ZC3H7A) was identified across all four NKX2-1 amplified cell lines (Fig 3B), mirroring the findings of the transcriptome analysis. Of note, NKX2-1 binding peaks were found associated with EGFR in all three cell lines (H1819, H661 and HCC1195) sharing similar transcriptional responses to NKX2-1 knockdown (Table K in S1 file, Fig 3D, and Figure D, Panel B in S2 File). NKX2-1 binding site motifs could be identified within the ChIPseq binding peaks at EGFR (Figure D, Panel B in S2 File). When considered across all cell lines together, genes nearest (and within 100Kb) to NKX2-1 binding sites were enriched for specific canonical pathways, including MAPK signaling, axon guidance, focal adhesion, and cell-cell communication (Table L in S1 File), possibly reflecting NKX2-1’s dual roles in cell proliferation and epithelial differentiation.
Integrative -omic analysis identifies EGFR as downstream effector of amplified NKX2-1
To identify direct transcriptional targets of NKX2-1, we integrated the above RNAseq and ChIPseq datasets. Among the four different cell lines, 2–13% of genes whose expression changed ≥25% upon NKX2-1 knockdown also had nearby (within 100Kb) NKX2-1 binding peaks (Tables M and N in S1 File, and Figure E in S2 File). Conversely, 6–17% of genes associated with NKX2-1 binding sites exhibited substantial expression changes (≥25%) following NKX2-1 knockdown. To further whittle down the list of interesting candidates, and to ensure relevance to primary NSCLC samples, we cross-compared our list of genes to transcriptome data of 488 primary lung adenocarcinomas from The Cancer Genome Atlas (TCGA) [25]. In particular, we identified the subset of NKX2-1 upregulated presumptive direct targets whose expression was significantly higher (by two-class SAM analysis) in TCGA lung adenocarcinomas with NKX2-1 amplification/overexpression (n = 20), compared to adenocarcinomas without amplification (but still expressing NKX2-1 in a non-amplified context) (n = 60) (Table 1). Notable among the remaining candidates was EGFR, which in summary was downregulated with NKX2-1 knockdown in 3 of 4 cell lines, was associated with a called ChIP peak in the same three cell lines, and was significantly overexpressed (FDR = 0.054) in primary NSCLCs with NKX2-1 amplification/overexpression. Decidedly, our subsequent follow-up experiments focused on EGFR.
Table 1. Integration of RNAseq, ChIPseq and TCGA data.
Cell line | Gene | RNAseq (siNKX2-1/siNTC) | ChIPseq peak rank | TCGA FDR a (%) |
---|---|---|---|---|
H1819 | EGFR | 0.349 | 84 | 5.4 |
EFR3B | 0.716 | 123 | 1.1 | |
H661 | EGFR | 0.397 | 1638 | 5.4 |
PAX9 | 0.632 | 1400 | 0.3 | |
MRPL42 | 0.644 | 1922 | 0.3 | |
TSEN15 | 0.644 | 2163 | 3.0 | |
RAI14 | 0.649 | 1989 | 9.2 | |
MAP2K6 | 0.667 | 2523 | 0.3 | |
LOC654342 | 0.696 | 2906 | 3.6 | |
GNG4 | 0.696 | 1070 | 5.4 | |
MED30 | 0.714 | 1551 | 0.8 | |
RALA | 0.722 | 3122 | 1.1 | |
MCTP1 | 0.733 | 261 | 7.8 | |
ARL4A | 0.750 | 401 | 9.2 | |
HCC1195 | CCL20 | 0.465 | 908 | 7.8 |
CNGA3 | 0.478 | 1542 | 7.8 | |
EGFR | 0.568 | 1388 | 5.4 | |
TREM1 | 0.571 | 1350 | 6.4 | |
MMP13 | 0.573 | 1367 | 9.2 | |
ETV5 | 0.600 | 437 | 6.4 | |
PENK | 0.677 | 1564 | 9.2 | |
IL1RAP | 0.683 | 1330 | 5.4 | |
NKX2-8 | 0.691 | 978 | 2.6 | |
ABCA4 | 0.691 | 328 | 5.4 | |
NAMPT | 0.709 | 1176 | 1.5 | |
HMGA2 | 0.712 | 1148 | 4.4 | |
ASPHD2 | 0.714 | 1199 | 1.5 | |
C5orf4 | 0.739 | 347 | 9.2 | |
EFR3B | 0.744 | 537 | 1.1 | |
LPIN1 | 0.745 | 698 | 5.4 | |
FAM101A | 0.750 | 275 | 1.1 | |
HCC1833 | NGF | 0.513 | 7 | 4.4 |
CCDC14 | 0.622 | 202 | 0.3 | |
HMP19 | 0.654 | 216 | 7.8 | |
HMGA2 | 0.676 | 193 | 4.4 | |
BCL2 | 0.706 | 95 | 0.6 | |
ESPL1 | 0.741 | 184 | 0.0 |
aAmplified/highly-expressed vs. non-amplified/highly-expressed (FDR<10%)
EGFR is moderately amplified in H1819 cells, and is wildtype (i.e. no activating mutations) in all four lines [33–36]. EGFR protein expression varies, but was highest in H1819 (Figure F, Panel A in S2 File). Consistent with the RNAseq results, in all three cell lines (H1819, HCC1195 and H661) where NKX2-1 knockdown led to reduced EGFR transcript, NKX2-1 knockdown also led to a 40–90% reduction of EGFR protein, assessed by western blot (Fig 4A). Consistent with EGFR possibly functioning as a downstream effector of NKX2-1, knock down of EGFR (using 200nM siRNA [37]) led to a comparable reduction in cell proliferation, measured by WST-1 assay (Fig 4B). Unexpectedly, knockdown of EGFR led to increased protein levels of NKX2-1 (Fig 4C), suggesting negative feedback regulation. Two independent siRNAs targeting EGFR (Figure B, Panels F and G in S2 File) each also increased NKX2-1 protein levels (by western blot; Figure B, Panel H in S2 File), supporting an on-target RNA interference phenotype. Interestingly, EGFR knockdown did not alter NKX2-1 transcript levels (Figure B, Panel I in S2 File), suggesting that the increased NKX2-1 protein levels observed were likely the consequence of post-transcriptional regulation.
Consistent with a negative feedback loop, knock down of NKX2-1 together with EGFR reduced H1819 cell proliferation significantly more than knockdown of either alone (Fig 5A). (The combined knockdown could not be tested in HCC1195 and H661 cells, because the higher siRNA concentration required to knockdown EGFR was toxic to those cells.) Furthermore, combined NKX2-1 and EGFR knockdown reduced MAP-kinase (p-MAPK levels) and PI3-kinase signaling (p-AKT levels)—known growth/survival signaling pathways downstream of EGFR [38]—more than knockdown of either alone (Fig 5B).
The above results suggested that NKX2-1 knockdown might collaborate with small molecule inhibition of EGFR kinase. To test this, we challenged H1819 cells with NKX2-1 knockdown (or non-targeting control) to a ~50% growth inhibitory concentration (1.0 μM; Figure F, Panel B in S2 File) of the EGFR inhibitor erlotinib. Notably, NKX2-1 knockdown significantly enhanced the growth inhibitory effect of erlotinib (Fig 5C) on the NKX2-1 amplified NSCLC cells.
Discussion
NKX2-1 is a transcription factor, so its oncogenic effects when amplified are presumed to be mediated through transcriptional regulation of key downstream targets. To identify those targets, here we performed a combined transcriptome (NKX2-1 knockdown followed by RNAseq) and cistrome (NKX2-1 bound genes by ChIPseq) analysis in four NSCLC cell lines exhibiting NKX2-1 amplification and growth-dependency. Results were further integrated with TCGA genome and transcriptome data from primary NSCLC cases, from which we identified and further studied EGFR as a downstream target.
An unexpected finding from the knockdown/RNAseq experiments was the relatively small overlap of substantially up/downregulated genes across the cell lines. This likely reflects the heterogeneity among different patient’s tumors, where distinct cell types of origin, differentiation patterns, and/or tumor (epi)genetic alterations may influence the landscape of genes regulated by NKX2-1. Despite this, 3 of the 4 cell lines shared meaningful gene expression patterns, exhibited enrichment for select processes (e.g. cell proliferation), and nominated specific genes (EGFR).
The combined analysis of NKX2-1 binding sites further identified the subset of likely direct NKX2-1 transcriptional targets (though we note that indirect transcriptional targets may also have important biological functions). Finally, the integration with TCGA data highlighted those direct targets with likely relevance to actual primary tumors. While EGFR emerged as the most interesting candidate, other genes with intriguing connections to lung cancer biology, including HMGA2, BCL2, and others (Table 1) [15, 39], will be the focus of future studies. Also notable was the absence of ROR1 and LMO3, both nominated from prior cistrome and/or tumor transcriptome studies [13, 14]. In our dataset, ROR1 displayed increased expression following NKX2-1 knockdown in one cell line (HCC1833), and was associated with a ChIP peak in two cell lines (H661 and H1195). LMO3 showed decreased expression in one cell line (HCC1195), and was associated with a ChIP peak in two cell lines (H1819 and HCC1195), and thus also did not meet our prioritization criteria.
From our integrative -omic analysis, EGFR emerged as a top candidate with obvious biological and clinical interest. EGFR is amplified and/or mutated in at least 10–15% of NSCLC (predominantly adenocarcimomas) [3], and those activating alterations predict clinical response to EGFR inhibitors including erlotinib and gefitinib [40]. EGFR mutations tend to occur in lung adenocarcinomas from female non-smokers [41]. In contrast, KRAS mutations, despite ostensibly functioning just downstream of EGFR, tend to occur in adenocarcinomas with mucinous histology from male smokers [3, 41]. Our findings suggest that NKX2-1 amplification mediates its oncogenic effects at least in part through upregulation/activation of EGFR and its downstream signaling pathways (MAPK and PI3K).
Notably, our findings are consonant with prior observations from genetically-engineered mouse models. In particular, Nkx2-1 was recently shown to be required for Egfr-driven murine lung cancer [16]. As suggested from our data, Nkx2-1 might be required for the upregulated expression of Egfr transcript and/or protein (albeit the Egfr transgene is not present at its native genome locus). In contrast, in Kras mutant mice Nkx2-1 loss was shown to accelerate lung cancer (mucinous adenocarcinoma) [16] and potentiate lung cancer metastasis [15]. NKX2-1 likely plays distinct roles in the context of EGFR-driven and KRAS-driven lung cancers, with Egfr-driven murine tumors perhaps more closely modeling NKX2-1 amplified human lung cancer.
Our findings further suggest a feedback inhibition loop between NKX2-1 and EGFR, which regulates downstream signaling (summarized in Fig 6). In support of this, combined knockdown of NKX2-1 and EGFR further reduced MAPK and PI3K signaling, with resultant diminished cell proliferation. However, it is possible that NKX2-1 also transcriptionally regulates other inputs into MAPK and PI3K signaling (as suggested by our NKX2-1 binding site analysis; Table L in S1 File), and/or other pathways affecting cell proliferation and survival. Indeed, it is rather over-simplistic to presume that a transcription factor such as NKX2-1 acts through only a single downstream target (whether EGFR, or ROR1 or LMO3 as proposed by others). Moreover, NKX2-1 is subject to post-translational modification (e.g. phosphorylation and acetylation) affecting its activity [4] that could potentially influence its modulation of target genes and hence oncogenesis.
Notably, NKX2-1 knockdown enhanced the growth-suppressive effect of EGFR inhibition, whether by siRNA or by the small molecule inhibitor erlotinib. EGFR inhibitors including erlotinib are used clinically to treat lung cancers carrying activating EGFR mutations, but are less effective against lung cancers with wildtype EGFR [40]. Our findings suggest that concomitant NKX2-1 inhibition may augment the efficacy of EGFR inhibitors in NKX2-1 amplified lung tumors. Although transcription factors are not considered ideal drug targets, recent successes in this area [42] suggest that future investigations based on this strategy may be warranted.
Further studies should define the precise mechanisms by which EGFR effectuates NKX2-1 oncogenesis in NSCLC. Nevertheless, our studies newly link two key NSCLC oncogenes, NKX2-1 and EGFR, thereby uncovering an important new area of investigation and possible therapeutic strategy. Furthermore, the RNAseq and ChIPseq datasets should provide a rich resource for mining additional mediators and cooperators of NKX2-1 oncogenesis.
Supporting Information
Acknowledgments
We thank members of the Pollack lab for helpful comments.
Data Availability
The complete dataset of raw RNAseq and ChIPseq reads is available at the NCBI Sequence Read Archive (Accession SRP045118).
Funding Statement
This research was supported by a grant from the Tobacco-Related Disease Research Program (21XT-0054). NC was supported by fellowships from the Gates Millennium Foundation and Stanford Cancer Biology Program, and JMD from the Stanford Tumor Biology Training Program. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.American Cancer Society. Cancer Facts & Figures 2014. American Cancer Society Annual Report. 2014.
- 2. Herbst RS, Heymach JV, Lippman SM. Lung cancer. The New England journal of medicine. 2008;359(13):1367–80. Epub 2008/09/26. 10.1056/NEJMra0802714 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Cooper WA, Lam DC, O'Toole SA, Minna JD. Molecular biology of lung cancer. J Thorac Dis. 2013;5 Suppl 5:S479–90. Epub 2013/10/29. 10.3978/j.issn.2072-1439.2013.08.03 jtd-05-S5-S479 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Yamaguchi T, Hosono Y, Yanagisawa K, Takahashi T. NKX2-1/TTF-1: an enigmatic oncogene that functions as a double-edged sword for cancer cell survival and progression. Cancer cell. 2013;23(6):718–23. Epub 2013/06/15. 10.1016/j.ccr.2013.04.002 . [DOI] [PubMed] [Google Scholar]
- 5. Mu D. The complexity of thyroid transcription factor 1 with both pro- and anti-oncogenic activities. The Journal of biological chemistry. 2013;288(35):24992–5000. Epub 2013/07/03. 10.1074/jbc.R113.491647 R113.491647 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Boggaram V. Thyroid transcription factor-1 (TTF-1/Nkx2.1/TITF1) gene regulation in the lung. Clinical science. 2009;116(1):27–35. Epub 2008/11/29. 10.1042/CS20080068 . [DOI] [PubMed] [Google Scholar]
- 7. Ordonez NG. Thyroid transcription factor-1 is a marker of lung and thyroid carcinomas. Adv Anat Pathol. 2000;7(2):123–7. Epub 2000/03/18. . [DOI] [PubMed] [Google Scholar]
- 8. Kendall J, Liu Q, Bakleh A, Krasnitz A, Nguyen KC, Lakshmi B, et al. Oncogenic cooperation and coamplification of developmental transcription factor genes in lung cancer. Proceedings of the National Academy of Sciences of the United States of America. 2007;104(42):16663–8. Epub 2007/10/11. 10.1073/pnas.0708286104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Weir BA, Woo MS, Getz G, Perner S, Ding L, Beroukhim R, et al. Characterizing the cancer genome in lung adenocarcinoma. Nature. 2007;450(7171):893–8. Epub 2007/11/06. 10.1038/nature06358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Kwei KA, Kim YH, Girard L, Kao J, Pacyna-Gengelbach M, Salari K, et al. Genomic profiling identifies TITF1 as a lineage-specific oncogene amplified in lung cancer. Oncogene. 2008;27(25):3635–40. Epub 2008/01/24. 10.1038/sj.onc.1211012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Tanaka H, Yanagisawa K, Shinjo K, Taguchi A, Maeno K, Tomida S, et al. Lineage-specific dependency of lung adenocarcinomas on the lung development regulator TTF-1. Cancer research. 2007;67(13):6007–11. Epub 2007/07/10. 10.1158/0008-5472.CAN-06-4774 . [DOI] [PubMed] [Google Scholar]
- 12. Garraway LA, Sellers WR. Lineage dependency and lineage-survival oncogenes in human cancer. Nature reviews Cancer. 2006;6(8):593–602. Epub 2006/07/25. 10.1038/nrc1947 . [DOI] [PubMed] [Google Scholar]
- 13. Yamaguchi T, Yanagisawa K, Sugiyama R, Hosono Y, Shimada Y, Arima C, et al. NKX2-1/TITF1/TTF-1-Induced ROR1 is required to sustain EGFR survival signaling in lung adenocarcinoma. Cancer cell. 2012;21(3):348–61. Epub 2012/03/24. 10.1016/j.ccr.2012.02.008 . [DOI] [PubMed] [Google Scholar]
- 14. Watanabe H, Francis JM, Woo MS, Etemad B, Lin W, Fries DF, et al. Integrated cistromic and expression analysis of amplified NKX2-1 in lung adenocarcinoma identifies LMO3 as a functional transcriptional target. Genes & development. 2013;27(2):197–210. Epub 2013/01/17. 10.1101/gad.203208.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Winslow MM, Dayton TL, Verhaak RG, Kim-Kiselak C, Snyder EL, Feldser DM, et al. Suppression of lung adenocarcinoma progression by Nkx2-1. Nature. 2011;473(7345):101–4. Epub 2011/04/08. 10.1038/nature09881 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Maeda Y, Tsuchiya T, Hao H, Tompkins DH, Xu Y, Mucenski ML, et al. Kras(G12D) and Nkx2-1 haploinsufficiency induce mucinous adenocarcinoma of the lung. The Journal of clinical investigation. 2012;122(12):4388–400. Epub 2012/11/13. 10.1172/JCI64048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Banks-Schlegel SP, Gazdar AF, Harris CC. Intermediate filament and cross-linked envelope expression in human lung tumor cell lines. Cancer research. 1985;45(3):1187–97. Epub 1985/03/01. . [PubMed] [Google Scholar]
- 18. Gazdar AF, Minna JD. NCI series of cell lines: an historical perspective. J Cell Biochem Suppl. 1996;24:1–11. Epub 1996/01/01. . [DOI] [PubMed] [Google Scholar]
- 19. Phelps RM, Johnson BE, Ihde DC, Gazdar AF, Carbone DP, McClintock PR, et al. NCI-Navy Medical Oncology Branch cell line data base. J Cell Biochem Suppl. 1996;24:32–91. Epub 1996/01/01. . [DOI] [PubMed] [Google Scholar]
- 20. Wistuba II, Bryant D, Behrens C, Milchgrub S, Virmani AK, Ashfaq R, et al. Comparison of features of human lung cancer cell lines and their corresponding tumors. Clinical cancer research: an official journal of the American Association for Cancer Research. 1999;5(5):991–1000. Epub 1999/06/03. . [PubMed] [Google Scholar]
- 21. Cao Y, Vo T, Millien G, Tagne JB, Kotton D, Mason RJ, et al. Epigenetic mechanisms modulate thyroid transcription factor 1-mediated transcription of the surfactant protein B gene. The Journal of biological chemistry. 2010;285(3):2152–64. Epub 2009/11/13. 10.1074/jbc.M109.039172 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Subramanian A, Kuehn H, Gould J, Tamayo P, Mesirov JP. GSEA-P: a desktop application for Gene Set Enrichment Analysis. Bioinformatics. 2007;23(23):3251–3. Epub 2007/07/24. doi: btm369 [pii] 10.1093/bioinformatics/btm369 . [DOI] [PubMed] [Google Scholar]
- 23. Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, et al. MEME SUITE: tools for motif discovery and searching. Nucleic acids research. 2009;37(Web Server issue):W202–8. Epub 2009/05/22. 10.1093/nar/gkp335 gkp335 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences of the United States of America. 2005;102(43):15545–50. Epub 2005/10/04. 10.1073/pnas.0506580102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Comprehensive molecular profiling of lung adenocarcinoma. Nature. 2014;511(7511):543–50. Epub 2014/08/01. 10.1038/nature13385 nature13385 [pii]. . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proceedings of the National Academy of Sciences of the United States of America. 2001;98(9):5116–21. Epub 2001/04/20. 10.1073/pnas.091062498 091062498 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Gazdar AF, Girard L, Lockwood WW, Lam WL, Minna JD. Lung cancer cell lines as tools for biomedical discovery and research. J Natl Cancer Inst. 2010;102(17):1310–21. Epub 2010/08/04. 10.1093/jnci/djq279 djq279 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Tsai HY, Yang YF, Wu AT, Yang CJ, Liu YP, Jan YH, et al. Endoplasmic reticulum ribosome-binding protein 1 (RRBP1) overexpression is frequently found in lung cancer patients and alleviates intracellular stress-induced apoptosis through the enhancement of GRP78. Oncogene. 2013;32(41):4921–31. Epub 2013/01/16. 10.1038/onc.2012.514 onc2012514 [pii]. . [DOI] [PubMed] [Google Scholar]
- 29. Chen FF, Liu Y, Wang F, Pang XJ, Zhu CD, Xu M, et al. Effects of upregulation of Id3 in human lung adenocarcinoma cells on proliferation, apoptosis, mobility and tumorigenicity. Cancer Gene Ther. 2015. Epub 2015/09/19. 10.1038/cgt.2015.38 cgt201538 [pii]. . [DOI] [PubMed] [Google Scholar]
- 30. Tagne JB, Gupta S, Gower AC, Shen SS, Varma S, Lakshminarayanan M, et al. Genome-wide analyses of Nkx2-1 binding to transcriptional target genes uncover novel regulatory patterns conserved in lung development and tumors. PloS one. 2012;7(1):e29907 Epub 2012/01/14. 10.1371/journal.pone.0029907 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Berger MF, Badis G, Gehrke AR, Talukder S, Philippakis AA, Pena-Castillo L, et al. Variation in homeodomain DNA binding revealed by high-resolution analysis of sequence preferences. Cell. 2008;133(7):1266–76. Epub 2008/07/01. 10.1016/j.cell.2008.05.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Minoo P, Hu L, Xing Y, Zhu NL, Chen H, Li M, et al. Physical and functional interactions between homeodomain NKX2.1 and winged helix/forkhead FOXA1 in lung epithelial cells. Mol Cell Biol. 2007;27(6):2155–65. Epub 2007/01/16. doi: MCB.01133-06 [pii] 10.1128/MCB.01133-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Amann J, Kalyankrishna S, Massion PP, Ohm JE, Girard L, Shigematsu H, et al. Aberrant epidermal growth factor receptor signaling and enhanced sensitivity to EGFR inhibitors in lung cancer. Cancer research. 2005;65(1):226–35. Epub 2005/01/25. doi: 65/1/226 [pii]. . [PubMed] [Google Scholar]
- 34. Conde E, Angulo B, Tang M, Morente M, Torres-Lanzas J, Lopez-Encuentra A, et al. Molecular context of the EGFR mutations: evidence for the activation of mTOR/S6K signaling. Clinical cancer research: an official journal of the American Association for Cancer Research. 2006;12(3 Pt 1):710–7. Epub 2006/02/10. doi: 12/3/710 [pii] 10.1158/1078-0432.CCR-05-1362 . [DOI] [PubMed] [Google Scholar]
- 35. Thomas RK, Baker AC, Debiasi RM, Winckler W, Laframboise T, Lin WM, et al. High-throughput oncogene mutation profiling in human cancer. Nature genetics. 2007;39(3):347–51. Epub 2007/02/13. doi: ng1975 [pii] 10.1038/ng1975 . [DOI] [PubMed] [Google Scholar]
- 36. Kubo T, Yamamoto H, Lockwood WW, Valencia I, Soh J, Peyton M, et al. MET gene amplification or EGFR mutation activate MET in lung cancers untreated with EGFR tyrosine kinase inhibitors. International journal of cancer Journal international du cancer. 2009;124(8):1778–84. Epub 2009/01/01. 10.1002/ijc.24150 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Chen G, Kronenberger P, Teugels E, Umelo IA, De Greve J. Targeting the epidermal growth factor receptor in non-small cell lung cancer cells: the effect of combining RNA interference with tyrosine kinase inhibitors or cetuximab. BMC Med. 2012;10:28 Epub 2012/03/23. 10.1186/1741-7015-10-28 1741-7015-10-28 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Sharma SV, Bell DW, Settleman J, Haber DA. Epidermal growth factor receptor mutations in lung cancer. Nature reviews Cancer. 2007;7(3):169–81. Epub 2007/02/24. 10.1038/nrc2088 . [DOI] [PubMed] [Google Scholar]
- 39. Haura EB, Cress WD, Chellappan S, Zheng Z, Bepler G. Antiapoptotic signaling pathways in non-small-cell lung cancer: biology and therapeutic strategies. Clin Lung Cancer. 2004;6(2):113–22. Epub 2004/10/13. doi: S1525-7304(11)70209-1 [pii] 10.3816/CLC.2004.n.025 . [DOI] [PubMed] [Google Scholar]
- 40. Pao W, Chmielecki J. Rational, biologically based treatment of EGFR-mutant non-small-cell lung cancer. Nature reviews Cancer. 2010;10(11):760–74. Epub 2010/10/23. 10.1038/nrc2947 nrc2947 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Finberg KE, Sequist LV, Joshi VA, Muzikansky A, Miller JM, Han M, et al. Mucinous differentiation correlates with absence of EGFR mutation and presence of KRAS mutation in lung adenocarcinomas with bronchioloalveolar features. J Mol Diagn. 2007;9(3):320–6. Epub 2007/06/27. doi: S1525-1578(10)60400-X [pii] 10.2353/jmoldx.2007.060182 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Yeh JE, Toniolo PA, Frank DA. Targeting transcription factors: promising new strategies for cancer therapy. Curr Opin Oncol. 2013;25(6):652–8. Epub 2013/09/21. 10.1097/01.cco.0000432528.88101.1a . [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The complete dataset of raw RNAseq and ChIPseq reads is available at the NCBI Sequence Read Archive (Accession SRP045118).