Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Jun 30.
Published in final edited form as: Cell Rep. 2017 Mar 14;18(11):2780–2794. doi: 10.1016/j.celrep.2017.02.033

Integrative Genomic Analysis of Cholangiocarcinoma Identifies Distinct IDH-Mutant Molecular Profiles

Farshad Farshidfar 1, Siyuan Zheng 2, Marie-Claude Gingras 3, Yulia Newton 4, Juliann Shih 5,6, A Gordon Robertson 7, Toshinori Hinoue 8, Katherine A Hoadley 9,10, Ewan A Gibb 7, Jason Roszik 2, Kyle R Covington 3, Chia-Chin Wu 2, Eve Shinbrot 3, Nicolas Stransky 11, Apurva Hegde 2, Ju Dong Yang 12, Ed Reznik 13, Sara Sadeghi 7, Chandra Sekhar Pedamallu 5,6, Akinyemi I Ojesina 14,15, Julian M Hess 5, J Todd Auman 9, Suhn K Rhie 16, Reanne Bowlby 7, Mitesh J Borad 17; The Cancer Genome Atlas Network, Andrew X Zhu 18, Josh M Stuart 4, Chris Sander 13, Rehan Akbani 2, Andrew D Cherniack 5,6, Vikram Deshpande 19, Taofic Mounajjed 12, Wai Chin Foo 2, Michael S Torbenson 12, David E Kleiner 20, Peter W Laird 8, David A Wheeler 3, Autumn J McRee 10, Oliver F Bathe 1, Jesper B Andersen 21,, Nabeel Bardeesy 19,, Lewis R Roberts 12,22,, Lawrence N Kwong 2,22,23,
PMCID: PMC5493145  NIHMSID: NIHMS866722  PMID: 28297679

Summary

Cholangiocarcinoma (CCA) is an aggressive malignancy of the bile ducts, with poor prognosis and limited treatment options. Here, we describe the integrated analysis of somatic mutations, RNA expression, copy number, and DNA methylation by The Cancer Genome Atlas of a set of predominantly intrahepatic CCA cases and propose a molecular classification scheme. We identified an IDH mutant-enriched subtype with distinct molecular features including low expression of chromatin modifiers, elevated expression of mitochondrial genes, and increased mitochondrial DNA copy number. Leveraging the multi-platform data, we observed that ARID1A exhibited DNA hypermethylation and decreased expression in the IDH mutant subtype. More broadly, we found that IDH mutations are associated with an expanded histological spectrum of liver tumors with molecular features that stratify with CCA. Our studies reveal insights into the molecular pathogenesis and heterogeneity of cholangiocarcinoma and provide classification information of potential therapeutic significance.

Graphical abstract

graphic file with name nihms866722u1.jpg

Introduction

Cholangiocarcinomas (CCAs) are a group of malignancies of the biliary epithelium (cholangiocytes), comprising invasive carcinomas that arise in the intrahepatic, perihilar, and extrahepatic biliary tree (Razumilava and Gores, 2014). Most patients with CCA present with advanced disease and have a median survival of less than 1 year despite treatment with current standard chemotherapy (Valle et al., 2010). Even patients who undergo apparently curative resection have poor outcomes due to a high rate of tumor recurrence (Razumilava and Gores, 2014). Although intrahepatic, perihilar, and extrahepatic CCAs share morphologic features and have traditionally been aggregated in clinical trials, it is now apparent that there are important differences in tumor biology and genetics among tumors from different anatomic sites (Chan-On et al., 2013; Churi et al., 2014). Further elucidation of molecular alterations in these heterogeneous tumors and discovery of meaningful subtypes within each anatomic group are important steps toward developing more rational, specific, and effective treatments (Kelley and Bardeesy, 2015).

Cholangiocarcinoma is the second most common type of primary liver tumor, and its incidence has been rising in the United States, from 0.44 per 100,000 in 1973 to 1.18 in 2012 (Saha et al., 2016). The actual incidence of the disease is likely much higher, as recent gene expression studies suggest that “carcinomas-of-unknown primary” identified in the liver most commonly originate from biliary epithelium (Varadhachary and Raber, 2014). Worldwide, the highest incidence of CCA is found in northeastern Thailand, where parasitic infection by liver flukes (Clonorchis sinensis and Opistorchis viverrini) leads to infestation of the biliary tree (Razumilava and Gores, 2014). In regions without liver fluke infestation, CCA disease etiologies include: primary sclerosing cholangitis, hepatitis B or C virus (HBV/HCV) infection, biliary stone disease, congenital biliary malformations, cirrhosis, and exposure to aromatic toxins (Razumilava and Gores, 2014). Lifestyle-related factors such as smoking, alcohol consumption, and diabetes also contribute to the risk of intrahepatic CCA (iCCA) (Palmer and Patel, 2012). Given the diversity of risk factors influencing the mutational spectrum and the distinct cellular origins of the CCA subtypes, there is still an incomplete appreciation of the mechanisms of biliary carcinogenesis.

Prior studies indicate that iCCAs are unusual among epithelial cancers in having a relatively high rate of missense mutations in the isocitrate dehydrogenase 1 and 2 (IDH1/IDH2) genes (Borger et al., 2012; Chan-On et al., 2013; Jiao et al., 2013; Kipp et al., 2012; Ross et al., 2014; Wang et al., 2013), which encode metabolic enzymes that interconvert isocitrate and alpha-ketoglutarate in central carbon metabolism. These mutations, which are also common in acute myeloid leukemia, low-grade glioma and glioblastoma, and chondrosarcoma, occur at defined hotspots and result in neomorphic enzyme activity, leading to production of high levels of the metabolite (R)-2-hydroxyglutarate (2HG) (Losman and Kaelin, 2013). 2HG does not participate in normal metabolic processes but instead interferes with the function of enzymes that utilize alpha-ketoglutarate as a co-enzyme, including histone and DNA demethylases, and inhibits the mitochondrial electron transport chain (Fu et al., 2015; Parker and Metallo, 2015). Additional recurrent mutations and fusions have been reported in CCAs involving the fibroblast growth factor receptor 2 (FGFR2) gene, as well as in KRAS, BRAF, TP53, and in genes encoding chromatin-modifying enzymes (Borad et al., 2014; Churi et al., 2014; Jiao et al., 2013; Nakamura et al., 2015; Ross et al., 2014; Sia et al., 2015). Given this complexity, it is imperative to generate a more integrative model of the molecular alterations in iCCA to better define the oncogenic circuitry and to derive a classification system that groups tumors into biologically meaningful subtypes that can be used to guide therapy.

In this study, we describe molecular features of 38 liver fluke-negative CCAs—predominantly intrahepatic and hepatitis negative—that stratify the disease into distinct groups. Most notably, we identify a class of CCAs with distinct transcriptomic, copy number, and methylation profiles that are highly enriched for IDH mutant samples. We also identify pathways and methylation patterns that we validate in external datasets and which could help lead to the development of more effective treatments. Finally, cross-platform comparisons of CCA with pancreatic cancer and hepatocellular carcinoma (HCC) further emphasize the presence of distinct tumor subsets.

Results

Samples

We analyzed 38 CCA samples that were predominantly from North America (89%), intrahepatic (84%), fluke-negative (100%), and HBV/HCV-negative (97%, as assessed by combined DNA and RNA sequencing, which also revealed no HBV integration sites), and had no prior exposure to chemotherapy or radiation (Tables 1 and S1). This relative overall sample uniformity minimizes known and potential sources of molecular heterogeneity in our sample set.

Table 1. Patient Characteristics.

Characteristics Number of Patients
Total number of patients 38
Median age at diagnosis (years) 66 (range 29–82)
Gender
Female 21
Male 17
ECOG PS
0 19
1 9
2 0
3 1
Unknown 9
Histologic Grade
G1 5
G2 21
G3 9
G4 2
Unknown 1
Resection status (R0/R1/Rx) 28/7/3
Tumor Stage
T1 19
T2 (T2a/T2b) 15 (2/5)
T3 4
Histologic Diagnosis
Intrahepatic 32
Extrahepatic/hilar 4
Mixed ICC/HCC 2
Lymph node status (N0/N1/Nx) 26/5/7
Metastatic disease (M0/M1/Mx) 30/4/4
Lymphovascular invasion (yes/no/unknown) 15/22/1
Perineural invasion (yes/no/unknown) 4/33/1
Elevated CA 19-9 (yes/no/unknown) 18/13/7
Race (white/Asian/black) 33/3/2
Country submitting tumor (USA/Canada/Italy/Brazil/Vietnam 30/4/2/1/1
Elevated CA 19-9 (yes/no/unknown) 18/13/7
Presence of fluke infection (Y/N) 0/38

ECOG, Eastern Cooperative Oncology Group.

The TCGA analysis pipeline used in this study consists of the following platforms: wholeexome sequencing; Affymetrix SNP 6.0 copy number array; RNA sequencing (RNA-seq), including microRNA (miRNA) and long noncoding RNA (IncRNA); DNA methylation; and reverse-phase protein array (RPPA) utilizing 192 antibodies. These datasets are annotated with relevant clinical information and careful histopathologic review by several experienced hepatobiliary pathologists (Table S1).

Gene-Level Mutations, Fusions, and Copy Number Alterations

We first annotated alterations to specific genes. Whole-exome sequencing identified 2,831 somatic mutations, of which 1,869 (66%) were non-silent coding mutations. Targeted-capture, deep sequencing validation of 43 selected genes confirmed 77 mutations and newly identified nine (see Supplemental Experimental Procedures and Table S2). The median mutation rate was 1.38 per megabase, with a median mutation count of 49 (Figure 1A; Table S1). Compared to other cancers assessed by TCGA (Lawrence et al., 2013), this CCA mutation rate is intermediate, and comparable to that of pancreatic ductal adenocarcinoma (PDAC).

Figure 1. Somatic Alterations in Cholangiocarcinoma.

Figure 1

(A and B) Significantly mutated genes identified using the MutSigCV algorithm, and additional genes with chromosomal alterations, hotspot mutations (red font), and possibly functional mutations, grouped by pathway. (A) TCGA analysis sample set (n = 38). (B) Additional sample set (n = 15). Left, mutation amount and percentage, plus epigenetic silencing for CDKN2A. Top, overall number of mutations per megabase. Bottom, mutation spectra signatures. Dashed yellow line in upper panel indicates median mutations/megabase.

Consistent with previous studies, we identified inactivating mutations in the tumor suppressor genes ARID1A, ARID1B, BAP1, PBRM1, TP53, STK11, and PTEN, and hotspot gain-of-function mutations in the oncogenes IDH1, IDH2, KRAS, BRAF, and PIK3CA (Figures 1A and S1A; Tables S1 and S3). Also consistent with prior reports, the IDH mutant alleles described in our cohort (IDH1R132C and IDH2R172K/S) are distinct from those found in glioma and acute myeloid leukemia (enriched for IDH1R132H and IDH2R140Q) (Cancer Genome Atlas Research Network, 2013; Brat et al., 2015). In two tumors, we identified a recurrent P216L mutation in the regulatory domain of ARAF adjacent to the functionally validated N217I mutation (Sia et al., 2015), which suggests an activated state. We also identified two frameshift deletions and one missense mutation S217F (Zou et al., 2014) in the albumin gene (ALB), one of the most significantly mutated genes in HCC (Schulze et al., 2015). Finally, we detected a single telomerase reverse transcriptase (TERT) promoter mutation, in a mixed HCC/iCCA sample. To further validate the observed mutations, we performed whole-exome sequencing and targeted deep validation on an independent set of 15 samples (Figure 1B). Although analyzed separately, these additional samples corroborated the mutations above and highlighted additional recurrent mutations in the BRCA2, MLL3, APC, NF1, and ELF3 tumor-suppressor genes.

An analysis of gene fusions from RNA-seq data identified five samples (13%) that expressed FGFR2 fusion transcripts; this prevalence is consistent with other studies (Arai et al., 2014; Churi et al., 2014; Goyal et al., 2016; Nakamura et al., 2015; Ross et al., 2014). Our cases included two with the known fusion partner BICC1 and three with the partners KIAA1598, FRK, and C10ORF118 (Table S3). Other than the FGFR2-FRK fusion, which resulted in loss of the FGFR2 kinase domain and retention of the FRK kinase domain, the rest of the fusions retained the kinase domain, consisting of FGFR2 exons 1–17 spliced in frame with the partner gene. We note that BICC1, KIAA1598, and C10ORF118 are located on chromosome 10 along with FGFR2. We also observed two missense mutations and one in-frame insertion in FGFR2.

Somatic copy number alterations (SCNAs) determined by analysis of high-density SNP data identified recurrent focal losses of CDKN2A and amplifications of CCND1. We further identified low-prevalence cases of focal SCNAs that have been reported in other cancers, including amplification of CDK4/MDM2 and homozygous focal deletion of QKI and SAV1 (Table S1; Figure S1B). The functional perturbation of these genes is supported by correlative gene expression data (Figure S1C). Epigenetic silencing of CDKN2A was identified in eight cases (21%) and was mutually exclusive with homozygous deletions and mutations (Figures 1, S1D, and S1E). Collectively, CDKN2A was mutated, deleted, or silenced in 47% of cancers, a higher rate than previously appreciated with single platform analyses.

Next, cross-comparing sequencing and copy number data, we found that all mutations in BAP1 and PBRM1 (both located on 3p21) were detected in tumors with 3p loss of heterozygosity, suggesting biallelic inactivation of these genes in near-diploid tumors. Cancer cell fractions were higher for the broad or arm-level loss of chromosome 3 than for BAP1 mutations, followed by PBRM1 mutations, suggesting that these events occur chronologically (3p loss, BAP1, PBRM1) in CCA development (Figures S1F and S1G). We also note that the NF2 splice site mutation in sample AA0S experienced loss of heterozygosity, suggesting bi-allelic loss of this Hippo pathway tumor suppressor gene.

Finally, using a previously defined mutational signature assessment (Covington et al., 2016) and non-negative matrix factorization, we identified seven enriched mutation signatures out of 21 total signatures. As observed across 31 tumor types analyzed to date (Covington et al., 2016), the most common signature was C > T/G > A substitutions at CpG islands (signature #6), followed by signature #1, characterized by AC > AN, AT > AN (Figure 1).

Filtering Normal Liver Genes Uncovers an IDH-Mutant-Enriched mRNA Subgroup

We next analyzed mRNA expression by unsupervised hierarchical clustering, selecting only the most variable 2% (i.e., 400) of genes. The two resultant clusters showed a large differential expression of the genes (Figure S2A) with strong enrichment for a liver-associated signature. However, the clusters did not correlate with any other molecular or clinical parameter. Given that most liver signature genes are expressed in normal liver at levels 1,000- to20,000-fold higher than in tumors, we considered that the liver signature expression in part reflected contamination by even a small amount of normal liver. Consistent with this hypothesis, histological analysis revealed a trend toward higher normal liver contamination in the liver-high cluster (Figure S2C), which was not picked up by DNA-based tumor purity estimates. This indicates that high-expressing stromal genes likely confound mRNA expression clustering, particularly when using only the top-most variable genes.

To reduce the statistical effects of the liver signature, 386 liver-specific genes derived from the GTEx (Genotype-Tissue Expression) normal tissue expression database (Lonsdale et al., 2013) (Table S4) were filtered out. The remaining 15,427 genes underwent principal component analysis (PCA) (Figures 2A and 2B), identifying three clusters. A subset of 1,150 genes was identified by orthogonal partial least-squares discriminant analysis (OPLSDA) as most strongly contributing to cluster separation (Figure 2A). Intriguingly, cluster 1 included all seven cases with an IDH1 or IDH2 hotspot mutation, while cluster 2 was enriched in extrahepatic or perihilar CCA, and cluster 3 contained all five FGFR2 fusions. This was validated by comparison with the previously generated microarray gene expression dataset GSE26566 (Andersen et al., 2012; Wang et al., 2013). Hierarchical clustering was performed for the 976 most strongly discriminant genes shared between datasets (Figure 2C). In this validation cohort of 40 samples, three main clusters were identified that resembled the TCGA dataset clusters. Notably, most IDH1/2 mutations (eight of ten) were located in the cluster that most strongly resembled the IDH-mutant-enriched TCGA cluster 1. Thus, removal of the liver signature unmasked transcriptional clusters that segregated samples in a biologically relevant manner.

Figure 2. mRNA Analysis of Cholangiocarcinoma Identified an IDH-Enriched Cluster.

Figure 2

(A and B) Principal component analysis of RNA-seq expression data of 15,272 genes after exclusion of 541 normal liver genes. The heatmap in (A) shows the most strongly discriminant 973 genes (shared between the TCGA and the GSE26566 dataset) that define the three clusters. (B) Three-dimensional PCA plot of TCGA CCA samples. (C) Hierarchical clustering analysis of 40 samples from the CCA microarray dataset GSE26566, using the same 973 genes as in (A). Genes in heatmaps (A) and (C) are shown in the same order.

The IDH Mutant Subgroup Is Enriched for Mitochondrial and Chromatin-Modifier Signatures

We next performed gene set enrichment analysis (GSEA) on the mRNA clusters and discovered an enriched expression of genes involved in mitochondrial structure and function in the IDH-mutant-enriched cluster 1. This included notable upregulation of genes encoding citric acid cycle enzymes, mitochondrial ribosomal proteins, electron transport chain components, and mitochondrial structural constituents, consistent with altered control of oxidative phosphorylation and mitochondrial biogenesis (Figure 3A; Tables S5 and S6) (hereafter collectively referred to as “mitochondrial gene expression”). High mitochondrial gene expression was significantly associated with IDH and PBRM1 mutant samples and low expression with FGFR2-fusion samples (Figure 3B and depicted as a condensed “mitochondrial score” in Figure 3D); these correlations were improved by removing low-purity (<0.65) samples, a possible confounding factor (Figures S3A and S3B). In keeping with potential functional relevance of the differential expression of mitochondrial genes, we identified a relatively higher mitochondrial copy number (Reznik et al., 2016) in IDH mutant samples and a lower number in FGFR2-fusion samples (Figure 3E).

Figure 3. IDH Mutant Cancers Are Enriched for a High Mitochondrial Signature and Mitochondrial DNA Count and a Low Chromatin Modifier Signature.

Figure 3

(A) GSEA analysis identified mitochondrial and chromatin modifier genes as significantly differentially expressed in the IDH-mutant-enriched mRNA cluster 1. Selected pathways are shown, omitting genesets that show a high degree of gene membership overlap with the displayed pathways. Full results are available in Table S6. NES, normalized enrichment score; size, geneset size.

(B and C) Heatmaps of the most significant mitochondrial and chromatin-modifier genes for TCGA (B) and GSE26566 (C). TCGA samples are filtered for high purity (>0.65); unfiltered results are shown in Figure S9.

(D and E) Quantification of mitochondrial signature (D) and mitochondrial DNA (E) count for different mutational subgroups, showing high-purity samples only. No subgroup by itself is significantly different from all other samples, indicating only enrichments and not exclusivity for high/low mitochondrial markers. Box and whisker plots show maximum and minimum bars.

(F) Pan-cancer correlation analysis of mitochondrial and chromatin modifier genes. For each cancer, the genes on the x and y axes are the same and in the same order. Red signifies high positive Spearman correlation values; green denotes high negative values for each gene-gene comparison. C, chromatin modifiers; M, mitochondrial genes. Genes are listed in Table S8.

Examination of the GSE26566 dataset (Andersen et al., 2012) provided an external validation of these findings, again identifying an enrichment of IDH mutants among the tumors with high expression of the mitochondrial gene signature (Figure 3C). This association between this signature and IDH mutations appears to be particular to CCA, since it was not observed upon analysis of TCGA datasets for glioblastoma, low-grade glioma, melanoma, or acute myeloid leukemia TCGA datasets (Figure S3C).

GSEA also identified chromatin modifier gene sets as significantly downregulated in the IDH mutant mRNA cluster (Figure 3A; Tables S5, S6, and S7. Notably, these included genes recurrently mutated in CCA—ARID1A, ARID1B, and PBRM—as well as genes whose protein products are known to be inhibited by IDH-mutant-generated 2HG, including TET2, TET3, KDM2A, and KDM5B (Xu et al., 2011). Expression of the chromatin modifier geneset anticorrelated with the mitochondrial geneset even when considered as a gradient (Figure S3D; Table S7). Querying of this association in multiple TCGA gene expression datasets demonstrated strikingly consistent anticorrelation of these two pathways across 23 of 25 cancer types (Figure 3F) as well as across and within normal tissues (Figure S3E) from the GTEx database for most genes. These results suggest that mitochondrial activity and chromatin modification are linked basic biological events that are also regulated by IDH hotspot mutations in CCA.

Cluster-of-Clusters Analysis Identifies Four Subgroups

We next enlisted the additional platforms (copy number, methylation, miRNA, IncRNA, and protein) into a clustering of the cluster assignments (COCAs) (Hoadley et al., 2014), which provides a way to distinguish sample subtypes by identifying patterns across platforms. We started by performing hierarchical clustering of each platform separately. First, copy number based on SNP array hybridization data (Figure 4A) revealed four genomic clusters. We note that this clustering is performed on select SCNAs determined by genome identification of significant targets in cancer (GISTIC) analysis to be significantly differentially altered, as using all data points results in overfitting. This approach therefore highlights samples that share loci that are likely undergoing positive selection in the tumor. Cluster 4 consisted entirely of tumors with high-level amplification of CCND1. Cluster 2 was characterized by enrichment of chromosomal deletions, particularly 6q, 9, and 14. Cluster 3 tumors contained mostly SCNAs that were found across the larger set of tumors (e.g., 1p loss and 1q gain), but on average had fewer arm-level deletions than cluster 2. Last, cluster 1 consisted of molecularly atypical tumors, including two genomically silent cases that were completely devoid of copy number alterations or recurrent CCA driver mutations (the low-purity extrahepatic W5-AAH2 and the 0.61-purity intrahepatic ZH-A8Y6).

Figure 4. Cluster-of-Clusters Analysis of CCAs.

Figure 4

(A) Unsupervised hierarchical clustering of copy number data. The clustering is performed on SCNAs that are determined by GISTIC analysis to be significantly altered, as shown in the lower heatmap.

(B) Unsupervised hierarchical clustering of DNA methylation data.

(C) The cluster-of-clusters analysis (COCA) was performed on the three platforms with the strongest degree of correlation (mRNA, copy number, and methyl-ation). Bottom, robustness scores indicating strength of cluster membership assignment for each sample. Selected information of interest is shown here; full clustering, genetic, clinical, and pathological data are available in Table S1.

Next, unsupervised clustering of samples using CpG sites that show cancer-specific DNA methylation changes identified four subgroups in our CCA cohort (Figure 4B). Tumors in cluster 1 showed minimal alterations in DNA methylation compared with normal liver, which is at least partially explained by low tumor purity for most of the samples. The remaining tumors had prominent DNA hypermethylation and were classified into three subgroups. All seven IDH mutant tumors were present in cluster 4, along with one IDH-wild-type case that exhibited a gene expression profile similar to that of IDH mutants (see Figure S4A). Surprisingly, on average, tumors in cluster 2 showed an even greater degree of DNA hypermethylation than did IDH mutant tumors (Figures 4B and S4B). Last, we note that tumors in clusters 2 and 3 had frequent mutation of genes encoding chromatin regulators, including PBRM1 and ARID1A (ten of 20).

We observed that the copy number and methylation clusters generally matched the mRNA clusters identified in Figure 3, suggesting the ability of the data to detect shared biological mechanisms. By contrast, hierarchical clustering of mature miRNAs, IncRNAs, and protein yielded clusters that were discordant with the other platforms. We believe this is mainly due to the far smaller number of informative features available for clustering (169, 101, and 192, respectively) given the sample size (Figure S4). As expected, COCA analysis using all six platforms gave a discordant pattern with no clinical correlates, even when the lower-sample number RPPA was removed. We therefore conducted the COCA analysis using only mRNA, copy number, and methylation to discern biologically coherent clusters. We optimized a four-cluster solution (see Experimental Procedures) that was not dominated by any one platform (Figure 4C).

We then correlated clinical data and molecular aberrations with the four COCA clusters (Figure 4C; Table S1). IDH hotspot mutations were present exclusively in COCA2 (p = 0.0004; “IDH COCA”), reflecting the mRNA and methylation specificity noted earlier, and identifying a correlation with copy number cluster 2 (“genomically unstable”). Patients with IDH COCA tumors were typically nonsmokers, and the tumors exhibited a lower frequency of lymphatic invasion and chromosome arm 8p gains (Figure 4C). Three of four distal or hilar tumors were in COCA1 (p = 0.003; “ECC COCA”), which exhibited the following characteristics: wild-type for FGFR2, IDH1/2, ARID1A, BAP1, and PBRM1; low methylation; and relative genomic silence for copy number alterations. COCA3 was enriched for samples with CCND1 amplification and with the most highly hypermethylated profile (methylation cluster 2; “METH2 COCA”). COCA4 (“METH3 COCA” contained eight of 12 cases with BAP1 mutations (p = 0.01) and all five FGFR2 fusion cases (p = 0.004). Survival analysis among the COCA clusters did not yield significant p values, possibly due to the small sample size. All clustering solutions for individual platforms and for COCAs, as well as key genetic, clinical, and pathologic data are available in Table S1. We posit that these COCAs identify biologically distinct CCA subtypes with potential clinical implications; however, we also acknowledge the limitations due to the sample number, and that validation of these subtypes awaits new comparable datasets and functional confirmation in model systems. Nevertheless, these results clearly highlight the molecular distinctness of IDH mutants and the power of integrated multiplatform analyses.

ARID1A Promoter Hypermethylation and Decreased Expression in IDH Mutants

To extend our analysis of the IDH mutant subgroup, we considered that the COCA classifications may help reveal new cross-platform insights. To this end, we asked whether IDH-mutant-specific DNA hypermethylation may target genes that show decreased expression in the IDH COCA subtype. We restricted the analysis to high-purity samples to avoid assessing gene expression changes that are primarily associated with contaminating stroma rather than with methylation. After intersecting IDH mutant hypermethylated loci with genes with decreased expression in the IDH COCA subtype and filtering for gene-specific anticorrelation between the two platforms, we identified a list of 24 genes whose expression is putatively regulated by IDH mutant hypermethylation (Figures 5A and 5B). To validate this list, we cross-referenced it with the matched CCA methylation and expression datasets GSE32079 and GSE26566, respectively (Andersen et al., 2012; Wang et al., 2013). GSE32079 uses the same methylation array platform as the TCGA dataset, allowing for direct comparison of probes. While all 24 genes once again showed IDH-mutant-specific hypermethylation (Figure 5C), only three genes, ARID1A, MARVELD1, and SLC1A5 were significantly downregulated (Figure 5D) in the “IDH-like” mRNA cluster (cf. Figure 2C). Given the bona fide tumor suppressor role of ARID1A in the liver, we explored the relationship between DNA methylation and expression further (Figure 5E). Of four IDH-wt samples in the IDH COCA, two were ARID1A mutant with low ARID1A expression. Moreover, the only IDH mutant to not show ARID1A hypermethylation, A95A, was ARID1A mutant with low expression (Figures 5A and 5E), suggesting that ARID1A mutation and IDH-induced hypermethylation are mutually exclusive due to redundancy. Analysis of publically available histone modification ChIP-seq data showed that the two hypermethylated ARID1A probes were located in the ARID1A promoter within the active transcription marks H3K27Ac and H3K4me3 (Figure 5F). Collectively, these data suggest that IDH mutations result in hypermethylation and silencing of ARID1A, and that impingement of ARID1A is a convergent feature of IDH COCA tumors.

Figure 5. ARID1A Is Hypermethylated and Has Low Expression in the IDH COCA.

Figure 5

(A and B) TCGA methylation (A) and RNA-seq (B) data for 24 genes (36 probes) that show both IDHmut-specific hypermethylation and downregulation in the IDH COCA subtype.

(C and D) Methylation (C) and microarray (D) data for the same 24 genes and 36 methylation probes as in (A and B), in the publically available datasets GEO: GSE32079 and GSE26566.

(E) Scatterplot of TCGA methylation and RNA expression values for ARID1A.

(F) Location of the two ARID1A hypermethylated probes within the ARID1A promoter. ENCODE histone marks for H3K27Ac and H3K4me3 are shown.

IncRNAs Associated with the Chromatin Modifier Signature

Given the centrality of the chromatin modifier signature to the IDH COCA subtype, we explored the remaining platforms to identify non-mRNA members of the signature. As IncRNAs are relatively understudied, we sought to identify IncRNAs that correlated with the mRNA-based chromatin modifier signature. To further limit the resulting set of 66 IncRNAs, we asked which ones also tracked with the chromatin modifier signature across other TCGA datasets (cf. Figure 3F). Across eight assessed cancers, we found that 21 of the IncRNAs correlated with the chromatin modifier signature in at least six cancers (Figure S5A; Table S7). Importantly, only an estimated eight of the 66 IncRNAs were expected to correlate by chance, making 21 a significant enrichment (p = 0.01, Fisher's exact test). These findings suggest potential functions for these 21 IncRNAs, which have not previously been studied.

Additionally, we identified IncRNA clusters that correlate with immune and liver mRNA signatures (Figures S5B and S5C) in the CCA dataset. To validate these, we determined the overlap with high-stringency immune- and liver-specific IncRNAs defined from the GTEx database (Lonsdale et al., 2013): for immune-associated IncRNAs, 34/48 CCA-derived IncRNAs overlapped with 190 GTEx IncRNAs (p = 2 × 10-13), while for liver-associated IncRNAs, 25/25 CCA-derived IncRNAs overlapped with 127 GTEx IncRNAs (p = 4 ×10-13). These clusters lend support to the biological fidelity and analytical utility of the IncRNA platform and provide starting IncRNA candidates when analyzing future samples. For miRNAs, we note that miRNA-194-5p is significantly upregulated in the IDH COCA subtype and negatively correlated with the chromatin modifier signature (Figures 4 and S5D–S5H). Notably, miRNA-194 has been implicated as a suppressor of invasion in liver cancer in vitro (Meng et al., 2010). The results of these analyses collectively demonstrate the robustness of the pan-cancer and cross-platform capacity of the TCGA.

Comparison with HCC and PDAC

We next compared liver, pancreatic, and biliary cancer across the standardized multiplatform TCGA datasets to determine their molecular relationships (Borad et al., 2014). First, to improve comparisons of mutational landscapes, we conducted a meta-analysis of six iCCA sequencing studies (Churi et al., 2014; Jiao et al., 2013; Nakamura et al., 2015; Ruzzenente et al., 2016; Zou et al., 2014) including this one, totaling 458 samples (Table S8). Whereas PDAC is dominated by KRAS (92%) and TP53 (70%) mutations, KRAS/NRAS and TP53 mutations comprise only 20% and 21% of iCCAs, respectively. HCC is characterized by TERT promoter (46%) and CTNNB1 (26%) mutations, neither of which are present in iCCA; otherwise HCC shares only three genes with iCCA that are mutated at >5% (TP53, BAP1, and ARID1A).

We next applied the Tumor Map algorithm, which generates “islands” of cancers based on similarity within chosen platforms (Davis et al., 2014). Incorporating mRNA expression, copy number, and methylation data (Figure 6A), this analysis separated most HCC, PDAC, and CCA samples into their respective cancer-type islands; however, seven of 38 (18%) CCA samples were embedded in the PDAC and HCC islands, and seven of 179 (4%) HCC samples were embedded in the CCA island, suggesting that some samples have discordant histopathologic and molecular profiles.

Figure 6. Cross-Cancer Analysis Comparing TCGA Cholangiocarcinoma, HCC, and Pancreatic Adenocarcinoma.

Figure 6

(A) Tumor map analysis incorporating mRNA, methylation, and copy number showing proximity of each sample.

(B) COCA across miRNA, copy number, DNA methylation, RPPA, and Tumor Map for the three cancer types. Unsupervised clustering was performed within each data type across a cohort of 292 samples from CCA, HCC, and PDAC to derive cross-tumor subtypes (miRNA n = 4; copy number (CN) n = 5; DNA methylation (DNAmeth) n = 7; RPPA n = 6; tumor map n = 7; see Figure S6 for individual platform cluster solutions). Individual classification subtypes were then used as input for pan-tumor COCA analysis identifying three COCA classes (first bar; K1, light green; K2, dark blue; K3, light blue). Second annotation bar denotes histology type – CCA, HCCC, PDAC. Third bar indicates the CCA-specific subtype classification (ECC, METH2, Meth3, and IDH, cf. Figure 5). Fourth bar notes IDH1 mutation status (red, R172 mutations; blue, R132 mutations; black, other mutations). Fifth bar indicates samples with FGFR2 fusions. Sixth bar indicates CCA score, a median value of the 600 most-enriched genes in CCA (see C). The bottom heatmap indicates sample membership for each of the individual classification subtypes (black, subtype member; white, not a subtype member; gray, missing data). Each row is labeled by platform and subtype number.

(C) Six hundred genes enriched in cholangiocarcinoma-like HCC.

To better understand these discordances, we illustrated cluster memberships for all 292 CCA/HCC/PDAC samples (Table S9) after clustering within each molecular platform (mRNA, miRNA, RPPA, copy number, and methylation; Figures S6A–S6F). First, we noted that most of our COCA1 CCA samples (distal/hilar CCA-enriched) clustered with PDAC, consistent with the related developmental origins of the extrahepatic and pancreatic ducts from the foregut endoderm. Second, we studied in greater depth the seven HCCs that mapped with CCA. These tumors shared several molecular features with CCA, including mRNA and miRNA expression patterns, DNA methylation, and to a lesser extent copy number (Figure 6B). Moreover, they lacked TERT promoter mutations, which are a hallmark of HCC but are absent in CCA. Strikingly, five of those seven samples harbored either hotspot IDH1/2 mutations (n = 4) or an FGFR2 fusion (n = 1), and they were the only cases in the HCC dataset with these mutations. Re-examination of their histology revealed that although regions of these seven cases fall within the spectrum of HCC, each of the five tumors with IDH1 or FGFR2 lesions had some features that have also been described in iCCA, including focal to diffuse glandular differentiation, abundant fibrotic stroma (desmoplasia), and in some areas, an anastomosing architecture (Figures S6G–S6L) (Bledsoe et al., 2015; Liau et al., 2014; Nakanuma et al., 2012). Consistent with this, these samples expressed bile duct (e.g., SOX9) and hepatocellular (e.g., HNF4A, HNF1A) markers at levels within that of iCCA (Figures 6C, S6M, and S6N). Analysis of the 600 genes that are most enriched in these tumors compared with standard HCC corroborated this close transcriptional similarity to CCA (Figure 6C). These data are notable in view of accumulating evidence that CCA and HCC lie along a spectrum of primary liver carcinomas, with intermediate subsets exhibiting overlapping phenotypes. The prominent enrichment of IDH mutations in molecularly CCA-like HCCs is consistent with previous findings that IDH mutations block liver progenitor cells from undergoing hepatocyte differentiation and shift them toward a cholangiocellular fate (Saha et al., 2014).

Discussion

Taking advantage of the molecular resolution provided by multiple genomic platforms, we identified distinct mRNA, DNA methylation, and copy number subgroups that together specify biologically relevant CCA classes. In particular, we highlight an IDH-mutant-enriched class whose samples share similar profiles across these three platforms. Notably, this class exhibits high expression of mitochondrial genes, including components of the citric acid cycle and electron transport chain, accompanied by relatively high mitochondrial DNA copy number, as well as low expression of chromatin modifier genes. This anticorrelation of mitochondrial and chromatin modifier signatures appears to be a basic biological link spanning nearly all TCGA cancers and normal GTEx tissues analyzed (Figures 2F and S9), warranting deeper mechanistic studies. The anticorrelation is consistent with the hypothesis that global changes in histone acetylation and DNA methylation rates affect mitochondrial metabolism via imbalances in available pools of acetyl and methyl moieties (Martinez-Pastor et al., 2013). Relevantly, IDH mutants hypermethylate and putatively silence the ARID1A promoter, which may contribute to the lowered chromatin modifier signature expression.

Moreover, we identify a group of liver tumors with an atypical histopathology and a highly CCA-like molecular profile that is enriched for IDH mutations, consistent with the emerging view that liver tumors comprise a continuous spectrum (Fan et al., 2012; Holczbauer et al., 2013; Marquardt et al., 2015; Sekiya and Suzuki, 2012). Given the molecular and partial histologic similarity to CCA, this potential subtype may be a distinct clinical entity and strongly warrants further study into its most beneficial classification. Furthermore, the complete lack of IDH mutations in otherwise standard HCC from the TCGA set (0/172) has implications about specific functions of mutant IDH in modulating liver cell identity and also underscores the benefit of combined molecular and histopathological diagnosis. Although previous studies also identified transcriptionally CCA-like HCC (Oishi et al., 2012; Seok et al., 2012; Woo et al., 2010), our results identify IDH and FGFR2 perturbations as associated drivers linked to methylation, miRNA, and copy number similarities. Together, these findings highlight the uniqueness of IDH-mutant-driven cancers and the importance of defective chromatin regulation in the pathogenesis of CCA.

Improved molecular classification of cholangiocarcinoma is urgently needed, as heterogeneity presents a serious challenge to clinical management. Unlike cancers in which a few predominant oncogenic loci converge on a pathway, such as KRAS in PDAC or the mostly mutually exclusive BRAF, NRAS, and NF1 in melanoma, CCA is marked by a heterogeneous set of often-overlapping, lower-penetrance driver genes across diverse signaling pathways. This intertumoral heterogeneity is further exacerbated by geographically distinct molecular profiles and is modified by the presence or absence of liver fluke infestation and/or viral hepatitis, and by the anatomic location of the cancer. As examples, (1) extrahepatic CCAs have more SMAD4 mutations than iCCAs (Churi et al., 2014; Ong et al., 2012); (2) a Chinese study found a much lower incidence of IDH (5%), PBRM1 (1%), and BAP1 (1%) mutations (Zou et al., 2014) in iCCAs; and (3) liver fluke- and/or viral hepatitis-positive cancers have a higher incidence of TP53 mutations and lower incidence of IDH mutations (Chan-On et al., 2013; Ong et al., 2012; Zou et al., 2014). In this study, we focused on intrahepatic and fluke- and hepatitis-negative CCA, which minimized heterogeneity and improved the ability to apply categorizing statistics. Although larger studies are needed to validate and identify further molecular subclasses, our results provide a proof of principle that subclasses of CCAs have distinct multi-level molecular characteristics that suggest potential therapeutic approaches.

In this regard, the enriched mitochondrial gene signature and coordinate increase in mitochondrial number in IDH mutant CCA is intriguing in light of prior work implicating mutant IDH in the impairment of multiple aspects of cell metabolism (Cuyàs et al., 2015; Grassian et al., 2014; Tateishi et al., 2015). These observations include 2HG-mediated disruption of components of the mitochondrial electron transport chain, and inhibition of reductive glutamine metabolism, a wild-type IDH1 -dependent process that is important for fatty acid synthesis in cells with dysfunctional mitochondria (Fu et al., 2015; Grassian et al., 2014; Leonardi et al., 2012). The IDH mutant iCCA profile might thus reflect an adaptive response to mitochondrial dysfunction and/or an increased reliance on mitochondrial activity for tumor growth. The decrease in expression of chromatin regulators is also notable given the potential widespread effects of 2HG on epigenetic states via inhibition of TET family cytosine oxygenases and Jumonji domain family histone demethylases (Xu et al., 2011). However, the significance of this signature is difficult to interpret, since the genes contributing to the signature spanned all classes of chromatin regulators as well as 21 newly associated IncRNAs. Nevertheless, the discovery of distinct molecular features of IDH mutant iCCA is noteworthy in light of early clinical data using IDH1 -mutant-specific inhibitor (Burris et al., 2015). While promising, these data suggest that single-agent treatment with these drugs may not be sufficient to produce durable responses or remissions. Thus, targeting aspects of metabolism (e.g., using inhibitors of oxidative phosphorylation) or of chromatin regulation are tentatively suggested by our genomic findings as avenues for future research. Notably, prior work has suggested that BAP1, PBRM1, and ARID1A deficiency all result in sensitivity to EZH2 inhibition across cancer types (Kim et al., 2015; LaFave et al., 2015); the association of ARID1A methylation with IDH mutation opens the question of whether EZH2 inhibition might also be effective in this subtype. Collectively, our findings reveal distinct molecular characteristics of IDH mutant cholangiocarcinoma, offering insights and valuable multi-omics data as a springboard for future basic and translational research into this deadly disease.

Experimental Procedures

Cholangiocarcinoma (CCA) tumors were collected and shipped to a central Biospecimen Core Resource (BCR) between August 15, 2013 and January 20, 2014. Qualifying tumor samples were obtained from patients who had received no prior chemotherapy or radiotherapy treatment for their disease. Specimens were shipped overnight from 12 Tissue Source Sites (TSSs) using a cryoport that maintained an average temperature of less than 180°C. TSSs contributing biospecimens included Barretos Cancer Hospital (Barretos, Brazil); Emory University (Atlanta, GA, USA); Garvan Institute of Medical Research (Darlinghurst, NSW, Australia); ILSbio, LLC (Chestertown, MD, USA); Mayo Clinic (Rochester, MN, USA); Sapienza University of Rome (Rome, Italy); Spectrum Health (Grand Rapids, MI, USA); University of Calgary Alberta Health Services (Calgary, AB, Canada); University of California - San Francisco (San Francisco, CA, USA); University of New Mexico (Albuquerque, NM, USA); University of North Carolina (Chapel Hill, NC, USA); Wake Forest University (Winston-Salem, NC, USA).

In addition to tumor samples, each frozen primary tumor specimen had a companion normal tissue specimen (blood or blood components, including DNA extracted at the TSS). Adjacent nontumor tissue was also submitted for a subset of cases (n = 20).

Cases were staged according to the American Joint Committee on Cancer (AJCC) staging system. Pathology quality control was performed on each tumor specimen and on adjacent normal tissue (where available) from a frozen section slide prepared either by the BCR or by the TSS. H&E-stained sections from each sample were made available on to a team of independent pathologists for review to: confirm consistency with CCA histology, confirm that the adjacent tissue specimen contained no tumor cells, and annotate various standard pathological parameters. Only tumor samples with ≥60% tumor nuclei, and %20% necrosis were submitted for nucleic acid extraction.

The data and analysis results can be explored through the TCGA Data Portal (https://gdc-portal.nci.nih.gov/projects/TCGA-CHOL), the Broad Institute GDAC FireBrowse portal (http://firebrowse.org/?cohort=CHOL), and Memorial Sloan Kettering Cancer Center cBioPortal (http://www.cbioportal.org/study.do?cancer_study_id=chol_tcga#summary). Detailed experimental procedures are included in the Supplemental Experimental Procedures.

Supplementary Material

Supplemental Experimental Procedures
Tables S1-S9.

Highlights.

  • IDH mutant CCAs have distinct mRNA, copy number, and DNA methylation features

  • IDH mutant CCAs display high mitochondrial and low chromatin modifier gene expression

  • IDH mutant CCAs methylate the ARID1A promoter and show low ARID1A expression

  • Other IDH mutant liver cancers show multiplatform similarities to CCA

In Brief.

Farshidfar et al. present The Cancer Genome Atlas (TCGA) marker analysis of cholangiocarcinoma. Through multiplatform analyses, they identify a distinct subtype enriched for IDH mutants. This subtype shows increased mitochondrial and decreased chromatin modifier gene expression, including potential epigenetic silencing of ARID1A. Other IDH-mutant liver cancers molecularly resemble cholangiocarcinoma.

Acknowledgments

We are grateful to all the patients and families who contributed to this study. We thank Ina Felau for project management. This work was supported by the following grants from NIH: U54 HG003273, U54 HG003067, U54 HG003079, U24 CA143799, U24 CA143835, U24 CA143840, U24 CA143843, U24 CA143845, U24 CA143848, U24 CA143858, U24 CA143866, U24 CA143867, U24 CA143882, U24 CA143883, U24 CA144025, P30 CA016672, R01 CA180778, R01 GM109031, U24 CA210990, and by the generosity of The Cholangiocarcinoma Foundation.

Footnotes

Supplemental Information: Supplemental Information includes Supplemental Experimental Procedures, six figures, and nine tables and can be found with this article online at http://dx.doi.org/10.1016/j.celrep.2017.02.033.

Consortia: Rehan Akbani, Loretta K. Allotey, Adrian Ally, Domenico Alvaro, Jesper B. Andersen, Elizabeth L.Appelbaum, Arshi Arora, J. Todd Auman, Miruna Balasundaram, Saianand Balu, Nabeel Bardeesy, Oliver F. Bathe, Stephen B. Baylin, Rameen Beroukhim, Mario Berrios, Tom Bodenheimer, Lori Boice, Moiz S. Bootwalla, Mitesh J. Borad, Jay Bowen, Reanne Bowlby, Maria Consiglia Bragazzi, Denise Brooks, Vincenzo Cardinale, Rebecca Carlsen, Guido Carpino, Andre L. Carvalho, Roongruedee Chaiteerakij, Vishal C. Chandan, Andrew D. Cherniack, Lynda Chin, Juok Cho, Gina Choe, Eric Chuah, Sudha Chudamani, Carrie Cibulskis, Matthew G. Cordes, Kyle R. Covington, Daniel Crain, Erin Curley, Agostino Maria De Rose, Timothy Defreitas, John A. Demchok, Vikram Deshpande, Noreen Dhalla, Li Ding, Kimberley Evason, Farshad Farshidfar, Ina Felau, Martin L. Ferguson, Wai Chin Foo, Antonio Franchitto, Scott Frazer, Catrina C. Fronick, Lucinda A. Fulton, Robert S. Fulton, Stacey B. Gabriel, Johanna Gardner, Julie M. Gastier-Foster, Eugenio Gaudio, Nils Gehlenborg, Giannicola Genovese, Mark Gerken, Gad Getz, Nasra H. Giama, Richard A. Gibbs, Marie-Claude Gingras, Felice Giuliante, Gian Luca Grazi, D. Neil Hayes, Apurva M. Hegde, David I. Heiman, Julian M. Hess, Toshinori Hinoue, Katherine A. Hoadley, Andrea Holbrook, Robert A. Holt, Alan P. Hoyle, Mei Huang, Carolyn M. Hutter, Stuart R. Jefferys, Steven J.M. Jones, Corbin D. Jones, Katayoon Kasaian, Robin K. Kelley, Jaegil Kim, David E. Kleiner, Jean-Pierre A. Kocher, Lawrence N. Kwong, Phillip H. Lai, Peter W. Laird, Michael S. Lawrence, Kristen M. Leraas, Tara M. Lichtenberg, Pei Lin, Wenbin Liu, Jia Liu, Laxmi Lolla, Yiling Lu, Yussanne Ma, David Mallery, Elaine R. Mardis, Marco A. Marra, Marcus M. Matsushita, Michael Mayo, Michael D. McLellan, Autumn J. McRee, Sam Meier, Shaowu Meng, Matthew Meyerson, Piotr A. Mieczkowski, Christopher A. Miller, Gordon B. Mills, Richard A. Moore, Scott Morris, Lisle E. Mose, Catherine D. Moser, Taofic Mounajjed, Andrew J. Mungall, Karen Mungall, Bradley A. Murray, Rashi Naresh, Yulia Newton, Michael S. Noble, Daniel R. O'Brien, Akinyemi I. Ojesina, Joel S. Parker, Tushar C. Patel, Joseph Paulauskis, Chandra Sekhar Pedamallu, Robert Penny, Charles M. Perou, Amy H. Perou, Todd Pihl, Amie J. Radenbaugh, Nilsa C. Ramirez, W. Kimryn Rathmell, Ed Reznik, Suhn K. Rhie, Jeffrey Roach, Lewis R. Roberts, A. Gordon Robertson, Sara Sadeghi, Gordon Saksena, Chris Sander, Jacqueline E. Schein, Heather K. Schmidt, Steven E. Schumacher, Candace Shelton, Troy Shelton, Ronglai Shen, Margi Sheth, Yan Shi, Juliann Shih, Eve Shinbrot, Rachna Shroff, Janae V. Simons, Payal Sipahimalani, Tara Skelly, Heidi J. Sofia, Matthew G. Soloway, Hubert Stoppler, Nicolas Stransky, Josh Stuart, Qiang Sun, Angela Tam, Donghui Tan, Roy Tarnuzzer, Nina Thiessen, Leigh B. Thorne, Michael S. Torbenson, David J. Van Den Berg, Umadevi Veluvolu, Roel G.W. Verhaak, Doug Voet, Yunhu Wan, Zhining Wang, John N. Weinstein, Daniel J. Weisenberger, David A. Wheeler, Richard K. Wilson, Lisa Wise, Tina Wong, Chia-Chin Wu, Ye Wu, Liu Xi, Ju Dong Yang, Liming Yang, Jean C. Zenklusen, Hailei Zhang, Jiashan (Julia) Zhang, Siyuan Zheng, and Erik Zmuda.

Author Contributions: The TCGA consortium contributed collectively to this study. Project activities were coordinated by the National Cancer Institute and National Human Genome Research Institute Project Teams. Biospecimens were provided by the Tissue Source Sites and processed by the Biospecimen Core Resource. Data generation and analyses were performed by the Genome Sequencing Center, Genome Characterization Centers, and Genome Data Analysis centers. Initial guidance in the project design was provided by the Disease Working Group. We also acknowledge the following TCGA investigators of the Analysis Working Group, who contributed substantially to the analysis and writing of this manuscript: Project Leadership and Co-Senior Authorship, L.N.K. and L.R.R.; Data Coordination, A.J.M.; Manuscript Coordination, J.B.A.; Analysis Coordination, J.D.Y.; mRNA Analysis, F.F., Y.N., J.R., J.T.A., O.F.B., J.B.A., N.B., and L.N.K.; COCA Analyses, S.Z. and K.A.H.; DNA Sequence Analysis, M.G., K.R.C., J.M.H., N.B., and D.A.W.; Copy-Number Analysis, J.S. and A.D.C.; miRNA Analysis, A.G.R. and R.B.; DNA Methylation Analysis, T.H., S.K.R., and P.W.L.; IncRNA Analysis, E.A.G. and A.G.R.; Fusion Analysis, C.C.W., E.S., and N.S.; Protein Analysis, A.H. and R.A.; mtDNA Analysis, E.R. and C.S.; Microbe Analysis, S.S., C.S.P., and A.I.O.; Pathology and Clinical Expertise, M.J.B., V.D., T.M., W.C.F., M.S.T., and D.E.K.

References

  1. Andersen JB, Spee B, Blechacz BR, Avital I, Komuta M, Barbour A, Conner EA, Gillen MC, Roskams T, Roberts LR, et al. Genomic and genetic characterization of cholangiocarcinoma identifies therapeutic targets for tyrosine kinase inhibitors. Gastroenterology. 2012;142:1021–1031. doi: 10.1053/j.gastro.2011.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Arai Y, Totoki Y, Hosoda F, Shirota T, Hama N, Nakamura H, Ojima H, Furuta K, Shimada K, Okusaka T, et al. Fibroblast growth factor receptor 2 tyrosine kinase fusions define a unique molecular subtype of cholangiocarcinoma. Hepatology. 2014;59:1427–1434. doi: 10.1002/hep.26890. [DOI] [PubMed] [Google Scholar]
  3. Bledsoe JR, Shinagare SA, Deshpande V. Difficult diagnostic problems in pancreatobiliary neoplasia. Arch Pathol Lab Med. 2015;139:848–857. doi: 10.5858/arpa.2014-0205-RA. [DOI] [PubMed] [Google Scholar]
  4. Borad MJ, Champion MD, Egan JB, Liang WS, Fonseca R, Bryce AH, McCullough AE, Barrett MT, Hunt K, Patel MD, et al. Integrated genomic characterization reveals novel, therapeutically relevant drug targets in FGFR and EGFR pathways in sporadic intrahepatic cholangiocarcinoma. PLoS Genet. 2014;10:e1004135. doi: 10.1371/journal.pgen.1004135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Borger DR, Tanabe KK, Fan KC, Lopez HU, Fantin VR, Straley KS, Schenkein DP, Hezel AF, Ancukiewicz M, Liebman HM, et al. Frequent mutation of isocitrate dehydrogenase (IDH)1 and IDH2 in cholangiocarcinoma identified through broad-based tumor genotyping. Oncologist. 2012;17:72–79. doi: 10.1634/theoncologist.2011-0386. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Brat DJ, Verhaak RG, Aldape KD, Yung WK, Salama SR, Cooper LA, Rheinbay E, Miller CR, Vitucci M, Morozova O, et al. Comprehensive, Integrative Genomic Analysis of Diffuse Lower-Grade Gliomas. N Engl J Med. 2015;372:2481–2498. doi: 10.1056/NEJMoa1402121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Burris H, Mellinghoff I, Maher E, Wen P, Beeram M, Touat M, Faris J, Azad N, Cloughesy T, Gore L, et al. Abstract PL04-05: The first reported results of AG-120, a first-in-class, potent inhibitor of the IDH1 mutant protein, in a Phase I study of patients with advanced IDH1-mutant solid tumors, including gliomas. Mol Cancer Ther. 2015;14:PL04–PL05. [Google Scholar]
  8. Cancer Genome Atlas Research Network. Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N Engl J Med. 2013;368:2059–2074. doi: 10.1056/NEJMoa1301689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chan-On W, Nairismägi ML, Ong CK, Lim WK, Dima S, Pairojkul C, Lim KH, McPherson JR, Cutcutache I, Heng HL, et al. Exome sequencing identifies distinct mutational patterns in liver fluke-related and non-infection-related bile duct cancers. Nat Genet. 2013;45:1474–1478. doi: 10.1038/ng.2806. [DOI] [PubMed] [Google Scholar]
  10. Churi CR, Shroff R, Wang Y, Rashid A, Kang HC, Weatherly J, Zuo M, Zinner R, Hong D, Meric-Bernstam F, et al. Mutation profiling in cholangiocarcinoma: Prognostic and therapeutic implications. PLoS ONE. 2014;9:e115383. doi: 10.1371/journal.pone.0115383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Covington K, Shinbrot E, Wheeler DA. Mutation signatures reveal biological processes in human cancer. bioRxiv. 2016 http://dx.doi.org/10.1101/036541.
  12. Cuyàs E, Fernández-Arroyo S, Corominas-Faja B, Rodríguez-Gallego E, Bosch-Barrera J, Martin-Castillo B, Llorens RD, Joven J, Menendez JA. Oncometabolic mutation IDH1 R132H confers a metformin-hypersensitive phenotype. 2015;6:12279–12296. doi: 10.18632/oncotarget.3733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Davis CF, Ricketts CJ, Wang M, Yang L, Cherniack AD, Shen H, Buhay C, Kang H, Kim SC, Fahey CC, et al. The somatic genomic landscape of chromophobe renal cell carcinoma. Cancer Cell. 2014;26:319–330. doi: 10.1016/j.ccr.2014.07.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Fan B, Malato Y, Calvisi DF, Naqvi S, Razumilava N, Ribback S, Gores GJ, Dombrowski F, Evert M, Chen X, Willenbring H. Cholangiocarcinomas can originate from hepatocytes in mice. J Clin Invest. 2012;122:2911–2915. doi: 10.1172/JCI63212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fu X, Chin RM, Vergnes L, Hwang H, Deng G, Xing Y, Pai MY, Li S, Ta L, Fazlollahi F, et al. 2-hydroxyglutarate inhibits ATP synthase and mTOR signaling. Cell Metab. 2015;22:508–515. doi: 10.1016/j.cmet.2015.06.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Goyal L, Saha SK, Liu LY, Siravegna G, Leshchiner I, Ahronian LG, Lennerz JK, Vu P, Deshpande V, Kambadakone A, et al. Polyclonal secondary FGFR2 mutations drive acquired resistance to FGFR inhibition in patients with FGFR2 fusion-positive cholangiocarcinoma. Cancer Discov. 2016 doi: 10.1158/2159-8290.CD-16-1000. Published online December 29, 2016. http://dx.doi.org/10.1158/2159-8290.CD-16-1000. [DOI] [PMC free article] [PubMed]
  17. Grassian AR, Parker SJ, Davidson SM, Divakaruni AS, Green CR, Zhang X, Slocum KL, Pu M, Lin F, Vickers C, et al. IDH1 mutations alter citric acid cycle metabolism and increase dependence on oxidative mitochondrial metabolism. Cancer Res. 2014;74:3317–3331. doi: 10.1158/0008-5472.CAN-14-0772-T. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hoadley KA, Yau C, Wolf DM, Cherniack AD, Tamborero D, Ng S, Leiserson MD, Niu B, McLellan MD, Uzunangelov V, et al. Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin. Cell. 2014;158:929–944. doi: 10.1016/j.cell.2014.06.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Holczbauer A, Factor VM, Andersen JB, Marquardt JU, Kleiner DE, Raggi C, Kitade M, Seo D, Akita H, Durkin ME, Thorgeirsson SS. Modeling pathogenesis of primary liver cancer in lineage-specific mouse cell types. Gastroenterology. 2013;145:221–231. doi: 10.1053/j.gastro.2013.03.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Jiao Y, Pawlik TM, Anders RA, Selaru FM, Streppel MM, Lucas DJ, Niknafs N, Guthrie VB, Maitra A, Argani P, et al. Exome sequencing identifies frequent inactivating mutations in BAP1, ARID1A and PBRM1 in intrahepatic cholangiocarcinomas. Nat Genet. 2013;45:1470–1473. doi: 10.1038/ng.2813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kelley RK, Bardeesy N. Biliary tract cancers: Finding better ways to lump and split. J Clin Oncol. 2015;33:2588–2590. doi: 10.1200/JCO.2015.61.6953. [DOI] [PubMed] [Google Scholar]
  22. Kim KH, Kim W, Howard TP, Vazquez F, Tsherniak A, Wu JN, Wang W, Haswell JR, Walensky LD, Hahn WC, et al. SWI/SNF-mutant cancers depend on catalytic and non-catalytic activity of EZH2. Nat Med. 2015;21:1491–1496. doi: 10.1038/nm.3968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kipp BR, Voss JS, Kerr SE, Barr Fritcher EG, Graham RP, Zhang L, Highsmith WE, Zhang J, Roberts LR, Gores GJ, Halling KC. Isocitrate dehydrogenase 1 and 2 mutations in cholangiocarcinoma. Hum Pathol. 2012;43:1552–1558. doi: 10.1016/j.humpath.2011.12.007. [DOI] [PubMed] [Google Scholar]
  24. LaFave LM, Béguelin W, Koche R, Teater M, Spitzer B, Chramiec A, Papalexi E, Keller MD, Hricik T, Konstantinoff K, et al. Loss of BAP1 function leads to EZH2-dependent transformation. Nat Med. 2015;21:1344–1349. doi: 10.1038/nm.3947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Lawrence MS, Stojanov P, Polak P, Kryukov GV, Cibulskis K, Sivachenko A, Carter SL, Stewart C, Mermel CH, Roberts SA, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature. 2013;499:214–218. doi: 10.1038/nature12213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Leonardi R, Subramanian C, Jackowski S, Rock CO. Cancer-associated isocitrate dehydrogenase mutations inactivate NADPH-dependent reductive carboxylation. J Biol Chem. 2012;287:14615–14620. doi: 10.1074/jbc.C112.353946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Liau JY, Tsai JH, Yuan RH, Chang CN, Lee HJ, Jeng YM. Morphological subclassification of intrahepatic cholangiocarcinoma: Etiological, clinicopathological, and molecular features. Modern Pathol. 2014;27:1163–1173. doi: 10.1038/modpathol.2013.241. [DOI] [PubMed] [Google Scholar]
  28. Lonsdale J, Thomas J, Salvatore M, Phillips R, Lo E, Shad S, Hasz R, Walters G, Garcia F, Young N, et al. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45:580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Losman JA, Kaelin WG., Jr What a difference a hydroxyl makes: Mutant IDH, (R)-2-hydroxyglutarate, and cancer. Genes Dev. 2013;27:836–852. doi: 10.1101/gad.217406.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Marquardt JU, Andersen JB, Thorgeirsson SS. Functional and genetic deconstruction of the cellular origin in liver cancer. Nat Rev Cancer. 2015;15:653–667. doi: 10.1038/nrc4017. [DOI] [PubMed] [Google Scholar]
  31. Martinez-Pastor B, Cosentino C, Mostoslavsky R. A tale of metabolites: The cross-talk between chromatin and energy metabolism. Cancer Discov. 2013;3:497–501. doi: 10.1158/2159-8290.CD-13-0059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Meng Z, Fu X, Chen X, Zeng S, Tian Y, Jove R, Xu R, Huang W. miR-194 is a marker of hepatic epithelial cells and suppresses metastasis of liver cancer cells in mice. Hepatology. 2010;52:2148–2157. doi: 10.1002/hep.23915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Nakamura H, Arai Y, Totoki Y, Shirota T, Elzawahry A, Kato M, Hama N, Hosoda F, Urushidate T, Ohashi S, et al. Genomic spectra of biliary tract cancer. Nat Genet. 2015;47:1003–1010. doi: 10.1038/ng.3375. [DOI] [PubMed] [Google Scholar]
  34. Nakanuma Y, Sato Y, Ikeda H, Harada K, Kobayashi M, Sano K, Uehara T, Yamamoto M, Ariizumi S, Park YN, et al. Intrahepatic cholangiocarcinoma with predominant “ductal plate malformation” pattern: A new subtype. Am J Surg Pathol. 2012;36:1629–1635. doi: 10.1097/PAS.0b013e31826e0249. [DOI] [PubMed] [Google Scholar]
  35. Oishi N, Kumar MR, Roessler S, Ji J, Forgues M, Budhu A, Zhao X, Andersen JB, Ye QH, Jia HL, et al. Transcriptomic profiling reveals hepatic stem-like gene signatures and interplay of miR-200c and epithelial-mesenchymal transition in intrahepatic cholangiocarcinoma. Hepatology. 2012;56:1792–1803. doi: 10.1002/hep.25890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Ong CK, Subimerb C, Pairojkul C, Wongkham S, Cutcutache I, Yu W, McPherson JR, Allen GE, Ng CC, Wong BH, et al. Exome sequencing of liver fluke-associated cholangiocarcinoma. Nat Genet. 2012;44:690–693. doi: 10.1038/ng.2273. [DOI] [PubMed] [Google Scholar]
  37. Palmer WC, Patel T. Are common factors involved in the pathogenesis of primary liver cancers? A meta-analysis of risk factors for intrahepatic cholangiocarcinoma. J Hepatol. 2012;57:69–76. doi: 10.1016/j.jhep.2012.02.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Parker SJ, Metallo CM. Metabolic consequences of oncogenic IDH mutations. Pharmacol Ther. 2015;152:54–62. doi: 10.1016/j.pharmthera.2015.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Razumilava N, Gores GJ. Cholangiocarcinoma. Lancet. 2014;383:2168–2179. doi: 10.1016/S0140-6736(13)61903-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Reznik E, Miller ML, Senbabaoglu Y, Riaz N, Sarungbam J, Tickoo SK, Al-Ahmadie HA, Lee W, Seshan VE, Hakimi AA, Sander C. Mitochondrial DNA copy number variation across human cancers. eLife. 2016;5:e10769. doi: 10.7554/eLife.10769. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Ross JS, Wang K, Gay L, Al-Rohil R, Rand JV, Jones DM, Lee HJ, Sheehan CE, Otto GA, Palmer G, et al. New routes to targeted therapy of intrahepatic cholangiocarcinomas revealed by next-generation sequencing. Oncologist. 2014;19:235–242. doi: 10.1634/theoncologist.2013-0352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Ruzzenente A, Fassan M, Conci S, Simbolo M, Lawlor RT, Pedrazzani C, Capelli P, D'Onofrio M, Iacono C, Scarpa A, Guglielmi A. Cholangiocarcinoma heterogeneity revealed bymultigene mutational profiling: Clinical and prognostic relevance in surgically resected patients. Ann Surg Oncol. 2016;23:1699–1707. doi: 10.1245/s10434-015-5046-6. [DOI] [PubMed] [Google Scholar]
  43. Saha SK, Parachoniak CA, Ghanta KS, Fitamant J, Ross KN, Najem MS, Gurumurthy S, Akbay EA, Sia D, Cornella H, et al. Mutant IDH inhibits HNF-4α to block hepatocyte differentiation and promote biliary cancer. Nature. 2014;513:110–114. doi: 10.1038/nature13441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Saha SK, Zhu AX, Fuchs CS, Brooks GA. Forty-year trends in cholangiocarcinoma incidence in the U.S.: Intrahepatic disease on the rise. Oncologist. 2016;21:594–599. doi: 10.1634/theoncologist.2015-0446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Schulze K, Imbeaud S, Letouzé E, Alexandrov LB, Calderaro J, Rebouissou S, Couchy G, Meiller C, Shinde J, Soysouvanh F, et al. Exome sequencing of hepatocellular carcinomas identifies new mutational signatures and potential therapeutic targets. Nat Genet. 2015;47:505–511. doi: 10.1038/ng.3252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Sekiya S, Suzuki A. Intrahepatic cholangiocarcinoma can arise from Notch-mediated conversion of hepatocytes. J Clin Invest. 2012;122:3914–3918. doi: 10.1172/JCI63065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Seok JY, Na DC, Woo HG, Roncalli M, Kwon SM, Yoo JE, Ahn EY, Kim GI, Choi JS, Kim YB, Park YN. A fibrous stromal component in hepatocellular carcinoma reveals a cholangiocarcinoma-like gene expression trait and epithelial-mesenchymal transition. Hepatology. 2012;55:1776–1786. doi: 10.1002/hep.25570. [DOI] [PubMed] [Google Scholar]
  48. Sia D, Losic B, Moeini A, Cabellos L, Hao K, Revill K, Bonal D, Miltiadous O, Zhang Z, Hoshida Y, et al. Massive parallel sequencing uncovers actionable FGFR2-PPHLN1 fusion and ARAF mutations in intrahepatic cholangiocarcinoma. Nat Commun. 2015;6:6087. doi: 10.1038/ncomms7087. [DOI] [PubMed] [Google Scholar]
  49. Tateishi K, Wakimoto H, Iafrate AJ, Tanaka S, Loebel F, Lelic N, Wiederschain D, Bedel O, Deng G, Zhang B, et al. Extreme Vulnerability of IDH1 Mutant Cancers to NAD+ Depletion. Cancer Cell. 2015;28:773–784. doi: 10.1016/j.ccell.2015.11.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Valle J, Wasan H, Palmer DH, Cunningham D, Anthoney A, Maraveyas A, Madhusudan S, Iveson T, Hughes S, Pereira SP, et al. Cisplatin plus gemcitabine versus gemcitabine for biliary tract cancer. N Engl J Med. 2010;362:1273–1281. doi: 10.1056/NEJMoa0908721. [DOI] [PubMed] [Google Scholar]
  51. Varadhachary GR, Raber MN. Cancer of unknown primary site. N Engl J Med. 2014;371:757–765. doi: 10.1056/NEJMra1303917. [DOI] [PubMed] [Google Scholar]
  52. Wang P, Dong Q, Zhang C, Kuan PF, Liu Y, Jeck WR, Andersen JB, Jiang W, Savich GL, Tan TX, et al. Mutations in isocitrate dehydrogenase 1 and 2 occur frequently in intrahepatic cholangiocarcinomas and share hypermethylation targets with glioblastomas. Oncogene. 2013;32:3091–3100. doi: 10.1038/onc.2012.315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Woo HG, Lee JH, Yoon JH, Kim CY, Lee HS, Jang JJ, Yi NJ, Suh KS, Lee KU, Park ES, et al. Identification of a cholangiocarcinoma-like gene expression trait in hepatocellular carcinoma. Cancer Res. 2010;70:3034–3041. doi: 10.1158/0008-5472.CAN-09-2823. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Xu W, Yang H, Liu Y, Yang Y, Wang P, Kim SH, Ito S, Yang C, Wang P, Xiao MT, et al. Oncometabolite 2-hydroxyglutarate is a competitive inhibitor of α-ketoglutarate-dependent dioxygenases. Cancer Cell. 2011;19:17–30. doi: 10.1016/j.ccr.2010.12.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Zou S, Li J, Zhou H, Frech C, Jiang X, Chu JS, Zhao X, Li Y, Li Q, Wang H, et al. Mutational landscape of intrahepatic cholangiocarcinoma. Nat Commun. 2014;5:5696. doi: 10.1038/ncomms6696. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Experimental Procedures
Tables S1-S9.

RESOURCES