Abstract
Background
Massive chromosomal aberrations are a signature of advanced cancer, although the factors promoting the pervasive incidence of these copy number alterations (CNAs) are poorly understood. Gatekeeper mutations, such as p53, contribute to aneuploidy, yet p53 mutant tumors do not always display CNAs. Uterine Corpus Endometrial Carcinoma (UCEC) offers a unique system to begin to evaluate why some cancers acquire high CNAs while others evolve another route to oncogenesis, since about half of p53 mutant UCEC tumors have a relatively flat CNA landscape and half have 20–90% of their genome altered in copy number.
Methods
We extracted copy number information from 68 UCEC genomes mutant in p53 by the GISTIC2 algorithm. GO term pathway analysis, via GOrilla, was used to identify suppressed pathways. Genes within these pathways were mapped for focal or wide distribution. Deletion hotspots were evaluated for temporal incidence.
Results
Multiple pathways contributed to the development of pervasive CNAs, including developmental, metabolic, immunological, cell adhesion and cadmium response pathways. Surprisingly, cadmium response pathway genes are predicted as the earliest loss events within these tumors: in particular, the metallothionein genes involved in heavy metal sequestration. Loss of cadmium response genes were associated with copy number changes and poorer prognosis, contrasting with 'copy number flat' tumors which instead exhibited substantive mutation.
Conclusion
Metallothioneins are lost early in the development of high CNA endometrial cancer, providing a potential mechanism and biological rationale for increased incidence of endometrial cancer with cadmium exposure. Developmental and metabolic pathways are altered later in tumor progression.
Introduction
Cancer genomes can evolve through acquisition of favorable mutations, copy number variations, or both. Determination of those mutations which recur on individual genes or even codons of individual amino acids within a gene has aided the development of targeted therapeutics, such as Vemurafenib to target BRAF V600E mutant tumors. The efficacy of targeting of mutations is improved for those mutations which arise early in tumor development (“trunk” mutations) and thus occupy the vast majority of tumor cells [1]. However, not all tumors have a clear trunk mutation or they have trunk mutations which cannot currently be targeted. For example, the mutational signature of serous ovarian cancers clustered away from 11 other cancer types due to the lack of any tumor suppressor or oncogene mutation other than p53 [2].
Copy number alterations (CNAs) are another route to oncogenesis. Some dramatic alterations, such as massive amplification of HER2 [3], have been clearly linked to tumor promoting roles. Yet again many tumors do not present such clearly oncogenic alterations of a single locus, but rather have much smaller alterations spread throughout their genome. Chromosome arms are allelically deleted or amplified based on their content of modest-effect tumor suppressors or oncogenes, respectively [4]. Mutation or loss of well-known tumor suppressors p53 [5, 6] or BRCA1/2 [7] can contribute to the initiation of widespread aneuploidy and CNAs, however p53 or BRCA mutant tumors do not always display a largely altered copy number landscape. There must be other genetic changes which enable certain tumors to acquire a larger burden of CNAs.
One cancer which allows an identification of these genetic changes which create a varied CNA profile is endometrial cancer. Endometrial cancer has been extensively sequenced by The Cancer Genome Atlas (TCGA) research consortium [8]. The majority of endometrial cancers are curable [9], however a class of Type II endometrial tumors is much more aggressive and less likely to respond to chemotherapy. The TCGA study found aggressive tumors could be identified by unbiased clustering of copy number variation within the tumors. Those tumors with the highest amount of copy number variation indicated the worst prognosis of patients studied in the TCGA cohort [8]. Interestingly, many of these aggressive, copy number variable tumors contained a p53 mutation, but many other p53 mutant tumors had a relatively flat copy number landscape. Since both CNA variable and CNA stable p53 mutant tumors were sequenced, we undertook a study to determine what changes could be contributing to the large copy number variation of aggressive tumors.
Here we investigated the genetic pathways altered by copy number between TP53 mutant uterine endometrial carcinoma tumors with high overall copy number diversity and those tumors with low copy number diversity. The enriched gene sets identified which pathways are involved in the creation of an endometrial tumor’s genetic diversity.
Results
Evaluation of low CNA vs. high CNA populations among TP53 mutant tumors
For these studies, we used the TCGA Uterine Corpus Endometrial Carcinoma dataset [8], containing 548 patient-tumor pairs. Among the 242 patient tumors with both copy number and mutational data, we focused on the 68 tumors bearing TP53 mutations (S1 Table). To evaluate specific enrichments among patients with high genomic copy number alterations (CNAs), these tumors were then parsed into those with higher than median CNAs and those with lower than median CNAs (Fig 1). We refer to tumors with p53 mutation and CNAs at high levels as “PMCH” and the tumors with p53 mutation and CNAs at low levels as “PMCL”.
CNA instability has been suggested to be caused by a loss of a tumor suppressor rather than a gain of an oncogene, such as p53 loss or BRCA1/2 loss. Our analysis focused on genes enriched for losses in PMCH tumors. Ploidy calls derived from the GISTIC2 algorithm [10] for individual gene gain/loss events were used for each gene in each tumor. Our standardized equation (see Materials and Methods) yielded scores for each gene which could be used to rank those genes most often lost in PMCH tumors relative to PMCL tumors. Since monoallelic loss of genes may contribute to tumor formation in a cumulative manner [4], this gene list was then analyzed for statistical enrichments in known molecular pathways. Genes enriched for losses were evaluated via GOrilla [11] for GO term pathway enrichment analysis.
Key pathways with enhanced depletion among the high CNA TP53 mutant tumors
A complete list of significant GO terms enriched for losses in PMCH tumors are available in S2 Table. For a detailed description of the statistics used, please refer to materials and methods (section: pathway analysis) and the publication describing the GOrilla statistical tool [11]. The key highlighted pathways are described below with altered genes briefly noted. Many genes are involved in multiple pathways. The interrelation of these pathways is represented in Fig 2.
Immune System
The somatic alteration patterns analyzed in this study are specific for the tumor and are not alterations in the peripheral blood. Nonetheless, the most significantly enriched pathway for deletions in endometrial tumor cells was that of the immune response. Genes involved in the natural killer cell (IFNW1, IFNA5, IFNA6, IFNA7, IFNA8, IFNK, IFNA1, IL12B, IFNA2, IFNA4, IFNA21, IFNA17, IFNA13, IFNA10, IFNA16, IFNA14, IFNE, IFNB1, and RAB27A), B cell (GAPT, CD180, PIK3R1, FNP1, CHRFAM7A, ONECUT1, and GPR97), and T cell response (in addition to the interferon genes above, IL4 and RORA) were significantly enriched for deletion/losses among high CNA tumors. Statistical enrichment was q< 3.4x10-11.
Cell-Cell Adhesion
The second most enriched pathway for deletions in high CNA TP53 mutant tumors was that of cell-cell adhesion. Suppressed genes included those in the protocadherin family of integrins (PCDH genes), FAT2 and DLL4 (which interact with NOTCH signaling), tyrosine kinases BLK and PEAK1, ECM2 (a female specific ECM protein), EGFL7 (an endothelial marker and migration regulator), FERMT2 (an ECM protein), B2M (a MHC I protein often overexpressed in cancer and which can promote metastasis), NEDD4 (an E3 ubiquitin ligase which negatively regulates PTEN levels), NEO1 (a FAK interaction protein), NPTN (neuroplastin), CD276 (an immune checkpoint protein), ITGAE (an E-cadherin receptor involved in autoimmunity), and PSTPIP1 (an autoinflammatory response protein). Statistical enrichment was q< 8.4x10-11
Development
The broad class of developmental genes were also prevalent among the high CNA mutant TP53 tumors. Nervous system development (KIF2A, SMN1, NAIP, ENC1, GFRA3, CSGALNACT1, GFRA2, APBA1, LPPR1, FKTN, GPSM1, CHRNA3), mitotic cytokinesis (SNX18, RASA1, KIF20A, NUSAP1, USP8, KIF23, SNX33), and general development genes (HEXB, VCAN, HAPLN1, NME5, PCSK5, BARX1, HSD17B3, ANP32B, FBN1, FOXB1, SMAD3, BBS4, STRA6, ACAN, HAPLN3) were enriched for deletions. Statistical enrichment was q< 7.5x10-6.
Metabolism
While less statistically significant, metabolism genes had the greatest overall incidence of losses of any pathway in this analysis. Specifically, metabolic GO terms enriched for deletion included Coenzyme A metabolism (MCCC2, HMGCR, and PPCDC), dsRNA metabolism (TMEM173, CARD9, GRIN2A), oxoacid metabolism (CKMT2, ACTO12, ABCA2, EFTA, ABAT), serine phosphorylation (FNIP2, TXN), nitrogen response (many genes), organic cyclic compound response (many genes), S-methylmethionine metabolism (BHMT and BHMT2), and hormone level regulation (CARTPT, FOXD1, GCNT4, PCSK1) including serotonin (HTR1A, RNF180). Statistical enrichment was q< 2.0x10-5.
Cadmium Response
The gene losses in the GO term “cadmium response” (CR) include a swath of genes regulated by heavy metal ions and distributed across the genome; some are in hot spots with other pathway deletions (Fig 3). These regions include 5q (SPARC deletion), 9q (GSN deletion), 15q (SORD, CYP11A1, and CYP1A2 deletions), and a region on 16q containing seven metallothionein genes (MT3, MT1E, MT1A, MT1F, MT1G, MT1H, and MT1X deletions). SPARC, also known as osteonectin, may be an adhesion suppressing gene and has been linked to cadmium by its specific downregulation as noted following oncogenic exposures of cadmium in bladder cancer [12]. It is expressed, however, following exposure to cadmium in tissues sensitive to the metal [13]. Gelsolin (GSN) is an actin cytoskeleton regulatory protein with severing and nucleating functions that is normally dependent on calcium [14], and translocates to the cytoskeleton following cadmium administration [15] to sever and disrupt cytoskeletal filaments [16, 17]. Sorbitol dehydrogenase (SORD) converts sorbitol to fructose. Its levels are upregulated by cadmium exposure [18], possibly due to the fact that the enzymatic activity is decreased by cadmium exposure [19]. The cytochromes Cyp11a1 and Cyp1a2 are involved in detoxification of numerous xenobiotics. The metallothionein (MT) proteins have a high affinity for cadmium and sequester the majority of cadmium in the body [20], including in endometrial cells [21]. The loss of 16q in CNA high uterine endometrial carcinomas induces a loss of a cluster of seven of these metallothionein genes, suggesting a substantial suppression in the cells’ ability to cope with heavy metal exposure. Statistical enrichment was q< 3.1x10-3.
Genes with CNA display both focal and widespread chromosomal distributions
Whole chromosome and chromosome arm losses are known, widely prevalent mechanisms for CNAs in cancer. These gross losses and gains are thought to encompass multiple tumor suppressors and oncogenes to cumulatively affect cell cycle dysregulation and other tumor formation characteristics [4]. In vivo studies have validated that knockdown of genes within gene clusters independently contribute to oncogenesis[22]. Even the most commonly inactivated tumor suppressor, TP53, may be commonly deleted due to neighboring tumor suppressors EIF5A and ALOX15B[23]. Since some biological pathways utilize many genes within a short region of the genome in arrayed gene sets, it is possible some of our GO term enrichments were found due to local clustering. To address this possibility, we mapped the genes enriched for loss in PMCH tumors to their chromosomal locations and graphed their distribution (Fig 3). We found that the pathways most enriched for gene loss spanned regions across multiple chromosomes, suggesting that the metabolic, developmental, cell-cell adhesion, and immune response pathways involve losses of more than just one chromosome or chromosome arm.
However, the CR pathway enriched by GO term analysis contained 7 out of 12 total genes in a focal array on cytoband 16q13. All these genes encode metallothionein proteins involved in the sequestering of cadmium and other xenobiotic heavy metals within the cell. At a first glance, we formed the hypothesis that because of this focal arrangement of genes, the CR pathway may be a false positive from the GO term enrichment analysis. However, there has been much debate in epidemiology regarding how exposure to cadmium influences endometrial cancer incidence. A 2008 study provocatively suggested cadmium contaminated vegetables and cereals increases endometrial cancer risk by 39% [24]. Follow-up by two prospective cohort studies suggested a positive trend of cadmium exposure as a risk factor for endometrial cancer in women with a BMI<25 in one [25], while the other found no association [26] possibly due to apparent difficulty in accurately estimating biological exposure to environmental cadmium. A meta-analysis of these studies and others concluded cadmium exposure is a risk factor in subsets of the population [27]. From a molecular biology standpoint, cadmium has been described as a mutagen in part due to its creation of reactive oxygen species [28], mismatch repair inhibition [29], and from its ability to cause double-stranded DNA breaks [30, 31]. Taking these previous findings into consideration, we decided not to rule out the CR pathway as a possible contributor to PMCH tumor formation and continued to incorporate these genes into subsequent analyses.
Early vs. Late Loss Events in Tumor Development
The relative prevalence of genetic alterations within a tumor sample can help indicate if an alteration occurred ‘early’ or ‘later’ during the development and adaptation of a tumor. Mutations, in particular, are widely used to determine “trunks” of the oncogenic development tree of a tumor; those mutations which are prevalent in near 100% of the tumor sample are more likely to have occurred early, and those of lower prevalence likely occur later in tumor development. In technical terms, mutation prevalence is calculated as a fraction of sequence reads; the smaller the fraction the fewer tumor cells are presumed to contain the mutation. Similarly, prevalence of copy number changes may be inferred from the log2 ratio of signal resulting from SNP array data targeting a gene: the higher the signal the fewer cells have acquired or retained the gene deletion. However, this can be confounded by the fact that it is difficult to determine if a loss is homozygous in half of the sample or heterozygous for the entirety of the sample. However, if alterations are mostly heterozygous, as determined by the GISTIC2 algorithm [10], then approximations of earlier tumorigenic changes can be calculated.
This assumption proved to be reasonable; all genes in Figs 2 and 3 enriched for losses had a frequency of homozygous deletions in <1% of tumors, with the exception of deletions on chromosome 8p. Chromosome 8p is already known to contain many tumor suppressors, such as DLC1 and FGL1 [22]. To find novel tumor suppressors, we therefore constructed (see Materials and Methods) a ranked intratumor temporal loss series of all genes enriched for losses by GO pathway analysis (S1 Fig). A consolidated view of these genes is presented in Fig 4.
The deep loss of region 16q13 encompasses the seven metallothionein genes as well as other potential tumor suppressors, such as mIR-138-2 [32], indicating that loss of this region appears to be an early event in PMCH tumors and likely contributes to the formation of further copy number variation. The next most present loss was 17p13, which incorporates just five genes enriched by GO pathway analysis: P2RX5 and P2RX1, encoding ligand gated Na+/Ca2+ ion channels regulated by purinergic signaling, TRPV1, a non-selective ion channel, ITGAE, an integrin involved in E-cadherin ligation, and C1QBP, a poly-phenotypic protein that is commonly associated with tumor progression [33] but also involved in increased glycolysis and the Warburg effect [34]. Interestingly, the P2X receptor activity is either augmented or blocked by bound cadmium, depending on the receptor, suggesting a possible interaction with the cadmium response [35, 36].
The majority of regions were lost at similar rates across cells in a tumor (Fig 4), suggesting once CNAs variation has been enabled, a number of pathways then enable tumor cells to undergo positive Darwinian selection. These regions were involved in all enriched aspects of PMCH tumors, including metabolism, development, and cell-cell interactions. Somewhat unexpectedly, some of the latest events in tumor progression incorporated caspase-3, an executioner caspase downstream of apoptosis initiation. Cadherin deletions on 5q31 (including 31 members of the protocadherin family) were the least prevalent losses in the tumor cell population, consistent with the idea that metastatic ability increases later in tumor development. Since the tumors analyzed were primary tumors extracted prior to therapy, one might have tested the prediction that cadherin deletions are much more enriched in metastatic tumor samples. However, these data were not available.
Effect of cadmium response gene loss is specific to CNAs, not mutations
Since metallothionein genes were the earliest loss events in our analysis, we characterized the capacity of CR gene loss to promote ongoing tumor mutagenesis. To address whether or not CR gene loss acts to alter mutation rates in addition to copy number alterations, we utilized the COSMIC database [37] to compare mutation types and rates between cadmium gene suppressed patients and those patients without CR gene loss in the TP53 mutant cohort.
We found that early loss of CR was associated with decreased mutagenesis relative to tumors that maintained CR genes. In fact, the tumors which did not exhibit losses of CR genes had far higher mutation rates across all mutation types (Fig 5A), consistent with POLE and microsatellite mutations driving tumorigenesis in many endometrial cancer patients[8]. The spectrum of mutations did not change in these cohorts, although there was a slight statistically significant enhancement of C>G mutations in tumors with CR gene loss (Fig 5B).
In contrast to the mutation rates observed, we confirmed the strong correlation between cadmium response gene loss and fraction of the genome which was altered in UCEC tumors. About 36% of the tumor genome had copy number changes in the CR gene loss TP53 mutant cohort, while only 13% was altered in TP53 mutant tumors without CR losses (p< 0.01 by Wilcoxon rank-sum test) (Fig 5C). Taken together, these results suggest loss of genomic regions containing CR genes most likely results in copy number instability rather than mutational instability. These changes are clinically significant; patients with losses in CR genes have nearly a doubled hazard ratio resulting in negative prognosis (Fig 5D). Such data fit the histopathological understanding that the serous subtype is generally more lethal. Accordingly: 71% of the PMCH cohort were annotated within the serous histological subtype, whereas only 47% of the PMCL cohort were serous, suggesting copy number variation contributes to the serous phenotype of p53 mutant endometrial tumors.
Discussion
Here we investigated the genomic cause for copy number variation in the most aggressive subtype of endometrial cancer. There was a divergence in TP53 mutant tumors: those which evolved primarily by acquiring mutations (PMCL cohort) and those which evolved in part by acquiring copy number changes (PMCH cohort). Our gene pathway analysis of regions enriched for deletions in PMCH tumors revealed that immunological, metabolic, developmental, adhesion and matrix signaling, and cadmium response pathways are enriched for deletions. The CR pathway loss, in particular the loss of metallothionein genes, occurred earlier in tumor evolution than any other regional loss. Tumors with genetically suppressed CR largely did not alter their mutational spectrum or increase mutational load, but rather increased the amount of copy number changes by 186% (Fig 5).
While we were expecting to find more canonical DNA maintenance pathways enriched in our GO term datasets, we were surprised to find CR genes as the earliest hit and a prognostic indicator of overall survival. Cadmium is a well-known carcinogen. The influence of cadmium on endometrial cancer incidence has been somewhat controversial; epidemiology studies have found correlations of increased incidence of endometrial cancer with high dietary exposure to cadmium in some [24, 25], but not other [26] prospective studies. A common suggestion in the cases which did not find a significant association (there was always a trend) is an inability to accurately measure cadmium exposure. Our results suggest that the patients with high copy number variation and poor prognosis have experienced an early evolutionary event in their tumor cells which have suppressed endometrial cells’ ability to sequester cadmium within metallothioneins. In addition, this provides a biological mechanism for how endometrial cancer and cadmium exposure may interact. Cadmium can induce aneuploidy in multiple model systems [38], including mouse oocytes [39]. It may do so by increasing the amount of reactive oxygen species, which in endometrial cancer would be further exacerbated by cytochrome deletions. Cells in the uterus with a loss in the CR pathway would selectively uptake cadmium without any sequestration from molecular machinery, allowing this carcinogen to wreak further havoc and initiate oncogenesis. While this link of metallothionein proteins to cadmium appears important, it should be noted that metallothioneins sequester a number of heavy metal ions which may impact endometrial biology [40, 41]. Metallothioneins also act to buffer zinc [42], impacting both enzymatic and transcriptional activities within the cell. [43]. Metallothioneins may also regulate oncogenesis in other cancer types such as serous ovarian cancer and breast cancer, which are deleted for metallothioneins in 68% and 59% of cases, respectively [44].
Since the analysis presented here is observational, it remains to be directly shown that polygenic pathway losses in UCEC are causal for a phenotype. There are, however, many documented cases where loss of a single allele of a single gene, without a mutation in the other allele, produces an increase in tumorigenesis. PTEN loss is present in many cancer types, and mouse studies have a demonstrable increase in spontaneous tumor formation [45]. BECN1, an autophagy initiation gene also involved in ploidy homeostasis [46], increases tumor formation when monoallelically deleted [47, 48]. The RNAi knockdown of multiple chromosome 8p tumor suppressor homologues in mice can incur oncogenesis [22]. From our study, we would predict loss of metallothionein genes to sensitize mice or human tissue to oncogenesis from cadmium exposure.
While loss of CR appeared earliest in the tumor, the remainder of the enriched pathways are in fractions of cells and may contribute to tumor presentation and micrometastasis. Interferons are normally expressed in cells infected by viruses or bacteria to recruit the immune system via MHC presentation to cytotoxic T cells [49]. A simultaneous deletion of 17 of these interferons occurs in PMCH tumors and may assist immune evasion. Metabolic and developmental alterations may allow for further growth of the tumor in hypoxic or physically constrained environments.
One advantage of monoallelic changes for tumor cells is an ability to retain oncogenes for later stages of tumor development, including metastasis. For example, loss of chromosome 9q34 removes tumor suppresser genes TSC1 (an mTOR pathway inhibitor [50]), NOTCH1 (a development and stemness regulator [51]), and GRIN1 (a calcium regulating tumor suppressor [52]). But deletion of 9q34 also removes ABL1, a potent oncogene when upregulated [53], as well as RXRA, an oncogene which can be activated by SUMOylation [54]. However, since both alleles are not deleted, these oncogenes are not completely removed and can be epigenetically or post-translationally up-regulated if necessary. This is analogous to the gene BECN1 and the regulation of autophagy. Beclin, the product of BECN1, promotes the formation of autophagosomes and recycling of cellular constituents. BECN1 is a haploinsufficient tumor suppressor [47, 48], yet once a tumor forms and expands, tumors require autophagy for essential metabolites, especially in hypoxic or otherwise nutrient-poor tumors [55, 56]. Examples like 9q34 show how a relatively short region of a single chromosome can contain many tumor suppressors and oncogenes, and each tumor regulates their proportional dosage to favor growth.
Challenges surrounding genomic cancer experiments are progressing from the initial simple difficulty of obtaining genetic information to those difficulties associated with interpretation and applying genomic information [57]. This study supports the idea that pathway analysis from the entire genome, rather than individual gene or chromosome arm analysis, contributes to our understanding of the pathogenesis of cancers. Future studies will be needed to determine causality regarding the potential tumor suppressor genes and pathways found here to be enriched for losses.
Materials and Methods
Patient cohort
The NCI Cancer Genome Atlas cohort was utilized for this study, due to its consistent data structure and large sample size [8]. Tumors incorporated into their study post-publication were included in this analysis, for a total of 548 patients, 242 of whom had both mutation and CNA data, and 68 of these with a TP53 mutation. These patients were defined as having a diagnosis of endometrioid adenocarcinoma or endometrioid serous carcinomas. All patients must have required surgery which extracted the tumor, and no patients at the time of surgery had detectable metastatic lesions. No prior chemotherapy or radiotherapy was allowed for incorporation into this dataset. Somatic alteration determination was made possible by paired sequencing of normal tissue from the patient, either from blood or from tissue >2cm from the tumor. We defined tumors with p53 mutation and CNAs at higher than median levels as “PMCH” and the tumors with p53 mutation and CNAs at lower than median levels as “PMCL”. Median age of patients with PMCH and PMCL tumors were both 68 years.
Mutation and copy number alteration (CNA) determination
All data were extracted and analyzed from The Cancer Genome Atlas (TCGA) consortium’s sequencing of Uterine Corpus Endometrial Carcinoma (UCEC) patient-tumor pairs, as maintained in the UCSC Cancer Genome Browser. The provisional dataset of 548 tumors, 242 of which had both CNA and mutation data, were downloaded and analyzed for this study. TP53 mutant tumors were identified as those annotated with a p53 mutation in cBioPortal [44], which were 68 tumors. The total fraction of the genome altered (Fig 5C) were downloaded from the curated values within the clinical characteristics in cBioPortal, as were survival data. Survival curves compared patients with 16q13, 9q34, or 15q24 losses (cadmium response regions), or those without these losses, independent of p53 status. TCGA copy number signals were generated on Affymetrix SNP 6.0 arrays and run through the Broad Institute pipeline [58] to determine CNVs, and we used these data as curated by the UCSC Cancer Genome Browser [59]. GISTIC2 copy number calls were used to assess allelic losses [10]. SNP data were input into the Integrated Genomics Viewer [60] to generate the loss/amplification panels of PMCH and PMCL tumors in Fig 1.
Assessment of enriched losses in PMCH tumors
We used the following equation to determine loss enrichment scores for a given gene α:
where gene scores are summed from all patient tumors within a genotype group PMCH or PMCL. αGISTIC scores are gene specific GISTIC2 [10] scores which range from: -2 (biallelic or complete loss), -1 (allelic loss), 0 (no change), 1 (allelic amplification), 2 (biallelic or more amplification). This strategy incorporated an equal weight against gains of tumor suppressors, since genes acting as dosage dependent tumor suppressors in the development of PMCH tumors should have a reduced frequency of gain events in conjunction with an increased frequency of loss events. Therefore, a gene with a high αLOSS score is a gene deleted in many more PMCH tumors than PMCL tumors, and a gene with a very low αLOSS score is a gene deleted more often in PMCL tumors or a gene amplified in PMCH tumors.
Pathway analysis
Ranked αLOSS scores were then input directly in GOrilla [11] and output GO terms are summarized in S1 Table. This strategy results in GO term hits for pathways which contain genes commonly deleted in PMCH tumors relative to PMCL tumors. Since we utilized a ranked gene list based on genes lost in decreasing enrichment in PMCH tumors, the hypergeometric score output from the algorithm was used. Only those GO terms with a false discovery rate adjusted q value of <0.05 were placed in S1 Table. GOrilla improves on some competitor GO term analyses from its ability to incorporate a user-based background gene set, rather than an arbitrary background list of annotated genes, in its calculations for enrichment. Our background set included all genes with copy number data throughout the TCGA UCEC study. GO terms were manually placed into the five overarching groups using annotated hierarchies from the Gene Ontology consortium [61]. For any gene within more than one enriched GO term, a literature search was performed to determine which pathway color is most relevant, although multiple-pathway annotations are retained in the network depiction in Fig 2 as multiple edges. Cytoscape was used to produce the graphical network display in Fig 2.
Genomic location mapping
Whole genome gene IDs were input into bioDBnet [62] and converted into Ensembl annotated chromosome locations. Centromere locations were downloaded from the UCSC human genome browser (hg19). Genes output from GOrilla analysis were then sliced from this dataset and graphed by Microsoft Excel.
Temporal assessment of chromosome region losses
Raw reference normalized SNP signal was downloaded from the UCSC Cancer Genome browser, sliced for those tumors and genes under evaluation, and compared to GISTIC2 copy number calls. All GISTIC2 calls of -1 in any tumor were included in the analysis. SNP values were used to determine timing of region loss using the following assumptions: only monoallelic, not biallelic, losses occurred (chromosome 8p did not fit this assumption and was excluded), the stronger the frequency of the variant the earlier the loss (as it is present in more cells within the sample), and variants of a log2 ratio of <0.2 were potential noise events and excluded from analysis. All genes resulting from GO term enrichment analysis, as depicted in Figs 2 and 3, were included in our analysis (S1 Fig). Histograms of each chromosome region in Fig 4 incorporate minimum SNP values of the queried region across all UCEC tumors with CNA data, and are mirrored for presentation purposes. Median SNP values of all GISTIC2 “-1” value regions are shown for comparison in Fig 4 (top panel). Overall, this approach mirrors the method used for mutational evolution of tumors by using the Cancer Cell Fraction within a sample [1].
Base-pair scale mutational analysis
For determination of transitions and transversions detected in whole-genome sequencing, we downloaded the COSMIC database [37] of cancer mutations and sliced the UCEC dataset for analysis. Mutation scores were summed for each individual tumor and mapped to TP53 mutation and/or Cadmium Response Loss cohorts, as described above. Tumors with a loss in any of the 12 genes enriched of loss in the Cadmium Response pathway were counted in the “CR” bar of Fig 5.
Supporting Information
Acknowledgments
We thank other members of the Stupack lab for their proofreading and intellectual discussion surrounding this manuscript.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This work was supported by NCI grants CA107263 (DGS) and T32CA121938 (JD). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Lohr JG, Stojanov P, Carter SL, Cruz-Gordillo P, Lawrence MS, Auclair D, et al. Widespread genetic heterogeneity in multiple myeloma: implications for targeted therapy. Cancer Cell. 2014;25(1):91–101. 10.1016/j.ccr.2013.12.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Kandoth C, McLellan MD, Vandin F, Ye K, Niu B, Lu C, et al. Mutational landscape and significance across 12 major cancer types. Nature. 2013;502(7471):333–9. 10.1038/nature12634 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Slamon DJ, Clark GM, Wong SG, Levin WJ, Ullrich A, McGuire WL. Human breast cancer: correlation of relapse and survival with amplification of the HER-2/neu oncogene. Science. 1987;235(4785):177–82. . [DOI] [PubMed] [Google Scholar]
- 4.Davoli T, Xu AW, Mengwasser KE, Sack LM, Yoon JC, Park PJ, et al. Cumulative haploinsufficiency and triplosensitivity drive aneuploidy patterns and shape the cancer genome. Cell. 2013;155(4):948–62. 10.1016/j.cell.2013.10.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Li M, Fang X, Baker DJ, Guo L, Gao X, Wei Z, et al. The ATM-p53 pathway suppresses aneuploidy-induced tumorigenesis. Proc Natl Acad Sci U S A. 2010;107(32):14188–93. 10.1073/pnas.1005960107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Thompson SL, Compton DA. Proliferation of aneuploid human cells is limited by a p53-dependent mechanism. J Cell Biol. 2010;188(3):369–81. 10.1083/jcb.200905057 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Waddell N, Pajic M, Patch AM, Chang DK, Kassahn KS, Bailey P, et al. Whole genomes redefine the mutational landscape of pancreatic cancer. Nature. 2015;518(7540):495–501. 10.1038/nature14169 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Cancer Genome Atlas Research N, Kandoth C, Schultz N, Cherniack AD, Akbani R, Liu Y, et al. Integrated genomic characterization of endometrial carcinoma. Nature. 2013;497(7447):67–73. 10.1038/nature12113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Group SGOCPECW, Burke WM, Orr J, Leitao M, Salom E, Gehrig P, et al. Endometrial cancer: a review and current management strategies: part II. Gynecol Oncol. 2014;134(2):393–402. 10.1016/j.ygyno.2014.06.003 . [DOI] [PubMed] [Google Scholar]
- 10.Mermel CH, Schumacher SE, Hill B, Meyerson ML, Beroukhim R, Getz G. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. 2011;12(4):R41 10.1186/gb-2011-12-4-r41 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Eden E, Navon R, Steinfeld I, Lipson D, Yakhini Z. GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinformatics. 2009;10:48 10.1186/1471-2105-10-48 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Larson J, Yasmin T, Sens DA, Zhou XD, Sens MA, Garrett SH, et al. SPARC gene expression is repressed in human urothelial cells (UROtsa) exposed to or malignantly transformed by cadmium or arsenite. Toxicol Lett. 2010;199(2):166–72. 10.1016/j.toxlet.2010.08.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lee MJ, Saijoh K, Nestler EJ, Duman RS, Sumino K. Regional differences in expression of osteonectin mRNA after administration of cadmium to rats. Arch Toxicol. 1995;69(9):590–5. . [DOI] [PubMed] [Google Scholar]
- 14.Choe H, Burtnick LD, Mejillano M, Yin HL, Robinson RC, Choe S. The calcium activation of gelsolin: insights from the 3A structure of the G4-G6/actin complex. J Mol Biol. 2002;324(4):691–702. . [DOI] [PubMed] [Google Scholar]
- 15.Liu Y, Templeton DM. Involvement of CaMK-IIdelta and gelsolin in Cd(2+) -dependent cytoskeletal effects in mesangial cells. J Cell Physiol. 2013;228(1):78–86. 10.1002/jcp.24108 . [DOI] [PubMed] [Google Scholar]
- 16.Apostolova MD, Christova T, Templeton DM. Involvement of gelsolin in cadmium-induced disruption of the mesangial cell cytoskeleton. Toxicol Sci. 2006;89(2):465–74. 10.1093/toxsci/kfj035 . [DOI] [PubMed] [Google Scholar]
- 17.Wang Z, Templeton DM. Cellular factors mediate cadmium-dependent actin depolymerization. Toxicol Appl Pharmacol. 1996;139(1):115–21. 10.1006/taap.1996.0149 . [DOI] [PubMed] [Google Scholar]
- 18.Goering PL, Klaassen CD. Resistance to cadmium-induced hepatotoxicity in immature rats. Toxicol Appl Pharmacol. 1984;74(3):321–9. . [DOI] [PubMed] [Google Scholar]
- 19.Sadik NA. Effects of diallyl sulfide and zinc on testicular steroidogenesis in cadmium-treated male rats. J Biochem Mol Toxicol. 2008;22(5):345–53. 10.1002/jbt.20247 . [DOI] [PubMed] [Google Scholar]
- 20.Klaassen CD, Liu J, Diwan BA. Metallothionein protection of cadmium toxicity. Toxicol Appl Pharmacol. 2009;238(3):215–20. 10.1016/j.taap.2009.03.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kawano Y, Furukawa Y, Kawano Y, Abe W, Hirakawa T, Narahara H. Cadmium chloride induces the expression of metallothionein mRNA by endometrial stromal cells and amnion-derived (WISH) cells. Gynecol Obstet Invest. 2011;71(4):240–4. 10.1159/000319615 . [DOI] [PubMed] [Google Scholar]
- 22.Xue W, Kitzing T, Roessler S, Zuber J, Krasnitz A, Schultz N, et al. A cluster of cooperating tumor-suppressor gene candidates in chromosomal deletions. Proc Natl Acad Sci U S A. 2012;109(21):8212–7. 10.1073/pnas.1206062109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Liu Y, Chen C, Xu Z, Scuoppo C, Rillahan CD, Gao J, et al. Deletions linked to TP53 loss drive cancer through p53-independent mechanisms. Nature. 2016;531(7595):471–5. 10.1038/nature17157 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Akesson A, Julin B, Wolk A. Long-term dietary cadmium intake and postmenopausal endometrial cancer incidence: a population-based prospective cohort study. Cancer Res. 2008;68(15):6435–41. 10.1158/0008-5472.CAN-08-0329 . [DOI] [PubMed] [Google Scholar]
- 25.Eriksen KT, Halkjaer J, Sorensen M, Meliker JR, McElroy JA, Tjonneland A, et al. Dietary cadmium intake and risk of breast, endometrial and ovarian cancer in Danish postmenopausal women: a prospective cohort study. PLoS One. 2014;9(6):e100815 10.1371/journal.pone.0100815 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Adams SV, Quraishi SM, Shafer MM, Passarelli MN, Freney EP, Chlebowski RT, et al. Dietary cadmium exposure and risk of breast, endometrial, and ovarian cancer in the Women's Health Initiative. Environ Health Perspect. 2014;122(6):594–600. 10.1289/ehp.1307054 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Cho YA, Kim J, Woo HD, Kang M. Dietary cadmium intake and the risk of cancer: a meta-analysis. PLoS One. 2013;8(9):e75087 10.1371/journal.pone.0075087 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hartwig A. Cadmium and cancer. Met Ions Life Sci. 2013;11:491–507. 10.1007/978-94-007-5179-8_15 . [DOI] [PubMed] [Google Scholar]
- 29.Jin YH, Clark AB, Slebos RJ, Al-Refai H, Taylor JA, Kunkel TA, et al. Cadmium is a mutagen that acts by inhibiting mismatch repair. Nat Genet. 2003;34(3):326–9. 10.1038/ng1172 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Haldsrud R, Krokje A. Induction of DNA double-strand breaks in the H4IIE cell line exposed to environmentally relevant concentrations of copper, cadmium, and zinc, singly and in combinations. J Toxicol Environ Health A. 2009;72(3–4):155–63. 10.1080/15287390802538964 . [DOI] [PubMed] [Google Scholar]
- 31.Littlefield NA, Hass BS, James SJ, Poirier LA. Protective effect of magnesium on DNA strand breaks induced by nickel or cadmium. Cell Biol Toxicol. 1994;10(2):127–35. . [DOI] [PubMed] [Google Scholar]
- 32.Liu X, Jiang L, Wang A, Yu J, Shi F, Zhou X. MicroRNA-138 suppresses invasion and promotes apoptosis in head and neck squamous cell carcinoma cell lines. Cancer Lett. 2009;286(2):217–22. 10.1016/j.canlet.2009.05.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zhang X, Zhang F, Guo L, Wang Y, Zhang P, Wang R, et al. Interactome analysis reveals that C1QBP (complement component 1, q subcomponent binding protein) is associated with cancer cell chemotaxis and metastasis. Mol Cell Proteomics. 2013;12(11):3199–209. 10.1074/mcp.M113.029413 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Fogal V, Richardson AD, Karmali PP, Scheffler IE, Smith JW, Ruoslahti E. Mitochondrial p32 protein is a critical regulator of tumor metabolism via maintenance of oxidative phosphorylation. Mol Cell Biol. 2010;30(6):1303–18. 10.1128/MCB.01101-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kracun S, Chaptal V, Abramson J, Khakh BS. Gated access to the pore of a P2X receptor: structural implications for closed-open transitions. J Biol Chem. 2010;285(13):10110–21. 10.1074/jbc.M109.089185 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Coddou C, Lorca RA, Acuna-Castillo C, Grauso M, Rassendren F, Huidobro-Toro JP. Heavy metals modulate the activity of the purinergic P2X4 receptor. Toxicol Appl Pharmacol. 2005;202(2):121–31. 10.1016/j.taap.2004.06.015 . [DOI] [PubMed] [Google Scholar]
- 37.Forbes SA, Beare D, Gunasekaran P, Leung K, Bindal N, Boutselakis H, et al. COSMIC: exploring the world's knowledge of somatic mutations in human cancer. Nucleic Acids Res. 2015;43(Database issue):D805–11. 10.1093/nar/gku1075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Natarajan AT. An overview of the results of testing of known or suspected aneugens using mammalian cells in vitro. Mutat Res. 1993;287(1):113–8. . [DOI] [PubMed] [Google Scholar]
- 39.Shimada T, Watanabe T, Endo A. Potential mutagenicity of cadmium in mammalian oocytes. Mutat Res. 1976;40(4):389–95. . [DOI] [PubMed] [Google Scholar]
- 40.Guyot E, Solovyova Y, Tomkiewicz C, Leblanc A, Pierre S, El Balkhi S, et al. Determination of Heavy Metal Concentrations in Normal and Pathological Human Endometrial Biopsies and In Vitro Regulation of Gene Expression by Metals in the Ishikawa and Hec-1b Endometrial Cell Line. PLoS One. 2015;10(11):e0142590 10.1371/journal.pone.0142590 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Nordberg M, Nordberg GF. Toxicological aspects of metallothionein. Cell Mol Biol (Noisy-le-grand). 2000;46(2):451–63. . [PubMed] [Google Scholar]
- 42.Pinter TB, Stillman MJ. Putting the pieces into place: Properties of intact zinc metallothionein 1A determined from interaction of its isolated domains with carbonic anhydrase. Biochem J. 2015;471(3):347–56. 10.1042/BJ20150676 . [DOI] [PubMed] [Google Scholar]
- 43.Pinter TB, Stillman MJ. The zinc balance: competitive zinc metalation of carbonic anhydrase and metallothionein 1A. Biochemistry. 2014;53(39):6276–85. 10.1021/bi5008673 . [DOI] [PubMed] [Google Scholar]
- 44.Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013;6(269):pl1 10.1126/scisignal.2004088 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Berger AH, Pandolfi PP. Haplo-insufficiency: a driving force in cancer. J Pathol. 2011;223(2):137–46. 10.1002/path.2800 . [DOI] [PubMed] [Google Scholar]
- 46.Mathew R, Kongara S, Beaudoin B, Karp CM, Bray K, Degenhardt K, et al. Autophagy suppresses tumor progression by limiting chromosomal instability. Genes Dev. 2007;21(11):1367–81. 10.1101/gad.1545107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yue Z, Jin S, Yang C, Levine AJ, Heintz N. Beclin 1, an autophagy gene essential for early embryonic development, is a haploinsufficient tumor suppressor. Proc Natl Acad Sci U S A. 2003;100(25):15077–82. 10.1073/pnas.2436255100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Qu X, Yu J, Bhagat G, Furuya N, Hibshoosh H, Troxel A, et al. Promotion of tumorigenesis by heterozygous disruption of the beclin 1 autophagy gene. J Clin Invest. 2003;112(12):1809–20. 10.1172/JCI20039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Strehl B, Seifert U, Kruger E, Heink S, Kuckelkorn U, Kloetzel PM. Interferon-gamma, the functional plasticity of the ubiquitin-proteasome system, and MHC class I antigen processing. Immunol Rev. 2005;207:19–30. 10.1111/j.0105-2896.2005.00308.x . [DOI] [PubMed] [Google Scholar]
- 50.Kladney RD, Cardiff RD, Kwiatkowski DJ, Chiang GG, Weber JD, Arbeit JM, et al. Tuberous sclerosis complex 1: an epithelial tumor suppressor essential to prevent spontaneous prostate cancer in aged mice. Cancer Res. 2010;70(21):8937–47. 10.1158/0008-5472.CAN-10-1646 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Rampias T, Vgenopoulou P, Avgeris M, Polyzos A, Stravodimos K, Valavanis C, et al. A new tumor suppressor role for the Notch pathway in bladder cancer. Nat Med. 2014;20(10):1199–205. 10.1038/nm.3678 . [DOI] [PubMed] [Google Scholar]
- 52.Prickett TD, Zerlanko BJ, Hill VK, Gartner JJ, Qutob N, Jiang J, et al. Somatic mutation of GRIN2A in malignant melanoma results in loss of tumor suppressor activity via aberrant NMDAR complex formation. J Invest Dermatol. 2014;134(9):2390–8. 10.1038/jid.2014.190 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Quintas-Cardama A, Cortes J. Molecular biology of bcr-abl1-positive chronic myeloid leukemia. Blood. 2009;113(8):1619–30. 10.1182/blood-2008-03-144790 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Zhu J, Nasr R, Peres L, Riaucoux-Lormiere F, Honore N, Berthier C, et al. RXR is an essential component of the oncogenic PML/RARA complex in vivo. Cancer Cell. 2007;12(1):23–35. 10.1016/j.ccr.2007.06.004 . [DOI] [PubMed] [Google Scholar]
- 55.White E. The role for autophagy in cancer. J Clin Invest. 2015;125(1):42–6. 10.1172/JCI73941 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Rabinowitz JD, White E. Autophagy and metabolism. Science. 2010;330(6009):1344–8. Epub 2010/12/04. 10.1126/science.1193497 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Adams JU. Genetics: Big hopes for big data. Nature. 2015;527(7578):S108–9. 10.1038/527S108a . [DOI] [PubMed] [Google Scholar]
- 58.McCarroll SA, Kuruvilla FG, Korn JM, Cawley S, Nemesh J, Wysoker A, et al. Integrated detection and population-genetic analysis of SNPs and copy number variation. Nat Genet. 2008;40(10):1166–74. 10.1038/ng.238 . [DOI] [PubMed] [Google Scholar]
- 59.Cline MS, Craft B, Swatloski T, Goldman M, Ma S, Haussler D, et al. Exploring TCGA Pan-Cancer data at the UCSC Cancer Genomics Browser. Sci Rep. 2013;3:2652 10.1038/srep02652 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29(1):24–6. 10.1038/nbt.1754 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25(1):25–9. 10.1038/75556 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Mudunuri U, Che A, Yi M, Stephens RM. bioDBnet: the biological database network. Bioinformatics. 2009;25(4):555–6. 10.1093/bioinformatics/btn654 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.