Abstract
Oncodriver genes are usually identified when mutations recur in multiple tumours. Different drivers often converge in the activation or repression of key cancer-relevant pathways. However, as many pathways contain multiple members of the same gene family, individual mutations might be overlooked, as each family member would necessarily have a lower mutation frequency and thus not identified as significant in any one-gene-at-a-time analysis. Here, we looked for mutated, functional sequence positions in gene families that were mutually exclusive (in patients) with another gene in the same pathway, which identified both known and new candidate oncodrivers. For instance, many inactivating mutations in multiple G-protein (particularly Gi/o) coupled receptors, are mutually exclusive with Gαs oncogenic activating mutations, both of which ultimately enhance cAMP signalling. By integrating transcriptomics and interaction data, we show that the Gs pathway is upregulated in multiple cancer types, even those lacking known GNAS activating mutations. This suggests that cancer cells may develop alternative strategies to activate adenylate cyclase signalling in multiple cancer types. Our study provides a mechanistic interpretation for several rare somatic mutations in multi-gene oncodrivers, and offers possible explanations for known and potential off-label cancer treatments, suggesting new therapeutic opportunities.
Subject terms: Cancer genomics, Cancer genomics, Cell signalling, Cell signalling
Introduction
Cancer genome sequencing projects have revealed a growing list of genes and mutations driving tumor initiation and progression [1–5]. However, even when an oncodriver role is well established, it remains difficult to discriminate driver from passenger mutations, especially when they are rare [6]. This task is more daunting when sparse somatic mutations affect genes not previously linked to cancer by standard approaches based on positive selection. A recent systematic survey of somatic mutations under positive selection estimated that nearly half of driving events are found in genes not yet linked to cancer [7]. With genome sequencing entering clinical practice, more powerful approaches able to identify rare driver mutations are therefore required.
Analysis of mutated genes, in the context of pathways and networks [8] or three-dimensional structures [5, 9], has aided the detection of driver variants, also illuminating mechanisms adopted by cancer cells for tumor growth and spread. For example, frequently mutated oncodrivers rarely participate to the same interaction interface [10]: e.g., TP53 is mutated in many cancers, but its most common interaction partners are not. However, there are instances where mutations in different parts of one pathway appear to be responsible for a common cancer. For example, mutations affecting KRAS/NRAS and BRAF are often found mutually exclusively along the MAP kinase cascade pathway, particularly in melanoma, colorectal, lung and pancreatic cancers [11]. Analysis of cancer genomes has revealed many examples of mutually exclusive events concurring with specific cancer phenotypes [12].
Given the redundancy of many biological processes, where multiple proteins can modulate a common up- or downstream partner, it is possible that multiple genes could essentially replace mutations in a single common cancer gene. Indeed, several known oncodrivers have multiple regulators. For example, the phosphoinositide 3-kinase (PI3K) pathway, shows great redundancy in that downstream target PI3Ks can be activated by many upstream signals transduced by tyrosine kinases and G-protein coupled receptors [13].
Instances of this multi-gene oncodriver phenomenon would often be overlooked as mutations in multiple upstream genes necessarily have lower frequencies, and thus would not be identified as statistically significant in any one-gene-at-a-time analysis. Here we searched for such examples, by grouping all genes into protein domain families and looking for pairs of functionally linked families showing exclusivity of mutations in particular cancers (Fig. 1a). We identify several instances of this phenomenon, involving both oncogenes and tumor suppressors, including sparsely mutated, genes not previously linked to cancer.
Results
A network of mutually exclusive, variant-enriched protein family positions in cancer
We defined a set of interacting gene families as those sharing a common protein domain [14], and residing in the same pathway (from Reactome; [15]). In each pair of families, we then sought protein alignment positions that were mutually exclusive with regard to somatic, non-synonymous cancer variants (Methods; Fig. 1b), either in specific cancers or across all of them (pan-cancer). We considered 1.9 M simple somatic non-synonymous mutations, from 414 different cancer subtypes (from 37 primary tissues and 49 histologies). A total of 794k mutations (41% of the total) affect 282k domain positions from 5.7k human protein families (92% of the total). We defined significantly enriched domain positions according to a gene-level background model (to correct for genes with different mutational burden; Fig. S1 and Tables S1, S2), and retained pairs of positions displaying significant enrichment in a pathway-level background model (Tables S3, S4; see Methods). We chose this model, similar to one used in another recent study [16], and not models designed to detect positive selection in individual genes [3], as we were seeking positions across families: using other models would have filtered out many rare variants that were precisely what we sought. Changing the parameters of the analysis (i.e., mutation thresholds, set of pathways considered, background model) led only to moderate changes in the number of identified positions. The number of dubious genes (i.e., those more tolerant to mutations [17]) uncovered also does not vary using different parameters and it is moreover always much lower than when using the 1000 Genomes set (see Methods and Table S5).
It is possible that the mutually exclusive pairs we detect arise because mutations in both partner proteins in one sample would simply not be tolerated in living cells. While we cannot rule this out in all instances, evidence that this is not the case comes from the fact that we see co-occurrence of mutations, in at least one sample, for 56% of all hotspot pairs (i.e., in addition to the observed exclusivity). This suggests that the mutations are at least, in principle, tolerated together. Moreover, few of the pairs we identified involve interactions between well-established oncodrivers, but most often involve one oncodriver and a larger family of proteins.
Our pan-cancer analysis revealed 414 significantly mutated (Binomial q < 0.1) and mutually exclusive (Fischer’s exact q < 0.1) pairs, involving a total of 86 positions from 23 families in 55 pathways (Fig. 1c), affecting 9.1k (43% of the total) samples. The three most represented pathways were tyrosine kinase signalling, generic transcription, and GPCR downstream signalling (Fig. S2; Table S3). We found that domains with significant mutated positions have similar fractions of highly conserved positions compared to those with no mutations (p = 0.17; Table S6). Moreover, only a minority (22%) of the top 50 largest sequence families in the human proteome have at least one significantly mutated positions, ruling out a bias owing to family size (see Table S6).
The process of identifying variant-enriched, mutually exclusive positions also highlights functional positions, as they are enriched for sites that bind to other molecules compared to random selections [18] (Methods). Nearly half of mutually exclusive position pairs (171 out of 336 matched to an interaction interface) have similar predicted functional consequences in the sense of targeting protein, small-molecule or nucleic acid binding sites (Fig. S3). 9% (of 5188) gene-pairs share a specific protein and 39% share a small-molecule interaction partner, indicating a more precise functional overlap (Table S7, see below).
The 1055 genes linked in the network (Fig. 1c), involve several instances of small (even single member) families, which include relations between well-known cancer genes. Not every known oncodriver is found, which is expected since the method is very restrictive: requiring gene family pairs each to have a specific position enriched in variants in a mutually exclusive fashion within at least one common pathway. Nevertheless, some well-known examples are in this network. For example, KRAS p.G12D with BRAF p.V600E, and AKT1 p.E17K with PIK3CA p.E545K. The approach also revealed candidate multi-gene oncodrivers, namely where positional variants scattered across a large family show mutual exclusivity with positions in a single protein downstream.
Mutations in multiple transcription factors are mutually exclusive with TP53, representing potential tumor suppressors
A major part of the network (Fig. 1c) relates to tumor suppressor activities mediated by transcription factors, particularly TP53. Mutations in several genes with transcriptional regulatory functions (i.e., the left side of Fig. 1c) are mutually exclusive (e.g., zf-C2H2 or zf-C2H6 with either P53 or EGF domains). Several known oncodrivers involved in this functional subnetwork are genes with known tumor suppressor properties, such as TP53, PTEN, FBXW7, SMAD3 and SMAD4.
In addition, many of these relationships involve transcription factors with broadly similar functions, including zinc-fingers with positions that were found to be exclusive with TP53 counterparts. The majority of positions are residues involved in DNA-binding in both families, where the predicted effect [18] is a loss of binding, suggesting that one could indeed replace the other functionally (Fig. 2a, b). In support of this, several zinc-finger proteins show similarities in transcriptional targets with TP53, as identified in ChIP-seq/ChIP-chip data (Methods; Fig. 2c). These include growth promoting genes such as MYC, RELA, and STAT3 (Fig. 2d). Remarkably, zinc-fingers SP1, PRDM1, ZNF740 and YY1 all show overlaps of more than 20% with TP53 target genes (Table S8). Several have already been linked to cancer; for example SP1 is a prognostic factor for lower survival in gastric cancer [19], PRDM1 to be a tumor suppressor in lymphomas [20] and YY1 as an initiator of tumorigenesis in several malignancies [21]. Most of these relationships were statistically significant when considering the large pan-cancer dataset, though several are also seen in specific cancer tissues (Table S9), including, skin and large intestine.
Multiple GPCRs mutations are exclusive with G-proteins and other oncogenes
The other main component of the pan-cancer network (Fig. 1c, right) relates broadly to signal transduction and contains several known oncogenes (e.g., Ras-family members, tyrosine and serine-threonine kinases, and G proteins). This also involves several hundred GPCR genes that show positional exclusivity with G-protein hotspots. A major contributor to this is the DRY Arginine (R3.50, superscript denotes the Ballesteros/Weinstein numbering [22], see Methods), which is the most significantly enriched (q = 2 × 10−35; Table S1) position, and which shows mutual exclusivity with recurrent mutations of the heterotrimeric Gα family catalytic switch I (SWI) arginine (mainly accounted by GNAS p.R201G.hfs2.2, superscript denotes the Common Gα numbering [23]; Fisher exact q < 0.001; Table S3).
DRY is an important motif for Class A GPCR activation [24, 25], mediating intra-molecular polar contacts holding receptors inactive until ligand binding [25]. The arginine is recurrently mutated (Table S10), with a total of 94 class A GPCRs having at least one somatic mutation at this arginine in 153 unique samples for a total of 189 non-synonymous mutations (see Table S1). Among them, the most frequently mutated are HCRTR2, GPR174, P2RY12 and LPAR4. The majority of mutations at the G-alpha SWI Arginine position (Table S11) are found in GNAS (109 samples) with others found in GNAI2 (12), GNA13 (7), GNA15 (4) and GNAQ/11 (6). DRY arginine also displays mutual exclusivity with positions in genes involved in GPCR-mediated downstream signalling including many oncogenes: AKT1(2,3) p.E17K, PIK3CA p.E545K and p.E542K, RAC1 p. P29S, JAK2 p.V617F. (Fig. 1c and Table S3).
Several other GPCR positions also show exclusivity, including P7.50 from another highly conserved NPxxY motif, which shows exclusivity with the DRY arginine itself (i.e., intramolecular; Table S3); these motifs cooperate in receptor activation [25]. This proline, together with other positions on the cytosolic side of the GPCR structure (Fig. 3b), also shows mutual exclusivity with the G-alpha SWI arginine and positions from many other downstream oncogenes (Table S3). Moreover, we also found a tendency for mutual exclusivity of GPCR DRY Arginine and G-alpha SWI Arginine in specific tissues, including pancreas (q = 0.02; Table S4), large intestine (p = 0.02, q = 0.23), stomach (p = 0.13, q = 1) and skin (p = 0.25, q = 1) (Fig. S5).
The potential importance of these mutations is underscored by their association with poorer survival, either in specific cancers (DRY Arginine in skin melanoma; LogRank p = 0.03; Cox’s HR = 1.16; Fig. 3e), or R4.40 mutations in all cancers (LogRank p = 0.01; Cox’s HR = 1.03; Fig. S4; Tables S12–S14).
The GPCR mutations we find in these cancer samples differ greatly from naturally occurring variants in healthy individuals (Wilcoxon p = 0.001; see Fig. S6), which are most often found in Olfactory receptors [26]. Nevertheless, despite the stringent background models adopted, we do see mutual exclusivities involving olfactory receptors (Fig. 1c). Among them is OR51E2 (Table S3), which is ectopically expressed in melanocytes [27] as well as in primary melanoma and melanoma metastasis [28] and whose activation has been shown to elicit an onco-suppressive effect in prostate carcinoma [29]. Thus, the emerging information suggests that at least a subset of these mutations might have consequences in cancers.
We also found exclusivity between G-alpha positions and downstream signalling partners. In pancreatic tumors, for example, GNAS SWI arginine and p.V633M mutations in the downstream adenylate cyclase ADCY8 are mutually exclusive (Fig. S7 and Table S4) in addition to those affecting DRY Arginine of upstream GPCRs (Fig. S7 and Table S4). In adrenal gland adenomas GNAS SWI arginine and p.L207R mutations in the cAMP activated enzyme PRKACA are mutually exclusive (Table S4). Remarkably, samples with these GNAS or PRKACA mutations do not have any other known oncogene or signalling oncodriver mutated (Table S16).
Widespread dysregulation of GPCR-mediated signaling in cancer
G-alpha SWI arginine mutations are usually oncogenic, leading to increased signaling [30]. The mutual exclusivity would be tantalizingly explained by DRY mutations (or other mutually exclusive GPCR positions) also being activating. This is also an attractive idea as it is known that some mutations, particularly of the DRY motif (e.g., in ADRA1A [31] and AVPR2 [32]), can lead to constitutive activation. However, we found that these positions almost always lead to a loss of function (Fig. 3d), as it is known to be most often the case for mutations to this arginine [33]. Indeed, by experimentally testing the effects of observed DRY arginine mutations in seven representative receptors (selected according to the data in Fig. S5) and the previously described AVPR2 mutations by a TGFα shedding assay [34], we find that these positions almost always lead to a loss of ligand-induced function (Fig. 3d, Fig. S8 and Table S17). Moreover, while AVPR2 mutations, as expected [32], led to constitutive activity, none of the representative receptors with cancer mutations displayed constitutive activation as reported by cAMP assays (Fig. S9).
This seemingly counterintuitive result can be explained by the overall context of GPCR/G-protein mediated signalling. We combined mutation, gene expression and GPCR/G-protein coupling data (i.e., which G-protein is coupled to each GPCR) [35] to estimate the overall activities of the four main G-protein classes in each cancer (Fig. 4a; Methods). This revealed that many cancers, beyond those showing GNAS hotspot mutations, display a tendency for widespread up-regulation of Gs activity over the other G-proteins, which is particularly evident for Gi/o which never prevails in any investigated cancer type (Fig. 4a, top). Indeed, 71% of TCGA cancer types characterized by GNAS activating mutations also show overall higher activity of the Gs pathway, which is the most activated in 75% of the considered cancer types (Fig. 4a). Notably, these include all malignancies of the gastro-intestinal tract (i.e., esophagus, stomach, liver, colon, rectum) present in the differential expression panel. More specifically, while Gs activity is mainly accounted by higher GNAS levels (Fig. 4b), lower Gi/o levels are seemingly caused by a diminished expression of either Gi/o-proteins or their coupled receptors (Figs. S10, S11), which are more frequently affected by deleterious mutations (i.e., stop gains, frameshifts or non-synonymous mutations at highly conserved domain positions; see Methods) in multiple cancers (Fig. 4a; grey in middle panel). Overall, Gi/o coupled receptors are the class hit by the greatest number of deleterious mutations also when considering all cancers (Fig. 4c).
This immediately suggests an explanation for our observation, as deactivating Gi/o signalling and activating Gs signalling would both lead to an increase in adenylate cyclase activity and cAMP mediated signalling (Fig. 4e). Indeed, we found that cAMP assays for a selection of representative Gi/o coupling GPCRs (UTS2R and HCRTR2) showed that the DRY mutants poorly induced Gi activation as measured by loss of inhibitory effect of ligand-induced cAMP luminescent signals (Fig. 4d, Table S18). The much greater expression of GNAS relative to the other G protein genes in all cancers (Fig. 4b), would also imply that Gs signalling is prevailing (i.e., even in the absence of any GNAS activation) and hence needs to be strictly controlled.
To understand more in details the functional consequences of DRY arginine mutations, we performed differential expression analysis of R3.50 mutations in cancer types where it was found at highest recurrence (i.e., skin, gastric and uterine; see Methods). In melanoma 31% (146 out 462) of dysregulated genes in R3.50 mutated samples overlap with those of G-protein activating mutations (GNAQ p.Q209G.s3h2.3 and GNAI2 p.R179G.hfs2.2), indicating at least partial functional equivalence between these mutually exclusive mutation sets and moreover pointing to biological processes characteristic of more aggressive melanomas (e.g., thickening, loss of inhibitory endopeptidase activity [36], see Fig. S12a). Comparative differential expression analysis of R3.50 mutations in multiple cancers (i.e., skin, gastric and uterine) revealed a more context specific downstream effect of these mutations, with only 15 genes similarly dysregulated (Fig. S12b). Among these, we found GNAS-AS1, the antisense RNA for the GNAS locus, which has been reported to regulate its imprinting status [37], thus suggesting an additional layer of regulation of GNAS’s activity. Intriguingly, GNAS-AS1 downregulation by methylation has been associated to colorectal tumorigenesis [38] as well as to lung cancer susceptibility [39].
An integrated map for GPCR drugs (off-)label prescription
Activation of cAMP signalling via means other than by activating GNAS mutations suggests both an explanation for off-label drugs with potential actions against cancer, and the means to identify additional cancer targeting drugs (Table 1, Table S19; Fig. S13). For instance propranolol, an antagonist of the GNAS coupled β2 andrenergic receptor (ADRAB2), improves prognosis in thick skin melanoma [40] and abrogates EGFR inhibitor resistance acquisition in lung cancer [41]. This suggests that other such antagonists could also prove useful, for example of several other adrenergic receptors (Table 1). In contrast, agonists of Gi signalling could, in principle, produce similar effects, and interestingly, two drugs targeting Gi-receptors (Pasireotide targeting somatostatin receptors and adenosine analogs targeting Adenosine A3 receptor) are already in use based on antitumor effects (Table 1). These potential treatments are likely only relevant when cAMP signalling either drives the cancer or where it is linked to therapy resistance, as in melanomas [42], and where there is not overexpression of the GPCR of interest (which could indicate a different mechanism; e.g., DRD2 in gastric cancers [43]). To aid the search for new therapeutic candidates, we integrated data on GPCR coupling and known antagonists/agonists for all coupling groups (Table 1, Fig. S13).
Table 1.
GPCR | Cancers | Agonists | Evidence for therapeutic benefit in cancer | Indications |
---|---|---|---|---|
Gi/o-coupled receptors with high expression, with no change in expression in tumor compared to wild-type, and with approved agonists | ||||
ADORA3 | BLCA | 2 | Antiproliferative effects of adenosine or synthetic agonist in melanoma, prostate, colon and liver carcinomas or lymphoma [70, 71] | Coronary vasodilators, pharmacological stress testing |
HNSC | ||||
LUSC | ||||
PRAD | ||||
STAD | ||||
HTR1D | STAD | 20 | Serotonin analogues are inhibitors of breast cancer cell growth [72] | Agonists used for migrane treatment |
S1PR1 | ESCA | 1 | Fingolimod efficacy in in-vitro and in-vivo cancer models by inhibition of sphingosine kinase 1 [73] + | Immuno-modulating in Multiple Sclerosis |
READ | ||||
STAD | ||||
SSTR1 | ESCA | 1 | Pasireotide (somatostatin analogue) can inhibit non-functioning pituitary adenomas and neuroendocrine tumors [74, 75] | Treatment of Cushing’s disease |
KIRP | ||||
LIHC | ||||
STAD | ||||
UCEC | ||||
Gs-coupled receptors with no significant under expression and with approved antagonists | ||||
ADRA2A | ESCA | 11 | None found | Many indications for antagonists, including Parkinson’s, schizophrenia, psychosis, depression and erectile dysfunction |
HNSC | ||||
LUSC | ||||
READ | ||||
STAD | ||||
THCA | ||||
UCEC | ||||
ADRA2B | KICH | 13 | None found | As above |
KIRC | ||||
UCEC | ||||
ADRA2C | BLCA | 12 | None found | As above |
BRCA | ||||
ESCA | ||||
HNSC | ||||
LUSC | ||||
STAD | ||||
UCEC | ||||
ADRB1 | ESCA | 14 | None found | |
ADRB2 | HNSC | 15 | Propranolol suppresses pancreatic and breast cancers invasion, protects patients with skin melanoma from disease recurrence and death and avoids EGFR inhibitor resistance in lung cancer [40, 41, 76, 77] | Treatment of hypertension or irregular heart rate |
KIRC | ||||
KIRP | ||||
THCA | ||||
CNR1 | ESCA | 1 | Rimonabant inhibits human breast cancer cell proliferation [78] | Anorectic antiobesity (drug withdrawn) |
THCA | ||||
HTR7 | BLCA | 20 | None found | Many indications, including depression, psychosis and panic disorder |
All receptors show mean expression (RPKM) values > = 100. For Gi/o-linked GPCRs we sought only those that showed no significant fold-change when comparing tumors to wild-types (i.e., as overexpression likely indicates an oncogenic activity); for Gs-linked GPCRs we included all that were not significantly under-expressed. Cancer types in italic are those where GNAS activating mutations or Gi/o GPCR down-regulation/deactivation is observed. Agonist/antagonist classification has been derived from IUPHAR [35]. +Fingolimod (phosphorylated metabolite) is a functional antagonist for S1PR1. It first acts as an agonist, but induces degradation of S1PR1
Discussion
Understanding the role played by somatic mutations in cancer is critical for the interpretation of large datasets from high-throughput sequencing projects and the development of personalized therapies. Considering functional relationships, or biological context, can help identify novel driver mutations, even for less frequently mutated genes [44]. Previous studies used protein family evolutionary relationships to highlight positions significantly affected by somatic mutations [16] and suggested that sparse mutations affecting equivalent domain positions in known drivers might display similar downstream consequences [16]. Similarly, the usage of protein domain information was shown to improve the detection of deleterious variants of genetic diseases [45]. Here, we employed a similar approach to analyse non-synonymous mutations and, to further infer on their biological role, we pinpointed causal relationships by systematically analysing their mutual exclusivity along biological pathways.
Our pan-cancer network includes just 51 (5%) genes from the Cancer Gene Census. Missing genes are from protein families lacking either individual mutation-enriched positions, or mutual exclusivity. The majority of the genes are not currently in the census, and could be new (usually rare) oncodriver mutations. For example, a few zinc-finger genes have been classified as oncodrivers in the Census (e.g., ZNF311 [46]) and there is mounting evidence for their involvement in several cancer types, most often as tumor suppressors, by regulating the transcription of genes important for tumor progression [46].
We propose that novel inhibitory mutations on Gi/o-coupled receptors may converge to produce a similar outcome to GNAS activating mutations, resulting in increased cAMP signalling. Moreover, these events appear to be part of a wider dysregulation of GPCR mediated signalling in multiple cancers (Fig. 4a), which we predict to lead to GNAS overactivity, owing to its much higher expression than all other G-proteins (Fig. 4a, b). This is also supported by our analysis of non-synonymous mutations and copy number variants (CNVs) in genes involved in cAMP signalling (Fig. S14a). About a third of patients (9k of 28k) have such mutations, and as expected activating mutations and/or CNV gains prevail in genes increasing cAMP (i.e., Gαs subunits and Adenylyl cyclases), with CNV losses being more common in genes that lower cAMP (i.e., Gαi subunits and Phosphodiesterases), (Fig. S14b). Generally, many cancers evolve towards higher cAMP by a variety of different mechanisms.
Several GPCRs are involved in cancer progression, metastasis and therapy resistance [47]. GPCRs and their cognate Gα proteins are extensively mutated in cancer samples, though the functional consequences are not always clear and appear to be context dependent. For example, it is well established that GNAS p.R201G.hfs2.2 (on SWI) and GNAQ/11 p.Q209G.s3h2.3 (on SWII) activating mutations are oncogenic [30]. Recent experiments have begun to decipher the functional consequences of the GNAS p.R201C mutation in pancreatic ductal adenocarcinomas, where it modulates KRAS p.G12V initiated neoplasia [48, 49]. We also found SWI mutations in GNAI2 (p.R179), in skin melanoma and lymphomas, where it has oncogenic effects by upregulating ERK1/2 [50], and in GNA13 (p.R200; in bladder carcinoma), which is not yet fully understood.
Functionally equivalent mutations on GPCRs have been seen to replace those in Gα proteins. For example, mutually exclusive P2RY8, GNA13 and RHOA mutations have similar inactivating effects in B-cell derived lymphomas [51, 52]. Activating mutations of CYSLTR2 (p.L129Q) in uveal melanoma are mutually exclusive and functionally equivalent to mutations in GNAQ/11 (p.Q209) [53]. Compensatory mutations with opposite functional outcomes have been reported in pituitary tumors, where the effect of activating GNAS mutations can be replaced by inactivating mutations on Gi/o proteins [54] or on AIP [55].
Thus, widespread inactivating mutations in Gi/o-linked GPCRs similarly suggests that their restraining activity on GNAS is lost and, as part of the tumorigenic process, cAMP signalling is persistently activated and leads to tumor progression. It has been shown that a cAMP signalling network, involving upstream GPCRs, is responsible of MAP kinase inhibitors resistance in melanomas [42]. The observation that DRY arginine mutations are associated to lower survival in melanoma, suggests that these mutation events might indeed concur to these mechanisms.
Knowledge of rare oncodrivers, including several identified here, can impact cancer diagnostics and treatment. For example, of the 26k samples across all cancers lacking common TP53 mutations, 2.2k (7.8%) show mutations in the Zinc-finger genes and positions mentioned above, thus potentially providing a molecular mechanism for tumor suppressor activities that might have been overlooked. More tantalizingly, observation of deactivating mutations in Gi/o coupled GPCRs could indicate suitability for treatment of to be developed Gs inhibitors, or readily available inhibitors of Gs coupled receptors, including the recently proposed off label use of beta-blockers (e.g., propranolol) or by exploiting the wealth of agonists activating Gi/o coupled receptors. The search for these in the future must naturally consider the complete context of GPCR/G-protein signalling pathways in each cancer.
The synthesis of functional information with genetic variants can uncover new molecular insights into diseases, including new disease genes, and specific molecular mechanisms. This holds much promise for developing personalized diagnoses and therapies across many clinical subjects.
Materials and methods
Datasets
We extracted confirmed somatic, non-synonymous mutations from version 79 of COSMIC [56] genomes (http://cancer.sanger.ac.uk/cosmic). We mapped 22842 of 28089 (81%) of the associated Ensembl [57] transcripts to Uniprot [58] canonical isoforms, which left 1.5 M unique protein mutations (corresponding to a total of 4.6 M mutated alleles), from 21k unique samples from 414 different cancer types. We defined cancer types using the COSMIC classification system considering Primary tissue/Tissue sub-type1 and Primary histology/Histology sub-type1 specifications.
We used the Pfam [14] database as it has the widest coverage of sequence space and provides HMM profiles for easy alignment of query sequences to the pre-existing alignments. We identified Pfam-A families within the mapped protein sequences using HMMer [59], defining highly-conserved positions as those with one amino-acid recurring in 50% or more sequences. To identify each position within the alignment, we employed the Pfam consecutive numbering scheme and labelled them using the amino acids letter from the Hidden Markov Model (HMM) consensus sequence (highly conserved residues are upper case). For GPCRs, we additionally labelled positions through the Ballesteros/Weinstein scheme [22], using the consensus domain secondary structure from the HMM model to number residues in helix regions (see Table S20). For G-proteins, we employed the established Common Gα numbering [23].
Variant enrichment
To define domain positions within protein family alignments enriched in variants we computed the log of the observed number of variants divided by the expected (log odds). We computed the expected number by multiplying the frequency of total alleles (4.6 alleles) in the total proteome length (>11 million amino acids) times the number of domain instances in the proteome. We assessed the statistical significance of domain position enrichments, either in individual cancer types or pan-cancer, through a one-tailed binomial test, computing the prior probability by randomly shuffling of non-synonymous protein variants from each individual across the same protein. Shuffling did not produce sufficient numbers of all possible combinations we observed to generate separate distributions to calculate observed P-values, but was rather used to get an expected probability to use in the binomial calculation. We corrected P-values through the Benjamini-Hochberg or False Discovery Rate procedure [60] to give a corresponding Q-value (labelled q in the text). For individual cancers, we retained positions having 5 observed or 2.5 expected non-synonymous variants; for pan-cancer we considered positions with 20 observed or 5 expected. For both we defined significantly enriched positions as those with log odds > = 0 and q < = 0.01.
These thresholds were chosen heuristically, based on our experience, to narrow down the analysis to more interesting examples and to avoid potential false positives. Indeed, using no thresholds, led to an increase in the total number of identified positions, in both specific tissues and pan-cancer and in the absolute number of dubious genes (see Table S5a), defined as those more tolerant to mutations as assessed through the RVIS approach [17]. We considered only the top 2% of the most tolerant genes ranked through the RVIS score.
To correct for domain mutation rate, we randomly shuffled mutations found at domains within positions of the same domain and the same protein. Mutations found outside domain regions, were randomly shuffled within the remaining protein sequence portion. We similarly calculated the enrichment of mutations at domain positions by using this alternative background model.
Mutual exclusivity analysis of functionally related protein positions
We defined functionally interacting genes based on their membership to a subset of Reactome (http://reactome.org/) pathways [18], defined according to size criteria to represent specific functional units. We considered pathways with fewer than 200 proteins from the FrontPageItem list. Sub-pathways were included if their size was less than 300. For sub-pathways with >300 proteins, we considered their sub-pathways. The threshold for the second level is based on the assumption that the lower level pathways should be more akin to functional units than those in upper levels. This allowed us to define a set of intermediate level biological pathways, which we found to be a trade-off between coverage and lower inner redundancy.
We assessed the mutual exclusivity of domain positions in mutated family members participating to the same biological pathway either in individual cancer types or pan-cancer. We considered only significantly enriched domain positions (see previous section) mutated either in at least five unique samples for individual cancers, or 50 unique samples for the pan-cancer set. We assessed the mutual exclusivity for each domain position pair within a given pathway through a one tail Fischer’s exact test, correcting P-values through the FDR procedure (Q-values), and considering position pairs with q < = 0.1. We adopted a looser threshold for FDR correction, with respect to the enrichment analysis, as by default mutual exclusivity inspected through the Fisher’s exact is an extremely stringent test. For each comparison, we restricted our analysis only to unique samples having mutations to the same class of domain position (i.e., classifying mutations at either conserved or non-conserved positions within the same domain).
We evaluated the enrichment significance for domain positions pairs through a binomial test procedure similar to that used for individual positions, computing the prior probability by randomly shuffling non-synonymous protein variants within members of a particular pathway.
Repeating the same test by considering the entire set of lowest level pathways from Reactome hierarchy [15] led to a moderate decrease of identified positions with no change in the number of RVIS genes (see Table S5b).
Assessing biomolecular consequences of mutations
We predicted functional consequences of COSMIC missense mutations using Mechismo [18] (mechismo.russelllab.org), which matches protein sequence positions to positions within three-dimensional structures and identifies sites affecting known interactions with other proteins, DNA/RNA or small-molecules. We considered medium-high confidence predictions, which include known structures or homologs with > = 30% of sequence identity for protein-protein interactions, > = 35% for protein-chemicals and > = 41% for protein-DNA/RNA interactions (as defined by Mechismo based on a benchmark for the accuracy of perturbed interfaces). We considered any interaction evidence, including interactions from other species, or those coming from indirect experiments (e.g., affinity purifications or co-expression).
Analysis of transcription factor targets
We retrieved a list of transcription factors putative target genes from an R package (https://github.com/slowkow/tftargets), which is a collection of several gene regulatory network experiments (including Chip-Seq data [61–66]). We then assessed the pairwise similarity of transcription factors target genes with a Jaccard score.
Oncodriver co-occurrence and survival time analysis
For each retained domain position, we assessed the co-occurrence of mutations of known oncodrivers from the Cancer Genes Census (CGC) [1]. We considered simple non-synonymous mutations, copy number variants (CNVs) and structural rearrangements for a total of 24k unique samples, corresponding to the 82% of the total.
We considered separately oncogenes and tumor suppressor genes (TSG), and defined a third category, Signalling Oncodrivers, which we generated by mapping all CGC genes to the Reactome “Signal Transduction” (top hierarchy) pathway. We then manually check this list based on literature inspection to include additional candidates and/or remove genes with no role in signal transduction. Mutual exclusivity between domain positions and oncodriver alterations was done through a one tailed Fisher’s exact test, correcting P-values through the FDR procedure (Q-values), considering as significant those instances having a q < 0.1.
We collected information for 17323 donors from release 23 of ICGC (icgc.org), and matched these to the corresponding COSMIC sample. We considered vital status (alive/deceased), disease status (complete remission or not – i.e., partial remission, relapse, progression) and survival time. We used the LogRank test to identify patient groups affected by particular domain positional mutations and having a statistically significant (P < 0.05) difference in survival time. We generated Kaplan-Meier survival analysis plots for the most significant examples. We employed Cox’s proportional hazard model to predict hazard ratios and survival probability of patients affected by interface-perturbing mutations, employing age, sex and cancer type as covariates.
For all the clustering and statistical analysis we used python (www.python.org/) through scipy (www.scipy.org/), statsmodels (statsmodels.sourceforge.net/) and lifelines (lifelines.readthedocs.org/en/latest/) libraries.
TGFα shedding assay
Transforming growth factor-α (TGFα) shedding assay was performed as described previously [34] with minor modifications. HEK293A cells were seeded in a 6-well culture plate (Greiner Bio-One) at a density of 2 × 105 cells per well, with 2 ml of complete DMEM and cultured for 1 day. The cells were transfected with a mixture of plasmids encoding a codon-optimized alkaline phosphatase (AP)-tagged TGFα (AP-TGFα, 1 µg, pCAGGS vector) and a GPCR of interest (200 ng) combined with 5 µl of 10 µl of 1 mg ml−1 PEI solution and 95 µl of Opti-MEM® I Reduced Serum Medium. For some GPCRs, a plasmid encoding a chimeric Gαq/s subunit (100 ng, pCAGGS vector) was co-transfected. After 1-day incubation, the cells were trypsinized with 0.05% trypsin- and 0.53 mM EDTA-containing D-PBS, neutralized with the complete DMEM, collected in a 15-ml tube, centrifuged at 190 g for 5 min, and suspended in 6 ml of pre-warmed HEPES-HBSS. The cell suspension was left for 15-min to settle spontaneous AP-TGFα release caused by trypsinization. Afterward, the cells were centrifuged and suspended in 6 ml of HEPES-HBSS and seeded in a 96-well culture microplate (Greiner Bio-one) at a volume of 80 μl per well. The cell plate was placed in a CO2 incubator for 30 min and mixed with 10 µl of 10X titrated test compounds diluted in 0.01% BSA-containing HEPES-HBSS. After 1-h incubation, the cell plate was centrifuged at 190 g for 2 min, and 80 µl of conditioned media was transferred to an empty 96-well plate (conditioned media (CM) plate). AP reaction solution (10 mM p-nitrophenylphosphate (p-NPP, disodium salt, Wako Pure Chemicals), 120 mM Tris–HCl (pH 9.5), 40 mM NaCl, and 10 mM MgCl2) was dispensed into the cell plate and the CM plate at a volume of 80 µl per well. Absorbance at a wavelength of 405 nm were measured using a microplate reader (SpectraMax 340 PC384, Molecular Devices), before and after a 1 h incubation at room temperature. For each well measurement, change in the absorbance unit (Abs405) during 1 h incubation with p-NPP solution (∆Abs405) was used as a relative amount of AP-TGFα. Relative AP-TGFα in the CM plate was calculated by dividing ∆Abs405 in the CM plate by the total (the CM plate and the cell plate) ∆Abs405, followed by multiplication by a factor of 1.25 (80 µl transferred volume out of total 100 µl). Compound-induced AP-TGFα release was obtained by subtracting spontaneous AP-TGFα release (e.g., vehicle-treated relative AP-TGFα in the CM plate). The AP-TGFα release was fitted to a four-parameter sigmoidal concentration-response curve (Prism 7 software, GraphPad Prism).
cAMP Glosensor assay
An in-house-modified cAMP GlosensorTM assay was performed as follows. HEK293A cells (Thermo Fisher Scientific) were seeded in a 6-cm culture dish (Greiner Bio-One) at a density of 2 × 105 cells per well, with 4 ml of Dulbecco’s Modified Eagle Medium (DMEM), supplemented with 10% fetal calf serum (FCS) and 100 U ml−1 penicillin and 100 µg ml−1 streptomycin (complete DMEM), and cultured for 1 day. Transfection solution was prepared by combining 190 µl of Opti-MEM® I Reduced Serum Medium (Thermo Fisher Scientific), plasmids encoding pGlo-22F (2 µg, codon-optimized for human cell expression, Genscript, pCAGGS vector) and a GPCR of interest (400 ng, pCAGGS vector or pcDNA3.1 vector), and 10 µl of 1 mg ml−1 PEI solution (Polyethylenimine “Max”, (Mw 40,000), Polysciences). The resulting transfection solution was added to the cells. After 1-day culture, the transfected cells were detached with 1 ml of 0.53 mM EDTA-containing Dulbecco’s Phosphate-Buffered Saline (D-PBS), mixed with 2 ml of Hank’s Balanced Salt Solution (HBSS) containing 5 mM HEPES (pH 7.4) (HEPES-HBSS) and centrifuged at 190 g for 5 min. The cell pellet was suspended in 1.2 ml of 0.01% (w/v) bovine serum albumin (BSA, fatty-acid-free and protease-free grade, Serva)-containing HEPES-HBSS and the cell suspension was seeded in a 96-well half-are white microplate (Advanced TC, Greiner Bio-One) at a volume of 30 µl per well. The cells were loaded with D-luciferin (Wako Pure chemicals) at 10 µl of 8 mM solution and incubated at room temperature for 2 h in dark. After measurement of initial luminescent signals with a microplate luminometer (SpectraMax L, Molecular Devices), 10 µl of 5X titrated test compound diluted in 5 µM forskolin were manually added to the cell and incubated at room temperature for 10 min. Afterward, luminescent signals were measured and normalized to the initial counts. The luminescent signals were fitted to a four-parameter sigmoidal concentration-response curve using the Prism 7 software (GraphPad Prism) and the values for pEC50 (equal to −Log10 EC50 [M]) and Emax were calculated from the curve.
Differential expression analysis
We obtained raw read counts for each gene in all cancer types released from TCGA (The Cancer Genome Atlas) before 26th February 2016. To avoid unreliable results, we selected 16 cancer types with at least 10 pairs of tumor-normal tissue matches. We used Deseq2 [67] for differential gene expression analysis in each cancer type and adjusted p-values (padj) for multiple testing by the Benjamini & Hochberg method [60]. We considered only values with corrected p-values (padj or q) < 0.01.
We also performed differential expression analysis of samples carrying GPCR R3.50 or G-protein known activating mutations (i.e., on position G.hfs2.2 on SWI, corresponding to GNAS p.R201, and on G.s3h2.3 on SWII, corresponding to GNAQ/11 p.Q209). For both, we excluded from the control both normal tissue samples as well as samples carrying mutations of the second class (either GPCR or G-protein) to be compared.
We considered the top 3 TCGA cancer types carrying 7 M R3.50 mutations, i.e., skin melanoma (SKCM: 19 samples), uterus endometrial carcinoma (UCEC: 11 samples) and stomach adenocarcinoma (STAD: 10 samples). Only significant genes (q < 0.01) with an absolute Log Fold Change (LFC) > = 2 were retained. Gene enrichment analysis of the overlapping genes between 7TM R3.50 and Gα activating mutations in melanoma was performed through g:Profiler [68] and visualized through Enrichment map [69].
Integrative analysis of GPCR’s activity in cancer
We grouped GPCRs based on their coupling preferences by using primary and secondary coupling information available from the literature (http://www.guidetopharmacology.org/) [35]. We then estimated the activity for each G-protein family by combining differential expression data for G-protein and GPCRs according to available coupling information. We considered the following G-protein groups: Gs (GNAS, GNAL), Gi/o (GNAI1, GNAI2, GNAI3 GNAO1), Gq/11 (GNAQ, GNA14, GNA15) and G12/13 (GNA12, GNA13).
We estimated the activity (AG) of each G-protein family, in a given cancer type, according to the following equation:
1 |
where, m is the base mean of normalized counts for all samples (i.e., cancer and control) used for differential expression analysis, normalized for sequencing depth; s is a scaling factor corresponding to the Log Fold Change (LFC), when significant (i.e., padj < 0.01), alternatively it’s set to 1;
c is a coupling constant set to 1 if the coupling is reported in IUPHAR, elsewhere is 0; n is the number of G-protein for a given family.
For GPCRs, we calculated the fraction of samples containing likely deleterious mutations, i.e., stop gains, frame-shift insertions/deletions or non-synonymous mutations affecting highly conserved 7TM positions. For G-proteins, we calculated the fraction of samples affected by known specific oncogenic mutations.
The column of the matrix in Fig. 4a as well as the top dendrogram were derived from hierarchical, complete-linkage clustering of GPCR coupling group significant LFCs.
Supplementary information
Acknowledgments
Funding
FR, JCG, GS and RBR are supported by the Cell Networks Excellence initiative of the Germany Research Foundation (DFG). FR was supported by an Alexander Von Humboldt post-doctoral fellowship and a Michael J. Fox Foundation research grant. RBR is part of the DFG SFB/TPR186 Molecular Switches in the Spatio-Temporal Control of Cellular Signal Transmission, and a member of the BMBF German Network for Bioinfomatics Infrastructure (de.NBI). AI was funded by Japan Science and Technology Agency (JST) Grant Number JPMJPR1331, Japan Agency for Medical Research and Development (AMED) Grant Number JP17gm5910013 and Japan Society for the Promotion of Science (JSPS) KAKENHI Grant Number 17K08264. JA received funding from AMED Grant Number JP17gm0710001. The authors thank Ayumi Inoue, Naoko Kuroda and Yuko Sugamura (Tohoku University) for plasmid construction and cell culture experiments. We also thank a reviewer for the helpful suggestion to explore the wider set of genes potentially affecting cAMP in cancer.
Author contributions
FR, AI, JSG, and RBR designed the study; FR performed the computational analysis; GS performed mutation clustering; JCGS generated the random models for statistical testing; FR, NS, and BF performed differential expression analysis; AI, FMNK, JA, AAV, and RS provided reagents and performed experimental assays; FR, JSG and RBR wrote the manuscript with input from all the authors
Code availability
Code and datasets to reproduce the analysis available at: http://www.russelllab.org/multigene-oncodriver.
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
Footnotes
Lead contact: Francesco Raimondi
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Francesco Raimondi, Asuka Inoue
Deceased Bernd Fischer
Contributor Information
Francesco Raimondi, Email: francesco.raimondi@bioquant.uni-heidelberg.de.
Robert B. Russell, Phone: +49 6221 54 51 362, Email: robert.russell@bioquant.uni-heidelberg.de
Supplementary information
The online version of this article (10.1038/s41388-019-0895-2) contains supplementary material, which is available to authorized users.
References
- 1.Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, et al. A census of human cancer genes. Nat Rev Cancer. 2004;4:177–83. doi: 10.1038/nrc1299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Kandoth C, McLellan MD, Vandin F, Ye K, Niu B, Lu C, et al. Mutational landscape and significance across 12 major cancer types. Nature. 2013;502:333–9. doi: 10.1038/nature12634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Lawrence MS, Stojanov P, Polak P, Kryukov GV, Cibulskis K, Sivachenko A, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature. 2013;499:214–8. doi: 10.1038/nature12213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Vogelstein B, Papadopoulos N, Velculescu VE, Zhou S, Diaz LA, Jr., et al. Cancer genome landscapes. Science (80-) 2013;339:1546–58. doi: 10.1126/science.1235122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bailey MH, Tokheim C, Porta-Pardo E, Sengupta S, Bertrand D, Weerasinghe A, et al. Comprehensive characterization of cancer driver genes and mutations. Cell. 2018;173:371–85.e18. doi: 10.1016/j.cell.2018.02.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Marx V. Cancer: hunting rare somatic mutations. Nat Methods. 2016;134:2016. doi: 10.1038/nmeth.3803. [DOI] [PubMed] [Google Scholar]
- 7.Martincorena Iñigo, Raine Keiran M., Gerstung Moritz, Dawson Kevin J., Haase Kerstin, Van Loo Peter, Davies Helen, Stratton Michael R., Campbell Peter J. Universal Patterns of Selection in Cancer and Somatic Tissues. Cell. 2017;171(5):1029-1041.e21. doi: 10.1016/j.cell.2017.09.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Creixell P, Reimand J, Haider S, Wu G, Shibata T, Vazquez M, et al. Pathway and network analysis of cancer genomes. Nat Methods. 2015;2:1–6. doi: 10.1038/nmeth.3440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Porta-Pardo Eduard, Kamburov Atanas, Tamborero David, Pons Tirso, Grases Daniela, Valencia Alfonso, Lopez-Bigas Nuria, Getz Gad, Godzik Adam. Comparison of algorithms for the detection of cancer drivers at subgene resolution. Nature Methods. 2017;14(8):782–788. doi: 10.1038/nmeth.4364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Raimondi F, Singh G, Betts MJ, Apic G, Vukotic R, Andreone P, et al. Insights into cancer severity from biomolecular interaction mechanisms. Sci Rep. 2016;6:34490. doi: 10.1038/srep34490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Pylayeva-Gupta Y, Grabocka E, Bar-Sagi D. RAS oncogenes: weaving a tumorigenic web. Nat Rev Cancer. 2011;11:761–74. doi: 10.1038/nrc3106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Leiserson MDM, Vandin F, Wu H-T, Dobson JR, Eldridge JV, Thomas JL, et al. Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes. Nat Genet. 2014;47:106–14. doi: 10.1038/ng.3168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Liu P, Cheng H, Roberts TM, Zhao JJ. Targeting the phosphoinositide 3-kinase pathway in cancer. Nat Rev Drug Discov. 2009;8:627–44. doi: 10.1038/nrd2926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016;44:D279–85. doi: 10.1093/nar/gkv1344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Fabregat A, Sidiropoulos K, Garapati P, Gillespie M, Hausmann K, Haw R, et al. The Reactome pathway Knowledgebase. Nucleic Acids Res. 2016;44:D481–7. doi: 10.1093/nar/gkv1351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Miller ML, Reznik E, Gauthier NP, Aksoy BA, Korkut A, Gao J, et al. Pan-cancer analysis of mutation hotspots in protein domains. Cell Syst. 2015;1:197–209. doi: 10.1016/j.cels.2015.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Petrovski S, Wang Q, Heinzen EL, Allen AS, Goldstein DB. Genic intolerance to functional variation and the interpretation of personal genomes. PLoS Genet. 2013;9:e1003709. doi: 10.1371/journal.pgen.1003709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Betts MJ, Lu Q, Jiang Y, Drusko A, Wichmann O, Utz M, et al. Mechismo: predicting the mechanistic impact of mutations and modifications on molecular interactions. Nucleic Acids Res. 2015;43:e10. doi: 10.1093/nar/gku1094. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Wang L, Wei D, Huang S, Peng Z, Le X, Wu TT, et al. Transcription factor Sp1 expression is a significant predictor of survival in human gastric cancer. Clin Cancer Res. 2003;9:6371–80. [PubMed] [Google Scholar]
- 20.Küçük C, Iqbal J, Hu X, Gaulard P, De Leval L, Srivastava G, et al. PRDM1 is a tumor suppressor gene in natural killer cell malignancies. Proc Natl Acad Sci USA. 2011;108:20119–24. doi: 10.1073/pnas.1115128108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gordon S, Akopyan G, Garban H, Bonavida B. Transcription factor YY1: structure, function and therapeutic implications in cancer biology. Oncogene. 2006;25:1125–42. doi: 10.1038/sj.onc.1209080. [DOI] [PubMed] [Google Scholar]
- 22.Ballesteros JA, Weinstein H. [19] Integrated methods for the construction of three-dimensional models and computational probing of structure-function relations in G protein-coupled receptors. Methods Neurosci. 1995;25:366–428. doi: 10.1016/S1043-9471(05)80049-7. [DOI] [Google Scholar]
- 23.Flock T, Ravarani CNJ, Sun D, Venkatakrishnan AJ, Kayikci M, Tate CG, et al. Universal allosteric mechanism for Gα activation by GPCRs. Nature. 2015;524:173–9. doi: 10.1038/nature14663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Audet M, Bouvier M. Restructuring G-protein-coupled receptor activation. Cell. 2012;151:14–23. doi: 10.1016/j.cell.2012.09.003. [DOI] [PubMed] [Google Scholar]
- 25.Rosenbaum DM, Rasmussen SGF, Kobilka BK. The structure and function of G-protein-coupled receptors. Nature. 2009;459:356–63. doi: 10.1038/nature08144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Raimondi F, Betts MJ, Lu Q, Inoue A, Gutkind JS, Russell RB. Genetic variants affecting equivalent protein family positions reflect human diversity. Sci Rep. 2017;7:12771. doi: 10.1038/s41598-017-12971-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Gelis L, Jovancevic N, Veitinger S, Mandal B, Arndt H-D, Neuhaus EM, et al. Functional characterization of the odorant receptor 51E2 in human melanocytes. J Biol Chem. 2016;291:17772–86. doi: 10.1074/jbc.M116.734517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Gelis L, Jovancevic N, Bechara FG, Neuhaus EM, Hatt H. Functional expression of olfactory receptors in human primary melanoma and melanoma metastasis. Exp Dermatol. 2017;26:569–76. doi: 10.1111/exd.13316. [DOI] [PubMed] [Google Scholar]
- 29.Neuhaus EM, Zhang W, Gelis L, Deng Y, Noldus J, Hatt H. Activation of an olfactory receptor inhibits proliferation of prostate cancer cells. J Biol Chem. 2009;284:16218–25. doi: 10.1074/jbc.M109.012096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.O’Hayre M, Vázquez-Prado J, Kufareva I, Stawiski EW, Handel TM, Seshagiri S, et al. The emerging mutational landscape of G proteins and G-protein-coupled receptors in cancer. Nat Rev Cancer. 2013 doi: 10.1038/nrc3521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Scheer A, Costa T, Fanelli F, De Benedetti PG, Mhaouty-Kodja S, Abuin L, et al. Mutational analysis of the highly conserved arginine within the Glu/Asp-Arg-Tyr motif of the alpha(1b)-adrenergic receptor: effects on receptor isomerization and activation. Mol Pharmacol. 2000;57:219–31. [PubMed] [Google Scholar]
- 32.Rochdi MD, Vargas GA, Carpentier E, Oligny-Longpré G, Chen S, Kovoor A, et al. Functional characterization of vasopressin type 2 receptor substitutions (R137H/C/L) leading to nephrogenic diabetes insipidus and nephrogenic syndrome of inappropriate antidiuresis: implications for treatments. Mol Pharmacol. 2010;77:836–45. doi: 10.1124/mol.109.061804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Rovati G. Enrico, Capra Valérie, Neubig Richard R. The Highly Conserved DRY Motif of Class A G Protein-Coupled Receptors: Beyond the Ground State. Molecular Pharmacology. 2006;71(4):959–964. doi: 10.1124/mol.106.029470. [DOI] [PubMed] [Google Scholar]
- 34.Inoue Asuka, Ishiguro Jun, Kitamura Hajime, Arima Naoaki, Okutani Michiyo, Shuto Akira, Higashiyama Shigeki, Ohwada Tomohiko, Arai Hiroyuki, Makide Kumiko, Aoki Junken. TGFα shedding assay: an accurate and versatile method for detecting GPCR activation. Nature Methods. 2012;9(10):1021–1029. doi: 10.1038/nmeth.2172. [DOI] [PubMed] [Google Scholar]
- 35.Southan C, Sharman JL, Benson HE, Faccenda E, Pawson AJ, Alexander SPH, et al. The IUPHAR/BPS Guide to PHARMACOLOGY in 2016: towards curated quantitative interactions between 1300 protein targets and 6000 ligands. Nucleic Acids Res. 2016;44:D1054–68. doi: 10.1093/nar/gkv1037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Velazquez EF, Yancovitz M, Pavlick A, Berman R, Shapiro R, Bogunovic D, et al. Clinical relevance of neutral endopeptidase (NEP/CD10) in melanoma. J Transl Med. 2007;5:2. doi: 10.1186/1479-5876-5-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Turan S, Bastepe M. GNAS spectrum of disorders. Curr Osteoporos Rep. 2015;13:146–58. doi: 10.1007/s11914-015-0268-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Menigatti M, Staiano T, Manser CN, Bauerfeind P, Komljenovic A, Robinson M, et al. Epigenetic silencing of monoallelically methylated miRNA loci in precancerous colorectal lesions. Oncogenesis. 2013;2:e56. doi: 10.1038/oncsis.2013.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang Y, Xu G, Chen W, Pan Q, Huang K, Pan J, et al. Detection of long-chain non-encoding RNA differential expression in non-small cell lung cancer by microarray analysis and preliminary verification. Mol Med Rep. 2015;11:1925–32. doi: 10.3892/mmr.2014.2944. [DOI] [PubMed] [Google Scholar]
- 40.De Giorgi Vincenzo, Grazzini Marta, Benemei Silvia, Marchionni Niccolò, Botteri Edoardo, Pennacchioli Elisabetta, Geppetti Pierangelo, Gandini Sara. Propranolol for Off-label Treatment of Patients With Melanoma. JAMA Oncology. 2018;4(2):e172908. doi: 10.1001/jamaoncol.2017.2908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Nilsson MB, Sun H, Diao L, Tong P, Liu D, Li L, et al. Stress hormones promote EGFR inhibitor resistance in NSCLC: Implications for combinations with β-blockers. Sci Transl Med. 2017;9:eaao4307. doi: 10.1126/scitranslmed.aao4307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Johannessen CM, Johnson LA, Piccioni F, Townes A, Frederick DT, Donahue MK, et al. A melanocyte lineage program confers resistance to MAP kinase pathway inhibition. Nature. 2013;504:138–42. doi: 10.1038/nature12688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Mu J, Huang W, Tan Z, Li M, Zhang L, Ding Q, et al. Dopamine receptor D2 is correlated with gastric cancer prognosis. Oncol Lett. 2017;13:1223–7. doi: 10.3892/ol.2017.5573. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.González‐Sánchez Juan Carlos, Raimondi Francesco, Russell Robert B. Cancer genetics meets biomolecular mechanism—bridging an age‐old gulf. FEBS Letters. 2018;592(4):463–474. doi: 10.1002/1873-3468.12988. [DOI] [PubMed] [Google Scholar]
- 45.Gussow AB, Petrovski S, Wang Q, Allen AS, Goldstein DB. The intolerance to functional genetic variation of protein domains predicts the localization of pathogenic mutations within genes. Genome Biol. 2016;17:9. 10.1186/s13059-016-0869-4. [DOI] [PMC free article] [PubMed]
- 46.Yu J, Liang QY, Wang J, Cheng Y, Wang S, Poon TCW, et al. Zinc-finger protein 331, a novel putative tumor suppressor, suppresses growth and invasiveness of gastric cancer. Oncogene. 2013;32:307–17. doi: 10.1038/onc.2012.54. [DOI] [PubMed] [Google Scholar]
- 47.Dorsam RT, Gutkind JS. G-protein-coupled receptors and cancer. Nat Rev Cancer. 2007;7:79–94. doi: 10.1038/nrc2069. [DOI] [PubMed] [Google Scholar]
- 48.Patra KC, Kato Y, Mizukami Y, Widholz S, Boukhali M, Revenco I, et al. Mutant GNAS drives pancreatic tumourigenesis by inducing PKA-mediated SIK suppression and reprogramming lipid metabolism. Nat Cell Biol. 2018;20:811–22. doi: 10.1038/s41556-018-0122-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Ideno Noboru, Yamaguchi Hiroshi, Ghosh Bidyut, Gupta Sonal, Okumura Takashi, Steffen Dana J., Fisher Catherine G., Wood Laura D., Singhi Aatur D., Nakamura Masafumi, Gutkind J. Silvio, Maitra Anirban. GNASR201C Induces Pancreatic Cystic Neoplasms in Mice That Express Activated KRAS by Inhibiting YAP1 Signaling. Gastroenterology. 2018;155(5):1593-1607.e12. doi: 10.1053/j.gastro.2018.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Nairismägi M-L, Tan J, Lim JQ, Nagarajan S, Ng CCY, Rajasegaran V, et al. JAK-STAT and G-protein-coupled receptor signaling pathways are frequently altered in epitheliotropic intestinal T-cell lymphoma. Leukemia. 2016;30:1311–9. doi: 10.1038/leu.2016.13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Muppidi JR, Schmitz R, Green JA, Xiao W, Larsen AB, Braun SE, et al. Loss of signalling via Gα13 in germinal centre B-cell-derived lymphoma. Nature. 2014;516:254–8. doi: 10.1038/nature13765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.O'Hayre M, Inoue A, Kufareva I, Wang Z, Mikelis C M, Drummond R A, Avino S, Finkel K, Kalim K W, DiPasquale G, Guo F, Aoki J, Zheng Y, Lionakis M S, Molinolo A A, Gutkind J S. Inactivating mutations in GNA13 and RHOA in Burkitt’s lymphoma and diffuse large B-cell lymphoma: a tumor suppressor function for the Gα13/RhoA axis in B cells. Oncogene. 2015;35(29):3771–3780. doi: 10.1038/onc.2015.442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Moore AR, Ceraudo E, Sher JJ, Guan Y, Shoushtari AN, Chang MT, et al. Recurrent activating mutations of G-protein-coupled receptor CYSLTR2 in uveal melanoma. Nat Genet. 2016;48:675–80. doi: 10.1038/ng.3549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Demir Hande, Donner Iikki, Kivipelto Leena, Kuismin Outi, Schalin-Jäntti Camilla, De Menis Ernesto, Karhu Auli. Mutation Analysis of Inhibitory Guanine Nucleotide Binding Protein Alpha (GNAI) Loci in Young and Familial Pituitary Adenomas. PLoS ONE. 2014;9(10):e109897. doi: 10.1371/journal.pone.0109897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Tuominen I, Heliövaara E, Raitila A, Rautiainen M-R, Mehine M, Katainen R, et al. AIP inactivation leads to pituitary tumorigenesis through defective Gαi-cAMP signaling. Oncogene. 2015;34:1174–84. doi: 10.1038/onc.2014.50. [DOI] [PubMed] [Google Scholar]
- 56.Forbes SA, Beare D, Boutselakis H, Bamford S, Bindal N, Tate J, et al. COSMIC: somatic cancer genetics at high-resolution. Nucleic Acids Res. 2017;45:D777–83. doi: 10.1093/nar/gkw1121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Yates A, Akanni W, Amode MR, Barrell D, Billis K, Carvalho-Silva D, et al. Ensembl 2016. Nucleic Acids Res. 2016;44:D710–6. doi: 10.1093/nar/gkv1157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.UniProt. The universal protein knowledgebase. Nucleic Acids Res. 2017;45:D158–69. doi: 10.1093/nar/gkw1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Eddy SR. A new generation of homology search tools based on probabilistic inference. Genome Inform. 2009;23:205–11. [PubMed] [Google Scholar]
- 60.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple. Testing. J. R. Stat. Soc. Ser. B. 1995;57:289–300. [Google Scholar]
- 61.Jiang C, Xuan Z, Zhao F, Zhang MQ. TRED: a transcriptional regulatory element database, new entries and other development. Nucleic Acids Res. 2007;35:D137–40. doi: 10.1093/nar/gkl1041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Zheng G, Tu K, Yang Q, Xiong Y, Wei C, Xie L, et al. ITFP: an integrated platform of mammalian transcription factors. Bioinformatics. 2008;24:2416–7. doi: 10.1093/bioinformatics/btn439. [DOI] [PubMed] [Google Scholar]
- 63.Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA, Doyle F, et al. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Neph S, Stergachis AB, Reynolds A, Sandstrom R, Borenstein E, Stamatoyannopoulos JA. Circuitry and dynamics of human transcription factor regulatory networks. Cell. 2012;150:1274–86. doi: 10.1016/j.cell.2012.04.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Han H, Shim H, Shin D, Shim JE, Ko Y, Shin J, et al. TRRUST: a reference database of human transcriptional regulatory interactions. Sci Rep. 2015;5:11432. doi: 10.1038/srep11432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Marbach D, Lamparter D, Quon G, Kellis M, Kutalik Z, Bergmann S. Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases. Nat Methods. 2016;13:366–70. doi: 10.1038/nmeth.3799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Reimand J, Arak T, Adler P, Kolberg L, Reisberg S, Peterson H, et al. g:Profiler—a web server for functional interpretation of gene lists (2016 update) Nucleic Acids Res. 2016;44:W83–9. doi: 10.1093/nar/gkw199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Merico D, Isserlin R, Stueker O, Emili A, Bader GD. Enrichment map: a network-based method for gene-set enrichment visualization and interpretation. PLoS ONE. 2010;5:e13984. doi: 10.1371/journal.pone.0013984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Fishman P, Bar-Yehuda S, Madi L, Cohn I. A3 adenosine receptor as a target for cancer therapy. Anticancer Drugs. 2002;13:437–43. doi: 10.1097/00001813-200206000-00001. [DOI] [PubMed] [Google Scholar]
- 71.Cohen S, Stemmer SM, Zozulya G, Ochaion A, Patoka R, Barer F, et al. CF102 an A3 adenosine receptor agonist mediates anti-tumor and anti-inflammatory effects in the liver. J Cell Physiol. 2011;226:2438–47. doi: 10.1002/jcp.22593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Jose J, Tavares CDJ, Ebelt ND, Lodi A, Edupuganti R, Xie X, et al. Serotonin analogues as inhibitors of breast cancer cell growth. ACS Med Chem Lett. 2017;8:1072–6. doi: 10.1021/acsmedchemlett.7b00282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.White C, Alshaker H, Cooper C, Winkler M, Pchejetski D. The emerging role of FTY720 (Fingolimod) in cancer treatment. Oncotarget. 2016;7:23106–27. doi: 10.18632/oncotarget.7145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Zatelli MC, Piccin D, Vignali C, Tagliati F, Ambrosio MR, Bondanelli M, et al. Pasireotide, a multiple somatostatin receptor subtypes ligand, reduces cell viability in non-functioning pituitary adenomas by inhibiting vascular endothelial growth factor secretion. Endocr Relat Cancer. 2007;14:91–102. doi: 10.1677/ERC-06-0026. [DOI] [PubMed] [Google Scholar]
- 75.Cives M, Kunz PL, Morse B, Coppola D, Schell MJ, Campos T, et al. Phase II clinical trial of pasireotide long-acting repeatable in patients with metastatic neuroendocrine tumors. Endocr Relat Cancer. 2014;22:1–9. doi: 10.1530/ERC-14-0360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Zhang D, Ma Q-Y, Hu H-T, Zhang M. β2-adrenergic antagonists suppress pancreatic cancer cell invasion by inhibiting CREB, NFκB and AP-1. Cancer Biol Ther. 2010;10:19–29. doi: 10.4161/cbt.10.1.11944. [DOI] [PubMed] [Google Scholar]
- 77.Choy C, Raytis Jl, Smith Dd, Duenas M, Neman J, Jandial R, et al. Inhibition of β2-adrenergic receptor reduces triple-negative breast cancer brain metastases: The potential benefit of perioperative β-blockade. Oncol Rep. 2016;35:3135–42. doi: 10.3892/or.2016.4710. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Sarnataro D, Pisanti S, Santoro A, Gazzerro P, Malfitano AM, Laezza C, et al. The cannabinoid CB1 receptor antagonist rimonabant (SR141716) inhibits human breast cancer cell proliferation through a lipid raft-mediated mechanism. Mol Pharmacol. 2006;70:1298–306. doi: 10.1124/mol.106.025601. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Code and datasets to reproduce the analysis available at: http://www.russelllab.org/multigene-oncodriver.