Significance
Cancer is highly correlated with somatic mutations. Understanding how mutation induces oncogenesis is an important task in cancer research. We proposed a strategy that combined network-based dynamics modeling with structural-based mutation analysis and molecular dynamics simulation to map cancer-related mutations to network dynamics changes via protein–protein interaction kinetics. This approach identifies the oncogenic role of mutations and subsequently determines the causal mechanism of mutation-induced oncogenesis. This work provides a framework for coupling mutation genotype to tumorigenesis phenotype. It is also a step toward seeking more effective anticancer drug targets in cellular pathways.
Keywords: mutation-induced oncogenesis, dynamic bifurcation, parameter sensitivity analysis, mutation enrichment, protein interaction kinetics
Abstract
It has been a consensus in cancer research that cancer is a disease caused primarily by genomic alterations, especially somatic mutations. However, the mechanism of mutation-induced oncogenesis is not fully understood. Here, we used the mitochondrial apoptotic pathway as a case study and performed a systematic analysis of integrating pathway dynamics with protein interaction kinetics to quantitatively investigate the causal molecular mechanism of mutation-induced oncogenesis. A mathematical model of the regulatory network was constructed to establish the functional role of dynamic bifurcation in the apoptotic process. The oncogenic mutation enrichment of each of the protein functional domains involved was found strongly correlated with the parameter sensitivity of the bifurcation point. We further dissected the causal mechanism underlying this correlation by evaluating the mutational influence on protein interaction kinetics using molecular dynamics simulation. We analyzed 29 matched mutant–wild-type and 16 matched SNP—wild-type protein systems. We found that the binding kinetics changes reflected by the changes of free energy changes induced by protein interaction mutations, which induce variations in the sensitive parameters of the bifurcation point, were a major cause of apoptosis pathway dysfunction, and mutations involved in sensitive interaction domains show high oncogenic potential. Our analysis provided a molecular basis for connecting protein mutations, protein interaction kinetics, network dynamics properties, and physiological function of a regulatory network. These insights provide a framework for coupling mutation genotype to tumorigenesis phenotype and help elucidate the logic of cancer initiation.
High-throughput whole-genome sequencing (1) and massively parallel technologies (2) have enabled cancer researchers to understand that cancer is a disease caused primarily by genomic alterations, especially somatic cell mutations (3, 4). This stimulated extensive research endeavors to identify the landscape of cancer-related mutations. In the last decade, studies of different tumor types using cancer genomic and pathology analysis indicate that oncogenic mutations are concentrated primarily in a few core regulatory pathways that govern cell phenotypic behaviors (5–8). These results led to the idea that “genetic aberrations alter normal cellular regulation and then drive tumor development” (9, 10). Therefore, mutation-induced oncogenesis should be explored from a network and system perspective.
A systematic interpretation of oncogenesis can be achieved in part by analyzing interactions among mutated genes and performing functional annotation of cancer-related proteins in functional pathways. A more quantitative approach is network modeling using ordinary differential equations (ODEs) and mathematical simulations. This well-established method has been successfully used to understand cellular dynamic functionalities and network regulatory principles, including the relationship between genetic mutations and oncogenesis (11–15). For example, Stites et al. (14) studied common oncogenic Ras mutations that affect Ras activation. They performed ODE model simulation to investigate the steady-state Ras concentration response to kinetic parameter changes and found a strong correlation between oncogenic mutations and Ras activation level. Chen et al. (15) recently analyzed the p53-induced apoptosis model and found that parameters significantly affecting the system’s bifurcation points corresponded with frequently mutated oncogenic genes. These studies demonstrate the suitability of applying systems-level analysis to understand oncogenesis. However, the limitations of previous analyses are obvious. Only pure network-based dynamics analyses were conducted, in which proteins were abstracted as theoretical nodes, structural details of interactions were ignored, functional processes of protein conformational changes were oversimplified, only correlations between mutations and dynamic properties were analyzed, and the causal molecular clues for the oncogenic function of cancer-related mutations remained unexplored.
In this study, we investigate the causal mechanisms of mutation-induced oncogenesis by integrating network-based dynamics modeling with structural-based mutation analysis and molecular dynamics simulation of protein interactions. This approach enables us to map cancer-related mutations to network dynamics changes via protein–protein interaction kinetics. We propose that cancer-related gene mutations may change relevant protein interaction kinetics in the network, which directly change the network’s model parameters. Subsequently, some of these disturbances may qualitatively change system dynamics behaviors, thereby disrupting normal cellular processes, and ultimately causing cancer.
We used the mitochondrial apoptotic pathway as a case study to validate our hypothesis, because dysfunction of this pathway may lead to many diseases, including cancer (16). After we constructed a mathematical model for the regulatory network based on mass action kinetics, we established the functional role of dynamic bifurcation in apoptosis and used parameter sensitivity analysis to examine the nonuniform effect of parameter variation on the bifurcation point using techniques developed during our previous work (15). We also investigated oncogenic mutations involved in the network by mapping them onto 3D protein structures and identified mutation enrichment for each protein functional domain. We found that cancer-related mutations also have a nonuniform distribution among different functional domains, and this distribution is strongly correlated with the nonuniformity of parameter sensitivity of the bifurcation point. We further dissected the causal mechanism underlying this correlation by evaluating mutational influence on protein interaction kinetics using molecular dynamics simulation. We found that the mutation-induced binding energy change is a major cause of parameter changes in the regulatory network model, and the kinetics changes of protein interactions that induce variations in the bifurcation point sensitive parameters are a major cause of apoptosis pathway dysfunction. We identified the oncogenic role of each mutation by combining the bifurcation point change direction due to parameter perturbation and the binding energy change direction due to mutation disturbance and found that mutations involved in sensitive interactions (corresponding to sensitive parameters of the bifurcation point) are most likely oncogenic.
Our approach, which couples pathway dynamics with protein interaction kinetics, provides a molecular basis for connections among protein mutations, protein interaction kinetics, network dynamics properties, and physiological function of a regulatory network. This work bridges the gap between genotype (mutations) and phenotype (tumorigenesis) and validates the hypothesis that mutation-induced abnormal changes in biological system dynamic properties may be a major cause of cancer initiation and development.
Results
Model Construction and Nonlinear Dynamics Analysis.
Apoptosis is a highly regulated process that is strongly dependent on mitochondrial function (16). We constructed a simplified apoptotic network focused primarily on the regulation of mitochondrial outer membrane permeabilization (MOMP), where cross-talk between intrinsic and extrinsic pathways occurs and multiple apoptotic signals culminate (17). MOMP is a primary cause of cell apoptosis; it leads to irreversible cell death by releasing proapoptotic factors such as cytochrome c and Smac from the mitochondrial intermembrane space and subsequently facilitating caspase activation (18). MOMP is controlled primarily through interactions among Bcl-2 protein family (19). Bcl-2 family members can be classified into two groups: pro- and antiapoptotic factors. The former contains the following three subtypes: the MOMP effectors Bax and Bak; Bax and Bak activators such as Bid and Bim; and sensitizers such as Puma and Noxa. Antiapoptotic members such as Bcl-2, Bcl-xL, and A1 can be considered as inhibitors of Bax and Bak association (20).
We performed extensive data survey and collection and constructed the MOMP core regulatory network triggered by extrinsic stimulus (Fig. 1). During apoptosis, initiator caspases such as caspase8 that have been activated by the death-inducing signaling complex (DISC; not shown in our network) cleave Bid into the active form tBid (21). Then, tBid promotes a series of conformational changes in Bax, resulting in its translocation from the cytosol onto the mitochondrial outer membrane (MOM) (22). Active Bax monomers incrementally form oligomers and create pores in MOM, consequently causing MOMP (23). The antiapoptotic proteins A1 and Bcl-2 bind tBid and Bax in the cytoplasm and on the membrane to inhibit effector activation (24), whereas the membrane-bound Bax dimer undergoes autoactivation to generate a positive feedback loop that enhances oligomer accumulation (25). Puma binds to both A1 and Bcl-2 to neutralize the inhibitors and release the effectors from the protein complex, and Noxa specifically binds to A1 (26).
Fig. 1 is a simplified network with compressions of redundancy (see Supporting Information for details). Compared with previous apoptotic models (15, 27), our model distinguishes the cytoplasmic and membrane binding of antiapoptotic proteins and elaborates the conformational changes and translocation processes of proapoptotic proteins. These processes are essential for triggering MOMP (28) and understanding mitochondrial apoptosis.
We used well-established methods for describing signaling pathways (29) and built a set of ODEs according to Fig. 1 to simulate the MOMP apoptotic network. The model parameters were chosen based on experimental data and reasonable estimates of biochemical constraints (Table S1). We conducted nonlinear dynamics analysis to study the system dynamics behavior. The model input and output was the caspase8 concentration and the Bax oligomer concentration, respectively. The analysis identified a saddle-node bifurcation of Bax oligomer concentration as a function of caspase8 concentration (Fig. 2, black line). There is a bistable regime in which two stable steady states of Bax oligomer concentration coexist (Fig. 2). The low and high branches in the bistable regime indicate that the apoptosis pathway is in the OFF and ON states, respectively. With increasing caspase8 levels, the system changes from a bistable state to a monostable state via the saddle-node bifurcation. When the system is in the OFF state, a sudden increase in Bax oligomer concentration will occur if the active caspase8 concentration increases beyond the bifurcation point. This kind of transition can be regarded as a life-to-death switch of a cell under apoptotic stimulus. Therefore, the location of the bifurcation point (i.e., critical concentration of caspase8) reflects the death threshold.
Table S1.
Parameter | Value | Description | Corresponding interaction | Source |
kdBax | 0.0001 | Degradation rate of activated Bax | bax*_deg | (27) |
kdtBid | 0.0001 | Degradation rate of tBid | tbid_deg | Same as kdBax |
k3 | 0.000015 | Activation rate of Bax by membrane binding tBid | tbidMOM_act_bax | (50) |
k16 | 0.00001 | Activation rate of Bax by membrane binding Bax dimer | baxMOM^2_act_bax | Estimated from k3 |
k2 | 0.0004 | Association rate of membrane binding tBid and Bcl2 | tbidMOM* bcl2 | (41) |
kf2 | 0.002 | Dissociation rate of membrane binding tBid and Bcl2 | tbidMOM − bcl2 | (41) |
k4 | 0.0002 | Association rate of membrane binding Bax and Bcl2 | baxMOM* bcl2 | (41) |
kf4 | 0.003 | Dissociation rate of membrane binding Bax and Bcl2 | baxMOM − bcl2 | (41) |
k12 | 0.0002 | Association rate of membrane binding Bax | baxMOM* baxMOM | Estimated from k4 |
kf12 | 0.003 | Dissociation rate of membrane binding Bax | baxMOM − baxMOM | Estimated from kf4 |
k14 | 0.0002 | Association rate of membrane binding Bax dimer | baxMOM^2* baxMOM^2 | Estimated from k4 |
kf14 | 0.003 | Dissociation rate of membrane binding Bax dimer | baxMOM^2 − baxMOM^2 | Estimated from kf4 |
k18 | 0.0005 | Association rate of membrane binding Puma and Bcl2 | pumaMOM* bcl2 | (41) |
kf18 | 0.002 | Dissociation rate of membrane binding Puma and Bcl2 | pumaMOM − bcl2 | (41) |
k5 | 0.002 | Production rate of Bid | bid_pro | Estimated from (ref. 27) |
kf5 | 0.0001 | Degradation rate of Bid | bid_deg | Same as kdBax |
k6 | 0.02 | Production rate of Bax | bax_pro | Estimated from (ref. 27) |
kf6 | 0.0001 | Degradation rate of Bax | bax_deg | Same as kdBax |
k7 | 0.008 | Production rate of Bcl2 | bcl2_pro | Estimated from (ref. 27) |
kf7 | 0.0001 | Degradation rate of Bcl2 | bcl2_deg | Same as kdBax |
k17 | 0.001 | Production rate of Puma | puma_pro | Estimated |
kf17 | 0.0001 | Degradation rate of Puma | puma_deg | Same as kdBax |
kf8 | 0.0001 | Degradation rate of membrane binding tBid | tbidMOM_deg | Same as kdBax |
kf9 | 0.0001 | Degradation rate of membrane binding Bax | baxMOM_deg | Same as kdBax |
kf10 | 0.0001 | Degradation rate of Bcl2·tBidMOM complex | bcl2·tbidMOM_deg | Same as kdBax |
kf11 | 0.0001 | Degradation rate of Bcl2·BaxMOM complex | bcl2·baxMOM_deg | Same as kdBax |
kf13 | 0.0001 | Degradation rate of Bax dimer | baxMOM·baxMOM_deg | Same as kdBax |
kf15 | 0.0001 | Degradation rate of Bax oligomer | baxMOM^2·baxMOM^2_deg | Same as kdBax |
kf19 | 0.0001 | Degradation rate of Bcl2·tBidMOM complex | bcl2·pumaMOM_deg | Same as kdBax |
kBidon | 0.0002 | Membrane translocation rate of tBid | tbid_onMOM | Estimated from cell size |
kBidoff | 0.00002 | Membrane separation rate of tBid | tbid_offMOM | Estimated from cell size |
kBaxon | 0.0002 | Membrane translocation rate of active Bax | bax*_onMOM | Same as kBidon |
kBaxoff | 0.00002 | Membrane separation rate of active Bax | bax*_offMOM | Same as kBidoff |
kPumaon | 0.0002 | Membrane translocation rate of Puma | puma_onMOM | Same as kBidon |
kPumaoff | 0.00002 | Membrane separation rate of Puma | puma_offMOM | Same as kBidoff |
kdPuma | 0.0001 | Degradation rate of membrane binding Puma | pumaMOM_deg | Same as kdBax |
k24 | 0.0004 | Association rate of A1 and active Bax | A1* bax* | (41) |
kf24 | 0.004 | Dissociation rate of A1 and active Bax | A1 − bax* | (41) |
k25 | 0.0005 | Association rate of A1 and Puma | A1* puma | (41) |
kf25 | 0.002 | Dissociation rate of A1 and Puma | A1 − puma | (41) |
k26 | 0.0002 | Association rate of A1 and tBid | A1* tbid | (41) |
kf26 | 0.001 | Dissociation rate of A1 and tBid | A1 - tbid | (41) |
k27 | 0.0002 | Association rate of A1 and Noxa | A1* noxa | (41) |
kf27 | 0.003 | Dissociation rate of A1 and Noxa | A1 - noxa | (41) |
kA1 | 0.003 | Production rate of A1 | A1_pro | Estimated from (ref. 27) |
kdA1 | 0.0001 | Degradation rate of A1 | A1_deg | Same as kdBax |
kdA1Bax | 0.0001 | Degradation rate of A1·Bax complex | A1·bax_deg | Same as kdBax |
kdAPuma | 0.0001 | Degradation rate of A1·Puma complex | A1·puma_deg | Same as kdBax |
kdA1tBid | 0.0001 | Degradation rate of A1·tBid complex | A1·tbid_deg | Same as kdBax |
kdA1Noxa | 0.0001 | Degradation rate of A1·Noxa complex | A1·noxa_deg | Same as kdBax |
kNoxa | 0.001 | Production rate of Noxa | Noxa_pro | Estimated from (ref. 27) |
kdNoxa | 0.0001 | Degradation rate of Noxa | Noxa_deg | Same as kdBax |
Beta | 2 | Constant of cell volume normalization | — | — |
k1 | 0.0000012 | Activation rate of Bid by caspase8 | — | Estimated from k3 |
C8 | Changing | Concentration of caspase8 | Control parameter | — |
Parameter Sensitivities.
Previous studies established that dynamic bifurcations manifest qualitative changes in biological systems and govern many biological functions (15, 30, 31). In our system, the saddle-node bifurcation provides the mechanism of the life-to-death switch. The location of the bifurcation point determines when a cell undergoes apoptosis; therefore, it determines the cell’s fate. Changes in certain parameters may lead to a large change at the location of the critical point, which can render the system dysfunctional. An example of such an effect in the MOMP network is presented in Fig. 2. Our simulation shows that a 30% reduction in the tBid–Bcl2 dissociation rate in the regulatory network significantly shifts the bifurcation point to the right from the original wild-type location, so that a much stronger caspase8 signal is required to switch on apoptosis. When this parameter is reduced by 70% or more, high levels of Bax oligomer can never be reached even under extremely strong apoptotic stimulus, so cell apoptosis is completely shut off. These cellular conditions provide a possible background for oncogenesis.
The nonuniform effect of parameter variation on dynamic properties has been reported in many biological systems (15, 32, 33). In our model, the effect of parameter changes on the bifurcation point location is not evenly distributed. To identify which parameters have a major impact on the bifurcation point location in our model, we conducted single-parameter sensitivity analysis to the bifurcation point by increasing or decreasing each of the 52 model parameters by 10% and recorded the percentage change of the bifurcation point. A sensitivity spectrum of all parameters was achieved in this way (Fig. 3A). Using the method of k-means clustering, we separated the parameters into the following four clusters according to their impact on the bifurcation point location: most sensitive, sensitive, insensitive, and most insensitive. A more systematic analysis of the model parameter sensitivity (change each parameter by 1%, 5%, 10%, and 15%) shows that the sensitivity patterns are approximately the same among the spectrums obtained through different parameter perturbation degree. Clustering analysis on each spectrum also shows the same cluster cutoff location between different sensitivity clusters (Fig. S1 A and B). A right shift of the bifurcation point suggests an increasing oncogenic potential. Therefore, the direction of parameter perturbation (increasing or decreasing) that leads to a right shift of the critical point is chosen (see the blue and orange bars in Fig. 3A). The direction change resulting from parameters in the most insensitive class is not discriminated, because their influences on the bifurcation point location can be neglected.
To map cancer-related mutations to network dynamic changes via protein interaction kinetics, we narrowed our study to the parameters of protein interactions in which the effect of genetic variation is due primarily to mutations in the corresponding functional domains. We selected a subset of the sensitivity spectrum from the parameter sensitivity spectrum of Fig. 3A in which only the parameters that represent specific protein interactions were chosen (Fig. 4A, Left). The subset includes the parameters of protein–protein binding and activation rate (Fig. 3B) and the parameters of membrane localization rate (Fig. 3C) but excludes the parameters of protein production and degradation rates (Fig. 3D). The latter can be affected by many mechanisms, such as mutations in promoter regions, transcription factors, chromosome translocation, and proteasome; these parameters cannot be explained simply by the kinetics of protein interactions. This selection allowed us to evaluate mutational influence on protein interaction kinetics using mutation domain mapping and molecular dynamics simulation, through which the parameter sensitivity of the bifurcation point and cancer-related mutations can be linked on a molecular basis.
Cancer-Related Mutation Enrichment.
To characterize the mutation pattern in our model and its association with the parameter sensitivity spectrum, we manually collected cancer-related mutations from the Catalogue of Somatic Mutations in Cancers (COSMIC) database (34) and from the literature (35). We focused primarily on missense mutations and ignored all SNPs nominated in the dbSNP database (36) (Table S2). Our analysis is different from previous studies that simply counted the total mutation number for each protein (15). For a given protein interaction (corresponding to the parameters in Fig. 4A, Left), we first distinguished functional and nonfunctional domains for involved proteins according to their structural information and then mapped all mutations to the functional and nonfunctional domains. Details regarding the basis and definition of functional and nonfunctional domains are given in Supporting Information and Table S3.
Table S2.
Gene | Transcript | Source | Mutation | Count | Amino acid position |
BID(195) | ENST00000317361(241) | COSMIC | C49Y | 1 | 5 |
G54D | 1 | 10 | |||
M143V | 1 | 99 | |||
S165L | 1 | 121 | |||
N170K | 1 | 126 | |||
Y184N | 1 | 140 | |||
K203N | 1 | 159 | |||
R214H | 1 | 170 | |||
R237K | 1 | 193 | |||
D141N | 1 | 97 | |||
BAX(192) | ENST00000345358(192) | COSMIC | G40E | 1 | 40 |
F93L | 1 | 93 | |||
L113P | 1 | 113 | |||
S118I | 1 | 118 | |||
L125M | 1 | 125 | |||
I133M | 1 | 133 | |||
L162I | 1 | 162 | |||
ENST00000293288(218) | G10D | 1 | 10 | ||
A24V | 1 | 24 | |||
W139C | 1 | 139 | |||
ENST00000345358(192) | K58N | 1 | 58 | ||
D68V | 1 | 68 | |||
(35) | F165L | 1 | 165 | ||
169 | 4 | 169 | |||
PUMA(BBC3)(193) | ENST00000439096 | COSMIC | T52P | 1 | 52 |
L179P | 1 | 179 | |||
P165L | 1 | 165 | |||
H185Y | 1 | 185 | |||
A187T | 1 | 187 | |||
BCL2(239) | BCL2 | COSMIC | A2T | 2 | 2 |
A2V | 1 | 2 | |||
H3D | 1 | 3 | |||
A4T | 1 | 4 | |||
A4P | 1 | 4 | |||
G5V | 1 | 5 | |||
R6I | 1 | 6 | |||
T7K | 1 | 7 | |||
T7R | 1 | 7 | |||
T7I | 2 | 7 | |||
Y9F | 1 | 9 | |||
N11Y | 1 | 11 | |||
N11K | 1 | 11 | |||
E13D | 2 | 13 | |||
I14M | 1 | 14 | |||
M16V | 1 | 16 | |||
M16K | 1 | 16 | |||
M16R | 1 | 16 | |||
K17M | 1 | 17 | |||
K17N | 1 | 17 | |||
K17N | 1 | 17 | |||
H20Q | 1 | 20 | |||
H20Q | 2 | 20 | |||
Y21H | 1 | 21 | |||
Y21C | 1 | 21 | |||
G27D | 1 | 27 | |||
Y28F | 1 | 28 | |||
D31N | 1 | 31 | |||
D31H | 1 | 31 | |||
A32T | 1 | 32 | |||
G33E | 1 | 33 | |||
D34G | 1 | 34 | |||
V35M | 1 | 35 | |||
A37V | 1 | 37 | |||
A43V | 1 | 43 | |||
A45S | 1 | 45 | |||
P46A | 2 | 46 | |||
G47D | 2 | 47 | |||
I48V | 1 | 48 | |||
I48F | 1 | 48 | |||
F49L | 2 | 49 | |||
F49S | 1 | 49 | |||
F49C | 1 | 49 | |||
S50P | 1 | 50 | |||
S51P | 1 | 51 | |||
S51Y | 1 | 51 | |||
Q52R | 1 | 52 | |||
P53S | 1 | 53 | |||
P57A | 1 | 57 | |||
P57S | 1 | 57 | |||
P59L | 2 | 59 | |||
A60T | 1 | 60 | |||
A60D | 1 | 60 | |||
A60G | 1 | 60 | |||
A60V | 4 | 60 | |||
A61T | 1 | 61 | |||
P65S | 1 | 65 | |||
V66I | 1 | 66 | |||
T69P | 1 | 69 | |||
T69A | 1 | 69 | |||
P71S | 1 | 71 | |||
T74I | 1 | 74 | |||
A76T | 1 | 76 | |||
A80V | 1 | 80 | |||
A85T | 1 | 85 | |||
L86F | 1 | 86 | |||
S87N | 2 | 87 | |||
S87R | 1 | 87 | |||
F104L | 1 | 104 | |||
F104L | 1 | 104 | |||
R107L | 1 | 107 | |||
Y108S | 1 | 108 | |||
M115I | 1 | 115 | |||
L119V | 1 | 119 | |||
T125S | 1 | 125 | |||
R129H | 1 | 129 | |||
A131G | 1 | 131 | |||
A131V | 4 | 131 | |||
T132S | 1 | 132 | |||
E135D | 1 | 135 | |||
N143S | 1 | 143 | |||
R146K | 2 | 146 | |||
V156L | 1 | 156 | |||
V156A | 1 | 156 | |||
V159A | 1 | 159 | |||
R164Q | 1 | 164 | |||
E165K | 1 | 165 | |||
S167A | 1 | 167 | |||
P168R | 1 | 168 | |||
L169Q | 1 | 169 | |||
N172S | 1 | 172 | |||
E179D | 1 | 179 | |||
T187I | 2 | 187 | |||
R207W | 1 | 207 | |||
A224S | 1 | 224 | |||
Bfl1/A1(175) | ENST0000267953 | COSMIC | C55R | 1 | 55 |
Q73H | 1 | 73 | |||
K116T | 1 | 116 | |||
Y120H | 1 | 120 | |||
A123V | 1 | 123 | |||
F125L | 1 | 125 | |||
I134R | 1 | 134 | |||
E141A | 1 | 141 | |||
G143A | 1 | 143 | |||
M155I | 1 | 155 | |||
T156A | 1 | 156 | |||
E159D | 1 | 159 | |||
T161S | 1 | 161 | |||
Noxa (54) | COSMIC | — | — | — |
Table S3.
Protein | Full length | BH4 | BH3 | BH1 | BH2 | TM | BH3-binding |
BAX | 192 | 20–37 | 59–73 | 98–118 | 150–164 | 169–192 | 66–67,69–70,76,79–80,90–91,94–95,98–99,106,108–109,111–112,116,120,164 |
BCL2 | 239 | 10–30 | 93–107 | 136–155 | 187–202 | 212–233 | 100–101,103–104,108,111–112,128–129,132–133,136–137,143,145–146,148–149,153,157,201 |
BID | 195 | — | 86–100 | — | — | 145–192 | — |
PUMA | 193 | — | 141–149 | — | — | 150–193 | — |
Bfl1/A1 | 175 | 10–20 | 37–45 | 77–96 | 132–146 | — | 40–41,43–44,48,51–52,69–70,73–74,77–78,85,87–88,90–91,95,99 |
NOXA | 54 | — | 29–37 | — | — | 41–50 | — |
Numbers in the table indicate the amino acid positions.
To quantify the cancer mutation distribution, we calculated the mutation enrichment for each of the functional domains (Table S4). Mutation enrichment for a functional domain is defined as , where and denote, respectively, the number of mutations and the amino acid length in the functional domain i; and denote, respectively, the number of mutations and the amino acid length in nonfunctional domains of the corresponding proteins. This definition of mutation enrichment tends to normalize different functional domain mutations in different proteins.
Table S4.
Protein | interaction | Functional domain region for interaction (amino acids) | Mi | Li | Mnon_fun | Lnon_fun | Mutation enrichment |
BID | Bid · A1 | 90–98 | 2 | 9 | 5 | 130 | 4.412698413 |
Bid · Bcl2 | 90–98 | 2 | 9 | 4.412698413 | |||
Bid_MOM | 145–192 | 3 | 48 | 1.390625 | |||
BAX | Bax · A1 | 57–82 | 2 | 26 | 3 | 40 | 1.015384615 |
Bax · Bax | 54–122 | 5 | 69 | 0.987318841 | |||
Bax_MOM | 168–192 | 4 | 25 | 1.485714286 | |||
Bax · Bcl2 | 57–82 | 2 | 26 | 1.015384615 | |||
(Bax)2 · (Bax)2 | 107–147 | 3 | 48 | 0.916666667 | |||
BCL2 | Bcl2 · Bid | 100–157 | 20 | 58 | 24 | 70 | 1.003134796 |
Bcl2 · Bax | 100–157 | 20 | 58 | 1.003134796 | |||
Bcl2 · Puma | 100–157 | 20 | 58 | 1.003134796 | |||
PUMA | Puma · A1 | 131–153 | 0 | 23 | 1 | 140 | 0 |
Puma · Bcl2 | 141–149 | 0 | 9 | 0 | |||
Puma_MOM | 150–192 | 4 | 43 | 3.404651163 | |||
A1 | A1 · Bax | 40–99 | 2 | 60 | 9 | 70 | 0.393939394 |
A1 · Puma | 40–99 | 2 | 60 | 0.393939394 | |||
A1 · Noxa | 40–99 | 2 | 60 | 0.393939394 | |||
A1 · Bid | 40–99 | 2 | 60 | 0.393939394 | |||
NOXA | Noxa · A1 | 29–37 | 0 | 9 | 0 | 35 | 0 |
The spectrum of the enrichment value is listed in Fig. 4A, Right. For each protein interaction (e.g., interaction of protein A and B), the enrichment value for a domain in protein A and a domain in protein B is evaluated separately. We also calculated the combined enrichment of protein A and protein B to characterize the whole interaction (see the definition of the combined enrichment in Supporting Information). Naturally, E = 1 can be considered as a separation of low- and high-density oncogenic mutations in a given protein interaction domain of a given protein; a domain with E > 1 indicates that more oncogenic mutations are concentrated in this protein interaction domain than in nonfunctional domains of the given protein. This domain mapping more accurately reflects the correspondence between cancer mutations and model parameters.
Comparison Between Parameter Sensitivity and Mutation Enrichment.
The sensitivity spectrum of the subset parameters can be clearly grouped into three parameter clusters: sensitive, insensitive, and most insensitive (Fig. 4A, Left). Each cluster covers distinct features. Most parameters in the first cluster (sensitive) correspond to interactions of apoptotic regulators on MOM, such as Bcl2, BaxMOM and tBidMOM, which are indispensable and dominant processes in the network. The parameters depicting the interactions, in which cytosolic reactants and some BH3 proteins with redundant functions participate, belong to the last two clusters (insensitive and most insensitive). Comparing Fig. 4A, Left and Right, we found good correlation between the parameter sensitivity of the bifurcation point and the mutation enrichment of each corresponding functional domain. Specifically, most of the sensitive parameters correspond to functional domains with high mutation enrichment, whereas the domains corresponding to insensitive parameters have relatively low mutation enrichment. A clear correlation between parameter sensitivity and combined mutation enrichment is shown in Fig. 4B. With increasing value of parameter sensitivity, the corresponding combined mutation enrichment generally tends to increase, which indicates that sensitive interactions tend to carry relatively greater numbers of mutations.
This correlation pattern is in agreement with most experimental results reported in the literature. For example, mitochondrial localization and insertion of Bax with its C-terminal transmembrane (TM) region (168−193 aa) (the corresponding parameter in our model is Bax*_onMOM) is a necessary step for MOMP and apoptotic induction (37). Schinzel et al. (38) proved that removal of Bax C-terminal or substitution of residue 168 in HeLa cells attenuated Bax localization at the mitochondria and apoptotic function. Gil et al. (39) reported that mutation of Bax residue 169, which was identified as a mutation hot spot in gastrointestinal cancer by Yamamoto et al. (35), reduced cell death by disrupting Bax membrane location. The loss-of-function mutations in Bax C-terminal TM region reduce the translocation rate of activated Bax. Our sensitivity analysis showed that Bax*_onMOM is indeed a sensitive parameter; a slight decrease in Bax*_onMOM shifts the bifurcation point to the right. Consistent with the result identifying Bax as a sensitive parameter, relatively high mutation enrichment was identified in the Bax C-terminal TM domain. Nevertheless, we found some exceptions. Sensitivity analysis showed that the parameters describing Puma membrane translocation are really insensitive, whereas the mutation enrichment of the corresponding domain (Puma_onMOM: 1.7) is much greater than that for Bax (Bax homodimerization, 0.99), whose dimerization is identified as a sensitive interaction. This might result from the complexity of Bax–Bax interactions. On one hand, multiconformational changes during Bax functioning may bring uncertainty and overlap into functional domains of homodimerization and oligomerization, which made it difficult to map mutations clearly. However, for Puma membrane translocation, the functional domain (C-terminal transmembrane domain) is more independent and clear. On the other hand, Bax has different protein moieties that participate in many more interactions than other apoptotic regulators, so the nonfunctional domain is much shorter. This may help to understand why mutation enrichments for Bax-involved interactions are approximately the same and close to 1 (Bax homodimerization, 0.99; Bax oligomerization, 0.92).
The results presented in Fig. 4 support the previous observation (15) that the distribution of oncogenic mutations and the parameter sensitivity are closely related. Our analysis clearly shows that mutations observed in tumor samples tend to be located in specific protein functional domains corresponding to the sensitive parameters of the bifurcation point.
Molecular Dynamics Simulation and Calculation.
To dissect the mechanism underlying the correlation between oncogenic mutation distribution and parameter sensitivity, we conducted molecular dynamics simulations to compute protein–protein interaction kinetics. We aimed to answer two questions. How do cancer mutations cause alterations in the parameter sensitivities? How is this effect coordinated with shifts in the bifurcation point?
The Gibbs free energy change accompanying a protein–protein interaction can be represented by its equilibrium constant KD, which can be linked to the kinetic parameters in the dynamic model [based on ]. If a mutation causes a change in the free energy change of a protein–protein interaction, then the equilibrium constant KD changes accordingly. Therefore, a logic chain connecting oncogenic mutations and parameter sensitivity can be established by mutation-induced change of free energy change in the protein–protein interaction. We propose that mutation-induced changes in free energy change are one of the major causes of parameter sensitivity perturbations. These perturbations may cause a right shift of the bifurcation point in Fig. 2, which predisposes the cell to a nonapoptotic state.
To obtain the value of between wild-type and mutant protein systems, we built eight pairs of wild type−protein–protein interaction models (heterodimers in Fig. 1) based on the structural information for the corresponding proteins. Using wild-type models, we constructed single-mutation models according to the collected cancer-related mutations in corresponding functional domains. We also mutated each structure according to missense SNPs involved in the corresponding functional domain as controls. Therefore, we built 26 cancer-related mutations, 16 SNPs, and 8 wild-type complexes (Supporting Information). The free energy calculations were achieved using molecular dynamics simulation and molecular mechanics/generalized Born surface area (MM/GBSA) methods (see Supporting Information for details). We did not perform calculations of the entropy contribution to binding due to resource limitation; therefore, strictly speaking, our result is the binding energy of the protein complex rather than the Gibbs free energy change (Table S5). However, this simplification is not a limitation because our goal is to obtain the binding energy change between wild-type proteins and those with single point mutations, which are very similar in structure. In most cases, the entropy difference can be ignored.
Table S5.
Protein complex | WT bax·bax | baxS118I | baxD68V | baxF93L | baxK58N | |
∆G of trajectory 1 | −193.1331 | −183.1156 | −154.0123 | −177.2145 | −198.092 | |
∆G of trajectory 2 | −194.0678 | −189.7092 | −151.1865 | −175.557 | −177.572 | |
∆G of trajectory 3 | −191.771 | −188.1804 | −177.8062 | −190.028 | −169.944 | |
Protein complex | WT bcl2·puma | bcl2R107L | bcl2A131V | bcl2A131G | ||
∆G of trajectory 1 | −106.9752 | −56.357 | −83.7189 | −96.0202 | ||
∆G of trajectory 2 | −116.1052 | −68.2611 | −105.6579 | −93.209 | ||
∆G of trajectory 3 | −108.2349 | −68.5743 | −82.2002 | −93.5993 | ||
Protein complex | WT bcl2·bid | bcl2R107L | bcl2A131V | bcl2A131G1 | bidD97N | bidM99V |
∆G of trajectory 1 | −58.5852 | −60.2434 | −75.2826 | −76.3556 | −76.507 | −67.1649 |
∆G of trajectory 2 | −69.4223 | −58.2829 | −72.5369 | −75.6281 | −70.8219 | −68.4519 |
∆G of trajectory 3 | −57.6667 | −59.4039 | −81.4395 | −91.0749 | −67.042 | −69.7464 |
Protein complex | WT bcl2·bax | bcl2R107L | bcl2A131V | bcl2A131G | baxD68V | baxK58N |
∆G of trajectory 1 | −93.5418 | −117.5711 | −93.0258 | −110.895 | −81.2849 | −97.2342 |
∆G of trajectory 2 | −95.9702 | −114.6753 | −100.4226 | −95.7451 | −93.5512 | −109 |
∆G of trajectory 3 | −91.0736 | −98.8869 | −97.2659 | −100.4194 | −78.81 | −129.729 |
Protein complex | WT A1·puma | A1C55R | A1Q73H | |||
∆G of trajectory 1 | −73.1556 | −63.411 | −77.0432 | |||
∆G of trajectory 2 | −76.7076 | −77.1824 | −76.7278 | |||
∆G of trajectory 3 | −82.5216 | −76.0922 | −75.5956 | |||
Protein complex | WT A1·noxa | A1C55R | A1Q73H | |||
∆G of trajectory 1 | −63.9573 | −98.7571 | −69.7059 | |||
∆G of trajectory 2 | −64.5644 | −85.3925 | −70.6175 | |||
∆G of trajectory 3 | −58.2384 | −87.3995 | −84.0773 | |||
Protein complex | WT A1·bax | A1C55R | A1Q73H | baxD68V1 | baxK58N | |
∆G of trajectory 1 | −64.3198 | −76.6675 | −94.6449 | −51.8972 | −71.4983 | |
∆G of trajectory 2 | −64.2426 | −71.0395 | −85.4226 | −55.4843 | −85.5556 | |
∆G of trajectory 3 | −70.9896 | −83.6728 | −109.554 | −62.7273 | −78.9105 | |
Protein complex | WT A1·bid | A1C55R | A1Q73H | bidD97N | bidM99V | |
∆G of trajectory 1 | −65.378 | −61.0498 | −64.8171 | −70.1003 | −55.2251 | |
∆G of trajectory 2 | −67.5395 | −66.2054 | −60.9238 | −79.5466 | −65.5695 | |
∆G of trajectory 3 | −73.2036 | −61.2107 | −54.434 | −67.6732 | −59.3677 | |
Protein complex | SNP bax·bax | baxR65L | baxR89Q | |||
∆G of trajectory 1 | −158.8016 | −180.3216 | ||||
∆G of trajectory 2 | −186.3847 | −182.5172 | ||||
∆G of trajectory 3 | −170.5732 | −182.0094 | ||||
Protein complex | SNP bcl2·puma | bcl2F124L | bcl2T122M | |||
∆G of trajectory 1 | −92.1748 | −83.5314 | ||||
∆G of trajectory 2 | −88.2984 | −101.1626 | ||||
∆G of trajectory 3 | −94.1378 | −88.0792 | ||||
Protein complex | SNP bcl2·bid | bcl2F124L | bcl2T122M | bidA93T | ||
∆G of trajectory 1 | −52.3385 | −59.4249 | −65.1373 | |||
∆G of trajectory 2 | −60.2012 | −68.1515 | −60.843 | |||
∆G of trajectory 3 | −56.6222 | −64.3837 | −74.2727 | |||
Protein complex | SNP bcl2·bax | bcl2F124L | bcl2T122M | baxR65L | ||
∆G of trajectory 1 | −100.7653 | −99.4888 | −99.1669 | |||
∆G of trajectory 2 | −99.6803 | −100.5259 | −98.7048 | |||
∆G of trajectory 3 | −100.8657 | −103.0382 | −83.408 | |||
Protein complex | SNP A1·puma | A1N58S | ||||
∆G of trajectory 1 | −86.1405 | |||||
∆G of trajectory 2 | −75.1161 | |||||
∆G of trajectory 3 | −75.8819 | |||||
Protein complex | SNP A1·noxa | A1N58S | ||||
∆G of trajectory 1 | −62.7148 | |||||
∆G of trajectory 2 | −74.0888 | |||||
∆G of trajectory 3 | −69.4577 | |||||
Protein complex | SNP A1·bax | A1N58S | baxR65L | |||
∆G of trajectory 1 | −86.4479 | −64.2426 | ||||
∆G of trajectory 2 | −81.2702 | −67.6373 | ||||
∆G of trajectory 3 | −80.6982 | −76.3291 | ||||
Protein complex | SNP A1·bid | A1N58S | bidA93T | |||
∆G of trajectory 1 | −76.2181 | −79.4234 | ||||
∆G of trajectory 2 | −71.1144 | −85.7599 | ||||
∆G of trajectory 3 | −62.8902 | −87.2638 |
The energy unit is kilocalories per mole.
The calculation results for matched mutant/SNP–wild type are displayed in Fig. 5A, Right. In view of the parameter sensitivity of the bifurcation point (Fig. 5A, Left), the eight interactions were classified into three groups: sensitive, insensitive, and most insensitive (see different color regions in the Fig. 5A, Right). Mutations and SNPs involved in the same interaction are aligned in the same line.
To identify the oncogenic role of each mutation, we classified all mutations and SNPs into three categories designated as Right, Wrong, and Ordinary according to the mutational effect on the consistency between binding energy change and parameter perturbation direction, which are colored green, red, and blue, respectively (Fig. 5A, Right). The Right mutation is a mutation that produces a significant binding energy change, perturbs (increase or decrease) a parameter, and subsequently induces a right shift in the bifurcation point. We designate these mutations as Right because the mutation shows an oncogenic function by shifting the bifurcation point to a higher death threshold, consistent with our theoretical prediction. The Wrong mutation also produces a significant binding energy change and perturbs a parameter, but it induces a left shift in the bifurcation point. The Ordinary mutation does not produce a significant binding energy change and has no effect on the bifurcation point.
Parameter sensitivity analysis of the bifurcation point identifies two types of parameter perturbation that can induce a right shift of the bifurcation point: those that increase KD (orange bar in Fig. 5A, Left), and those that decrease KD (blue bar in Fig. 5A, Left). Therefore, based on , in cases of increase KD, a Right mutation should satisfy ; otherwise it is a “Wrong” mutation. In cases of decrease KD, a Right mutation should satisfy , otherwise it is a Wrong mutation. For mutations involved in the most insensitive group, we only calculated the absolute value of owing to the undefinable sensitivity direction (Table S6).
Table S6.
Protein complex | WT Bax·Bax | Bax S118I | Bax D68V | Bax F93L | Bax K58N | |
Average ∆G | −192.9906333 | −187.002 | −161.0016667 | −180.6 | −181.3 | |
Variation | 1.155008711 | 3.451207 | 14.62157795 | 7.91984 | 14.557728 | |
∆∆G | −5.9889 | −31.98896663 | −12.391 | −11.69063 | ||
Abs(∆∆G) | 5.9889 | 31.98896663 | 12.3906 | 11.690633 | ||
Significance | 3.685792 | 24.1006733 | 7.85321 | 3.8342649 | ||
Protein complex | WT Bcl2·Puma | Bcl2 R107L | Bcl2 A131V | Bcl2 A131G | ||
Average ∆G | −110.4384333 | −64.3975 | −90.52566667 | −94.146 | ||
Variation | 4.947817131 | 6.965009 | 13.12687993 | 1.52293 | ||
∆∆G | −46.0869 | −19.95866633 | −16.338 | |||
Abs(∆∆G) | 46.08687 | 19.95866633 | 16.3383 | |||
Significance | 40.13043 | 10.92129071 | 13.1029 | |||
Protein complex | WT Bcl2·Bid | Bcl2 R107L | Bcl2 A131V | Bcl2 A131G1 | Bid D97N | Bid M99v |
Average ∆G | −61.8914 | −59.3101 | −76.41966667 | −81.02 | −71.45697 | −68.4544 |
Variation | 6.538100005 | 0.983613 | 4.55892116 | 8.7158 | 4.7643508 | 1.290751816 |
∆∆G | −2.58133 | 14.52826667 | 19.1281 | 9.5655667 | 6.563 | |
Abs(∆∆G) | 2.581333 | 14.52826667 | 19.1281 | 9.5655667 | 6.563 | |
Significance | −1.17952 | 8.979756084 | 11.5012 | 3.9143413 | 2.64857409 | |
Protein complex | WT Bcl2·Bax | Bcl2 R107L | Bcl2 A131V | Bcl2 A131G | Bax D68V | Bax K58N |
Average ∆G | −93.52853333 | −110.378 | −96.90476667 | −102.35 | −84.5487 | −111.9878667 |
Variation | 2.448326958 | 10.05616 | 3.71160013 | 7.75786 | 7.8939874 | 16.45229295 |
∆∆G | 16.84923 | 3.376233333 | 8.82463 | −8.979833 | 18.45933333 | |
Abs(∆∆G) | 16.84923 | 3.376233333 | 8.82463 | 8.9798333 | 18.45933333 | |
Significance | 10.59699 | 0.296269789 | 3.72154 | 3.8086762 | 9.00902338 | |
Protein complex | WT A1·Puma | A1 C55R | A1 Q73H | |||
Average ∆G | −77.4616 | −72.2285 | −76.45553333 | |||
Variation | 2.511643287 | 7.655639 | 0.761238 | |||
∆∆G | −5.23307 | −1.006066667 | ||||
Abs(∆∆G) | 5.233067 | 1.006066667 | ||||
Significance | 0.149426 | −0.630373977 | ||||
Protein complex | WT A1·Noxa | A1 C55R | A1 Q73H | |||
Average ∆G | −62.25336667 | −90.5164 | −74.80023333 | |||
Variation | 3.49028804 | 7.206891 | 8.047094388 | |||
∆∆G | 28.263 | 12.54686667 | ||||
Abs(∆∆G) | 28.263 | 12.54686667 | ||||
Significance | 22.91441 | 6.778175452 | ||||
Protein complex | WT A1·Bax | A1 C55R | A1 Q73H | Bax D68V | Bax K58N | |
Average ∆G | −66.51733333 | −77.1266 | −96.5405 | −56.703 | −78.6548 | |
Variation | 3.873288888 | 6.329151 | 12.17686704 | 5.51693 | 7.0321375 | |
∆∆G | 10.60927 | 30.02316667 | −9.8144 | 12.137467 | ||
Abs(∆∆G) | 10.60927 | 30.02316667 | 9.8144 | 12.137467 | ||
Significance | 5.508047 | 21.9980887 | 5.11929 | 6.6847535 | ||
Protein complex | WT A1·Bid | A1 C55R | A1 Q73H | Bid D97N | Bid M99V | |
Average ∆G | −68.70703333 | −62.822 | −60.0583 | −72.44 | −60.0541 | |
Variation | 4.041330771 | 2.931243 | 5.245379785 | 6.27297 | 5.2062474 | |
∆∆G | −5.88507 | −8.648733333 | 3.733 | −8.652933 | ||
Abs(∆∆G) | 5.885067 | 8.648733333 | 3.733 | 8.6529333 | ||
Significance | 2.39878 | 4.005378055 | −1.4242 | 4.0291443 | ||
Protein complex | SNP Bax·Bax | Bax R65L | Bax R89Q | |||
Average ∆G | −171.9198333 | −181.62 | ||||
Variation | 13.84077012 | 1.14943 | ||||
∆∆G | −21.07079997 | −11.375 | ||||
Abs(∆∆G) | 21.07079997 | 11.3746 | ||||
Significance | 13.57291055 | 10.2224 | ||||
Protein complex | SNP Bcl2·Puma | Bcl2 F124L | Bcl2 T122M | |||
Average ∆G | −91.537 | −90.924 | ||||
Variation | 2.971487796 | 9.15348 | ||||
∆∆G | −18.947333 | −19.56 | ||||
Abs(∆∆G) | 18.947333 | 19.5599 | ||||
Significance | 14.98765345 | 12.5093 | ||||
Protein complex | SNP Bcl2·Bid | Bcl2 F124L | Bcl2 T122M | Bid A93T | ||
Average ∆G | −56.3873 | −63.987 | −66.751 | |||
Variation | 3.936609751 | 4.37682 | 6.8587339 | |||
∆∆G | −5.5041 | 2.0953 | 4.8596 | |||
Abs(∆∆G) | 5.5041 | 2.0953 | 4.8596 | |||
Significance | 0.266745122 | −3.3622 | −1.838817 | |||
Protein complex | SNP Bcl2·Bax | Bcl2 F124L | Bcl2 T122M | Bax R65L | ||
Average ∆G | −100.4371 | −101.02 | −93.7599 | |||
Variation | 0.657327711 | 1.82508 | 8.9679852 | |||
∆∆G | 6.908566667 | 7.4891 | 0.2313667 | |||
Abs(∆∆G) | 6.908566667 | 7.4891 | 0.2313667 | |||
Significance | 5.355739332 | 5.3524 | −5.476789 | |||
Protein complex | SNP A1·Puma | A1 N58S | ||||
Average ∆G | −79.04616667 | |||||
Variation | 6.155792922 | |||||
∆∆G | 1.584566667 | |||||
Abs(∆∆G) | 1.584566667 | |||||
Significance | −2.749151438 | |||||
Protein complex | SNP A1·Noxa | A1 N58S | ||||
Average ∆G | −68.75376667 | |||||
Variation | 5.719581331 | |||||
∆∆G | 6.5004 | |||||
Abs(∆∆G) | 6.5004 | |||||
Significance | 1.895465314 | |||||
Protein complex | SNP A1·Bax | A1 N58S | Bax R65L | |||
Average ∆G | −82.80543333 | −69.403 | ||||
Variation | 3.16740723 | 6.23371 | ||||
∆∆G | 16.2881 | 2.88567 | ||||
Abs(∆∆G) | 16.2881 | 2.88567 | ||||
Significance | 12.76775194 | −2.1678 | ||||
Protein complex | SNP A1·Bid | A1 N58S | Bid A93T | |||
Average ∆G | −70.07423333 | −84.149 | ||||
Variation | 6.724558694 | 4.16103 | ||||
∆∆G | 1.3672 | 15.442 | ||||
Abs(∆∆G) | 1.3672 | 15.442 | ||||
Significance | −4.015744732 | 11.3408 |
The energy unit is kilocalories per mole.
Fig. 5A shows that the Right, Wrong, and Ordinary mutations/SNPs are not evenly distributed in different sensitivity groups. Statistical analyses of the Right, Wrong, and Ordinary mutation distributions of the first two sensitivity groups are shown in Fig. 5 B and C. Of 20 total mutations in the sensitive interactions, we found that 90% were Right mutations (Fig. 5B), which was significantly higher than that in insensitive interactions (37.5%) (P < 0.01, Fisher’s test). Fig. 5C, Left shows that the Right mutations in sensitive interactions (90%) were greater than those in SNPs (40%) (P < 0.05, Fisher’s test). This tendency was not observed in the insensitive group (Fig. 5C, Right). These results suggest that mutations in sensitive domains have high enrichment values and strong oncogenic function, whereas mutations in insensitive domains behave more like harmless SNPs.
To explore the mutational influence on wild-type binding energy in different sensitivity groups, we calculated the average relative change of binding energy for each nonordinary mutation in three sensitivity groups (Fig. 5D). The results showed a clearly increasing tendency of binding energy change from sensitive to most insensitive class. This suggests a distinct selection threshold for mutations in domains with different sensitivity. Sensitive parameters provide key regulatory positions in the regulatory network; a slight perturbation caused by a mutation will lead to qualitative changes in network behavior. Therefore, if we assume that each gene in the cell mutates more or less randomly (40), mutations occurring in sensitive domains tend to be selected out and reserved more easily than those in nonsensitive domains during the process of tumor evolution.
It is consistent with the experimental results that most of the involved mutations in sensitive interaction domains have high oncogenic potential. Some mutation sites in sensitive domains, including Bax, D68, and Bcl2 R107, are highly conserved and directly engaged in interactions with other Bcl2 family proteins (41). This suggests that severe disorders may arise in pathway function owing to variations in these sites. For example, Bax lacking the IGDE sequence (66−69 aa, including D68) failed to promote apoptosis in mammalian cells (42). Other work showed that Bax D68R could not dimerize with wild-type Bax in vitro, which is consistent with a previous computational simulation and our calculation for Bax D68V (43). These experimental results support our observation that mutations show their oncogenic function by affecting important protein interactions and thereby disrupting biological processes.
To show the mechanism of mutation-induced alterations in the binding energy, we used wild-type and mutant type D68V Bax dimer as an example for structural analysis. Previous research suggested that D68 and R109 tightly associate in the Bax dimer, and experiments demonstrated that the dimer cannot form with a mutation in either residue (43). Our computational analysis confirmed this result. In wild-type Bax dimer, D68 interacted with R109 by hydrogen bonding with strong electrostatic attraction, whereas this interaction disappeared when the negative charged amino acid aspartic acid was mutated to a hydrophobic amino acid valine (Fig. 6). As a result, the binding energy of Bax D68V dimer increased from wild-type −191 kcal/mol to −154 kcal/mol according to our calculation. The mutation Bax D68V dramatically altered the binding energy of Bax dimer toward the oncogenetic direction. Other structures of protein complexes with mutations highlighted were shown in Fig. S2.
Our calculation of the binding energy change between wild-type and mutant protein interactions provides some insight into the two questions we asked at the beginning of this section. The molecular mechanism responsible for the oncogenic effect of mutations lies in the changed binding kinetics of protein interactions (reflected by ∆∆G), and a consequent right shift of the bifurcation location. By exploring the underlying molecular mechanism, these results further support our hypothesis that interactions corresponding to the sensitive parameters have a crucial role in tumorigenesis, and mutations involved in sensitive interactions are most likely oncogenic.
Discussion
We successfully investigated the causal mechanism underlying the correlation between cancer mutation pattern of protein functional domains and parameter sensitivities of the system bifurcation by performing nonlinear dynamics and molecular-level analyses. We first conducted bifurcation analysis and parameter sensitivity analysis of the mitochondrial apoptotic pathway using a previously established method (15). Then, we separated functional and nonfunctional protein domains and mapped oncogenic mutations to the corresponding domains using structure information and established a correlation between parameter sensitivity of the bifurcation point and oncogenic mutation enrichment of functional domains. We further systematically studied the molecular mechanism underlying this correlation by evaluating the mutational effect on protein interaction kinetics using molecular dynamic simulation. We found that the binding energy changes induced by mutations are major determinants of parameter changes in the regulatory network model; combining the bifurcation point change direction due to parameter perturbation and the binding energy change direction due to mutation disturbance, we identified the oncogenic role of each mutation and found that mutations involved in sensitive interactions (corresponding to sensitive parameters of the bifurcation point) are most likely oncogenic. Finally, a clear vision of mutation-induced oncogenesis emerged after these analyses. The location of the saddle-node bifurcation point represents a MOMP threshold; if this threshold is exceeded, the system switches on apoptosis. Oncogenic mutations may increase this threshold by introducing significant changes in binding kinetics between key regulators. Consequently, the death switch is averted and cells can evade apoptosis. These conditions may enable oncogenesis.
The correlation between domain mutation enrichment spectrum and parameter sensitivity spectrum of the bifurcation point suggests that bifurcation point perturbation is a good measure to predict causal genes and mutations in cancer. Stites et al. (14) proposed a protein-level, perturbation-based analysis of unregulated Ras activation associated with cancer and successfully explained the correlation between Ras mutation properties and the known pathological mechanisms of Ras signaling activation. In our opinion, the selection of different dynamic features should not be considered as an inconsistency; the specific biological function to be embodied should be taken into account. For apoptosis, G1/S transition or any other checkpoint-like processes, such as epithelial-to-mesenchymal transition, induce bifurcation behaviors reflecting the qualitative change of a system. This is a good measure for describing the switch-like cell fate decision from a dynamic point of view. The steady-state level of certain protein species (reflecting protein concentration) is more precise for evaluating signal transduction pathways that require proper concentrations of active kinases for activation (44). Considering these aspects, our analysis also can be applied to any apoptotic response pathways that share similar interaction topology and key regulators, with a bifurcation behavior as the pathway functional property.
The parameters in network analysis include binding/unbinding kinetic rates. Therefore, binding energy, a reflection of the equilibrium constant KD, is not a direct indicator of network parameters. In this work, we assumed that the mutation effect on KD is due primarily to the change of koff (Fig. 5A, Left), considering that kon is described based on diffusion rules and long-range force, which will not be significantly influenced by single mutation, whereas koff is determined primarily by short-range interactions (ionic interactions, hydrogen bonds, and hydrophobic interaction) and is more likely to change due to amino acid variation (45). Our parameter sensitivity analysis also shows that even if the mutation has a significant effect on kon under certain conditions it also would not bring discordance to the original results (Supporting Information). It will be more precise if we can specify the mechanisms of kinetic parameter change. Tiwary et al. (46) recently developed a method to calculate koff of the interaction between a protein and its small molecule ligand. This method is based on metadynamics, which samples as many configurations as possible to rebuild the different binding phases at a large timescale. However, this method is not applicable for protein complex simulations with large sample space, and the accuracy is limited. Therefore, the method of calculating kinetic rates of protein–protein interactions by molecular dynamics simulation is at a very early stage. We hope this method will be fully developed in the future to further facilitate investigations such as the present study.
Our study selected a subset of parameters that represent specific protein interactions, including binding and membrane translocation, for analysis. The structural information of those selected protein interactions has been reported; therefore, we accurately mapped mutations to interaction domains and established the correspondence between mutations and model parameters. Further calculations also depended on structural information for the protein interactions. Some important factors were excluded from this analysis because sufficient information on the molecular mechanisms and interaction structures was not currently available. Nonexon mutations (47), transcription factor mutations (48), and proteasome mutations (49), which often affect protein expression levels and protein degradation processes and play important roles in cancer development, were out of the scope of our study, whereas we find some consistency between sensitivity of these kind of parameters and corresponding oncogenetic aberrations (Supporting Information). An analysis including those factors could provide a more complete picture of oncogenesis.
Materials and Methods
Equations for the Mitochondrial Apoptotic Network.
We compiled a set of ODEs (Supporting Information) to model the mitochondrial apoptotic network induced by extrinsic death signals.
Cancer Mutation, SNP, and Protein Structure Database.
All mutations in our analysis were obtained from COSMIC and literature. The missense SNPs were obtained from dbSNP. Some initial structure of wild-type protein complexes were obtained from Protein Data Bank (www.rcsb.org/pdb/home/home.do), and some were obtained by molecular replacement (Supporting Information).
Molecular Dynamics Simulation and MM/GBSA Method.
All molecular dynamics simulations were performed using Amber software (version 11). Single trajectories were used to calculate the binding energy and residue energy decomposition using the MM/GBSA algorithm in AmberTools12 (Supporting Information).
Models and ODEs
Rules to Translate the Regulatory Network to ODE Models.
Our model is formulated as a set of coupled ODEs,
where A, B, C,… represent reactants such as proteins and enzymes. Their dynamics are governed by the function f. These functions are derived according to the following rules:
-
i)
Binding reactions,
Reaction dynamics follow the law of mass action:
-
ii)
Enzyme catalysis,
Reaction dynamics follow Michaelis–Menten kinetics. We consider the linear stage of the reaction in our model,
where [P] = product concentration, [S] = substrate concentration, and [E] = enzyme concentration.
Mitochondrial Apoptotic Pathway Dynamics in Response to Extrinsic Death Signal.
Eqs. S1–S20 are ODEs that are translated according to the regulatory network of Fig. 1. They describe the dynamics of the mitochondrial apoptotic pathway in response to extrinsic death signal. Caspase8 (C8) is the control parameter. In the extrinsic mitochondrial apoptotic pathway, caspase8 is activated by DISK, which is recruited by dimerized death-receptor ligation (18). For simplicity, we do not include DISK formation here. Our model is relatively simple yet detailed, compared with the previous apoptotic model (50) that focuses on upstream MOMP signaling (regulation of death signaling transduction and downstream MOMP events) and caspase activation regulation. Our model focuses primarily on the MOMP event governed by the Bcl2 protein family. The model pathway we analyze includes a subset of Bcl2 proteins, and proteins with redundant function are compressed into one representative reactant. For example, Bax and Bak are the major effectors of apoptosis whose the oligomerization causes MOMP. Either Bax or Bak alone is sufficient to perform this function; therefore, we use Bax as the representative. Bcl2, Bcl-xL, and Mcl-1 are important antiapoptotic proteins that have prominent roles in cancer progression. All these proteins have Bcl2 homolog C-terminal membrane-anchoring domains and function primarily on the mitochondrial outer membrane. We use Bcl2 to represent the membrane-anchoring antiapoptotic proteins but distinguish the interactions of cytoplasmic antiapoptotic protein A1. We also elaborate the conformational changes and translocation processes of proapoptotic proteins. Cytoplasmic proteins and membrane-binding proteins are distinguished.
[S1] |
[S2] |
[S3] |
[S4] |
[S5] |
[S6] |
[S7] |
[S8] |
[S9] |
[S10] |
[S11] |
[S12] |
[S13] |
[S14] |
[S15] |
[S16] |
[S17] |
[S18] |
[S19] |
[S20] |
Parameters in the ODEs.
The initial conditions are all set to zero, indicating that the cell starts from a nonapoptosis state. The values, descriptions, corresponding interactions, and sources for all parameters are listed in Table S1. Parameter values are chosen with the same scale and are estimated from available experiments and/or previous models. The concentration unit is nanomolar and the time unit is seconds.
Parameter Sensitivity Analysis Using Different Degree of Perturbation.
To systematically detect the parameter sensitivity of our model, we conducted additional sets of parameter sensitivity analysis by extending the parameter perturbation degree from the original 10% to 1%, 5%, 10%, and 15%. It is sufficient to use these local parameter perturbations to simulate the influences of point mutations on the pathway because the mutational effects are often limited. The sensitivity spectrums obtained through 1%, 5%, 10%, and 15% parameter perturbation are presented in Fig. S1A. It shows that the sensitivity rank of each parameter and the sensitivity patterns are approximately the same among the spectrums. k-means clustering analysis on each spectrum shows the same cluster cutoff location between different sensitivity clusters. The number of clusters was set to four (most sensitive, sensitive, insensitive, and most insensitive), and the algorithm was the “kmeans” routine in MATLAB software (The MathWorks). This consistency among the sensitivity spectrums with different parameter perturbation degree indicates our model is robust, and the consistent cluster mode also implies a common intrinsic feature shared by parameters in the same cluster.
To eliminate influences of the preset cluster number on the cluster pattern, we set the cluster number to three, and checked the k-means clustering results. The result shows that the original first two clusters (most sensitive and sensitive) are sorted out again, and with the same cluster cutoff location (Fig. S1B). The original last two clusters (insensitive and most insensitive) are classified into the third cluster. The consistent cluster pattern with different cluster numbers also implies that the processes or interactions depicted by the parameters in the same cluster play an almost equally important role in network regulation.
Cancer-Related Mutation and Domain Definition
Cancer-Related Mutation.
We focus primarily on missense mutations in our network derived from the COSMIC database and the literature (35). COSMIC is a database of somatically acquired mutations found in human cancer. To enhance the cancer mutation annotation filters, COSMIC removed all known SNPs from new mutation uploads. Initially, these SNPs were defined by the 1000 Genomes Project and a panel of noncancer samples from Sanger CGP sequencing (cancer.sanger.ac.uk/cosmic/analyses). However, in the mutations collected from COSMIC, we can still find some SNPs that are defined in the dbSNP database. The dbSNP contains SNPs and other classes of minor genetic variation. To further purify the cancer-related mutations, and because most SNPs in dbSNP are considered to be non-disease-related (51), we excluded all SNPs nominated in the dbSNP database. Mutations in different functional transcripts of one gene that are observed in all cancer types are included. The amino acid positions are denoted by the sequence number of the analyzed structures. The provenance, mutation condition, and mutation count are listed in Table S2.
Definition of Functional and Nonfunctional Domains.
We identified the range of functional domains, including BH1, BH2, BH3, BH4, BH3-binding domains, and transmembrane domains, for all involved Bcl2 family proteins according to the National Center for Biotechnology Information (NCBI) database, Uniprot database, and literatures. Nonfunctional domains were defined as the remaining residues after excluding the functional range (BH1, BH2, BH3, BH4, BH3-binding domains, and functional domains for interaction) for each protein. The region and source of each functional domain are listed in Table S3.
Functional domains for corresponding interactions of each reactant were defined based on the corresponding structures. For BH3 peptides in the structures, the functional domains for interaction were defined as the peptide length, such as Bax BH3 peptide in Bax–Bcl2. For proapoptotic proteins Bcl2 and A1, the functional domains for interaction were defined as the contiguous sequence of corresponding BH3-binding domain according to the NCBI database and corresponding structures. Our calculations are completely based on the constructed structures. This definition of interaction domain unifies the mutation domain mapping and mutational effect on corresponding interactions. The regions of each functional domain for interaction of each corresponding protein are listed in Table S4.
Mutation Enrichment.
Mutation enrichment of each functional domain for interaction was quantified. The mutation number involved in the corresponding interaction functional domain and the value of mutation enrichment are listed in Table S4.
Combined Enrichment of Protein A and Protein B.
Consistent with the definition of the mutation enrichment for a functional domain, we defined the combined enrichment as , where and denote, respectively, the number of mutations and the amino acid length in the functional domain involved in interaction i of protein A; and denote, respectively, the number of mutations and the amino acid length in nonfunctional domains of protein A. Variables with “B” subscripts denote the corresponding meaning of protein B. For interactions with only one involved protein, the combined enrichment is calculated as the half-value of the existing enrichment to remedy the false high value of combined enrichment owing to the absence of the other participant. For interactions with zero mutation enrichment of one involved protein, we also defined the combined enrichment as the half-value of the nonzero enrichment. According to CE definition, zero mutation enrichment (for the interactions of Puma/Noxa with other antiapoptotic proteins, and = 0 will raise its value to a false high level, because these CE calculations actually just make the denominator a much smaller number (items concerning amino acid length rate) but with the numerator unchanged.
Consistency Between Most Sensitive Parameters and Oncogenetic Aberrations.
Fig. 3A shows that most of the parameters in the first cluster, most sensitive, represent the production and degradation of key MOMP regulators such as Bcl2 and Bax. This sensitivity pattern is in agreement with oncogenetic aberrations. For example, chromosomal translocation-induced Bcl-2 overexpression and other amplification cases have been observed in many cancer types (52, 53), so it is not surprising that the parameter describing Bcl-2 production stands out as the most sensitive one. Similarly, the frameshift mutations in a (G)8 mononucleotide tract of the Bax gene, which lead to synthesis of a truncated protein and reduced expression, commonly occur in colon and gastric cancers (54), suggesting that Bax inactivation during tumorigenesis may facilitate tumor progression by enhancing escape from apoptosis. A corresponding high sensitivity of the parameters describing Bax basal generation and degradation with an oncogenic shift of the crucial point was predicted by our analysis, consistent with experimental observation. Perturbations responsible for variations of the parameters reflecting production and degradation are not restricted to the above forms; mutations in the promoter region, in the corresponding transcription factors, and in the proteasome also affect these processes. Because the mechanisms for such parameter variation cannot be simply explained by the kinetics of protein interactions, we filtered out these types of parameters to evaluate mutational influence on protein interaction kinetics using mutation domain mapping and molecular dynamics simulation, through which the parameter sensitivity of the bifurcation point and cancer-related mutations can be linked on a molecular basis.
Molecular Dynamics Simulations and Calculation
Protein Interaction Pairs and Wild-Type Model Building.
Based on the network, we built eight pairs of protein–protein interaction models. The initial coordinates were obtained from the Protein Data Bank (www.rcsb.org). All of the built structures were subjected to optimization and subsequent molecular dynamics simulations.
Bax–Bax dimer.
The full-length structure of activated Bax–Bax dimer (and other Bcl-2 family protein interaction complexes) had not been resolved due to hydrophobic aggregation effects. Czabotar et al. (43) solved a crystal structure of the Bax homodimer, each containing helices α2 through α5 (residues 54−122, PDB ID code 4BDU). The crystal structure reasonably approximated the state when activated Bax formed a BH3-in-groove symmetrical dimer on the mitochondria membrane. Therefore, we choose this structure to represent the Bax–Bax dimer.
Bcl-2−Bax complex.
This complex was constructed by molecular replacement. We substituted the Bcl-2 (residues 3−34 and 92−207) in 1G5M for Bcl-2 in 2XA0 (residues 10−32 and 92−207), in which helix α2 of Bax (residues 57−82) binds to the BH3 binding domain of Bcl-2.
Bcl-2−tBid complex.
This complex was constructed by molecular replacement. We substituted the Bcl-2 (residues 3−34 and 92−207) in 1G5M for Mcl-1 in 2KBW, in which helix α2 of tBid (residues 76−107) binds to the BH3-binding domain of Mcl-1.
Bcl-2−Puma complex.
This complex was constructed by molecular replacement. We substituted the Bcl-2 (residues 3−34 and 92−207) in 1G5M for Bcl-xL in 2M04, in which helix α2 of Puma (residues 68−92) binds to the BH3-binding domain of Bcl-xL.
A1−Noxa complex.
3MQP was considered as the starting structure of the A1−Noxa complex, in which helix α2 of Noxa (residues 19−43) binds to the BH3-binding domain of A1 (residues 2−152).
A1–Bax, A1–tBid, and A1–Puma complexes.
These complexes were constructed by molecular replacement. We substituted Noxa (residues 19−43) in 3MQP for Bax, tBid, and Puma to form A1−Bax, A1−tBid, and A1−Puma complexes.
Model Building with Protein Interaction Pairs and Mutant SNP.
We hypothesized that perturbations in those parameters were caused by changes in binding energy, which result from cancer-related mutations. Therefore, we generate single mutations for each structure modeled above using mutation data obtained from the COSMIC database. We also mutated each structure using SNP as control. Owing to resource limitations, we did not test all collected mutations. We tested five mutations (if available) for each interaction, and two SNPs (if available) for each reactant. The rule for selecting a mutation is that mutations with high count would be selected first, mutations in conserved residues would be selected second, and the remaining mutations were selected randomly. All SNPs were selected randomly. Only one position is expected to mutate every time; therefore, the protein models were mutated directly in PyMol and were subjected to optimization and subsequent molecular dynamics simulations.
Molecular Dynamics Simulations.
All molecular dynamics simulations were performed using Amber software (version 11) with the Amber ff99SB force field in periodic boxes, which extended at least 8 Å away from any given protein atom. The systems were immersed into TIP3P octahedral water and counter ions were added to keep the systems neutral. Long-range electrostatic interactions were calculated using the PME algorithm, and bond length was maintained using the SHAKE algorithm.
We performed three steps to optimize the built structures. First, all atoms were fixed and each system was subjected to 2,000 frames (8 ns) for the steepest descent and 2,000 frames for conjugated gradient minimizations. Then, the protein backbone was frozen and other atoms were allowed to achieve minimization for 2,500 frames in each algorithm. Finally, the whole system was released, and 5,000 frames of each algorithm were run to achieve energy minimization. The final production run continued for 40 ns (Bax–Bax dimer) or 60 ns (other heterodimers) at 310 K in the NPT ensemble with 2-fs integration time, and the coordinates were stored every 4 ps for subsequent analysis.
Binding Energy Calculation.
We simulated five trajectories for each complex. A single trajectory was used to calculate the binding energy and residue energy decomposition using the MM/GBSA algorithm in AmberTools12. For all systems, the last 10 ns (2,500 frames) were subtracted to calculate the binding energy. Other analyses such as rmsd and secondary-structure analysis (DSSP) were performed using AmberTools12, and the visual display was achieved by PyMol.
ΔΔG Calculation.
Two odd trajectories (indicated by unstable rmsd) were discarded out of five trajectories. The remaining three trajectories were used to calculate the average binding energy. The average binding energy was considered as the representative binding energy for a given protein complex. The binding energy values for each trajectory are listed in Table S5. The variance of three binding energies for each complex was calculated.
The of each matched mutant/SNP–wild type was identified as the difference between the average binding energy of mutant/SNP and the average binding energy of wild type. To determine whether a mutant/SNP introduced a significant binding energy change to the wild type, we calculated the difference between the absolute value of and the absolute value of half of the sum of wild-type binding energy variance and mutant/SNP binding energy variance. If the difference was >0, which meant that the binding energy distribution of mutant/SNP type and wild type were separated, introduced by the mutation/SNP was identified as significant, and the mutation/SNP then would be classified as Right or Wrong and colored red or green according to the comparison of binding energy change and parameter perturbation. If the difference was <0, which meant that the binding energy distribution of mutant/SNP type and wild type were not separated, introduced by the mutation/SNP was identified as not significant, and the mutation/SNP then would be classified as Ordinary and colored blue. Detailed results of binding energy calculations are listed in Table S6.
Correspondence Between Parameter Sensitivity and ∆∆G Calculations.
The result of our calculation of the binding energy change between wild type and mutant type and the linkage to parameter perturbation functionality support the opinion that somatic mutations alter cellular regulatory circuit and cause cancer. Therein, KD links the macroscopic kinetic parameters and the microscopic quantity- binding energy. In our dynamic model, the protein–protein association and disassociation processes are strictly discriminated by the parameters kon and koff. Owing to technique restriction, the binding energy calculation can only reflect an equilibrium effect of those two processes, KD. When analyzing the correspondence between ∆∆G and the model parameters, we assume that the mutation effect on KD is primarily due to the change of koff. The reason for this simplification is that kon is generally described based on diffusion rules and long-range force, which will not be significantly influenced by single mutation, whereas koff is determined primarily by short-range interactions (ionic interactions, hydrogen bonds, and hydrophobic interaction) and is more likely to change due to amino acid variation (45).
Our results show that even if the mutation has a significant effect on kon under certain conditions, it also would not bring discordance to our original results. In our parameter sensitivity analysis, the change directions of kon and koff that can shift the bifurcation point are approximately symmetrical, and the sensitivity ranks of these paired parameters are very close; they always belong to the same sensitivity cluster. For example, the change directions of BaxMOM and Bcl2 binding and unbinding rate are opposite as indicated by the different bar colors (Fig. 4A, Left); they are included in the same sensitivity cluster and close to each other. This character can be found in other interactions. If the binding energy calculation indicates that the correct corresponding model-predicted KD change is increased according to the linking equation , either koff increase or kon decrease would satisfy this constraint. Therefore, using the sensitivity of kon instead of koff would not change the results of interaction classification and mutation classification in Fig. 5 or our main conclusions.
Counting Rules for the Mutation Sample Number.
A mutated protein may participate in several interactions and may have different effects on the corresponding interactions, such as Bcl2 A131G and Bax D68V. The number of mutations was counted following the rule that the number of interactions for a given mutation equals the number of samples of the given mutation. For example, Bcl2 A131G is involved in the interactions between Bcl2 and tBidMOM, Bcl2 and BaxMOM; therefore, mutation A131G will be counted twice as two mutation samples. A mutation that is observed in several independent tumor samples is naturally counted as the number of repeated samples. For example, Bcl2 A131V is observed in four independent tumor samples (Table S2), so the number of mutations involved in one interaction is four. Because mutation A131V is involved in two interactions, the total number of Bcl2 A131V mutation is eight.
Odds Ratio Calculation.
We separated all mutations and SNPs into two groups, Right and Non-Right, to quantify the distribution tendency of Right mutations in different categories, such as sensitive interaction and insensitive interaction, and oncogenic mutation and SNP. The number for Right and Non-Right mutations in category 1 is denoted by N1R and N1nR, respectively; the number for Right and Non-Right mutations in category 2 is denoted by N2R and N2nR, respectively.
Category | Right | Non-Right |
1 | N1R | N1nR |
2 | N2R | N2nR |
The odds ratio is calculated as
Structures of Other Mutant Types.
Molecular dynamics simulation and GBSA calculation identify the kinetic changes toward oncogenesis due to mutations. We have provided the mechanism of how one mutation, Bax D68V, disrupts Bax dimer interactions. For other analyzed mutations, they can be classified into two categories, on the interaction interface or not. For the former class, it is easy to understand that the mutations could directly result in changes in binding energy. Examples include Bax mutation S118I in the Bax dimer and Bcl2 mutation R107L in Bcl2–Bax heterodimer (see Fig. S2A for structure details and other examples). For the latter class (Fig. S2B), how a residue far away from the interaction interface contributes to protein interactions may not be immediately clear. However, increasing evidence indicates that even residues located 25 Å from the binding interface can strongly influence binding equilibrium (55), and this influence can propagate cooperatively through pathways of energetic coupling. Those residues may reside in a less stable region, which attains greater stability of binding.
Acknowledgments
We thank J. Chen, L. H. Lai, F. J. Chen, X. M. Ma, D. Q. Yu, Y. S. Cao, H. Y. Fang, Z. W. Xie, X. Zhao, B. Shao, J. Y. Xi, H. L. Wang, and F. T. Li for helpful discussions. This work was partially supported by National Natural Science Foundation of China Grants 11074009 and 81273436 and Chinese Ministry of Science and Technology Grants 2012AA02A702 and 2012AA020301.
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1502126112/-/DCSupplemental.
References
- 1.Kandoth C, et al. Mutational landscape and significance across 12 major cancer types. Nature. 2013;502(7471):333–339. doi: 10.1038/nature12634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Dahl F, et al. Multigene amplification and massively parallel sequencing for cancer mutation discovery. Proc Natl Acad Sci USA. 2007;104(22):9387–9392. doi: 10.1073/pnas.0702165104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Alexandrov LB, et al. Australian Pancreatic Cancer Genome Initiative; ICGC Breast Cancer Consortium; ICGC MMML-Seq Consortium; ICGC PedBrain Signatures of mutational processes in human cancer. Nature. 2013;500(7463):415–421. doi: 10.1038/nature12477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Stratton MR, Campbell PJ, Futreal PA. The cancer genome. Nature. 2009;458(7239):719–724. doi: 10.1038/nature07943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Jones S, et al. Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science. 2008;321(5897):1801–1806. doi: 10.1126/science.1164368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ding L, et al. Somatic mutations affect key pathways in lung adenocarcinoma. Nature. 2008;455(7216):1069–1075. doi: 10.1038/nature07423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Cancer Genome Atlas Research Network Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature. 2008;455(7216):1061–1068. doi: 10.1038/nature07385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bild AH, Potti A, Nevins JR. Linking oncogenic pathways with therapeutic opportunities. Nat Rev Cancer. 2006;6(9):735–741. doi: 10.1038/nrc1976. [DOI] [PubMed] [Google Scholar]
- 9.Pe’er D, Hacohen N. Principles and strategies for developing network models in cancer. Cell. 2011;144(6):864–873. doi: 10.1016/j.cell.2011.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kreeger PK, Lauffenburger DA. Cancer systems biology: A network modeling perspective. Carcinogenesis. 2010;31(1):2–8. doi: 10.1093/carcin/bgp261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ferrell JE, Jr, Tsai TY, Yang Q. Modeling the cell cycle: Why do certain circuits oscillate? Cell. 2011;144(6):874–885. doi: 10.1016/j.cell.2011.03.006. [DOI] [PubMed] [Google Scholar]
- 12.Zhang XP, Liu F, Cheng Z, Wang W. Cell fate decision mediated by p53 pulses. Proc Natl Acad Sci USA. 2009;106(30):12245–12250. doi: 10.1073/pnas.0813088106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ma W, Trusina A, El-Samad H, Lim WA, Tang C. Defining network topologies that can achieve biochemical adaptation. Cell. 2009;138(4):760–773. doi: 10.1016/j.cell.2009.06.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Stites EC, Trampont PC, Ma Z, Ravichandran KS. Network analysis of oncogenic Ras activation in cancer. Science. 2007;318(5849):463–467. doi: 10.1126/science.1144642. [DOI] [PubMed] [Google Scholar]
- 15.Chen J, Yue H, Ouyang Q. Correlation between oncogenic mutations and parameter sensitivity of the apoptosis pathway model. PLOS Comput Biol. 2014;10(1):e1003451. doi: 10.1371/journal.pcbi.1003451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Tait SW, Green DR. Mitochondria and cell death: Outer membrane permeabilization and beyond. Nat Rev Mol Cell Biol. 2010;11(9):621–632. doi: 10.1038/nrm2952. [DOI] [PubMed] [Google Scholar]
- 17.Spencer SL, Sorger PK. Measuring and modeling apoptosis in single cells. Cell. 2011;144(6):926–939. doi: 10.1016/j.cell.2011.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chipuk JE, Bouchier-Hayes L, Green DR. Mitochondrial outer membrane permeabilization during apoptosis: The innocent bystander scenario. Cell Death Differ. 2006;13(8):1396–1402. doi: 10.1038/sj.cdd.4401963. [DOI] [PubMed] [Google Scholar]
- 19.Llambi F, Green DR. Apoptosis and oncogenesis: Give and take in the BCL-2 family. Curr Opin Genet Dev. 2011;21(1):12–20. doi: 10.1016/j.gde.2010.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Youle RJ, Strasser A. The BCL-2 protein family: Opposing activities that mediate cell death. Nat Rev Mol Cell Biol. 2008;9(1):47–59. doi: 10.1038/nrm2308. [DOI] [PubMed] [Google Scholar]
- 21.Li H, Zhu H, Xu CJ, Yuan J. Cleavage of BID by caspase 8 mediates the mitochondrial damage in the Fas pathway of apoptosis. Cell. 1998;94(4):491–501. doi: 10.1016/s0092-8674(00)81590-1. [DOI] [PubMed] [Google Scholar]
- 22.Lovell JF, et al. Membrane binding by tBid initiates an ordered series of events culminating in membrane permeabilization by Bax. Cell. 2008;135(6):1074–1084. doi: 10.1016/j.cell.2008.11.010. [DOI] [PubMed] [Google Scholar]
- 23.Kim H, et al. Stepwise activation of BAX and BAK by tBID, BIM, and PUMA initiates mitochondrial apoptosis. Mol Cell. 2009;36(3):487–499. doi: 10.1016/j.molcel.2009.09.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Hsu Y-T, Wolter KG, Youle RJ. Cytosol-to-membrane redistribution of Bax and Bcl-X(L) during apoptosis. Proc Natl Acad Sci USA. 1997;94(8):3668–3672. doi: 10.1073/pnas.94.8.3668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Tan C, et al. Auto-activation of the apoptosis protein Bax increases mitochondrial membrane permeability and is inhibited by Bcl-2. J Biol Chem. 2006;281(21):14764–14775. doi: 10.1074/jbc.M602374200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Happo L, Strasser A, Cory S. BH3-only proteins in apoptosis at a glance. J Cell Sci. 2012;125(Pt 5):1081–1087. doi: 10.1242/jcs.090514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Chen C, et al. Modeling of the role of a Bax-activation switch in the mitochondrial apoptosis decision. Biophys J. 2007;92(12):4304–4315. doi: 10.1529/biophysj.106.099606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Leber B, Lin J, Andrews DW. Embedded together: The life and death consequences of interaction of the Bcl-2 family with membranes. Apoptosis. 2007;12(5):897–911. doi: 10.1007/s10495-007-0746-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Aldridge BB, Burke JM, Lauffenburger DA, Sorger PK. Physicochemical modelling of cell signalling pathways. Nat Cell Biol. 2006;8(11):1195–1203. doi: 10.1038/ncb1497. [DOI] [PubMed] [Google Scholar]
- 30.Lu M, Jolly MK, Levine H, Onuchic JN, Ben-Jacob E. MicroRNA-based regulation of epithelial-hybrid-mesenchymal fate determination. Proc Natl Acad Sci USA. 2013;110(45):18144–18149. doi: 10.1073/pnas.1318192110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ferrell JE, Jr, et al. Simple, realistic models of complex biological processes: Positive feedback and bistability in a cell fate switch and a cell cycle oscillator. FEBS Lett. 2009;583(24):3999–4005. doi: 10.1016/j.febslet.2009.10.068. [DOI] [PubMed] [Google Scholar]
- 32.Lee YS, Liu OZ, Hwang HS, Knollmann BC, Sobie EA. Parameter sensitivity analysis of stochastic models provides insights into cardiac calcium sparks. Biophys J. 2013;104(5):1142–1150. doi: 10.1016/j.bpj.2012.12.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Yue H, et al. Insights into the behaviour of systems biology models from dynamic sensitivity and identifiability analysis: A case study of an NF-kappaB signalling pathway. Mol Biosyst. 2006;2(12):640–649. doi: 10.1039/b609442b. [DOI] [PubMed] [Google Scholar]
- 34.Forbes SA, et al. COSMIC: Exploring the world’s knowledge of somatic mutations in human cancer. Nucleic Acids Res. 2015;43(Database Issue):D805–D811. doi: 10.1093/nar/gku1075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Yamamoto H, Sawai H, Perucho M. Frameshift somatic mutations in gastrointestinal cancer of the microsatellite mutator phenotype. Cancer Res. 1997;57(19):4420–4426. [PubMed] [Google Scholar]
- 36.Sherry ST, et al. dbSNP: The NCBI database of genetic variation. Nucleic Acids Res. 2001;29(1):308–311. doi: 10.1093/nar/29.1.308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Billen LP, Shamas-Din A, Andrews DW. Bid: A Bax-like BH3 protein. Oncogene. 2008;27(Suppl 1):S93–S104. doi: 10.1038/onc.2009.47. [DOI] [PubMed] [Google Scholar]
- 38.Schinzel A, et al. Conformational control of Bax localization and apoptotic activity by Pro168. J Cell Biol. 2004;164(7):1021–1032. doi: 10.1083/jcb.200309013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Gil J, Yamamoto H, Zapata JM, Reed JC, Perucho M. Impairment of the proapoptotic activity of Bax by missense mutations found in gastrointestinal cancers. Cancer Res. 1999;59(9):2034–2037. [PubMed] [Google Scholar]
- 40.Vogelstein B, et al. Cancer genome landscapes. Science. 2013;339(6127):1546–1558. doi: 10.1126/science.1235122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ku B, Liang C, Jung JU, Oh BH. Evidence that inhibition of BAX activation by BCL-2 involves its tight and preferential interaction with the BH3 domain of BAX. Cell Res. 2011;21(4):627–641. doi: 10.1038/cr.2010.149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Zha H, et al. Structure-function comparisons of the proapoptotic protein Bax in yeast and mammalian cells. Mol Cell Biol. 1996;16(11):6494–6508. doi: 10.1128/mcb.16.11.6494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Czabotar PE, et al. Bax crystal structures reveal how BH3 domains activate Bax and nucleate its oligomerization to induce apoptosis. Cell. 2013;152(3):519–531. doi: 10.1016/j.cell.2012.12.031. [DOI] [PubMed] [Google Scholar]
- 44.Dhillon AS, Hagan S, Rath O, Kolch W. MAP kinase signalling pathways in cancer. Oncogene. 2007;26(22):3279–3290. doi: 10.1038/sj.onc.1210421. [DOI] [PubMed] [Google Scholar]
- 45.Schreiber G. Kinetic studies of protein-protein interactions. Curr Opin Struct Biol. 2002;12(1):41–47. doi: 10.1016/s0959-440x(02)00287-7. [DOI] [PubMed] [Google Scholar]
- 46.Tiwary P, Limongelli V, Salvalaglio M, Parrinello M. Kinetics of protein-ligand unbinding: Predicting pathways, rates, and rate-limiting steps. Proc Natl Acad Sci USA. 2015;112(5):E386–E391. doi: 10.1073/pnas.1424461112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Fredriksson NJ, Ny L, Nilsson JA, Larsson E. Systematic analysis of noncoding somatic mutations and gene expression alterations across 14 tumor types. Nat Genet. 2014;46(12):1258–1263. doi: 10.1038/ng.3141. [DOI] [PubMed] [Google Scholar]
- 48.O’Farrell TJ, Ghosh P, Dobashi N, Sasaki CY, Longo DL. Comparison of the effect of mutant and wild-type p53 on global gene expression. Cancer Res. 2004;64(22):8199–8207. doi: 10.1158/0008-5472.CAN-03-3639. [DOI] [PubMed] [Google Scholar]
- 49.Tank EM, True HL. Disease-associated mutant ubiquitin causes proteasomal impairment and enhances the toxicity of protein aggregates. PLoS Genet. 2009;5(2):e1000382. doi: 10.1371/journal.pgen.1000382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Albeck JG, Burke JM, Spencer SL, Lauffenburger DA, Sorger PK. Modeling a snap-action, variable-delay switch controlling extrinsic cell death. PLoS Biol. 2008;6(12):2831–2852. doi: 10.1371/journal.pbio.0060299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.The 1000 Genomes Project Consortium A map of human genome variation from population-scale sequencing. Nature. 2010;467(7319):1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ikegaki N, Katsumata M, Minna J, Tsujimoto Y. Expression of bcl-2 in small cell lung carcinoma cells. Cancer Res. 1994;54(1):6–8. [PubMed] [Google Scholar]
- 53.Monni O, et al. BCL2 overexpression associated with chromosomal amplification in diffuse large B-cell lymphoma. Blood. 1997;90(3):1168–1174. [PubMed] [Google Scholar]
- 54.Rampino N, et al. Somatic frameshift mutations in the BAX gene in colon cancers of the microsatellite mutator phenotype. Science. 1997;275(5302):967–969. doi: 10.1126/science.275.5302.967. [DOI] [PubMed] [Google Scholar]
- 55.Gohlke H, Kiel C, Case DA. Insights into protein-protein binding by binding free energy calculation and free energy decomposition for the Ras-Raf and Ras-RalGDS complexes. J Mol Biol. 2003;330(4):891–913. doi: 10.1016/s0022-2836(03)00610-7. [DOI] [PubMed] [Google Scholar]