Abstract
The cyclin-dependent kinase 4 (CDK4)-cyclin D1 complex plays a crucial role in the transition from the G1 phase to S phase of the cell cycle. Among the CDKs, CDK4 is one of the genes most frequently affected by somatic genetic variations that are associated with various forms of cancer. Thus, because the abnormal function of the CDK4-cyclin D1 protein complex might play a vital role in causing cancer, CDK4 can be considered a genetically validated therapeutic target. In this study, we used a systematic, integrated computational approach to identify deleterious nsSNPs and predict their effects on protein-protein (CDK4-cyclin D1) and protein-ligand (CDK4-flavopiridol) interactions. This analysis resulted in the identification of possible inhibitors of mutant CDK4 proteins that bind the conformations induced by deleterious nsSNPs. Using computational prediction methods, we identified five nsSNPs as highly deleterious: R24C, Y180H, A205T, R210P, and R246C. From molecular docking and molecular dynamic studies, we observed that these deleterious nsSNPs affected CDK4-cyclin D1 and CDK4-flavopiridol interactions. Furthermore, in a virtual screening approach, the drug 5_7_DIHYDROXY_ 2_ (3_4_5_TRI HYDROXYPHENYL) _4H_CHROMEN_ 4_ONE displayed good binding affinity for proteins with the mutations R24C or R246C, the drug diosmin displayed good binding affinity for the protein with the mutation Y180H, and the drug rutin displayed good binding affinity for proteins with the mutations A205T and R210P. Overall, this computational investigation of the CDK4 gene highlights the link between genetic variation and biological phenomena in human cancer and aids in the discovery of molecularly targeted therapies for personalized treatment.
Introduction
Cyclin-dependent kinases (CDKs) drive cell cycle progression, control transcriptional regulation processes and maintain cell proliferation. Irregular entry into the cell cycle and uncontrolled cell proliferation are hallmarks of cancer [1]. Hence, it is not surprising that the dysregulation of CDKs might play a vital role in tumorigenesis. The CDK4-cyclin D1-p16 retinoblastoma protein (RB1) pathway (CDK4 pathway) promotes the G1-S cell cycle transition and is commonly dysregulated in most cancers. The CDK4-cyclin D1 complex acts as an essential regulator in the G1-S phase transition of the cell cycle process. The CDKs and cyclins that are most frequently affected by somatic nucleotide alterations in various cancers are CDK4 and cyclinD1. Thus, abnormality of the CDK4/cyclin D1 pathway plays a major role in oncogenesis; hence, CDK4 can be genetically tested as a valid molecular therapeutic target. In 1996, Bradley et al. observed the inhibition of CDK4 by the flavonoid compound flavopiridol in breast cancer cell lines [2]. Flavopiridol was the first drug identified as a potent tumour suppressor in several lung and breast cancer cell lines [3]. Various studies have highlighted that flavopiridol has the capability to prevent the proliferation of a broad range of cell lines, leukaemias, lymphomas and human tumours [4, 5]. Several clinical trials have been completed up to the phase II level with various regimens. To date, several inhibitors with varying selectivity for members of the CDK family have been identified. CDK4 inhibitors are considered the most attractive therapeutic targets because of their ability to control tumour growth with minimal toxicity.
In more than 90% of melanoma cases, genomic variations associated with CDK4 pathway activation are present, as determined in human and mouse models of melanoma [6, 7]. A common type of genetic variation in the human genome is the single nucleotide polymorphism (SNP) [8]. SNPs are the naturally occurring nucleotide variability in the human genome and play a significant role in the phenotypic variability that differentiates individuals within a given species. SNPs can occur in both coding and noncoding regions of the genome and generate polymorphic variation in expressed amino acid sequences that affects protein structure and function. In the coding region of the genome, SNPs are mainly classified into two types: synonymous and non-synonymous (nsSNPs). Nonsynonymous SNPs can change the physicochemical properties of a protein residue, thereby disturbing protein stability and dynamics, affecting normal interactions with other molecules, and hindering stable complex formation with binding partners [9–12].
Protein-protein interacting interfaces are usually referred to as binding hot spots of proteins. These regions are charged, structurally conserved and highly polar and are surrounded by hydrophobic residues, which are the residues that are mostly involved in the binding [13]. Deleterious genetic variation may affect the electrostatic nature of protein surfaces and introduce harmful effects, such as changes in stability or folding, altering binding partner specificity and affinity and changing protein function [13]. The identification of harmful nsSNPs helps uncover plausible molecular mechanisms underlying disease phenotypes and helps in the development of suitable inhibitors to target the mutant proteins. Individual nsSNPs are causation for significant changes in drug disposition and efficacy. Because nsSNPs can occur in drug binding proteins, they can affect treatment response or produce adverse effects.
In this study, we employed efficient computational prediction methods to identify deleterious nsSNPs in the CDK4 gene. We then evaluated the effects of deleterious CDK4 variants on CDK4-Cyclin D1 protein interactions and drug binding. Protein structure-based virtual screening analysis was performed to identify suitable inhibitors of mutant CDK4 proteins. Furthermore, atomic level studies were performed by molecular dynamics (MD) simulations to better understand the effects of deleterious variants on CDK4-Cyclin D1 complex formation and to check the binding efficacies of selected inhibitors for the mutant proteins.
Material and Methods
Datasets
The nsSNPs in the CDK4 gene were extracted from the dbSNP [14] and UniProt [15] databases. The amino acid length of CDK4 and cyclinD1 protein sequence is 306 and 271 residues respectively. For structure analysis, the CDK4-Cyclin D1 protein complex crystal structure was obtained from the PDB database (PDB ID: 2W96) [16]. For virtual screening, flavopiridol-like compounds were retrieved from the DrugBank database [17].
Deleterious nsSNP prediction tools
Different computational methods have been previously developed for the prediction of phenotypic effects of nsSNPs. In this study, three variation tolerance prediction methods, SIFT [18], PolyPhen 2 [19], and I-Mutant 3.0 [20], were used following the same protocol, in which nsSNPs are first labelled with amino acid properties according to the changes they may have on protein structure or function. The pathogenicity of the nsSNPs is decided based on the resultant vectors calculated using the individual tools. The prediction by each method is generally based on evolutionary information and a combination of protein structural and/or functional parameters and multiple sequence alignment-derived information. SIFT calculates a tolerance index score for a particular residue substitution. This algorithm first generates multiple sequence alignments with a large set of homologous amino acid sequences and predicts a tolerance index for each residue, ranging from zero to one. PolyPhen 2 predicts the possible effects of amino acid substitutions on protein structure and function using straightforward physical and evolutionary comparative considerations. PolyPhen 2 searches for multiple sequence alignments of homologous protein sequences, 3D protein structures and residue contact information from secondary structure databases. Based on this information, PolyPhen 2 calculates PSIC scores for each of the two variants and computes the differences between the PSIC scores. If the predicted score is higher for a particular substituted amino acid, that substitution is likely to have a higher functional effect on protein structure/function. The machine learning method I-Mutant 3.0 utilises support vector machines (SVMs) for classification. It is built on unsupervised classification using support vector machines and trained on the most comprehensive dataset derived from ProTherm for the prediction of protein stability changes resulting from nsSNPs. The energy difference between the native and the variant protein was calculated based on the Gibbs free energy value and the predicted free energy change is denoted by a DDG value.
Docking and virtual screening
AutoDock is one of the most widely accepted docking software programs available and requires a set of preparation steps for general screening [21]. Included in this process are the preparations of acceptable ligands and a receptor macromolecule, calculation of maps and creation of folders for each ligand.AutoDockVina1.1.2 is a new program for molecular docking and virtual screening and approximately two orders of magnitude faster than AutoDock4 [22]. AutoDockVina 1.1.2 was used for all dockings in this study. In general, the docking parameters for AutoDockVina were kept to their default values. In vina, the size of the docking grid was 63 Å×47 Å×40 Å, which encompassed all the native and mutant protein structure.For high throughput virtual screening, the VcPpt tool was used. VcPpt is an independently developed extension for AutoDockVina and is a software package for flexible protein-ligand docking built by the Biochem Lab solution. This package can perform high-throughput docking for a given compound with Vina, and the output is PDB files of ligands ranked based on their binding energies and positions.
Molecular dynamic simulations
Molecular dynamics simulations of native and mutant protein-protein and protein-ligand complexes were performed using Gromacs 5.0 software [23]. The force field used for the simulation was Gromos96 43a1 [24, 25]. The structures were solvated using a simple point charge (SPC) water molecules in a box with a dimension of 52.0 Å size. The number of water molecules added into the boxes differed and it depends on the concerned system. For protein-protein complexes43918, 43931, 43926, 43930, 43926 and 43932 water molecules were added to respective native, R24C, Y180H, A205T, R210P and R246C simulation boxes and for protein-ligand complexes43915, 43919, 43888, 43893, 51929 and 43918 water molecules were added to respective native, R24C, Y180H, A205T, R210P and R246C simulation boxes. At physiological pH, protein-protein and protein-ligand complexes were negatively charged, and counter ions (Na+) were added to make the simulation system neutral. Then, the system energy was minimised by utilising the steepest descent method. After minimisation, three different steps were employed in the MD simulation: namely, heating, equilibration, and production. An NPT ensemble (constant number of particles, pressure, and temperature) was performed for 50000 ps at 300 K followed by an NVT ensemble (constant number of particles, volume and temperature) that was performed at 300 K [26]. Then, the production of molecular dynamics simulation trajectories was performed at 300 K for 50 ns. The Linear Constraint Solver (LINCS) algorithm was used to constrain the covalent bonds [27]. The Particle Mesh Ewald (PME) method was used to calculate electrostatic interactions [28]. The cutoff radii for van der Waals and Coulomb interactions were fixed at 14.0 and 10.0 Å, respectively.
The trajectory potentials obtained from each MD simulation were thoroughly investigated by using GROMACS utilities [29]. The utilities g_rms, g_rmsf, g_hbond, g_mindist and g_sas were used to plot graphs. The g_rms program calculates the root mean square deviation (RMSD) for specified atoms in a protein molecule with respect to a reference structure by fitting the structure to least square level with the reference structure. The g_rmsf program computes the root mean square fluctuation (RMSF) (i.e. standard deviation) of C-alpha atomic position of a protein molecule after fitting to a reference structural frame. The g_hbond program calculates number of hydrogen bonds formed between two molecules based on simple geometric criteria. This program analyzes the possibilities for hydrogen bond formation between all possible acceptors (A) and donors (D). The most accepted geometrical distance for a hydrogen bond formation between molecules is <2.5 Å and between hydrogen and the acceptor and a donor-hydrogen-acceptor angle of between 90° and 180°. The program g_mindist calculates the minimum distance between the atoms of two different molecules during simulation time. It also calculates the number of contacts made between two molecules within a certain radius rmax. The g_sas computes total solvent accessible surface area, hydrophilic and hydrophobic interaction of protein molecules. To check whether the systems follow constant NVT or NPT ensembles, differences in total, potential, and kinetic energies, pressure and temperature were calculated as a function of simulation time throughout the simulation period. The numbers of H-bond formation and the minimum distance between protein-protein and protein-ligand complexes were calculated to explain the stability of the complexes. SASA analysis was performed to identify the solvent traceable area of a molecule, and all graphs were generated using the XM grace tool [30].
Principle compound analysis
Essential Dynamics (ED) method was used to calculate the eigenvectors and eigenvalues, and their projection along the first two principal components [31]. ED or the principle component analysis (PCA) is a method that reduces the complexity of the data and extracts the different modes involved in the motion of the protein molecule [31]. In the process of ED calculation, a covariance matrix was generated from the trajectories after the elimination of translational and rotational movements. By diagonalizing the covariance matrix, a set of eigenvectors and eigenvalues were generated. For each eigenvector, one corresponding eigenvalue produced explains the energetic contribution of each component to the motion. The protein molecular segments that are responsible for the most significant collective motions can be acknowledged through PCA. Backbone C-alpha bonds trajectories were obtained and analyzed by using g_covar and g_anaeig of GROMACS inbuilt tool.
Results
Prediction of deleterious nsSNPs using the SIFT, Polyphen2 and I-Mutant 3.0 programs
Out of 20 nsSNPs submitted to SIFT, 11 nsSNPs (corresponding to R24C, R24H, Y180H, A205T, R209C, R210P, R246C, P251H, R255H, V260E, and H296Y) were predicted to be deleterious, with SIFT scores <0.05 (Table 1). Of these, sixnsSNPs (R24C, R24H, Y180H, A205T, R209C, and R210P) were considered to be highly deleterious, with SIFT scores of 0.00 (Table 1). PolyPhen 2 predicted ninensSNPs that would deleteriously affect protein structure and function, and the remaining 12 nsSNPswere characterised as benign. Of these nine deleterious nsSNPs, one nsSNP (Y180H) was predicted to be highly deleterious, with a SIFT score of 1.000 (Table 1). SIFT and PolyPhen 2 has been shown to perform well in predicting deleterious nsSNPs compared to other computational tools [32]. To improve the accuracy of deleterious nsSNP prediction, along with SIFT and PolyPhen 2, we used I-Mutant 3 to predict the deleteriousness of nsSNPs that affect protein stability. All of the nsSNPs analysed in SIFT and PolyPhen 2.0 were also submitted to the I-Mutant 3.0 server. I-Mutant 3.0 identified 15 nsSNPs as deleterious. By comparing the predictions by all three tools, we identified five nsSNPs (R24C, Y180H, A205T, R210P, and R246C) to be highly deleterious (Fig 1) and subjected these to further structural and functional analysis.
Table 1. List of nsSNPs present in CDK4 gene and their predicted SIFT, PolyPhen2 and I-Mutant3 scores.
SNP ID | Amino acid variation | SIFT | PolyPhen2 | I-Mutant3 |
---|---|---|---|---|
rs11547328 | R24C | 0.00 | 0.737 | Disease |
rs104894340 | R24H | 0.00 | 0.324 | Disease |
rs199609381 | P26H | 0.15 | 0.015 | Disease |
rs376139539 | P40H | 0.05 | 0.174 | Disease |
rs144890720 | N41S | 0.15 | 0.002 | Disease |
rs200213586 | A65V | 0.25 | 0.603 | Disease |
rs3211612 | R82Q | 0.09 | 0.990 | Disease |
rs34386532 | R122H | 0.55 | 0.001 | Disease |
rs150281463 | V137L | 0.18 | 0.001 | Disease |
rs151103937 | V176I | 0.06 | 0.956 | Neutral |
rs375372343 | Y180H | 0.00 | 1.000 | Disease |
rs368013594 | A205T | 0.00 | 0.987 | Disease |
rs140644696 | R209C | 0.00 | 0.258 | Disease |
rs373619077 | R210P | 0.00 | 0.995 | Disease |
rs370258992 | R246C | 0.01 | 0.995 | Disease |
rs372604524 | R246H | 0.09 | 0.016 | Neutral |
rs143670820 | P251H | 0.01 | 0.755 | Neutral |
rs144657355 | R255H | 0.01 | 0.004 | Neutral |
rs201617914 | S259L | 0.18 | 0.004 | Neutral |
rs200215596 | V260E | 0.01 | 0.356 | Neutral |
rs2227954 | H296Y | 0.03 | 0.155 | Disease |
Highly deleterious by SIFT, Probably and possibly damaging by PolyPhen2 and disease/neutral by I-Mutant3 prediction were highlighted in bold
CDK4-Flavopiridol binding analysis in the occurrence of deleterious nsSNPs in CDK4 protein
Flavopiridol has been suggested to inhibit the in vitro kinase activity of the CDK4 protein [2]. Amino acid variation may affect the inhibitory action of flavopiridol on the CDK4 protein. Numbers of studies have been performed to find an ATP-competitive inhibitor that binds specifically to the CDK4 protein [33–37]. Additionally, further studies have attempted to optimise inhibitor binding and specificity for CDK4 using structure-based design methods [38–40]. In this study, we examined the binding affinity of flavopiridol with native and mutant CDK4 proteins using AutodockVina, a computational docking program. Before performing the docking analysis, ATP binding sites of the native CDK4 protein were identified. The amino acid residues present in the ATP binding clefts of CDK4 are ILE12, VAL20, ALA33, VAL77, PHE93, GLU94, HIS95, VAL96, GLN98, ASP99, THR102, GLU144, LEU147, ALA157 and ASP158.Computational docking analysis also indicated the inhibitory action of flavopiridol with CDK4, as observed in in vitro studies.e., the inhibitor flavopiridol binds exactly at the ATP binding site of the native CDK4 protein (S1 Fig). However, flavopiridol binds residues outside of the ATP-binding cleft in mutant CDK4 structures (S1 Fig). A change in the binding residues will indeed affect the complementarities between the mutant proteins and flavopiridol. Non-covalent interactions and shape complementarity are important factors for the maintenance of protein-ligand affinity. Non-covalent bonds, such as van der Waals contacts, electrostatic forces and hydrogen bonds, are the primary forces involved in protein-ligand interactions. Calculating the interaction energies of non-covalent bonds is vital to understanding the binding ability of the ligand molecule. The number of hydrogen bonds arising between the protein and ligand was computed using AutodockVina. The binding energies between the CDK4 (native and mutant) proteins and the inhibitor molecule flavopiridol were calculated (Table 2) to be -8.8 kcal/mol, -7.7 kcal/mol, -7.1 kcal/mol, -7.3 kcal/mol, -7.4 kcal/mol and -7.1 kcal/mol for the native, R24C, Y180H, A205T, R210P and R246C complexes, respectively. The binding energy of the native complex displayed the best interaction and complete inhibition by the flavopiridol compound. This docking analysis gives a “theoretical quantitative” assessment of the binding efficiencies of CDK4 native and mutant proteins with the cancer drug flavopiridol.
Table 2. Binding energies of CDK4 native and mutant proteins with drug flavopiridol.
Native & Mutant proteins | Binding Energy(Kcal/mol) | Number of H-Bonds | Residues forming H-Bonds with Ligand | Interacting Residues |
---|---|---|---|---|
Native | -8.8 | 3 | VAL14,LYS142 | ILE12, GLY13,VAL14, GLY15, VAL20,LYS35, ASP99, LYS142 |
R24C | -7.7 | 3 | VAL14,LYS142 | ILE12, GLY13,VAL14, GLY15, VAL20,LYS35, ASP99, LYS142 |
Y180H | -7.1 | 1 | ARG5 | ARG5,GLU92,LEU143,LYS147 |
A205T | -7.3 | 3 | VAL137,ARG163 | GLU56,GLU59,ARG163,TYR191,VAL137 |
R210P | -7.4 | 3 | VAL137, ARG163 | GLU56, LEU59,VAL137, ARG163,TRY191 |
R246C | -7.1 | 3 | VAL137, ARG163 | LEU23,ARG26,HIS68,ARG126, PHE130,ALA133 |
CDK4 mutant protein structures based virtual screening and docking studies
Nonsynonymous SNPs play a vital role in the diverse responses to therapeutic treatment in human populations, influencing efficacy and toxicity by affecting the drug-binding pocket of target proteins. Virtual screening is the fastest and most accurate method for identifying novel drug-like compounds on the basis of target structures [41, 42].It has an advantage over any de novo design method because retrieved hits can be easily obtained for biological testing. Docking is a computational method used to predict binding affinities between a target protein and a ligand. Docking follows a search pattern to identify appropriate confirmations and a score that measures the affinity of various conformations [43, 44]. For virtual screening, we retrieved 19 similar compounds, such as flavopiridol, from the DrugBank database (Table 3). Subsequently, docking analysis was performed between mutant CDK4 proteins (R24C, Y180H, A205T, R210P, and R246C) and the screened compounds (Table 4). Among the 19 compounds docked, R24C and R246C mutant proteins displayed good binding to the drug 5_7_DIHYDROXY_2_ (3_4_5_TRIHYDROXYPHENYL) _4H_CHROMEN_4_ONE,with a binding energy of -8.3 kcal/mol and -8.2 kcal/mol, forming four hydrogen bonds with R24C and R246C mutant proteins, respectively.This compound interacts with the ATP binding residues of both R24C (ILE12, VAL20, HIS95, VAL96, GLU144, and LEU147) and R246C (ILE12, HIS95, VAL96, GLU144 and LEU147) mutant protein structures (Fig 2A and 2E).Diosmin displayed a good affinity for the mutant protein structure Y180H and obtained, with a high binding energy of -7.7 kcal/mol. Diosmin formed three hydrogen bonds with Y180H and interacted with the ATP binding residue ALA33 (Fig 2B). Rutin displayed good binding with the mutant structures A205T and R210P and obtained the highest binding energy of -8.6 and -8.3 kcal/mol, respectively. Rutin formed four hydrogen bonds with A205T and six hydrogen bonds with R210P and interacted with the ATP binding residue ALA33 of the A205T mutant protein. Notably, no interaction was observed with the ATP binding residues of R210P (Fig 2C and 2D).
Table 3. List of virtually selected compounds similar to flavopiridol from DrugBank database.
Accession number | Name | Chemical formula | Class | Groups | Original uses |
---|---|---|---|---|---|
DB07024 | 2_(3_4_DIHYDROXYPHENYL)_8_(1_1_DIOXIDOISOTHIAZOLIDIN_2_YL)_3_HYDROXY_6_METHYL_4H_CHROMEN_4_ONE | C19H17NO7S | Flavonoids | Experimental | - |
DB07453 | 2_PHENYL_4H_BENZO[H]CHROMEN_4_ONE | C19H12O2 | Flavonoids | Experimental | - |
DB07776 | 2_PHENYL_4H_CHROMEN_4_ONE | C15H10O2 | Flavonoids | Experimental | - |
DB08230 | 3_7_3'_4'_TETRAHYDROXYFLAVONE | C15H10O7 | Flavonoids | Experimental | - |
DB08230 | 5_7_DIHYDROXY_2_(3_4_5_TRIHYDROXYPHENYL)_4H_CHROMEN_4_ONE | C15H10O7 | Flavonoids | Experimental | - |
DB06927 | 5_HYDROXY_2_(4_HYDROXYPHENYL)_1_BENZOFURAN_7_YL]ACETONITRILE | C16H11NO3 | Benzofurans | Experimental | - |
DB02205 | 6_(1_1_Dimethylallyl)_2_(1_Hydroxy_1_Methylethyl)_2_3_Dihydro_7h_Furo[3_2_G]Chromen_7_One | C19H22O4 | Coumadin’s and Derivatives | Experimental | - |
DB01838 | 6_4'_Dihydroxy_3_Methyl_3'_5'_Dibromoflavone | C16H10Br2O4 | Flavonoids | Experimental | - |
DB02593 | 7_8_Dihydroxy_1_Methoxy_3_Methyl_10_Oxo_4_10_Dihydro_1h_3h_Pyrano[4_3_B]Chromene_9_Carboxylic Acid | C15H14O8 | Benzopyrans | Experimental | - |
DB06732 | Beta_Naphthoflavone | C19H12O2 | Flavonoids | Experimental | Putative chemo preventive agent |
DB06726 | Bufuralol | C16H23NO2 | Benzofurans | Experimental, investigational | Non-selective adrenoceptor blocking agent |
DB04886 | Calanolide A | C22H26O5 | Coumarins and Derivatives | Investigational | Non-nucleoside reverse transcriptase inhibitor (NNRTI) |
DB08995 | Diosmin | C28H32O15 | Not Available | approved | Oral phlebotropic drug used to venous disease |
DB01547 | Drotebanol | C19H27NO4 | Morphinans | experimental, illicit | - |
DB01148 | Flavoxate | C24H25NO4 | Flavonoids | Approved | Used in various urinary syndromes and as an antispasmodic |
DB01852 | Kaempherol | C15H10O6 | Flavonoids | Experimental | - |
DB02375 | Myricetin | C15H10O8 | Flavonoids | Experimental | - |
DB04216 | Quercetin | C15H10O7 | Flavonoids | Experimental | Used as an antioxidant |
DB01698 | Rutin | C27H30O16 | Flavonoids | Experimental | Used to decrease capillary fragility |
Table 4. Virtual compounds with highest binding energy with CDK4 mutant proteins.
CDK4 Mutant Proteins | Binding Drug | Binding Energy (Kcal/mol) | Number of H-Bonds | Residues Forming H-Bonds with Ligand | Interacting Residues |
---|---|---|---|---|---|
R24C | 5_7_DIHYDROXY_2_(3_4_5_TRIHYDROXYPHENYL)_4H_CHROMEN_4_ONE | -8.3 | 4 | ALA16,HIS95,VAL96,LYS142 | ILE12, GLY15,ALA16, VAL20, LYS35,HIS95,VAL96,LYS142, GLU144,ASN145, LEU147 |
Y180H | Diosmin | -7.7 | 3 | ALA33, GLN291 | ALA33,PRO69,ARG122,ARG126, GLN291 |
A205T | Rutin | -8.6 | 4 | ALA33,HIS68,ARG126 | ARG26,ALA33,PHE66, HIS68,ARG126 |
R210P | Rutin | -8.3 | 6 | HIS27,ILE87, LYS149 | ARG5,HIS27,ILE87, ARG73, LEU94,LYS149 |
R246C | 5_7_DIHYDROXY_2_(3_4_5_TRIHYDROXYPHENYL)_4H_CHROMEN_4_ONE | -8.2 | 4 | ALA16,HIS95,VAL96,LYS142 | ILE12,GLY15,ALA16,LYS35,HIS95,VAL96,LYS142,GLU144,ASN145, LEU147 |
Molecular dynamics simulation studies of the CDK4-Cyclin D1 complex
To examine the structural and functional consequences of deleterious nsSNPs in the CDK4-Cyclin D1 complex, we analyzed the 50ns molecular dynamics simulation trajectories of native (CDK4-Cyclin D1) and mutant (R24C, Y180H, A205T, R210P and R246C) complexes.
Equilibration and Stability checking of native and mutant CDK4-Cyclin D1 complexes
The root mean square deviations (RMSDs) of protein backbone atoms of native and mutant CDK4-Cyclin D1 complexes were analysed. For all of the simulations, the energy minimised protein complexes were taken as a starting reference. To obtain all-atom levels in detail, MD simulations were repeated twice for native and mutant complexes for a time period of 50 ns. No significant changes were observed from the repeated MD simulation trajectories of native and mutant CDK4-Cyclin D1 complex structures. The simulated CDK4-Cyclin D1 complex backbone atoms were aligned with RMSDs of less than 3.5 A°.
After 35000 ps, all of the native and mutant protein complexes attained an equilibrated state. Native complexes obtained an RMSD ~0.3 nm in the equilibrated state, but the mutant complex Y180H displayed a different pattern with an RMSD value of ~0.5 to ~0.6 nm until the end of the simulation period. Mutant complexes R24C, A205T, R210P and R246C obtained an RMSD of~0.4 to ~0.5 nm in the equilibrated state (S2 Fig). Although minor deviations were observed during simulation periods, the stable trajectory formations in the equilibrium state provide a suitable basis for further analysis.
Calculation of amino acid fluctuations in native and mutant CDK4 proteins
The C-alpha root means square fluctuation (RMSF) values for each amino acid of CDK4 proteins were calculated. Graphs were plotted, for native and mutant proteins, from the values from the last 10ns of trajectories to obtain the amino acid mobility of each protein (S3 Fig). We found that all of the mutant proteins, R24C, Y180H, A205T, R210P, and R246C, had different fluctuation patterns from the native CDK4 protein. Mutation induced changes in the flexibility of the proteins, indicating that mutations may affect the function of the proteins by altering CDK4-Cyclin-D1 interactions.
Hydrogen bonding and minimum distance between CDK4 and Cyclin D1 in native and mutant complexes
Hydrogen bonds arise when a hydrogen atom covalently bound to a molecule interacts with an electronegative atom within the same molecule or in a different molecule. As they are responsible for maintaining the stability of protein structure, determining hydrogen bonds significantly reveals the stability of a protein or the stability between the proteins [45]. Therefore, assessing the number of hydrogen bonds between protein-protein interacting regions is essential to gain knowledge regarding the stability of the molecules. Fig 3 depicts the number of hydrogen bonds formed between CDK4-Cyclin D1 proteins in native and mutant states. Native complexes of CDK4-Cyclin D1protein exhibit an average of ~ 45 to ~80 hydrogen bonds throughout the last 10ns simulation period. The maximum number of hydrogen bonds formed between CDK4-Cyclin D1 in the mutant complexes R24C, Y180H, A205T, R210P and R246C were ~ 40 to ~65, ~ 40 to ~70, ~ 45 to ~75, ~ 40 to ~70, and ~ 40 to ~70, respectively, in the last 10 ns of the simulation period. The mutant protein complexes R24C, Y180H, A205T, R210P, and R246C had fewer hydrogen bonds than did the native protein complex. The decrease in the number of hydrogen bonds formed between mutant CDK4-Cyclin D1complexes indicates the deleterious effects of amino acid substitution and their ability to reduce the number of hydrogen bonds formed between CDK4 and Cyclin D1 proteins.
Furthermore, the minimum distance between CDK4 and the Cyclin D1 protein was computed for both the native and mutant complexes (Fig 4). The minimum distance between native CDK4 and Cyclin D1 was observed to be ~ 0.035 to ~0.065 nm in the last 10 ns of the simulation period. However, for the mutant complexes R24C, Y180H, A205T, R210P and R246C, increases in the distance were observed to be ~ 0.05 to ~0.07 nm, ~ 0.045 to ~0.07, ~ 0.045 to ~0.067, ~ 0.05 to ~ 0.07 and ~ 0.045 to ~0.067, respectively. From the minimum distance analysis, we infer that the distance between mutant CDK4-Cyclin D1 proteins is greater than in the native complex.
Effects of nsSNPs in the SASA of native and mutant CDK4 proteins
The protein surface that is traced by a solvent molecule is referred to as the solvent-accessible surface area (SASA). The solvation effect plays a significant role in maintaining protein stability and folding. Likewise, the solvation effect accompanies protein-protein interaction processes and protein structure modifications. This solvation effect can be measured using explicit solvent models; MD simulations use a sphere of water molecules [46]. SASA was calculated for both the native and mutant CDK4 proteins. From S4 Fig, it was observed that the native CDK4 protein had an SASA of ~18.5 nm2 to ~ 21.5 nm2 in the last 10ns simulation period but that the mutant structures R24C, Y180H, A205T, R210P, and R246Chad different SASAs, of ~19 nm2to ~21.5 nm2, ~19 nm2 to ~21.5 nm2, ~19 nm2 to ~22 nm2, ~18.5 nm2 to ~22 nm2and ~18.5 nm2 to ~21.5 nm2, respectively. Compared with the native protein, all five mutant proteins had different SASAs. The differences in the SASAs of the mutant proteins indicated that there might be a shift in the amino acid residues from the buried region to the accessible area or vice versa, and this may be the reason for the reduced affinity between the mutant CDK4 and Cyclin D1 proteins.
Principal component analysis
PCA was performed on all the six trajectories of CDK4-Cyclin D1 native and mutant complexes to examine the overall motion of the protein molecules. Diagonal covariance matrices were built over the C-alpha atoms of the protein for each trajectory. The eigenvalues obtained through the diagonalization of the covariance matrix elucidates the atomic contribution on the motion. Similarly, the eigenvectors explain a collective motion accomplished by the particles. The spectrum of the corresponding eigenvalues indicated the level of fluctuation and dynamic behaviour of protein molecule in the system and confined within the first two eigenvectors. The trace values for native, R24C, Y180H, A205T, R210P and R246C structure of CDK4-Cyclin D1 protein complex were found to be~22.216 nm2, ~26.786 nm2, ~32.264 nm2, ~36.569 nm2, ~27.987 nm2, and ~26.018 nm2 respectively (S5 Fig). All the mutant complexes showed high values suggesting an overall escalation in the flexibility than the native complex during the collective motion of the protein. The mutant protein complexes covered the larger region of conformational space than the native complex. From these projections, it was observed that clusters of mutants were well defined and was less stable compared to the native protein complex.
Molecular dynamics simulation studies of native and mutant CDK4 structures in complex with flavopiridol and virtually screened compounds
To verify whether the results obtained by the molecular docking analysis were robust or fortuitous, we performed molecular dynamics simulations with native and mutant (R24C, Y180H, A205T, R210P, and R246C) protein-ligand complexes CDK4-flavopiridol, R24C-5_7_DIHYDROXY_ 2_ (3_4_5_TRIHYDROXYPHENYL) _4H_CHROMEN_ 4_ONE, Y180H-Diosmin, A205T-Rutin, R210P-Rutin, and R246C-5_7_DIHYDROXY_2_ (3_4_5_TRIHYDROXYPHENYL) _4H_CHROMEN_ 4_ONE.The atomic RMSD of the backbone atoms of the protein-ligand complexes obtained from the trajectories and the initial structures was monitored during the simulation(S6 Fig). A sharp rise was observed in the first ~ 20000ps and the function remained stable for the rest of the simulation. The system reached a constant temperature. The total energy was fluctuated around an average energy with the system stabilising at a temperature of 310K and one atmospheric pressure. The RMSD of the molecular dynamics simulation was stable after ~2000ps at the equilibrium. The native and mutant protein-ligand trajectories attained stable RMSD values around ~30000ps, indicating a high binding affinity between the protein-ligand complexes, further enhancing the credibility of the docking results.
Furthermore, to provide insight into the binding affinity of the protein and ligand, hydrogen bond contributions were analysed in the well equilibrated simulation period of the last 10 ns. The total number of hydrogen bonds between CDK4-flavopiridol and mutant CDK4-virtually screened compounds were analysed (Fig 5). CDK4-flavopiridol had one to four hydrogen bonds in the last 10ns simulation period. The numbers of hydrogen bonds formed between R24C-5_7_DIHYDROXY_ 2_ (3_4_5_TRI HYDRO XYPHENYL) _4H_CHROMEN_ 4_ONE, Y180H-Diosmin, A205T-Rutin, R210P-Rutin and R246C-5_7_DIHYDROXY_2_ (3_4_5_TRIHYDROXYPHENYL) _4H_CHROMEN_ 4_ONEmutant protein-ligand complexes were 1 to 4, 1 to 4, 1 to 5, 1 to 6 and 1 to 4, respectively. The number of hydrogen bonds between the mutant proteins R24C, Y180H, and R246C and their respective inhibitors is similar to the CDK4-flavopiridol complex. The mutant proteins A205T and R210P had a similar number of hydrogen bonds with the inhibitor rutin during the simulation period. These results provide evidence that these selected compounds have high capacity to function as strong inhibitors for various CDK4 protein mutants.
The minimum distance between CDK4-flavopiridol and mutant CDK4-virtually screened compounds was analysed (Fig 6). The minimum distance between CDK4-flavopiridol was ~2.5 nm in the last 10ns simulation period. The minimum distance between R24C-5_7_DIHYDROXY_ 2_ (3_4_5_TRI HYDRO XYPHENYL) _4H_CHROMEN_ 4_ONE, Y180H-Diosmin, A205T-Rutin, R210P-Rutin, and R246C-5_7_DIHYDROXY_2_ (3_4_5_ TRIHYDROXYPHENYL) _4H_CHROMEN_ 4_ONE mutant protein-ligand complexes was observed as ~1.55 to ~2.25, ~1.55, ~1.25,~2.5 and ~2.25, respectively. The distance between the mutant proteins and their respective inhibitor is less than for the native CDK4-flavopiridol complex, indicating that the screened inhibitor has a good binding affinity for the mutant proteins.
Discussion
The endpoint of deleterious nsSNP identification should be protein structural and functional analysis. Proper understanding of protein conformational changes elucidates the mechanisms underlying disease phenotypes and aids in the identification of suitable drugs for structurally modified proteins. Deleterious nsSNPs cause protein structural and functional changes and are associated with diverse responses in drug efficacy in the human population. Therefore, these nsSNPs could occur in drug-binding target proteins and influence treatment outcomes. Rapid growth in the identification of SNPs makes it more difficult for experimental biologists to evaluate the biological significance of each SNP. However, computational, theoretical approaches can be used to identify and investigate the effects of deleterious polymorphisms that affect protein structure and function before undertaking further validation by experimental methods. Computational prediction methods such as SIFT, PolyPhen2 and I-Mutant 3.0 can screen for deleterious nsSNPs potentially involved in disease conditions. Drug targets for various diseases that are influenced by nsSNPs can be investigated using in silico methods, which has significant implications for specific target identification. Each computational pathogenicity prediction method follows a different strategy to estimate the effects of nsSNPs; hence, the prediction results are sometimes dissimilar. However, the positive predictions that overlap in the prediction methods should provide the greatest reliability to behave similarly. The variation in the identification of deleterious nsSNPs might be because of the differences in the adopted methodologies or the training datasets.
An in-depth knowledge of protein function could reveal the molecular mechanisms of the process that causes the disease condition. Deleterious nsSNPs in protein binding hot spots or the core regions of proteins can evidently disrupt protein interactions. The mapping of nsSNPs onto three-dimensional protein structures and analysing these changes at the structural level will help to find the exact point where they alter interactions with proteins. Three-dimensional models of mutant proteins are constructed by substituting the deleterious nsSNPs in the appropriate three-dimensional protein structures. Three-dimensional models are the simplest way to detect the types of harmful effects that deleterious nsSNPs can have on protein-protein and protein-ligand interactions. To obtain in-depth knowledge of protein structure and understand the adverse changes that these deleterious nsSNPs cause in CDK4 proteins, we extended our study by analysing the native and proposed mutant (protein-protein and protein-drug) structures at the atomic level by using a molecular dynamics simulation approach. Basic parameters such as RMSD, RMSF, hydrogen bond numbers, minimum distances, and SASA were analysed from the simulation trajectory values. Molecular stability and flexibility changes were observed from RMSD and RMSF. Stability is the fundamental property enhancing biomolecular function, activity, and regulation. Structural mutations affected buried residues in the protein core, causing changes in amino acid size, amino acid charge and hydrogen bond numbers [47]. Hydrogen bonds are the most important factor that creates a stable contact between a protein and its binding partner. Furthermore, number of studies have reported that protein interaction sites often have electrostatic complementarity with the charge distribution of the interacting partner [48–51]. Taken together, deleterious nsSNPs may affect the electrostatic charge distribution and alter the binding surfaces, causing the formation of more or fewer hydrogen bonds. Changes in protein stability, flexibility, hydrogen bonding, minimum distance between molecules and SASA have been shown to cause the loss of thermodynamic stability, as well as aberrant folding and aggregation of the protein [47]. Studies have indicated that many disease-related mutations lie in solvent-accessible sites, suggesting that the analysis of these mutations might also shed light on the mechanisms underlying disease conditions [52–57].
We performed a systematic computational analysis of theCDK4 gene to identify potentially deleterious nsSNPs and their structural and functional significance, through molecular docking and molecular dynamics approaches. To predict the significant effect of nsSNPs in theCDK4 gene, we applied three widely used computational methods, SIFT, PloyPhen2 and I-Mutant3, and each tool predicted 11, 9 and 15 nsSNPs as deleterious, respectively. Comparing the predictions made by all three methods, five amino acid variants (R24C, Y180H, A205T, R210P and R246C) of the CDK4protein were identified as highly deleterious, and all five variants were investigated structurally and functionally.For the structural analysis, we modelled the mutant structure with Spdbv software, after which energy minimization was performed for the native and mutant CDK4-cyclin D1 complexes using a steepest descent force field to check and repair the geometry of the modelled complex. Next, docking analysis was performed between the native and mutant structures of CDK4 with flavopiridol. Several factors enhancing the protein-ligand interactions were analysed. Binding energies between the mutant protein and ligand revealed a lower binding capability of the mutant structures to the drug flavopiridol. Notably, flavopiridol interacting residues differed in the mutant proteins compared with native CDK4 protein. Although the mutant structures R24C, A205T, R210P, and R246C obtained a similar number of hydrogen bonds with flavopiridol, the ligand failed to bind to the ATP-binding site of the mutant proteins. These results indicate that the structural changes in the CDK4protein occurred because of the substitution of deleterious amino acids. Another functionally significant property of the protein was explored through the docking of flavopiridol with CDK4, demonstrating the effect of fluctuation in protein-ligand complex formation. A change in the pattern of flexibility of the functional residues might alter the functional region and activity of mutant proteins. A decrease in overall flexibility and an increase in rigidity due to mutations may affect the binding properties of proteins. To identify a suitable inhibitor for mutant proteins, structure-based virtual screening and docking analysis were performed. A total of 19 compounds that were structurally similar to flavopiridol were retrieved from the DrugBank database. All of the compounds were individually docked with theR24C, Y180H, A205T, R210P and R246C mutant structures of the CDK4 protein. The inhibitor 5_7_DIHYDROXY_ 2_ (3_4_5_TRI HYDRO XYPHENYL) _4H_CHROMEN_ 4_ONE displayed good binding affinity and inhibition at the ATP-binding site of R24C and R246C.Diosmin displayed good binding affinity and interaction with the ATP-binding residue of Y180H. Rutin displayed good binding affinity and interaction with the A205T and R210P mutant proteins.
Furthermore, we performed 50ns MD simulation analysis on the native and mutant (R24C, Y180H, A205T, R210P, and R246C) protein-protein and protein-ligand complexes. MD simulation analysis provides in-depth knowledge regarding the protein-protein and protein-ligand interactions at the atomic level. Five fundamental parameters (RMSD, RMSF, hydrogen bond numbers, minimum distances and SASA) were examined during the last 10 ns of the CDK4-Cyclin D1 simulation trajectories for the native and mutant complexes. Molecular stability and flexibility changes were observed through RMSD and RMSF analysis. Stability is the primary property that enhances biomolecular function, activity, and regulation. The results obtained from the CDK4-Cyclin D1complex stability analysis concluded that all five mutant complexes had different RMSD values from the native complex. A larger deviation increases the stability of molecules and vice versa. A higher stability increases the rigidity of the protein, and a lower stability increases the flexibility of the protein. Henceforth, from the stability analyses of native and mutant CDK4-CyclinD1protein complexes, it was observed that the mutant complexes had different stabilities because of the substitution of deleterious amino acids. From the RMSF analyses, we observed changes in flexibility for all five mutant complexes. Based on the results of the RMSD and RMSF analyses, we confirmed that the substitution of amino acids adversely affected the stability and flexibility of CDK4-CyclinD1 mutant complexes. In addition to the different electrostatic forces, the hydrogen bonds across the protein-protein interacting interface act as the main contributor in maintaining the stability of the protein. Furthermore, the presence of deleterious polymorphisms might change the hydrogen bond formation between the molecules. Consequently, in all five mutant complexes less number of hydrogen bonds was observed between the CDK4-Cyclin D1 proteins. A decrease in the number of hydrogen bonds revealed that the binding stability of the CDK4-Cyclin D1 mutant complexes might be affected. The minimum distance between CDK4 and CyclinD1 in the protein-protein complexes was analysed for both the native and mutant CDK4-Cyclin D1 complexes. The distance between the CDK4 and Cyclin-D1 proteins increased in all five mutant complexes compared with the native complex. An increase in the distance might reduce the binding affinity between the CDK4-CyclinD1 mutant complexes. Furthermore, in the SASA analysis, with respect to the native protein, different areas of solvent accessible surface were observed in all five mutant CDK4 proteins. The differences in the accessible areas in mutant proteins might alter the probability of an interaction between CDK4 in Cyclin D1 protein complexes. Subsequent SASA analysis revealed that the presence of deleterious polymorphisms in the CDK4 protein might change the hydrophilic and hydrophobic areas of the mutant CDK4 proteins.
For CDK4-flavopiridol, R24C-5_7_DIHYDROXY_ 2_ (3_4_5_TRI HYDRO XYPHENYL) _4H_CHROMEN_ 4_ONE, Y180H-Diosmin, A205T-Rutin, R210P-Rutin and R246C-5_7_DIHYDROXY_2_ (3_4_ 5_ TRIHYDROXYPHENYL) _4H_CHROMEN_ 4_ONE, docked protein-ligand complexes were subjected to MD simulation for 50 ns to analyse RMSD, H-bonds, and minimum distances. In the molecular stability change analysis, all five protein-ligand complexes averaged lower RMSD values than those of the native protein-ligand complex. Hydrogen bond interactions between the protein and ligand served as the main contributor in maintaining the stable contact between the molecules. The substitution of deleterious nsSNPs might change the electrostatic charge distribution in proteins and affect the normal protein-ligand interactions. However, the virtually selected inhibitors exhibited good binding affinity for the mutant proteins, having the potential to maintain a stable number of hydrogen bonds during the simulation period. In all five mutant complexes (R24C, Y180H, A205T, R210P, and R246C), a similar number of H-bonds was observed between the proteins and their respective virtually selected ligand molecules. The distances between the mutant proteins and their respective virtually screened potential ligands were consistently lower than that measured for the native CDK4-flavopiridol complex. A reduction in this distance may increase the affinity of the mutant CDK4 proteins for their respective virtually screened compounds.
Conclusion
Alterations in protein structure are mainly caused by deleterious nsSNPs in the nucleotide sequence. The occurrence of deleterious nsSNPs in the genome is relatively rare but has a significant effect on protein structure and function. A number of computational methods have been designed to predict the functional effects of nsSNPs whether a particular nsSNP is a driver of carcinogenesis. In this study, by following an integrated computational approach, we identified and investigated the effects of harmful polymorphisms at both the structural and functional level for the CDK4 gene. Furthermore, this study highlights successful drug repositioning in mutant CDK4 proteins in the event of conformational changes attributable to deleterious nsSNPs. This information on drug repositioning can accelerate the development of cancer drugs, and this study of new cancer drug targets can potentially be used for future target identification. The identification of new drug targets and new uses for existing drugs hold promise for the future use of computational methods in cancer research. Overall, the methodology adopted in this study is extremely useful in the drug discovery process and will improve drug efficacy and safety profiles in cancer treatment.
Supporting Information
Acknowledgments
This work was supported by the Research Grants Council of Hong Kong [212111], [212613] and Faculty Research Grant of Hong Kong Baptist University [FRG/14-15/063] and [FRG2/13-14/056]. The authors thank VIT and Galgotias University.
Data Availability
All relevant data are within the paper and its Supporting Information file.
Funding Statement
This work was supported by the Research Grants Council of Hong Kong [212111], [212613] and Faculty Research Grant of Hong Kong Baptist University [FRG2/12-13/061] and [FRG2/13-14/056].
References
- 1. Hanahan D, Weinberg RA (2011) Hallmarks of cancer: the next generation. Cell 144:646–74. 10.1016/j.cell.2011.02.013 [DOI] [PubMed] [Google Scholar]
- 2. Carlson BA, Dubay MM, Sausville EA, Brizuela L, Worland PJ (1996) Flavopiridol induces G1 arrest with inhibition of cyclin-D1ependent kinase (CDK) 2 and CDK4 in human breast carcinoma cells. Cancer Res 56(13):2973–8. [PubMed] [Google Scholar]
- 3. Kaur G, Stetler SM, Sebers S, Worland P, Sedlacek H, Myers C, et al. (1992) Growth inhibition with reversible cell cycle arrest of carcinoma cells by flavone L86-8275. J. Natl Cancer Inst 84:1736 [DOI] [PubMed] [Google Scholar]
- 4. Drees M, Dengler WA, Roth T, Labonte H, Mayo J, Malspeis L, et al. (1997) Flavopiridol (L86-8275): selective antitumor activity in vitro and activity in vivo for prostate carcinoma cells. Clin Cancer Res 3:273 [PubMed] [Google Scholar]
- 5. Parker BW, Kaur G, Nieves NW, Taimi M, Kohlhagen G, Shimizu T, et al. (1998) Early induction of apoptosis in hematopoietic cell lines after exposure to flavopiridol. Blood 91:458 [PubMed] [Google Scholar]
- 6. Walker GJ, Flores JF, Glendening JM, Lin AH, Markl ID,Fountain JW (1998) Virtually 100% of melanoma cell lines harbor alterations at the DNA level within CDKN2A, CDKN2B, or one of their downstream targets. Genes Chromosomes Cancer 22:157–63. [DOI] [PubMed] [Google Scholar]
- 7. Curtin JA, Fridlyand J, Kageshita T, Patel HN, Busam KJ, Kutzner H, et al. (2005) Distinct sets of genetic alterations in melanoma. N Engl J Med 353:2135–47. [DOI] [PubMed] [Google Scholar]
- 8. Huang RS, Ratain MJ (2009) Pharmacogenetics and pharmacogenomics of anticancer agents. CA Cancer J Clin 59(1):42–55. 10.3322/caac.20002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Collins FS, Brooks LD, Chakravarti A (1998) A DNA polymorphism discovery resource for research on human genetic variation. Genome Res 8:1229–123. [DOI] [PubMed] [Google Scholar]
- 10. Yue P, Moult J (2006) Identification and analysis of deleterious human SNPs. J Mol Bio 356:1263–1274. [DOI] [PubMed] [Google Scholar]
- 11. Stitziel NO, Binkowski TA, Tseng YY, Kasif S, Liang J (2004) TopoSNP: a topographic database of non-synonymous single nucleotide polymorphisms with and without known disease association. Nucleic Acids Res 32:52–522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Uzun A, Leslin CM, Abyzov A, Ilyin V (2007) Structure SNP (StSNP): a web server for mapping and modeling nsSNPs on protein structures with linkage to metabolic pathways. Nucleic Acids Res 35:384–392. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Ma B, Elkayam T, Wolfson H, Nussinov R (2003) Protein-protein interactions: structurally conserved residues distinguish between binding sites and exposed protein surfaces. Proc Natl AcadSci 100:5772–5777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, et al. (2001) dbSNP: the NCBI database of genetic variation. Nucleic Acids Res 29:308–311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Yip YL, Lachenal N, Pillet V, Veuthey AL (2007) Retrieving mutation specific information for human proteins in UniProt/Swiss Prot Knowledgebase, Journal of Bioinformatics and Computational Biology 5:1215–31. [DOI] [PubMed] [Google Scholar]
- 16. Day PJ, Cleasby A, Tickle IJ, O'Reilly M, Coyle JE, Holding FP, at al. (2009) Crystal structure of human CDK4 in complex with a D-type cyclin. Proc Natl AcadSci USA 106(11):4166–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Knox C, Law V, Jewison T, Liu P, Ly S, Frolkis A, et al. (2011) DrugBank 3.0: a comprehensive resource for 'omics' research on drugs. Nucleic Acids Res; 39 (Database issue):D1035–41. 10.1093/nar/gkq1126 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Ng PC, Henikoff S (2003) SIFT: predicting amino acid changes that affect protein function. Nucl Acids Res 31:3812–3814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, et al. (2010) A method and server for predicting damaging missense mutations. Nat Methods 7: 248–249. 10.1038/nmeth0410-248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Capriotti E, Fariselli P, Rossi I, Casadio R (2008) A three state prediction of single point mutations on protein stability changes. BMC Bioinformatics 9:2–S6. 10.1186/1471-2105-9-S2-S6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Morris GM, Huey R, Lindstrom W, Sanner MF, Belew RK, Goodsell DS, et al. (2009) AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J ComputChem 30(16):2785–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Trott O, Olson AJ (2010) AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J ComputChem 31(2):455–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Hess B, Kutzner C, Van der Spoel D, Lindahl E (2008) GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. J. Chem. Theory Comput 4:435–447. [DOI] [PubMed] [Google Scholar]
- 24. Gunsteren WFV, Billeter SR, Eising AA, Hunenberger PH, Kruger PK, Mark AE, et al. Biomolecular simulation: The GROMOS96 manual and user guide. Verlag der Fachvereine, Zurich: 1996; 1–1024. [Google Scholar]
- 25. Oostenbrink C, Villa A, Mark AE, Gunsteren WFV. (2004) Abiomolecular force field based on the free enthalpy of hydration and solvation: the GROMOS force field parameter sets 53A5 and 53A6. J. Comput. Chem 25:1656–1676. [DOI] [PubMed] [Google Scholar]
- 26. Berendsen HJC, Postma JPM, Gunsteren WFV, Hermans J (1981) Interaction models for water in relation to protein hydration. Intermolecular Forces, Pullman B., ed. Dordrecht, the Netherlands: D. Riedel Publishing Company; 331–342. [Google Scholar]
- 27. Hess B, Bekker H, Berendsen HJC, Fraaije JGEM (1997) LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem 18:1463–1472. [Google Scholar]
- 28. Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, Pedersen LG (1995) A smooth particle meshes Ewald method. J. Chem. Phys 103:8577–8593. [Google Scholar]
- 29. Van der spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJ (2005) GROMACS: fast flexible, and free. J. Comput. Chem 26:1710–1718. [DOI] [PubMed] [Google Scholar]
- 30. Turner PJ (2005) XMGRACE, Version 5.1.19 Central for costal and Land-Margin Research; Oregon Graduate Institute of Science and Technology, Beaverton, ORE, USA. [Google Scholar]
- 31. Amadei A, Linssen ABM, Berendsen HJC (1993) Essential dynamics of proteins. Proteins: Structure, Function and Genetics 17(4):412–425. [DOI] [PubMed] [Google Scholar]
- 32. Thusberg J, Vihinen M (2009) Pathogenic or not? and if so, then how? Studying the effects of missense mutations using bioinformatics methods. Hum. Mutat 30:703–714. 10.1002/humu.20938 [DOI] [PubMed] [Google Scholar]
- 33. Beattie JF, Breault GA, Ellston RPA, Green S, Jewsbury PJ, Midgley CJ, et al. (2003) Cyclin-D1ependent kinase 4 inhibitors as a treatment for cancer. Part 1: identification and optimisation of substituted 4, 6-Bis anilino pyrimidines. Bioorg. Med. Chem. Lett 13:2955–2960. [DOI] [PubMed] [Google Scholar]
- 34. Carini DJ, Kaltenbach RF, Liu J, Benfield PA, Boylan J, Boisclair M, et al. (2001) Identification of selective inhibitors of cyclin dependent kinase 4. Bioorg. Med. Chem. Lett 11:2209–2211. [DOI] [PubMed] [Google Scholar]
- 35. Honma T, Hayashi K, Aoyama T, Hashimoto N, Machida T, Fukasawa K, et al. (2001) Structure-based generation of a new class of potent Cdk4 inhibitors: new de novo design strategy and library design. J. Med. Chem 044:4615–4627. [DOI] [PubMed] [Google Scholar]
- 36. Honma T, Yoshizumi T, Hashimoto N, Hayashi K, Kawanishi N, Fukasawa K, et al. (2001) A novel approach for the development of selective Cdk4 inhibitors: library design based on locations of Cdk4 specific amino acid residues. J. Med. Chem 44:4628–4640. [DOI] [PubMed] [Google Scholar]
- 37. Soni R, Muller L, Furet P, Schoepfer J, Stephan C, Zumstein-Mecker S, et al. (2000) Inhibition of cyclin-D1ependent kinase 4 (Cdk4) by fascaplysin, a marine natural product. Biochem. Biophys. Res. Commun 275:877–884. [DOI] [PubMed] [Google Scholar]
- 38. Barvian M, Boschelli DH, Crossrow J, Dobrusin E, Fattaey A, Fritsch A, et al. (2000) Pyrido [2,3-d]pyrimidin-7-one inhibitors of cyclin-Dependent kinases. J. Med. Chem 43:4606–4616. [DOI] [PubMed] [Google Scholar]
- 39. Kubo A, Kazuhiko N, Varma RK, Conrad NK, Cheng JQ, Lee WC, et al. (1999) The p16 status of tumor cell lines identifies small molecule inhibitors specific for cyclin-D1ependent kinase 4. Clin. Cancer Res 5:4279–4286. [PubMed] [Google Scholar]
- 40. Nugiel DA, Vidwans A, Etzkorn AM, Rossi KA, Benfield PA, Burton CR, et al. (2002) Synthesis and evaluation of indenopyrazoles as cyclin-D1dependent kinase inhibitors 2. Probing the indeno ring substituent pattern. J. Med. Chem 45:5224–5232. [DOI] [PubMed] [Google Scholar]
- 41. Ripphausen P, Nisius B, Peltason L, Bajorath J. vadis Quo (2010) Virtual screening? A comprehensive survey of prospective applications. J Med Chem 53(24):8461–7. 10.1021/jm101020z [DOI] [PubMed] [Google Scholar]
- 42. Clark DE (2008) what has virtual screening ever done for drug discovery?. Expert Opin Drug Discov 3(8):841–51. 10.1517/17460441.3.8.841 [DOI] [PubMed] [Google Scholar]
- 43. Cheng T, Li Q, Zhou Z, Wang Y, Bryant SH (2012) Structure based virtual screening for drug discovery: a problem centric review. AAPS J 14(1):133–141. 10.1208/s12248-012-9322-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Meng XY, Zhang HX, Mezei M, Cui M (2011) Molecular docking: a powerful approach for structure based drug discovery. CurrComput Aided Drug Des 7(2):146–157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Gerlt JA, Kreevoy MM, Cleland WW, Frey PA (1997) Understanding enzymic catalysis: the importance of short, strong hydrogen bonds. ChemBiol 4:259–267. [DOI] [PubMed] [Google Scholar]
- 46. Becker O, Mackerell A, Roux B, Watanabe M. (2001) Computer simulation of biomolecular system: Theoritical and experimental, Computational Biochemistry & Biophysics, Marcel Dekker, New York. [Google Scholar]
- 47. Chen J, Shen B (2009) Computational Analysis of Amino Acid Mutation: A Proteome Wide Perspective. Current Proteomics 6:228–234. [Google Scholar]
- 48. Wilson C, Mau T, Weisgraber KH, Wardell MR, Mahley RW, Agard DA (1994) Salt bridge relay triggers defective LDL receptor binding by a mutant apolipoprotein. Structure 2:713–8. [DOI] [PubMed] [Google Scholar]
- 49. Cherfils J, Duquerroy S, Janin J (1991) Protein-protein recognition analyzed by docking simulations. Proteins 11:271–80. [DOI] [PubMed] [Google Scholar]
- 50. Novotny J, Sharp K (1992) Electrostatic fields in antibodies and antibody/antigen complexes. ProgBiophysMolBiol 58:203–24. [DOI] [PubMed] [Google Scholar]
- 51. Creighton TE (1993) Proteins: Structures and Molecular Properties, Freeman, San Francisco. [Google Scholar]
- 52. Steward RE, MacArthur MW, Laskowski RA, Thornton JM (2003) Molecular basis of inherited diseases: a structural perspective. TrendsGenet 19:505–13. [DOI] [PubMed] [Google Scholar]
- 53. Ye Y, Li Z, Godzik A (2006) Modeling and analyzing three dimensional structures of human disease proteins. Pac SympBiocomput 439:50. [PubMed] [Google Scholar]
- 54. George Priya Doss C, Nagasundaram N, Chakraborty Chiranjib, Chen L, Zhu H (2013) Extrapolating the effect of deleterious nsSNPs in the binding adaptability of flavopiridol with CDK7 protein: a molecular dynamics approach. Hum Genomics 7(1):10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Nagasundaram N, Doss CG (2013) Predicting the impact of single-nucleotide polymorphisms in CDK2-flavopiridol complex by molecular dynamics analysis. Cell BiochemBiophys 66(3):681–95. [DOI] [PubMed] [Google Scholar]
- 56. George Priya Doss C, Rajith B, Chakraborty Chiranjib, NagaSundaram N, Ali SK, Zhu H (2014) Structural signature of the G719S-T790M double mutation in the EGFR kinase domain and its response to inhibitors. Scientific Reports 4:5868 10.1038/srep05868 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. George Priya Doss C, Chakraborty Chiranjib, Haneef SA, Nagasundaram N, Chen L, Zhu H (2014) Evolutionary and structural based computational design to reveal the impact of deleterious missense mutations in Maturity onset diabetes of the young, type 2. Theranostics 4:366–385. 10.7150/thno.7473 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information file.