Abstract
The cytotoxic T-lymphocyte antigen-4 (CTLA4) is essential in controlling T cell activity within the immune system. Thus, uncovering the molecular dynamics of single nucleotide polymorphisms (SNPs) within the CTLA4 gene is critical. We identified the non-synonymous SNPs (nsSNPs), examined their impact on protein stability, and identified the protein sequences associated with them in the human CTLA4 gene. There were 3134 SNPs (rsIDs) in our study. Out of these, 186 missense variants (5.93%), 1491 intron variants (47.57%), and 91 synonymous variants (2.90%), while the remaining SNPs were unspecified. We utilized SIFT, PolyPhen-2, PROVEAN, and SNAP for identifying deleterious nsSNPs, and SNPs&GO, PhD SNP, and PANTHER for verifying risk nsSNPs in the CTLA4 gene. Following SIFT analysis, six nsSNPs were identified as deleterious and reporting second and third nsSNPs as probably damaging and one as benign, respectively. From upstream analysis, rs138279736, rs201778935, rs369567630, and rs376038796 were found to be deleterious, probably damaging, and disease associated. ConSurf predicted conservation scores for four nsSNPs, and Project Hope suggested that all mutations could disrupt protein interactions. Furthermore, mCSM and DynaMut2 analyses indicated a decrease in ΔΔG stability for the mutants. GeneMANIA and STRING networks highlighted correlations with CD86 and CD80 genes. Finally, MD simulation revealed consistent fluctuation in RMSD and RMSF, consequently Rg, hydrogen bonds, and PCA in the mutant proteins compared with wild-type, which might alter the functional and structural stability of CTLA4 protein. The current comprehensive study shows how various nsSNPs in the CTLA4 gene can modify the structural and functional characteristics of the protein, potentially influencing the pathogenesis of diseases in humans. Further, experimental studies are needed to analyze the effect of these nsSNPs on the susceptibility of pathological phenotype populations.
1. Introduction
Cytotoxic T-lymphocyte antigen 4 (CTLA4), also well-known as cluster of differentiation 152 (CD152), is a key immune regulatory protein which regulates T cell activation and immunological homoeostasis [1, 2]. The CTLA4 gene can inhibit T cell activation by attaching to cluster of differentiation 80 (CD80) and cluster of differentiation 86 (CD86) on antigen-presenting cells, avoiding excessive immunological activation and maintaining immune homoeostasis [3, 4]. This pathway is critical for autoimmunity prevention, inflammation control, and immunological tolerance regulation. The CTLA4 gene, which comprises four exons and resides on chromosome 2q33, encodes the CTLA4 protein. The CTLA4 gene exhibits a variety of genetic variants, including insertions, deletions, SNPs, and microsatellite repeats [5]. These variants may modify CTLA4 protein expression, structure, or function, altering immune control and perhaps having an impact on the onset of disease [6]. Association of CTLA4 gene polymorphisms have been found in Japanese patients with rheumatoid arthritis [7], South-Moroccan patients with type 1 diabetes (T1D) [8], Han-Chinese [9] and Asian [10] patients with breast cancer, as well as Iranian patients with gastric and colorectal cancers [11]. One study also found links between the CTLA4 gene SNPs and autoimmune disorders [12]. The relationship between CTLA4 gene SNPs and autoimmune thyroid illnesses like Graves’ disease and Hashimoto’s thyroiditis highlights the potential contribution of genetic variants to the immunological dysregulation seen in these ailments. Furthermore, links between CTLA4 gene polymorphisms and rheumatoid arthritis, T1D, systemic lupus erythematosus (SLE), and multiple sclerosis have been found, pointing to a shared genetic vulnerability for autoimmune etiology [13–18].
CTLA4 gene SNPs have been studied in relation to cancer in addition to autoimmune diseases [19]. For example, SNPs such as rs231775 (CT60), have been linked to elevated cancer risk and altered immunotherapy responses [20]. These results highlighted the importance of the CTLA4 gene in tumor immunity and suggests the potential role of genetic variation in the progression and prognosis of cancer. Additionally, the relationship between CTLA4 polymorphisms and infectious disorders has been studied. Certain CTLA4 gene polymorphisms have been linked to hepatitis B and C infections, tuberculosis, and other diseases, suggesting that they may have an impact on how the body reacts to infectious agents [21]. Furthermore, it has been shown that CTLA4 polymorphisms affect the risk and severity graft-versus-host disease (GVHD) [22], a side effect of stem cell transplantation, albeit further research is needed to fully empathize the underlying mechanisms. The precise mechanisms behind how particular CTLA4 polymorphisms contribute to disease risk and development are still completely unidentified. For the complex interactions between the immune system and disease development to be fully understood, it is essential to comprehend the functional and structural implications of these genetic variants.
Within the coding region, SNPs manifest in two forms: synonymous and non-synonymous SNPs (nsSNPs), with the latter type directly impacting protein sequences [23]. Those nsSNPs could change the stability, structure, and functions of the respective protein [24, 25]. The aim of this study was to use bioinformatics tools to predict nsSNPs in the CTLA4 gene that may be deleterious. Subsequently, the CTLA4 gene underwent comprehensive analysis to assess its pathogenicity.
2. Materials and methods
The entire process used in our investigation is outlined (Fig 1).
Fig 1. The complete workflow employed in this study.
2.1. Retrieval of protein sequence and nsSNPs in CTLA4 gene
The protein sequence (FASTA format) and nsSNP of CTLA4 gene were obtained using National Center for Biotechnology Information (NCBI) (https://www.ncbi.nlm.nih.gov/) and dbSNP database (https://www.ncbi.nlm.nih.gov/snp/), respectively (accessed November, 2024). Since missense mutations can prevent DNA-transcription, leading to alterations in the protein expression, nsSNPs were subsequently subjected to analysis using various bioinformatic methods [26].
2.2. Identification of deleterious nsSNPs in CTLA4 gene
SIFT web-server (https://sift.bii.a-star.edu.sg/) was employed to identify deleterious non-synonymous SNPs (nsSNPs) in CTLA4 gene. SIFT assesses functional amino acids by presuming conservation of major amino acids, where changes at certain positions tend to be deleterious. SNPs with a probability score of ≤0.05 were classified as deleterious, while those with a score >0.05 were considered tolerated. The analysis involved inputting single amino acid substitutions and rsIDs of each SNP in the human CTLA4 gene [27].
PolyPhen-2 web-server (http://genetics.bwh.harvard.edu/pph2/) categorize and predict the functional effects of allele variations. It calculates the site-specific sequence conservation score of position-specific independent count (PSIC) and estimates the native and mutant variants differences. PolyPhen-2 classifies SNPs into three, 1) benign (0.00 to ≤0.45), 2) possibly damaging (>0.45 to ≤0.95), or 3) probably damaging (>0.95 to 1). The analysis entails inputting single amino acid changes and the FASTA sequence of the CTLA4 protein into PolyPhen-2 [28].
The PROVEAN web server (http://provean.jcvi.org) was employed to assess whether an amino acid alteration in protein sequences is harmful or neutral. It utilizes a delta alignment score derived from aligning homologous sequences with both the native and mutant protein sequences. Variants with a score of ≤ −2.5 are considered deleterious, while those with a score > −2.5 are deemed neutral. The input for PROVEAN includes the FASTA sequence of the CTLA4 protein and a list of SNPs [29].
SNAP, accessible at http://www.rostlab.org/services/SNAP, is a web-based tool utilized to assess the functional impact of amino acid substitutions in proteins using a neural network method. The tool rates each amino acid variation, identifying them as having either neutral or significant effects [30].
2.3. Verification of the risk nsSNPs
The SNPs&GO server (http://snps.biofold.org/snps-and-go/) was used to predict the effects of nsSNPs to evaluate the likelihood that each variant will be linked to a disease in humans. A prediction value of ≥0.5 indicates a variant’s potential involvement in disease, whereas a value <0.5 suggests a neutral impact. Additionally, this service combines its predictions with results from the PhD-SNP and PANTHER for comprehensive analysis [31].
2.4. Determination of protein structural stability using I-Mutant2.0 and MUpro server
I-mutant2.0 (http://folding.biofold.org/i-mutant/i-mutant2.0.html) was utilized to predict the potential effects of nsSNPs on protein stability. Following this analysis, Delta Delta G (ΔΔG) values were predicted. Protein stability rises with a ΔΔG score ≥0 kcal/mol and falls with a ΔΔG score <0 kcal/mol. The protein’s FASTA sequence, the location of the amino acid substitution, and the variant residue were entered for analysis [32].
The MUpro server (http://mupro.proteomics.ics.uci.edu/) is used to forecast how amino acid substitutions might lower or raise the stability of a respective protein. This server employs both neural networks and support vector machines (SVM) for its predictions. A confidence score <0 indicates a decrease in protein stability, while a score ≥0 suggests an increase [33].
2.5. Evolutionary conservation analysis by ConSurf
The ConSurf web server (http://consurf.tau.ac.il/) was employed to trace the evolutionary conservation of amino acid residues (wild type) and to pinpoint nsSNPs at particular positions. By generating a phylogenetic tree from the phylogenetic relationships among homologous sequences, this server calculates the evolutionary trajectories of amino acid sites within a protein [34].
2.6. Prediction of structural effects of CTLA4 mutants
The Project Hope web server (http://www.cmbi.ru.nl/hope/) was used to assess the structural and functional impact of point mutations. The HOPE server provides a 3D visualization of the mutated proteins, incorporating predictions from UniProt and DAS servers. Inputs included the protein sequence, wild type (WT), and mutated amino acids, with the results presented in text, graphical, and animated formats [35].
2.7. Comparative modelling of wild-type CTLA4 protein and their mutant structures
The mCSM (http://structure.bioc.cam.ac.uk/mcsm) is a machine learning method to predict the effects of missense mutations based on the structural characteristics of the corresponding protein. Here is the mCSM determination of the effects of missense variants on CTLA4 stability. The tools predict changes in protein folding free energy (ΔΔG) as a consequence of missense mutations and are classified into two categories (stabilizing and destabilizing) based on the ΔΔG [36]. The DynaMut2 web-server (https://biosig.lab.uq.edu.au/dynamut2/) was utilize to calculate the impact on the flexibility and stability of proteins. Using the normal mode analysis (NMA) approach, DynaMut2 was utilized to determine how the mutation affected protein stability and dynamics. The predicted Gibbs free energy (G) values of mutants <0 was categorized as destabilizing, and those >0 as stabilizing. Both single mutations and multiple mutations can be predicted by DynaMut2, as it utilizes a single mutation prediction feature for the corresponding protein. We inputted a list of mutations and the wild-type structure in PDB format [37].
2.8. Gene-gene, protein-protein interactions and pathway enrichment analysis
GeneMANIA (http://www.genemania.org) is a web-based tool that sifts through an extensive database to discover supplementary genes associated with a provided set of input genes. Examples of associated data include co-expression, pathways, colocalization, protein domain similarity, and protein-protein interactions. We searched GeneMANIA using CTLA4 as an input gene for Homo sapiens [38]. Protein-protein interactions are crucial for determining how proteins function and play specific roles in disease processes as proteins are a crucial component of molecular pathways. The Search Tool for the Retrieval of Interacting Genes (STRING) (https://string-db.org/) is a biological database and online tool used for interpreting the protein-protein interactions. A search conducted using the terms "CTLA4 gene" and "Homo sapiens" as input genes, using the highest confidence score of 0.999, we identified ten interacting proteins [39]. The publicly available Web-Gestalt (WEB-based Gene SeT AnaLysis Toolkit) (http://www.webgestalt.org/) was used to analyze the pathways enrichment analysis of the associated gene [40].
2.9. Molecular dynamics simulation analysis
A 100 ns MD simulation analysis were performed under the Linux framework in Schrodinger 2020–3 with “Desmond v6.3 Program” to assess the structural stability of the corresponding protein [41]. The three-site transferrable intermolecular potential (TIP3P) water model was used to analyze the MD simulation [42]. An orthorhombic box shape was used to maintain a specified volume and Na+ and Cl- were added to neutralize the whole system with a salt concentration of 0.15 M. An OPLS3e force field was applied [43]. Further the protein structure system minimized using a natural time and pressure (NPT) ensemble at a constant pressure of 101325 Pascal’s and a temperature of 300 K. The protein stability and dynamic properties were assessed using the RMSD (root means square deviation), RMSF (root means square fluctuation), Rg (radius of gyration), and hydrogen bonds. Principal Component Analysis (PCA) has been widely used to study the slow and functional movements of biomolecules [44]. Prior to performing PCA, the correlation matrix C needed to be calculated. These are the definitions of the parts Cij in the matrix, Cij = ⟨(xi−⟨xi⟩) (xj−⟨xj⟩)⟩ where xi and xj are instant coordinates of atoms and 〈xi〉 and 〈xj〉 are the average coordinate over the group. Principal Component Analysis (PCA) has been widely used to study the slow and functional movements of biomolecules [44]. Prior to performing PCA, the correlation matrix C needed to be calculated.
3. Results
3.1. Retrieval of protein sequence and nsSNPs in CTLA4 gene
Following dbSNP database, we have found a total of 3134 SNPs. Following these, about 186 missense variants (5.93%), 1491 intron variants (47.57%), and 91 synonymous variants (2.90%) were computed (Fig 2A). The protein sequence (UniProt ID: P16410) of CTLA4 gene has been retrieved from NCBI. In this study, missense variants were subjected to further computational analysis to explore the most deleterious nsSNPs.
Fig 2. Retrieved SNPs of CTLA4 gene and evolutionary conservation analysis.
(A) Pie chart representing total SNPs of CTLA4 gene. (B) Prediction of evolutionary conserved amino acid residues by ConSurf server. Conservation score is represented as the color-coding bars.
3.2. Identification of deleterious nsSNPs in CTLA4 gene
Following the analysis of 186 nsSNPs using SIFT, 18 nsSNPs were identified, with 12 being eliminated as they were deemed tolerated by SIFT. Among the remaining 6 nsSNPs, those with a tolerance index of ≤ 0.05 were predicted as deleterious. Polyphen-2 produced two scores, HumDiv and HumVar, for the 6 nsSNPs analyzed. HumDiv identified 2 nsSNPs as probably damaging with high confidence, 1 as possibly damaging, and 1 as benign. In contrast, HumVar classified 3 nsSNPs as probably damaging with high confidence and 1 as benign. Subsequent analysis with the PROVEAN program revealed that 3 out of the 4 nsSNPs in the CTLA4 gene were deleterious. Additionally, the SNAP program was employed to evaluate the impact of 4 nsSNPs on the protein sequence, identifying 3 as having non-neutral effects.
3.3. Verification of the risk nsSNPs
We employed the SNPs&GO, PhD-SNP, and PANTHER programs to forecast disease-related nsSNPs. Out of the 4 nsSNPs examined, SNPs&GO identified 4, PhD-SNP recognized 3, and PANTHER detected 3 nsSNPs linked to disease (Table 1). Upon comparing the outcomes from analyses of the 4 nsSNPs using computational tools including SIFT, Polyphen-2, PROVEAN, SNAP, SNPs&GO, PhD-SNP, and PANTHER, we observed a strong concordance among the 7 programs regarding nsSNPs rs138279736, rs201778935, rs369567630 and rs376038796.
Table 1. Screening of deleterious single nucleotide polymorphism (SNP) predicted by SIFT, Polyphen-2, PROVEAN, SNAP, SNPs&GO, PhD-SNP, and Panther.
| SNP Id | Allele | Variant | SIFT | Polyphen-2 | Provean | SNAP | SNPs&GO | PhD SNP | Panther | |
|---|---|---|---|---|---|---|---|---|---|---|
| HumDiv | HumVar | |||||||||
| rs138279736 | G/A | R8Q | D | poss | Pro | D | E | N | Dis | N |
| rs201778935 | C/T | R8W | D | pro | Pro | D | E | N | Dis | Dis |
| rs369567630 | C/T | P32S | D | benign | benign | D | N | Dis | Dis | Dis |
| rs376038796 | C/T | A86V | D | pro | Pro | N | E | Dis | N | Dis |
D: Deleterious, E: Effect N: neutral, Dis: Disease, Pro: Probably damaging (high confident), Poss: Possibly damaging (low confident).
3.4. Determination of protein structural stability I-Mutant2.0 and MUpro server
The structural stability of the protein was assessed using both I-Mutant2.0 and the MUpro server. These servers predict the structural impact of four selected nsSNPs by analyzing the change in free energy (ΔΔG) and reliability index (RI) following mutation. I-Mutant2.0 predicted the stability changes caused by the second nsSNP, while MUpro predicted those caused by the third nsSNP in the CTLA4 protein (Table 2).
Table 2. Characterization of the effect of deleterious SNPs on protein stability by I-mutant2.0 and MUpro.
| SNP ID | Variant | I-Mutant2.0 | MUpro | ||
|---|---|---|---|---|---|
| Stability | DDG (kcal/mol) | Prediction | Confidence score | ||
| rs138279736 | R8Q | Increase | 0.41 | Decrease | -1.0 |
| rs201778935 | R8W | Increase | 0.12 | Decrease | -1.0 |
| rs369567630 | P32S | Decrease | -0.82 | Decrease | -0.84 |
| rs376038796 | A86V | Decrease | -0.20 | Increase | 0.44 |
3.5. Evolutionary conservation analysis by ConSurf
Using the ConSurf tool, a color-coded conservation score was generated for the CTLA4 protein, highlighting highly conserved functional areas. Among the four most harmful SNPs, ConSurf predicted two amino acids (R8Q and R8W) with a conservation score of 4, which are exposed residue and rest two (P32S and A86V) with a score of 5 and 9; buried and structural residue respectively (Fig 2B). Highly conserved regions typically play a critical role in the biological function of the respected protein.
3.6. Prediction of structural effects of CTLA4 mutants
The Project Hope server assesses how mutations affect the structural characteristics of a protein, considering factors (size, charge, hydrophobicity, and spatial structure) in comparison to the wild type (WT). Maintaining protein stability depends critically on the size of amino acid residues. Alterations in residue size can impair folding, structural integrity, or functional interactions, resulting in protein failure. Among the four anticipated substitutions, the mutant residues R8Q and R8W were smaller than the wild-type residues, while the mutants P32S and A86V were larger. Consequently, mutating these residues may disrupt protein interactions (Table 3).
Table 3. The effect mutation on protein using Project Hope prediction.
| Rs Id | Variant | Wild Type | Mutant Type |
|---|---|---|---|
| rs138279736 | R8Q | Arginine | Glutamine |
| rs201778935 | R8W | Arginine | Tryptophan |
| rs369567630 | P32S | Proline | Serine |
| rs376038796 | A86V | Alanine | Valine |
3.7. Comparative modelling of WT CTLA4 protein and their mutant structures
We used mCSM to assess the impacts of missense variants on CTLA4 stability. The mCSM relies on graph-based signatures to predict the impacts of missense variants on protein stability. Here, all four variants destabilized the protein (Table 4). DynaMut2 was used to calculate the general dynamic traits of the highest deleterious nsSNPs including R8Q, R8W, P32S and A86V mutants (Table 5). DynaMut2 interprets the predictions for ΔΔG stability value among the WT and mutant CTLA4 protein. All mutants showed a decrease in the ΔΔG stability value compared to WT and were found to be responsible for destabilizing the protein. The free energy (ΔΔG) stability values for R8Q, R8W, P32S and A86V mutants were found -0.51, -0.34, -0.96 and -0.94 kcal/mol respectively.
Table 4. Alterations in stability of protein and interaction upon nsSNPs.
| SL | RsID | Variants | RSA (%) | DDG Value | Prediction |
|---|---|---|---|---|---|
| 01 | rs138279736 | R8Q | 100 | -0.007 | Destabilizing |
| 02 | rs201778935 | R8W | 100 | -0.369 | Destabilizing |
| 03 | rs369567630 | P32S | 74 | -0.806 | Destabilizing |
| 04 | rs376038796 | A86V | 19.2 | -0.498 | Destabilizing |
Table 5. Alterations in structural stability and molecular interactions due to nsSNPs.
| SL | RsID | Variant | ΔΔG | ΔΔGStability |
|---|---|---|---|---|
| 01 | rs138279736 | R8Q | -0.51 kcal/mol | Destabilising |
| 02 | rs201778935 | R8W | -0.34 kcal/mol | Destabilising |
| 03 | rs369567630 | P32S | -0.96 kcal/mol | Destabilising |
| 04 | rs376038796 | A86V | -0.94 kcal/mol | Destabilising |
3.8. Gene-gene, protein-protein interactions and pathway enrichment analysis
Protein interactions are fundamental in determining the resilience and flexibility of biological networks. Disruptions in these relationships may result in network malfunction, significantly impacting cellular and organismal health. The interaction network between genes associated with the CTLA4 gene demonstrated several correlations. Gene-gene interaction networks and functional analyses highlighted gene sets enriched within the CTLA4 target network. Various interactions were represented by different colors of network edges, including Physical Interactions, Co-expression, Co-localization, Predicted, Genetic Interactions, Pathway, and Shared protein domains. GeneMANIA constructed a composite gene-gene functional interaction network. It enhances our comprehension of the molecular mechanisms that elucidate observable phenotypes. The study identified an association between the CTLA4 gene and 20 other genes, with CD86, CD80, and adaptor related protein complex 2 subunit mu 1 (AP2M1) (Fig 3A and Table 6).
Fig 3. Gene-gene and protein-protein interactions analysis.
(A) GeneMANIA gene-gene interaction for CTLA4 Gene Different colors of the network edge indicate: co-expression, website prediction, pathway, physical interactions and co-localization. (B) CTLA4 interacts with a total of 10 different proteins. Colored nodes: query proteins and first shell of interactors. Empty nodes: proteins of unknown 3D structure. Filled nodes: 3D structure is known/predicted. Edges represent protein-protein associations.
Table 6. Gene functionally linked to CTLA4 identified using GeneMANIA.
| Gene | Description |
|---|---|
| CTLA4 | cytotoxic T-lymphocyte associated protein 4 |
| CD86 | CD86 molecule |
| CD80 | CD80 molecule |
| AP2M1 | adaptor related protein complex 2 subunit mu 1 |
| FOXP3 | forkhead box P3 |
| LRBA | LPS responsive beige-like anchor protein |
| FYN | FYN proto-oncogene, Src family tyrosine kinase |
| PTPN11 | protein tyrosine phosphatase non-receptor type 11 |
| STAT5B | signal transducer and activator of transcription 5B |
| CD28 | CD28 molecule |
| YES1 | YES proto-oncogene 1, Src family tyrosine kinase |
| JAK2 | Janus kinase 2 |
| NFIA | nuclear factor I A |
| STAT5A | signal transducer and activator of transcription 5A |
| LCK | LCK proto-oncogene, Src family tyrosine kinase |
| LYN | LYN proto-oncogene, Src family tyrosine kinase |
| NFATC2 | nuclear factor of activated T cells 2 |
| PTPN7 | protein tyrosine phosphatase non-receptor type 7 |
| BATF | basic leucine zipper ATF-like transcription factor |
| LAX1 | lymphocyte transmembrane adaptor 1 |
| CD5 | CD5 molecule |
Proteins collaborate and interact with others to facilitate cell signaling and various cellular functions. Consequently, variations in amino acids within a protein can impact other proteins within the network. The STRING tool predicted the functional partners of CTLA4 gene. CTLA4, also known as Cytotoxic T-lymphocyte protein 4, serves as a significant negative regulator of T-cell responses. It exhibits notably stronger affinity towards its natural B7 family ligands, CD80 and CD86, compared to the stimulatory coreceptor cluster of differentiation 28 (CD28). LTF emerged as the primary interacting partner of the T-lymphocyte activation antigen CD80 (Fig 3B).
The KEGG pathway enrichment analysis, based on the highest enrichment ratio and FDR value revealed that those gene regulate several metabolic pathways (S1 Table). According to this analysis, the most enriched pathways were PD-L1 expression and PD-1 checkpoint pathway in cancer, T cell receptor signaling pathway, Th17 cell differentiation, Cell adhesion molecules and Autoimmune thyroid disease (S1 Table).
3.9. Molecular dynamics simulation analysis
The structural stability of native and four mutant proteins were calculated (RMSD and RMSF) and compared to evaluate structural and functional change due to the mutation of the corresponding protein. Here, 100 ns MD simulation analysis were conducted to observe this deviation in an artificial environment. Mutants R8Q, R8W, P32S and A86V showed considerable fluctuations compare with the native protein CTLA4. The RMSD values for the native protein were observed ranging from 0.325 nm to 4.038 nm while 0.398 nm to 3.980 nm for the R8Q mutant, 0.364 nm to 3.628 nm for the R8W mutant, 0.323 nm to 3.968 nm for the P32S mutant, 0.317 nm to 4.436 nm for the A86V mutant. The average RMSD values of native CTLA4, and mutant R8Q, R8W, P32S and A86V are 3.103 nm, 2.987 nm, 2.759 nm, 3.164 nm, and 3.680 nm, respectively (Fig 4). Overall, the RMSD plot showed that the native protein was stable along the simulation run time and also demonstrated that the mutant protein has a major impact on the structural confirmation of the CTLA4 protein. The RMSF of protein residue was counted to observe how these changes impact the local flexibility. In addition, RMSF analysis demonstrated considerable fluctuations between native and mutant structures throughout the simulation run. The average RMSF values are 1.102 nm, 0.907 nm, 0.886 nm, 0.87 nm, 0.891 nm, and 1.060 nm correspondingly (Fig 5). The Rg measures how the atoms are distributed around the axis of a protein. It is an important indicator for predicting the structural activity of macromolecules and for assessing changes in the compactness of the protein structure. The native protein and four mutants R8Q, R8W, P32S and A86V revealed Rg ranges of 2.189 nm to 5.025 nm, 2.651 nm to 4.877 nm, 2.246 nm to 5.249 nm, 2.181 nm to 5.060 nm, and 1.982 nm to 5.189 nm, respectively. Average fluctuation of these mutants was 3.108 nm, 3.297 nm, 3.406 nm, 2.856 nm, and 2.583 nm, respectively (Fig 6). The mutated protein structures were unstable in 100 ns simulations with greater fluctuation differences from lowest to highest, suggesting that the mutated proteins significantly alter the conformation of the respective protein’s active site. The number of hydrogen bonds can help to characterize a protein. Therefore, hydrogen bond numbers were calculated from initial to final times during the 100 ns simulation run to observe each hydrogen bond. All the proteins formed several hydrogen bonds ranging between 128 to 184 occur simultaneously until 100 ns simulation time (Fig 7 and S2 Table). These results indicated that the native protein has a higher specificity while the mutated protein has a higher fluctuating range of hydrogen bonds denoted the structural instability. The PCA plots provide insights into the genetic variance and clustering patterns for four mutations compared to the wild-type. Each mutant SNP variant exhibits unique clustering patterns compared to the wild-type. The difference patterns in the PCA plots show that each mutation (R8W, R8Q, P32S, and A86V) might change structure or function in a way that is different from the wild-type (Fig 8).
Fig 4. RMSF analysis shows Cα atom fluctuations over 100 ns simulation.
The color scheme is as follows: native (blue), R8Q mutant (red), R8W mutant (green), P32S mutant (violet), and A86V mutant (orange). RMSF analysis shows Cα atom fluctuations over 100 ns simulation.
Fig 5. Root mean square fluctuation (RMSF) analysis reveals variations in Cα atom dynamics over 100 ns.
The color scheme is as follows: native (blue), R8Q mutant (red), R8W mutant (green), P32S mutant (violet), and A86V mutant (orange).
Fig 6. Radius of gyration (Rg) analysis shows structural stability differences in wild-type and mutant proteins.
It is represented as a time-dependent change during the simulation. The color scheme is as follows: native (blue), R8Q mutant (red), R8W mutant (green), P32S mutant (violet), and A86V mutant (orange).
Fig 7. Analysis of hydrogen bonds in wild type and mutant proteins over 100 ns simulation conducted.
The color scheme is as follows: native (blue), R8Q mutant (red), R8W mutant (green), P32S mutant (violet), and A86V mutant (orange).
Fig 8. Principal component analysis (PCA) of wild type and mutant proteins for molecular dynamics simulation trajectories.
4. Discussion
The human Cytotoxic T-lymphocyte antigen 4 (hg-CTLA4) serves as a crucial negative regulator in the immune system, particularly for regulatory T cells (Tregs), which play a role in suppressing T cell proliferation and differentiation. Constitutively expressed by Tregs, hg-CTLA4 is also induced in activated conventional T cells [1, 2, 45, 46]. Mutations in the CTLA4 gene have been associated with a range of clinical manifestations, encompassing different autoimmune conditions affecting specific organs, hypo-gammaglobulinemia, recurrent infections, and cancer [47]. Missense or non-synonymous mutations contribute to protein destabilization and can influence susceptibility to diseases as well as responses to drug treatments [48, 49]. Numerous SNPs within the CTLA4 gene have been identified and documented in the dbSNP database. Consequently, we conducted systematic and comprehensive bioinformatic analyses to uncover the functional and structural impact of nsSNPs within the human CTLA4 gene. Our aim was to understand how these mutations affect the gene’s functionality and its role in disease pathogenesis. Through in silico structural and functional analyses, we identified potential nsSNPs within the CTLA4 gene.
Seven web servers, namely SIFT, PolyPhen-2, PROVEAN, SNAP, SNPs&GO, PhD-SNP, and PANTHER, were utilized to identify the most detrimental nsSNPs. By comparing the scores from all servers, we determined that four nsSNPs R8Q, R8W, P32S, and A86V were considered deleterious, likely causing damage, affecting protein function, and associated with diseases [50]. These nsSNPs were then selected for further analysis using additional in silico tools. Stability is a vital property of corresponding protein that affects the function, activity, and regulation of protein. Stability changes occurred due to mutation of proteins which are involved in diseases [51]. Using the I-Mutant2.0 and MUpro programs, protein stability changes were observed. In P32S and A86V nsSNPs, and R8Q, R8W, and P32S nsSNPs were decreased protein stability using I-Mutant2.0 and MUpro programs respectively. Subsequently, the ConSurf tool generated a color-coded conservation score for the CTLA4 protein, highlighting regions of high conservation. Within the four most harmful nsSNPs, ConSurf identified two amino acids with a conservation score of 4, one with a score of 5, and one with a score of 9. These conserved residues typically play critical roles in biological function. When a highly conserved residue undergoes mutation, it tends to have more detrimental effects compared to a less conserved one [24].
Once more, HOPE offers 3D structural depictions of both mutated and wild-type (WT) proteins. Predicted substitutions from the HOPE server indicated that mutant residues R8Q and R8W were smaller than the wild residues, while P32S and A86V mutants were larger. This disparity could result in an unoccupied area within the protein’s core, potentially leading to the loss of hydrophobic interactions. Such vacancies might affect the protein’s function, properties, or reactivity [52]. Those variants were analyzed following mCSM, where ΔΔG < −0 kcal/mol were considered to reduce CTLA4 stability (destabilizing). Here, all four variants resulted in reduced protein stability (Table 4). The WT and mutant CTLA4 protein ΔΔG stability value for each protein structure was predicted using DynaMut2. The free energy (ΔΔG) stability values for R8Q, R8W, P32S and A86V mutants were found -0.51, -0.34, -0.96 and -0.94 kcal/mol respectively (Table 5). All mutants showed a decrease in the ΔΔG stability value compared to WT and were found to be responsible for destabilizing the protein. This difference in the free energy landscape explains how the mutation influences the protein’s stability [37].
GeneMANIA analysis showed that CTLA4 interacts significantly with 20 different proteins, which are particularly crucial in neurogenesis. Among these, CD86, CD80, and AP2M1 exhibit particularly significant interactions with CTLA4. CD86 is involved in regulating B-cell IgG1 production levels and signaling for self-regulation and cell-cell association or disassociation. Similarly, CD80, also known as the CD80 molecule, plays a role in T-cell activation and in regulating the activity of both normal and malignant B cells [53, 54]. Together with CD80, CD86 delivers costimulatory signals crucial for T cell activation and survival. AP2M1 facilitates autophagy-induced degradation of CLDN2 through endocytosis and interaction with LC3, consequently reducing intestinal epithelial tight junction permeability. The String tool was utilized to anticipate the close interactor proteins of CTLA4 (Fig 3B). Proteins operate in a concerted manner and interact with others to execute cell signaling and various cellular functions. CTLA4 exhibits notably stronger affinity towards its natural B7 family ligands, CD80 and CD86, compared to the stimulatory coreceptor CD28. LTF emerged as the primary interacting partner of the T-lymphocyte activation antigen CD80. The pathway enrichment analysis revealed that genes interacting with the CTLA4 gene were associated with disease and therefore, might play a significant regulatory network in the progression of disease pathogenesis [55]. In order to investigate the impact of alterations on the protein structure, 100 ns MD simulations were utilized for the above-mentioned variations. This analysis was conducted to observe this deviation in an artificial environment. Mutants R8Q, R8W, P32S and A86V showed considerable fluctuations compare with the native protein CTLA4. The RMSD of protein residue was counted to observe how these changes impact the local flexibility (Fig 4). In addition, RMSF analysis demonstrated considerable fluctuations between native and mutant structures throughout the simulation run (Fig 5). The protein structure is stabilized when the RMSD and RMSF values of a protein are within 0.1–0.3 nm [56]. The Rg plot displays against time for all mutant proteins as well as the wild type protein. It was discovered that the average Rg of the wild type protein was 3.108 nm, whereas the mutant was 3.297 nm, 3.406 nm, 2.856 nm, and 2.583 nm, respectively (Fig 6). Similarly, the increased number of hydrogen bonds causes the structural unsteadiness of the corresponding protein (Fig 7). The PCA analysis provide insights into the genetic variance and clustering patterns. The difference patterns in the PCA plots show that each mutation (R8W, R8Q, P32S, and A86V) might change structure or function in a way that is different from the wild-type. The PCA study reveals genetic differences between wild-type and each SNP mutation add discrete genetic diversity (Fig 8). These can direct next research on the possible consequences of any mutation on protein stability, function, or phenotype.
In Chinese Han population and Egyptian, rs231775 of CTLA4 gene has found involvement in hepatocellular carcinoma and a direct association with cancer [57, 58]. In Pakistan, these rs231775 are also associated with HCC [59]. In CTLA4 gene, rs221775 the variant is associated with multiple sclerosis susceptibility [60]. Similarly, rs11571317 and rs3087243 variants play role in breast cancer progression [61]. Genetic variants rs3087243 and rs231775 have an association with Graves’ disease in Chinese Han population [62]. Interestingly, three major high-risk SNPs, rs1553657429, rs1559591863, rs778534474 also have been found within CTLA4 gene [63]. Also, association of CTLA-4 gene polymorphisms were found in Japanese patients with rheumatoid arthritis [7], South Moroccan with type 1 diabetes [64], Chinese Han [9] and Asian populations [10], and Iranian patients with gastric and colorectal cancers [11].
The non-synonymous SNPs (nsSNPs) R8Q, R8W, P32S, and A86V (rs138279736, rs201778935, rs369567630, and rs376038796) have not yet been examined, underscoring the need for research that accounts for the ethnic variability in disease susceptibility across diverse populations. This study has identified SNPs within the CTLA4 gene that may potentially interfere with ligand-receptor interactions, suggesting a significant risk. Nevertheless, additional in-vitro and in-vivo experimentation, as well as genetic association studies in human population are needed to validate the impact of these nsSNPs on the pathological phenotypes and drive future progress about genetic polymorphism research.
5. Conclusion
Currently, in silico methodologies are gaining prominence as a crucial strategy for identifying SNPs associated with diseases. In the present work, a thorough investigation of the CTLA4 gene was conducted using various computational tools to explore how non-synonymous SNPs (nsSNPs) impact the protein’s structure and functionality. The study identified 3134 SNPs within the CTLA4 gene, encompassing 186 missense variants (5.93%), 1,491 intron variants (47.57%), and 91 synonymous variants (2.90%). Four specific nsSNPs within the CTLA4 gene rs138279736, rs201778935, rs369567630, and rs376038796 were highlighted as potentially harmful, likely to be damaging, and associated with diseases. Further analysis using I-Mutant2.0 and MUpro suggested that the stability of the CTLA4 protein could be altered by these nsSNPs. Additionally, ConSurf analysis indicated high conservation across most regions of the protein, suggesting that these mutations may significantly alter the protein’s physicochemical characteristics, such as size, charge, and hydrophobicity. These changes, in turn, could impact the protein’s stability and function, potentially leading to disease.
The results of this study suggest that the R8Q, R8W, P32S, and A86V mutations may contribute to the destabilization of the protein, which has also been confirmed by MD simulation. Finally, PCA study also reveals the genetic differences between wild-type and four SNP due to mutation. However, it is important to acknowledge that these conclusions are based on in silico analysis, and the potential link between these nsSNPs and disease susceptibility across diverse populations requires further experimental validation.
Supporting information
(DOCX)
(XLSX)
Acknowledgments
AKM Munzurul Hasan is the recipient of University Graduate Scholarships (UGS) and Graduate Teaching Fellowships (GTF) at the University of Saskatchewan.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This work was funded by the NSERC Discovery Grant awarded to D.P.C. and a grant (Project Number: RSP2024R357) from King Saud University, Riyadh, Saudi Arabia, for molecular dynamics simulations.
References
- 1.Walunas TL, Lenschow DJ, Bakker CY, Linsley PS, Freeman GJ, Green JM, et al. CTLA-4 can function as a negative regulator of T cell activation. Immunity. 1994;1: 405–413. doi: 10.1016/1074-7613(94)90071-x [DOI] [PubMed] [Google Scholar]
- 2.Regulation of CTLA-4 expression during T cell activation—PubMed. [cited 27 Dec 2023]. Available: https://pubmed.ncbi.nlm.nih.gov/8666782/
- 3.Perkins D, Wang Z, Donovan C, He H, Mark D, Guan G, et al. Regulation of CTLA-4 expression during T cell activation. The Journal of Immunology. 1996;156: 4154–4159. doi: 10.4049/JIMMUNOL.156.11.4154 [DOI] [PubMed] [Google Scholar]
- 4.Boise LH, Minn AJ, Noel PJ, June CH, Accavitti MA, Lindsten T, et al. CD28 costimulation can promote T cell survival by enhancing the expression of Bcl-xL. Immunity. 1995;3: 87–98. doi: 10.1016/1074-7613(95)90161-2 [DOI] [PubMed] [Google Scholar]
- 5.Ling V, Wu PW, Finnerty HF, Sharpe AH, Gray GS, Collins M. Complete Sequence Determination of the Mouse and Human CTLA4 Gene Loci: Cross-Species DNA Sequence Similarity beyond Exon Borders. Genomics. 1999;60: 341–355. doi: 10.1006/geno.1999.5930 [DOI] [PubMed] [Google Scholar]
- 6.Lindsten T, Lee KP, Harris ES, Petryniak B, Craighead N, Reynolds PJ, et al. Characterization of CTLA-4 structure and expression on human T cells. The Journal of Immunology. 1993;151: 3489–3499. doi: 10.4049/JIMMUNOL.151.7.3489 [DOI] [PubMed] [Google Scholar]
- 7.Yanagawa T, Gomi K, Nakao EI, Inada S. CTLA-4 gene polymorphism in Japanese patients with rheumatoid arthritis. J Rheumatol. 2000;27: 2740–2742. Available: https://europepmc.org/article/med/11128657 [PubMed] [Google Scholar]
- 8.Bouqbis L, Izaabel H, Akhayat O, Pérez-Lezaun A, Calafell F, Bertranpetit J, et al. Association of the CTLA4 promoter region (-1661G allele) with type 1 diabetes in the South Moroccan population. Genes Immun. 2003;4: 132–137. doi: 10.1038/sj.gene.6363933 [DOI] [PubMed] [Google Scholar]
- 9.Wang L, Li D, Fu Z, Li H, Jiang W, Li D. Association of CTLA-4 gene polymorphisms with sporadic breast cancer in Chinese Han population. BMC Cancer. 2007;7: 173. doi: 10.1186/1471-2407-7-173 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Dai Z, Tian T, Wang M, Liu X, Lin S, Yang P, et al. CTLA-4 polymorphisms associate with breast cancer susceptibility in Asians: A meta-analysis. PeerJ. 2017;2017. doi: 10.7717/peerj.2815 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hadinia A, Hossieni SV, Erfani N, Saberi-Firozi M, Fattahi MJ, Ghaderi A. CTLA-4 gene promoter and exon 1 polymorphisms in Iranian patients with gastric and colorectal cancers. J Gastroenterol Hepatol. 2007;22: 2283–2287. doi: 10.1111/j.1440-1746.2007.04862.x [DOI] [PubMed] [Google Scholar]
- 12.Hossen MM, Ma Y, Yin Z, Xia Y, Du J, Huang JY, et al. Current understanding of CTLA-4: from mechanism to autoimmune diseases. Front Immunol. 2023;14: 1198365. doi: 10.3389/fimmu.2023.1198365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Bednarczuk T, Hiromatsu Y, Fukutani T, Jazdzewski K, Miskiewicz P, Osikowska M, et al. Association of cytotoxic T-lymphocyte-associated antigen-4 (CTLA-4) gene polymorphism and non-genetic factors with Graves’ ophthalmopathy in European and Japanese populations. Eur J Endocrinol. 2003;148: 13–18. doi: 10.1530/eje.0.1480013 [DOI] [PubMed] [Google Scholar]
- 14.Chistiakov DA, Turakulov RI. REVIEW CTLA-4 and its role in autoimmune thyroid disease. J Mol Endocrinol. 2003. Available: http://www.endocrinology.org [DOI] [PubMed] [Google Scholar]
- 15.Mochizuki M, Amemiya S, Kobayashi K, Kobayashi K, Shimura Y, Ishihara T, et al. Association of the CTLA-4 Gene 49 A/G Polymorphism With Type 1 Diabetes and Autoimmune Thyroid Disease in Japanese Children. Diabetes Care. 2003;26: 843–847. doi: 10.2337/diacare.26.3.843 [DOI] [PubMed] [Google Scholar]
- 16.Zhao J, Chen Y, Xu Z, Yang W, Zhu Z, Song Y, et al. Increased circulating follicular regulatory T cells in Hashimoto’s thyroiditis. Autoimmunity. 2018;51: 345–351. doi: 10.1080/08916934.2018.1516759 [DOI] [PubMed] [Google Scholar]
- 17.Liu W, Yang Z, Chen Y, Yang H, Wan X, Zhou X, et al. The Association Between CTLA-4, CD80/86, and CD28 Gene Polymorphisms and Rheumatoid Arthritis: An Original Study and Meta-Analysis. Front Med (Lausanne). 2021;8: 598076. doi: 10.3389/fmed.2021.598076 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Liu J, Zhang HX. CTLA-4 Polymorphisms and Systemic Lupus Erythematosus: A Comprehensive Meta-Analysis. https://home.liebertpub.com/gtmb. 2013;17: 226–231. doi: 10.1089/GTMB.2012.0302 [DOI] [PMC free article] [PubMed]
- 19.Zhao Y, Yang W, Huang Y, Cui R, Li X, Li B. Evolving Roles for Targeting CTLA-4 in Cancer Immunotherapy. Cell Physiol Biochem. 2018;47: 721–734. doi: 10.1159/000490025 [DOI] [PubMed] [Google Scholar]
- 20.Hu SW, Pu D, Xia XY, Guo BX, Zhang CL, Jain N. CTLA-4 rs5742909 polymorphism and cervical cancer risk: A meta-analysis. Medicine. 2020;99: E19433. doi: 10.1097/MD.0000000000019433 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Mohammad Alizadeh AH, Hajilooi M, Ranjbar M, Fallahian F, Mousavi SM. Cytotoxic T-lymphocyte antigen 4 gene polymorphisms and susceptibility to chronic hepatitis B. World J Gastroenterol. 2006;12: 630–635. doi: 10.3748/wjg.v12.i4.630 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Karabon L, Markiewicz M, Partyka A, Pawlak-Adamska E, Tomkiewicz A, Dzierzak-Mietla M, et al. A CT60G>A polymorphism in the CTLA-4 gene of the recipient may confer susceptibility to acute graft versus host disease after allogeneic hematopoietic stem cell transplantation. Immunogenetics. 2015;67: 295–304. doi: 10.1007/S00251-015-0840-7/TABLES/6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zhao Y, Wang K, Wang WL, Yin TT, Dong WQ, Xu CJ. A high-throughput SNP discovery strategy for RNA-seq data. BMC Genomics. 2019;20: 1–10. doi: 10.1186/S12864-019-5533-4/FIGURES/2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bappy MNI, Roy A, Rabbi MGR, Jahan N, Chowdhury FA, Hoque SF, et al. Scrutinizing Deleterious Nonsynonymous SNPs and Their Effect on Human POLD1 Gene. Genet Res (Camb). 2022;2022. doi: 10.1155/2022/1740768 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kamal MM, Mia MS, Faruque MO, Rabby MG, Islam MN, Talukder MEK, et al. In silico functional, structural and pathogenicity analysis of missense single nucleotide polymorphisms in human MCM6 gene. Scientific Reports 2024 14:1. 2024;14: 1–18. doi: 10.1038/s41598-024-62299-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Zhang Z, Miteva MA, Wang L, Alexov E. Analyzing effects of naturally occurring missense mutations. Comput Math Methods Med. 2012;2012. doi: 10.1155/2012/805827 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Sim NL, Kumar P, Hu J, Henikoff S, Schneider G, Ng PC. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 2012;40: W452–W457. doi: 10.1093/nar/gks539 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Adzhubei I, Jordan DM, Sunyaev SR. Predicting Functional Effect of Human Missense Mutations Using PolyPhen-2. Curr Protoc Hum Genet. 2013;76: 7.20.1–7.20.41. doi: 10.1002/0471142905.hg0720s76 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Choi Y, Chan AP. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31: 2745–2747. doi: 10.1093/bioinformatics/btv195 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bromberg Y, Rost B. SNAP: predict effect of non-synonymous polymorphisms on function. Nucleic Acids Res. 2007;35: 3823–3835. doi: 10.1093/nar/gkm238 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Capriotti E, Calabrese R, Fariselli P, Martelli PL, Altman RB, Casadio R. WS-SNPs&GO: a web server for predicting the deleterious effect of human protein variants using functional annotation. BMC Genomics. 2013;14 Suppl 3: 1–7. doi: 10.1186/1471-2164-14-S3-S6/FIGURES/2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Capriotti E, Fariselli P, Casadio R. I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure. Nucleic Acids Res. 2005;33: W306–W310. doi: 10.1093/nar/gki375 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Worth CL, Preissner R, Blundell TL. SDM—a server for predicting effects of mutations on protein stability and malfunction. Nucleic Acids Res. 2011;39: W215–W222. doi: 10.1093/nar/gkr363 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ashkenazy H, Erez E, Martz E, Pupko T, Ben-Tal N. ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids. Nucleic Acids Res. 2010;38: W529–W533. doi: 10.1093/nar/gkq399 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Venselaar H, te Beek TAH, Kuipers RKP, Hekkelman ML, Vriend G. Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces. BMC Bioinformatics. 2010;11: 1–10. doi: 10.1186/1471-2105-11-548/FIGURES/5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Pires DEV, Ascher DB, Blundell TL. mCSM: predicting the effects of mutations in proteins using graph-based signatures. Bioinformatics. 2014;30: 335–342. doi: 10.1093/bioinformatics/btt691 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Rodrigues CH, Pires DE, Ascher DB, David Ascher CB, eduau unimelb. DynaMut2: Assessing changes in stability and flexibility upon single and multiple point missense mutations. Protein Science. 2021;30: 60–69. doi: 10.1002/PRO.3942 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zuberi K, Franz M, Rodriguez H, Montojo J, Lopes CT, Bader GD, et al. GeneMANIA Prediction Server 2013 Update. Nucleic Acids Res. 2013;41: W115–W122. doi: 10.1093/nar/gkt533 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47: D607–D613. doi: 10.1093/nar/gky1131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Liao Y, Wang J, Jaehnig EJ, Shi Z, Zhang B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 2019;47: W199–W205. doi: 10.1093/nar/gkz401 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Imon RR, Kabir Talukder ME, Akhter S, Islam MS, Ahammad F, Anis-Ul-Haque KM, et al. Natural defense against multi-drug resistant Pseudomonas aeruginosa: Cassia occidentalis L. in vitro and in silico antibacterial activity. RSC Adv. 2023;13: 28773–28784. doi: 10.1039/d3ra03923d [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Mark P, Nilsson L. Structure and Dynamics of the TIP3P, SPC, and SPC/E Water Models at 298 K. Journal of Physical Chemistry A. 2001;105: 9954–9960. doi: 10.1021/JP003020W [DOI] [Google Scholar]
- 43.Roos K, Wu C, Damm W, Reboul M, Stevenson JM, Lu C, et al. OPLS3e: Extending Force Field Coverage for Drug-Like Small Molecules. J Chem Theory Comput. 2019;15: 1863–1874. doi: 10.1021/acs.jctc.8b01026 [DOI] [PubMed] [Google Scholar]
- 44.Bahar I, Atilgan AR, Demirel MC, Erman B. Vibrational Dynamics of Folded Proteins: Significance of Slow and Fast Motions in Relation to Function and Stability. Phys Rev Lett. 1998;80: 2733. doi: 10.1103/PhysRevLett.80.2733 [DOI] [Google Scholar]
- 45.Wing K, Onishi Y, Prieto-Martin P, Yamaguchi T, Miyara M, Fehervari Z, et al. CTLA-4 control over Foxp3+ regulatory T cell function. Science (1979). 2008;322: 271–275. doi: 10.1126/science.1160062 [DOI] [PubMed] [Google Scholar]
- 46.Rowshanravan B, Halliday N, Sansom DM. CTLA-4: a moving target in immunotherapy. Blood. 2018;131: 58–67. doi: 10.1182/blood-2017-06-741033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Schwab C, Gabrysch A, Olbrich P, Patiño V, Warnatz K, Wolff D, et al. Phenotype, penetrance, and treatment of 133 cytotoxic T-lymphocyte antigen 4–insufficient subjects. Journal of Allergy and Clinical Immunology. 2018;142: 1932–1946. doi: 10.1016/j.jaci.2018.02.055 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Stefl S, Nishi H, Petukh M, Panchenko AR, Alexov E. Molecular mechanisms of disease-causing missense mutations. J Mol Biol. 2013;425: 3919–3936. doi: 10.1016/j.jmb.2013.07.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Kamal MM, Teeya ST, Rahman MM, Talukder MEK, Sarmin S, Wani TA, et al. Prediction and assessment of deleterious and disease causing nonsynonymous single nucleotide polymorphisms (nsSNPs) in human FOXP4 gene: An in-silico study. Heliyon. 2024;10. doi: 10.1016/J.HELIYON.2024.E32791/ASSET/4A1B4DD9-6B74-4BCE-8CD8-B48E56CDF993/MAIN.ASSETS/FX11_LRG.JPG [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Kamal MM, Islam MN, Rabby MG, Zahid MA, Hasan MM. In Silico Functional and Structural Analysis of Non-synonymous Single Nucleotide Polymorphisms (nsSNPs) in Human Paired Box 4 Gene. Biochem Genet. 2023; 1–24. doi: 10.1007/S10528-023-10589-1/METRICS [DOI] [PubMed] [Google Scholar]
- 51.Khan S, Vihinen M. Performance of protein stability predictors. Hum Mutat. 2010;31: 675–684. doi: 10.1002/humu.21242 [DOI] [PubMed] [Google Scholar]
- 52.Strnad O, Kozlıková B, Šustr V, Sochor J. Real-time visualization and exploration of protein empty space with varying parameters. Int J Adv Life Sci. 2013;5. [Google Scholar]
- 53.Mir MA. Introduction to Costimulation and Costimulatory Molecules. Developing Costimulatory Molecules for Immunotherapy of Diseases. 2015; 1–43. doi: [DOI] [Google Scholar]
- 54.Mir MA. Concept of Reverse Costimulation and Its Role in Diseases. Developing Costimulatory Molecules for Immunotherapy of Diseases. Elsevier; 2015. pp. 45–81. doi: [DOI] [Google Scholar]
- 55.Islam MN, Rabby MG, Hossen MM, Kamal MM, Zahid MA, Syduzzaman M, et al. In silico functional and pathway analysis of risk genes and SNPs for type 2 diabetes in Asian population. PLoS One. 2022;17: e0268826. doi: 10.1371/journal.pone.0268826 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Alam R, Rahman Imon R, Enamul M, Talukder K, Akhter S, Hossain MA, et al. GC-MS analysis of phytoconstituents from Ruellia prostrata and Senna tora and identification of potential anti-viral activity against SARS-CoV-2 †. 2021. [cited 21 Mar 2024]. doi: 10.1039/d1ra06842c [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Yang J, Liu J, Chen Y, Tang W, Liu C, Sun Y, et al. Association of CTLA-4 tagging polymorphisms and haplotypes with hepatocellular carcinoma risk: A case-control study. Medicine. 2019;98. doi: 10.1097/MD.0000000000016266 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.El-Said HH, Ghanayem NM, Badr EA, El-Fert AY, Gaballah AK. Cytotoxic T-lymphocyte antigen-4 gene polymorphisms in hepatocellular carcinoma patients in Egypt. Menoufia Medical Journal. 2014;27: 372. doi: 10.4103/1110-2098.141711 [DOI] [Google Scholar]
- 59.Shabbir M, Badshah Y, Khan K, Trembley JH, Rizwan A, Faraz F, et al. Association of CTLA-4 and IL-4 polymorphisms in viral induced liver cancer. BMC Cancer. 2022;22. doi: 10.1186/S12885-022-09633-X [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Haibing X, Xu C, Jifu C, Wenshuang Z, Ling L, Yuzhen C, et al. Correlation between CTLA-4 gene rs221775A>G single nucleotide polymorphism and multiple sclerosis susceptibility. A meta-analysis. Open Medicine (Poland). 2016;11: 264–269. doi: 10.1515/MED-2016-0052/PDF [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Goske M, Ramachander VRV, Komaravalli PL, Rahman PF, Rao C, Jahan P. CTLA-4 Genetic Variants (rs11571317 and rs3087243): Role in Susceptibility and Progression of Breast Cancer. World J Oncol. 2017;8: 162. doi: 10.14740/wjon1046w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Tu Y, Fan G, Dai Y, Zeng T, Xiao F, Chen L, et al. Association between rs3087243 and rs231775 polymorphism within the cytotoxic T-lymphocyte antigen 4 gene and Graves’ disease: a case/control study combined with meta-analyses. Oncotarget. 2017;8: 110614–110624. doi: 10.18632/oncotarget.22702 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Irfan M, Iqbal T, Hashmi S, Ghani U, Bhatti A. Insilico prediction and functional analysis of nonsynonymous SNPs in human CTLA4 gene. Scientific Reports 2022 12:1. 2022;12: 1–11. doi: 10.1038/s41598-022-24699-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Bouqbis L, Izaabel H, Akhayat O, Pérez-Lezaun A, Calafell F, Bertranpetit J, et al. Association of the CTLA4 promoter region (−1661G allele) with type 1 diabetes in the South Moroccan population. Genes & Immunity 2003 4:2. 2003;4: 132–137. doi: 10.1038/sj.gene.6363933 [DOI] [PubMed] [Google Scholar]








