Abstract
Background
Although the process of reclassification of a variant of uncertain significance can be complex, they are commonly detected through molecular testing. It often takes years before enough clinical data are acquired, and it can be costly and time‐consuming to perform functional analysis of a single variant. It is important that other tools are developed to aid in clarifying how a specific genetic variant impacts a protein's function, and ultimately the health of the patient.
Methods
Two more newly characterized, suspected pathogenic variants in NBN and PTEN were analyzed through personalized protein modeling. Comparisons between the wild‐type and altered protein were studied using simulations, genomic exome analysis, and clinic study.
Results
Modeling of the new NBN and PTEN protein structures suggested loss of essential domains important for normal enzymatic function for these personalized genomic examples which matched the clinical findings.
Conclusion
The defects detected through modeling were consistent with the expected clinical effect. Personalized protein modeling is another tool for determination of correct variant classification, which can become further useful through construction of deposition archive.
Keywords: computational screening, hereditary cancer, protein modeling, variant classification
1. INTRODUCTION
Of the numerous individuals who complete clinical genetic testing, a significant portion will receive an inconclusive result (Altschul et al., 1997). Variants of uncertain significance are genetic alterations with an unclear impact on gene and downstream protein function. One‐third of individuals tested with a 25 gene hereditary cancer panel will receive at least one Variant(s) of Unknown Significance (VUS) (Altschul et al., 1997), and the number of uncertain results discovered generally increases with number of genes tested (Amendola et al., 2016). It is not always clear though whether a specific variant should be classified as (likely) pathogenic, uncertain significance or (likely) benign. Guidelines have been put forward by the American College of Medical Genetics and Genomics and the Association for Molecular Pathology to aid in the classification of variants (Caulfield, 2011), but there can still be disagreement amongst laboratories (Caulfield & Devkota, 2012). It often takes a long period of time for enough clinical data to accumulate to allow for confident reclassification of a VUS to pathogenic or benign (Caulfield, Devkota, & Rollins, 2011; Caulfield & Medina‐Franco, 2011), and functional studies can be costly and time consuming. Computational and predictive data, like protein modeling, can aid geneticists in this task. Personalized molecular modeling and structure‐based analytics from comparison of VUS and wild‐type represent an expanding toolkit at the physicians practice.
2. CASE PRESENTATION
The first variant, previously discussed in a case report (Garcia, 2014), presents as a frameshift variant of NBN (OMIM No. 602667), c.698_701delAACA, which was classified by the original testing laboratory as pathogenic. The patient had a history of contralateral breast cancers, with a diagnosis of left breast ductal carcinoma in situ at age 62 and right breast invasive ductal carcinoma at age 64. Her medical history was also significant for melanoma on the lateral corner of the left eyelid, melanoma on the left posterior calf at age 40 and squamous cell carcinoma of the skin at age 65. Her family history included her mother with breast cancer at age 69, a brother with renal cancer at age 59, a sister with melanoma in her 50's and a maternal half‐brother with small bowel lymphoma. While no heterozygous individuals had been reported previously, this NBN variant had been seen previously in homozygous individuals with Nijmegen breakage syndrome 1 (Gass et al., 2017).
The second genetic change studied was a pathogenic deletion of exon 8 in the PTEN gene (OMIM No. 601728). The patient presented with a recent diagnosis of endometrioid cancer of the ovaries with synchronous endometrial intraepithelial neoplasia and an endocervical adenomcarcinoma at age 37. Her past medical history included a partial thyroidectomy for premalignant lesion at age 15, a thyroidectomy for premalignant lesion at age 34, and a premalignant melanoma at age 31. She also had a history of multiple colon polyps, pathology unavailable, discovered around age 22 and 35, as well as, a ganglioneuroma that was discovered at her most recent colonoscopy at age 37. She was adopted, and her family history was unknown. Genetic testing identified a deletion of exon 8 in PTEN which was originally classified as a VUS and then upgraded to pathogenic by the testing laboratory some weeks later. Physical examination identified macrocephaly, consistent with PTEN‐related disease. She was referred to dermatology and found to have a sclerotic fibroma/storiform collagenoma. Her son subsequently tested positive for the deletion. He had no history of cancers or tumors, but did have a history of macrocephaly, premature birth at 29 weeks and early developmental delay that had largely resolved. As this PTEN variant and the pathogenic NBN variant were less well characterized with regards to hereditary cancer syndromes, they were good examples to test with protein modeling.
3. MATERIALS AND METHODS
Computer assisted modeling was completed using a sequence of human protein nibrin (NBN) taken from the NCBI Reference Accession Sequence: NP_002476: version NP_002476.2 (GenBank: AF069291.1), which is encoded for a 754 amino acid sequence. The required PTEN sequence was also taken from the NCBI Reference Accession Sequence: NM_000314: version NM_000314.6 (GenBank: AF067844.1), which encodes a 403 amino acid sequence.
Monte Carlo (MC) simulations were performed on the altered proteins to observe local and regional changes for 754 amino acids for the NBN protein and 403 amino acids for the PTEN protein. For the NBN protein, frameshift begins at K233, and subsequently leads to a deletion construct, such that we have S233, I234, R235, N236 and then deletion of the remaining 518 residues dubbed “K233SIRN(X)”. The p.del(D268‐K342) variant was introduced to the PTEN protein. The X‐ray refinement for Monte Carlo was built using the YASARA SSP/PSSM method (Hooft, Sander, Scharf, & Vriend, 1996; Hooft, Vriend, Sander, & Abola, 1996; Humphrey, Dalke, & Schulten, 1996; King & Sternberg, 1996; Krieger et al., 2009; Laskowski, MacArthur, Moss, & Thornton, 1993). Relaxation of the structure was completed via the YASARA/Amber force field using YASARA knowledge‐based refinement protocol (Lopez‐Vallejo et al., 2011). Schrodinger's LC‐MOD Monte Carlo‐based module and minimization finalized the models, generating initial model and mutant protein (Hooft, Sander, Scharf, & Vriend, 1996; Hooft, Vriend, Sander, & Abola, 1996; Humphrey et al., 1996; Krieger et al., 2009; Murray, 2011). Conformational sampling was done through Monte Carlo dynamics searching (LCMOD‐MC) (Qiu & Elber, 2006; Rehm et al., 2013; Reumers et al., 2005; Richards et al., 2015). The systems then proceeded to the MC search criteria (Qiu & Elber, 2006; Rehm et al., 2013; Reumers et al., 2005; Richards et al., 2015). MC helped study any VUS and possible effects on DNA binding or processing.
The full‐length structure for the wild‐type (WT) NBN protein was 754 amino acids (11,919 atoms) and 236 amino acids (3,710 atoms) for the K233SIRN(X) variant. For the PTEN protein, dimeric complexes of 161 amino acids (5,017 atoms) of the full‐length wild‐type model and 91 amino acids (2,970 atoms) for the p.del(D268‐K342) variant were reviewed.
4. RESULTS
Both of these proteins, NBN and PTEN, form internal domain interactions that are important to function, and the structural modeling drawn from X‐ray structural datasets demonstrated a strong effect from the respective variants.
The wild‐type was compared to the NBN truncation variant K233SIRN(X), and the stability of the object from energetic calculations for ΔG per amino acid is lower for the K233SIRN(X) variant. WT and K233SIRN(X) were 580.98 and 146.79 kcal/mol*Å2 respectively (Caulfield, 2011; Caulfield & Devkota, 2012; Caulfield & Medina‐Franco, 2011; Lopez‐Vallejo et al., 2011; Reumers et al., 2005; Schymkowitz et al., 2005).24 Per amino acid, they measured 0.77 kcal/mol*Å2 for the WT and 0.62 kcal/mol*Å2 for K233SIRN(X). Interestingly, the K233SIRN(X) protein fragment had better total object stability when not examined at a per amino acid distribution of energies. Nibrin is involved in DNA damage repair and aids with histone‐related protein recruitment; better object stability may make the K233SIRN(X) variant better for N‐terminus binding over the WT protein. However, the huge deletion of 69.1% of the NBN protein for K233SIRN(X) results in massive defect in the protein's function, which eliminates binding partners. The actual residue mutation, p.K233S, induces a +0.28808 kcal/mol*Å2 change in free energy, which is disruptive to the local region. However, given the additional missense mutations from I234‐N236, functional impact is much greater. The molecular model for the full structure and its deletion form are given (Figure 1b–d) (Caulfield, 2011; Caulfield & Devkota, 2012; Caulfield & Medina‐Franco, 2011; Caulfield et al., 2011; Lopez‐Vallejo et al., 2011).24‐31
Examination of the domain regions lost due to the NBN truncation consists of residues within the SP binding region (residues E111‐H238) and mTOR, MAPKAP1, RICTOR binding sites (residues I221‐T402) (Figure 1a). Nuclear localization signal is lost (P461‐E467) and the EEXXXDDL motif is gone (E736‐L743) (Figure 1a). The electrostatic distribution map (ESM) and portions for the deleted regions were also determined (Supporting information Figure S1).
For WT versus the PTEN deletion p.del(D268‐K342), the stability of the object from energetic calculations for ΔG per amino acid to is higher for the deletion. WT measured 164.68 kcal/mol*Å2, and p.del(D268‐K342) measured 214.53 kcal/mol*Å2 (Qiu & Elber, 2006; Rehm et al., 2013; Reumers et al., 2005; Schymkowitz et al., 2005; Tung et al., 2016; Varon et al., 1998). This object stability does indicate the new structure is deleterious to function based on visual inspection of domain interactions, but also on the energetics output provided by the foldx algorithm (Varon et al., 1998). However, the large deletion of 18.4% of the protein with p.del(D268‐K342) results in a rather impressive defect in the protein's function due to location at the domain interface (Figure 2a–b) (Qiu & Elber, 2006; Rehm et al., 2013; Reumers et al., 2005; Richards et al., 2015; Schymkowitz et al., 2005; Varon et al., 1998).
Examination of the domain interaction zone in that region of PTEN, which consists of residues E91, N94, I168, P169, S170, R173, Y174, Y177, Y180, Y188, R189, P190, L220, K221, N276, F279, and T286, identifies helix‐helix and helix‐beta sheet interactions (Figure 2a). Specific hydrophobic interactions between the helices involve I168, P169, P190, L220, N276, and F279, and hydrogen bonding (Figure 3a). We also examined the ESM for PTEN (Supporting information Figure S1).
5. DISCUSSION
As it is justifiable to immediately think that frameshift and other truncating variants can cause nonsense mediated decay and no protein product, previous studies have shown that this is not always the case. Tanzarella et al. 2003 in his paper described two NBN variants (835del4 and 900del25), which were translated into a form of Nibrin with smaller molecular weight (Tanzanella, 2003 #2). Using established techniques, the energetics data and modeling reveal conclusive defects in the NBN protein and the PTEN protein (Qiu & Elber, 2006; Rehm et al., 2013; Reumers et al., 2005; Richards et al., 2015; Schymkowitz et al., 2005; Tung et al., 2016; Varon et al., 1998). The models for the NBN structure and PTEN structure show loss of essential domains interactions for proper function. In both cases, the energetic calculations structure show in great detail how the deletion construct has been perturbed from the wild‐type protein in a manner that may put the native protein at a disadvantage when competing for N‐terminus binding partners; thus promoting loss of function or decreased activity. The structural models of these two variants concur with previous clinical data. We are continuing to build a database for all NBN and PTEN VUSs for future use to the clinical community as an online publicly available resource, which will become available after we have added sufficient number of cases. Protein modeling should be considered as another tool in the difficult task of variant classification.
CONFLICT OF INTEREST
The authors have no conflicts of interest to declare.
Supporting information
ACKNOWLEDGMENTS
We thank the Center for Individualized Medicine at Mayo Clinic for funding this research project. We also extend thanks to the National Institutes of Health LRP for funding assistance to T.R.C.
Macklin S, Mohammed A, Jackson J, Hines SL, Atwal PS, Caulfield T. Personalized molecular modeling for pinpointing associations of protein dysfunction and variants associated with hereditary cancer syndromes. Mol Genet Genomic Med. 2018;6:805–810. 10.1002/mgg3.447
REFERENCES
- Altschul, S. F. , Madden, T. L. , Schaffer, A. A. , Zhang, J. , Zhang, Z. , Miller, W. & Lipman, D. J. . (1997). Gapped BLAST and PSI‐BLAST: A new generation of protein database search programs. Nucleic Acids Research, 25, 3389–3402. 10.1093/nar/25.17.3389 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amendola, L. M. , Jarvik, G. P. , Leo, M. C. , McLaughlin, H. M. , Akkari, Y. , Amaral, M. D. , … Rehm, H. L. (2016). Performance of ACMG‐AMP Variant‐ interpretation guidelines among nine laboratories in the clinical sequencing exploratory research consortium. American Journal of Human Genetics, 98, 1067–1076. 10.1016/j.ajhg.2016.03.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caulfield, T. R. (2011). Inter‐ring rotation of apolipoprotein A‐I protein monomers for the double‐belt model using biased molecular dynamics. Journal of Molecular Graphics and Modelling, 29, 1006–1014. 10.1016/j.jmgm.2011.04.005 [DOI] [PubMed] [Google Scholar]
- Caulfield, T. , & Devkota, B. (2012). Motion of transfer RNA from the A/T state into the A‐site using docking and simulations. Proteins, 80, 2489–2500. 10.1002/prot.24131 [DOI] [PubMed] [Google Scholar]
- Caulfield, T. R. , Devkota, B. , & Rollins, G. C. (2011). Examinations of tRNA Range of Motion Using Simulations of Cryo‐EM Microscopy and X‐Ray Data. J Biophys, 2011, 1–11. 10.1155/2011/219515 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caulfield, T. , & Medina‐Franco, J. L. (2011). Molecular dynamics simulations of human DNA methyltransferase 3B with selective inhibitor nanaomycin A. Journal of Structural Biology, 176, 185–191. 10.1016/j.jsb.2011.07.015 [DOI] [PubMed] [Google Scholar]
- Garcia, C. (2014). Comparison of risk management strategies between women testing positive for a BRCA variant of unknown significance and women with known BRCA deleterious mutations. Genetics in Medicine, 16, 896–902. 10.1038/gim.2014.48 [DOI] [PubMed] [Google Scholar]
- Gass, J. , Jackson, J. , Macklin, S. , Blackburn, P. , Hines, S. , & Atwal, P. S. (2017). A case of contralateral breast cancer and skin cancer associated with NBN heterozygous pathogenic variant c.698_701delAACA. Familial Cancer, 16, 551–553. 10.1007/s10689-017-9982-0 [DOI] [PubMed] [Google Scholar]
- Hooft, R. W. , Sander, C. , Scharf, M. , & Vriend, G. (1996a). The PDBFINDER database: A summary of PDB, DSSP and HSSP information with added value. Computer Applications in the Biosciences, 12, 525–529. [DOI] [PubMed] [Google Scholar]
- Hooft, R. W. , Vriend, G. , Sander, C. , & Abola, E. E. (1996b). Errors in protein structures. Nature, 381, 272 10.1038/381272a0 [DOI] [PubMed] [Google Scholar]
- Humphrey, W. , Dalke, A. , & Schulten, K. (1996). VMD: Visual molecular dynamics. Journal of Molecular Graphics, 14(33–8), 27–28. [DOI] [PubMed] [Google Scholar]
- King, R. D. , & Sternberg, M. J. (1996). Identification and application of the concepts important for accurate and reliable protein secondary structure prediction. Protein Science, 5, 2298–2310. 10.1002/pro.5560051116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krieger, E. , Joo, K. , Lee, J. , Lee, J. , Raman, S. , Thompson, J. , … Karplus, K. (2009). Improving physical realism, stereochemistry, and side‐chain accuracy in homology modeling: Four approaches that performed well in CASP8. Proteins, 77, 114–122. 10.1002/prot.22570 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Laskowski, R. A. , MacArthur, M. W. , Moss, D. S. , & Thornton, J. M. (1993). Procheck ‐ a Program to Check the Stereochemical Quality of Protein Structures. J Appl Cryst, 26, 283–291. 10.1107/S0021889892009944 [DOI] [Google Scholar]
- Lopez‐Vallejo, F. , Caulfield, T. , Martinez‐Mayorga, K. , Giulianotti, M. A. , Nefzi, A. , Houghten, R. A. & Medina‐Franco, J. L. (2011). Integrating virtual screening and combinatorial chemistry for accelerated drug discovery. Comb Chem High Throughput Screen, 14, 475–487. 10.2174/138620711795767866 [DOI] [PubMed] [Google Scholar]
- Murray, M. L. (2011). Follow up of carriers of BRCA1 and BRCA2 variants of unknown significance: Variant reclassification and surgical decisions. Genetics in Medicine, 13, 998–1005. 10.1097/GIM.0b013e318226fc15 [DOI] [PubMed] [Google Scholar]
- Qiu, J. , & Elber, R. (2006). SSALN: An alignment algorithm using structure‐dependent substitution matrices and gap penalties learned from structurally aligned protein pairs. Proteins, 62, 881–891. [DOI] [PubMed] [Google Scholar]
- Rehm, H. L. , for the Working Group of the American College of Medical Genetics and Genomics Laboratory Quality Assurance Committee , Bale, S. J. , Bayrak‐Toydemir, P. , Berg, J. S. , Brown, K. K. , … Lyon, E. (2013). ACMG clinical laboratory standards for next‐generation sequencing. Genetics in Medicine, 15, 733–747. 10.1038/gim.2013.92 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reumers, J. , Schymkowitz, J. , Ferkinghoff‐Borg, J. , Stricher, F. , Serrano, L. & Rousseau, F. (2005). SNPeffect: A database mapping molecular phenotypic effects of human non‐synonymous coding SNPs. Nucleic Acids Research, 33(Database issue):D527–D532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Richards, S. , on behalf of the ACMG Laboratory Quality Assurance Committee , Aziz, N. , Bale, S. , Bick, D. , Das, S. , … Rehm, H. L. (2015). Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genetics in Medicine, 17, 405–423. 10.1038/gim.2015.30 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schymkowitz, J. W. , Rousseau, F. , Martins, I. C. , Ferkinghoff‐Borg, J. , Stricher, F. , & Serrano, L. (2005). Prediction of water and metal binding sites and their affinities by using the Fold‐X force field. Proceedings of the National Academy of Sciences of the United States of America, 102, 10147–10152. 10.1073/pnas.0501980102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tanzanella, C. , Antoccia, A. , Spadoni, E. , di Masi, A. , Pecile, V. , Demori, E. , … Maraschio, P. (2003). Chromosome instability and nibrin protein variants in NBS heterozygotes. Eur J Hum Genet, 11(4), 297–303. https://doi:10.1038/sj.ejhg.5200962 [DOI] [PubMed] [Google Scholar]
- Tung, N. , Lin, N. U. , Kidd, J. , Allen, B. A. , Singh, N. , Wenstrup, R. J. , … Garber, J. E. (2016). Frequency of germline mutations in 25 cancer susceptibility genes in a sequential series of patients with breast cancer. Journal of Clinical Oncology, 34, 1460–1468. 10.1200/JCO.2015.65.0747 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Varon, R. , Vissinga, C. , Platzer, M. , Cerosaletti, K. M. , Chrzanowska, K. H. , Saar, K. , … Reis, A. (1998). Nibrin, a novel DNA double‐ strand break repair protein, is mutated in Nijmegen breakage syndrome. Cell, 93, 467–476. 10.1016/S0092-8674(00)81174-5 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.