Abstract
Somatic activating GNAS mutations cause McCune-Albright syndrome (MAS). Owing to low mutation abundance, mutant-specific enrichment procedures, such as the peptide nucleic acid (PNA) method, are required to detect mutations in peripheral blood. Next generation sequencing (NGS) can analyze millions of PCR amplicons independently, thus it is expected to detect low-abundance GNAS mutations quantitatively. In the present study, we aimed to develop an NGS-based method to detect low-abundance somatic GNAS mutations. PCR amplicons encompassing exons 8 and 9 of GNAS, in which most activating mutations occur, were sequenced on the MiSeq instrument. As expected, our NGS-based method could sequence the GNAS locus with very high read depth (approximately 100,000) and low error rate. A serial dilution study with use of cloned mutant and wildtype DNA samples showed a linear correlation between dilution and measured mutation abundance, indicating the reliability of quantification of the mutation. Using the serially diluted samples, the detection limits of three mutation detection methods (the PNA method, NGS, and combinatory use of PNA and NGS [PNA-NGS]) were determined. The lowest detectable mutation abundance was 1% for the PNA method, 0.03% for NGS and 0.01% for PNA-NGS. Finally, we analyzed 16 MAS patient-derived leukocytic DNA samples with the three methods, and compared the mutation detection rate of them. Mutation detection rate of the PNA method, NGS and PNA-NGS in 16 patient-derived peripheral blood samples were 56%, 63% and 75%, respectively. In conclusion, NGS can detect somatic activating GNAS mutations quantitatively and sensitively from peripheral blood samples. At present, the PNA-NGS method is likely the most sensitive method to detect low-abundance GNAS mutation.
Introduction
The rapid emergence of next generation sequencing (NGS) is revolutionizing medical sciences. NGS now allows clinical investigators to analyze transcriptome, exome and genome from small amounts of DNA/RNA. NGS is also available for ultra-deep sequencing of PCR amplicons, microRNA and microbiomes. NGS-based approaches have brought remarkable advances in a broad range of medical research areas, such as studies of rare Mendelian disorders [1] and surveillance of infectious disease outbreaks [2]. NGS has also provided a wealth of new information for cancer genomics, owing in part to the ultra-deep amplicon sequencing of cancerous and precancerous cells [3]. Because NGS can analyze millions of DNA fragments simultaneously and independently, low abundance mutations of oncogenes have now become readily detectable. However, in contrast to advances in understanding of somatic mutations associated with cancer, knowledge about somatic mutations causing benign congenital disorders remains very limited.
McCune-Albright syndrome (MAS; OMIM #174800) is a rare congenital disorder hallmarked by osseous fibrous dysplasia, café-au-lait skin pigmentation and various endocrine hyperfunction, e.g., peripheral precocious puberty, Cushing syndrome and functional pituitary adenoma [4], [5]. MAS is caused by activating mutations of GNAS, encoding the stimulatory G-protein alpha subunit [6]. Mutations are exclusively present in the somatic mosaic state, probably because the nonmosaic state leads to early embryonic lethality. Clinical manifestations of MAS are highly variable in all three lesions, presumably due to variability of mutation abundance among affected tissues.
In MAS patients, mutation abundance is generally low in unaffected tissues. Thus, mutations in peripheral blood leukocytes (PBL) cannot be detected by standard PCR-based Sanger sequencing, while mutations in affected lesions (e.g., surgical bone specimens) can. Based on the fact that the vast majority of activating GNAS mutations occurs in the Arg201 residue, Candeliere et al. developed the method for selective enrichment of Arg201 GNAS mutations using a series of nested PCR and restriction enzyme digestion [7]. Subsequently, the second enrichment method with use of a peptide nucleic acid (PNA) probe, which forms hybrids with wildtype DNA and inhibits PCR amplification, was developed [8]. Mutation detection rate from PBL samples with these two methods are typically around 50% [9]–[11]. Of interest, mutation detection rate increases up to 90% when DNA sample derived from the affected lesion is available [10]. This implies that diagnostic performance of the two methods is still inadequate.
In the present study, we developed a novel NGS-based method that can detect low-abundance GNAS mutations quantitatively and sensitively. We compared diagnostic performance of the NGS-based method with that of the PNA method, by a serial dilution study and a mutation detection study using 16 MAS patient-derived PBL samples.
Materials and Methods
PCR with or without the PNA probe
The overview of mutation detection methods is shown in Figure 1 . All DNA samples used in the study was extracted from PBL with the Gentra Puregene Blood Kit (Qiagen, Hilden, Germany). Partial region of the GNAS locus (chr20:57484398-57484647; hg19), in which nucleotides 598 to 711 (begins at the first ATG codon) were included, was PCR-amplified with or without the PNA probe. The PCR mixture (final volume 20 µL) contained 100 ng genomic DNA, 0.25 mM dNTPs, primers (0.25 µM each), and 1 U Herculase II Fusion DNA Polymerases in reaction buffer (Agilent Technologies, Santa Clara, CA), with or without 30 µM PNA probe (Panagene Inc., Daejeon, Korea). The PCR conditions were as follows: initial denaturation at 98°C for 30 s; 35 cycles at 98°C for 10 s (denaturation), 68°C for 60 s (hybridization), 55°C for 30 s (annealing) and 72°C for 30 s (extension) with a final extension at 72°C for 5 min. The sequences of the PNA and primers were as follows: PNA Gly-NH2-CGC TGC CGT GTC-HAc; sense primer 5′-CTA CAC GAC GCT CTT CCG ATC TGT TTC AGG ACC TGC TTC GC-3′; and antisense primer 5′-GTG ACT GGA GTT CAG ACG TGT GCT CTT CCG ATC TCA CAG CAT CCT ACC GTT GAA-3′ (adaptor sequences used in Illumina platform are underlined). Generated PCR products were purified with the Agencourt AMPure XP Bead system (Beckman Coulter Genomics, Essex, UK), and were subject to both Sanger sequencing and NGS.
Mutation detection by Sanger sequencing and NGS
For Sanger sequencing, we used the BigDye Dideoxy Sequence Kit (Life Technologies, Carlsbad, CA) and the ABI3130xl sequencer (Life Technologies). Presence/absence of mutations was judged based on visual inspection of each sequence chromatogram.
As for NGS, we performed 15 cycles of second PCR using diluted first PCR products to add the attachment sites (P5 and P7) and the index sequence, which are used in Illumina multiplexed sequencing. The PCR mixture (final volume 20 µL) contained 1 µL purified first PCR product (diluted 1∶20 with pure water), 0.25 mM dNTPs, primers (0.25 µM each), and 1 U Herculase II Fusion DNA Polymerases in reaction buffer. The PCR conditions were as follows: initial denaturation at 98°C for 30 s; 15 cycles at 98°C for 10 s, 55°C for 20 s and 72°C for 30 s with a final extension at 72°C for 5 min. The sequences of the primers were as follows: sense primer: 5'- AAT GAT ACG GCG ACC ACC GAG ATC TAC ACT CTT TCC CTA CAC GAC GCT CTT CCG ATC T-3'; and antisense primer: 5'- CAA GCA GAA GAC GGC ATA CGA GAT NNN NNN GTG ACT GGA GTT CAG ACG TGT-3' (P7 and P5 attachment sites are underlined. NNN NNN in the antisense primer denotes index-specific sequence). The second PCR products were purified with the AMPure system, and were quantified with the Qubit dsDNA HS Assay Kit (Life Technologies). In each NGS run, 16 samples were multiplexed per pool. To improve base call accuracy, an equimolar quantity of the PhiX control (Illumina, San Diego, CA) was added to the pool.
Pooled samples were pair-end sequenced on the MiSeq instrument with at least 30 cycles and an index read. Base calling, read filtering and demultiplexing were performed according to the standard Illumina processing pipeline. Sequence reads were mapped to the GNAS genomic sequence with Bowtie [12]. We used SAMtools to calculate read depth and nucleotide frequencies for each position of the amplicons [13]. Filtering threshold was set to Q35, which is equivalent to the probability of an incorrect base call 1 in 3160 times. For each experiment, three control PBL DNA samples were analyzed to define the experiment-specific reference upper limit of the variant call (z-score equal or more than 2.5 were defined as positive). All DNA samples were amplified and sequenced twice.
Serial dilution of cloned mutant DNAs
A first PCR product generated from an R201H mutation carrying patient was cloned into the pCR2.1-TOPO vector (Life Technologies). We prepared wildtype or mutant DNA (each 1 ng/ µL) by diluting sequence-verified plasmids. Then, we diluted cloned mutant DNA into cloned wildtype DNA to 1/10 [relative mutation abundance (RMA), 10%], 1/100 (1%), 1/333 (0.3%), 1/1,000 (0.1%), 1/3,333 (0.03%) and 1/10,000 (0.01%). Serially diluted DNA samples were subject to sequence analyses described above.
Clinical samples
In a comparative mutation detection study, 16 PBL genome samples derived from MAS patients (6 boys and 10 girls) were used. The 16 patients had classic form of MAS with two or three features of the triad (osseous fibrous dysplasia, café-au-lait skin pigmentation and endocrine hyperfunction) ( Table 1 ). Fibrous dysplasia, café-au-lait skin pigmentation and endocrine hyperfunction were observed in 14 (88%), 12 (75%) and 14 (88%), respectively. Eight subjects (50%) had all three features. Observed endocrine dysfunction includes peripheral precocious puberty (N = 9), functional thyroid adenoma (N = 3), functional pituitary adenoma (N = 2) and Cushing syndrome (N = 2) ( Table 1 ).
Table 1. Characteristics of the study subjects.
ID | Sex | MAS features | Relative mutation abundance (%) | Mutation detection method | ||||
FD | Skin lesion | Endocrine hyperfunction | PNA | NGS | PNA-NGS | |||
1 | F | Present | Present | Peripheral PP | 12.4 | R201C | R201C | R201C |
2 | M | Present | Present | Functional pituitary adenoma*, Functional thyroid adenoma | 4.2 | R201C | R201C | R201C |
3 | M | Present | Absent | Cushing syndrome | 3.4 | R201H | R201H | R201H |
4 | F | Absent | Present | Peripheral PP | 2.9 | N.D. | R201C | R201C |
5 | F | Present | Present | Peripheral PP | 1.4 | R201H | R201H | R201H |
6 | M | Present | Present | Absent | 0.81 | R201H | R201H | R201H |
7 | M | Present | Present | Cushing syndrome | 0.67 | R201H | R201H | R201H |
8 | F | Present | Present | Peripheral PP, Functional thyroid adenoma | 0.55 | R201H | R201H | R201H |
9 | M | Present | Present | Functional thyroid adenoma | 0.28 | R201H | R201H | R201H |
10 | M | Present | Absent | Pituitary adenoma** | 0.26 | R201C | R201C | R201C |
11 | F | Present | Present | Peripheral PP | <0.03 | N.D. | N.D. | R201C |
12 | F | Present | Present | Peripheral PP | <0.03 | N.D. | N.D. | R201H |
13 | F | Absent | Present | Peripheral PP, Functional thyroid adenoma | <0.03 | N.D. | N.D. | N.D. |
14 | F | Present | Absent | Peripheral PP | <0.03 | N.D. | N.D. | N.D. |
15 | F | Present | Absent | Peripheral PP | <0.03 | N.D. | N.D. | N.D. |
16 | F | Present | Present | Absent | <0.03 | N.D. | N.D. | N.D. |
Abbreviations: FD, osseous fibrous dysplasia; MAS, McCune-Albright syndrome; N.D., not detected; NGS, next generation sequencing; PNA, the peptide nucleic acid method; PNA-NGS, combinatory use of PNA and NGS; PP, precocious puberty
Hyperprolactinemia and GH-producing adenoma
GH-producing adenoma
Ethics statement
The study was approved by the Institutional Review Boards of Asahikawa Medical University and Keio University School of Medicine. Written informed consent for molecular studies was obtained from the subjects or his/her parents.
Results
GNAS amplicon sequencing by NGS
We designed chimeric primer pairs, containing both locus-specific and adapter sequences, to generate PCR amplicons that are directly sequenced on the Illumina MiSeq platform. The amplicon covers two known sites of activating GNAS mutations (i.e., Arg201 and Gln227 [14]), thus would be expected to detect most mutations causing MAS. This experimental design allowed us to generate very high read depth per sample (approximately 100,000) with low error rate (mean±SD, 0.011±0.005%) (data not shown).
Quantitative detection of a GNAS mutation
To test the ability of the NGS-based mutation detection to provide quantitative data, we conducted a serial dilution study using cloned plasmid DNA samples (wildtype or R201H). We serially diluted mutant DNA into wildtype DNA, and measured mutant abundance with NGS. As we expected, a linear correlation between true mutant abundance and measured relative mutation abundance (NGS-measured RMA; defined as the proportion of sequence reads containing the mutation), was observed down to 0.01% ( Figure 2 ), indicating reliable quantification.
Mutation abundance in PBL of MAS patients
To define the distribution of NGS-measured RMA in PBL among MAS patients, we analyzed 16 patient-derived PBL samples. Ten out of 16 genomes had a GNAS mutation (R201H, N = 6; R201C, N = 4) of which NGS-measured RMA was more than 0.03% ( Table 1 ). NGS-measured RMA ranged from 0.3% to 12.4% (median, 1.1%), which was consistent with the fact that we could not detect those mutations by conventional Sanger sequencing without the PNA probe (data not shown). It is also consistent with the previous genetic knowledge that conventional Sanger sequencing cannot detect GNAS mutations in most MAS patient-derived PBL samples [7]. The distributions of NGS-measured RMA were similar between eight patients having all three MAS features and the remaining eight with two features (P = 0.5, Wilcoxon rank sum test).
Comparison of mutation detection methods
Finally, we compared the diagnostic performance of three mutation detection methods: the PNA method, NGS, and combinatory use of PNA and NGS (PNA-NGS). We assessed the detection limits using the serially diluted samples, and found that NGS could detect the mutation down to 0.03%, while the PNA method alone could detect down to 1% ( Figure 2 ). The PNA-NGS method had the lowest detection limit, which was 0.01% ( Figure 2 ). We also performed a comparative mutation detection study using the 16 patient-derived PBL genome samples. The PNA method identified mutations in nine out of 16 patients ( Table 1 ). All of these mutations were also detected by NGS and PNA-NGS. Among seven patients with a negative result by the PNA method, NGS detected one mutation carrier, and PNA-NGS revealed further two mutation carriers ( Table 1 ). Collectively, mutation detection rate of PNA, NGS and PNA-NGS was 56%, 63% and 75%, respectively.
Discussion
Detecting low-abundance somatic mutations with next generation amplicon sequencing is becoming a robust analytic tool in cancer genomics. In the present study, we demonstrate that this approach is also effective in diagnosis of a benign disorder due to low-abundance somatic mutations, as shown in megalencephaly syndromes very recently [15].
The quantitative nature of NGS allowed us to investigate the distribution of RMA in patient-derived PBL samples. We showed for the first time that NGS-measured RMA in PBL is strikingly variable among MAS patients. NGS-measured RMA in PBL does not correlate with disease severity, as defined by the number of clinical features, indicating that RMA in PBL and affected lesions are not correlated. Similar results have been observed in syndromes due to activation of AKT signaling (Proteus syndrome [16], and megalencephaly syndromes [15]), thus would be a universal feature of congenital syndromes due to somatic activating mutations.
We verified the diagnostic performance of three mutation detection methods (PNA, NGS and PNA-NGS) using a serial dilution study and a comparative mutation detection study. In both studies, the PNA-NGS method was seemed to be most sensitive. Combination of NGS and PNA resulted in 100-fold decrease in assay detection limit as compared with the PNA method alone. Clearly, this improvement will contribute to more accurate molecular diagnosis of MAS.
In the present study, 56% of MAS patients were positive for a GNAS mutation by the PNA method. Similar mutation detection rate with the nested PCR method or the PNA method have been reported [9]–[11]. Of clinical importance, we could detect GNAS mutations by the PNA-NGS method in three out of seven ‘PNA-negative’ MAS patients. This would be not only due to ultra-deep sequencing of NGS, but also due to nature of the mutation detection methods (qualitative vs quantitative), because a patient with relatively high RMA (Patient 4 in Table 1 ) was missed by PNA. Considering that mutation detection rate is more than 90% when affected tissue was available [10], we believe that improvement of mutation detection by the PNA-NGS approach is due to increase of true positives, although we cannot discriminate true positives from false positives in the present study. Future studies using paired PBL-affected tissue(s) samples, which have discordant test results (e.g., negative in PBL but positive in affected tissue(s)) will clarify the true diagnostic performance of the PNA-NGS approach.
In summary, we successfully developed an NGS-based mutation detection method for MAS, allowing quantitative and sensitive molecular diagnosis. The PNA-NGS method achieved 100-fold decrease in assay detection limit, as compared with the PNA method. Our study exemplifies the utility of NGS-based approaches to diagnose congenital disorders due to low-abundance somatic mutations from peripheral blood.
Acknowledgments
We would like to thank the following physicians for sent us clinical samples of MAS patients: Hamajima T (Aichi Children's Health and Medical Center), Hori N (Sanokousei General Hospital), Kamasaki H (Sapporo Medical University School of Medicine), Kamimaki T (Shizuoka City Shimizu Hospital), Kawada Y (Kyushu Rousai Hospital), Motomura K (Nagasaki University), Mukai T (Asahikawa-Kosei General Hospital), Naiki Y (National Center for Child Health and Development), Okada S (Hiroshima University Graduate School of Biomedical Sciences), Suwa T (Gifu University), Tajima T (Hokkaido University School of Medicine), and Tokuhiro E (Odawara Municipal Hospital). We also thank Prof. Takao Takahashi for fruitful discussion.
Funding Statement
This work was supported by the Grant-in-Aid for Young Scientists (B) (24791087) from the Japan Society for the Promotion of Science, and the Health Science Research Grant for Research on Applying Health Technology [Jitsuyoka (Nanbyo)-Ippan-014] from the Ministry of Health, Labour and Welfare, Japan. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Ng SB, Turner EH, Robertson PD, Flygare SD, Bigham AW, et al. (2009) Targeted capture and massively parallel sequencing of 12 human exomes. Nature 461: 272–276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Mellmann A, Harmsen D, Cummings CA, Zentz EB, Leopold SR, et al. (2011) Prospective genomic characterization of the German enterohemorrhagic Escherichia coli O104:H4 outbreak by rapid next generation sequencing technology. PLoS One 6: e22751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Thomas RK, Nickerson E, Simons JF, Janne PA, Tengs T, et al. (2006) Sensitive mutation detection in heterogeneous cancer specimens by massively parallel picoliter reactor sequencing. Nat Med 12: 852–855. [DOI] [PubMed] [Google Scholar]
- 4. McCune DJ (1936) Osteitis fibrosa cystica; the case of a nine year old girl who also exhibits precocious puberty, multiple pigmentation of the skin and hyperthyroidism. Am J Dis Child 52: 743–744. [Google Scholar]
- 5. Albright F, Butler AM, Hampton AO, Smith P (1937) Syndrome characterized by osteitis fibrosa disseminata, areas of pigmentation and endocrine dysfunction, with precocious puberty in females: report of five cases. N Engl J Med 216: 727–746. [Google Scholar]
- 6. Weinstein LS, Shenker A, Gejman PV, Merino MJ, Friedman E, et al. (1991) Activating mutations of the stimulatory G protein in the McCune-Albright syndrome. N Engl J Med 325: 1688–1695. [DOI] [PubMed] [Google Scholar]
- 7. Candeliere GA, Roughley PJ, Glorieux FH (1997) Polymerase chain reaction-based technique for the selective enrichment and analysis of mosaic arg201 mutations in G alpha s from patients with fibrous dysplasia of bone. Bone 21: 201–206. [DOI] [PubMed] [Google Scholar]
- 8. Bianco P, Riminucci M, Majolagbe A, Kuznetsov SA, Collins MT, et al. (2000) Mutations of the GNAS1 gene, stromal cell dysfunction, and osteomalacic changes in non-McCune-Albright fibrous dysplasia of bone. J Bone Miner Res 15: 120–128. [DOI] [PubMed] [Google Scholar]
- 9. Hannon TS, Noonan K, Steinmetz R, Eugster EA, Levine MA, et al. (2003) Is McCune-Albright syndrome overlooked in subjects with fibrous dysplasia of bone? J Pediatr 142: 532–538. [DOI] [PubMed] [Google Scholar]
- 10. Lumbroso S, Paris F, Sultan C, European Collaborative S (2004) Activating Gsalpha mutations: analysis of 113 patients with signs of McCune-Albright syndrome--a European Collaborative Study. J Clin Endocrinol Metab 89: 2107–2113. [DOI] [PubMed] [Google Scholar]
- 11. Kalfa N, Philibert P, Audran F, Ecochard A, Hannon T, et al. (2006) Searching for somatic mutations in McCune-Albright syndrome: a comparative study of the peptidic nucleic acid versus the nested PCR method based on 148 DNA samples. Eur J Endocrinol 155: 839–843. [DOI] [PubMed] [Google Scholar]
- 12. Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10: R25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, et al. (2009) The Sequence Alignment/Map format and SAMtools. Bioinformatics 25: 2078–2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Lee SE, Lee EH, Park H, Sung JY, Lee HW, et al. (2012) The diagnostic utility of the GNAS mutation in patients with fibrous dysplasia: meta-analysis of 168 sporadic cases. Hum Pathol 43: 1234–1242. [DOI] [PubMed] [Google Scholar]
- 15. Riviere JB, Mirzaa GM, O'Roak BJ, Beddaoui M, Alcantara D, et al. (2012) De novo germline and postzygotic mutations in AKT3, PIK3R2 and PIK3CA cause a spectrum of related megalencephaly syndromes. Nat Genet 44: 934–940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Lindhurst MJ, Sapp JC, Teer JK, Johnston JJ, Finn EM, et al. (2011) A mosaic activating mutation in AKT1 associated with the Proteus syndrome. N Engl J Med 365: 611–619. [DOI] [PMC free article] [PubMed] [Google Scholar]