Abstract
A new MALDI-TOF based mini-sequencing assay termed VSET was developed for genotyping of SNPs. In this assay, specific fragments of genomic DNA containing the SNP site(s) are first amplified, followed by mini-sequencing in the presence of three ddNTPs and the fourth nucleotide in the deoxy form. In this way, the primer is extended by only one base from one allele, while it is typically extended by two bases from another allele. The products are then analyzed using MALDI-TOF mass spectrometry. The genotype of the SNP site is identified based on the number of nucleotides added. This assay has been examined using both synthetic and genomic DNA samples. In addition, multiplexed assays were successfully performed to genotype four SNP sites in a single tube. The main aspect of this assay is that it can overcome the key problems associated with the currently used mini-sequencing methods. First, it significantly reduces the stringent high-resolution and extensive desalting requirements that are essential to the pinpoint assay. Second, it avoids the long extension problem associated with the PROBE assay.
INTRODUCTION
Currently, there is a critical need to develop high-throughput, low-cost, accurate methods for genotyping of single nucleotide polymorphisms (SNPs) (1). The matrix-assisted-laser-desorption-ionization time-of-flight (MALDI-TOF) mass spectrometric based technique represents a new approach to SNP analysis and several methods based on MALDI-TOF have been proposed (2–25). Among them, the PINPOINT assay is particularly promising due to its potential to identify multiple SNPs in a single tube (2–5). In this assay, a primer is annealed to the targeted DNA immediately upstream of the SNP site and is extended by a single base in the presence of all four ddNTPs. The base at the SNP site is identified by the mass added onto the primer. However, this method is limited by two problems. One problem is resolution, as the mass difference between nucleotides can be as little as 9 Da (the difference between A and T), and this renders it difficult to unambiguously identify the added nucleotide. It was reported that the A/T heterozygote samples were poorly resolved and unassignable due to the small 9 Da mass difference between ddT and ddA (6). The second problem is the salt effect that renders it more difficult to accurately measure the peak masses due to the presence of Na, Mg and K adducts.
Smith and co-workers developed a variation approach to the PINPOINT assay (6,7) in which the mass-tagged dideoxynucleotides were employed in place of the unmodified dideoxynucleotides to overcome the resolution and salt problems. Becker and co-workers used another method to overcome the resolution problem (8,9), in which a cleavable site is added to the primer that can be cleaved after extension to produce low-mass markers that can be identified in a more accurate manner.
PROBE is another mini-sequencing method in which the primer is extended in the presence of three dNTPs and the fourth nucleotide in the dideoxy form (10–15). In this way, the primer is extended by the addition of the ddNTP from one allele, while it is extended through the addition of dNTPs until the ddNTP is incorporated from another allele and the number of dNTPs added depends on the template sequence. This creates several potential problems for high-level multiplex genotyping since the extended products from one of the alleles could be many bases away from its primer. First, the long extension products may overlap with those produced from other SNP sites; second, the long extension products may not be detectable if extension is not efficient; and third, the presence of double charged and dimer ions can be a problem since they may overlap with singly-charged monomer ions.
Herein, we report an alternative mini-sequencing method in which the primer is extended in the presence of three ddNTPs and the fourth nucleotide in the deoxy form. In this way, the primer is extended by one of the ddNTPs from one allele, while it is normally extended by the addition of one dNTP, and then terminated by a ddNTP from another allele. Because this approach tends to produce very short extension products, it is termed as VSET (very short extension). The main aspect of this method is that it can overcome key problems associated with the currently used methods. First, VSET greatly relaxes the stringent high-resolution and extensive desalting requirements that are essential to PINPOINT. Second, VSET avoids the long extension problem associating with PROBE.
MATERIALS AND METHODS
A MALDI-TOF based mini-sequencing approach typically consists of four major steps. They are: (i) extraction of genomic DNA from blood or tissue samples; (ii) PCR amplification of specific fragments of genomic DNA containing the SNP site(s); (iii) mini-sequencing of PCR products; and (iv) analysis of mini-sequencing products using MALDI-TOF, respectively. In addition, PCR products are treated prior to mini-sequencing to remove the unreacted PCR primers and dNTPs, and the mini-sequencing products will be desalted.
Materials
Shrimp alkaline phosphatase, Exonuclease I and ThermoSequenase were purchased from Amersham Pharmacia Biotech (Piscataway, NJ). Taq DNA polymerase was obtained from Promega (Madison, WI). The synthetic DNA templates and primers used in PCR and mini-sequencing were obtained from either Sigma Genosys (Woodlands, TX) or Research Genetics (Huntsville, AL).
Templates and PCR amplification
The model SNP systems used in this work are synthetic DNA containing the variation site at the second base of codon 12 of k-ras. The human genomic DNA sample was used for the genotyping of other SNP sites. PCR was performed using a thermal cycler (PCR-2400, PE Applied Biosystems, Foster City, CA). The 20 µl mixture solutions were prepared with the buffer provided by the manufacturer. Thereafter, the PCR products were treated by alkaline phosphatase and exonuclease I to destroy residual dNTPs and PCR primers. One unit of alkaline phosphatase and exonuclease I were added to the PCR tube, respectively. Then, the tube was incubated at 37°C for 30 min, followed by heating the tube at 80°C for 15 min to deactivate alkaline phosphatase and exonuclease I.
Mini-sequencing and ethanol precipitation desalting
Mini-sequencing was typically performed in 20 µl using the same thermal cycler. The mixture solutions were prepared with the buffer provided by the manufacturer. 5 pmol of primers were used for all mini-sequencing reactions. Thereafter, 2 µl of the ammonium acetate solution (10 M) was added to the mini-sequencing reaction tube. After incubation of several minutes at room temperature, 4–6 volumes of ethanol were added, followed by incubating the tube in a freezer at –20°C for ~30 min. Then, the tube was centrifuged, followed by removing the supernatant solution. The precipitated DNA was washed with 70% cooled ethanol solution. Finally, 5 µl of water was added to re-dissolve the precipitated DNA.
MALDI-TOF analysis
The MALDI sample was prepared by mixing 0.5 µl of the purified mini-sequencing products and 0.5 µl matrix (saturated 3-hydropicolinic acid in a 1:1:2 mixture of water, acetonitrile and 0.1 M ammonium citrate). The sample was dried and then analyzed using MALDI-TOF (Dynamo, Thermo Bioanalysis, Santa Fe, NM). The negative ion mode was used to collect all spectra.
RESULTS AND DISCUSSION
A biallelic SNP marker yields three different genotypes. For example, an A/G marker could produce the genotype of A homozygote, G homozygote and A/G heterozygote, respectively. Figure 1 displays a schematic representation of the VSET method for genotyping of a model SNP (A/G) system. The primer is annealed to the target DNA immediately upstream of the SNP site and is extended in the presence of three ddNTPs (ddATP, ddGTP and ddCTP) and the fourth nucleotide in the deoxy form (dTTP). As shown in Figure 1, the G homozygous sample produces one-base extension products, while the A homozygous sample yields two-base extension products. As a result, the A/G heterozygous sample gives both one- and two-base extension products. The genotype of an individual is identified on the basis of the primer extension pattern.
Figure 1.
A schematic representation of the VSET method for genotyping of a model SNP (A/G) system.
First, we examined this assay with model SNP systems, which were synthetic oligonucleotides resembling the sequence of k-ras gene (GenBank ID: M54968). The SNP site is the second base of codon 12, at which the nucleotide was A, C, G and T, respectively. The mini-sequencing primer is 5′-tcaaggcactcttgcctacgcca-3′, as shown in Figure 1. Heterozygous samples were created by mixing equimolar two oligonucleotides that were different only at the second base of codon 12. Figure 2a–c displays the result of typing three different genotypes arising from A and G targets in the presence of ddATP, ddCTP, ddGTP and dTTP. As expected, the homozygous G target yields one-base extension products by the addition of ddC, while the homozygous A target generated two-base extension products by the addition of dT and ddC. In contrast, the heterozygous A/G target produced both one and two-base extension products.
Figure 2.
MALDI-TOF mass spectra of the results arising from genotyping the model systems. (a) A target; (b) G target; (c) A/G target; (d) C target; (e) T target; (f) C/T target. Note that the minor peaks labeled with an asterisk were due to unterminated products.
Figure 2d–f display the result of typing three different genotypes of the C and T target samples, in the presence of ddTTP, ddGTP, ddATP and dGTP. As expected, the homozygous T samples produced only one-base extension products by the addition of ddA, while the homozygous C sample yielded only two-base extension products by adding dG and ddC. In contrast, both one- and two-base extension products were produced if the heterozygous C/T target samples were used, as seen from Figure 2f. Comparing the expected extension patterns with Figure 2, it was seen that all six different targets were correctly identified.
It should be noted that the minor peaks labeled with an asterisk in Figure 2a and d were due to unterminated products. In other words, the primers were extended by the addition of a dNTP, but they were not completely terminated by a ddNTP. As seen from Figure 2c and f, two extension products yielded from a heterozygote sample would have similar peak intensities and thereby, the genotype (homozygote or heterozygote) of a sample can be easily identified simply by comparing the peak intensity of products. For example, because the intensity of the peak labeled with an asterisk in Figure 2a was significantly weaker than that of another extended products, we can easily, therefore, conclude that the product corresponding to this minor peak was not terminated by a ddNTP and that the sample typed was homozygous.
Next, we examined the feasibility of VSET in genotyping of SNPs with double-stranded DNA amplified from PCR as the DNA templates. Figure 3 displays the result of typing three different genotypes associating with an A/T model system. Single-strand synthetic model templates (either homozygous or heterozygous targets of A/T) were first amplified by PCR, then mini-sequencing was performed in the presence of ddCTP, ddTTP, ddGTP and dATP, respectively. As a result, the A target is expected to produce one-base extension products, while the T target yields two-base extension products. The A/T heterozygous sample produces both one- and two-base extension products. As seen from Figure 3, the primers were clearly extended according to the genotype and three genotypes of A/T were unambiguously identified.
Figure 3.
MALDI-TOF mass spectra of the results arising from genotyping the PCR products. (a) A target; (b) T target; and (c) T/A target.
It should be noted that the primers are typically extended by either one or two bases in VSET. If the template sequence contains a run of the nucleotide that is the same as the base in the SNP site, and it immediately follows the SNP site, the primer may be extended by more than two bases. In this case, several measures can be taken. First, the combination of ddNTPs and dNTP should be alternated and this change will limit extension to either one or two bases. Second, a new primer can be designed to target another strand. Third, the same reaction conditions are utilized and the primer is allowed to extend by three or four bases as long as the extension products will not lead to the overlapping problem.
After the feasibility study, we evaluated the performance of VSET using human genomic DNA samples which were kindly provided by Professor Chakravarti of Case Western Reserve University. A total of five different SNP sites were examined and their ID/SNP sites in GenBank were AJ243297/17274, AJ243297/20297, AJ243297/20547, AJ243297/25187 and AJ243297/29366, respectively. We found that VSET was able to correctly identify the genotype of all the samples we have examined thus far. In addition, we have also successfully used VSET to identify the G1691 → A mutation of the Factor VLeiden gene in 27 people. This part of our study will be the subject of future publication.
For comparison, both PINPOINT and PROBE were also evaluated using these SNP samples. The results of genotyping an individual of A/T heterozygote (AJ243297/29366) using VSET, PINPOINT and PROBE were displayed in Figure 4a, b and c, respectively. A 540 bp DNA fragment containing this SNP site was first amplified with PCR, followed by mini-sequencing. All three mini-sequencing methods utilized the same mini-sequencing primer 5′-taagaaaggttgcatgatt-3′. The partial sequence of the template containing the SNP site is 5′-tcttgagaatgatttttttttaatcatgcaacctttcctta-3′, the base labeled by the bold letter is the SNP site.
Figure 4.
MALDI-TOF mass spectra of the results arising from genotyping an individual of A/T heterozygote using (a) VSET; (b) PINPOINT; and (c) PROBE, respectively.
For VSET, mini sequencing was performed in the presence of ddATP, ddCTP, ddGTP and dTTP. For PINPOINT, mini-sequencing was performed in the presence of all four ddNTPs. For PROBE, mini-sequencing was performed in the presence of dATP, dCTP, dGTP and ddTTP. All other conditions and the DNA sample used were also the same. Two extension products were generated from PINPOINT, where one allele led to one-base extension by adding ddA, while another resulted in one-base extension by adding ddT. It was seen from Figure 4b that these two extension products were poorly resolved and unassignable due to the small mass difference of 9 Da between ddA and ddT. In contrast, the A/T genotype was easily and unambiguously identified based on Figure 4a since two extension products obtained from VSET differs by one base.
In PROBE, the mini-sequencing primers were extended by 1 and 10 bases, respectively, for this A/T heterozygous sample. It was seen from Figure 4c that the peak corresponding to 10-base extension products was barely seen and that its intensity was more than seven times weaker than that corresponding to one-base extension products. The weaker ion intensity for long extension products was also seen in another work (12). In contrast, two extension products obtained from VSET yield comparable ion intensities. The weaker intensity for long extension products can be caused by two factors. First, the extension efficiency may be poorer for longer extension. Second, MALDI-TOF tends to have poorer detection sensitivity for larger oligonucleotides (26). One may argue that an increase of the polymerase concentration would improve extension, but this significantly increases costs and reduces the specificity of mini-sequencing.
Finally, we studied the feasibility of using VSET for multiplex genotyping. Four SNP sites (AJ243297/17247, AJ243297/20297, AJ243297/20547 and AJ243297/25187) were examined. For AJ243297/20297 and AJ243297/25187, the sense primers were designed to target the anti-sense strand templates, while for AJ/243297/19274 and AJ24397/20254, the anti-sense primers were designed to target the sense strand templates. As a result, the targeted bases at all four SNP sites was G/A. In this experiment, the PCR amplification was first performed separately for each sample, then these double-stranded PCR products were pooled together for mini-sequencing. In addition, mini-sequencing was performed in 30 µl.
A combination of ddATP, ddCTP, ddGTP and dTTP was used in mini-sequencing. The sequence of primers and their expected extension products are listed in Table 1. As will be discussed later, the ideal size of the primers for multiplexed genotyping is 15–27 bases in length. Therefore, the four primers used in this test were 15, 20, 24 and 27 bases in length, respectively. The portion of the primer sequence underlined in Table 1 acted as mass tags and the remaining sequences targeted the templates. Figure 5 displays the result of the 4-fold multiplexed VSET assay. It was seen that all primers were extended according to the expected extension pattern and therefore the genotypes of all four SNP sites were unambiguously identified based on Figure 5. The minor peaks labeled with an asterisk in Figure 5 were due to impurities present in the primers provided by the manufacturers. It should be noted that these primers were not purified when they were ordered and that they can be removed using HPLC and PAGE purification.
Table 1. Mini-sequencing primers and their expected extension products in 4-fold multiplexed VSET assays.
Primers | GenBank ID/ | Mass of | Sequence (5′→3′) | Genotype | Expected extension |
---|---|---|---|---|---|
SNP site | primer (Da) | ||||
P1 | AJ243297/19274 | 4772 | AGACTCCCTGCCCCA | A/G | A (dTddG) G (ddC) |
P2 | AJ243297/20297 | 5923 | TTTCACACCCTGACCCACCA | G | G (ddC) |
P3 | AJ243297/20547 | 7124 | CTGTAGGAAAAAATAGCACGAGT | A/G | A (dTddG) G (ddC) |
P4 | AJ243297/25187 | 8500 | TTTTTTTTTTCTGAGCACAGTGGCCCA | A/G | A (dTddG) G (ddC) |
Figure 5.
MALDI-TOF mass spectra of the results arising from genotyping four different SNP sites in a single tube. Note that the peaks labeled with an asterisk were due to the impurities present with the primers.
In this work, we reported an alternative MALDI-TOF based mini-sequencing method for typing SNPs. Compared with PINPOINT, VSET greatly relaxes the stringent mass resolution requirements since the bases at a SNP site are identified based on the number of nucleotides added. As a result, the salt effect on the assignment of extended products is also minimized and therefore, simple and cost-effective desalting methods can be utilized. In this work, mini-sequencing products were exclusively purified using ethanol precipitation. Compared with other methods, ethanol purification costs virtually nothing, therefore significantly reduces the costs associated with DNA purification.
Compared with PROBE, VSET leads to shorter extension. Extension can be terminated by three of the four ddNTPs in VSET, while extension can only be terminated by one ddNTP in PROBE. Therefore, VSET statistically produces much smaller extension products. As shown in Figure 5, long extension could lead to a major problem for multiplex genotyping since the extension efficiency varies significantly with the SNP sites and samples. In other words, when multiple SNP sites were genotyped in a single tube, some of the SNP samples will be well extended, while others may be poorly extended. As a result, long extension products may not be observable if extension is poor and, therefore, a heterozygous genotype could be mistakenly assigned as homozygote. Other problems associated with long extension include overlapping of long extension products with the products from other SNP sites; overlapping of unterminated extension products with other extension products; and overlapping double-charged and dimer ions with singly-charged monomer ions.
Finally, we discuss the capability of VSET for multiplex genotyping of SNPs. The peaks of doubly-charged or dimer ions of a given oligonucleotide should not overlap with the peaks of singly-charged monomer ions of other oligonucleotides if the oligonucleotides are 15–29 bases in length. In addition, extension efficiency and specificity with the primers of these sizes are fairly good. Hence, 15–27 bases may be the ideal lengths for the primers of multiplexed VSET assays. Because each primer and its extension products generally cover three bases, five different sets (15, 18, 21, 24 and 27mers) of oligonucleotides can be used as the primers. The primers of different size can be created with a 3′ portion of the sequence complementary to the target and a 5′ portion which is not complementary to the target (3). The 5′ portion of the sequence serves as mass tag. In addition, three primers can be created from each set of oligonucleotides as long as the masses of the unextended and extended primers are sufficiently different (the average mass difference is ~100 Da). Therefore as many as 15 primers (3 × 5) can be used to genotype 15 SNP sites in a single tube. This throughput is comparable to that achieved with the PINPOINT method (3–5). Currently, we are working to apply VSET to high-level multiplex genotyping of SNPs.
Acknowledgments
ACKNOWLEDGEMENTS
The authors would like to thank Prof. A. Chakravarti of Case Western Reserve University for kindly providing the human genomic DNA samples and helpful discussion during the course of this work. The financial support by HG01437 and HG01815 is acknowledged.
REFERENCES
- 1. (1999) Nature Genet. (Suppl.), 21, 1–60. [Google Scholar]
- 2.Haff L.A. and Smirnov,I.P. (1997) Genome Res., 7, 378–388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Haff L.A. and Smirnov,I.P (1997) Nucleic Acids Res., 25, 3749–3750. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ross P., Hall,L., Smirnov,I.P. and Haff,L. (1998) Nat. Biotechnol., 16, 1347–1351. [DOI] [PubMed] [Google Scholar]
- 5.Ross P., Smirnov,I.P., Hall,L. and Haff,L. (1999) The Abstract for the 47thASMS Conference on Mass Spectrometry and Allied Topics. Dollars, TX.
- 6.Fei Z., Ono,T. and Smith,L.M. (1998) Nucleic Acids Res., 26, 2827–2828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Fei Z., Ono,T. and Smith,L.M. (1999) The Abstract for the 47thASMS Conference on Mass Spectrometry and Allied Topics. Dollars, TX.
- 8.Bulter J.M., Royer,S., Hunter,J., Lin,H. and Becker,C.H. (1999) The Abstract for the 47thASMS Conference on Mass Spectrometry and Allied Topics. Dollars, TX.
- 9.Li J., Butler,J.M., Tan,Y., Lin,H., Royer,S., Ohler,L., Shaler,T.A., Hunter,J.M., Pollart,D.J., Monforte,J.A. and Becker,C.H. (1999) Electrophoresis, 20, 1258–1265. [DOI] [PubMed] [Google Scholar]
- 10.Little D.P., Braun,A., Darnhofer-Demar,B. and Koster,H. (1997) Eur. J. Clin. Chem. Clin. Biochem., 35, 545–548. [PubMed] [Google Scholar]
- 11.Little D.P., Braun,A., Darnhofer-Demar,B., Frilling,A., Li,Y., McIver,R.T.,Jr and Koster,H. (1997) J. Mol. Med., 75, 745–750. [DOI] [PubMed] [Google Scholar]
- 12.Braun A., Little,D.P. and Koster,H. (1997) Clin. Chem., 43, 1151–1158. [PubMed] [Google Scholar]
- 13.Hihhins G.S., Little,D.P. and Koster,H. (1997) Biotechniques, 23, 710–714. [DOI] [PubMed] [Google Scholar]
- 14.Little D.P., Braun,A., O’Donnell,M.J. and Koster,H. (1997) Nature Med., 3, 1413–1416. [DOI] [PubMed] [Google Scholar]
- 15.Braun A., Little,D., Reuter,D., Muller-Mysok,B. and Koster,H. (1997) Genomics, 46, 18–23. [DOI] [PubMed] [Google Scholar]
- 16.Griffin T.J., Tang,W. and Smith,L.M. (1997) Nat. Biotechnol., 15, 1368–1372. [DOI] [PubMed] [Google Scholar]
- 17.Ross P., Lee,K. and Belgrader,P. (1997) Anal. Chem., 69, 4197–4202. [DOI] [PubMed] [Google Scholar]
- 18.Jing-Baucom P., Girard,H.E., Buttler,J. and Belgrader,P. (1997) Anal. Chem., 69, 4894–4898. [DOI] [PubMed] [Google Scholar]
- 19.Wang B., Olson,J., Belenky,A., Stanton,V. and Dykes,C. (1999) The Abstract for the 47thASMS Conference on Mass Spectrometry and Allied Topics. Dollars, TX.
- 20.Gut I., Lechner,D., Berlin,K., Sauer,S., Kostrzewa,M., Frohlich,T., Thomas,W. and Lehrach,H. (1999) The Abstract for the 47thASMS Conference on Mass Spectrometry and Allied Topics. Dollars, TX.
- 21.Kucharzak R., Berlin,K., Lehrach,H. and Gut,I. (1999) The Abstract for the 47thASMS Conference on Mass Spectrometry and Allied Topics. Dollars, TX.
- 22.Griffin T.J., Hall,J.G., Prudent,J.R. and Smith,L.M. (1999) Proc. Natl Acad. Sci. USA, 96, 6301–6306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tang K., Fu,D., Julien,D., Cantor,C. and Koster,H. (1999) Proc. Natl Acad. Sci. USA, 96, 10016–10020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Guo B.C. (1999) Anal. Chem., 71, 333R–337R. [DOI] [PubMed] [Google Scholar]
- 25.Chen X., Fei,Z., Smith,L.M., Bradbury,E.M. and Majidi,V. (1999) Anal. Chem., 71, 3118–3128. [DOI] [PubMed] [Google Scholar]
- 26.Fitzgerald M., Parr,G. and Smith,L.M. (1993) Rapid Commun. Mass Spectrom., 7, 895–897. [Google Scholar]