Two recent reviews, one in Nature Reviews Genetics from Bansal et al. (Statistical analysis strategies for association studies involving rare variants Nature Reviews Genetics 11, 773-785 (2010))1 and one elsewhere2, examined the emerging area of rare variant association studies. These reviews nicely describe the progression from association studies for common SNPs towards those for rare variants. We would like to add to these discussions a strategy that has been used by several groups for rare variant case-control association studies, developed independently of genome wide SNP association studies (GWAS) and largely confined to cancer genetics. We refer to this strategy as case-control mutation screening (CCMS).
Ideas contributing to CCMS are as follows. First, linkage analysis shows that evidence from many, individually very rare, sequence variants at the same locus can be combined3. Second, clinical testing of susceptibility genes such as BRCA1 and BRCA2 has shown that testing can be based on sequencing rather than genotyping. Third, the integrated evaluation of unclassified variants in BRCA1 and BRCA2 has shown that in silico assessment of rare variants — currently, rare missense substitutions (rMSs) — can be used to grade variants on the basis of predicted severity without attempting to dichotomize as deleterious versus neutral4. Finally, lessons from GWAS tell us that well-powered CCMS studies will be large, usually multi-center and often multi-ethnic, and therefore must be analyzed by statistical methods that allow for covariates.
The development of CCMS can be traced through the efforts of the genetics community to understand the contribution of heterozygous sequence variation in ATM to risk of breast cancer (Table 1). Analysis of ATM CCMS data started with a case-control study that employed a cohort allelic sums test limited to protein truncating variants plus variants that clearly damage splice junctions (T+SJVs)5, Analyses progressed to a two-pronged strategy of analyzing the pool of ATM T+SJVs in one logistic regression and the pool of rMSs in a second logistic regression6. The subtlety in this latter approach lies in combining all of the rMSs into a single categorical variable that incorporates prior information, such as sequence conservation, and grades the severity of rMSs from likely harmful to likely benign4,6. This variable is easily assessed in a logistic regression test for trend, thus minimizing the multiple testing problem while accommodating epidemiologic covariates. We believe that this form of CCMS, augmented by steadily improving statistical methods7,8, will be useful for identifying genes that harbor variants conferring intermediate risk, especially those in which most pathogenic variants are rare and either reduce or ablate function.
Table 1.
Key CCMS and related analyses of association of rare ATM variants with breast cancer
| Study | Year of publication | Key elements |
|---|---|---|
| FitzGeral et al.5 | 1997 | CCMS study, CAST test of T+SJVs. |
| Gatti et al.9 | 1999 | Theoretical study predicting relative importance of ATM missense substitutions. |
| Sommer et al.10 | 2003 | CCMS study incorporating prediction of missense substitution severity. |
| Renwick et al.11 | 2006 | Well-powered CCMS study, combined CAST of T+SJVs and pathogenic rMSs. |
| Tavtigian et al.6 | 2009 | CCMS meta-analysis, combined CAST of T+SJVs and graded rMS trend test. |
| Bernstein et al.12 | 2010 | Case-case analysis, supports importance of “predicted damaging” rMSs. |
CAST, cohort allelic sums test. T+SJVs, protein truncating plus splice junction variants. rMSs, rare missense substitutions.
Going forward, improving the accuracy and scope of methods of predicting sequence variant severity is a key goal. To this end, the Critical Assessment of Genomic Interpretation community exercise (http://genomeinterpretation.org/) will illuminate the capabilities of current approaches and inform their further development. An important additional issue is that methods for predicting gene dysfunction must be sufficiently transparent that other researchers can readily replicate predictions and judge the effects of hidden multiple testing, introduced by prediction of sequence variant severity, on CCMS data analysis.
Acknowledgments
This work was supported by NIH R01 CA121245.
References
- 1.Bansal V, Libiger O, Torkamani A, Schork NJ. Statistical analysis strategies for association studies involving rare variants. Nat Rev Genet. 2010;11:773–785. doi: 10.1038/nrg2867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Asimit J, Zeggini E. Rare Variant Association Analysis Methods for Complex Traits. Annual Review of Genetics. 2010;44:293–308. doi: 10.1146/annurev-genet-102209-163421. [DOI] [PubMed] [Google Scholar]
- 3.Terwilliger JD, Ott J. Handbook of Human Genetic Linkage. Johns Hopkins University Press; Baltimore and London: 1994. p. 235. [Google Scholar]
- 4.Tavtigian SV, Byrnes GB, Goldgar DE, Thomas A. Classification of rare missense substitutions, using risk surfaces, with genetic- and molecular-epidemiology applications. Hum Mutat. 2008;29:1342–1354. doi: 10.1002/humu.20896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.FitzGerald MG, et al. Heterozygous ATM mutations do not contribute to early onset of breast cancer. Nat Genet. 1997;15:307–310. doi: 10.1038/ng0397-307. [DOI] [PubMed] [Google Scholar]
- 6.Tavtigian SV, et al. Rare, evolutionarily unlikely missense substitutions in ATM confer increased risk of breast cancer. Am J Hum Genet. 2009;85:427–446. doi: 10.1016/j.ajhg.2009.08.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Le Calvez-Kelm F, et al. Rare, evolutionarily unlikely missense substitutions in CHEK2 contribute to breast cancer susceptibility: results from a breast CFR case-control mutation screening study. Breast Cancer Res. 2011 doi: 10.1186/bcr2810. In Press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Price AL, et al. Pooled association tests for rare variants in exon-resequencing studies. Am J Hum Genet. 2010;86:832–838. doi: 10.1016/j.ajhg.2010.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gatti RA, Tward A, Concannon P. Cancer risk in ATM heterozygotes: a model of phenotypic and mechanistic differences between missense and truncating mutations. Mol Genet Metab. 1999;68:419–423. doi: 10.1006/mgme.1999.2942. [DOI] [PubMed] [Google Scholar]
- 10.Sommer SS, et al. ATM missense mutations are frequent in patients with breast cancer. Cancer Genet Cytogenet. 2003;145:115–120. doi: 10.1016/s0165-4608(03)00119-5. [DOI] [PubMed] [Google Scholar]
- 11.Renwick A, et al. ATM mutations that cause ataxia-telangiectasia are breast cancer susceptibility alleles. Nat Genet. 2006;38:873–875. doi: 10.1038/ng1837. [DOI] [PubMed] [Google Scholar]
- 12.Bernstein JL, et al. Radiation exposure, the ATM Gene, and contralateral breast cancer in the women’s environmental cancer and radiation epidemiology study. J Natl Cancer Inst. 2010;102:475–483. doi: 10.1093/jnci/djq055. [DOI] [PMC free article] [PubMed] [Google Scholar]
