Abstract
Recombination is an important evolutionary factor in many organisms, including humans, and understanding its effects is an important task facing geneticists. Detecting past recombination events is thus important; this article introduces statistics that give a lower bound on the number of recombination events in the history of a sample, on the basis of the patterns of variation in the sample DNA. Such lower bounds are appropriate, since many recombination events in the history are typically undetectable, so the true number of historical recombinations is unobtainable. The statistics can be calculated quickly by computer and improve upon the earlier bound of Hudson and Kaplan 1985. A method is developed to combine bounds on local regions in the data to produce more powerful improved bounds. The method is flexible to different models of recombination occurrence. The approach gives recombination event bounds between all pairs of sites, to help identify regions with more detectable recombinations, and these bounds can be viewed graphically. Under coalescent simulations, there is a substantial improvement over the earlier method (of up to a factor of 2) in the expected number of recombination events detected by one of the new minima, across a wide range of parameter values. The method is applied to data from a region within the lipoprotein lipase gene and the amount of detected recombination is substantially increased. Further, there is strong clustering of detected recombination events in an area near the center of the region. A program implementing these statistics, which was used for this article, is available from http://www.stats.ox.ac.uk/mathgen/programs.html.
Full Text
The Full Text of this article is available as a PDF (334.0 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Clark A. G., Weiss K. M., Nickerson D. A., Taylor S. L., Buchanan A., Stengård J., Salomaa V., Vartiainen E., Perola M., Boerwinkle E. Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. Am J Hum Genet. 1998 Aug;63(2):595–612. doi: 10.1086/301977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fearnhead P., Donnelly P. Estimating recombination rates from population genetic data. Genetics. 2001 Nov;159(3):1299–1318. doi: 10.1093/genetics/159.3.1299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Griffiths R. C., Marjoram P. Ancestral inference from samples of DNA sequences with recombination. J Comput Biol. 1996 Winter;3(4):479–502. doi: 10.1089/cmb.1996.3.479. [DOI] [PubMed] [Google Scholar]
- Hein J. Reconstructing evolution of sequences subject to recombination using parsimony. Math Biosci. 1990 Mar;98(2):185–200. doi: 10.1016/0025-5564(90)90123-g. [DOI] [PubMed] [Google Scholar]
- Hudson R. R., Kaplan N. L. Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics. 1985 Sep;111(1):147–164. doi: 10.1093/genetics/111.1.147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuhner M. K., Yamato J., Felsenstein J. Maximum likelihood estimation of recombination rates from population data. Genetics. 2000 Nov;156(3):1393–1401. doi: 10.1093/genetics/156.3.1393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nickerson D. A., Taylor S. L., Weiss K. M., Clark A. G., Hutchinson R. G., Stengård J., Salomaa V., Vartiainen E., Boerwinkle E., Sing C. F. DNA sequence diversity in a 9.7-kb region of the human lipoprotein lipase gene. Nat Genet. 1998 Jul;19(3):233–240. doi: 10.1038/907. [DOI] [PubMed] [Google Scholar]
- Nielsen R. Estimation of population parameters and recombination rates from single nucleotide polymorphisms. Genetics. 2000 Feb;154(2):931–942. doi: 10.1093/genetics/154.2.931. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Przeworski M., Wall J. D. Why is there so little intragenic linkage disequilibrium in humans? Genet Res. 2001 Apr;77(2):143–151. doi: 10.1017/s0016672301004967. [DOI] [PubMed] [Google Scholar]
- Templeton A. R., Clark A. G., Weiss K. M., Nickerson D. A., Boerwinkle E., Sing C. F. Recombinational and mutational hotspots within the human lipoprotein lipase gene. Am J Hum Genet. 2000 Jan;66(1):69–83. doi: 10.1086/302699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wall J. D. A comparison of estimators of the population recombination rate. Mol Biol Evol. 2000 Jan;17(1):156–163. doi: 10.1093/oxfordjournals.molbev.a026228. [DOI] [PubMed] [Google Scholar]