Abstract
We present here a framework for the study of molecular variation within a single species. Information on DNA haplotype divergence is incorporated into an analysis of variance format, derived from a matrix of squared-distances among all pairs of haplotypes. This analysis of molecular variance (AMOVA) produces estimates of variance components and F-statistic analogs, designated here as φ-statistics, reflecting the correlation of haplotypic diversity at different levels of hierarchical subdivision. The method is flexible enough to accommodate several alternative input matrices, corresponding to different types of molecular data, as well as different types of evolutionary assumptions, without modifying the basic structure of the analysis. The significance of the variance components and φ-statistics is tested using a permutational approach, eliminating the normality assumption that is conventional for analysis of variance but inappropriate for molecular data. Application of AMOVA to human mitochondrial DNA haplotype data shows that population subdivisions are better resolved when some measure of molecular differences among haplotypes is introduced into the analysis. At the intraspecific level, however, the additional information provided by knowing the exact phylogenetic relations among haplotypes or by a nonlinear translation of restriction-site change into nucleotide diversity does not significantly modify the inferred population genetic structure. Monte Carlo studies show that site sampling does not fundamentally affect the significance of the molecular variance components. The AMOVA treatment is easily extended in several different directions and it constitutes a coherent and flexible framework for the statistical analysis of molecular data.
Full Text
The Full Text of this article is available as a PDF (4.0 MB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Anderson S., Bankier A. T., Barrell B. G., de Bruijn M. H., Coulson A. R., Drouin J., Eperon I. C., Nierlich D. P., Roe B. A., Sanger F. Sequence and organization of the human mitochondrial genome. Nature. 1981 Apr 9;290(5806):457–465. doi: 10.1038/290457a0. [DOI] [PubMed] [Google Scholar]
- Barbujani G., Sokal R. R. Genetic population structure of Italy. II. Physical and cultural barriers to gene flow. Am J Hum Genet. 1991 Feb;48(2):398–411. [PMC free article] [PubMed] [Google Scholar]
- Barbujani G., Sokal R. R. Zones of sharp genetic change in Europe are also linguistic boundaries. Proc Natl Acad Sci U S A. 1990 Mar;87(5):1816–1819. doi: 10.1073/pnas.87.5.1816. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Birky C. W., Jr, Fuerst P., Maruyama T. Organelle gene diversity under migration, mutation, and drift: equilibrium expectations, approach to equilibrium, effects of heteroplasmic cells, and comparison to nuclear genes. Genetics. 1989 Mar;121(3):613–627. doi: 10.1093/genetics/121.3.613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonné-Tamir B., Johnson M. J., Natali A., Wallace D. C., Cavalli-Sforza L. L. Human mitochondrial DNA types in two Israeli populations--a comparative study at the DNA level. Am J Hum Genet. 1986 Mar;38(3):341–351. [PMC free article] [PubMed] [Google Scholar]
- Brega A., Gardella R., Semino O., Morpurgo G., Astaldi Ricotti G. B., Wallace D. C., Santachiara Benerecetti A. S. Genetic studies on the Tharu population of Nepal: restriction endonuclease polymorphisms of mitochondrial DNA. Am J Hum Genet. 1986 Oct;39(4):502–512. [PMC free article] [PubMed] [Google Scholar]
- Brown W. M., George M., Jr, Wilson A. C. Rapid evolution of animal mitochondrial DNA. Proc Natl Acad Sci U S A. 1979 Apr;76(4):1967–1971. doi: 10.1073/pnas.76.4.1967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown W. M., Prager E. M., Wang A., Wilson A. C. Mitochondrial DNA sequences of primates: tempo and mode of evolution. J Mol Evol. 1982;18(4):225–239. doi: 10.1007/BF01734101. [DOI] [PubMed] [Google Scholar]
- Cann R. L., Stoneking M., Wilson A. C. Mitochondrial DNA and human evolution. Nature. 1987 Jan 1;325(6099):31–36. doi: 10.1038/325031a0. [DOI] [PubMed] [Google Scholar]
- Cockerham C. C. Analyses of gene frequencies. Genetics. 1973 Aug;74(4):679–700. doi: 10.1093/genetics/74.4.679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ewens W. J., Spielman R. S., Harris H. Estimation of genetic variation at the DNA level from restriction endonuclease data. Proc Natl Acad Sci U S A. 1981 Jun;78(6):3748–3750. doi: 10.1073/pnas.78.6.3748. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Excoffier L. Evolution of human mitochondrial DNA: evidence for departure from a pure neutral model of populations at equilibrium. J Mol Evol. 1990 Feb;30(2):125–139. doi: 10.1007/BF02099939. [DOI] [PubMed] [Google Scholar]
- Felsenstein J. Phylogenies from molecular sequences: inference and reliability. Annu Rev Genet. 1988;22:521–565. doi: 10.1146/annurev.ge.22.120188.002513. [DOI] [PubMed] [Google Scholar]
- Giles R. E., Blanc H., Cann H. M., Wallace D. C. Maternal inheritance of human mitochondrial DNA. Proc Natl Acad Sci U S A. 1980 Nov;77(11):6715–6719. doi: 10.1073/pnas.77.11.6715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gyllensten U., Wharton D., Josefsson A., Wilson A. C. Paternal inheritance of mitochondrial DNA in mice. Nature. 1991 Jul 18;352(6332):255–257. doi: 10.1038/352255a0. [DOI] [PubMed] [Google Scholar]
- Johnson M. J., Wallace D. C., Ferris S. D., Rattazzi M. C., Cavalli-Sforza L. L. Radiation of human mitochondria DNA types analyzed by restriction endonuclease cleavage patterns. J Mol Evol. 1983;19(3-4):255–271. doi: 10.1007/BF02099973. [DOI] [PubMed] [Google Scholar]
- Long J. C., Smouse P. E., Wood J. W. The allelic correlation structure of Gainj- and Kalam-speaking people. II. The genetic distance between population subdivisions. Genetics. 1987 Oct;117(2):273–283. doi: 10.1093/genetics/117.2.273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Long J. C. The allelic correlation structure of Gainj- and Kalam-speaking people. I. The estimation and interpretation of Wright's F-statistics. Genetics. 1986 Mar;112(3):629–647. doi: 10.1093/genetics/112.3.629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lynch M., Crease T. J. The analysis of population survey data on DNA sequence variation. Mol Biol Evol. 1990 Jul;7(4):377–394. doi: 10.1093/oxfordjournals.molbev.a040607. [DOI] [PubMed] [Google Scholar]
- Mantel N. The detection of disease clustering and a generalized regression approach. Cancer Res. 1967 Feb;27(2):209–220. [PubMed] [Google Scholar]
- Nei M. Analysis of gene diversity in subdivided populations. Proc Natl Acad Sci U S A. 1973 Dec;70(12):3321–3323. doi: 10.1073/pnas.70.12.3321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nei M. F-statistics and analysis of gene diversity in subdivided populations. Ann Hum Genet. 1977 Oct;41(2):225–233. doi: 10.1111/j.1469-1809.1977.tb01918.x. [DOI] [PubMed] [Google Scholar]
- Nei M., Miller J. C. A simple method for estimating average number of nucleotide substitutions within and between populations from restriction data. Genetics. 1990 Aug;125(4):873–879. doi: 10.1093/genetics/125.4.873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nei M., Tajima F. DNA polymorphism detectable by restriction endonucleases. Genetics. 1981 Jan;97(1):145–163. doi: 10.1093/genetics/97.1.145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nei M., Tajima F. Maximum likelihood estimation of the number of nucleotide substitutions from restriction sites data. Genetics. 1983 Sep;105(1):207–217. doi: 10.1093/genetics/105.1.207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reynolds J., Weir B. S., Cockerham C. C. Estimation of the coancestry coefficient: basis for a short-term genetic distance. Genetics. 1983 Nov;105(3):767–779. doi: 10.1093/genetics/105.3.767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Richardson R. H., Smouse P. E., Richardson M. E. Patterns of molecular variation. II. associations of electrophoretic mobility and larval substrate within species of the Drosophila mulleri complex. Genetics. 1977 Jan;85(1):141–154. doi: 10.1093/genetics/85.1.141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schurr T. G., Ballinger S. W., Gan Y. Y., Hodge J. A., Merriwether D. A., Lawrence D. N., Knowler W. C., Weiss K. M., Wallace D. C. Amerindian mitochondrial DNAs have rare Asian mutations at high frequencies, suggesting they derived from four primary maternal lineages. Am J Hum Genet. 1990 Mar;46(3):613–623. [PMC free article] [PubMed] [Google Scholar]
- Scozzari R., Torroni A., Semino O., Sirugo G., Brega A., Santachiara-Benerecetti A. S. Genetic studies on the Senegal population. I. Mitochondrial DNA polymorphisms. Am J Hum Genet. 1988 Oct;43(4):534–544. [PMC free article] [PubMed] [Google Scholar]
- Semino O., Torroni A., Scozzari R., Brega A., De Benedictis G., Santachiara Benerecetti A. S. Mitochondrial DNA polymorphisms in Italy. III. Population data from Sicily: a possible quantitation of maternal African ancestry. Ann Hum Genet. 1989 May;53(Pt 2):193–202. doi: 10.1111/j.1469-1809.1989.tb01784.x. [DOI] [PubMed] [Google Scholar]
- Slatkin M. The average number of sites separating DNA sequences drawn from a subdivided population. Theor Popul Biol. 1987 Aug;32(1):42–49. doi: 10.1016/0040-5809(87)90038-4. [DOI] [PubMed] [Google Scholar]
- Stoneking M., Jorde L. B., Bhatia K., Wilson A. C. Geographic variation in human mitochondrial DNA from Papua New Guinea. Genetics. 1990 Mar;124(3):717–733. doi: 10.1093/genetics/124.3.717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vilkki J., Savontaus M. L., Nikoskelainen E. K. Human mitochondrial DNA types in Finland. Hum Genet. 1988 Dec;80(4):317–321. doi: 10.1007/BF00273643. [DOI] [PubMed] [Google Scholar]
- Wallace D. C., Garrison K., Knowler W. C. Dramatic founder effects in Amerindian mitochondrial DNAs. Am J Phys Anthropol. 1985 Oct;68(2):149–155. doi: 10.1002/ajpa.1330680202. [DOI] [PubMed] [Google Scholar]
- Watterson G. A. On the number of segregating sites in genetical models without recombination. Theor Popul Biol. 1975 Apr;7(2):256–276. doi: 10.1016/0040-5809(75)90020-9. [DOI] [PubMed] [Google Scholar]