Skip to main content
Genetics logoLink to Genetics
. 2004 Aug;167(4):1915–1928. doi: 10.1534/genetics.103.015693

Significance tests and weighted values for AFLP similarities, based on Arabidopsis in silico AFLP fragment length distributions.

Wim J M Koopman 1, Gerrit Gort 1
PMCID: PMC1471014  PMID: 15342529

Abstract

Many AFLP studies include relatively unrelated genotypes that contribute noise to data sets instead of signal. We developed: (1) estimates of expected AFLP similarities between unrelated genotypes, (2) significance tests for AFLP similarities, enabling the detection of unrelated genotypes, and (3) weighted similarity coefficients, including band position information. Detection of unrelated genotypes and use of weighted similarity coefficients will make the analysis of AFLP data sets more informative and more reliable. Test statistics and weighted coefficients were developed for total numbers of shared bands and for Dice, Jaccard, Nei and Li, and simple matching (dis)similarity coefficients. Theoretical and in silico AFLP fragment length distributions (FLDs) were examined as a basis for the tests. The in silico AFLP FLD based on the Arabidopsis thaliana genome sequence was the most appropriate for angiosperms. The G + C content of the selective nucleotides in the in silico AFLP procedure significantly influenced the FLD. Therefore, separate test statistics were calculated for AFLP procedures with high, average, and low G + C contents in the selective nucleotides. The test statistics are generally applicable for angiosperms with a G + C content of approximately 35-40%, but represent conservative estimates for genotypes with higher G + C contents. For the latter, test statistics based on a rice genome sequence are more appropriate.

Full Text

The Full Text of this article is available as a PDF (138.6 KB).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Alonso-Blanco C., Peeters A. J., Koornneef M., Lister C., Dean C., van den Bosch N., Pot J., Kuiper M. T. Development of an AFLP based linkage map of Ler, Col and Cvi Arabidopsis thaliana ecotypes and construction of a Ler/Cvi recombinant inbred line population. Plant J. 1998 Apr;14(2):259–271. doi: 10.1046/j.1365-313x.1998.00115.x. [DOI] [PubMed] [Google Scholar]
  2. Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000 Dec 14;408(6814):796–815. doi: 10.1038/35048692. [DOI] [PubMed] [Google Scholar]
  3. Devos K. M., Beales J., Nagamura Y., Sasaki T. Arabidopsis-rice: will colinearity allow gene prediction across the eudicot-monocot divide? Genome Res. 1999 Sep;9(9):825–829. doi: 10.1101/gr.9.9.825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Feng Qi, Zhang Yujun, Hao Pei, Wang Shengyue, Fu Gang, Huang Yucheng, Li Ying, Zhu Jingjie, Liu Yilei, Hu Xin. Sequence and analysis of rice chromosome 4. Nature. 2002 Nov 21;420(6913):316–320. doi: 10.1038/nature01183. [DOI] [PubMed] [Google Scholar]
  5. Freeling M. Grasses as a single genetic system: reassessment 2001. Plant Physiol. 2001 Mar;125(3):1191–1197. doi: 10.1104/pp.125.3.1191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Goff Stephen A., Ricke Darrell, Lan Tien-Hung, Presting Gernot, Wang Ronglin, Dunn Molly, Glazebrook Jane, Sessions Allen, Oeller Paul, Varma Hemant. A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science. 2002 Apr 5;296(5565):92–100. doi: 10.1126/science.1068275. [DOI] [PubMed] [Google Scholar]
  7. Innan H., Terauchi R., Kahl G., Tajima F. A method for estimating nucleotide diversity from AFLP data. Genetics. 1999 Mar;151(3):1157–1164. doi: 10.1093/genetics/151.3.1157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Marie D., Brown S. C. A cytometric exercise in plant DNA histograms, with 2C values for 70 species. Biol Cell. 1993;78(1-2):41–51. doi: 10.1016/0248-4900(93)90113-s. [DOI] [PubMed] [Google Scholar]
  9. Matassi G., Montero L. M., Salinas J., Bernardi G. The isochore organization and the compositional distribution of homologous coding sequences in the nuclear genome of plants. Nucleic Acids Res. 1989 Jul 11;17(13):5273–5290. doi: 10.1093/nar/17.13.5273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Meksem K., Ruben E., Hyten D., Triwitayakorn K., Lightfoot D. A. Conversion of AFLP bands into high-throughput DNA markers. Mol Genet Genomics. 2001 Apr;265(2):207–214. doi: 10.1007/s004380000418. [DOI] [PubMed] [Google Scholar]
  11. Montero L. M., Salinas J., Matassi G., Bernardi G. Gene distribution and isochore organization in the nuclear genome of plants. Nucleic Acids Res. 1990 Apr 11;18(7):1859–1867. doi: 10.1093/nar/18.7.1859. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Mueller UG, Wolfenbarger LL. AFLP genotyping and fingerprinting. Trends Ecol Evol. 1999 Oct;14(10):389–394. doi: 10.1016/s0169-5347(99)01659-6. [DOI] [PubMed] [Google Scholar]
  13. Nei M., Li W. H. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc Natl Acad Sci U S A. 1979 Oct;76(10):5269–5273. doi: 10.1073/pnas.76.10.5269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Nussinov R. Compositional variations in DNA sequences. Comput Appl Biosci. 1991 Jul;7(3):287–293. doi: 10.1093/bioinformatics/7.3.287. [DOI] [PubMed] [Google Scholar]
  15. Nussinov R. Nearest neighbor nucleotide patterns. Structural and biological implications. J Biol Chem. 1981 Aug 25;256(16):8458–8462. [PubMed] [Google Scholar]
  16. O'Hanlon P. C., Peakall R. A simple method for the detection of size homoplasy among amplified fragment length polymorphism fragments. Mol Ecol. 2000 Jun;9(6):815–816. doi: 10.1046/j.1365-294x.2000.00924.x. [DOI] [PubMed] [Google Scholar]
  17. Peters J. L., Constandt H., Neyt P., Cnops G., Zethof J., Zabeau M., Gerats T. A physical amplified fragment-length polymorphism map of Arabidopsis. Plant Physiol. 2001 Dec;127(4):1579–1589. [PMC free article] [PubMed] [Google Scholar]
  18. Rouppe van der Voort J. N., van Zandvoort P., van Eck H. J., Folkertsma R. T., Hutten R. C., Draaistra J., Gommers F. J., Jacobsen E., Helder J., Bakker J. Use of allele specificity of comigrating AFLP markers to align genetic maps from different potato genotypes. Mol Gen Genet. 1997 Jul;255(4):438–447. doi: 10.1007/s004380050516. [DOI] [PubMed] [Google Scholar]
  19. Salinas J., Matassi G., Montero L. M., Bernardi G. Compositional compartmentalization and compositional patterns in the nuclear genomes of plants. Nucleic Acids Res. 1988 May 25;16(10):4269–4285. doi: 10.1093/nar/16.10.4269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Vos P., Hogers R., Bleeker M., Reijans M., van de Lee T., Hornes M., Frijters A., Pot J., Peleman J., Kuiper M. AFLP: a new technique for DNA fingerprinting. Nucleic Acids Res. 1995 Nov 11;23(21):4407–4414. doi: 10.1093/nar/23.21.4407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Yu Jun, Hu Songnian, Wang Jun, Wong Gane Ka-Shu, Li Songgang, Liu Bin, Deng Yajun, Dai Li, Zhou Yan, Zhang Xiuqing. A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science. 2002 Apr 5;296(5565):79–92. doi: 10.1126/science.1068037. [DOI] [PubMed] [Google Scholar]

Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES