Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 1984 Jan 11;12(1 Pt 1):203–213. doi: 10.1093/nar/12.1part1.203

Use of statistical criteria for screening potential homologies in nucleic acid sequences.

M Kanehisa
PMCID: PMC320997  PMID: 6694901

Abstract

We proposed a simple formula to assess the statistical significance of homologous segments found in comparison of two nucleic acid sequences (Goad and Kanehisa, Nucleic Acids Res. 10, 247-263, 1982). This paper clarifies the basic assumptions of the formula and its reliability is examined by Monte Carlo calculations. The results were satisfactory for random sequences. The formula is a useful measure for screening potentially interesting homologies and it can be implemented in any search algorithms. Examples are given for the screening procedure in the graphic display version of the Goad-Kanehisa algorithm.

Full text

PDF
203

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Brutlag D. L., Clayton J., Friedland P., Kedes L. H. SEQ: a nucleotide sequence analysis and recombination system. Nucleic Acids Res. 1982 Jan 11;10(1):279–294. doi: 10.1093/nar/10.1.279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Goad W. B., Kanehisa M. I. Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries. Nucleic Acids Res. 1982 Jan 11;10(1):247–263. doi: 10.1093/nar/10.1.247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Kanehisa M. I. Los Alamos sequence analysis package for nucleic acids and proteins. Nucleic Acids Res. 1982 Jan 11;10(1):183–196. doi: 10.1093/nar/10.1.183. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Korn L. J., Queen C. L., Wegman M. N. Computer analysis of nucleic acid regulatory sequences. Proc Natl Acad Sci U S A. 1977 Oct;74(10):4401–4405. doi: 10.1073/pnas.74.10.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Maizel J. V., Jr, Lenk R. P. Enhanced graphic matrix analysis of nucleic acid and protein sequences. Proc Natl Acad Sci U S A. 1981 Dec;78(12):7665–7669. doi: 10.1073/pnas.78.12.7665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Needleman S. B., Wunsch C. D. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970 Mar;48(3):443–453. doi: 10.1016/0022-2836(70)90057-4. [DOI] [PubMed] [Google Scholar]
  7. Pustell J., Kafatos F. C. A high speed, high capacity homology matrix: zooming through SV40 and polyoma. Nucleic Acids Res. 1982 Aug 11;10(15):4765–4782. doi: 10.1093/nar/10.15.4765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Smith T. F., Waterman M. S. Identification of common molecular subsequences. J Mol Biol. 1981 Mar 25;147(1):195–197. doi: 10.1016/0022-2836(81)90087-5. [DOI] [PubMed] [Google Scholar]
  9. Soeda E., Arrand J. R., Smolar N., Walsh J. E., Griffin B. E. Coding potential and regulatory signals of the polyoma virus genome. Nature. 1980 Jan 31;283(5746):445–453. doi: 10.1038/283445a0. [DOI] [PubMed] [Google Scholar]
  10. Staden R. An interactive graphics program for comparing and aligning nucleic acid and amino acid sequences. Nucleic Acids Res. 1982 May 11;10(9):2951–2961. doi: 10.1093/nar/10.9.2951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Wilbur W. J., Lipman D. J. Rapid similarity searches of nucleic acid and protein data banks. Proc Natl Acad Sci U S A. 1983 Feb;80(3):726–730. doi: 10.1073/pnas.80.3.726. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES