Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 1982 Jan 11;10(1):247–263. doi: 10.1093/nar/10.1.247

Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries.

W B Goad, M I Kanehisa
PMCID: PMC326131  PMID: 6801626

Abstract

We present an algorithm--a generalization of the Needleman-Wunsch-Sellers algorithm--which finds within longer sequences all subsequences that resemble one another locally. The probability that so close a resemblance would occur by chance alone is calculated and used to classify these local homologies according to statistical significance. Repeats and inverted repeats may also be found. Results for both random and biological nucleic acid sequences are presented. Fourteen complete genomes are analyzed for dyad symmetries.

Full text

PDF
247

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Beck E., Sommer R., Auerswald E. A., Kurz C., Zink B., Osterburg G., Schaller H., Sugimoto K., Sugisaki H., Okamoto T. Nucleotide sequence of bacteriophage fd DNA. Nucleic Acids Res. 1978 Dec;5(12):4495–4503. doi: 10.1093/nar/5.12.4495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Fiers W., Contreras R., Duerinck F., Haegeman G., Iserentant D., Merregaert J., Min Jou W., Molemans F., Raeymaekers A., Van den Berghe A. Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene. Nature. 1976 Apr 8;260(5551):500–507. doi: 10.1038/260500a0. [DOI] [PubMed] [Google Scholar]
  3. Fiers W., Contreras R., Haegemann G., Rogiers R., Van de Voorde A., Van Heuverswyn H., Van Herreweghe J., Volckaert G., Ysebaert M. Complete nucleotide sequence of SV40 DNA. Nature. 1978 May 11;273(5658):113–120. doi: 10.1038/273113a0. [DOI] [PubMed] [Google Scholar]
  4. Franck A., Guilley H., Jonard G., Richards K., Hirth L. Nucleotide sequence of cauliflower mosaic virus DNA. Cell. 1980 Aug;21(1):285–294. doi: 10.1016/0092-8674(80)90136-1. [DOI] [PubMed] [Google Scholar]
  5. Galibert F., Mandart E., Fitoussi F., Tiollais P., Charnay P. Nucleotide sequence of the hepatitis B virus genome (subtype ayw) cloned in E. coli. Nature. 1979 Oct 25;281(5733):646–650. doi: 10.1038/281646a0. [DOI] [PubMed] [Google Scholar]
  6. Godson G. N., Barrell B. G., Staden R., Fiddes J. C. Nucleotide sequence of bacteriophage G4 DNA. Nature. 1978 Nov 16;276(5685):236–247. doi: 10.1038/276236a0. [DOI] [PubMed] [Google Scholar]
  7. Hartley J. L., Donelson J. E. Nucleotide sequence of the yeast plasmid. Nature. 1980 Aug 28;286(5776):860–865. doi: 10.1038/286860a0. [DOI] [PubMed] [Google Scholar]
  8. Heffron F., McCarthy B. J., Ohtsubo H., Ohtsubo E. DNA sequence analysis of the transposon Tn3: three genes and three sites involved in transposition of Tn3. Cell. 1979 Dec;18(4):1153–1163. doi: 10.1016/0092-8674(79)90228-9. [DOI] [PubMed] [Google Scholar]
  9. Korn L. J., Queen C. L., Wegman M. N. Computer analysis of nucleic acid regulatory sequences. Proc Natl Acad Sci U S A. 1977 Oct;74(10):4401–4405. doi: 10.1073/pnas.74.10.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Max E. E., Maizel J. V., Jr, Leder P. The nucleotide sequence of a 5.5-kilobase DNA segment containing the mouse kappa immunoglobulin J and C region genes. J Biol Chem. 1981 May 25;256(10):5116–5120. [PubMed] [Google Scholar]
  11. Needleman S. B., Wunsch C. D. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970 Mar;48(3):443–453. doi: 10.1016/0022-2836(70)90057-4. [DOI] [PubMed] [Google Scholar]
  12. Sanger F., Coulson A. R., Friedmann T., Air G. M., Barrell B. G., Brown N. L., Fiddes J. C., Hutchison C. A., 3rd, Slocombe P. M., Smith M. The nucleotide sequence of bacteriophage phiX174. J Mol Biol. 1978 Oct 25;125(2):225–246. doi: 10.1016/0022-2836(78)90346-7. [DOI] [PubMed] [Google Scholar]
  13. Sankoff D. Matching sequences under deletion-insertion constraints. Proc Natl Acad Sci U S A. 1972 Jan;69(1):4–6. doi: 10.1073/pnas.69.1.4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Sellers P. H. Pattern recognition in genetic sequences. Proc Natl Acad Sci U S A. 1979 Jul;76(7):3041–3041. doi: 10.1073/pnas.76.7.3041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Soeda E., Arrand J. R., Smolar N., Walsh J. E., Griffin B. E. Coding potential and regulatory signals of the polyoma virus genome. Nature. 1980 Jan 31;283(5746):445–453. doi: 10.1038/283445a0. [DOI] [PubMed] [Google Scholar]
  16. Sutcliffe J. G. Complete nucleotide sequence of the Escherichia coli plasmid pBR322. Cold Spring Harb Symp Quant Biol. 1979;43(Pt 1):77–90. doi: 10.1101/sqb.1979.043.01.013. [DOI] [PubMed] [Google Scholar]
  17. Tinoco I., Jr, Uhlenbeck O. C., Levine M. D. Estimation of secondary structure in ribonucleic acids. Nature. 1971 Apr 9;230(5293):362–367. doi: 10.1038/230362a0. [DOI] [PubMed] [Google Scholar]
  18. Valenzuela P., Quiroga M., Zaldivar J., Rutter W. J., Kirschner M. W., Cleveland D. W. Nucleotide and corresponding amino acid sequences encoded by alpha and beta tubulin mRNAs. Nature. 1981 Feb 19;289(5799):650–655. doi: 10.1038/289650a0. [DOI] [PubMed] [Google Scholar]
  19. Yang R. C., Wu R. BK virus DNA: complete nucleotide sequence of a human tumor virus. Science. 1979 Oct 26;206(4417):456–462. doi: 10.1126/science.228391. [DOI] [PubMed] [Google Scholar]
  20. van Wezenbeek P. M., Hulsebos T. J., Schoenmakers J. G. Nucleotide sequence of the filamentous bacteriophage M13 DNA genome: comparison with phage fd. Gene. 1980 Oct;11(1-2):129–148. doi: 10.1016/0378-1119(80)90093-1. [DOI] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES