Abstract
We present an algorithm--a generalization of the Needleman-Wunsch-Sellers algorithm--which finds within longer sequences all subsequences that resemble one another locally. The probability that so close a resemblance would occur by chance alone is calculated and used to classify these local homologies according to statistical significance. Repeats and inverted repeats may also be found. Results for both random and biological nucleic acid sequences are presented. Fourteen complete genomes are analyzed for dyad symmetries.
Full text
PDF
















Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Beck E., Sommer R., Auerswald E. A., Kurz C., Zink B., Osterburg G., Schaller H., Sugimoto K., Sugisaki H., Okamoto T. Nucleotide sequence of bacteriophage fd DNA. Nucleic Acids Res. 1978 Dec;5(12):4495–4503. doi: 10.1093/nar/5.12.4495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiers W., Contreras R., Duerinck F., Haegeman G., Iserentant D., Merregaert J., Min Jou W., Molemans F., Raeymaekers A., Van den Berghe A. Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene. Nature. 1976 Apr 8;260(5551):500–507. doi: 10.1038/260500a0. [DOI] [PubMed] [Google Scholar]
- Fiers W., Contreras R., Haegemann G., Rogiers R., Van de Voorde A., Van Heuverswyn H., Van Herreweghe J., Volckaert G., Ysebaert M. Complete nucleotide sequence of SV40 DNA. Nature. 1978 May 11;273(5658):113–120. doi: 10.1038/273113a0. [DOI] [PubMed] [Google Scholar]
- Franck A., Guilley H., Jonard G., Richards K., Hirth L. Nucleotide sequence of cauliflower mosaic virus DNA. Cell. 1980 Aug;21(1):285–294. doi: 10.1016/0092-8674(80)90136-1. [DOI] [PubMed] [Google Scholar]
- Galibert F., Mandart E., Fitoussi F., Tiollais P., Charnay P. Nucleotide sequence of the hepatitis B virus genome (subtype ayw) cloned in E. coli. Nature. 1979 Oct 25;281(5733):646–650. doi: 10.1038/281646a0. [DOI] [PubMed] [Google Scholar]
- Godson G. N., Barrell B. G., Staden R., Fiddes J. C. Nucleotide sequence of bacteriophage G4 DNA. Nature. 1978 Nov 16;276(5685):236–247. doi: 10.1038/276236a0. [DOI] [PubMed] [Google Scholar]
- Hartley J. L., Donelson J. E. Nucleotide sequence of the yeast plasmid. Nature. 1980 Aug 28;286(5776):860–865. doi: 10.1038/286860a0. [DOI] [PubMed] [Google Scholar]
- Heffron F., McCarthy B. J., Ohtsubo H., Ohtsubo E. DNA sequence analysis of the transposon Tn3: three genes and three sites involved in transposition of Tn3. Cell. 1979 Dec;18(4):1153–1163. doi: 10.1016/0092-8674(79)90228-9. [DOI] [PubMed] [Google Scholar]
- Korn L. J., Queen C. L., Wegman M. N. Computer analysis of nucleic acid regulatory sequences. Proc Natl Acad Sci U S A. 1977 Oct;74(10):4401–4405. doi: 10.1073/pnas.74.10.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Max E. E., Maizel J. V., Jr, Leder P. The nucleotide sequence of a 5.5-kilobase DNA segment containing the mouse kappa immunoglobulin J and C region genes. J Biol Chem. 1981 May 25;256(10):5116–5120. [PubMed] [Google Scholar]
- Needleman S. B., Wunsch C. D. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970 Mar;48(3):443–453. doi: 10.1016/0022-2836(70)90057-4. [DOI] [PubMed] [Google Scholar]
- Sanger F., Coulson A. R., Friedmann T., Air G. M., Barrell B. G., Brown N. L., Fiddes J. C., Hutchison C. A., 3rd, Slocombe P. M., Smith M. The nucleotide sequence of bacteriophage phiX174. J Mol Biol. 1978 Oct 25;125(2):225–246. doi: 10.1016/0022-2836(78)90346-7. [DOI] [PubMed] [Google Scholar]
- Sankoff D. Matching sequences under deletion-insertion constraints. Proc Natl Acad Sci U S A. 1972 Jan;69(1):4–6. doi: 10.1073/pnas.69.1.4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sellers P. H. Pattern recognition in genetic sequences. Proc Natl Acad Sci U S A. 1979 Jul;76(7):3041–3041. doi: 10.1073/pnas.76.7.3041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soeda E., Arrand J. R., Smolar N., Walsh J. E., Griffin B. E. Coding potential and regulatory signals of the polyoma virus genome. Nature. 1980 Jan 31;283(5746):445–453. doi: 10.1038/283445a0. [DOI] [PubMed] [Google Scholar]
- Sutcliffe J. G. Complete nucleotide sequence of the Escherichia coli plasmid pBR322. Cold Spring Harb Symp Quant Biol. 1979;43(Pt 1):77–90. doi: 10.1101/sqb.1979.043.01.013. [DOI] [PubMed] [Google Scholar]
- Tinoco I., Jr, Uhlenbeck O. C., Levine M. D. Estimation of secondary structure in ribonucleic acids. Nature. 1971 Apr 9;230(5293):362–367. doi: 10.1038/230362a0. [DOI] [PubMed] [Google Scholar]
- Valenzuela P., Quiroga M., Zaldivar J., Rutter W. J., Kirschner M. W., Cleveland D. W. Nucleotide and corresponding amino acid sequences encoded by alpha and beta tubulin mRNAs. Nature. 1981 Feb 19;289(5799):650–655. doi: 10.1038/289650a0. [DOI] [PubMed] [Google Scholar]
- Yang R. C., Wu R. BK virus DNA: complete nucleotide sequence of a human tumor virus. Science. 1979 Oct 26;206(4417):456–462. doi: 10.1126/science.228391. [DOI] [PubMed] [Google Scholar]
- van Wezenbeek P. M., Hulsebos T. J., Schoenmakers J. G. Nucleotide sequence of the filamentous bacteriophage M13 DNA genome: comparison with phage fd. Gene. 1980 Oct;11(1-2):129–148. doi: 10.1016/0378-1119(80)90093-1. [DOI] [PubMed] [Google Scholar]
