Abstract
A new type of search algorithm to find biological information inherited in nucleic acid sequences was developed. The algorithm is of pattern match type and is based on the fact that genetic information often is a function of a predictable statistical occurrence of the four bases within parts of the sequence. The search algorithm compares the known statistical pattern of bases in e.g. a promoter, with an unknown sequence and calculates the statistical significance of the match at all positions in the unknown sequence. The program was tested on 54 published prokaryotic promoters. 44 or 49 could be found with 1 or 4 false answers, respectively. The program was also used on plasmid pBR322. All promoters functioning in an in vitro transcription system were found (tet, anti-tet, p4, bla and ori) except the so called p5 promoter. A search for donor and acceptor sites was performed in a human HLA genomic sequence that contains six introns. Five of the possible six donor and acceptor sites were found.
Full text
PDFSelected References
These references are in PubMed. This may not be the complete list of references from this article.
- Barry G., Squires C. L., Squires C. Control features within the rplJL-rpoBC transcription unit of Escherichia coli. Proc Natl Acad Sci U S A. 1979 Oct;76(10):4922–4926. doi: 10.1073/pnas.76.10.4922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Breathnach R., Chambon P. Organization and expression of eucaryotic split genes coding for proteins. Annu Rev Biochem. 1981;50:349–383. doi: 10.1146/annurev.bi.50.070181.002025. [DOI] [PubMed] [Google Scholar]
- Delcuve G., Downing W., Lewis H., Dennis P. P. Nucleotide sequence of the proximal portion of the RNA polymerase beta subunit gene of Escherichia coli. Gene. 1980 Nov;11(3-4):367–373. doi: 10.1016/0378-1119(80)90076-1. [DOI] [PubMed] [Google Scholar]
- Gold L., Pribnow D., Schneider T., Shinedling S., Singer B. S., Stormo G. Translational initiation in prokaryotes. Annu Rev Microbiol. 1981;35:365–403. doi: 10.1146/annurev.mi.35.100181.002053. [DOI] [PubMed] [Google Scholar]
- Korn L. J., Queen C. L., Wegman M. N. Computer analysis of nucleic acid regulatory sequences. Proc Natl Acad Sci U S A. 1977 Oct;74(10):4401–4405. doi: 10.1073/pnas.74.10.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Linn T., Scaife J. Identification of a single promoter in E. coli for rplJ, rplL and rpoBC. Nature. 1978 Nov 2;276(5683):33–37. doi: 10.1038/276033a0. [DOI] [PubMed] [Google Scholar]
- Ma J. C., Newman A. J., Hayward R. S. Internal promoters of the rpoBC operon of Escherichia coli. Mol Gen Genet. 1981;184(3):548–550. doi: 10.1007/BF00352538. [DOI] [PubMed] [Google Scholar]
- Maizel J. V., Jr, Lenk R. P. Enhanced graphic matrix analysis of nucleic acid and protein sequences. Proc Natl Acad Sci U S A. 1981 Dec;78(12):7665–7669. doi: 10.1073/pnas.78.12.7665. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malissen M., Malissen B., Jordan B. R. Exon/intron organization and complete nucleotide sequence of an HLA gene. Proc Natl Acad Sci U S A. 1982 Feb;79(3):893–897. doi: 10.1073/pnas.79.3.893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mandecki W., Reznikoff W. S. A lac promoter with a changed distance between -10 and -35 regions. Nucleic Acids Res. 1982 Feb 11;10(3):903–912. doi: 10.1093/nar/10.3.903. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Newman A. J., Linn T. G., Hayward R. S. Evidence for co-transcription of the RNA polymerase genes rpoBC with a ribosomal protein gene of escherichia coli. Mol Gen Genet. 1979 Jan 31;169(2):195–204. doi: 10.1007/BF00271671. [DOI] [PubMed] [Google Scholar]
- Ovchinnikov Y. A., Monastyrskaya G. S., Gubanov V. V., Guryev S. O., Chertov OYu, Modyanov N. N., Grinkevich V. A., Makarova I. A., Marchenko T. V., Polovnikova I. N. The primary structure of Escherichia coli RNA polymerase. Nucleotide sequence of the rpoB gene and amino-acid sequence of the beta-subunit. Eur J Biochem. 1981 Jun 1;116(3):621–629. doi: 10.1111/j.1432-1033.1981.tb05381.x. [DOI] [PubMed] [Google Scholar]
- Post L. E., Strycharz G. D., Nomura M., Lewis H., Dennis P. P. Nucleotide sequence of the ribosomal protein gene cluster adjacent to the gene for RNA polymerase subunit beta in Escherichia coli. Proc Natl Acad Sci U S A. 1979 Apr;76(4):1697–1701. doi: 10.1073/pnas.76.4.1697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siebenlist U., Simpson R. B., Gilbert W. E. coli RNA polymerase interacts homologously with two different promoters. Cell. 1980 Jun;20(2):269–281. doi: 10.1016/0092-8674(80)90613-3. [DOI] [PubMed] [Google Scholar]
- Staden R. Further procedures for sequence analysis by computer. Nucleic Acids Res. 1978 Mar;5(3):1013–1016. doi: 10.1093/nar/5.3.1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Staden R. Sequence data handling by computer. Nucleic Acids Res. 1977 Nov;4(11):4037–4051. doi: 10.1093/nar/4.11.4037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stefano J. E., Gralla J. D. Spacer mutations in the lac ps promoter. Proc Natl Acad Sci U S A. 1982 Feb;79(4):1069–1072. doi: 10.1073/pnas.79.4.1069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stormo G. D., Schneider T. D., Gold L., Ehrenfeucht A. Use of the 'Perceptron' algorithm to distinguish translational initiation sites in E. coli. Nucleic Acids Res. 1982 May 11;10(9):2997–3011. doi: 10.1093/nar/10.9.2997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stüber D., Bujard H. Organization of transcriptional signals in plasmids pBR322 and pACYC184. Proc Natl Acad Sci U S A. 1981 Jan;78(1):167–171. doi: 10.1073/pnas.78.1.167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sutcliffe J. G. Complete nucleotide sequence of the Escherichia coli plasmid pBR322. Cold Spring Harb Symp Quant Biol. 1979;43(Pt 1):77–90. doi: 10.1101/sqb.1979.043.01.013. [DOI] [PubMed] [Google Scholar]
- Yamamoto M., Nomura M. Contranscription of genes for RNA polymerase subunits beta and beta' with genes for ribosomal proteins in Escherichia coli. Proc Natl Acad Sci U S A. 1978 Aug;75(8):3891–3895. doi: 10.1073/pnas.75.8.3891. [DOI] [PMC free article] [PubMed] [Google Scholar]