Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 1994 Jun 11;22(11):2158–2165. doi: 10.1093/nar/22.11.2158

Analysis of E.coli promoter structures using neural networks.

I Mahadevan 1, I Ghosh 1
PMCID: PMC308136  PMID: 8029027

Abstract

Backpropagation neural network is trained to identify E.coli promoters of all spacing classes (15 to 21). A three module approach is employed wherein the first neural net module predicts the consensus boxes, the second module aligns the promoters to a length of 65 bases and the third neural net module predicts the entire sequence of 65 bases taking care of the possible interdependencies between the bases in the promoters. The networks were trained with 106 promoters and random sequences which were 60% AT rich and tested on 126 promoters (Bacterial, Mutant and Phage promoters). The network was 98% successful in promoter recognition and 90.2% successful in non-promoter recognition when tested on 5000 randomly generated sequences. The network was further trained with 11 mutated non-promoters and 8 mutated promoters of the p22ant promoter. The testing set with 7 mutated promoters and 13 mutated non-promoters of p22ant were identified. The network was upgraded using total 1665 data of promoters and non-promoters to identify any promoter sequences in the gene sequences. The network identified the locations of P1, P2 and P3 promoters in the pBR322 plasmid. A search for the start codon, Ribosomal Binding Site and the stop codon by a string search procedure has also been added to find the possible promoters that can yield protein products. The network was also successfully tested on a synthetic plasmid pWM528.

Full text

PDF
2164

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Brosius J., Cate R. L., Perlmutter A. P. Precise location of two promoters for the beta-lactamase gene of pBR322. S1 mapping of ribonucleic acid isolated from Escherichia coli or synthesized in vitro. J Biol Chem. 1982 Aug 10;257(15):9205–9210. [PubMed] [Google Scholar]
  2. Cardon L. R., Stormo G. D. Expectation maximization algorithm for identifying protein-binding sites with variable lengths from unaligned DNA fragments. J Mol Biol. 1992 Jan 5;223(1):159–170. doi: 10.1016/0022-2836(92)90723-w. [DOI] [PubMed] [Google Scholar]
  3. Demeler B., Zhou G. W. Neural network optimization for E. coli promoter prediction. Nucleic Acids Res. 1991 Apr 11;19(7):1593–1599. doi: 10.1093/nar/19.7.1593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Deuschle U., Kammerer W., Gentz R., Bujard H. Promoters of Escherichia coli: a hierarchy of in vivo strength indicates alternate structures. EMBO J. 1986 Nov;5(11):2987–2994. doi: 10.1002/j.1460-2075.1986.tb04596.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Hawley D. K., McClure W. R. Compilation and analysis of Escherichia coli promoter DNA sequences. Nucleic Acids Res. 1983 Apr 25;11(8):2237–2255. doi: 10.1093/nar/11.8.2237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Lisser S., Margalit H. Compilation of E. coli mRNA promoter sequences. Nucleic Acids Res. 1993 Apr 11;21(7):1507–1516. doi: 10.1093/nar/21.7.1507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Mandecki W., Hayden M. A., Shallcross M. A., Stotland E. A totally synthetic plasmid for general cloning, gene expression and mutagenesis in Escherichia coli. Gene. 1990 Sep 28;94(1):103–107. doi: 10.1016/0378-1119(90)90474-6. [DOI] [PubMed] [Google Scholar]
  8. Moyle H., Waldburger C., Susskind M. M. Hierarchies of base pair preferences in the P22 ant promoter. J Bacteriol. 1991 Mar;173(6):1944–1950. doi: 10.1128/jb.173.6.1944-1950.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Mulligan M. E., Hawley D. K., Entriken R., McClure W. R. Escherichia coli promoter sequences predict in vitro RNA polymerase selectivity. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):789–800. doi: 10.1093/nar/12.1part2.789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. O'Neill M. C. Escherichia coli promoters. I. Consensus as it relates to spacing class, specificity, repeat substructure, and three-dimensional organization. J Biol Chem. 1989 Apr 5;264(10):5522–5530. [PubMed] [Google Scholar]
  11. Stüber D., Bujard H. Organization of transcriptional signals in plasmids pBR322 and pACYC184. Proc Natl Acad Sci U S A. 1981 Jan;78(1):167–171. doi: 10.1073/pnas.78.1.167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Youderian P., Bouvier S., Susskind M. M. Sequence determinants of promoter activity. Cell. 1982 Oct;30(3):843–853. doi: 10.1016/0092-8674(82)90289-6. [DOI] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES