Abstract
We have compiled the DNA sequence data forEscherichia coliavailable from the GenBank and EMBL data libraries and independently from the literature. We provide the most definitive version of the ECDEscherichia colidatabase now exclusively via the World Wide Web System: http://susi.bio.uni-giessen.de/usr/local/www/ html/ecdc.html . Our database encloses an assembled set of contiguous sequences. Each of these contigs compiles all available sequence information, including those derived from a variety of elder sequences. The organisation of the database allows precise physical location of each individual gene or regulatory region, even taking into consideration discrepancies in nomenclature. The WWW program allows to branch into the original EMBL and SWISSPROT datafiles. A number of links to other WWW servers is provided. A FASTA and BLAST search may be performed online. Besides the WWW format a flat file version may be obtained via ftp. The ftp version may also be obtained from the EMBL data library as part of the CD-ROM issue of the EMBL sequence database, which is released and updated every 3 months. After deletion of all detected overlaps a total of 3 588 706 individual bp has been determined up to the end of September 1996. This corresponds to a total of 77.09% of the entire E.coli chromosome consisting of approximately 4655 kb. About 479 kb (10.3%) are additionally available from Kyoto (Japan). Another 94 kb (2%) are available, but mapping has not been confirmed. Thus the total may have reached 89.4%.
Full Text
The Full Text of this article is available as a PDF (44.1 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Berlyn M. B., Letovsky S. Genome-related datasets within the E. coli Genetic Stock Center database. Nucleic Acids Res. 1992 Dec 11;20(23):6143–6151. doi: 10.1093/nar/20.23.6143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Birkenbihl R. P., Vielmetter W. Complete maps of IS1, IS2, IS3, IS4, IS5, IS30 and IS150 locations in Escherichia coli K12. Mol Gen Genet. 1989 Dec;220(1):147–153. doi: 10.1007/BF00260869. [DOI] [PubMed] [Google Scholar]
- Birkenbihl R. P., Vielmetter W. Cosmid-derived map of E. coli strain BHB2600 in comparison to the map of strain W3110. Nucleic Acids Res. 1989 Jul 11;17(13):5057–5069. doi: 10.1093/nar/17.13.5057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blattner F. R., Burland V., Plunkett G., 3rd, Sofia H. J., Daniels D. L. Analysis of the Escherichia coli genome. IV. DNA sequence of the region from 89.2 to 92.8 minutes. Nucleic Acids Res. 1993 Nov 25;21(23):5408–5417. doi: 10.1093/nar/21.23.5408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Borodovsky M., Koonin E. V., Rudd K. E. New genes in old sequence: a strategy for finding genes in the bacterial genome. Trends Biochem Sci. 1994 Aug;19(8):309–313. doi: 10.1016/0968-0004(94)90067-1. [DOI] [PubMed] [Google Scholar]
- Borodovsky M., Rudd K. E., Koonin E. V. Intrinsic and extrinsic approaches for detecting genes in a bacterial genome. Nucleic Acids Res. 1994 Nov 11;22(22):4756–4767. doi: 10.1093/nar/22.22.4756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burland V., Plunkett G., 3rd, Daniels D. L., Blattner F. R. DNA sequence and analysis of 136 kilobases of the Escherichia coli genome: organizational symmetry around the origin of replication. Genomics. 1993 Jun;16(3):551–561. doi: 10.1006/geno.1993.1230. [DOI] [PubMed] [Google Scholar]
- Burland V., Plunkett G., 3rd, Sofia H. J., Daniels D. L., Blattner F. R. Analysis of the Escherichia coli genome VI: DNA sequence of the region from 92.8 through 100 minutes. Nucleic Acids Res. 1995 Jun 25;23(12):2105–2119. doi: 10.1093/nar/23.12.2105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daniels D. L., Plunkett G., 3rd, Burland V., Blattner F. R. Analysis of the Escherichia coli genome: DNA sequence of the region from 84.5 to 86.5 minutes. Science. 1992 Aug 7;257(5071):771–778. doi: 10.1126/science.1379743. [DOI] [PubMed] [Google Scholar]
- Fujita N., Mori H., Yura T., Ishihama A. Systematic sequencing of the Escherichia coli genome: analysis of the 2.4-4.1 min (110,917-193,643 bp) region. Nucleic Acids Res. 1994 May 11;22(9):1637–1639. doi: 10.1093/nar/22.9.1637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karp P. D., Riley M., Paley S. M., Pelligrini-Toole A. EcoCyc: an encyclopedia of Escherichia coli genes and metabolism. Nucleic Acids Res. 1996 Jan 1;24(1):32–39. doi: 10.1093/nar/24.1.32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaufmann A., Stierhof Y. D., Henning U. New outer membrane-associated protease of Escherichia coli K-12. J Bacteriol. 1994 Jan;176(2):359–367. doi: 10.1128/jb.176.2.359-367.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kohara Y., Akiyama K., Isono K. The physical map of the whole E. coli chromosome: application of a new strategy for rapid analysis and sorting of a large genomic library. Cell. 1987 Jul 31;50(3):495–508. doi: 10.1016/0092-8674(87)90503-4. [DOI] [PubMed] [Google Scholar]
- Kröger M. Compilation of DNA sequences of Escherichia coli. Nucleic Acids Res. 1989;17 (Suppl):r283–r309. doi: 10.1093/nar/17.suppl.r283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kröger M., Wahl R. Compilation of DNA sequences of Escherichia coli K12 (ECD and ECDC; update 1995). Nucleic Acids Res. 1996 Jan 1;24(1):29–31. doi: 10.1093/nar/24.1.29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kröger M., Wahl R., Rice P. Compilation of DNA sequences of Escherichia coli (update 1990). Nucleic Acids Res. 1990 Apr 25;18 (Suppl):2549–2587. doi: 10.1093/nar/18.suppl.2549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kröger M., Wahl R., Rice P. Compilation of DNA sequences of Escherichia coli (update 1991). Nucleic Acids Res. 1991 Apr 25;19 (Suppl):2023–2043. doi: 10.1093/nar/19.suppl.2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kröger M., Wahl R., Rice P. Compilation of DNA sequences of Escherichia coli (update 1993). Nucleic Acids Res. 1993 Jul 1;21(13):2973–3000. doi: 10.1093/nar/21.13.2973. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kröger M., Wahl R., Schachtel G., Rice P. Compilation of DNA sequences of Escherichia coli (update 1992). Nucleic Acids Res. 1992 May 11;20 (Suppl):2119–2144. doi: 10.1093/nar/20.suppl.2119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kunisawa T., Nakamura M., Watanabe H., Otsuka J., Tsugita A., Yeh L. S., George D. G., Barker W. C. Escherichia coli K12 genomic database. Protein Seq Data Anal. 1990 Jun;3(2):157–162. [PubMed] [Google Scholar]
- Médigue C., Bouché J. P., Hénaut A., Danchin A. Mapping of sequenced genes (700 kbp) in the restriction map of the Escherichia coli chromosome. Mol Microbiol. 1990 Feb;4(2):169–187. doi: 10.1111/j.1365-2958.1990.tb00585.x. [DOI] [PubMed] [Google Scholar]
- Médigue C., Hénaut A., Danchin A. Escherichia coli molecular genetic map (1000 kbp): update I. Mol Microbiol. 1990 Sep;4(9):1443–1454. [PubMed] [Google Scholar]
- Médigue C., Viari A., Hénaut A., Danchin A. Colibri: a functional data base for the Escherichia coli genome. Microbiol Rev. 1993 Sep;57(3):623–654. doi: 10.1128/mr.57.3.623-654.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Médigue C., Viari A., Hénaut A., Danchin A. Escherichia coli molecular genetic map (1500 kbp): update II. Mol Microbiol. 1991 Nov;5(11):2629–2640. doi: 10.1111/j.1365-2958.1991.tb01972.x. [DOI] [PubMed] [Google Scholar]
- Plunkett G., 3rd, Burland V., Daniels D. L., Blattner F. R. Analysis of the Escherichia coli genome. III. DNA sequence of the region from 87.2 to 89.2 minutes. Nucleic Acids Res. 1993 Jul 25;21(15):3391–3398. doi: 10.1093/nar/21.15.3391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riley M., Space D. B. Genes and proteins of Escherichia coli (GenProtEc). Nucleic Acids Res. 1996 Jan 1;24(1):40–40. doi: 10.1093/nar/24.1.40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rodriguez-Tomé P., Stoehr P. J., Cameron G. N., Flores T. P. The European Bioinformatics Institute (EBI) databases. Nucleic Acids Res. 1996 Jan 1;24(1):6–12. doi: 10.1093/nar/24.1.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rudd K. E., Miller W., Ostell J., Benson D. A. Alignment of Escherichia coli K12 DNA sequences to a genomic restriction map. Nucleic Acids Res. 1990 Jan 25;18(2):313–321. doi: 10.1093/nar/18.2.313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sofia H. J., Burland V., Daniels D. L., Plunkett G., 3rd, Blattner F. R. Analysis of the Escherichia coli genome. V. DNA sequence of the region from 76.0 to 81.5 minutes. Nucleic Acids Res. 1994 Jul 11;22(13):2576–2586. doi: 10.1093/nar/22.13.2576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Umeda M., Ohtsubo E. Mapping of insertion elements IS1, IS2 and IS3 on the Escherichia coli K-12 chromosome. Role of the insertion elements in formation of Hfrs and F' factors and in rearrangement of bacterial chromosomes. J Mol Biol. 1989 Aug 20;208(4):601–614. doi: 10.1016/0022-2836(89)90151-4. [DOI] [PubMed] [Google Scholar]
- Wahl R., Kröger M. ECDC--a totally integrated and interactively usable genetic map of Escherichia coli K12. Microbiol Res. 1995 Mar;150(1):7–61. doi: 10.1016/S0944-5013(11)80034-0. [DOI] [PubMed] [Google Scholar]
- Wahl R., Rice P., Rice C. M., Kröger M. ECD--a totally integrated database of Escherichia coli K12. Nucleic Acids Res. 1994 Sep;22(17):3450–3455. doi: 10.1093/nar/22.17.3450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watanabe H., Kunisawa T. Computer-assisted analysis of chromosomal locations and transcriptional directions of Escherichia coli genes. Protein Seq Data Anal. 1990 Jun;3(2):149–156. [PubMed] [Google Scholar]
- Yura T., Mori H., Nagai H., Nagata T., Ishihama A., Fujita N., Isono K., Mizobuchi K., Nakata A. Systematic sequencing of the Escherichia coli genome: analysis of the 0-2.4 min region. Nucleic Acids Res. 1992 Jul 11;20(13):3305–3308. doi: 10.1093/nar/20.13.3305. [DOI] [PMC free article] [PubMed] [Google Scholar]
