Abstract
Lactobacillus johnsonii is a member of the acidophilus group of lactobacilli. Because of their probiotic properties, including attachment to epithelial cells, immunomodulation, and competitive exclusion of pathogens, representatives of this group are being intensively studied. Here we report the complete annotated genome sequence of Lactobacillus johnsonii FI9785, a strain which prevents the colonization of specific-pathogen-free chicks by Clostridium perfringens.
The competitive exclusion law formulated by Gause (5) states that two species that compete for the exact same resources cannot stably coexist. In recent times, the exploitation of this concept has gained renewed interest in the agricultural sector. Modern animal production methods rely on stringent hygiene measures, vaccination, and until recently the use of antimicrobial agents in order to control pathogens. A new regulation (1831/2003/EC) preventing the regular use of antibiotic growth promoters previously frequently included in feeds has led to an increase in the occurrence of endemic diseases. Clostridium perfringens, which causes necrotic enteritis in poultry, has emerged as one of the beneficiaries of this policy change (16). Lactobacillus johnsonii FI9785 has been shown to reduce the extent to which C. perfringens colonizes and persists in specific-pathogen-free chicks (8). In order to interrogate the genome sequence with regard to the strain's exclusion properties and to enable the use of “omics”-type techniques for further analyses of these properties, the complete genome sequence of this strain was determined using 454 pyrosequencing technology (454 Life Sciences, Branford, CT).
The initial draft assembly provided by 454 Life Sciences was based on 708,584 pyrosequencing reads with an average read length of 108 nucleotides which assembled into 45 contigs. The genome sequence of L. johnsonii NCC533 (12) was used to order these contigs into large scaffolds. Standard PCR followed by primer walk sequencing on the resulting products was used to close the gaps in combination with inverse PCR for gaps for which no linkage information could be obtained. In the case of one large gap (10 kb), an integration/rescue cloning strategy, followed by transposon-assisted sequencing of the rescued fragment, was used to close the respective gap. Sequence assembly was carried out using the Phred/Phrap (4) software program in conjunction with the Staden software package (14). The predicted error rate of the finished sequence is 1 × 10−6.
The finished L. johnsonii FI9785 sequence was annotated using the GenDB 2.2 annotation tool (11). Open reading frame (ORF) sequences were determined using the REGANOR5 software program (10), based on the combined ORF predictions of CRITICA (2) and Glimmer (13). Putative ribosomal binding sites and tRNA genes were identified using the RBSfinder (15) and tRNAscan-SE (9) software programs, respectively. An automatic functional annotation was computed based on different analyses, followed by a manual annotation of each predicted gene. Eighteen similarity searches (1) were performed against different databases, including the nonredundant database provided by National Center for Biotechnology Information, SWISS-PROT and TrEMBL, KEGG, Pfam, TIGRFAM, and InterPro. Additionally, SignalP (3), helix-turn-helix, and TMHMM (7) were applied. Predicted ORFs were manually reviewed, and alterations were made on the basis of the presence of potential ribosomal binding sites, sequence alignments, and available literature data. The L. johnsonii FI9785 genome is composed of one circular chromosome of 1,755,993 bp with a GC content of 34.49% and two circular plasmids, p9785S (3,471 bp) (6) and p9785L (25,652 bp), with GC contents of 35.81% and 30.38%, respectively. A total of 1,710 ORFs were identified on the chromosome, of which the majority (77.6%) are encoded on the leading strand of chromosome replication. The small and large plasmids carry 3 and 25 ORFs, respectively. The chromosome contains 4 rRNA gene operons and 53 tRNA genes, as well as 1 complete prophage genome.
Nucleotide sequence accession numbers.
Genome information for the chromosome and the two plasmids of L. johnsonii has been deposited in the EMBL/GenBank databases with accession numbers FN298497, AY862141, and FN357112.
Acknowledgments
We acknowledge funding through a Core Strategic Grant from the Biotechnology and Biological Sciences Research Council (BBSRC).
Footnotes
Published ahead of print on 18 September 2009.
REFERENCES
- 1.Altschul, S. F., T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389-3402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Badger, J. H., and G. J. Olsen. 1999. CRITICA: coding region identification tool invoking comparative analysis. Mol. Biol. Evol. 16:512-524. [DOI] [PubMed] [Google Scholar]
- 3.Bendtsen, J. D., H. Nielsen, G. von Heijne, and S. Brunak. 2004. Improved prediction of signal peptides: SignalP 3.0. J. Mol. Biol. 340:783-795. [DOI] [PubMed] [Google Scholar]
- 4.Ewing, B., and P. Green. 1998. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8:186-194. [PubMed] [Google Scholar]
- 5.Gause, G. F. 1934. The struggle for existence. Williams and Wilkins, Baltimore, MD.
- 6.Horn, N., U. Wegmann, A. Narbad, and M. J. Gasson. 2005. Characterisation of a novel plasmid p9785S from Lactobacillus johnsonii FI9785. Plasmid 54:176-183. [DOI] [PubMed] [Google Scholar]
- 7.Krogh, A., B. Larsson, G. von Heijne, and E. L. Sonnhammer. 2001. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 305:567-580. [DOI] [PubMed] [Google Scholar]
- 8.La Ragione, R. M., A. Narbad, M. J. Gasson, and M. J. Woodward. 2004. In vivo characterization of Lactobacillus johnsonii FI9785 for use as a defined competitive exclusion agent against bacterial pathogens in poultry. Lett. Appl. Microbiol. 38:197-205. [DOI] [PubMed] [Google Scholar]
- 9.Lowe, T. M., and S. R. Eddy. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955-964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.McHardy, A. C., A. Goesmann, A. Puhler, and F. Meyer. 2004. Development of joint application strategies for two microbial gene finders. Bioinformatics 20:1622-1631. [DOI] [PubMed] [Google Scholar]
- 11.Meyer, F., A. Goesmann, A. C. McHardy, D. Bartels, T. Bekel, J. Clausen, J. Kalinowski, B. Linke, O. Rupp, R. Giegerich, and A. Puhler. 2003. GenDB—an open source genome annotation system for prokaryote genomes. Nucleic Acids Res. 31:2187-2195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Pridmore, R. D., B. Berger, F. Desiere, D. Vilanova, C. Barretto, A. C. Pittet, M. C. Zwahlen, M. Rouvet, E. Altermann, R. Barrangou, B. Mollet, A. Mercenier, T. Klaenhammer, F. Arigoni, and M. A. Schell. 2004. The genome sequence of the probiotic intestinal bacterium Lactobacillus johnsonii NCC 533. Proc. Natl. Acad. Sci. USA 101:2512-2517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Salzberg, S. L., A. L. Delcher, S. Kasif, and O. White. 1998. Microbial gene identification using interpolated Markov models. Nucleic Acids Res. 26:544-548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Staden, R., K. F. Beal, and J. K. Bonfield. 2000. The Staden package, 1998. Methods Mol. Biol. 132:115-130. [DOI] [PubMed] [Google Scholar]
- 15.Suzek, B. E., M. D. Ermolaeva, M. Schreiber, and S. L. Salzberg. 2001. A probabilistic method for identifying start codons in bacterial genomes. Bioinformatics 17:1123-1130. [DOI] [PubMed] [Google Scholar]
- 16.Van Immerseel, F., J. De Buck, F. Pasmans, G. Huyghebaert, F. Haesebrouck, and R. Ducatelle. 2004. Clostridium perfringens in poultry: an emerging threat for animal and public health. Avian Pathol. 33:537-549. [DOI] [PubMed] [Google Scholar]