We report the complete genome sequence of Bifidobacterium longum subsp. longum JCM7052, isolated from human feces in Japan. This strain has the capability of growing on and utilizing gum arabic as an energy source. The complete genome is 2,273,627 bp long, with 1,929 protein-coding genes and 59.9 mol% G+C content.
ABSTRACT
We report the complete genome sequence of Bifidobacterium longum subsp. longum JCM7052, isolated from human feces in Japan. This strain has the capability of growing on and utilizing gum arabic as an energy source. The complete genome is 2,273,627 bp long, with 1,929 protein-coding genes and 59.9 mol% G+C content.
ANNOUNCEMENT
Bifidobacteria are Gram-positive, non-spore-forming, multimorphogenic rod-shaped, and strictly anaerobic bacteria. Some species of the genus Bifidobacterium are used as probiotics, which are defined as live microorganisms that, when administered in adequate amounts, confer a health benefit on the host (1). Among these species, Bifidobacterium longum subsp. longum is a stable and dominant population in the human gastrointestinal tract, and the genes in its genome vary depending on the habitat, such as the types and quantity of carbohydrates ingested by the host (2). In the genus Bifidobacterium, some strains of B. longum subsp. longum and B. adolescentis have been shown to ferment gum arabic as the carbohydrate source (3). We observed that B. longum subsp. longum JCM7052, which was isolated from the feces of an adult human by Mitsuoka (4), can grow on gum arabic (5).
B. longum subsp. longum JCM7052 was grown under anaerobic conditions in Bifidobacterium medium (5), and DNA was extracted using the cetyltrimethylammonium bromide (CTAB) method (6). A sequence library was prepared with the TruSeq DNA sample prep kit (Illumina; FC-121-1001) as described in the manufacturer’s instructions. Sequencing was conducted using a next-generation sequencer, the Illumina HiSeq 2000 (San Diego, CA), and generated 100-bp paired-end reads. A total of 116,426,984 reads and 11,759 Mb of sequence data (∼2,500× coverage) was obtained after a quality check of the sequence reads, performed using CASAVA version 1.8.1, a software provided by Illumina. A total of 36 contigs were assembled using Velvet version 1.2.10 (7), and GapCloser version 1.10 was used to close the gaps that emerged during the scaffolding process, conducted using SOAPdenovo (8). The contigs were placed in order compared with the genome sequence of B. longum subsp. longum JCM1217 (GenBank accession number AP010888) using Mauve (9, 10). Since each contig contained a part of the IS21 insertion sequence (IS) family or rRNA operon in its terminal regions, the primers for PCR were designated referring to the sequences adjacent to the IS and rRNA operon (Table 1), and PCR amplification was performed (Bio-Rad T100 thermal cycler). The amplicons obtained were subjected to sequencing by means of primer walking on an Applied Biosystems 3730 DNA sequencer using a BigDye Terminator version 3.1 cycle sequencing kit (Applied Biosystems, Foster City, CA). The amplicon sequences were assembled into ordered contigs by visual examination. Finally, a circular sequence was obtained. This sequence was annotated using the DDBJ Fast Annotation and Submission Tool (DFAST) (11), and some products were identified using a BLAST (12) search of DDBJ service analytical tools.
TABLE 1.
PCR primer pairs used for gap closing between contigsa
| Target contig | Forward primer | Reverse primer |
|---|---|---|
| NODE4-10 | 5′-GCGAGACGATAGTCGAGCCATAAGG-3′ | 5′-CCTATCGGTCGATGATATGACCGCC-3′ |
| NODE10-6 | 5′-CCTCGCTATAGTGCTCGCCCTGCTC-3′ | 5′-GTTTCATCGACGCCGACCCGCTCAT-3′ |
| NODE6-7 | 5′-GGTGTCGTGTCTGGCCACCAATACG-3′ | 5′-TCTGCGGAATTCAGCGCGACATGCC-3′ |
| NODE7-5 | 5′-CAGATGGACAGCGCTGGCAGTCACA-3′ | 5′-GGAGAAGTTCATGCCGGACAGCAGC-3′ |
| NODE5-3 | 5′-TCCAAGTTCGACGACATGGAGCCGG-3′ | 5′-AAAGCCTGGGATTGAGCCCGTTGGC-3′ |
| NODE3-1 | 5′-GAGACGCGAAAGGACTCATGGTGGC-3′ | 5′-ACAGGAGGTTCGCGTTCTTGTCCGG-3′ |
| NODE1-9 | 5′-GCCTACAGAGCGGGTCATTACGAGC-3′ | 5′-AGGTACGGGTACGAACGCTCCAACG-3′ |
| NODE9-8 | 5′-GGGCCGACAAGCTGAAAACTCTCGC-3′ | 5′-AAGCCAGTTTCTGGCTCCCCTCTGG-3′ |
| NODE8-2 | 5′-GGTCGCGTTGTCCATCACGATCACG-3′ | 5′-CATGCAGGTCGATTTCGGCATGGCC-3′ |
| NODE2-4 | 5′-GGAGGCCAACATCGTGTCCATCGAC-3′ | 5′-CGTGACGCCATCGACGGATTGGAGA-3′ |
| NODE6-11 | 5′-CCAAGAGGTTTCTGGCTCCCCTCTT-3′ | 5′-GGGCGAAGCCTGTCGTTTCAACGAT-3′ |
| NODE9-2 | 5′-TCGATGATGAGAAGTCGGGCCTTGC-3′ | 5′-CATGCAGGTCGATTTCGGCATGGCC-3′ |
PCR was done under the following conditions: 95°C for 5 min followed by 30 cycles at 94°C for 30 sec, 45°C to 55°C for 30 sec, and 72°C for 3 to 6 min.
The complete genome of B. longum subsp. longum JCM7052 is 2,273,627 bp long and has a G+C content of 59.9 mol%. The genome contains 1,929 putative proteins and carries 4 rRNA operons, 56 tRNAs, and 1 transfer-messenger (tmRNA). Eight gene clusters found in the genome of strain JCM7052 were not found in B. longum subsp. longum JCM1217, which is a strain that cannot assimilate gum arabic. These genes may provide useful insights into the physiological properties of strain JCM7052.
Data availability.
This whole-genome project has been deposited in DDBJ/ENA/GenBank under accession number AP022379. The BioProject accession number is PRJDB9197. The raw sequence reads have been deposited in the DRA under accession number DRA011367.
ACKNOWLEDGMENT
This study was supported by the education fund of Yukiyoshi Gakuen.
REFERENCES
- 1.FAO/WHO. 2006. Probiotics in food: health and nutritional properties and guidelines for evaluation. FAO Food and Nutrition Paper 85. FAO/WHO, Rome, Italy. http://www.fao.org/3/a-a0512e.pdf.
- 2.Odamaki T, Bottacini F, Kato K, Mitsuyama E, Yoshida K, Horigome A, Xiao J-Z, van Sinderen D. 2018. Genomic diversity and distribution of Bifidobacterium longum subsp. longum across the human lifespan. Sci Rep 8:85. doi: 10.1038/s41598-017-18391-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Crociani F, Alessandrini A, Mucci MM, Biavati B. 1994. Degradation of complex carbohydrates by Bifidobacterium spp. Int J Food Microbiol 24:199–210. doi: 10.1016/0168-1605(94)90119-8. [DOI] [PubMed] [Google Scholar]
- 4.Mitsuoka T. 1969. Vergleichende Untersuchungen uber die Bifidobakterien aus dem Verdauungstrakt von Menschen und Tieren (Zugleich die Beschreibung von B. thermophilum nov. spec. und B. pseudolongum nov. spec.). Zentralbl Bakteriol Orig A210:52–64. [PubMed] [Google Scholar]
- 5.Saishin N, Yamamoto I. 2009. α-Galactosidase purified from Bifidobacterium longum JCM 7052 grown on gum arabic. J Biol Macromol 9:71–80. http://www.jsb.gr.jp/jbm/2009/0902_4.pdf. [Google Scholar]
- 6.Wilson K. 2001. Preparation of genomic DNA from bacteria. Curr Protoc Mol Biol 56:2.4.1–2.4.5. doi: 10.1002/0471142727.mb0204s56. [DOI] [PubMed] [Google Scholar]
- 7.Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu S-M, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam T-W, Wang J. 2012. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1:18. doi: 10.1186/2047-217X-1-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Darling ACE, Mau B, Blattner FR, Perna NT. 2004. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 14:1394–1403. doi: 10.1101/gr.2289704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Darling AE, Mau B, Perna NT. 2010. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One 5:e11147. doi: 10.1371/journal.pone.0011147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Tanizawa Y, Fujisawa T, Nakamura Y. 2018. DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication. Bioinformatics 34:1037–1039. doi: 10.1093/bioinformatics/btx713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Altschul SF, Wootton JC, Gertz EM, Agarwala R, Morgulis A, Schaffer AA, Yu Y-K. 2005. Protein database searches using compositionally adjusted substitution matrices. FEBS J 272:5101–5109. doi: 10.1111/j.1742-4658.2005.04945.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
This whole-genome project has been deposited in DDBJ/ENA/GenBank under accession number AP022379. The BioProject accession number is PRJDB9197. The raw sequence reads have been deposited in the DRA under accession number DRA011367.
