Equol is an intestinal bacterial metabolite derived from the isoflavone daidzein and has beneficial health effects. Most equol producers belong to the family Coriobacteriaceae, which includes species such as Adlercreutzia equolifaciens. Here, we report the draft genome sequence of A. equolifaciens IPLA 37004, a human isolate that does not produce equol.
ABSTRACT
Equol is an intestinal bacterial metabolite derived from the isoflavone daidzein and has beneficial health effects. Most equol producers belong to the family Coriobacteriaceae, which includes species such as Adlercreutzia equolifaciens. Here, we report the draft genome sequence of A. equolifaciens IPLA 37004, a human isolate that does not produce equol.
ANNOUNCEMENT
Among isoflavones and their metabolites, the bacterial daidzein-derived compound (S)-(−)-equol has the strongest estrogenic activity (1) and antioxidant action (2), by which it can positively influence human health (3). However, only 30 to 60% of humans harbor equol-producing microbes in their intestines. Most equol-producing bacteria belong to the family Coriobacteriaceae of the phylum Actinobacteria, which includes species such as Adlercreutzia equolifaciens, Asaccharobacter celatus, Enterorhabdus mucosicola, Slackia isoflavoniconvertens, and Slackia equolifaciens (3). However, whether equol production is a family-, species-, or strain-specific characteristic is not yet known.
Strain IPLA 37004 was isolated from fecal samples from a woman with an equol-producing microbiota, using agar plates of Gifu anaerobic medium (GAM) (Nissui) supplemented with 5 g liter−1 arginine (GAM-Arg; Merck) and cultivation under strict anaerobic conditions. Sequencing and sequence comparison of the 16S rRNA gene of IPLA 37004 identified the strain as A. equolifaciens. Surprisingly, the strain did not produce equol when cultured in GAM-Arg with isoflavones, daidzein, or dihydrodaidzein. To get insights regarding its non-equol-producing phenotype, the genome of A. equolifaciens IPLA 37004 was sequenced, and its sequence was analyzed.
After growth of the strain in GAM-Arg, total genomic DNA from IPLA 37004 was extracted and purified using the DNeasy blood and tissue kit (Qiagen), as suggested by the manufacturer. For genome sequencing, a library was constructed using SPRIworks fragment library system I (Beckman Coulter), according to the manufacturer’s instructions. Paired-end sequencing (2 × 150-bp reads) was performed at Eurofins Genomics on a HiSeq sequencer (Illumina) (sequence mode, NovaSeq 6000; S2 flow cell; paired-end 150-bp reads; Xp flowchart). The average genome coverage approached 100×, yielding a total of 5,568,276 bp of adapter-free and quality-filtered reads after the use of Trimmomatic (4) and FastQC (5) software packages. De novo assembly of quality-filtered nontrimmed reads in contigs was accomplished using SPAdes v3.6.2 (6), setting the parameters with a k value of 125 or 127 and only-assembler. Then, the NCBI Prokaryotic Genome Annotation Pipeline (7) was used to predict and annotate the open reading frames. Whole-genome comparisons were achieved by using the multiple genome alignment software package Mauve v2.3.1 (8). Finally, BLAST v2.10.0 searches (BLASTp suite, NCBI) were used to determine homology between proteins of different strains.
The genome of A. equolifaciens IPLA 37004 consisted of 2,664,741 bp in 169 contigs longer than 200 bp, with a G+C content of 63.7%. The maximum contig length was 325,998 bp, and the N50 and N90 values were 101,827 and 25,297 bp, respectively. A total of 2,310 genes were identified in the IPLA 37004 genome, including 2,251 coding sequences, of which 2,198 encoded complete proteins and 53 were identified as pseudogenes. In addition, 59 genes encoded RNA molecules, including 8 rRNAs, 48 tRNAs, and 3 noncoding RNAs. Genes encoding reductases homologous to those of A. equolifaciens DSM 19450T involved in equol formation (9) were not detected in the IPLA 37004 genome. Whole-genome comparison of DSM 19450T and IPLA 37004 identified shared flanking genes upstream and downstream of the equol locus in A. equolifaciens DSM 19450T. The deduced amino acids of the shared genes showed an identity range of 80 to 99%. The gene cluster of DSM 19450T encompassed a region of about 11 kbp, which was absent in IPLA 37004.
Data availability.
The genome sequence of A. equolifaciens IPLA 37004 was deposited in GenBank under accession number VJNE00000000. The fastq files of the raw reads were deposited in the NCBI SRA under accession number SRR9657807.
ACKNOWLEDGMENTS
This study was supported by projects from MINECO (grant AGL-2014-57820-R) and Asturias Principality (grant IDI/2018/000114).
REFERENCES
- 1.Selvaraj V, Zakroczymski MA, Naaz A, Mukai M, Ju YH, Doerge DR, Katzenellenbogen JA, Helferich WG, Cooke PS. 2004. Estrogenicity of the isoflavone metabolite equol on reproductive and non-reproductive organs in mice. Biol Reprod 71:966–972. doi: 10.1095/biolreprod.104.029512. [DOI] [PubMed] [Google Scholar]
- 2.Choi EJ, Kim GH. 2014. The antioxidant activity of daidzein metabolites, O-desmethylangolensin and equol, in HepG2 cells. Mol Med Rep 9:328–332. doi: 10.3892/mmr.2013.1752. [DOI] [PubMed] [Google Scholar]
- 3.Mayo B, Vázquez L, Flórez AB. 2019. Equol: a bacterial metabolite from the daidzein isoflavone and its presumed beneficial health effects. Nutrients 11:2231. doi: 10.3390/nu11092231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Andrews S. 2010. FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc.
- 6.Bankevich A, Nurk S, Antipov D, Gurevich A, Dvorkin M, Kulikov AS, Lesin V, Nikolenko S, Pham S, Prjibelski A, Pyshkin A, Sirotkin A, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Haft DH, DiCuccio M, Badretdin A, Brover V, Chetvernin V, O'Neill K, Li W, Chitsaz F, Derbyshire MK, Gonzales NR, Gwadz M, Lu F, Marchler GH, Song JS, Thanki N, Yamashita RA, Zheng C, Thibaud-Nissen F, Geer LY, Marchler-Bauer A, Pruitt KD. 2018. RefSeq: an update on prokaryotic genome annotation and curation. Nucleic Acids Res 46:D851–D860. doi: 10.1093/nar/gkx1068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Darling AC, Mau B, Blattner FR, Perna NT. 2004. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 14:1394–1403. doi: 10.1101/gr.2289704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Toh H, Oshima K, Suzuki T, Hattori M, Morita H. 2013. Complete genome sequence of the equol-producing bacterium Adlercreutzia equolifaciens DSM 19450T. Genome Announc 1:e00742-13. doi: 10.1128/genomeA.00742-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequence of A. equolifaciens IPLA 37004 was deposited in GenBank under accession number VJNE00000000. The fastq files of the raw reads were deposited in the NCBI SRA under accession number SRR9657807.