Abstract
Background
Different feeding regimens in infancy alter the gastrointestinal (gut) microbial environment. The fecal microbiota in turn influences gastrointestinal homeostasis including metabolism, immune function, and extra-/intra-intestinal signaling. Advances in next generation sequencing (NGS) have enhanced our ability to study the gut microbiome of breast-fed (BF) and formula-fed (FF) infants with a data-driven hypothesis approach.
Methods
Next generation sequencing libraries were constructed from fecal samples of BF (n=24) and FF (n=10) infants and sequenced on an Illumina HiSeq 2500. Taxonomic classification of the NGS data was performed using the Sunbeam/Kraken pipeline and a functional analysis at the gene level was performed using publicly available algorithms, including BLAST, and custom scripts. Differentially represented genera, genes, and NCBI Clusters of Orthologous Genes (COG) were determined between cohorts using count data and R (statistical packages edgeR and DESeq2).
Results
Thirty-nine genera were found to be differentially represented between the BF and FF cohorts (FDR ≤ 0.01) including Parabacteroides, Enterococcus, Haemophilus, Gardnerella, and Staphylococcus. A Welch t-test of the Shannon diversity index for BF and FF samples approached significance (p=0.061). Bray-Curtis and Jaccard distance analyses demonstrated clustering and overlap in each analysis. Sixty COGs were significantly overrepresented and those most significantly represented in BF vs. FF samples showed dichotomy of categories representing gene functions. Over 1,700 genes were found to be differentially represented (abundance) between the BF and FF cohorts.
Conclusions
Fecal samples analyzed from BF and FF infants demonstrated differences in microbiota genera. The BF cohort includes greater presence of beneficial genus Bifidobacterium. Several genes were identified as present at different abundances between cohorts indicating differences in functional pathways such as cellular defense mechanisms and carbohydrate metabolism influenced by feeding. Confirmation of gene level NGS data via PCR and electrophoresis analysis revealed distinct differences in gene abundances associated with important biologic pathways.
Keywords: metagenomics, next generation sequencing, gut microbiome, whole genome, breast-feeding, infants
1 Introduction
Early dietary content is an important consideration in the long-term development of immunologic, metabolic, and many chronic disorders (Singhal and Lanigan, 2007; Cox et al., 2014; Rodriguez et al., 2015; Clapp et al., 2017; Davis et al., 2017; Davis et al., 2020; Turroni et al., 2020; Sarkar et al., 2021). Analyzing the infant fecal microbiome to understand the effects on the gastrointestinal (gut) microbiota conferred by early feeding/diet could help elucidate the mechanism underlying the development of these phenotypes. Next generation sequencing (NGS) is a technique that enables deep probing of both meta-taxonomy of the gut flora as well as the metagenomics signature of the microbiome. Dietary differences may have a long-lasting effect on the gut microbiome by impacting the composition and biological functions of the organisms present. This study seeks to contrast taxonomic variation between breast-fed (BF) and formula-fed (FF) infants at the genera level to identify orthologous gene clusters that may be in differential abundance between the cohorts. Additionally, the study attempts to characterize the metagenome composition and differences between BF and FF cohorts.
Previous studies, including work from our laboratory, demonstrate differences in the gut microbiome of BF versus FF infants (Lee et al., 2015; Schwartz et al., 2012; Bäckhed et al., 2015; Baumann-Dudenhoeffer et al., 2018; Stewart et al., 2018; Di Guglielmo et al., 2019). These studies highlight species diversity between differently fed infants using a 16S ribosomal RNA analysis (Schwartz et al., 2012; Bäckhed et al., 2015; Lee et al., 2015) and identify key genera abundance dissimilarities between FF and BF very young infants. Key findings include a significant predominance of the Bifidobacterium genus in BF infants and more abundant Enterococcus and Escherichia genera in FF samples. Additional metagenomic analysis using a shotgun approach, versus a 16S ribosomal RNA approach, indicates a diversity of gut microbiota at both the genera and gene level (Baumann-Dudenhoeffer et al., 2018; Stewart et al., 2018; Di Guglielmo et al., 2019).
Formula feeding influences the persistence of a more diverse, but not necessarily beneficial, gut microbiota (Davis et al., 2020). Prior work in our laboratory demonstrate differential levels of bacterial genes in each cohort showing a relative lower abundance of seven genes in the FF infants contrasted with 364 genes with a higher relative abundance. The most notable change in gene abundance is the lack of one gene (CRISPR-Cas9) in FF fecal samples. CRISPR-Cas9 is a key component of bacterial cellular defense mechanisms for protecting against both pathogenic mutations and antibiotic resistance. Overall, the specific genes in question suggest a biological explanation for gut microbiota acting differently on the intestinal epithelium in the development of pathogenic strains, drug resistance, and biofilm formation.
Other studies demonstrate that different short-chain fatty acids dominate in BF versus FF infant fecal samples (Fukuda et al., 2012; den Besten et al., 2013; Cox et al., 2014). These metabolic differences may reflect the gut microbiota composition that generates these small molecules and the downstream effects on host gastrointestinal, immunologic, and neurologic functions. Further analysis of these microbiome differences is required to expand our understanding of how diet early in life impacts the gut microbiota and microbiome.
Research on the infant microbiome has demonstrated both pliability as well as susceptibility to external influence (Singhal and Lanigan, 2007; Cox et al., 2014; Davis et al., 2020; Sarkar et al., 2021). Particular microbiota species are known to be dominant in BF infants (Pannaraj et al., 2017). Studies of the genus Bifidobacterium suggest a critical symbiotic role of human milk oligosaccharides in breast milk and the genus of organisms that metabolize them (Sela et al., 2008; Bode, 2012; Ruiz-Moyano et al., 2013; Garrido et al., 2015; Lewis et al., 2015). The long-term health implications of microbiome changes may not be subtle. If a more pro-inflammatory or pro-pathogenic environment is fostered in the gut of young infants due to dietary influences that change micro-organisms, gene expression, and carbohydrate metabolism, individuals may develop increased immune disorders (Schwartz et al., 2012), the need for broad-spectrum antibiotics (Taft et al., 2018; Casaburi et al., 2019), and gastrointestinal and neurologic disorders (Clapp et al., 2017; Kang et al., 2017; Kang et al., 2019). Clinical care of young patients could be directed to protect the beneficial gut microflora and focus efforts on influencing it early in life when the microbiome is still adaptable; long-term benefits on adolescent and adult health may follow (Singhal and Lanigan, 2007; Cox et al., 2014; Davis et al., 2020; Sarkar et al., 2021).
To better understand the importance of both taxonomy and differential gene abundance, the present study employs whole-genome fecal metagenomic next-generation sequencing and a computational pipeline previously used in our laboratory (Di Guglielmo et al., 2019). The goal of this manuscript is to expand the prior analysis in size and interpretation of the metagenomics data and create a refined and more accurate picture of the metataxonomic profile of the cohorts studied. Each Clusters of Orthologous Genes (COG) pattern that emerges from the analysis of gene abundance can suggest functional roles. Coupling taxonomic differences with gene abundance as a proxy for functional and biological significance allows a hypotheses for testing future interventions that could modify the gut microbiota in a beneficial way for the health of patients.
2 Materials and Methods
2.1 Subject Enrollment
The study was approved by the Nemours Institutional Review Board #1458092 and #822736. Parental permission was obtained from each infant’s parent or guardian. The FASTQ data will be deposited online at SRA (or equivalent). Thirty-four healthy term infants between 5 days and 100 days of age who were exclusively breast fed (n=24; age range 5-95 days) or formula fed (n=10; age range 10-100 days) were recruited. Infants were excluded if they had any other sources of nutrition, dietary restrictions (e.g., hypoallergenic formula), consumed higher density formula (>20 calories/ounce), had exposure to antibiotics, or had any gastrointestinal infection or disease that affected the integrity of the intestinal mucosa. Fecal samples and clinical data on infants were collected, including demographic information, maternal and paternal age (years) at infant’s birth, maternal and paternal height and weight, delivery method, maternal antibiotic use (breast-feeding mothers only), and maternal over the counter or prescription medications taken during pregnancy.
2.2 Sample Collection
Soiled diapers were sampled within 12 hours of defecation. Stool was collected by application of two duplicate swabs (Copan Diagnostics, Murrieta, CA) for metagenomics sequencing. The containers were placed immediately into a dry ice ethanol bath and then transferred to a -80°C freezer until processing.
2.3 DNA Extraction and Sequencing
DNA extraction and sequencing were completed at the Microbiome Center at the Children’s Hospital of Philadelphia. DNA was extracted from samples using the DNeasy PowerSoil kit using the manufacturer’s instructions (Qiagen, Germantown, MD). Libraries were generated from 1 ng of DNA using the NexteraXT kit (Illumina, San Diego, CA) and sequenced on the Illumina HiSeq 2500 using 2x125bp chemistry in high output mode. Extraction controls (no template) and DNA free water were included to empirically assess environmental and reagent contamination. Laboratory-generated mock communities consisting of DNA from Vibrio campbellii, Cryptococcus diffluens, and Lambda phage were included as positive controls.
2.4 Bioinformatics Analysis
Microbiome NGS library analysis was performed using the same pipeline as our previous study (Di Guglielmo et al., 2019) with a few modifications. Briefly, the “QC” part of the Sunbeam pipeline (Clarke et al., 2019) was used to remove adapters, human, and PhiX contamination before the Sunbeam “Classify” portion used a Kraken1 (Wood and Salzberg, 2014) database built on October 23, 2018 to classify the decontaminated reads. Trimmed mean of M-values normalization and statistical testing were performed with edgeR (Robinson et al., 2010) and DESeq2 (Love et al., 2014) to calculate statistically significant differentially represented genera.
Shannon diversity indexes were calculated via the VEGAN R package (Dixon, 2003). Metagenome construction was done via MEGAHIT (Li et al., 2015) using concatenated decontaminated FASTQ files as inputs. Prodigal (Hyatt et al., 2010) and NCBI COGs (Tatusov et al., 1997) were used for gene prediction and annotation. STAR (Dobin et al., 2013) was used to map individual samples’ decontaminated FASTQ files to the metagenome. RSEM (Li and Dewey, 2011) was used to count the number of reads mapping to unique genes, and a custom script was used to count the number of reads mapping to unique National Center for Biotechnology Information COGs. The edgeR and DESeq2 packages were used to calculate statistically significant differentially represented genes and COGs between cohorts. Heatmaps were generated using the pheatmap R package. NGS read depths are listed in Supplemental Table 1 .
2.5 Direct PCR Validation
Specific genes that were more abundant or less abundant in either cohort, or that mapped to specific COGs having statistically significant differences in representation between cohorts, were subjected to validation using primers specific for each gene. The PCRs were performed using Takara (Takara Bio Inc., Shiga, Japan) 50X Titanium Taq DNA polymerase. Each 25 µl PCR reaction contained 2.5 µl 10X Takara Taq buffer (S1793), 0.5 µl 10 mM dNTP mix (Sigma cat# D7295), 1 µl 10 µM oligos IDT, 0.25 µl 50X Takara Titanium Taq DNA polymerase (S1792), 1 µl DNA (extracted from fecal samples), and 19.75 µl H2O. The PCR conditions are 5 minutes at 95°C for 1 cycle, 30 seconds at 95°C for 35 cycles, 30 seconds at 66°C for 35 cycles, 30 seconds at 72°C for 35 cycles, and 7 minutes at 72°C for 1 cycle. Five microliters of each PCR were run on a 3% NuSieve agarose gel in 1X TAE buffer and visualized using ethidium bromide staining. NGS reads for the target genes are displayed in Supplemental Table 2 ; qPCR primers used for the target genes are listed in Supplemental Table 3 .
3 Results
3.1 Patient Demographics
Thirty-four subjects were enrolled, and duplicate fecal samples were processed for each subject. Table 1 details demographic information about subjects. Twenty-four infants were exclusively breast fed and 10 were exclusively formula fed. No subjects were exposed to antibiotics. There were no statistical differences noted in the demographic data ( Table 1 ) between the BF and FF cohort except for delivery method (p-value <0.01).
Table 1.
Breast Fed (n = 24) | Formula Fed (n = 10) | P | |
---|---|---|---|
Sex, Female | 54% | 20% | 0.07 |
Age, days (mean, SD) | 48.4, 32.4 | 53.7, 24.3 | 0.59 |
Age, days (median, IQR) | 37.5, 63 | 54, 18.8 | |
Race, Caucasian | 67% | 70% | 0.85 |
Ethnicity, Non-Hispanic | 79% | 80% | 0.96 |
Delivery method, SVD | 79% | 30% | <0.01 |
Birth weight, grams (mean, SD) | 3317, 407 | 3335, 221 | 0.87 |
Enrollment weight, grams (mean, SD) | 4519, 996 | 4554, 921 | 0.92 |
Maternal age, years (mean, SD) | 30.9, 4.7 | 31.1, 6.2 | 0.92 |
Paternal age, years (mean, SD) | 32.6, 6.1 | 33.4, 7.8 | 0.78 |
Maternal BMI, kg/m2 (mean, SD) | 27.6, 7.5 | 26.6, 3.4 | 0.54 |
Maternal pre-pregnancy BMI, kg/m2 (mean, SD) | 26.4, 7.9 | 26.3, 5.1 | 0.97 |
Paternal BMI, kg/m2 (mean, SD) | 27.7, 8.2 | 28.7, 7.3 | 0.75 |
p values for categorical variables were calculated using Chi squared test, p values for numerical variables were calculated using Student’s t-test. SD, standard deviation; IQR, interquartile range; SVD, spontaneous vaginal delivery; BMI, body mass index.
3.2 Metagenomic Sequencing Beta-Diversity and Genera Analysis
The genera abundance was analyzed per cohort, FF and BF, and plotted as relative % abundance ( Supplemental Figure 1A ). There were similarities noted in presence/absence of genera, including Bacteroides, Klebsiella, Bifidobacterium, Escherichia, and Veillonella; however, differences in abundances were noted between the cohorts. Genera abundances were also plotted per sample ( Supplemental Figure 1B ) and differences within a cohort were noted. There were consistent patterns between cohorts, with seven of the 20 most abundant genera ( Figure 1A ) having abundance differences that were statistically significant (asterisks). The distribution of the Shannon diversity index was examined per each cohort and plotted as box-whisker plots ( Figure 1B ), and a wider distribution was noted in the BF cohort compared with the FF cohort, but the comparison did not reach significance (Welch’s t-test p-value = 0.0613). In total, 39 genera exhibited differences in abundance that were statistically significant between the FF and BF cohorts (p-value < 0.05 with an FDR ≤ 0.01 for both edgeR and DESeq2) ( Supplemental Table 4 ). Consistent with our previous metagenomics work studying the metagenome of FF versus BF infants (Di Guglielmo et al., 2019), Parabacteroides, Haemophilus, Enterococcus, Staphylococcus, and Phietavirus were differentially represented between the cohorts ( Supplemental Table 4 , bold). To determine consistency of the bio-replicates in the BF and FF cohorts, a Bray-Curtis dissimilarity and Jaccard distance principal coordinate analysis were conducted ( Supplemental Figure 2 ). The Bray-Curtis plot ( Supplemental Figure 2A ) demonstrated consistency between the bio-replicates (circle and triangles) using species abundance data, and the Jaccard distance plot ( Supplemental Figure 2B ) demonstrated consistency between bio-replicates (circle and triangles) using binary (plus/minus) species data.
3.3 Gene Level Analysis
To determine potential functional differences in the metagenomes between the FF and BF cohorts, a gene level analysis was conducted by creating a co-assembly of all the sequencing data that was then utilized to map and annotate individual level sample data. In total, 1,734 genes (annotated via Prodigal) were identified as statistically different in abundance (count data) between the FF and BF cohorts ( Supplemental Table 5 ). Genes that were higher in abundance in FF samples included functional annotations (NCBI COG) such as DNA segregation ATPase, NADPH ubiquinone oxidoreductase subunit 4, DNA topoisomerase IA, and a sugar phosphate permease ( Supplemental Table 5 , bold). Genes that were higher in abundance in BF samples included functional annotations such as retron-type reverse transcriptase, type IV secretory pathway (relaxase), beta-galactosidase/beta-glucuronidase, and OmpR family ( Supplemental Table 5 , underline).
To determine if a higher-level analysis of the COGs would increase the interpretation of the functional annotations, COGs were collapsed/rolled up based on their functional hierarchy. The abundance differences of 60 COGs were identified as statistically significant when comparing the FF cohort with the BF cohort ( Table 2 ). Using COG count level data, genes and COGs were clustered and visualized using a heatmap approach ( Supplemental Figure 3 ).
Table 2.
COG | Description | Category | logFC | logCPM | FDR-edgeR | Padj- DESeq2 |
---|---|---|---|---|---|---|
COG3549 | Plasmid maintenance system killer protein | Defense mechanisms | -6.303 | 4.665 | 1.47E-03 | 1.52E-03 |
COG3914 | Predicted O-linked N-acetylglucosamine transferase, SPINDLY family | Posttranslational modification, protein turnover, chaperones | -5.142 | 4.459 | 3.90E-04 | 3.93E-08 |
COG4115 | Toxin component of the Txe-Axe toxin-antitoxin module, Txe/YoeB family | Defense mechanisms | -4.391 | 9.353 | 6.97E-04 | 1.33E-06 |
COG5527 | Protein involved in initiation of plasmid replication | Mobilome: prophages, transposons | -4.302 | 12.060 | 2.95E-04 | 5.59E-08 |
COG3256 | Nitric oxide reductase large subunit | Inorganic ion transport and metabolism | -3.775 | 4.241 | 3.05E-03 | 1.53E-05 |
COG5314 | Conjugal transfer/entry exclusion protein | Mobilome: prophages, transposons | -3.698 | 7.229 | 6.07E-03 | 2.46E-04 |
COG4292 | Low temperature requirement protein LtrA (function unknown) | Function unknown | -3.635 | 3.580 | 1.15E-03 | 3.74E-06 |
COG4132 | ABC-type uncharacterized transport system, permease component | General function prediction only | -3.528 | 7.289 | 4.75E-03 | 2.48E-04 |
COG4413 | Urea transporter | Amino acid transport and metabolism | -3.527 | 4.633 | 8.87E-03 | 3.16E-04 |
COG4146 | Uncharacterized membrane permease YidK, sodium:solute symporter family | General function prediction only | -2.663 | 7.585 | 4.26E-04 | 1.62E-06 |
COG3542 | Predicted sugar epimerase, cupin superfamily | General function prediction only | -2.640 | 4.961 | 8.87E-03 | 2.19E-04 |
COG2452 | Predicted site-specific integrase-resolvase | Mobilome: prophages, transposons | -2.579 | 7.767 | 6.32E-03 | 1.97E-04 |
COG5520 | O-Glycosyl hydrolase | Cell wall/membrane/envelope biogenesis | -2.201 | 9.260 | 6.32E-03 | 5.67E-04 |
COG4372 | Uncharacterized conserved protein, contains DUF3084 domain | Function unknown | -1.888 | 10.155 | 1.24E-03 | 5.57E-05 |
COG0728 | Peptidoglycan biosynthesis protein MviN/MurJ, putative lipid II flippase | Cell wall/membrane/envelope biogenesis | -1.621 | 9.419 | 8.71E-03 | 9.85E-04 |
COG1004 | UDP-glucose 6-dehydrogenase | Cell wall/membrane/envelope biogenesis | -1.322 | 9.261 | 5.24E-03 | 6.10E-04 |
COG0627 | S-formylglutathione hydrolase FrmB | Defense mechanisms | -1.241 | 8.546 | 3.17E-03 | 1.24E-04 |
COG0362 | 6-phosphogluconate dehydrogenase | Carbohydrate transport and metabolism | -1.118 | 9.309 | 5.19E-03 | 6.79E-04 |
COG0657 | Acetyl esterase/lipase | Lipid transport and metabolism | -1.051 | 10.130 | 7.05E-03 | 6.64E-04 |
COG0110 | Acetyltransferase (isoleucine patch superfamily) | General function prediction only | -0.885 | 9.518 | 6.32E-03 | 1.92E-04 |
COG0738 | Fucose permease | Carbohydrate transport and metabolism | -0.780 | 10.289 | 6.64E-03 | 1.53E-04 |
COG1686 | D-alanyl-D-alanine carboxypeptidase | Cell wall/membrane/envelope biogenesis | 0.964 | 9.148 | 4.33E-03 | 7.24E-03 |
COG3887 | c-di-AMP phosphodiesterase, consists of a GGDEF-like and DHH domains | Signal transduction mechanisms | 1.201 | 7.785 | 8.87E-03 | 6.00E-03 |
COG3857 | ATP-dependent helicase/DNAse subunit B | Replication, recombination and repair | 1.239 | 8.504 | 4.71E-03 | 3.88E-03 |
COG0301 | Adenylyl- and sulfurtransferase ThiI, participates in tRNA 4-thiouridine and thiamine biosynthesis | Coenzyme transport and metabolism | 1.298 | 6.853 | 7.05E-03 | 5.84E-03 |
COG3290 | Sensor histidine kinase regulating citrate/malate metabolism | Signal transduction mechanisms | 1.377 | 9.331 | 1.60E-03 | 2.26E-03 |
COG4932 | Uncharacterized surface anchored protein | Function unknown | 1.412 | 11.054 | 8.60E-04 | 1.20E-03 |
COG0825 | Acetyl-CoA carboxylase alpha subunit | Lipid transport and metabolism | 1.423 | 6.612 | 3.75E-03 | 2.32E-03 |
COG1199 | Rad3-related DNA helicase | Replication, recombination and repair | 1.442 | 8.417 | 3.61E-03 | 6.21E-03 |
COG2357 | ppGpp synthetase catalytic domain (RelA/SpoT-type nucleotidyltranferase) | Nucleotide transport and metabolism | 1.444 | 6.705 | 3.05E-03 | 1.50E-03 |
COG4720 | Uncharacterized membrane protein | Function unknown | 1.509 | 6.663 | 1.12E-03 | 3.33E-04 |
COG1638 | TRAP-type C4-dicarboxylate transport system, periplasmic component | Carbohydrate transport and metabolism | 1.572 | 8.267 | 3.90E-04 | 7.41E-04 |
COG4109 | Predicted transcriptional regulator containing CBS domains | Transcription | 1.808 | 5.657 | 8.92E-04 | 3.78E-04 |
COG3688 | Predicted RNA-binding protein containing a PIN domain | General function prediction only | 1.824 | 6.121 | 6.09E-04 | 1.22E-04 |
COG1307 | Fatty acid-binding protein DegV (function unknown) | Lipid transport and metabolism | 1.856 | 7.624 | 9.16E-05 | 6.39E-05 |
COG3331 | Penicillin-binding protein-related factor A, putative recombinase | General function prediction only | 1.864 | 4.516 | 4.87E-04 | 1.18E-04 |
COG1001 | Adenine deaminase | Nucleotide transport and metabolism | 1.889 | 7.376 | 3.10E-03 | 3.36E-03 |
COG4753 | Two-component response regulator, YesN/AraC family, consists of REC and AraC-type DNA-binding domains | Transcription | 1.900 | 7.971 | 3.17E-03 | 4.88E-03 |
COG4709 | Uncharacterized membrane protein | Function unknown | 1.913 | 4.692 | 1.88E-03 | 9.03E-04 |
COG2179 | Predicted phosphohydrolase YqeG, HAD superfamily | General function prediction only | 1.935 | 4.166 | 6.40E-04 | 2.00E-04 |
COG1671 | Uncharacterized conserved protein YaiI, UPF0178 family | Function unknown | 1.975 | 5.369 | 4.75E-03 | 6.48E-03 |
COG1344 | Flagellin and related hook-associated protein FlgL | Cell motility | 2.007 | 8.124 | 3.95E-03 | 9.53E-03 |
COG0727 | Fe-S-cluster containing protein | General function prediction only | 2.009 | 5.909 | 6.19E-04 | 3.68E-04 |
COG1345 | Flagellar capping protein FliD | Cell motility | 2.022 | 7.356 | 3.75E-03 | 5.88E-03 |
COG4640 | Uncharacterized membrane protein YvbJ | Function unknown | 2.153 | 5.735 | 2.94E-03 | 2.87E-03 |
COG4717 | Uncharacterized protein YhaN | Function unknown | 2.199 | 5.526 | 1.97E-03 | 1.73E-03 |
COG3760 | Uncharacterized protein | Function unknown | 2.206 | 4.658 | 6.19E-04 | 5.54E-04 |
COG4862 | Negative regulator of genetic competence, sporulation and motility | Transcription | 2.301 | 4.990 | 3.05E-04 | 1.29E-04 |
COG2607 | Predicted ATPase, AAA+ superfamily | General function prediction only | 2.381 | 5.081 | 3.17E-03 | 4.87E-03 |
COG4728 | Uncharacterized protein | Function unknown | 2.486 | 3.669 | 4.87E-04 | 1.97E-04 |
COG3108 | Uncharacterized conserved protein YcbK, DUF882 family | Function unknown | 2.545 | 6.373 | 3.05E-04 | 6.34E-04 |
COG1775 | Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB | Secondary metabolites biosynthesis, transport and catabolism | 2.592 | 6.286 | 4.54E-04 | 6.34E-04 |
COG4509 | Uncharacterized protein | Function unknown | 2.602 | 6.946 | 3.26E-05 | 1.31E-05 |
COG1645 | Uncharacterized Zn-finger containing protein, UPF0148 family | General function prediction only | 2.617 | 3.892 | 2.95E-04 | 1.62E-04 |
COG4478 | Uncharacterized membrane protein | Function unknown | 2.657 | 4.231 | 5.77E-05 | 1.53E-05 |
COG1036 | Archaeal flavoprotein | Energy production and conversion | 2.765 | 3.623 | 3.05E-04 | 2.45E-04 |
COG4769 | Uncharacterized membrane protein | Function unknown | 2.999 | 3.649 | 3.94E-05 | 2.44E-05 |
COG4805 | Uncharacterized conserved protein, DUF885 family | Function unknown | 3.142 | 5.282 | 2.84E-03 | 5.28E-03 |
COG4223 | Uncharacterized conserved protein | Function unknown | 3.541 | 4.149 | 1.46E-03 | 5.42E-03 |
COG4939 | Major membrane immunogen, membrane-anchored lipoprotein | Function unknown | 4.122 | 2.791 | 6.29E-06 | 5.08E-05 |
The abundance differences of 60 COGs that were statistically significant when comparing the formula-fed cohort with the breast-fed cohort are listed with COG category and an “example gene” description. Highlighted COGs represent top 10 most abundant COG in breast-fed (light gray) and formula-fed (dark gray) samples. LogFC values are relative to breast-fed abundance. Negative logFC indicates a genus was n fold lower in formula-fed compared with breast-fed; positive logFC indicates a genus was n fold higher in formula-fed compared with breast-fed.
A total of 21 COGs showed greater abundance in the BF group and 39 exhibited higher abundance in the FF group ( Figure 2 ). Patterns of differences between the two cohorts for a variety of COGs were observed. Four COG categories were significantly overrepresented in BF samples: amino acid transport and metabolism, defense mechanisms, mobilome, and inorganic ion transport and metabolism. For FF samples, five categories were significantly overrepresented: cell motility; nucleotide transport and metabolism; replication, recombination, and repair; signal transduction mechanisms; and transcription.
The COGs that were overrepresented in each cohort were further analyzed using directed PCR to validate presence/absence of specific genes within each identified COG ( Figures 3A, B ). Specific primers were created to conduct gene/COG amplification to 11 additional genes within the COG categories that were suggested as having the most variance/difference between cohorts (defense mechanism, carbohydrate metabolism, signal transduction, and mobilome). Some genes were completely absent in the samples tested, as expected; others were amplified in both, though with incongruence between amplification and the NGS raw reads ( Figures 3A, B ). The PCR are end point and representative of differences between cohorts.
3.3.1 Specific Genes
To evaluate and validate the shotgun metagenomics data, we conducted a polymerase PCR followed by gel electrophoresis on candidate markers identified. Four samples were selected from the BF cohort and four samples from the FF based on sample availability for validations. The number of NGS reads supporting the abundance for a given gene and sample are displayed under the gel band. Figure 3A focuses on five genes identified in the shotgun metagenomics analysis that are related to defense mechanisms, carbohydrate metabolism, or signal transduction pathways. As demonstrated in the gel pictures, the predicted abundances from the NGS data correlated with the NGS read counts. Gene ID 156409 was the only gene in which there was a perfect absence (FF) and presence (BF) between cohorts. Consistent with other metagenomic publications (Baumann-Dudenhoeffer et al., 2018) it is not uncommon to detect variability within a cohort, even when statistically significant differences are noted at the population level (BF vs FF).
Figure 3B focuses on six genes identified in the shotgun metagenomics analysis that are related to signal transduction pathways, mobilome, and carbohydrate metabolism. Like the genes analyzed in Figure 3A , the predicted abundances from the NGS data correlated with the NGS read counts. Supplemental Table 2 contains all the analyzed subjects’ shotgun metagenomics NGS read count data.
4 Discussion
The metagenomic analysis presented here demonstrates a robust and useful tool for analyzing fecal microbiomes early in life. In the present study, a larger cohort expands the sensitivity of the analysis from prior work (Di Guglielmo et al., 2019) and allows a more in-depth analysis of COG and gene differences in the gut microbiome between the two cohorts, FF and BF, of young infants. Understanding whether these differences are biologically significant, and whether they are permanent through childhood, remain goals of both this study and a future longitudinal study. The Bray-Curtis PCoA clustering implies an abundance of species separation between cohorts, further reinforcing that infant feeding even at early ages influences the gut microbiota contrast and congruence.
4.1 Metataxonomic Trends and Differences
The diversity and differentiation between cohorts is represented, both summated, and by individual subject, in Supplemental Figure 1 . A wider distribution of diversity is seen in the BF cohort; however, the index distribution values are lower compared with the FF cohort. While some subjects stand out as unique, the overall trend represents diversity differentiation, with the FF cohort having a greater diversity (overall Shannon diversity index approaching statistical significance). Specifically, one subject in the FF cohort had a very high abundance of Bacteroides, which was almost uniformly seen in the BF cohort with increased abundance, likely skewing the Shannon diversity index. A greater diversity in FF infants has been associated with poorer health outcomes and dysbiosis and is contrasted by a lower diversity in BF infants (Schwartz et al., 2012; Savage et al., 2018; Davis et al., 2020).
Thirty-nine genera are differentially represented between cohorts, with a similar pattern to our prior work (Di Guglielmo et al., 2019). Of note, between our prior reported cohorts and our expanded cohort, the genera with statistical significance between cohorts ( Figure 1A , Supplemental Table 4 ) are consistent: Parabacteroides, Haemophilus, Clostridioides, and Staphylococcus. In the expanded cohort, Bifidobacterium, Enterococcus, and Lachnoclostridium are also statistically significantly different between cohorts. Breast-fed infants have greater relative abundance of Bifidobacterium, a gram-positive bacterium of phylum Actinobacteria, used in many probiotics. As expected, the Bifidobacterium genus is highly abundant in the BF cohort consistent with prior reports (Henrick et al., 2018; Karav et al., 2018; Henrick et al., 2019). Bifidobacterium is strongly associated with breast milk feeding and is therefore expected to be found in the infant gut. Lachnoclostridium, a microbe in the phylum Firmicutes, but of unknown pathogenic potential, is more abundant in the FF cohort. The greater abundance in the FF cohort of Enterococcus may also reflect a more pathogenic-potential microbial shift in the gut of these infants. The trend bears further investigation in terms of both gene abundance and metabolic output to determine clinical and biological significance for these infants (Fukuda et al., 2012; den Besten et al., 2013). Caesarean section delivery was more frequent in the FF infants; however, the mode of delivery influences the infant microbiota/microbiome (Rutayisire et al., 2016; Korpela, 2021). In the cohort genera metataxonomic analysis ( Figure 1A ), data show that Bifidobacterium is comparable between cohorts, while Klebsiella and Bacteroides are not. Escherichia was similar between cohorts, while Veillonella was not. While differences between delivery modes have been studied, the results in this cohort are not completely aligned with prior studies. To ascertain whether the influence on taxonomy alone of delivery is enough to cause cohort differences, the more in-depth analysis available with whole-genome metagenomics offers an advantage.
4.2 Metagenomic Observations and Implications
Sixty COGs were significantly more abundant in either BF or FF infants ( Table 2 ). Examining those COGs that are significantly overrepresented in either FF or BF cohorts ( Figure 2 ) reveals some notable patterns. There are COGs with specific functions that are absent or present in each of the cohorts. Should these COGs and the protein domains they represent result in the loss or gain of protein function, the abundance differences may influence metabolically active, beneficial, or pathologic proteins within the gut. It is not yet clear if these changes are permanent or whether they contribute to intra- or extra-intestinal disorders in infancy and childhood.
Using the difference in abundance of these 60 COGs as informed by the heatmap ( Supplemental Figure 3 ), we focused on four COGs: carbohydrate transport and metabolism; defense mechanisms; mobilome—prophages, transposons; and signal transduction mechanisms ( Figure 2 ). In our prior work, we observed some genes that were present only, or predominantly, in one cohort vs. the other, namely CRISPR-Cas9 and carboxypeptidase (Di Guglielmo et al., 2019).
For the current study, specific genes from each COG demonstrate similar patterns as the NGS read counts and shows alignment with abundance data ( Figure 3 ). Five out of five genes more abundant in the BF samples analyzed are entirely or almost entirely absent from the FF samples. Four genes (309214, 266471, 114703, 145511) that are more abundant in FF infants are completely absent in the BF samples analyzed; two genes (316412, 345373) that are more abundant in FF infants are mostly absent from the BF samples with two BF samples detected and two BF samples not detected ( Figure 3 ; Supplemental Table 2 contains the supporting NGS read count data). Of the genes completely absent in our samples from either cohort, the pattern indicates that defense mechanism genes are absent (two of three) in FF samples while a carbohydrate metabolism gene, two signal transduction genes, and a mobilome gene are absent in some BF samples. For the defense mechanism genes, this indicates that FF infant microbiota may lack the ability to thwart DNA level changes that could confer greater susceptibility to mutative changes. The implication is the potential in FF infants for the introduction of pathologic or inflammatory proteins affecting gut immune stability and overall homeostasis (Fallani et al., 2010; Guaraldi and Salvatori, 2012; Schwartz et al., 2012; Turroni et al., 2020; Sarkar et al., 2021). For the BF infant microbiota, some of the genes in these categories may be associated with reduced propensity for metabolic shifts that confer dysbiosis or altered signal transduction. Regarding mobilome genes, a loss or gain of horizontal gene transfer through mobile genetic elements (Jørgensen et al., 2015; Mancino et al., 2019; Carr et al., 2021), related to selection pressure-driven changes, may facilitate or impede antibiotic resistance in the differently fed cohorts.
Based on the function of these genes, and what they contribute to in terms of either signaling, carbohydrate metabolism, or health of the gut bacteria themselves, we may be able to examine impact on metabolic (obesity), immunologic (antibiotic-resistance), and other long-term health issues. Does a more dysbiotic young infant gut lead to cellular stress or inflammation that interferes with gut signaling? This is a hypothesis that warrants further study. The gut may normalize over time based on later infancy and toddlerhood diets such that there is no permanent impact. Conversely, these microbial influences and changes may be more permanent (Turroni et al., 2020; Sarkar et al., 2021) because of the overabundance or underabundance of certain key organisms, the genes they express, and the proteins those genes encode. Ongoing work in our laboratory aims to demonstrate short-chain fatty acid differences that exist between cohorts as well as longitudinal taxonomic and genera/COG/gene abundance differences (Fukuda et al., 2012; den Besten et al., 2013).
4.3 Limitations
There are both strengths and limitations to the present study. Shotgun metagenomics enables a non-biased approach that can yield gene level data compared with 16S methods. This type of method requires extensive bioinformatic pipelines (Di Guglielmo et al., 2019) and generates data that enables gene level analysis that can be utilized for functional interpretation, which is not possible with the traditional 16S approach. The present study has a larger sample size compared with our previous report. Limitations include that our FF cohort is less than half the sample size as our BF cohort, and thus outliers with different taxonomy or metagenomes/COG could affect both statistical significance and definitiveness of conclusions. The CRISPR-Cas9 gene from our prior work was detected in samples in this study but the abundance difference was not statistically significant, reflecting the smaller FF cohort size. Of note, this gene is known to be variable in different taxonomies (Crawley et al., 2018). Yet some differences are striking enough to be worth reporting and pursuing further. We cannot control for the maternal or environmental microbiome and their impact/influence on even very young infants, as we see no clear pattern in presence/absence of genera of individual subject’s sample regardless of sex and age at sample collection or delivery method. We acknowledge that delivery method was statistically different between cohorts ( Table 1 ), which may present a confounding variable. In prior studies, with small cohorts, Caesarean section delivery was either an exclusion criterion with no data was presented to make a comparison (Lee et al., 2015) or not mentioned at all (Schwartz et al., 2012). We note that in our study, within a cohort, there was variability in the relative abundances of genera between samples from babies born via vaginal delivery and between samples from babies born via caesarean section delivery. Nonetheless, the data do show that there are notable summative differences in the cohorts. In a potential future study with larger sample size, a covariate regression analysis could be completed to ascertain the role of delivery; however, in our study our sample size is not large enough to make definitive statements about the impact of delivery. Our group has banked additional samples from infants not enrolled in this study who had the same dichotomous feeding method but did not meet all inclusion criteria. Using the analysis pipeline, and looking for similar patterns of abundance, diversity, variance, and COG overrepresentation, we may be able to elucidate patterns for each cohort beyond the strict criteria used for this study. Longitudinal microbiome data from additional collected samples on these patients will allow us to answer questions about whether these early changes and differences are temporary or permanent. Finally, metabolic data on these samples, currently under analysis, will demonstrate, potentially, unique characteristics by subject and cohort.
The present study demonstrates key differences between BF and FF cohorts in both taxonomic signature of the gut microbiota and metagenome. Several defense mechanism genes are virtually absent in the FF cohort, as confirmed by PCR validation. Gut bacteria in these infants, if more susceptible to phage, virus, and other transformative changes due to defense mechanism gene absence, could lead to pathogenicity and/or specific dysbiotic characteristics such as antibiotic resistance. With these changes can come inflammation and other cellular stressors that alter the ability of both microbiota, and host, to maintain homeostasis. Regardless of whether the gut microbiome signature normalizes later in life, the impact early on of gene abundance (or absence) as a proxy for function cannot be ignored. This research may lead to solutions for restoring gut health in FF infants, or those exposed to antibiotics at an early age, with the intent to boost the abundance of bacteria (Underwood et al., 2013; Karav et al., 2018; Henrick et al., 2019) that return specific proteins and their function to the gut.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA542703 and https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA789149.
Ethics Statement
The studies involving human participants were reviewed and approved by Nemours Institutional Review Board, Nemours Children’s Health, Wilmington, Delaware. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin.
Author Contributions
MD, KF, and EC contributed substantially to the conception or design of the work and the acquisition, analysis, or interpretation of data for the work. MD, KF, AR, and EC drafted the work and revised it critically for important intellectual content. MD, KF, AR, and EC provided approval for publication of the content and agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.
Funding
Research Excellence program grant from the National Institute of General Medical Sciences of the National Institutes of Health under grant number P30 GM114736 (PI: Shaffer) and by an Institutional Development Award (IDeA) from the National Institute of General Medical Sciences of the National Institutes of Health under grant number U54 GM104941 (PI: Binder-Macleod).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
The authors would like to acknowledge Brittni Deadrick for assistance in collecting samples and Marlee Goins and the Nemours Biobank for assistance in processing samples. Lisa Mattei and the Children’s Hospital of Philadelphia Microbiome Core are acknowledged for metagenomics assistance.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcimb.2022.816601/full#supplementary-material
Abbreviations
COG, Clusters of Orthologous Genes; BF, breast-fed; FF, formula-fed; NGS, next generation sequencing.
References
- Bäckhed F., Roswall J., Peng Y., Feng Q., Jia H., Kovatcheva-Datchary P., et al. (2015). Dynamics and Stabilization of the Human Gut Microbiome During the First Year of Life. Cell Host Microbe 17, 690–703. doi: 10.1016/j.chom.2015.04.004 [DOI] [PubMed] [Google Scholar]
- Baumann-Dudenhoeffer A. M., D’Souza A. W., Tarr P. I., Warner B. B., Dantas G. (2018). Infant Diet and Maternal Gestational Weight Gain Predict Early Metabolic Maturation of Gut Microbiomes. Nat. Med. 24, 1822–1829. doi: 10.1038/s41591-018-0216-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bode L. (2012). Human Milk Oligosaccharides: Every Baby Needs a Sugar Mama. Glycobiology 22, 1147–1162. doi: 10.1093/glycob/cws074 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carr V. R., Shkoporov A., Hill C., Mullany P., Moyes D. L. (2021). Probing the Mobilome: Discoveries in the Dynamic Microbiome. Trends Microbiol. 29, 158–170. doi: 10.1016/j.tim.2020.05.003 [DOI] [PubMed] [Google Scholar]
- Casaburi G., Duar R. M., Vance D. P., Mitchell R., Contreras L., Frese S. A., et al. (2019). Early-Life Gut Microbiome Modulation Reduces the Abundance of Antibiotic-Resistant Bacteria. Antimicrob. Resist. Infect. Control 8, 131. doi: 10.1186/s13756-019-0583-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clapp M., Aurora N., Herrera L., Bhatia M., Wilen E., Wakefield S. (2017). Gut Microbiota’s Effect on Mental Health: The Gut-Brain Axis. Clin. Pract. 7, 987. doi: 10.4081/cp.2017.987 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clarke E. L., Taylor L. J., Zhao C., Connell A., Lee J. J., Fett B., et al. (2019). Sunbeam: An Extensible Pipeline for Analyzing Metagenomic Sequencing Experiments. Microbiome 7, 46. doi: 10.1186/s40168-019-0658-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cox L. M., Yamanishi S., Sohn J., Alekseyenko A. V., Leung J. M., Cho I., et al. (2014). Altering the Intestinal Microbiota During a Critical Developmental Window has Lasting Metabolic Consequences. Cell 158, 705–721. doi: 10.1016/j.cell.2014.05.052 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crawley A. B., Henriksen E. D., Stout E., Brandt K., Barrangou R. (2018). Characterizing the Activity of Abundant, Diverse and Active CRISPR-Cas Systems in Lactobacilli. Sci. Rep. 8, 11544. doi: 10.1038/s41598-018-29746-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davis E. C., Dinsmoor A. M., Wang M., Donovan S. M. (2020). Microbiome Composition in Pediatric Populations From Birth to Adolescence: Impact of Diet and Prebiotic and Probiotic Interventions. Dig Dis. Sci. 65, 706–722. doi: 10.1007/s10620-020-06092-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davis E. C., Wang M., Donovan S. M. (2017). The Role of Early Life Nutrition in the Establishment of Gastrointestinal Microbial Composition and Function. Gut Microbes 8, 143–171. doi: 10.1080/19490976.2016.1278104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- den Besten G., van Eunen K., Groen A. K., Venema K., Reijngoud D. J., Bakker B. M. (2013). The Role of Short-Chain Fatty Acids in the Interplay Between Diet, Gut Microbiota, and Host Energy Metabolism. J. Lipid Res. 54, 2325–2340. doi: 10.1194/jlr.R036012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Di Guglielmo M. D., Franke K., Cox C., Crowgey E. L. (2019). Whole Genome Metagenomic Analysis of the Gut Microbiome of Differently Fed Infants Identifies Differences in Microbial Composition and Functional Genes, Including an Absent CRISPR/Cas9 Gene in the Formula-Fed Cohort. Hum. Microb. J. 12, 100057. doi: 10.1016/j.humic.2019.100057 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dixon P. (2003). VEGAN, a Package of R Functions for Community Ecology. J. Vegetation Sci. 14, 927–930. doi: 10.1111/j.1654-1103.2003.tb02228.x [DOI] [Google Scholar]
- Dobin A., Davis C. A., Schlesinger F., Drenkow J., Zaleski C., Jha S., et al. (2013). STAR: Ultrafast Universal RNA-Seq Aligner. Bioinformatics 29, 15–21. doi: 10.1093/bioinformatics/bts635 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fallani M., Young D., Scott J., Norin E., Amarri S., Adam R., et al. (2010). Intestinal Microbiota of 6-Week-Old Infants Across Europe: Geographic Influence Beyond Delivery Mode, Breast-Feeding, and Antibiotics. J. Pediatr. Gastroenterol. Nutr. 51, 77–84. doi: 10.1097/MPG.0b013e3181d1b11e [DOI] [PubMed] [Google Scholar]
- Fukuda S., Toh H., Taylor T. D., Ohno H., Hattori M. (2012). Acetate-Producing Bifidobacteria Protect the Host From Enteropathogenic Infection via Carbohydrate Transporters. Gut Microbes 3, 449–454. doi: 10.4161/gmic.21214 [DOI] [PubMed] [Google Scholar]
- Garrido D., Ruiz-Moyano S., Lemay D. G., Sela D. A., German J. B., Mills D. A. (2015). Comparative Transcriptomics Reveals Key Differences in the Response to Milk Oligosaccharides of Infant Gut-Associated Bifidobacteria. Sci. Rep. 5, 13517. doi: 10.1038/srep13517 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guaraldi F., Salvatori G. (2012). Effect of Breast and Formula Feeding on Gut Microbiota Shaping in Newborns. Front. Cell. Infect. Microbiol. 2, 94. doi: 10.3389/fcimb.2012.00094 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henrick B. M., Chew S., Casaburi G., Brown H. K., Frese S. A., Zhou Y., et al. (2019). Colonization by B. Infantis EVC001 Modulates Enteric Inflammation in Exclusively Breastfed Infants. Pediatr. Res. 86, 749–757. doi: 10.1038/s41390-019-0533-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henrick B. M., Hutton A. A., Palumbo M. C., Casaburi G., Mitchell R., Underwood M. A., et al. (2018). Elevated Fecal pH Indicates a Profound Change in the Breastfed Infant Gut Microbiome Due to Reduction of Bifidobacterium Over the Past Century. MSphere 3, e00041–e00018. doi: 10.1128/mSphere.00041-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hyatt D., Chen G. L., LoCascio P. F., Land M. L., Larimer F. W., Hauser L. J. (2010). Prodigal Prokaryotic Gene Recognition and Translation Initiation Site Identification. BMC Bioinf. 11, 119. doi: 10.1186/1471-2105-11-119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jørgensen T. S., Kiil A. S., Hansen M. A., Sørensen S. J., Hansen L. H. (2015). Current Strategies for Mobilome Research. Front. Microbiol. 5, 1–6. doi: 10.3389/fmicb.2014.00750 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kang D. W., Adams J. B., Coleman D. M., Pollard E. L., Maldonado J., McDonough-Means S., et al. (2019). Long-Term Benefit of Microbiota Transfer Therapy on Autism Symptoms and Gut Microbiota. Sci. Rep. 9, 5821. doi: 10.1038/s41598-019-42183-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kang D. W., Adams J. B., Gregory A. C., Borody T., Chittick L., Fasano A., et al. (2017). Microbiota Transfer Therapy Alters Gut Ecosystem and Improves Gastrointestinal and Autism Symptoms: An Open-Label Study. Microbiome 5, 10. doi: 10.1186/s40168-016-0225-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karav S., Casaburi G., Frese S. A. (2018). Reduced Colonic Mucin Degradation in Breastfed Infants Colonized by Bifidobacterium Longum Subsp. Infantis EVC001. FEBS Open Bio 8, 1649–1657. doi: 10.1002/2211-5463.12516 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Korpela K. (2021). Impact of Delivery Mode on Infant Gut Microbiota. Ann. Nutr. Metab. 77 (suppl 3), 11–19. doi: 10.1159/000518498 [DOI] [PubMed] [Google Scholar]
- Lee S. A., Lim J. Y., Kim B. S., Cho S. J., Kim N. Y., Kim O. B., et al. (2015). Comparison of the Gut Microbiota Profile in Breast-Fed and Formula-Fed Korean Infants Using Pyrosequencing. Nutr. Res. Pract. 9, 242–248. doi: 10.4162/nrp.2015.9.3.242 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewis Z. T., Totten S. M., Smilowitz J. T., Popovic M., Parker E., Lemay D. G., et al. (2015). Maternal Fucosyltransferase 2 Status Affects the Gut Bifidobacterial Communities of Breastfed Infants. Microbiome 3, 13. doi: 10.1186/s40168-015-0071-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li B., Dewey C. N. (2011). RSEM: Accurate Transcript Quantification From RNA-Seq Data With or Without a Reference Genome. BMC Bioinf. 12, 323. doi: 10.1186/1471-2105-12-323 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li D., Liu C. M., Luo R., Sadakane K., Lam T. W. (2015). MEGAHIT: An Ultra-Fast Single-Node Solution for Large and Complex Metagenomics Assembly via Succinct De Bruijn Graph. Bioinformatics 31, 1674–1676. doi: 10.1093/bioinformatics/btv033 [DOI] [PubMed] [Google Scholar]
- Love M. I., Huber W., Anders S. (2014). Moderated Estimation of Fold Change and Dispersion for RNA-Seq Data With Deseq2. Genome Biol. 15, 550. doi: 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mancino W., Lugli G. A., van Sinderen D., Ventura M., Turroni F. (2019). Mobilome and Resistome Reconstruction From Genomes Belonging to Members of the Bifidobacterium Genus. Microorganisms 7, 638–652. doi: 10.3390/microorganisms7120638 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pannaraj P. S., Li F., Cerini C., Bender J. M., Yang S., Rollie A., et al. (2017). Association Between Breast Milk Bacterial Communities and Establishment and Development of the Infant Gut Microbiome. JAMA Pediatr. 171, 647–654. doi: 10.1001/jamapediatrics.2017.0378 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson M. D., McCarthy D. J., Smyth G. K. (2010). Edger: A Bioconductor Package for Differential Expression Analysis of Digital Gene Expression Data. Bioinformatics 26, 139–140. doi: 10.1093/bioinformatics/btp616 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rodriguez J. M., Murphy K., Stanton C., Ross R. P., Kober O. I., Juge N., et al. (2015). The Composition of the Gut Microbiota Throughout Life, With an Emphasis on Early Life. Microb. Ecol. Health Dis. 26, 26050. doi: 10.3402/mehd.v26.26050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruiz-Moyano S., Totten S. M., Garrido D. A., Smilowitz J. T., German J. B., Lebrilla C. B., et al. (2013). Variation in Consumption of Human Milk Oligosaccharides by Infant Gut-Associated Strains of Bifidobacterium Breve. Appl. Environ. Microbiol. 79, 6040–6049. doi: 10.1128/AEM.01843-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rutayisire E., Huang K., Liu Y., Tao F. (2016). The Mode of Delivery Affects the Diversity and Colonization Pattern of the Gut Microbiota During the First Year of Infants’ Life: A Systematic Review. BMC Gastroenterol. 16, 86. doi: 10.1186/s12876-016-0498-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sarkar A., Yoo J. Y., Valeria Ozorio Dutra S., Morgan K. H., Groer M. (2021). The Association Between Early-Life Gut Microbiota and Long-Term Health and Diseases. J. Clin. Med. 10, 459. doi: 10.3390/jcm10030459 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Savage J. H., Lee-Sarwar K. A., Sordillo J. E., Lange N. E., Zhou Y., O’Connor G. T., et al. (2018). Diet During Pregnancy and Infancy and the Infant Intestinal Microbiome. J. Pediatr. 203, 47–54.e44. doi: 10.1016/j.jpeds.2018.07.066 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwartz S., Friedberg I., Ivanov I. V., Davidson L. A., Goldsby J. S., Dahl D. B., et al. (2012). A Metagenomic Study of Diet-Dependent Interaction Between Gut Microbiota and Host in Infants Reveals Differences in Immune Response. Genome Biol. 13, r32. doi: 10.1186/gb-2012-13-4-r32 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sela D. A., Chapmanc J., Adeuya A., Kim J. H., Chen F., Whitehead T. R., et al. (2008). The Genome Sequence of Bifidobacterium Longum Subsp. Infantis Reveals Adaptations for Milk Utilization Within the Infant Microbiome. Proc. Natl. Acad. Sci. U. S. A. 105, 18964–18969. doi: 10.1073/pnas.0809584105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Singhal A., Lanigan J. (2007). Breastfeeding, Early Growth and Later Obesity. Obes. Rev. 8 Suppl 1, 51–54. doi: 10.1111/j.1467-789X.2007.00318.x [DOI] [PubMed] [Google Scholar]
- Stewart C. J., Ajami N. J., O’Brien J. L., Hutchinson D. S., Smith D. P., Wong M. C., et al. (2018). Temporal Development of the Gut Microbiom in Early Childhood From the TEDDY Study. Nature 562, 583–602. doi: 10.1038/s41586-018-0617-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taft D. H., Liu J., Maldonado-Gomez M. X., Akre S., Huda M. N., Ahmad S. M., et al. (2018). Bifidobacterial Dominance of the Gut in Early Life and Acquisition of Antimicrobial Resistance. mSphere 3, e00441. doi: 10.1128/mSphere.00441-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tatusov R. L., Koonin E. V., Lipman D. J. (1997). A Genomic Perspective on Protein Families. Science 278, 631–637. doi: 10.1126/science.278.5338.631 [DOI] [PubMed] [Google Scholar]
- Turroni F., Milani C., Duranti S., Lugli G. A., Bernasconi S., Margolles A., et al. (2020). The Infant Gut Microbiome as a Microbial Organ Influencing Host Well-Being. Ital J. Pediatr. 46, 16. doi: 10.1186/s13052-020-0781-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Underwood M. A., Kalanetra K. M., Bokulich N. A., Lewis Z. T., Mirmiran M., Tancredi D. J., et al. (2013). A Comparison of Two Probiotic Strains of Bifidobacteria in Premature Infants. J. Pediatr. 163, 1585–1591.e9. doi: 10.1016/j.jpeds.2013.07.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wood D. E., Salzberg S. L. (2014). Kraken: Ultrafast Metagenomic Sequence Classification Using Exact Alignments. Genome Biol. 15, R46. doi: 10.1186/gb-2014-15-3-r46 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA542703 and https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA789149.