Fig. 4 ∣. MEDI food abundances across infants and adults.

a, Fraction of samples with at least one detected food read across different age groups. b, Relative abundance of food-derived reads in a cohort of 447 infants. The blue line denotes the smoothing spline of the observed reads; the light blue area denotes the 95% confidence interval of the mean spline curve. Orange dots denote samples with less than 95% overall abundance mapped to bacteria (that is, low bacterial biomass). Grey shaded area denotes the interquartile area of the onset of solid food intake across infants. c, Energy content per standardized portion size (100 g) per sample in adults and infants. Shown are only samples with detected food items (n = 196 for infants and n = 359 for adults). Asterisk denotes significance under a Welch t-test: *P = 0.024. d, Macronutrient content per standardized portion size in infants and adults. Shown are only samples with detected food items (n = 196 for infants and n = 359 for adults). Asterisk denotes significance under a two-sided Welch t-test: *P = 0.015. In c and d, boxplots show 25%, 50% and 75% quantiles; the centre denotes the median and whiskers extend to the smallest and largest data points within 1.5 interquartile ranges. e, One-sided Mantel permutation test statistics for beta diversity agreement between MEDI-predicted food abundances, FFQs and microbial species abundances (Bray–Curtis distances; see Methods). Correlation between pairwise distance measures is indicated by r; Mantel test P value is shown. f, Comparison of relative food group abundances with paired diet frequency questionnaire data from infants. RPM, reads per million. Circles denote the mean; error bars, s.d. (n = 447). Pt-test indicates the P value of a two-sided Welch t-test of log-transformed relative abundances; Plogit denotes the P value of a logistic regression of food occurrence against food frequency strata. Axis labels are common across both plots in this panel. g, Comparison of MEDI-predicted relative food group abundances with diet frequency questionnaires in adults. Circles denote the mean; error bars, s.d. (only samples with paired FFQs, n = 361), Plm indicates the ANOVA P value of a regression of log-transformed relative abundances; Plogit denotes the P value of a logistic regression of food occurrence against food frequency strata. Axis labels are common across all plots in this panel.