Figure 2. Gut microbiome taxonomic profiles of healthy and non-healthy individuals inform a Lasso-penalized logistic regression classification model.
(a) Principal component analysis (PCA) of gut microbiome profiles reveals significant differences in the distribution of healthy (disease-free) (blue, n = 5547) and non-healthy (diseased) (red, n = 2522) groups (P < 0.05, PERMANOVA). Ellipses represent 95% confidence regions. The top 10 PC1 and PC2 loading vector magnitudes are shown. (b) Coefficient values for the Lasso-penalized logistic regression model. The model includes 49 taxa with positive coefficients, 3105 taxa with zero coefficients, and 46 taxa with negative coefficients.