Figure 2.
t-SNE visualization of the normalized microbe-gene co-mentioning matrix derived from PubMed queries for the microbes from the following four phyla: Firmicutes, Proteobacteria, Bacteroidetes, and Actinobacteria. The t-SNE was applied to the TF-IDF normalized (see “Methods”) microbe-gene co-mentioning matrix to calculate the 2-D coordinates for individual microbial genus or species. Each dot in the scatter plot represents a microbial genus or species, which is colored by their respective phylum based on Greengenes taxonomy.