Table 4. 10 top most overrepresented (Ov) new families, from the set of over 180 curated novel families identified in this work.
Family ID | Family description | g | n | G | N | Ov/Ex/Es | Most advanced PSI target (id, center, status) |
PB004588 | No hypothesis about function | 105 | 0 | 14 | 0 | 666.89/7.00/0.22 | 390317, JCSG, Diffraction-quality Crystals |
PB064361 | Contains putative lipoproteins | 60 | 0 | 13 | 0 | 381.08/4.29/0.20 | #N/A |
PB012771 | No hypothesis about function | 92 | 1 | 21 | 1 | 292.16/3.68/0.32 | NYSGXRC-T1444, NYSGXRC, Work Stopped |
PB008694 | Contains conserved hypothetical proteins found in conjugate transposon TraH. | 40 | 0 | 13 | 0 | 254.05/2.86/0.20 | 390153, JCSG, Diffraction-quality crystals |
HGC00311 | No hypothesis about function | 35 | 0 | 16 | 0 | 222.30/2.06/0.25 | #N/A |
PB023339 | No hypothesis about function | 32 | 0 | 13 | 0 | 203.24/2.29/0.20 | NYSGXRC-12097b, NYSGXRC, Native diffraction data |
HGC00150 | No hypothesis about function | 31 | 0 | 15 | 0 | 196.89/1.94/0.23 | #N/A |
PB029229 | No hypothesis about function | 28 | 0 | 13 | 0 | 177.84/2.00/0.20 | 393207, JCSG, Crystallized |
PB048420 | No hypothesis about function | 27 | 0 | 18 | 0 | 171.49/1.42/0.28 | #N/A |
PB047024 | Remote homology to HD domain (PF01966) | 24 | 0 | 22 | 0 | 152.43/1.04/0.34 | #N/A |
Exact definitions of the Ov category is given in the Methods section. Columns provide numerical values for: g (total number of representatives in genomes of human gut microbiome microbes), n (total number of representatives in genomes of microbes not associated with human gut microbiome), G (number of microbes from human gut microbiome with at least representative of a family) and N (number of microbes not associated with human gut microbiome with at least representative of a family).