Table 1.
Data inputs for an ad hoc committee naming uncultivated SAGs and MAGs
Category | Parameters for data quality recommendations |
Genome quality | Percent completion |
Percent contamination | |
Presence of 5S, 16S 23S rRNA gene (and level of completeness) | |
Number of tRNA genes | |
Assembly quality | N50 (defined as the length of the shortest contig in the set of largest contigs that together constitute at least half of the total assembly size) |
Number of contigs | |
Naming conventions | Status of Candidatus and future nomenclature (for example, superscript u, c or e designating uncultivated, Candidatus or environmental microorganisms, respectively) |
Description requirements | Phenotype or metabolic prediction based on DNA sequence |
Ecological and biogeographic consideration | |
Additional metadata (including guanine and cytosine content, genome size and number of protein coding genes) | |
Validation of MAG and SAG nomenclature | Potential for informational system of classification, validation and DOI assignment |