Skip to main content
. 2023 Feb 6;8(1):e01066-22. doi: 10.1128/msystems.01066-22

TABLE 1.

Summary information for the two selected published data sets, including the number and percentage of operational taxa assigned to the core by each method testeda

Characteristic Data set
Human Microbiome Project Arabidopsis thaliana microbiome
Total taxa 11,752 14,890
Total reads 1,893,867 1,770,731
Total samples 319 288
NCBI accession no. HM16STR ERP001384
Sequencing platform Illumina 454
Method Taxa assigned to core, n (%)
Abundance-based 1,108 (9.42) 1,245 (8.36)
Occupancy-based 204 (1.73) 1,134 (7.61) 
Abundance and occupancy-based 204 (1.73) 907 (6.09)
Hard cutoffs of abundance and occupancy 554 (4.71) 181 (1.21)
Not assigned to the core by any method 10,642 (90.55) 13,590 (91.26)
Unique taxa assigned to the core by any method 1,110 (9.44) 1,300 (8.73)
a

The Arabidopsis thaliana data set was generated by Lundberg et al. (10) and only utilizes rhizosphere samples from the M21 site. The human microbiome data set was generated by the Human Microbiome Consortium (53) and includes only fecal samples.