Table 1.
Misidentification as unidentified ASVs (NA) and ASVs assigned to other genera, the Shannon diversity and richness of bacterial genera as well as the residual sum of squares (RSS) as discrepancy of the sequenced community composition and the structure of the mock community on genus level
| Community | Misidentification | Shannon diversity | Richness | RSS to Mock | |||||
|---|---|---|---|---|---|---|---|---|---|
| NA | Other genera | Mock | Raw | GCN | Mock | Raw | Raw | GCN | |
| Mock-12 | 0.0003 | 0.0461 | 2.0736 | 0.3926 | 0.5765 | 11 | 13 | 1.3138 | 1.3274 |
| Mock-13 | 0.0005 | 0.0057 | 2.8216 | 2.6968 | 2.7120 | 18 | 32 | 0.5198 | 0.5258 |
| Mock-14 | 0.0003 | 0.0103 | 2.8216 | 2.7039 | 2.7456 | 18 | 35 | 0.5245 | 0.4991 |
| Mock-15 | 0.0001 | 0.0038 | 2.8216 | 2.6950 | 2.6591 | 18 | 30 | 0.5447 | 0.5823 |
| Mock-16 | 0.0853 | 0.0913 | 3.7543 | 3.1574 | 3.0887 | 46 | 54 | 0.8441 | 0.9053 |
| Mock-18 | 0.0019 | 0.0000 | 2.7081 | 2.6027 | 2.4329 | 15 | 15 | 0.3089 | 0.5965 |
| Mock-19 | 0.3074 | 0.0000 | 2.3581 | 2.4581 | 2.1697 | 15 | 15 | 0.8829 | 1.1353 |
| Mock-20 | 0.0000 | 0.0001 | 2.7616 | 2.5335 | 2.4519 | 17 | 17 | 0.5107 | 0.5766 |
| Mock-21 | 0.0000 | 0.0000 | 1.6901 | 1.5246 | 1.5091 | 17 | 14 | 0.4041 | 0.3534 |
| Mock-22 | 0.0004 | 0.0292 | 2.7616 | 2.7212 | 2.7024 | 17 | 20 | 0.2978 | 0.4075 |
| Mock-23 | 0.0008 | 0.0017 | 1.6901 | 1.7205 | 1.7214 | 17 | 20 | 0.1938 | 0.1566 |
The mock community (Mock) was compared with the 16S amplicon sequencing data without (raw) or with gene copy normalization (GCN). Mock-12 was removed from the analysis due to the low Shannon diversity of the sequencing data, accounting for only 20% of the real diversity (shown in red). The best fit of the sequencing data with or without GCN using RSS compared with the mock composition is shown in bold