Table 1.
Parameters | Rio | Bahia |
---|---|---|
MG-RAST ID |
44852183 |
44852193 |
No. of sequences |
494,201 |
503,019 |
Avg. length (bp) |
327 ± 110* |
328 ± 111* |
Total length (bp) |
161,854,245 |
165,387,021 |
Predicted proteins
†
|
488,630 |
450,087 |
Assigned reads |
242,868 |
261,482 |
LCA
¥
| ||
Bacteria (%) |
206,733 (94,3) |
273,078 (97,36) |
Archaea (%) |
10,041 (4,6) |
4,623 (1,65) |
Eukarya (%) |
1,559 (0,7) |
1,018 (0,36) |
Viruses (%) |
297 (0,1) |
29 (0,01) |
Unclassified (%) |
649 (0,3) |
1,740 (0,62) |
MEGAN | ||
No. of sequences |
494,201 |
503,019 |
Assigned reads |
293,374 |
340,668 |
Bacteria (%) |
252,395 (93,6) |
313,633 (97,3) |
Archaea (%) |
13,173 (4,88) |
5,686 (1,76) |
Eukarya (%) |
3,295 (1,22) |
2,805 (0,87) |
Viruses (%) |
554 (0,2) |
57 (0,01) |
Unclassified (%) | 269 (0,1) | 182 (0,06) |
*After duplicate removal, splitting and trimming of sequence reads.
†Predicted protein coding regions assigned an annotation using at least one of protein databases (M5NR) in MG-RAST server.
¥ Lowest common ancestor (LCA) using 1e-5 cutoff, 60% minimum identity, and a minimum alignment lenght cutoff of 15.