TABLE 1.
Database or domain | Avg log10 E value | Median log10 E value | No. of sequences | % of totala |
---|---|---|---|---|
GenBank nt or nr database | −21.5 | −13.7 | 2,195 | 39 |
Archaea | −11.4 | −8.2 | 25 | 0.4 (1) |
Bacteria | −17.1 | −10.1 | 1,031 | 18 (47) |
Eukaryote | −8.1 | −4.5 | 149 | 2.6 (7) |
Virus | −28.4 | −21.2 | 962 | 17 (44) |
Environmental databaseb | −24.5 | −17.1 | 1,731 | 31 |
No homology | NAc | NA | 1,715 | 30 |
Total | NA | NA | 5,641 | 100 |
For domains the value in parentheses is the percentage of the 2,195 sequences displaying similarity to nt or nr database sequences.
The sequence was similar to at least one environmental database sequence (env-nt, env-nr, or viral metagenome) but showed no similarity to the nt or nr database.
NA, not applicable.