Skip to main content
. 2017 Oct 18;11(10):e0005984. doi: 10.1371/journal.pntd.0005984

Table 3. CDS and predicted protein annotations using publicly available databases.

Public Database Annotation Summary
BLASTp x nr 140,484 CDSs (72.3%)
49,518 unique protein identities
BLASTn x nt 128,028 CDSs (65.9%)
26,708 unique nt identities
InterProScan 137,778 (70.9%)
Gene Ontology (GO) 50,870 CDSs (26.2%)
 Unique Molecular Function 3,246
 Unique Cellular Component 1,618
 Unique Biological Process 8,282
KEGG 145,197 CDSs (74.7%)
 Unique KEGG orthologous groups 3,824
 Unique KEGG pathways 387
 Unique KEGG classes 46
 Unique KEGG categories 6
  Cellular Processes 13,845
  Environmental Information Processing 16,093
  Genetic Information Processing 13,722
  Human Diseases 32,748
  Metabolism 41,022
  Organismal Systems 27,767