Skip to main content
. 2014 Dec 11;8(12):e3392. doi: 10.1371/journal.pntd.0003392

Table 1. Summary of the nucleotide sequence data for EgPSCs prior to and following assembly, with detailed bioinformatic annotation and analyses.

Raw reads 330188
Unigenes (average length; min-max length) 26514 (510.5; 150–3357)
Containing an open reading frame (%) 19576 (73.8)
With homologues in E. granulosus (%) 17732 (66.9)
   E. multilocularis 17861 (67.4)
   Caenorhabditis elegans 8946(33.7)
   Clonorchis sinensis 2540 (20.6)
   Schistosoma mansoni 2159 (17.5)
   Schistosoma japonicum 1485 (12.1)
   Escherichia coli 159 (1.3)
Returning STRING results (%) 3188 (12.0)
Returning NCBI NR results (%) 12408 (46.8)
 Gene Ontology (%) 5846 (22.0)
 Number of biological process terms (level 2) 24
  Cellular component 20
  Molecular function 14
 Returning a KOBAS result (%) 5657 (21.3)
 Number of predicted biological pathways 306