Table 2. Classification of putative CDSs in chromosome I.
Total predicted protein-coding CDSs | 369 |
Hypothetical proteins and proteins of unknown function | 277 |
Proteins conserved in other species | 177 |
Proteins conserved in T.brucei onlya | 42 |
Proteins previously characterised in T.brucei | 37 |
Proteins with annotated function | 55 |
Gene ontology process assignmentsb | 156 |
Cell organization and biogenesis | 18 |
Metabolism | 8 |
Other cell growth and/or maintenance | 4 |
Immune-evasion | 3 |
Interpro matchesc | 111 |
Leucine-rich repeat, IPR001611 | 5 |
G-protein βWD-40 repeat, IPR001680 | 3 |
AAA ATPase, IPR003593 | 3 |
SAM (and some other nucleotide) binding motif IPR000051 | 3 |
Tubulin/FtsZ protein, IPR003008 | 3 |
Zn-finger, RING, IPR001841 | 3 |
CDSs with polymorphisms (chrIa versus chrIb) | 119 |
Total number of non-conservative or complex polymorphismsd | 73 |
aFamilies 1–4, 7, 8, 13, 15–17 in Table 3, plus intact VSG genes.
bNumber of GO process terms assigned (see also Fig. 1), each category contains multiple terms, each CDS may have more than one assignment, GO process categories not assigned to chrI are not listed.
cSix most abundant domains (all domain annotations are available at http://www.genedb.org), non-redundant hits only, i.e. one hit of each interpro domain is counted per CDS and only one copy of repeated genes was used in the analysis.
dConservative amino acid substitutions were defined as those that score greater than zero using a BLOSUM 62 substitution matrix (56), complex polymorphisms include multiple substitutions and insertions or deletions but do not include insertions or deletions of three nucleotides.