Table 3. Characteristics of gene families on chromosome Ia.
Number | Product description | Distribution | % identity (protein) | % identity (nucleotide) | Conserved 3′ region | Comments | Gene identifiers (no. copies) |
---|---|---|---|---|---|---|---|
1 | RHS | Clustered | 20–99% | 34–99% | Yes | Different 3′UTR associated with different subtypeb | Tb.1.70,120,180,220,420 (5) |
2 | LRRP1 | Clustered | 93–95% | 93–95% | Yes | Interspersed with 3 | Tb.1.290,370,480 (3) |
3 | Unknown function | Clustered | 75–95% | 93–95% | Yes | Interspersed with LRRP1 | Tb.1.280,380,470 (3) |
4 | Unknown function | Tandem | 23% | 45% | No | Tb.1.1470,1500 (2) | |
5 | Phosphate permease | Tandem | 97% | 97% | Yes | Tb.1.580,600 (2) | |
6 | PGK | Tandem | 65–89% | 63–91% | No | Tb.1.700,710,720 (3) | |
7 | Unknown function | Dispersed | 99% | 98% | Yes | Tb.1.1040,1650 (2) | |
8 | ESAG2 | Dispersed | 68–82% | 76–87% | Yes | 2 conserved 3′UTRs, 1 divergent | Tb.1.2040,4890,5100 (3) |
9 | Calpain-like | Clustered | 6–80% | NDc | No | Tb.1.2100,2110,2120,2150,2160,2230,2260 (7) | |
10 | β-tubulin | Tandem | >99% | >99% | Yes | Interspersed with α-tubulin | Tb.1.2330,2350,2370,2390 (ND) |
11 | α-tubulin | Tandem | >99% | >99% | Yes | Interspersed with β-tubulin | Tb.1.2340,2360,2380,2400 (ND) |
12 | Histone H3 | Tandem | 100% | >99% | Yes | Interspersed with 13 | Tb.1.2430,2450,2470,2490,2510,2530,2550 (7) |
13 | Unknown function | Tandem | 100% | >99% | Yes | Interspersed with histone-H3 | Tb.1.2440,2460,2480,2500,2520,2540,2560 (7) |
14 | Pteridine transporter | Tandem | 100% | >99% | Yes | Tb.1.2820,2850,2880 (3) | |
15 | Unknown function | Tandem | 51–99% | 60–99% | Yes | Last CDS of cluster is divergent in CDS and 3′UTR | Tb.1.4540,4560,4580,4600,4630,4650 (6) |
16 | ESAG1 | Clustered | 55–84% | 76–89% | No | 3 conserved 3′UTRs, 2 divergent | Tb.1.4870,4910,5120,5200,5240 (5) |
17 | ESAG11 | Clustered | 88% | 91% | Yes | Tb.1.4900,5110 (2) | |
18 | ESAG9, putative | Clustered | 30% | 48% | Yes | Tb.1.5080, 5220 (2) |
Clustered, confined to specific region of chrI but not arrayed in defined repeat units, genes may be on either strand, i.e. inverted relative to other copies. Tandem, within a defined repeat unit in close tandem arrays, repeat units never inverted relative to other copies. Dispersed, located in distant regions of the chromosome, may be inverted. ND, not determined.
aExcludes VSGs.
bRHS subtypes described in Bringaud et al. (39).
cNucleotide sequences too divergent for accurate alignment.