Table 2. Entamoeba histolytica protein families showing high association with repetitive elements.
Family ID* | Protein family name | Number of associated elements | Number of genes in Family | Percentage of Association |
238 | hypothetical protein | 5 | 5 | 100% |
133 | hypothetical protein | 7 | 7 | 100% |
64 | hypothetical protein | 10 | 10 | 100% |
145 | hypothetical protein | 5 | 6 | 83% |
52 | hypothetical protein | 4 | 5 | 80% |
236 | cystein protease family | 4 | 5 | 80% |
66 | hypothetical protein | 6 | 8 | 75% |
42 | hypothetical protein, conserved | 11 | 15 | 73% |
157 | Gal/Gal/Nac lectin complex family | 4 | 6 | 66% |
87/29/274 | AIG1 family protein | 18 | 29 | 62% |
77 | regulator of nonsense transcripts family | 6 | 10 | 60% |
93 | hypothetical protein | 5 | 9 | 55% |
111 | hypothetical protein | 4 | 8 | 50% |
15 | hypothetical protein | 12 | 29 | 41% |
67 | hypothetical protein | 4 | 11 | 36% |
2 | BspA-like family protein | 41 | 114 | 35% |
12 | HSP 70 family | 11 | 31 | 35% |
63 | peroxiredoxin family protein | 4 | 12 | 33% |
54 | hypothetical protein | 4 | 13 | 30% |
41 | cystein protease family | 4 | 14 | 28% |
32 | DEAD/DEAH-box helicase family protein | 5 | 18 | 27% |
9 | kinase family protein | 9 | 39 | 23% |
19 | zinc-finger domain containing protein | 6 | 26 | 23% |
8 | hypothetical protein | 9 | 38 | 23% |
5 | hypothetical protein, conserved | 13 | 61 | 21% |
24 | kinase family protein | 4 | 20 | 20% |
13 | LRR repeat containing protein | 5 | 29 | 17% |
*Only families with at least five proteins and showing more than 15% association are shown.