Table 1.
Analyzed data | 4A-3A | 4A-3B | Total |
---|---|---|---|
ESTs | 3,269 | 2,656 | 5,925 |
Clones | 1,839 | 1,403 | 3,242 |
Average insert size, bp | 1,500 | ||
Average EST length, bp | 375 | ||
EST clusters | 4,122 | ||
Clone clusters | 2,380 | ||
Homologous clone clusters | 1,118 | ||
Identical to Anopheles | 27 | ||
Potential immunity genes | 38 |
Numbers of sequenced cDNA clones and generated ESTs from the libraries constructed from cell lines 4A-3A and 4A-3B. The average insert size was calculated for 100 cDNA clones from each library, and EST length was calculated from the total set of 5,925 ESTs. ESTs with 97% or greater identity over a 100-bp region were clustered together forming 4,122 EST clusters. Clusters including the 5′ and 3′ end sequences of the same clone were grouped together forming 2,380 clone clusters, each potentially representing an individual gene. One or more ESTs of 1,118 clone clusters had a significant blastx E value (<10−4) to other proteins in a nonredundant swissprot and sptrembl database (24); a small number of these seem to be chimeric. A total of 27 clone clusters had protein sequences identical to those of A. gambiae genes in the database, and 38 clone clusters were similar to genes known to play potential roles in innate immunity.