Table 1. Summary of sequencing data.
Number | Match length (bases) | Ambiguity (%) | |||||
|
|
Average ± SD |
Range |
Median |
Average ± SD |
Range |
Median |
Stocks received | 1189 | n/a | n/a | n/a | n/a | n/a | n/a |
Failed to grow | 56 | n/a | n/a | n/a | n/a | n/a | n/a |
No sequence read | 10 | n/a | n/a | n/a | n/a | n/a | n/a |
Correct (pre-screened)a | 613 | 383 ± 73 | 54–549 | 399 | 2.0 ± 2.5 | 0–17.0 | 1.2 |
Correct (post-screen) | 126 | 363 ± 96 | 94–500 | 389 | 2.0 ± 2.0 | 0–11.3 | 1.4 |
Correct (isolated from contaminated stocks) | 84 | 372 ± 79 | 75–467 | 396 | 1.6 ± 2.1 | 0–14.3 | 1.2 |
Incorrect (pre-screened) | 152 | ||||||
Accession no. in UniGene cluster | 97 | 386 ± 56 | 186–490 | 399 | 2.4 ± 2.2 | 0–9.3 | 1.7 |
Accession no. not in UniGene cluster | 15 | 380 ± 100 | 182–476 | 427 | 1.7 ± 2.1 | 0.5–8.6 | 0.9 |
No significant identity | 40 | n/d | n/d | n/d | n/d | n/d | n/d |
Incorrect (colony isolation)b | 332 | ||||||
Accession no. in UniGene cluster | 247 | 369 ± 76 | 111–525 | 383 | 1.7 ± 1.6 | 0–9.0 | 1.2 |
Accession no. not in UniGene cluster | 61 | 395 ± 85 | 149–546 | 424 | 1.4 ± 1.6 | 0–7.0 | 0.8 |
No significant identity | 24 | n/d | n/d | n/d | n/d | n/d | n/d |
Total clones identifiedc | 1243 | 378 ± 76 | 54–549 | 394 | 1.9 ± 2.2 | 0–17.0 | 1.2 |
Total clonesd | 1303 | n/a | n/a | n/a | n/a | n/a | n/a |
From 1189 stock cultures ordered, the number of entities in each category is noted. Correct is defined as sequence that has identity to the published sequence for the desired clone. Incorrect sequences do not have identity to the desired clone. Incorrect sequences were assigned GenBank accession numbers where possible by comparison to the murine sequences in dbEST. Match length (the length of the region of identity between derived and published sequence) is presented with standard deviation, range and median values. Percent ambiguity (the number of ambiguous base calls divided by the total number of bases over the region of identity) is also presented with standard deviation, range and median values. n/a, Not applicable; n/d, not determined.
aPre-screened, derived from those stocks which passed agarose gel electrophoretic pre-screening.
bColony isolation, derived from those stocks which failed pre-screening.
cIncludes correct clones, and incorrect clones which could be assigned a GenBank accession number cluster. This number is higher than the number of stocks ordered, due to the isolation of more than one incorrect clone from several stock cultures.
dNumber of clones examined, inclusive of clones which could not be identified.