Table 3.
%n | m | Expected number of new genes in an additional sample of size m | Probability of discovering a new gene at the (n + m + 1)-th read |
---|---|---|---|
Naegleria aerobic | |||
50 | 480 | 162 ∈ (138 , 188) | 0.318 ∈ (0.307 , 0.329) |
100 | 959 | 307 ∈ (271 , 345) | 0.290 ∈ (0.277 , 0.303) |
150 | 1438 | 441 ∈ (394 , 488) | 0.270 ∈ (0.257 , 0.282) |
200 | 1918 | 566 ∈ (510 , 624) | 0.254 ∈ (0.241 , 0.267) |
250 | 2398 | 685 ∈ (619 , 751) | 0.242 ∈ (0.229 , 0.255) |
300 | 2877 | 798 ∈ (725 , 873) | 0.231 ∈ (0.219 , 0.244) |
Naegleria anaerobic | |||
50 | 484 | 231 ∈ (206 , 258) | 0.450 ∈ (0.440 , 0.461) |
100 | 969 | 440 ∈ (402 , 478) | 0.412 ∈ (0.400 , 0.424) |
150 | 1454 | 632 ∈ (583 , 683) | 0.384 ∈ (0.371 , 0.397) |
200 | 1938 | 812 ∈ (753 , 873) | 0.362 ∈ (0.349 , 0.375) |
250 | 2422 | 983 ∈ (915 , 1053) | 0.344 ∈ (0.332 , 0.357) |
300 | 2907 | 1146 ∈ (1069 , 1225) | 0.330 ∈ (0.317 , 0.342) |
Naeglaria aerobic and anaerobic libraries: the first column provides the size of the additional sample in % of the size of the initial sample, the second the actual size of the additional survey, the third presents the expected number of new genes and the fourth the discovery probability. The estimates in the third and fourth column are accompanied by the 95% highest posterior density intervals.