Table 1.
Consensus-making statistics for five dilutions of a D. magna gDNA-Seq library.
Dataset* | Dilution (amol of DNA used as input) | ||||||
---|---|---|---|---|---|---|---|
10a | 50a | 100a | 150a | 1f | Pooled** | ||
Full | Input read pairs (×106) | 51.7 | 88.4 | 72.9 | 115.5 | 107.1 | 435.6 |
Total DCS | 655072 | 902770 | 422378 | 493156 | 37860 | 2511236 | |
Total DCS bases (×106) | 77.0 | 114.0 | 45.0 | 57.0 | 46.0 | 339.0 | |
Filtered DCS | 148771 | 171002 | 152820 | 153884 | 9663 | 636140 | |
(% of total DCS) | (22.7) | (18.9) | (36.2) | (31.2) | (25.5) | (25.3) | |
Filtered DCS bases (×106) | 11.1 | 15.3 | 12.3 | 13.7 | 1.0 | 53.4 | |
(% of total DCS bases) | (14.4) | (13.4) | (27.3) | (24.0) | (2.2) | (17.8) | |
Avg filtered DCS length (bp) | 74 | 89 | 80 | 89 | 99 | 83 | |
Filtered DCS bases in masked genome (×106) | 4.79 | 6.71 | 5.30 | 6.00 | 0.39 | 23.19 | |
Downsampled | Input read pairs (×106) | 50.0 | 50.0 | 50.0 | nd | 50.0 | nd |
Total DCS | 644041 | 308193 | 191903 | nd | 8002 | nd | |
Total DCS bases (×106) | 42.8 | 24.2 | 14.0 | nd | 0.6 | nd | |
Filtered DCS | 147195 | 100136 | 76179 | nd | 1846 | nd | |
(% of total DCS) | (22.9) | (32.5) | (39.7) | nd | (23.1) | nd | |
Filtered DCS bases (×106) | 10.9 | 9.5 | 6.2 | nd | 0.2 | nd | |
(% of total DCS bases) | (25.5) | (39.4) | (44.3) | nd | (28.9) | nd | |
Avg filtered DCS length (bp) | 74 | 95 | 81 | nd | 100 | nd | |
Filtered DCS bases in masked genome (×106) | 4.71 | 4.15 | 2.64 | nd | 0.07 | nd | |
Peak read family size | 8 | 2 | 2 | nd | 2 | nd |
*Dataset: statistics are shown for the analysis pipeline run on the full set of read pairs generated per library (“Full”) or on libraries downsampled to 50 M read pairs each (“Downsampled”).
**Pooled: Reads from all five diluted libraries were combined and analyzed together (full datasets only). The pooled dataset was used to identify the final set of 14 putative somatic mutations described below.
nd, no data.