Table 1. Table describing the rates of L1Hs insertion validation by binned average number of sequencing reads. The percentage of successfully validated novel non-ref L1Hs per read bin was used to calculate the number of predicted true positive novel non-ref L1Hs per bin. The calculated total true positive per read bin is the sum of the predicted true positive novel non-ref L1Hs and the total number of detected known non-ref and ref L1Hs per bin. The calculated percent true positive per bin is calculated as the total true positive for bin, divided by the sum of the number of detected novel non-ref L1Hs, number of detected known non-ref L1Hs and number of detected ref L1Hs. The cumulative percent true positive is calculated as the percentage of true positive L1Hs insertions having at least the average number of sequencing reads for that bin and above.
novel non-ref L1Hs | known non-ref L1Hs | ref L1Hs | Calculated total true positive per bin | Calculated % true positive per bin | Cumulative % true positive (per bin and above) | |||||
---|---|---|---|---|---|---|---|---|---|---|
Avg. # of reads | Successfully validated | Validations attempted | % Validated | total # detected per bin | Predicted true positive | total # detected per bin | total # detected per bin | |||
≥1,000 | 32 | 44 | 72.7 | 460 | 334 | 227 | 465 | 1026 | 89.1 | 89.1 |
500-999 | 16 | 41 | 39.0 | 186 | 73 | 43 | 90 | 206 | 64.6 | 83.8 |
250-499 | 13 | 40 | 32.5 | 320 | 104 | 28 | 67 | 199 | 48.0 | 75.9 |
100-249 | 6 | 36 | 16.7 | 876 | 146 | 34 | 77 | 257 | 26.0 | 58.8 |
Table 1. Validation of putative Ta subfamily L1Hs insertions.