Table 3. Variant graph statistics.
Connectivity statistics are shown for variant graphs constructed from various simulated mate-pair (# kb, MP) and Hi-C read datasets. Graph constructed from all Hi-C data are compared to those constructed using only Hi-C read pairs with inserts over 1 kb. The Hi-C variant graphs are highly connected in contrast to the mate-pair graphs that have both lower connectedness and lower rates of variants occurring in the same connected components.
Num. reads | Max | Avg. | % Same c.c | |
---|---|---|---|---|
5 kb, MP | 10,287,315 | 71 | 14.81 | 6.21 |
10 kb, MP | 7,681,515 | 96 | 24.45 | 16.6 |
20 kb, MP | 4,871,227 | 94 | 27.58 | 32.38 |
40 kb, MP | 4,257,896 | 111 | 37.19 | 100 |
Hi-C (all) | 16,429,505 | 10 | 5.11 | 97.77 |
Hi-C (>1 kb) | 111,525 | 11 | 5.47 | 94.75 |