Table 4.
Features of long-read sequence datasets for completing MnatCMV genome sequences
|
Virus |
Corresponding SR dataset* |
Reads (no.) |
Reads mapping to (no.) |
Differences‡ (no.) |
||
|---|---|---|---|---|---|---|
|
Original |
Trimmed |
SR genome |
LR† genome |
|||
|
MnatCMV1 |
Mnat36A E2-3 |
307 535 |
307 049 |
38 326 |
38 302 |
152 |
|
MnatCMV2 |
Mnat2A C3-3 |
252 363 |
251 971 |
26 728 |
25 483 |
169 |
|
MnatCMV3 |
Mnat35A A3-2 |
703 074 |
701 775 |
102 749 |
97 912 |
88 |
*See Table 2. SR, short-read.
†LR, long-read.
‡Between the genome sequences determined from SR data with reiterated regions derived from LR data (GenBank accessions OP429138.1, OP429139.1 and OP429140.1) and the genome sequences assembled from LR data alone.