Table 3. Assessment of proteome completeness (%) using DOGMA and BUSCO approaches.
Approach/domain | DF HQ | DF All | PT HQC | PT HQ All | PL HQC | PL HQ All | PA MC | PA HC |
---|---|---|---|---|---|---|---|---|
DOGMA | ||||||||
CDA Found1 | 464 | 774 | 149 | 344 | 525 | 741 | 580 | 550 |
CDA Found2 | 272 | 471 | 97 | 229 | 280 | 431 | 141 | 287 |
CDA Found3 | 172 | 291 | 40 | 115 | 123 | 242 | 29 | 188 |
Total Found CDA | 908 | 1536 | 286 | 688 | 928 | 1414 | 750 | 1025 |
Total % completeness | 45 | 76 | 14 | 34 | 46 | 70 | 37 | 51 |
BUSCO | ||||||||
Complete | 299 | 523 | 216 | 321 | 466 | 593 | 107 | 455 |
Single | 184 | 355 | 161 | 236 | 383 | 468 | 76 | 318 |
Multi | 115 | 168 | 55 | 85 | 83 | 125 | 31 | 137 |
Fragment | 203 | 283 | 144 | 193 | 148 | 188 | 242 | 230 |
Missing | 938 | 634 | 1080 | 926 | 826 | 659 | 1091 | 755 |
DOGMA is the 965 single-domain CDAs and 1052 multiple-domain CDAs (Conserved Domain Arrangements) across eukaryotes and BUSCO is the Benchmarking Universal Single-Copy Orthologs. Explanation of headings: DF, Douglas-fir; PT, Pinus taeda; PL, Pinus lambertiana; PA, Picea abies; HQ, High Quality; HQC, High-Quality Complete; MC, Medium Content; HC, High Content.