Table 1. Genome data for idSarSubv1.1.
| Project accession data | |
|---|---|
| Assembly identifier | idSarSubv1.1 |
| Species | Sarcophaga subvicina |
| Specimen | idSarSubv1 |
| NCBI taxonomy ID | 236850 |
| BioProject | PRJEB51465 |
| BioSample ID | SAMEA7746447 |
| Isolate information | male, thorax tissue (genomic DNA), head tissue (Hi-C) |
| Assembly metrics * | |
| Base pair QV | 52.9 (Benchmark: ≥50) |
| k-mer completeness | 99.99% (Benchmark: ≥95%) |
| BUSCO ** | C:99.2%[S:98.5%,D:0.7%],F:0.2%,M:0.6%,n:3285 (Benchmark: C ≥ 95%) |
| Percentage of assembly mapped to
chromosomes |
95.91% (Benchmark: ≥95%) |
| Sex chromosomes | X chromosome identified (Benchmark: localised homologous pairs) |
| Organelles | Mitochondrion genome assembled (Benchmark: complete single alleles) |
| Raw data accessions | |
| PacificBiosciences SEQUEL II | ERR9284049, ERR9284050 |
| 10X Genomics Illumina | ERR9248453–ERR9248456 |
| Hi-C Illumina | ERR9248452 |
| Genome assembly | |
| Assembly accession | GCA_936449025.1 |
| Accession of alternate haplotype | GCA_936440885.1 |
| Span (Mb) | 714.2 |
| Number of contigs | 445 |
| Contig N50 length (Mb) | 102.9 |
| Number of scaffolds | 274 |
| Scaffold N50 length (Mb) | 122.7 |
| Longest scaffold (Mb) | 159.5 |
| Genome annotation | |
| Number of protein-coding genes | 16,793 |
* Assembly metric benchmarks are adapted from column VGP-2020 of “Table 1: Proposed standards and metrics for defining genome assembly quality” from ( Rhie et al., 2021).
** BUSCO scores based on the diptera_odb10 BUSCO set using v5.3.2. C = complete [S = single copy, D = duplicated], F = fragmented, M = missing, n = number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/idSarSubv1.1/dataset/CAKZFR01/busco.