Skip to main content
. 2024 Sep 4;8:65. Originally published 2023 Feb 8. [Version 2] doi: 10.12688/wellcomeopenres.18717.2

Table 1. Genome data for idSarSubv1.1.

Project accession data
Assembly identifier idSarSubv1.1
Species Sarcophaga subvicina
Specimen idSarSubv1
NCBI taxonomy ID 236850
BioProject PRJEB51465
BioSample ID SAMEA7746447
Isolate information male, thorax tissue (genomic DNA), head tissue (Hi-C)
Assembly metrics *
Base pair QV 52.9 (Benchmark: ≥50)
k-mer completeness 99.99% (Benchmark: ≥95%)
BUSCO ** C:99.2%[S:98.5%,D:0.7%],F:0.2%,M:0.6%,n:3285 (Benchmark: C ≥ 95%)
Percentage of assembly mapped to
chromosomes
95.91% (Benchmark: ≥95%)
Sex chromosomes X chromosome identified (Benchmark: localised homologous pairs)
Organelles Mitochondrion genome assembled (Benchmark: complete single alleles)
Raw data accessions
PacificBiosciences SEQUEL II ERR9284049, ERR9284050
10X Genomics Illumina ERR9248453–ERR9248456
Hi-C Illumina ERR9248452
Genome assembly
Assembly accession GCA_936449025.1
Accession of alternate haplotype GCA_936440885.1
Span (Mb) 714.2
Number of contigs 445
Contig N50 length (Mb) 102.9
Number of scaffolds 274
Scaffold N50 length (Mb) 122.7
Longest scaffold (Mb) 159.5
Genome annotation
Number of protein-coding genes 16,793

* Assembly metric benchmarks are adapted from column VGP-2020 of “Table 1: Proposed standards and metrics for defining genome assembly quality” from ( Rhie et al., 2021).

** BUSCO scores based on the diptera_odb10 BUSCO set using v5.3.2. C = complete [S = single copy, D = duplicated], F = fragmented, M = missing, n = number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/idSarSubv1.1/dataset/CAKZFR01/busco.