Table 1.
Overview of all data files/data sets
| Labe | Name of data file/data set | File types (file extension) | Data repository and identifier (DOI or accession number) |
|---|---|---|---|
| Data file 1 | Raw short MGI sequencing reads | Fasta file (.fastq) | NCBI Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRR25056966 [33] |
| Data file 2 | Raw long HiFi sequencing reads | Fasta file (.fastq) | NCBI Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRR25056964 [34] |
| Data file 3 | Raw Hi-C sequencing reads | Fasta file (.fastq) | NCBI Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRR25056965 [35] |
| Data file 4 | Raw RNA-seq reads for seven tissues | Fasta file (.fastq) |
NCBI Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRR25056967 https://identifiers.org/ncbi/insdc.sra:SRR25056968 https://identifiers.org/ncbi/insdc.sra:SRR25056969 https://identifiers.org/ncbi/insdc.sra:SRR25056970 https://identifiers.org/ncbi/insdc.sra:SRR25056971 |
| Data file 5 | Supplementary of the genome | pdf file (.pdf) | Figshare, 10.6084/m9.figshare.23651262 [37] |
| Data file 6 | Assembled genome | Fasta file (.fasta) | NCBI GenBank, https://identifiers.org/ncbi/insdc.gca:GCA_030549335.1 [38] |
| Data file 7 | Predicted gene | Gff3 file (.gff) | Figshare, 10.6084/m9.figshare.23635401 [39] |
| Data file 8 | Predicted gene-CDS | CDS file (.cds) | Figshare, 10.6084/m9.figshare.23635401 [39] |
| Data file 9 | Predicted gene-Protein | Protein file (.pep) | Figshare, 10.6084/m9.figshare.23635401 [39] |
| Data file 10 | Gene annotation using KEGG, GO, InterPro, Swiss-Prot, NR, and KOG databases | Annotation file (.html) | Figshare, 10.6084/m9.figshare.23635401 [39] |