Table 1.
Genome assembly and annotation details for species used for OpenSpliceAI training and transfer-learning in this study. Note: For each species, the table includes the GenBank accession number, assembly name, ftp sites for assembly and annotation downloads, and annotation release dates.
| Species | Name | Genbank accession | Download link | Annotation Release date |
|---|---|---|---|---|
| Homo sapiens | GRCh38.p14 | GCA_000001405.29 | https://ftp.ncbi.nlm.nih.gov/genomes/all/annotation_releases/9606/GCF_000001405.40-RS_2023_03/ | 21-March-2023 |
| Mus musculus | GRCm39 | GCA_000001635.9 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/001/635/GCF_000001635.27_GRCm39/ | 08-February-2024 |
| Apis mellifera | Amel_HAv3.1 | GCA_003254395.2 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/003/254/395/GCF_003254395.2_Amel_HAv3.1/ | 30-September-2022 |
| Arabidopsis thaliana | TAIR10.1 | GCA_000001735.2 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/001/735/GCF_000001735.4_TAIR10.1/ | 16-June-2023 |
| Danio rerio | GRCz11 | GCA_000002035.4 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/002/035/GCF_000002035.6_GRCz11/ | 15-August-2024 |