Table 1. Overview of the BDGP D. melanogaster Release 6 genome assembly.
Current release | Dmel_Release_6 |
Data provider | BDGP |
Collaborators | DHGP, BCM-HGSC, Celera Genomics |
Sequenced strain | iso-1 |
Date released | 21-JUL-2014 (FlyBase, Dmel annotation version R6.01) |
25-JUL-2014 (GenBank, RefSeq) | |
NCBI accessions | Release 6 plus ISO1 MT |
Assembly: GCA_000001215.4 | |
RefSeq: GCF_000001215.4 | |
BioProject: PRJNA13812 | |
Assembly statistics | • Total sequence length = 143 726 002 bp. |
• Total gap length = 1 152 978 bp. | |
• Total number of scaffolds = 1870. | |
• Seven chromosome arms (plus mitochondrial genomea): X, 2L, 2R, 3L, 3R, 4 and Y. | |
• The vast majority of sequence, 137.6 Mb, resides on the seven chromosome arms. | |
• 1862 ‘unlocalized’ minor scaffolds, of which 884 have been mapped cytologically or genetically to a chromosome region: X, 2CEN, 3CEN, Y, XY and rRNA. | |
Major changes relative to Release 5 | • Release 6 is 4.2 Mb larger. |
• Total gap length decreased by 1.5 Mb. | |
• The majority of new sequence added to the chromosome arm scaffolds is in the heterochromatic regions, 10.0 Mb of which derives from the BDGP Release 5 scaffolds XHet, 2LHet, 2RHet, 3LHet, 3RHet and U. | |
• The chromosome Y scaffold is vastly improved and 10 times larger at 3.1 Mb. | |
• Most remaining gaps are in the heterochromatic regions of the assembly. | |
• 1862 minor scaffolds replace Release 5 concatenated pseudoscaffolds (e.g. U). | |
• 48 minor scaffolds have been modified and improved from Release 5; their names indicate their mapping (2Cen_mapped_Scaffold_10_D1684). The remaining 1814 ‘unmodified’ minor scaffolds have numeric identifiers like 2110000… | |
• All fragmented gene annotations from Release 5 have been resolved, largely as a result of improvements to the Y and 3R scaffolds. |
aThe reference genome assembly update in Dmel R6.01 (FB2014_04) was for the nuclear genome only, maintaining the old mitochondrial genome assembly, a composite of sequences from various D. melanogaster strains (GenBank U37541.1, RefSeq NC_001709.1). With FlyBase update FB2015_01, the mitochondrial reference genome assembly was also updated, replacing the previous assembly with one derived exclusively from the iso-1 reference strain (GenBank KJ947872.2; RefSeq NC_024511.2).