Skip to main content
. 2021 Jun 29;11:13460. doi: 10.1038/s41598-021-92601-5

Table 2.

Summary annotation statistics for male and female assemblies.

Male Female#
Repeat content 23.55% 23.41%
Number of protein-coding genes 27,175 28,988
Median gene length (bp) 7,368 6,721
Number of transcripts 50,133 51,844
Number of exons 303,132 307,753
Number of coding exons 284,414 288,788
Coding GC content 52.67% 52.57%
Median UTR length (bp) 1,231 1,222
Median intron length (bp) 388 371
Exons/transcript 11.88 11,53
Transcripts/gene 1.84 1.79
Multi-exonic transcripts 0.956 0.941
Gene density (gene/Mb) 45.026 47.679
Functionally annotated transcripts 36,130 (72.1%) 35,999 (69.4%)
Unique genes 3,806 (14%) 4,643 (16%)
non-conding RNAs 21,123 23,822

Annotation pipeline is described with more details in “Supplementary method”.

#Sequence deposited in figshare https://doi.org/10.6084/m9.figshare.12472100.v1.