Skip to main content
. 2020 Nov 16;49(D1):D92–D96. doi: 10.1093/nar/gkaa1023

Table 1.

GenBank divisions

Division Description Base pairsa
WGS Whole genome shotgun data 8 841 649 410 652
TSA Transcriptome shotgun data 381 148 464 834
PLN Plants 269 438 877 546
BCT Bacteria 98 827 135 660
VRT Other vertebrates 63 565 835 430
EST Expressed sequence tags 43 301 109 577
TLS Targeted Loci Studies 27 825 059 498
HTG High-throughput genomic 27 781 778 663
PAT Patent sequences 26 452 787 091
GSS Genome survey sequences 26 378 695 300
MAM Other mammals 20 844 388 122
INV Invertebrates 19 759 935 222
ROD Rodents 12 090 011 771
PRI Primates 8 767 435 622
SYN Synthetic 7 932 542 985
ENV Environmental samples 6 755 612 180
VRL Viruses 5 824 026 918
PHG Phages 782 571 323
HTC High-throughput cDNA 733 210 026
STS Sequence tagged sites 640 923 137
UNA Unannotated 679 302
TOTAL All GenBank sequences 9 890 500 490 859

aRelease 239 (8/2020).