Skip to main content
. 2017 Jul 10;5:e3558. doi: 10.7717/peerj.3558

Table 1. Statistics for Megahit contigs, recruitment to data-rich-contigs, and relative abundance of draft genome results for each sample.

TARA sample site Size fraction (Girus, Bacteria, or Protist) Depth (Surface or DCMa) No. of reads No. of initial Megahit assembly N50c (bp; initial Megahit assembly) Longest initial Megahit assembly (bp) Recruitment (% data-rich-contigs) Relative abundancec of draft genomes (%) Relative abundancec of ten most abundant genomes (% )
TARA007 Girus DCM 178,519,830 1,318,470 828 220,754 72.84 14.64 6.35
TARA007 Girus Surface 221,166,612 1,308,847 861 211,946 81.74 14.83 6.12
TARA007 Protist DCM 744,458,992 4,667,618 654 188,635 19.45 8.60 3.18
TARA007 Protist Surface 265,432,098 2,590,120 564 18,444 25.58 1.57 0.61
TARA009 Girus DCM 416,553,274 2,796,841 831 1,643,839 69.48 14.16 6.32
TARA009 Girus Surface 489,617,426 1,787,467 929 1,142,851 68.85 12.29 4.76
TARA009 Protist DCM 329,036,110 1,938,636 613 95,724 22.07 13.35 4.20
TARA009 Protist Surface 370,813,078 1,700,350 588 292,050 22.53 15.97 6.17
TARA018 Bacteria DCM 408,021,182 2,520,645 840 1,573,060 76.22 11.49 3.18
TARA018 Bacteria Surface 414,976,308 2,604,031 816 2,086,508 75.80 11.03 3.02
TARA023 Bacteria DCM 147,400,552 1,273,576 830 213,456 76.08 13.29 4.09
TARA023 Bacteria Surface 149,566,010 1,237,617 825 134,179 75.98 13.82 4.01
TARA023 Protist DCM 508,610,652 2,707,801 734 336,689 28.23 25.07 7.83
TARA023 Protist Surface 397,044,232 2,246,571 593 397,140 23.00 25.16 10.31
TARA025 Bacteria DCM 386,627,816 2,516,865 806 388,546 69.77 14.55 5.35
TARA025 Bacteria Surface 457,560,422 2,326,838 857 330,773 75.57 10.99 3.18
TARA030 Bacteria DCM 346,837,034 1,968,945 1,097 508,775 80.16 10.31 2.57
TARA030 Bacteria Surface 478,785,582 1,639,697 1,194 204,976 77.70 7.26 2.64
TARA030 Protist DCM 426,896,616 1,620,343 616 478,892 15.12 17.83 5.13
TARA030 Protist Surface 430,029,974 1,838,588 628 287,782 22.36 17.60 6.73

Notes.

a

DCM—deep chlorophyll maximum.

b

N50—length of DNA sequence above which 50% of the total is contained.

c

Relative abundance—determined using the reads recruited data-rich-contigs.