Table 1. Statistics for Megahit contigs, recruitment to data-rich-contigs, and relative abundance of draft genome results for each sample.
| TARA sample site | Size fraction (Girus, Bacteria, or Protist) | Depth (Surface or DCMa) | No. of reads | No. of initial Megahit assembly | N50c (bp; initial Megahit assembly) | Longest initial Megahit assembly (bp) | Recruitment (% data-rich-contigs) | Relative abundancec of draft genomes (%) | Relative abundancec of ten most abundant genomes (% ) |
|---|---|---|---|---|---|---|---|---|---|
| TARA007 | Girus | DCM | 178,519,830 | 1,318,470 | 828 | 220,754 | 72.84 | 14.64 | 6.35 |
| TARA007 | Girus | Surface | 221,166,612 | 1,308,847 | 861 | 211,946 | 81.74 | 14.83 | 6.12 |
| TARA007 | Protist | DCM | 744,458,992 | 4,667,618 | 654 | 188,635 | 19.45 | 8.60 | 3.18 |
| TARA007 | Protist | Surface | 265,432,098 | 2,590,120 | 564 | 18,444 | 25.58 | 1.57 | 0.61 |
| TARA009 | Girus | DCM | 416,553,274 | 2,796,841 | 831 | 1,643,839 | 69.48 | 14.16 | 6.32 |
| TARA009 | Girus | Surface | 489,617,426 | 1,787,467 | 929 | 1,142,851 | 68.85 | 12.29 | 4.76 |
| TARA009 | Protist | DCM | 329,036,110 | 1,938,636 | 613 | 95,724 | 22.07 | 13.35 | 4.20 |
| TARA009 | Protist | Surface | 370,813,078 | 1,700,350 | 588 | 292,050 | 22.53 | 15.97 | 6.17 |
| TARA018 | Bacteria | DCM | 408,021,182 | 2,520,645 | 840 | 1,573,060 | 76.22 | 11.49 | 3.18 |
| TARA018 | Bacteria | Surface | 414,976,308 | 2,604,031 | 816 | 2,086,508 | 75.80 | 11.03 | 3.02 |
| TARA023 | Bacteria | DCM | 147,400,552 | 1,273,576 | 830 | 213,456 | 76.08 | 13.29 | 4.09 |
| TARA023 | Bacteria | Surface | 149,566,010 | 1,237,617 | 825 | 134,179 | 75.98 | 13.82 | 4.01 |
| TARA023 | Protist | DCM | 508,610,652 | 2,707,801 | 734 | 336,689 | 28.23 | 25.07 | 7.83 |
| TARA023 | Protist | Surface | 397,044,232 | 2,246,571 | 593 | 397,140 | 23.00 | 25.16 | 10.31 |
| TARA025 | Bacteria | DCM | 386,627,816 | 2,516,865 | 806 | 388,546 | 69.77 | 14.55 | 5.35 |
| TARA025 | Bacteria | Surface | 457,560,422 | 2,326,838 | 857 | 330,773 | 75.57 | 10.99 | 3.18 |
| TARA030 | Bacteria | DCM | 346,837,034 | 1,968,945 | 1,097 | 508,775 | 80.16 | 10.31 | 2.57 |
| TARA030 | Bacteria | Surface | 478,785,582 | 1,639,697 | 1,194 | 204,976 | 77.70 | 7.26 | 2.64 |
| TARA030 | Protist | DCM | 426,896,616 | 1,620,343 | 616 | 478,892 | 15.12 | 17.83 | 5.13 |
| TARA030 | Protist | Surface | 430,029,974 | 1,838,588 | 628 | 287,782 | 22.36 | 17.60 | 6.73 |
Notes.
DCM—deep chlorophyll maximum.
N50—length of DNA sequence above which 50% of the total is contained.
Relative abundance—determined using the reads recruited data-rich-contigs.