(A) Genome size, (B) level of BUSCO completion (n = 255 BUSCOs), (C) number of genes, and (D) boxplots of mean gene length (mean gene length is represented by the blue dot) of the MAGs and reference diatoms C. tenuissimus (C.t.), P. tricornutum (P.t.), and T. pseudonana (T.p.). Only the assembly scaffolds of C. tenuissimus were available, preventing us from investigating the number of genes and their length. (E, F) PCA of different gene and genome metrics of the MAGs, shaded by geographical origin (blue: Arctic Ocean; purple: Mediterranean Sea; orange: Pacific South Eastern Ocean; green: Pacific South Western Ocean; black: Southern Ocean) (See Tables B and C in S1 Table for raw values). BUSCO, Benchmarking Universal Single-Copy Orthologs; MAG, metagenome-assembled genome; PCA, principal component analysis.