Skip to main content
. 2023 Mar 13;14:1384. doi: 10.1038/s41467-023-36988-x

Fig. 6. Larger marine prokaryotic genomes harbor elongated and highly redundant coding gene sequences.

Fig. 6

Box-and-whisker plots (a–c) show estimated average gene length (AGL), percent guanine and cytosine (% GC), and number of unique genes per megabase pair (Mbp) of coding sequence length (UGPCL) in metagenomic assemblies from the global Malaspina expedition (n = 81). Depth and number of samples analyzed are as follows: the epipelagic (EPI, ~3–200 m; n = 23), mesopelagic (MES, > 200–1000 m; n = 32), and bathypelagic (BAT, > 1000–4000 m; n = 26). Boxplots show median as horizontal lines and interquartile ranges as boxes (whiskers extend to a maximum of 1.5 times the interquartile range). Mean values are shown as white colored diamonds. Values at the top indicate the adjusted significant P-values of the unpaired two-sided Wilcoxon test with Benjamini-Hochberg correction. d Relationship between genetic (AGL, %GC, and UGPCL) and environmental (temperature, depth, and salinity) factors in the sampled global ocean microbiomes (n = 81). Values indicate Spearman correlations coefficient (one-tailed). Asterisks mark non-significant relationships (p > 0.01). Source data are provided as a Source Data file.