ABSTRACT
Lineage-wise physiological activities of plankton communities in the ocean are important but challenging to characterize. Here, we conducted whole-assemblage metatranscriptomic profiling at continental shelf and slope sites in the South China Sea to investigate carbon fixation potential in different lineages. RuBisCO expression, the proxy of Calvin carbon fixation (CCF) potential, was mainly contributed by Bacillariophyta, Chlorophyta, Cyanobacteria, and Haptophyta, which was differentially affected by environmental factors among lineages. CCF potential exhibited positive or negative correlations with phagotrophy gene expression, suggesting phagotrophy possibly enhances or complements CCF. Our data also reveal significant non-Calvin carbon fixation (NCF) potential, as indicated by the active expression of genes in all five currently recognized NCF pathways, mainly contributed by Flavobacteriales, Alteromonadales, and Oceanospirillales. Furthermore, in Flavobacteriales, Alteromonadales, Pelagibacterales, and Rhodobacterales, NCF potential was positively correlated with proton-pump rhodopsin (PPR) expression, suggesting that NCF might be energetically supported by PPR. The novel insights into the lineage-differential potential of carbon fixation, widespread mixotrophy, and PPR as an energy source for NCF lay a methodological and informational foundation for further research to understand carbon fixation and the trophic landscape in the ocean.
IMPORTANCE
Marine plankton plays an important role in global carbon cycling and climate regulation. Phytoplankton and cyanobacteria fix CO2 to produce organic compounds using solar energy and mainly by the Calvin cycle, whereas autotrophic bacteria and archaea may fix CO2 by non-Calvin cycle carbon fixation pathways. How active individual lineages are in carbon fixation and mixotrophy, and what energy source bacteria may employ in non-Calvin carbon fixation, in a natural plankton assemblage are poorly understood and underexplored. Using metatranscriptomics, we studied carbon fixation in marine plankton with lineage resolution in tropical marginal shelf and slope areas. Based on the sequencing results, we characterized the carbon fixation potential of different lineages and assessed Calvin- and non-Calvin- carbon fixation activities and energy sources. Data revealed a high number of unigenes (4.4 million), lineage-dependent differential potentials of Calvin carbon fixation and responses to environmental conditions, major contributors of non-Calvin carbon fixation, and their potential energy source.
KEYWORDS: whole-assemblage metatranscriptomics (WAME), phytoplankton, microbial, carbon fixation, proton-pump rhodopsin, dinoflagellates
INTRODUCTION
The functional contribution of individual lineages to a marine plankton community is challenging to characterize and has been underexplored. Traditional taxonomic composition analysis provides community structure but does not offer information on the physiological performance of each taxon. Physiological measurements, on the other hand, reveal the functional activities of the community but cannot resolve the contribution of individual taxa. Metatranscriptomic analysis can fill this gap because expressed genes in the resulting data set can be traced back to the source lineage of organisms, whereas the expression levels of the genes (proxies of the potential activities of the encoded functions) can be quantified and compared between lineages. Thanks to the rapid growth of genomic and transcriptomic databases and the increasing accessibility of metatranscriptome sequencing and bioinformatic analyses, metabolic profiles in dinoflagellates (1–3), diatoms (4), raphidophytes (5), and other lineages (4) in the natural marine environment have increasingly been documented. The metatranscriptomic approach has enabled the interrogation of the potential to acquire energy and nutrients, defend against grazing and microbial attacks, and perform sexual reproduction of individual phytoplankton species in the community, with which to understand the controlling drivers of regime shifts in the community (6–12). Most of these studies have focused on one or a few species in the communities, and cross-kingdom or domain comparative insights on multi-metabolic pathways have only begun to emerge and are still relatively limited. Carradec and colleagues (13) reported the first global ocean atlas of eukaryotic genes based on the Tara Oceans expedition. Recently, a study in the tropical Pacific revealed that dinoflagellates occupy an important niche by employing numerous metabolic strategies and play a dual role in carbon transformation (14). A metatranscriptomic study on a dinoflagellate bloom characterized the differential metabolic potential of phytoplankton, zooplankton, bacteria, and viruses and harnessed insights into how the bloom dynamics of Karenia longicanalis was modulated by bottom-up (nutrients and energy) and top-down (grazing and microbial attacks) processes (12). More whole-assemblage (covering eukaryotes and prokaryotes) studies are needed, especially focusing on lineage-specific contributions to carbon fixation.
Calvin carbon fixation (CCF) is usually dominated by cyanobacteria in the oligotrophic ocean and by diatoms and other eukaryotes in the coastal waters (15, 16). Non-Calvin carbon fixation (NCF) by bacteria has been anecdotally documented or discussed but has not become a focused topic of research (17–19). NCF involves five metabolic pathways and has been suggested to potentially contribute as high as >30% of total oceanic carbon fixation as it can be performed both at night (dark C fixation) and under light (20). Although NCF was increasingly reported for marine bacterial isolates in recent years (17, 18, 21), lineage-specific activities of NCF and their contribution to total carbon fixation are still poorly understood. Besides, extensive laboratory studies indicate that many planktonic eukaryotes are capable of mixotrophy, using different degrees of phagotrophy combined with complementary degrees of photoautotrophy (22, 23). A recent study in the tropical Pacific indicates that phagotrophy of dinoflagellates is important in euphotic and the mesopelagic zones (14). How active individual lineages are in carbon fixation and mixotrophy, and what energy source bacteria may employ in NCF in natural plankton assemblages are poorly understood and underexplored.
In this study, we conducted a whole-assemblage metatranscriptomic (WAME) study for a continental shelf station and a continental slope station in the South China Sea (SCS) to gain a comprehensive understanding of lineage-specific functional contributions. SCS is a subtropical marginal sea that connects the Tibetan Plateau and the Western Pacific Warm Pool. There are both terrestrial input from the plateau and exchange from the warm ocean currents (24, 25). The coastal waters (within the continental shelf) are influenced by several major rivers, which bring in high nutrient loads (26, 27). The basin of the SCS (continental slope and beyond), however, is generally oligotrophic, even though it receives nutrient inputs from the seasonally fluctuated Kuroshio current, upwelling, and monsoons (28, 29). Its carbon biogeochemistry has been extensively investigated, and parts of the ocean are found to be carbon sinks, whereas other parts are carbon sources, at least seasonally (30). By cataloging the functional (i.e., protein-coding) gene repertoire, characterizing metabolic pathways in different lineages, comparing the potentials of CCF, characterizing phagotrophic activities, and assessing NCF activity and energy sources, we aim to shed light on the metabolic dynamics and carbon cycling in the diverse plankton assemblages of the SCS. Our data set reveals a vast number of unigenes (4.4 million) and provides insights into the lineage-specific activities of carbon fixation and the potential sources of energy for different lineages.
RESULTS
Retrieved genes and their functional and taxonomic distributions
The 20 samples from which WAME was analyzed were collected from two to three different depths in picoplankton (0.2–3 µm) and nano-/micro-plankton (3–200 µm) organismal size fractions, at two stations, hence corresponding to 10 different conditions, with two biological replicates for each condition (Fig. 1). Physical and chemical parameters are shown in Table 1, showing a temperature range of 18.61°C–30.25°C, salinity of 33.62–34.78, N-nutrients 0.16–9.11 µM, and P-nutrient 0.01–0.72 µM. WAME sequencing and bioinformatics of these samples yielded a total of 881 Gb of raw reads, which were assembled into 4,499,414 unigenes, with N50 of 372 bp and a maximum length of 71,092 bp (Table 2). Rarefaction analysis indicated that our sequencing effort was close to saturation to recover all expressed genes (Fig. S1). Of these unigenes, 37.23% matched genes that have functional annotations in the databases (Table S1). These unigenes were contributed by 26,269 taxa of organisms (Table S2). It is important to note that the data and the metabolic profiles reported here only represent a snapshot of the potentially very dynamic and heterogeneous plankton communities. The actual numbers of unigenes and contributing lineages as well as their metabolic profiles remain to be portrayed with more spatially and temporally intensive sampling schemes in future research. Nevertheless, the insights gained from the limited data are still worth discussing.
Fig 1.

Study sites and sampling schemes. A continental shelf site (C6) and a continental slope site (C9) were sampled for the depths of surface (SUR) and deep chlorophyll maximum (DCM), and for C9, also the 150-m depth defined as the bottom of the euphotic zone (BOT). Station C6 (117.46°E, 22.13°N) is on the continental shelf, with a depth of 77 m, therefore, had no samples for 150 m. Station C9 (117.99°E, 21.69°N) is on the continental slope, with a depth of 1,369 m. For each sample, 40–50 L of water was collected and serially filtered onto 0.2–3 and 3–200 µm size fractions.
TABLE 1.
Physical and chemical parameters
| C6S_1 | C6S_2 | C6D_1 | C6D_2 | C9S_1 | C9S_2 | C9D_1 | C9D_2 | C9B_1 | C9B_2 | |
|---|---|---|---|---|---|---|---|---|---|---|
| NO2− + NO3− (μm) | 0.3985 | 0.272 | 1.2915 | 0.281 | 0.607 | 0.156 | 0.584 | 2.291 | 9.108 | 8.246 |
| PO4− (μm) | 0.058 | 0.053 | 0.538 | 0.087 | 0.011 | 0.021 | 0.131 | 0.242 | 0.717 | 0.648 |
| SiO32− (μm) | 3.0145 | 2.065 | 2.7505 | 1.557 | 1.527 | 1.497 | 2.702 | 3.61 | 8.641 | 7.769 |
| Temperature (°C) | 29.26 | 28.5935 | 27.0803 | 27.0055 | 30.2508 | 29.8893 | 24.5048 | 26.2942 | 18.61 | 18.9146 |
| Salinity | 33.62 | 34.2943 | 34.1409 | 34.4909 | 34.0557 | 33.7776 | 34.6454 | 34.7504 | 34.7678 | 34.7839 |
| Depth (m) | 3 | 3 | 35 | 35 | 3 | 3 | 75 | 75 | 150 | 150 |
| Photosynthetically active radiation (μmol m−2 s−1) | 55.8697 | 55.8697 | 4.007 | 4.007 | 59.0434 | 59.0434 | 0.7454 | 0.7454 | 0.0221 | 0.0221 |
TABLE 2.
Statistics of non-redundant gene sets
| Statistics | |
|---|---|
| Total number (#) | 4,449,414 |
| Total length (bp) | 1,669,935,941 |
| Gap (N) (bp) | 0 |
| Average length (bp) | 375.32 |
| N50 length (bp) | 372 |
| N90 length (bp) | 226 |
| Maximum length (bp) | 71,092 |
| Minimum length (bp) | 200 |
| GC content | 44.99% |
CCF in different plankton groups and environmental effects
RuBisCO gene expression is the rate-limiting enzyme of photosynthetic carbon fixation and thus was used as a proxy to compare the potential of CCF in plankton. Among the samples, the relative CCF potential in the deep chlorophyll maximum (DCM) layer was higher than the surface layer (SUR) and bottom of the photic zone (BOT) at both stations, with the DCM of C6 station (shelf) being the highest of all sample sources. The relative CCF potential in the continental shelf area was higher than that in the continental slope area with the same water layer, and that in the SUR layer of C9 the station (slope) was the lowest of all sample sources (Fig. 2A).
Fig 2.
Relative contributions of major lineages to the CCF transcript pool in different water layers. (A) Comparison at supergroup level; number in the inner circle depicts the expression level (transcripts per kilobase of exon model per million mapped reads) of RuBisCO gene. (B) Comparison at order level.
At the supergroup level, the main contributors of CCF transcript pool were Bacillariophyta, Haptophyta, Cyanobacteria, Chlorophyta, and non-diatom Stramenopiles (O_Stramenopiles) (Fig. 2A). In the SUR layer at C6 station, Proteobacteria and Other Eukaryota (O_Eukaryota) contributed a small proportion of CCF transcripts, while other supergroups have similar proportions, although Haptophyta contributed slightly more than other supergroups. In the DCM layer of the shelf, the dominant CCF transcript contributor was diatoms (Bacillariophyta), which accounted for 68.60% of the total CCF potential. In the SUR layer of the slope, the main CCF transcript contributors were O_Stramenopiles, Cyanobacteria, and Haptophyta, which accounted for 36.46%, 27.15%, and 16.43% of total CCF potential, respectively. In this study, cyanobacteria contribution was the greatest in the slope_SUR environment. Bacillariophyta also contributed substantially to the CCF of the BOT layer at slope, accounting for 30.23%, and other stramenopile phyla overwhelmingly dominated the total CCF potential. For CCF-contributing Proteobacteria, only one species named Azoarcus kh32c was represented in the WAME data set in the entire study, and its RuBisCO expression was detected only in the BOT layer of the slope, contributing 0.04% to the total CCF potential.
For Bacillariophyta, the relative contributions to CCF potential in the surface layer and DCM layer in the shelf region were 11.24% and 68.60%, respectively. In the slope region, the relative contributions in the surface layer, DCM layer, and BOT were 5.88%, 15.95%, and 30.23%, respectively. For Cyanobacteria, the relative contributions to CCF potential in the surface layer and DCM layer in the shelf region were 13.34% and 0.74%, respectively. While in the slope region, the relative contributions to CCF potential in the surface layer, DCM layer, and BOT were 27.15%, 5.87%, and 1.23%, respectively. For Dinophyceae, the relative contributions to CCF potential in the surface layer and DCM layer in the shelf region were 18.37% and 3.30%, respectively. In the slope region, the relative contributions in the surface layer, DCM layer, and BOT were 6.00%, 1.36%, and 0.50%, respectively.
In general, at both study sites, the relative transcript contribution of Bacillariophyta to the CCF of plankton increased with depth, while the relative transcript contribution of Cyanobacteria and Dinophyta decreased with the increase of depth. At the same water depth, the proportion of CCF of Bacillariophyta and Dinophyta was higher at shelf than slope, while the proportion of CCF of Cyanobacteria and O_Stramenopiles was higher at slope than shelf. We also estimated the CCF contribution of major plankton at the order level. In total, no fewer than 117 orders of plankton were detected to actively express the genes involved in CCF, among which the top 10 plankton orders contributed 54.96%–76.84% of all measured transcripts in each sample. The main orders active in CCF were Bacillariales, Pelagomonadales, Prymnesiales, Mamiellales, and Naviculales (Fig. 2B). The proportion of Gymnodiniales, Dinophyta, and cyanobacteria subsection I in CCF decreased with the increase of depth, and the proportion of Naviculales in CCF increased with the increase of depth.
Furthermore, we examined the relationship of the CCF community structure and environmental factors of different planktons at the supergroup and order levels, respectively. At the supergroup level, the CCF transcript contribution of Bacillariophyta was negatively correlated with the concentration of P in seawater (Fig. 3A). The CCF transcript contribution of Dinophyta was positively correlated with temperature and negatively correlated with salinity and depth. At the order level, the CCF transcript contribution of Gymnodiniales, Isochrysidales, and Naviculales was significantly related to salinity, depth, and photosynthetically active radiation, respectively, while the contribution of Pelagomonadales to CCF transcript pool was positively correlated to N and Si concentrations and temperature and is negatively correlated to depth (Fig. 3B). Based on the gene expression data of light-harvesting proteins and nutrient transporters (Fig. S2), CCF potential in Suessiales, Mamiellales, and Prymnesiales was positively correlated with light energy harvesting and uptake of N and P nutrients. CCF potential of Bacillariales was positively correlated with the acquisition of light energy, P, and Si but not with N nutrient.
Fig 3.
The relationship between CCF potential and environmental factors of major supergroups (A) and orders (B) of plankton. The thickness of the line represents the correlation between environmental factors and taxon abundance, and the color of the lines represents the significance level of the correlation (see color scale and thickness scale on the upper and middle right). Fill color strength of the squares represents the correlation of selected major taxa, from orange (negative interaction), green, to gray (positive interaction) (see the color scale on the lower right). Asterisks indicate significance (*P < 0.05 and ***P < 0.01).
Correlation between CCF and endocytosis gene expressions
Many protists are mixotrophic, acquiring nutrients through phagocytosis either to support obligate photosynthesis or to promote proliferation with the aid of facultative photosynthesis (22, 23). For these two contrasting trophic groups of protists, a positive or negative correlation between the expression of endocytosis genes and that of carbon fixation genes, respectively, is expected. We analyzed the correlation between the expression of core genes in the carbon fixation pathway and the core genes of endocytosis (indicators of pinocytosis/phagotrophy) in each major CCF-active lineage. We found that the expression of the CCF core genes in Bacillariophyta, Chlorophyta, and Haptophyta was positively correlated with the expression of their endocytosis core genes. The correlation was negative in Dinophyta (Fig. 4). Although the overall two pathways show a specific relationship, it is possible that some of the individual genes in them deviate the overall pattern. For instance, within Chlorophyta, K00026 seems to have a slight negative relationship with all pinocytosis genes, while in Haptophyta, K00889 also has a negative relationship with all carbon fixation and other phagocytosis genes (Fig. 4).
Fig 4.
Relationship between the expression of core photosynthesis genes and that of core endocytosis (pinocytosis/phagocytosis) genes in major phytoplankton phyla. The relationships are shown in red frames. Fill color of the squares depicts the sign and degree of correlation according to the scale bar on the right. (A) Chlorophyta; (B) Bacillariophyta; (C) Haptophyta; and (D) Dinophyta. *P < 0.05. KO numbers represent genes, whose names can be found in KEGG (Table S5).
NCF in different plankton groups and environmental effects
NCF was characterized using the expression profiles of the currently recognized five NCF pathways (reductive citric acid cycle, Wood-Ljungdahl pathway, 3-hydroxypropionate bicycle, hydroxypropionate-hydroxybutyrate cycle, and dicarboxylate-hydroxybutyrate cycle). The results of our analyses showed that at both stations NCF was contributed by Proteobacteria, Bacteroidetes, Archaea, and other lineages of bacteria (O_Bacteria). The highest relative expression of NCF genes was in the surface layer of station C6, and the lowest was in the DCM layer at station C9 (Fig. S3A). At the order level, 45 orders of prokaryotes were found to express at least 15 of the NCF pathway genes, among which the main five orders were Flavobacteria, Alteromonales, Oceanospirillales, Pelagibacterales, and Rhodobacterales (Fig. 5). Flavobacteriales contributed more than half of the NCF transcripts on the surface of the C6 station, and its contribution to the NCF transcript pool at the two stations decreased with increasing depth, while the NCF transcript contribution of Alteromonadales, Oceanospirillales, Cytophagales, and Nitrospinales increased with increasing depth (Fig. 5).
Fig 5.

Relative expressional contribution to NCF of different orders of NCF bacteria in five water masses.
In the correlation analysis between NCF transcript contribution and environmental factors of the 10 major orders, we found that the NCF transcript contribution of Miscellaneous Alphaproteobacteria was positively correlated with temperature, depth, and N and Si concentrations. Nitrospinales’ NCF transcript contribution was positively correlated with salinity and PAR, while Cytophagales and Oceanospirillales’ NCF transcript contributions were affected by salinity. In addition, Pelagibacterales’ contribution to the NCF transcript pool appeared to be negatively affected by Si concentration (Fig. 6).
Fig 6.

Relationship between relative contribution (%) of NCF transcript pool by 10 major orders of NCF bacteria and environmental factors. The thickness of the line represents the correlation between environmental factors and taxon abundance, and the color of the lines represents the significance level of the correlation (see the color scale and thickness scale on the upper and middle right). Fill color strength of the squares represents the correlation of selected major taxa, from orange (negative interaction), green, to gray (positive interaction) (see the color scale on the lower right). Asterisks indicate significance (*P < 0.05; **0.05 < P < 0.01; and ***P < 0.01).
In addition, by calibrating with size-fraction RNA quantities from the same water sample, we compared the NCF transcript abundances of two size samples in each water layer and found that both large- and small-sized groups of plankton contributed approximately equally (Fig. S3B). However, the NCF transcript abundances of plankton in small-sized samples in the SUR layer of the C6 station and the DCM layer of the C9 station were slightly higher than that of large-sized samples, and the NCF transcript abundances of large-sized samples in other water depths were higher than that of small-sized samples (Fig. S3B).
The profile of rhodopsin and its relationship with NCF potential
A high diversity of proton-pump rhodopsin (PPR) was found to be actively expressed. In total, 1,774 rhodopsin unigenes were detected, which were mainly contributed by Proteobacteria, Dinophyta, and O_Bacteria. At the order level, Flavobacteriales and Pelagibacterales were the two most dominant orders that contributed to the expression of rhodopsins (Fig. S4). Among the top six orders of prokaryotes with the highest PPR expression levels, four were carbon fixers involved in NCF: Flavobacteriales, Alteromonadales, Pelagibacterales, and Rhodobacterales. Statistical analysis showed that the expression of PPR was positively correlated with the expression of carbon fixation genes for these four groups of bacteria, with R2 values ranging from 0.68 to 0.90 (Fig. 7). Furthermore, we used the same method to analyze 187 samples collected from 66 stations in Tara Oceans database (31) and found that the transcript abundance of NCF of Flavobacteria in these samples was significantly correlated with its PPR expression level (Fig. 8).
Fig 7.
The correlation between the relative expression of NCF genes and the expression of rhodopsin genes in four major orders of NCF bacteria.
Fig 8.

Correlation between the relative gene expression of NCF and the gene expression of rhodopsin in Flavobacteriales in the global ocean. (A) The atlas of NCF gene expression [log2 (TPM + 1)] and rhodopsin gene expression (TPM). (B) Linear correlation between the expression level of Flavobacteriales NCF and rhodopsin.
DISCUSSION
In this study, we used whole-assemblage metatranscriptome (WAME) sequencing to be able to compare prokaryotes with eukaryotes for common metabolic profiles or physiological functions. Furthermore, using RNA quantity-based calibration, data from separately sequenced size fractions from the same water sample could be integrated to reconstruct the assemblage gene expression profile. This approach enables the estimation of the metabolic contribution of each size group to the overall community in various physiological functions, as represented by the genes expressed in both size groups. For example, it allows for the comparison of free-living bacteria (in the 0.2–3 µm fraction) versus particle-associated (PA) bacteria (in the 3–200 µm fraction) in terms of NCF.
Using this integrative WAME approach, we obtained novel insights into the relative contributions of different lineages to CCF and NCF, environmental effects, and a potential energy source. As is true for any molecular ecological data, the relationship between gene expression and the metabolic activity supported by the encoded proteins may not be linear and could vary significantly between different organisms. Thus, we caution that the lineage-differential metabolic potentials still require experimental verification in the future. There are some potential methods that could be employed: (i) enzymatic assays to measure the activity of specific metabolic pathways or enzymes of interest. This can help validate the metabolic potentials inferred from gene expression data; (ii) stable isotope probing to trace the incorporation of specific isotopically labeled substrates into biomass. This can provide direct evidence of the metabolic activity and substrate utilization of different lineages in the community; and (iii) metabolomics analysis to measure the changes in metabolite profiles and concentrations. By comparing the metabolomes of different lineages, it is possible to validate their metabolic potentials and identify the key metabolites associated with specific functions. Nevertheless, insights obtained from metatranscriptomic studies have been proven highly valuable. For instance, this type of work has led to the discovery of PPR in dinoflagellates (1) and diatoms (7) as a source of energy to cope with nutrient limitation and trypsin in diatoms as a regulator of nitrogen-phosphorus stoichiometric homeostasis (32, 33). The results obtained in the present study shed light on how carbon fixation might vary between different lineages and might be differentially influenced by biotic and abiotic factors, providing bases on which to construct testable hypotheses for further research in the future.
Some hypotheses that can be derived from the results obtained in this study include (i) about biotic factors that influence the relative contribution of different lineages to carbon fixation, for example, it can be hypothesized that specific lineages of free-living (FL) bacteria (in the 0.2–3 µm fraction) may contribute more to NCF than particle-associated bacteria (in the 3–200 µm fraction) do due to the differences in their ecological niches and interactions with other organisms; (ii) abiotic factors that affect the metabolic activity and carbon fixation in different lineages, for instance, it can be hypothesized that environmental conditions such as nutrient availability, temperature, or light intensity may influence the relative contribution of certain lineages to the CCF and NCF pathways; (iii) about the potential that specific lineages may possess unique adaptations for efficient carbon fixation, for example, it can be hypothesized that certain eukaryotic lineages might have evolved specific molecular mechanisms, such as the presence of PPR, to cope with nutrient limitation and enhance their carbon fixation abilities; (iv) about the potential that interactions between different lineages influence carbon fixation dynamics, it can be hypothesized that the presence of specific lineages or the interactions between different organisms in a community may have a positive or negative effect on the carbon fixation rates of the community or the relative contribution of different lineages.
CCF transcript pool was differentially contributed by different lineages and affected by environmental factors
For CCF, the transcript level of CCF in the continental shelf area was higher than that in the slope area (Fig. 2). This suggests a potentially higher CCF activity in the shelf area, which might be attributed to higher nutrient concentrations there due to greater influences from more nutrient input than in the slope area (26, 34). Indeed, the CCF transcript abundance of the major CCF groups was in most cases positively correlated with nutrient concentrations (Fig. S2).
At the supergroup level, CCF potential was mainly contributed by Bacillariophyta, Cyanophyta, Chlorophyta, Haptophyta, and O_Stramenopiles in our samples overall. Northern SCS summer phytoplankton communities have been found to be dominated by Cyanobacteria, Bacillariophyta, Haptophyta, and Chlorophyta (35, 36). This is similar to the taxonomic profile of CCF plankton in our study. The CCF transcript contribution of plankton was, at least in part, correlated with their abundance, as is often found to be the case (37, 38). In the photic zone, the contribution of Bacillariophyta and Dinophyta to the total CCF of plankton was higher in the shelf area than in the slope area and higher at greater depths. On the contrary, the contribution of Cyanobacteria to the whole plankton’s CCF potential in the shelf area was lower than that in the slope area and decreased with the increase of depth. This trend is consistent with the general understanding that cyanobacteria are predominant primary producers in the major ocean basins (offshore), whereas diatoms and dinoflagellates dominate nearshore waters, due to contrasting nutrient availabilities.
The major CCF contributors at the taxonomic level of the order were generally similar between the shelf and slope areas, including Bacillariales, Pelagomonadales, Prymnesiales, Mamiellales, and Naviculales, but without any cyanobacterial order, suggesting cyanobacterial contributors to CCF were more spread out taxonomically. Despite the shelf versus slope similarity, the CCF transcript abundance of plankton in different orders varied with geographical location and depth. Bacillariales was most transcriptionally active in the shelf DCM layer, Mamiellales was most active in the slope DCM layer, and Dinophysales was most active in the SUR layer. This pattern might reflect niche differentiation among these orders, particularly due to their differential potential to acquire light energy and nutrients. For instance, diatoms favor high inorganic nutrients (39), whereas Mamiellales species prefer to be distributed in the subsurface layer with high chlorophyll a concentration (40). Dinoflagellates are more competitive in nutrient-poor environments due to propensity for mixotrophy, and they are versatile in scavenging nutrients (22). In addition, dinoflagellates possess PPR for efficiently harvesting light energy (1), which might help prevent photoinhibition that often occurs on sea surface phytoplankton (41). Indeed, dinoflagellates were actively expressing rhodopsin (Table S3) and phagotrophy genes in the present study.
Taken together, our results indicate that diverse taxa of both cyanobacteria and eukaryotic phytoplankton actively contributed to primary production in our study area, with no taxa being predominant in photosynthetic carbon fixation. Furthermore, the CCF potential of these contributing taxa was differentially influenced by the common environmental conditions such as nutrient and light availability. In addition, dinoflagellate CCF potential might, in part, be supported by phagotrophy as a source of nutrients and PPR as a supplemental source of energy. For instance, in laboratory experiments, Prorocentrum shikokuense actively upregulated the phagocytosis pathway under deficiency of inorganic nitrogen nutrients (22) and upregulated rhodopsin under phosphorus limitation (42) and during bloom outbreaks in phosphorus-poor waters (8).
NCF by both free-living and particle-associated microbes
In the present study, all the five NCF pathways were represented in the whole-assemblage metatranscriptomic data set, indicating that there were five different ways of bacterial NCF in the euphotic zone of the northern South China Sea. Based on the average expression level of complete pathway genes, we found that Flavobacteriales, Alteromonadales, Oceanospirillales, and Pelagibacterales were the dominant NCF contributors in the northern SCS for NCF (Fig. 5). Generally, prokaryotes in small-sized samples are FL, while prokaryotes in large-sized samples are PA, although some FL species might have been retained on the >3 µm fraction due to clogging during filtration. Studies in multiple sea areas have shown that the diversity and abundance of PA bacteria are generally higher than that of FL bacteria (43, 44). In this study, FL and PA prokaryotes contributed nearly equally except in C6_DCM and C9_SUR, where PA prokaryotes’ contribution was higher than FL prokaryotes. Our data suggest that the PA prokaryotes may contribute significantly to non-Calvin microbial carbon fixation in SCS, highlighting their potential importance in this process. Particles might provide a concentrated supply of substrates to support carbon fixation. Our findings in this study demonstrate the potential of non-photosynthetic bacteria to contribute to ocean production, which should be further investigated in other sea areas and verified in laboratory experiments.
Lineage-differential effects of environmental factors on CCF and NCF
Marine CCF is well known to be affected by environmental factors such as light cycle and light intensity (45, 46), salinity (47), temperature (48), N-, P-, and Si-nutrients (22, 42, 49), and trace metals (50). However, the effects differ between species, and it is challenging to investigate lineage-differential effects in a natural community in situ using ecological or physiological methods. In this study, we examined the relationship between the lineage-specific carbon fixation potential (gene expression) and environmental factors at supergroup and order levels and found differential responses of different lineages to environmental factors. The relative expressional contributions of cyanobacteria to CCF were found to be negatively correlated with the phosphate concentration (Fig. 2 and 3), which was probably because cyanobacteria are highly capable of utilizing dissolved organophosphate (51, 52). This is consistent with the previous observation that the abundance of cyanobacteria is positively correlated with the N:P ratio (53). The dinoflagellate proportion of the CCF transcript pool was positively correlated with temperature and significantly negatively correlated with salinity (Fig. 2 and 3). Dinoflagellates are generally known to favor warmer temperatures and have been reported to prefer lower salinity and higher temperature conditions (54). Lower salinities are often linked to freshwater input that brings in organic and inorganic nutrients (55).
Regarding NCF, the relative expressional contribution of Cytophagales, Nitrospinales, and Oceanospirillales was significantly positively correlated with salinity. This suggests that these three orders might be more NCF active in higher salinity environments than other major orders of NCF contributors. In contrast, some orders of bacteria did not show a clear trend in NCF transcript abundance in relationship to environmental factors. Therefore, the NCF potential of different species from the same order can respond differently to environmental factors. As NCF has been understudied, more research is required to unravel how salinity and other environmental factors influence this type of carbon fixation.
In sum, our data in concert indicate that the responses of carbon fixation to environmental factors may be different between phytoplankton and non-photosynthetic bacteria, as well as between taxa within each group of organisms. When species coverage of the genome and transcriptome databases increases in the future, the relationship between carbon fixation activity and environmental factors can be studied at a finer taxonomic level, and more trends of NCF with environmental factors might emerge.
Role of endocytosis in supporting CCF
Endocytosis is usually divided into three types: phagocytosis, pinocytosis, and receptor-mediated endocytosis due to the different sizes of the material and the mechanism (56). It is increasingly recognized that many phytoplankton are mixotrophic (hence named mixoplankton) (57–59). Diatoms can take in iron ions in seawater through pinocytosis and even capture silicon to support their growth (60–62). Many dinoflagellates can perform phagocytosis to cope with the lack of nutrients in the environment (8, 22). Viruses and commensal microorganisms enter the cells of protists through the endocytic pathway of the protistan hosts (63). Therefore, the endocytosis of these CCF microorganisms might serve as a supplementary nutritional mechanism to support their photosynthesis and carbon fixation or compensate for reduced photosynthetic carbon fixation due to nutrient deficiency. In these two scenarios, a positive and a negative correlation, respectively, is expected for carbon fixation gene expression and endocytosis gene expression. In this study, we analyzed the relationship between photosynthesis and endocytosis of Bacillariophyta, Dinophyta, Haptophyta, and Chlorophyta. The results revealed a positive correlation between the core genes of endocytosis and the core genes of photosynthesis in Bacillariophyta, Haptophyta, and Chlorophyta (Fig. 4). This suggests that these diatoms might be taking in iron and other nutrients through pinocytosis to support their photosynthesis (61). Besides, through phagocytosis, the haptophytes and chlorophytes might be obtaining essential growth factors to promote the photosynthetic growth of the CCF population (64). In sharp contrast, a negative correlation was found in dinoflagellates between the expression of phagocytosis core genes and that of photosynthesis core genes (Fig. 4). This is probably because many dinoflagellates are mixotrophic and use phagotrophy to compensate for the reduction of photosynthesis due to nutrient limitation in the subtropical ocean. Although the overall two pathways show a specific relationship, it is possible that some of the individual genes in them deviate the overall pattern. For instance, within Chlorophyta, K00026 seems to have a slight negative relationship with all pinocytosis genes, while in Haptophyta, K00889 also has a negative relationship with all carbon fixation and other phagocytosis genes. Malate dehydrogenase (K00026) in the carbon fixation pathway of Chlorophyta showed a slightly negative correlation with pinocytosis pathway genes. Malate dehydrogenase is involved in the process of energy metabolism, while pinocytosis requires energy expenditure. The negative correlation of the gene expression of CCF and pinocytosis may reflect the cellular allocation of energy utilization. When cells require more energy, gene expression of pinocytosis may be suppressed, while malate dehydrogenase expression is increased to improve energy supply. This may be the way of energy metabolism regulation in Chlorophyta, which warrants further investigation in the future. In haptophytes, K00889 encodes 1-phosphatidylinositol-4-phosphate 5-kinase (PIP5K), and its negative correlation with carbon fixation and other pinocytosis genes might be because carbon fixation and pinocytosis require energy and resources from the cell. PIP5K may affect the allocation of these limited resources within the cell, leading to negative correlations between its expression and the genes involved in carbon fixation or pinocytosis.
To obtain support from a wider seascape, we analyzed the correlations for chlorophytes and dinoflagellates using the metatranscriptomic data from the Tara Oceans expedition (https://tara-oceans.mio.osupytheas.fr/ocean-gene-atlas/). We found a similar positive correlation for chlorophytes between the expression of core genes in CCF and endocytosis (Fig. S5A). For dinoflagellates, however, the result indicated a positive correlation instead of a negative one (Fig. S5B). The disparity between our data and the Tara Oceans data for dinoflagellates is probably not so surprising, again because the global ocean sampling in the latter might have covered more photosynthetic species than our subtropical smaller geographic range. Alternatively, the wider range of environmental conditions in the Tara Oceans expedition may mask the relationships within a smaller range of environmental conditions in the northern South China Sea.
As the relationship between endocytosis and photosynthetic carbon fixation is still underexplored, further research is needed. In particular, direct evidence needs to come from laboratory experiments on isolates of these species or fieldwork using a combination of microscopic, isotope tracing, and omics analyses.
Rhodopsin as a potential energy source for NCF in bacteria
As discussed earlier, studies in recent years have revealed that many prokaryotes in the ocean are engaged in carbon fixation activities through five NCF pathways (65). However, the source of energy to support NCF remains elusive. While there is chemical energy from organic carbon metabolism or the energy from the oxidation of inorganic compounds (66), another potential source is light energy harnessed by PPR. This structurally very simple and hence highly efficient machinery includes all-trans retinal coupled with an opsin protein and absorbs light energy from the blue-green waveband to pump protons outside of the cell membrane, thus creating a proton gradient to drive the synthesis of ATP (67, 68). PPR is therefore a photosystem-independent solar energy converting mechanism. PPR is widespread in marine bacteria, in which the energy harnessed facilitates the survival and growth of the bacteria in nutrient-limited environments (69, 70). Therefore, rhodopsin expression was analyzed for the four NCF bacterial orders with the highest relative PPR expression: Flavobacteriales, Alteromonadales, Pelagibacterales, and Rhodobacterales. The results showed that the four orders with the highest NCF potential were also the four orders with the highest rhodopsin gene expression, and the rhodopsin expression level of these four orders exhibited significantly positive correlations with their NCF gene expression (Fig. 7). To explore whether the same correlations occur in other parts of the global ocean, we mined the metatranscriptome database of the Tara Oceans expedition. Indeed, the same result was found in Flavobacterium (Fig. 8). Based on all these results, it is likely that rhodopsin is a potential energy source for NCF in some lineages of NCF-active bacteria.
Conclusion
There has been an increase in large-scale studies using meta-omics to understand the metabolic performance of microorganisms in the ocean (13), but relatively few have dealt with lineage-specific carbon fixation, compared CCF with NCF, and examined their respective relationships with environmental effects. This is particularly true for the South China Sea, a large subtropical continental marginal sea. Our first attempt to catalog functional genes in the China Seas documented over 4.4 million unigenes. This will be useful baseline data for future research on plankton metabolic pathways in situ in this area and a potentially valuable resource for research in other parts of the global ocean. Previous metatranscriptomic studies have tended to focus on either eukaryotes or prokaryotes, and our whole-assemblage metatranscriptomics approach includes both groups. This allows us to make a comparative analysis of the contributions of prokaryotes and eukaryotes to common physiological processes at the level of gene expression and better understand the interaction between eukaryotes and prokaryotes. At the transcriptomic level, we have assessed the contribution of the major carbon-fixing plankters in different water layers in the shelf and slope areas of the northern South China Sea. The results indicate that Bacillariales, Pelagomonadales, Prymnesiales, Mamiellales, and Naviculales were the major contributors of the CCF transcript pool, while Flavobacteriales, Alteromonadales, Oceanospirillales, and Rhodobacterales were most transcriptionally active in NCF. CCF and NCF potential capacities were higher at the shelf and differentially affected by environmental factors depending on lineages. Furthermore, our data suggest that endocytosis may promote CCF in diatoms, chlorophytes, and haptophytes, while it may complement CCF in dinoflagellates. NCF in Flavobacteriales, Alteromonadales, Pelagibacterales, and Rhodobacterales appears to be energetically supported by PPR. Testable hypotheses can be developed based on our findings for verification or further inquiries of related questions. Moreover, this study sheds light on the physiological characteristics of different taxa of planktonic microorganisms in a subtropical marginal sea and provides a reference for future studies on the response of plankton to environmental factors in different geographical locations.
MATERIALS AND METHODS
Field sampling
During a research cruise in the period of 6–12 August 2016, 20 samples were collected at C6 (117.46 °E, 22.13 °N) and C9 (117.99 °E, 22.69 °N) stations from the northern South China Sea, separated into the picoplankton/nanoplankton (0.2–3 μm) and microplankton (3–200 μm) organismal size fractions. Stations C6 (77 m deep) and C9 (1,369 m deep) were located on the shelf and slope of the northern South China Sea, respectively. Sampling targeted three depth layers, which encompassed distinct physicochemical conditions: surface, deep chlorophyll maximum, and bottom of photic zone layers. For each sample, 40–50 L seawater from more than three Niskin bottles (there are 12 Niskin bottles on a CTD rosette, and the volume of each Niskin bottle is 12 L) was first filtered through 200 µm blotting cloth to remove large planktons. Then, the seawater was serially filtered through two size fractions (0.2–3 and 3–200 μm) using 142 mm polycarbonate membranes (Millipore, Billerica, MA, USA), each sample taking about 20 min. Finally, the sample-carrying filters were divided into four quarters: one being stored in a 2 mL tube containing lysis buffer (0.1 M EDTA and 1% SDS) for DNA work and the other in Trizol reagent (Molecular Research Center, Inc., USA) for RNA work. All samples were immediately frozen in liquid nitrogen onboard until processing upon return to the laboratory. Two replicate samples were collected from each condition; a total of 20 samples were collected (Fig. 1).
Measurements of physical and chemical parameters
Temperature, salinity, and depth were measured using a CTD profiler (SBE 17plus V2, Sea-Bird Scientific, USA). A series of nutrients (NO3−+ NO2−, PO43−, and SiO32-) were measured using a continuous flow analyzer, and all measurements were performed on triplicate samples. The light intensity of each water layer was calculated based on remote sensing data and a reported computational algorithm (71).
Sample processing and metatranscriptomic sequencing
RNA was extracted as previously described (32). Shortly, each of the samples was mixed with a 1:1 mixture of 0.5 and 0.1 mm-diameter zirconia/silica beads (Biospec, USA) and beat at a rate of 6 m/s on a FastPrep-24 bead mill (MP Biomedicals, USA) for three times to ensure complete cell breakage. Each bead beating step lasted for 60 s, and then the centrifuge tubes were placed on ice for a 3-min cold bath, before proceeding to the next bead beating step. RNA was extracted following the TRI Reagent protocol coupled with the Direct-zol RNA columns, essentially as reported previously (1). DNase 1 was used to remove DNA from the total RNA. RNA concentration was measured using a NanoDrop ND-2000 Spectrophotometer, and integrity was assessed using RNA 6000 Nano LabChip Kit in microcapillary electrophoresis (Agilent 2100 Bioanalyzer, Agilent Technologies, Australia). Samples with an RNA integrity number (RIN) ≥ 6.0 were used for metatranscriptome sequencing, and all of our samples in this study satisfied RIN ≥ 6.0. One microgram RNA from each sample was subject to ribosomal RNA removal using a Ribo-Zero rRNA Removal Kit (Plant Leaf), a Ribo-Zero rRNA Removal Kit (Plant Seed/Root) (Illumina, San Diego, CA, USA), and a TIANSeq rRNA Depletion Kit (Bacteria) (TIANGEN Biotech, China) for whole-assemblage (prokaryotes + eukaryotes) metatranscriptome sequencing. mRNA (i.e., rRNA-depleted RNA) was then fragmented with First Strand Synthesis Reaction Buffer and Random Primer Mix (2×) at 94°C for 10 min, and first-strand cDNA was synthesized using ProtoScript II Reverse Transcriptase, and the second-strand cDNA was synthesized using Second Strand Synthesis Enzyme Mix. The double-stranded cDNA was purified, end repaired, and ligated to adaptors. Fragments of about 400 bp (with an approximate insert size of 250 bp) were selected and sequenced on the Illumina HiSeq 4000 instrument (Illumina, San Diego, CA, USA). Each sample had two or three technical replicates and totally produced 881 Gb of raw data; each sequencing yielded 55,123,252–147,337,262 reads (Table S4), with a total of 5,667,140,490 reads.
Transcriptome analysis and gene expression quantification
Raw reads were processed by removing adaptors, reads with >5% ambiguous bases (N), and low-quality reads (>20% bases with quality value < 20) using SOAPnuke software (v1.5.6) (72) with parameter setting -| 20 -q 0.2-n 0.05 -Q 2. Reads from each sample contained <10% ribosomal RNA, which were discarded. For the remaining clean reads, we used Trinity (v2.0.6) (73) to perform de novo assembly for each sample, then used Tgicl (v2.0.6) (74) to remove redundancy for all samples of C6 and C9, respectively. The parameters of Trinity were as follows: (i) group-pairs_distance 500, (ii) min_contig length 200, (iii) min_kmer_cov 2, (iv) min-glue 2, (v) bfly_opts -V 5, (vi) edge-thr = 0.1, (vii) stderr, and (viii) SS_lib_type RF. The parameters of Tgicl were as follows: (i) a minimum of 95% identity between the contigs, (ii) a minimum of 35 overlapping bases, (iii) a minimum score of 35, and (iv) a maximum of 20 unmatched overhanging bases at the sequence ends. The unigene sets from all samples were clustered by using cd-hit (v4.8.1) (75) (-c 0.95 -aS 0.9) to generate the final unigene set (Unigene) for downstream analysis, and the sequence similarity was set at 95%. We used BLASTN (76) for NT annotation, blastx (76) or Di for NR, KOG, KEGG, and Swissprot annotation, blast2go (77) and NR annotation results for GO annotation, and InterProScan 5 (78) for InterPro annotation. The thresholds of BLASTN, blastx, blast2go, and interprocan5 were all e value < 10−5, and the top1 was selected as the annotation result. Bowtie2 (79) was used to match clean reads to Unigene, and then Salmon v0.9.1 (80) was used to calculate the gene expression levels of each sample. In the subsequent analysis, we excluded unigenes whose TPM (transcripts per kilobase of exon model per million mapped reads) was less than 0.1 across all 20 samples.
Identification of key genes in different samples
Key genes in carbon fixation were identified from the WAME data set based on functional annotations. The expression contributions of these genes were visualized using Circos (http://mkweb.bcgsc.ca/tableviewer/visualize/).
Standardization of gene expression in specific taxonomic groups
While comparing gene expression differences in the same taxonomic group between sample groups, gene expression at the taxonomic group level re-normalized as transcripts of the specific gene from the specific taxonomic group per million of total mapped reads of the taxonomic group was used.
Statistical analysis
We analyzed correlations for genes with non-zero expression in at least 6 out of our 20 samples. Correlation between the expression of CCF core genes and endocytosis core genes was visualized in R using packages ggplot (81). Pearson correlation was analyzed between the expression of NCF and that of PPR in bacterial lineages in which PPR expression was high. Mantel tests were conducted to determine the correlations between the gene expression and environmental factors using the “ggcor” package.
Quantification of the relative activity of carbon fixation
Based on the annotations of the KO database, we used the TPM value of the RuBisCO gene to represent the CCF activity of the lineage (supergroup) (82). Besides, we counted the sum of the TPM values of all unigenes on the NCF pathway for individual lineages of interest and calculated the average TPM value of each gene in the pathway. The average value was used to represent the NCF activity of the lineage (supergroup). The greater the TPM value, the greater the activity of carbon fixation.
Calibration for size fraction data set to enable comparison
Large (3–200 µm) and small (0.2–3 µm) size fractions were sequenced separately, making it impossible to directly compare the contributions of these two size groups to the community in various metabolic processes at the transcriptional level. To overcome the problem, we multiplied TPM of a functional gene in a size fraction with the total RNA extracted from the size fraction sample, then divided the product by the total RNA extracted from both size fractions from the same volume of water sample, i.e., TPMsmall × RNAsmall per L/(RNAsmall per L + RNAlarge per L) and TPMlarge × RNAlarge per L/(RNAsmall per L + RNAlarge per L). These allow the estimation of the contribution of a small-sized plankton or a large-sized plankton to the whole assemblage in the function represented by the gene (e.g., RuBisCO).
Search for rhodopsin expressed in the global ocean
To understand the expression of rhodopsin in the global ocean, we examined their occurrence in the Tara Oceans metatranscriptome data set. The rhodopsin domain (Pfam ID: PF01036) identified from the rhodopsin protein was used as a query to search in the data sets. For the Tara Oceans data set from all major oceans except the Arctic during 2009–2013, hmmsearch was carried out using the MATOU-v1 catalog with a threshold of 1e−5 to study their expression and biogeography.
ACKNOWLEDGMENTS
We thank Bangqin Huang, Xin Liu, Yanping Zhong, and Chentao Guo of Xiamen University for help with sampling and logistic support. We are also indebted to the crew and participants of the Yanping 2 research cruise for assistance in sampling. In addition, we thank Minglei Ma and Xin Lin for participating in various discussions of the study.
This work was supported by the National Key Research and Development Program of China grant 2016YFA0601202, the Marine S&T Fund of Shandong Province for Pilot National Laboratory for Marine Science and Technology (Qingdao) (No. 2018SDKJ0406-3), the Science and Technology Project of Qingdao West Coast District (No. 2020-3-3) to D.L., the Science Foundation of Donghai Laboratory (Grant number: DH-2022KF0209), and Science and Technology Plan Projects of Guangdong Province (No. 2021B1212050025).
Contributor Information
Senjie Lin, Email: senjie.lin@uconn.edu.
Erik F. Y. Hom, University of Mississippi, University, Mississippi, USA
DATA AVAILABILITY
The data that support the findings of this study have been deposited in CNGB Sequence Archive (CNSA) of China National GeneBank DataBase (CNGBdb) (https://db.cngb.org/cnsa/) under the accession number CNP0001483. The spreadsheet of the final unigene set alongside their taxonomic and functional annotations, and gene expression levels of each sample in terms of raw read counts, has been uploaded to our laboratory website (http://sampgr.org.cn/index.php/download) and is available for public download.
SUPPLEMENTAL MATERIAL
The following material is available online at https://doi.org/10.1128/spectrum.02177-23.
Fig. S1 to S5.
Tables S1 to S5.
ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.
REFERENCES
- 1. Lin S, Zhang H, Zhuang Y, Tran B, Gill J. 2010. Spliced leader-based metatranscriptomic analyses lead to recognition of hidden genomic features in dinoflagellates. Proc Natl Acad Sci U S A 107:20033–20038. doi: 10.1073/pnas.1007246107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Cooper ED, Bentlage B, Gibbons TR, Bachvaroff TR, Delwiche CF. 2014. Metatranscriptome profiling of a harmful algal bloom. Harmful Algae 37:75–83. doi: 10.1016/j.hal.2014.04.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Zhuang Y, Zhang H, Hannick L, Lin S. 2015. Metatranscriptome profiling reveals versatile N-nutrient utilization, CO2 limitation, oxidative stress, and active toxin production in an Alexandrium fundyense bloom. Harmful Algae 42:60–70. doi: 10.1016/j.hal.2014.12.006 [DOI] [Google Scholar]
- 4. Alexander H, Jenkins BD, Rynearson TA, Dyhrman ST. 2015. Metatranscriptome analyses indicate resource partitioning between diatoms in the field. Proc Natl Acad Sci U S A 112:E2182–90. doi: 10.1073/pnas.1421993112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Ji N, Lin L, Li L, Yu L, Zhang Y, Luo H, Li M, Shi X, Wang D-Z, Lin S. 2018. Metatranscriptome analysis reveals environmental and diel regulation of a Heterosigma akashiwo (raphidophyceae) bloom. Environ Microbiol 20:1078–1094. doi: 10.1111/1462-2920.14045 [DOI] [PubMed] [Google Scholar]
- 6. Wohlrab S, Falcke JM, Lin S, Zhang H, Neuhaus S, Elferink S, Voss D, Zielinski O, John U. 2018. Metatranscriptome profiling indicates size-dependent differentiation in plastic and conserved community traits and functional diversification in dinoflagellate communities. Front Mar Sci 5. doi: 10.3389/fmars.2018.00358 [DOI] [Google Scholar]
- 7. Marchetti A, Schruth DM, Durkin CA, Parker MS, Kodner RB, Berthiaume CT, Morales R, Allen AE, Armbrust EV. 2012. Comparative metatranscriptomics identifies molecular bases for the physiological responses of phytoplankton to varying iron availability. Proc Natl Acad Sci U S A 109:E317–E325. doi: 10.1073/pnas.1118408109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Zhang S-F, Yuan C-J, Chen Y, Lin L, Wang D-Z. 2019. Transcriptomic response to changing ambient phosphorus in the marine dinoflagellate Prorocentrum donghaiense. Sci Total Environ 692:1037–1047. doi: 10.1016/j.scitotenv.2019.07.291 [DOI] [PubMed] [Google Scholar]
- 9. Bertrand EM, McCrow JP, Moustafa A, Zheng H, McQuaid JB, Delmont TO, Post AF, Sipler RE, Spackeen JL, Xu K, Bronk DA, Hutchins DA, Allen AE. 2015. Phytoplankton-bacterial interactions mediate micronutrient colimitation at the coastal Antarctic sea ice edge. Proc Natl Acad Sci U S A 112:9938–9943. doi: 10.1073/pnas.1501615112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Kolody BC, McCrow JP, Allen LZ, Aylward FO, Fontanez KM, Moustafa A, Moniruzzaman M, Chavez FP, Scholin CA, Allen EE, Worden AZ, Delong EF, Allen AE. 2019. Diel transcriptional response of a California current plankton microbiome to light, low iron, and enduring viral infection. ISME J 13:2817–2833. doi: 10.1038/s41396-019-0472-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Kolody BC, Smith SR, Zeigler Allen L, McCrow JP, Moustafa A, Shi D, Hopkinson BM, Morel FMM, Ward BB, Allen AE. 2022. Nitrogen and iron availability drive metabolic remodeling and natural selection of diverse phytoplankton during experimental upwelling. mSystems 7:e0072922. doi: 10.1128/msystems.00729-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Yu L, Li T, Li H, Ma M, Li L, Lin S. 2023. In situ molecular ecological analyses illuminate distinct factors regulating formation and demise of a harmful dinoflagellate bloom. Microbiol Spectr 11:e0515722. doi: 10.1128/spectrum.05157-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Carradec Q, Pelletier E, Da Silva C, Alberti A, Seeleuthner Y, Blanc-Mathieu R, Lima-Mendez G, Rocha F, Tirichine L, Labadie K, et al. 2018. A global ocean atlas of eukaryotic genes. Nat Commun 9:373. doi: 10.1038/s41467-017-02342-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Cohen NR, McIlvin MR, Moran DM, Held NA, Saunders JK, Hawco NJ, Brosnahan M, DiTullio GR, Lamborg C, McCrow JP, Dupont CL, Allen AE, Saito MA. 2021. Dinoflagellates alter their carbon and nutrient metabolic strategies across environmental gradients in the central Pacific Ocean. Nat Microbiol 6:173–186. doi: 10.1038/s41564-020-00814-7 [DOI] [PubMed] [Google Scholar]
- 15. Zehr JP, Kudela RM. 2009. Photosynthesis in the open ocean. Science 326:945–946. doi: 10.1126/science.1181277 [DOI] [PubMed] [Google Scholar]
- 16. Strzepek RF, Harrison PJ. 2004. Photosynthetic architecture differs in coastal and oceanic diatoms. Nature 431:689–692. doi: 10.1038/nature02954 [DOI] [PubMed] [Google Scholar]
- 17. Wang Z, Guo F, Liu L, Zhang T. 2014. Evidence of carbon fixation pathway in a bacterium from candidate phylum SBR1093 revealed with genomic analysis. PLoS One 9:e109571. doi: 10.1371/journal.pone.0109571 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Pachiadaki MG, Sintes E, Bergauer K, Brown JM, Record NR, Swan BK, Mathyer ME, Hallam SJ, Lopez-Garcia P, Takaki Y, Nunoura T, Woyke T, Herndl GJ, Stepanauskas R. 2017. Major role of nitrite-oxidizing bacteria in dark ocean carbon fixation. Science 358:1046–1051. doi: 10.1126/science.aan8260 [DOI] [PubMed] [Google Scholar]
- 19. Huang J-M, Baker BJ, Li J-T, Wang Y. 2019. New microbial lineages capable of carbon fixation and nutrient cycling in deep-sea sediments of the northern South China Sea. Appl Environ Microbiol 85:e00523-19. doi: 10.1128/AEM.00523-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Baltar F, Herndl GJ. 2019. Ideas and perspectives: is dark carbon fixation relevant for oceanic primary production estimates? Biogeosciences 16:3793–3799. doi: 10.5194/bg-16-3793-2019 [DOI] [Google Scholar]
- 21. Sheik CS, Jain S, Dick GJ. 2014. Metabolic flexibility of enigmatic SAR324 revealed through metagenomics and metatranscriptomics. Environ Microbiol 16:304–317. doi: 10.1111/1462-2920.12165 [DOI] [PubMed] [Google Scholar]
- 22. Li H, Li L, Yu L, Yang X, Shi X, Wang J, Li J, Lin S. 2021. Transcriptome profiling reveals versatile dissolved organic nitrogen utilization, mixotrophy, and N conservation in the dinoflagellate Prorocentrum shikokuense under N deficiency. Sci Total Environ 763:143013. doi: 10.1016/j.scitotenv.2020.143013 [DOI] [PubMed] [Google Scholar]
- 23. Chan Y-F, Chiang K-P, Ku Y, Gong G-C. 2019. Abiotic and biotic factors affecting the ingestion rates of mixotrophic nanoflagellates (Haptophyta). Microb Ecol 77:607–615. doi: 10.1007/s00248-018-1249-2 [DOI] [PubMed] [Google Scholar]
- 24. Jiwarungrueangkul T, Liu Z, Zhao Y. 2019. Terrigenous sediment input responding to sea level change and East Asian monsoon evolution since the last deglaciation in the southern South China Sea. Global planetary change 174:127–137. doi: 10.1016/j.gloplacha.2019.01.011 [DOI] [Google Scholar]
- 25. Wu Q, Colin C, Liu Z, Douville E, Dubois-Dauphin Q, Frank N. 2015. New insights into hydrological exchange between the South China Sea and the Western Pacific Ocean based on the Nd isotopic composition of seawater. Deep Sea Res Part II Top Stud Oceanogr 122:25–40. doi: 10.1016/j.dsr2.2015.11.005 [DOI] [Google Scholar]
- 26. Cai W-J, Dai M, Wang Y, Zhai W, Huang T, Chen S, Zhang F, Chen Z, Wang Z. 2004. The biogeochemistry of inorganic carbon and nutrients in the Pearl River estuary and the adjacent Northern South China Sea. Cont Shelf Res 24:1301–1319. doi: 10.1016/j.csr.2004.04.005 [DOI] [Google Scholar]
- 27. Yin K, Harrison PJ, Broom M, Chung CH. 2011. Ratio of nitrogen to phosphorus in the Pearl River and effects on the estuarine coastal waters: nutrient management strategy in Hong Kong. Phys Chem Earth 36:411–419. doi: 10.1016/j.pce.2010.04.014 [DOI] [Google Scholar]
- 28. Zhang J, Li H, Xuan J, Wu Z, Yang Z, Wiesner MG, Chen J. 2019. Enhancement of mesopelagic sinking particle fluxes due to upwelling, aerosol deposition, and monsoonal influences in the northwestern South China Sea. JGR Oceans 124:99–112. doi: 10.1029/2018JC014704 [DOI] [Google Scholar]
- 29. Zhou L, Tan Y, Huang L, Hu Z, Ke Z. 2015. Seasonal and size-dependent variations in the phytoplankton growth and microzooplankton grazing in the southern South China Sea under the influence of the East Asian monsoon. Biogeosciences 12:6809–6822. doi: 10.5194/bg-12-6809-2015 [DOI] [Google Scholar]
- 30. Dai M, Cao Z, Guo X, Zhai W, Liu Z, Yin Z, Xu Y, Gan J, Hu J, Du C. 2013. Why are some marginal seas sources of atmospheric CO2? Geophys Res Lett 40:2154–2158. doi: 10.1002/grl.50390 [DOI] [Google Scholar]
- 31. Sunagawa S, Coelho LP, Chaffron S, Kultima JR, Labadie K, Salazar G, Djahanschiri B, Zeller G, Mende DR, Alberti A, et al. 2015. Structure and function of the global ocean microbiome. Science 348:1261359. doi: 10.1126/science.1261359 [DOI] [PubMed] [Google Scholar]
- 32. Zhang Y, Lin X, Shi X, Lin L, Luo H, Li L, Lin S. 2019. Metatranscriptomic signatures associated with phytoplankton regime shift from diatom dominance to a dinoflagellate bloom. Front Microbiol 10:590. doi: 10.3389/fmicb.2019.00590 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. You Y, Sun X, Ma M, He J, Li L, Porto FW, Lin S. 2022. Trypsin is a coordinate regulator of N and P nutrients in marine phytoplankton. Nat Commun 13:4022. doi: 10.1038/s41467-022-31802-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Ladd C, Stabeno P, Cokelet ED. 2005. A note on cross-shelf exchange in the northern Gulf of Alaska. Deep Sea Res Part II Top Stud Oceanogr 52:667–679. doi: 10.1016/j.dsr2.2004.12.022 [DOI] [Google Scholar]
- 35. Wang Z, Zhao J, Zhang Y, Cao Y. 2009. Phytoplankton community structure and environmental parameters in aquaculture areas of Daya Bay, South China Sea. J Environ Sci (China) 21:1268–1275. doi: 10.1016/s1001-0742(08)62414-6 [DOI] [PubMed] [Google Scholar]
- 36. Huang B, Hu J, Xu H, Cao Z, Wang D. 2010. Phytoplankton community at warm eddies in the northern South China Sea in winter 2003/2004. Deep Sea Res Part II Top Stud Oceanogr 57:1792–1798. doi: 10.1016/j.dsr2.2010.04.005 [DOI] [Google Scholar]
- 37. Tilman D, Isbell F, Cowles JM. 2014. Biodiversity and ecosystem functioning. Annu Rev Ecol Evol Syst 45:471–493. doi: 10.1146/annurev-ecolsys-120213-091917 [DOI] [Google Scholar]
- 38. Nurfadillah N, Tarina S, Miswar E, Dewiyanti I, Agustina S. 2019. Relationship of primary productivity and phytoplankton abundance in Muara Kuala Raja, Bireuen district, Aceh, p 012024. IOP Publishing [Google Scholar]
- 39. Reynolds CS. 2006. The ecology of phytoplankton. Cambridge University Press, Cambridge.doi: 10.1017/CBO9780511542145 [DOI] [Google Scholar]
- 40. Choi DH, An SM, Chun S, Yang EC, Selph KE, Lee CM, Noh JH. 2016. Dynamic changes in the composition of photosynthetic picoeukaryotes in the northwestern Pacific Ocean revealed by high-throughput tag sequencing of plastid 16S rRNA genes. FEMS Microbiol Ecol 92:fiv170. doi: 10.1093/femsec/fiv170 [DOI] [PubMed] [Google Scholar]
- 41. Mosharov SA, Stupnikova AN, Zyulyaeva YA. 2014. Phytoplankton photoadaptation to photoinhibition in the tropical part of the Atlantic ocean. Dokl Biol Sci 454:26–28. doi: 10.1134/S0012496614010037 [DOI] [PubMed] [Google Scholar]
- 42. Shi X, Lin X, Li L, Li M, Palenik B, Lin S. 2017. Transcriptomic and microRNAomic profiling reveals multi-faceted mechanisms to cope with phosphate stress in a dinoflagellate. ISME J 11:2209–2218. doi: 10.1038/ismej.2017.81 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Suzuki S, Kaneko R, Kodama T, Hashihama F, Suwa S, Tanita I, Furuya K, Hamasaki K. 2017. Comparison of community structures between particle-associated and free-living prokaryotes in tropical and subtropical Pacific Ocean surface waters. J Oceanogr 73:383–395. doi: 10.1007/s10872-016-0410-0 [DOI] [Google Scholar]
- 44. Rieck A, Herlemann DPR, Jürgens K, Grossart H-P. 2015. Particle-associated differ from free-living bacteria in surface waters of the Baltic Sea. Front Microbiol 6:1297. doi: 10.3389/fmicb.2015.01297 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Jacob-Lopes E, Scoparo CHGLacerda LMCFFranco TT. 2009. Effect of light cycles (night/day) on CO2 fixation and biomass production by microalgae in photobioreactors. Chem Eng Process Process Intensif 48:306–310. doi: 10.1016/j.cep.2008.04.007 [DOI] [Google Scholar]
- 46. Harun I, Yahya L, Chik MN, Kadir NNA, Pang M. 2014. Effects of natural light dilution on microalgae growth. IJCEA 5:112–116. doi: 10.7763/IJCEA.2014.V5.362 [DOI] [Google Scholar]
- 47. Sudhir P, Murthy SDS. 2004. Effects of salt stress on basic processes of photosynthesis. Photosynt 42:481–486. doi: 10.1007/S11099-005-0001-6 [DOI] [Google Scholar]
- 48. Watanabe Y, Nishihara GN, Tokunaga S, Terada R. 2014. Effect of irradiance and temperature on the photosynthesis of a cultivated red alga, Pyropia tenera (= Porphyra tenera), at the southern limit of distribution in Japan. Phycological res 62:187–196. doi: 10.1111/pre.12053 [DOI] [Google Scholar]
- 49. Kristiansen S, Hoell EE. 2002. The importance of silicon for marine production, p 25–28. In Sustainable increase of marine harvesting: fundamental mechanisms and new concepts: proceedings of the 1st Maricult conference held in Trondheim, Norway, 25–28 June 2000. Springer, Dordrecht, Netherlands. [Google Scholar]
- 50. Li T, Lin X, Yu L, Lin S, Rodriguez IBHo TY. 2020. RNA-seq profiling of Fugacium kawagutii reveals strong responses in metabolic processes and symbiosis potential to deficiencies of iron and other trace metals. Sci Total Environ 705:135767. doi: 10.1016/j.scitotenv.2019.135767 [DOI] [PubMed] [Google Scholar]
- 51. Letscher RT, Moore JK. 2015. Preferential remineralization of dissolved organic phosphorus and non‐redfield DOM dynamics in the global ocean: impacts on marine productivity, nitrogen fixation, and carbon export. Global Biogeochem Cycles 29:325–340. doi: 10.1002/2014GB004904 [DOI] [Google Scholar]
- 52. Dyhrman ST, Chappell PD, Haley ST, Moffett JW, Orchard ED, Waterbury JB, Webb EA. 2006. Phosphonate utilization by the globally important marine diazotroph Trichodesmium. Nature 439:68–71. doi: 10.1038/nature04203 [DOI] [PubMed] [Google Scholar]
- 53. Tian C, Pei H, Hu W, Xie J. 2012. Variation of cyanobacteria with different environmental conditions in Nansi Lake, China. J Environ Sci 24:1394–1402. doi: 10.1016/S1001-0742(11)60964-9 [DOI] [PubMed] [Google Scholar]
- 54. Tsuchiya K, Kuwahara VS, Yoshiki TM, Nakajima R, Shimode S, Kikuchi T, Toda T. 2017. Response of phytoplankton and enhanced biogeochemical activity to an episodic typhoon event in the coastal waters of Japan. Estuar Coast Shelf Sci 194:30–39. doi: 10.1016/j.ecss.2017.05.019 [DOI] [Google Scholar]
- 55. Bharathi MD, Sarma VVSS, Ramaneswari K. 2018. Intra-annual variations in phytoplankton biomass and its composition in the tropical estuary: influence of river discharge. Mar Pollut Bull 129:14–25. doi: 10.1016/j.marpolbul.2018.02.007 [DOI] [PubMed] [Google Scholar]
- 56. Wileman T, Harding C, Stahl P. 1985. Receptor-mediated endocytosis. Biochem J 232:1–14. doi: 10.1042/bj2320001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Jeong HJ, Yoo YD, Park JY, Song JY, Kim ST, Lee SH, Kim KY, Yih WH. 2005. Feeding by phototrophic red-tide dinoflagellates: five species newly revealed and six species previously known to be mixotrophic. Aquat Microb Ecol 40:133–150. doi: 10.3354/ame040133 [DOI] [Google Scholar]
- 58. Burkholder JM, Glibert PM, Skelton HM. 2008. Mixotrophy, a major mode of nutrition for harmful algal species in eutrophic waters. Harmful Algae 8:77–93. doi: 10.1016/j.hal.2008.08.010 [DOI] [Google Scholar]
- 59. Stoecker DK, Hansen PJ, Caron DA, Mitra A. 2017. Mixotrophy in the marine plankton. Ann Rev Mar Sci 9:311–335. doi: 10.1146/annurev-marine-010816-060617 [DOI] [PubMed] [Google Scholar]
- 60. Thamatrakoln K, Kustka AB. 2009. When to say when: can excessive drinking explain silicon uptake in diatoms? Bioessays 31:322–327. doi: 10.1002/bies.200800185 [DOI] [PubMed] [Google Scholar]
- 61. Kazamia E, Sutak R, Paz-Yepes J, Dorrell RG, Vieira FRJ, Mach J, Morrissey J, Leon S, Lam F, Pelletier E, Camadro J-M, Bowler C, Lesuisse E. 2018. Endocytosis-mediated siderophore uptake as a strategy for Fe acquisition in diatoms. Sci Adv 4:eaar4536. doi: 10.1126/sciadv.aar4536 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Annenkov VV, Gordon R, Zelinskiy SN, Danilovtseva EN. 2020. The probable mechanism for silicon capture by diatom algae: assimilation of polycarbonic acids with diatoms-is endocytosis a key stage in building of siliceous frustules? J Phycol 56:1729–1737. doi: 10.1111/jpy.13062 [DOI] [PubMed] [Google Scholar]
- 63. Mukherjee S, Ghosh RN, Maxfield FR. 1997. Endocytosis. Physiol Rev 77:759–803. doi: 10.1152/physrev.1997.77.3.759 [DOI] [PubMed] [Google Scholar]
- 64. Stoecker DK. 1998. Conceptual models of mixotrophy in planktonic protists and some ecological and evolutionary implications. Eur J Protistol 34:281–290. doi: 10.1016/S0932-4739(98)80055-2 [DOI] [Google Scholar]
- 65. Kumar M, Sundaram S, Gnansounou E, Larroche C, Thakur IS. 2018. Carbon dioxide capture, storage and production of biofuel and biomaterials by bacteria: a review. Bioresour Technol 247:1059–1068. doi: 10.1016/j.biortech.2017.09.050 [DOI] [PubMed] [Google Scholar]
- 66. Berg IA, Kockelkorn D, Ramos-Vera WH, Say RF, Zarzycki J, Hügler M, Alber BE, Fuchs G. 2010. Autotrophic carbon fixation in archaea. Nat Rev Microbiol 8:447–460. doi: 10.1038/nrmicro2365 [DOI] [PubMed] [Google Scholar]
- 67. Béjà O, Aravind L, Koonin EV, Suzuki MT, Hadd A, Nguyen LP, Jovanovich SB, Gates CM, Feldman RA, Spudich JL, Spudich EN, DeLong EF. 2000. Bacterial rhodopsin: evidence for a new type of phototrophy in the sea. Science 289:1902–1906. doi: 10.1126/science.289.5486.1902 [DOI] [PubMed] [Google Scholar]
- 68. Choi AR, Shi L, Brown LS, Jung K-H. 2014. Cyanobacterial light-driven proton pump, Gloeobacter rhodopsin: complementarity between rhodopsin-based energy production and photosynthesis. PLoS One 9:e110643. doi: 10.1371/journal.pone.0110643 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Ernst OP, Lodowski DT, Elstner M, Hegemann P, Brown LS, Kandori H. 2014. Microbial and animal rhodopsins: structures, functions, and molecular mechanisms. Chem Rev 114:126–163. doi: 10.1021/cr4003769 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Govorunova EG, Sineshchekov OA, Li H, Spudich JL. 2017. Microbial rhodopsins: diversity, mechanisms, and optogenetic applications. Annu Rev Biochem 86:845–872. doi: 10.1146/annurev-biochem-101910-144233 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Lee Z, Du K, Arnone R, Liew S, Penta B. 2005. Penetration of solar radiation in the upper ocean: a numerical model for oceanic and coastal waters. J Geophys Res 110. doi: 10.1029/2004JC002780 [DOI] [Google Scholar]
- 72. Chen Y, Chen Y, Shi C, Huang Z, Zhang Y, Li S, Li Y, Ye J, Yu C, Li Z, Zhang X, Wang J, Yang H, Fang L, Chen Q. 2018. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 7:1–6. doi: 10.1093/gigascience/gix120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, Couger MB, Eccles D, Li B, Lieber M, MacManes MD, Ott M, Orvis J, Pochet N, Strozzi F, Weeks N, Westerman R, William T, Dewey CN, Henschel R, LeDuc RD, Friedman N, Regev A. 2013. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc 8:1494–1512. doi: 10.1038/nprot.2013.084 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, Quackenbush J. 2003. TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 19:651–652. doi: 10.1093/bioinformatics/btg034 [DOI] [PubMed] [Google Scholar]
- 75. Fu L, Niu B, Zhu Z, Wu S, Li W. 2012. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28:3150–3152. doi: 10.1093/bioinformatics/bts565 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2 [DOI] [PubMed] [Google Scholar]
- 77. Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. 2005. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21:3674–3676. doi: 10.1093/bioinformatics/bti610 [DOI] [PubMed] [Google Scholar]
- 78. Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R. 2005. InterProScan: protein domains identifier. Nucleic Acids Res 33:W116–20. doi: 10.1093/nar/gki442 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359. doi: 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80. Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. 2017. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods 14:417–419. doi: 10.1038/nmeth.4197 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81. Wickham H. 2006. An implementation of the grammar of graphics in R: ggplot, p 182 [Google Scholar]
- 82. John DE, Wang ZA, Liu X, Byrne RH, Corredor JE, López JM, Cabrera A, Bronk DA, Tabita FR, Paul JH. 2007. Phytoplankton carbon fixation gene (RuBisCO) transcripts and air-sea CO2 flux in the Mississippi river plume. ISME J 1:517–531. doi: 10.1038/ismej.2007.70 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Fig. S1 to S5.
Tables S1 to S5.
Data Availability Statement
The data that support the findings of this study have been deposited in CNGB Sequence Archive (CNSA) of China National GeneBank DataBase (CNGBdb) (https://db.cngb.org/cnsa/) under the accession number CNP0001483. The spreadsheet of the final unigene set alongside their taxonomic and functional annotations, and gene expression levels of each sample in terms of raw read counts, has been uploaded to our laboratory website (http://sampgr.org.cn/index.php/download) and is available for public download.




