ABSTRACT
The Amazon River basin sustains dramatic hydrochemical gradients defined by three water types: white, clear, and black waters. In black water, important loads of allochthonous humic dissolved organic matter (DOM) result from the bacterioplankton degradation of plant lignin. However, the bacterial taxa involved in this process remain unknown, since Amazonian bacterioplankton has been poorly studied. Its characterization could lead to a better understanding of the carbon cycle in one of the Earth’s most productive hydrological systems. Our study characterized the taxonomic structure and functions of Amazonian bacterioplankton to better understand the interplay between this community and humic DOM. We conducted a field sampling campaign comprising 15 sites distributed across the three main Amazonian water types (representing a gradient of humic DOM), and a 16S rRNA metabarcoding analysis based on bacterioplankton DNA and RNA extracts. Bacterioplankton functions were inferred using 16S rRNA data in combination with a tailored functional database from 90 Amazonian basin shotgun metagenomes from the literature. We discovered that the relative abundances of fluorescent DOM fractions (humic-, fulvic-, and protein-like) were major drivers of bacterioplankton structure. We identified 36 genera for which the relative abundance was significantly correlated with humic DOM. The strongest correlations were found in the Polynucleobacter, Methylobacterium, and Acinetobacter genera, three low abundant but omnipresent taxa that possessed several genes involved in the main steps of the β-aryl ether enzymatic degradation pathway of diaryl humic DOM residues. Overall, this study identified key taxa with DOM degradation genomic potential, the involvement of which in allochthonous Amazonian carbon transformation and sequestration merits further investigation.
IMPORTANCE The Amazon basin discharge carries an important load of terrestrially derived dissolved organic matter (DOM) to the ocean. The bacterioplankton from this basin potentially plays important roles in transforming this allochthonous carbon, which has consequences on marine primary productivity and global carbon sequestration. However, the structure and function of Amazonian bacterioplanktonic communities remain poorly studied, and their interactions with DOM are unresolved. In this study, we (i) sampled bacterioplankton in all the main Amazon tributaries, (ii) combined information from the taxonomic structure and functional repertory of Amazonian bacterioplankton communities to understand their dynamics, (iii) identified the main physicochemical parameters shaping bacterioplanktonic communities among a set of >30 measured environmental parameters, and (iv) characterized how bacterioplankton structure varies according to the relative abundance of humic compounds, a by-product from the bacterial degradation process of allochthonous DOM.
KEYWORDS: Acinetobacter, Methylobacterium, Polynucleobacter, bacterioplankton, carbon cycle, dissolved organic carbon, dissolved organic matter, humic acids, microbiome
INTRODUCTION
The Amazon basin occupies almost 38% of continental South America (1) and holds 12% to 20% of the planet’s liquid freshwater (2). Its discharge carries a significant load of terrestrially derived nutrients to the ocean, which have global consequences on marine primary productivity and global carbon sequestration (3, 4). The Amazon River basin sustains dramatic hydrochemical and ecological gradients that impose physiological constraints upon its aquatic communities (5–9). Its three major tributaries, the Rio Solimões, Rio Negro, and Rio Tapajos, represent distinct water “types” or “colors” that harbor contrasting physicochemical profiles (10). The white water from the Rio Solimões has an Andean origin, is eutrophic (nutrient- and ion-rich), turbid, and has a circumneutral pH (10–13). The crystalline “clear water” from the Rio Tapajos has a circumneutral pH, low conductivity, and a reduced amount of suspended material associated with its pre-Cambrian rock origin draining the Brazilian shield. Last, the black water of the Rio Negro stems from the craton born drainage of the Guyana shield (14) and largely contrasts with the aforementioned tributaries; it is oligotrophic (nutrient- and ion-poor) and contains a high quantity of dissolved organic matter (DOM), typically 8 to 12 mg C/liter (15). Black water DOM is enriched in chromophoric dissolved organic matter (CDOM), a fraction of DOM that absorbs light. While the measure of fluorescent CDOM (or fluorescent dissolved organic matter [FDOM]) can be used to characterize the nature of DOM in a system, FDOM only represents a fraction of the total DOM, varying between <10% and 70% depending on the environment (16, 17). FDOM from the Rio Negro has a distinctive allochthonous origin, in comparison with FDOM from the Rio Solimões or Rio Tapajos (18).
Overall, the Amazon basin has very low rates of phytoplankton production (19), suggesting that terrestrial allochthonous DOM is an important carbon source for bacterial growth (20, 21). This is especially true in the Rio Negro’s black water, where most of the DOM is a by-product of the lignin degradation process associated with plant decomposition on the riverbed, fueled by the important release of plant material during the seasonal forest flooding (10, 22, 23). The DOM from the Rio Negro is characterized by a complex mixture of humic aromatic compounds, which mostly originate from the first step of the lignin degradation process, the microbial oxidation of lignin-derived compounds (Fig. 1) (22, 24). While fungi have been particularly studied for their involvement in this process in the past, some studies are starting to unravel the lignin and DOM degradation machinery of bacterioplankton (25, 26).
Despite its relevance for global-scale elemental cycling and primary production processes, there is a limited understanding of the taxonomical and functional structure of the Amazon River bacterioplankton. A few studies have focused on these bacterial communities; however, most of them did not sample in different water types (21–29). Several of the aforementioned studies also included a limited number of habitats sampled (e.g., only one site in reference 23) or were focused on the dynamics of bacterioplankton in the plume downstream of the Amazonian River (25–27) rather than the communities in upstream black water systems per se. A previous study (21) found that the genera Ramlibacter, Planktophila, Methylopumilus, Limnohabitans, and Polynucleobacter were enriched in DOM degradation pathways along the Amazon River; however, the abundance of these genera in accordance with humic DOM and whether they are transcriptionally active or not remain unknown.
In this study, we aimed to identify the most important environmental variables shaping the Amazonian bacterioplankton community structure and inferred functional profile. Given that bacterioplankton communities have been shown to be involved in the transformation of DOM (30–32), we hypothesized that the optical characteristics of DOM in particular FDOM, such as humic-like materials, would be one of the most important drivers of bacterioplankton communities. In particular, we expected that the relative abundance of humic FDOM would significantly affect bacterioplankton communities as the leaching of protons from humic molecules is responsible for acidifying Amazonian black water ecosystems (pH 2.8 to 5), making them physiologically challenging environments that shape the structure of resident aquatic communities (33, 34). Second, we aimed to better understand potential interactions between humic FDOM and the Amazonian bacterioplankton. We hypothesized that the taxa correlated with the abundance of humic FDOM would possess genes potentially involved in its pathways of degradation. To achieve these objectives, we performed a 16S rRNA metabarcoding analysis based on bacterioplankton DNA and RNA extracts to characterize the taxonomic structure of global bacterioplankton (from DNA extracts) and transcriptionally active bacterioplankton (from RNA extracts). In parallel, we assembled 90 Amazonian basin shotgun metagenomes from the literature, to build a tailored functional database used to infer the bacterioplankton functions from the 16S rRNA data.
RESULTS
The Amazonian bacterioplankton showed a rich abundance of Proteobacteria, Actinobacteria, and Cyanobacteria (Fig. S1). While Shannon α-diversity did not significantly differ between water types (P > 0.05, mean Shannon diversity between 6 and 7 for all water types) (Fig. S2 to S4), β-diversity analyses showed that bacterioplankton communities significantly clustered according to water type in distance-based redundancy analyses (RDAs) based on the taxonomic structure and inferred functions of these communities (RDA, P < 0.001, F = 2.95 to 20.4) (Fig. 2). These RDAs (Fig. 2) suggest that white and clear water communities were similar and differed from black water communities. This result was confirmed by the error rates from the confusion matrix of the random forest classification; the rates of misclassification of clear water samples (0.08 to 0.58) were always higher between clear and white than between clear and black water samples (0 to 0.31) (Table S1). The RDAs in Fig. 2a and b show that a higher proportion of the total variance is explained by axes 1 and 2 for the global bacterioplankton (27.31%) than for the transcriptionally active bacterioplankton (13.01%).
The environmental parameters that significantly influenced the taxonomic structure of global bacterioplankton, transcriptionally active bacterioplankton, and the inferred functional repertory of bacterioplankton were not identical. The taxonomic structure of global bacterioplankton from black water was mostly driven by the concentration of Cd2+, Co2+, humic dissolved organic carbon (DOC), fluvic DOC, total DOC, and levels of SAC340, but in white and clear waters, it was associated with the concentration of Mg2+, Na+, and the levels of pH, conductivity, and the absorbance ratio at 254 nm and 365 (Abs254/365). The taxonomic structure of transcriptionally active bacterioplankton in black water was driven by the concentration of Co2+, Pb2+, and Cd2+, but in white and clear waters, the main drivers were the concentrations of Ca2+, Mg2+, K+, protein-like DOC, fluvic DOC, and the level of Abs254/365. The inferred functional repertory from black water was associated by the concentrations of Fe3+, Mn2+, and Cd2+, while in white and clear waters, it was affected by the concentrations of Ca2+ and silicate and the levels of pH and Abs254/365. Overall, the taxonomic structures and inferred functional repertory were driven by several parameters associated with the relative abundance of the different FDOM components (i.e., the relative abundance of humic, fluvic, and protein-like DOC, in addition to the SAC340 and Abs254/365 ratios), which appear in red in the RDAs of Fig. 2.
Permutational multivariate analysis of variance (PERMANOVA) analyses (999 permutations) have shown that water type was significantly associated with the taxonomic structure of global bacterioplankton (F = 8.5, df.res = 82, R2 = 0.17, P < 0.001), transcriptionally active bacterioplankton (F = 2.9, df.res = 82, R2 = 0.06, P < 0.001), and the inferred functional repertory (F = 11.8, df.res = 82, R2 = 0.22, P < 0.001). Analyses of variance (ANOVAs) performed on overall RDA solutions were also consistently significant (Fig. 2). Betadisper permutests (1,000 permutations) showed homogenous variance between groups for the inferred functions data set (F = 0.73, df.res = 167, P = 0.45). They have shown heterogenous variance for the taxonomic structure of global (F = 13.23, df.res = 82, P < 0.001) and transcriptionally active bacterioplankton (F = 4.79, df.res = 82, P = 0.01); however, they also showed in both cases that the largest variance occurred in the group with the largest number of samples (i.e., white water). In this situation, PERMANOVA tests are known to be overly conservative, especially with unbalanced designs (35), but still showed a significant signal between water types in our case (Fig. 2; Fig. S3). In addition, PERMANOVA tests suggested that communities’ composition from water bodies characterized by different water residence time (lake versus river systems) significantly differed (all P values < 0.02) according to the Ecosystem variable from Table 1 (Table S2 and the supplemental results and discussion in the supplemental material).
TABLE 1.
Site no. | Site name | Water color | GPS |
|||
---|---|---|---|---|---|---|
South | West | Ecosystem | Sampling time | |||
1 | Rio Negro – Barcelos | Black | 0°50′50.8′′S | 62°57′40.3′′W | River | November 2018 |
2 | Rio Negro – Santo Alberto | Black | 1°23′29.8′′S | 61°59′35.3′W | River | October 2019 |
3 | Rio Negro – Anavilhanas | Black | 2°41′46.1′′S | 60°46′33.3′W | River | October 2018 |
4 | Lago do Cemeterio | Black | 3°02′16.6′′S | 60°32′42.7′′W | Lake | October 2019 |
5 | Lago Téfé | Black | 3°27′55.2′′S | 64°53′13.2′′W | Lake | November 2019 |
6 | Rio Branco | White | 1°19′05.7′′S | 61°52′34.7′′W | River | October 2019 |
7 | Lago Janauari | White | 3°12′03.4′′S | 60°03′10.1′′W | Lake | October 2018 |
8 | Lago Catalão | White | 3°09′56.4′′S | 59°54′38.4′′W | Lake | October 2018 |
9 | Lago Janauaca | White | 3°23′37.5′S | 60°19′52.6′′W | Lake | November 2018 |
10 | Rio Manacapuru | White | 3°16′16.9′′S | 60°42′03.2′′W | River | November 2018 |
11 | Lago Téfé-Solimões | White | 3°21′07.4′′S | 64°40′21.4′′W | Lake | November 2019 |
12 | Lago des Pirates | White | 3°15′19.2′′S | 64°41′44.3′W | Lake | November 2019 |
13 | Balbina Reservoir | Clear | 1°50′55.9′′S | 59°34′59.5′W | Reservoir | October 2018 |
14 | Rio Tapajós | Clear | 2°18′57.8′′S | 55°00′45.0′′W | River | October 2019 |
15 | Rio Curua-Una | Clear | 2°48′19.1′′S | 54°17′52.2′′W | River | November 2018 |
We implemented a machine-learning random forest algorithm to identify which taxa best discriminated different water colors. The out-of-bag error rate (node error rates in the trees of classification) was only 1.18% when considering the taxonomic structure of global bacterioplankton, 27.06% for transcriptionally active bacterioplankton, and 23.53% for inferred functions (Table S1). For global bacterioplankton, the amplicon sequence variants (ASVs) that best discriminated water colors were from the Proteobacteria, Alphaproteobacteria, Actinobacteria, Gammaproteobacteria, and Polynucleobacter clades (ASVs were annotated to the best taxonomic resolution possible). Black water samples were mostly characterized by an enrichment of Gammaproteobacteria, Actinobacteria, and Polynucleobacter (Fig. 3). For transcriptionally active bacterioplankton, discriminant ASVs were taxonomically more diverse and represented members of Bradyrhizobium, Actinobacteria, Ralstonia, Delftia, Geobacillus, Commamonadaceae, Alphaproteobacteria, Burkholderiales, Caulobacteraceae, and Polynucleobacter. Black water samples were mostly characterized by an increased abundance of Alphaproteobacteria, Burkholdariales, Caulobacteraceae, Actinobacteria, and Polynucleobacter. ASVs from the clade Polynucleobacter possessed the highest taxonomic resolution and were consistently associated with black water samples in global and transcriptionally active communities (Fig. 3).
The relative abundance of the different FDOM components was specific to each water type of the Amazon basin (Fig. 4a; Table 2; Fig. S5). The parallel factor analysis (PARAFAC) model showed that the FDOM was composed of three main fractions: the humic-like, fulvic-like, and protein-like fractions (Fig. 4b to d). Black water sites contained higher concentrations of DOC, with FDOM profiles significantly enriched (P = 0.006, t = 3.24, df = 13) in the humic-like fraction of greater aromaticity and molecular weight. White water sites contained FDOM characterized by a high content of fulvic-like components, while clear water sites contained more protein-like FDOM of low aromaticity and molecular weight (see Table 2 for raw data; Fig. 4; FEEM scans on Fig. S5).
TABLE 2.
Site no. | Water color | DOC concn | SAC340 | SUVA254 | Abs254/365 | Humic FDOM (%) | Fulvic FDOM (%) | Protein FDOM (%) |
---|---|---|---|---|---|---|---|---|
1 | Black | 10.9 | 39.5 | 4.5 | 3.8 | 56.7 | 29.5 | 13.8 |
2 | Black | 11.7 | 33.5 | 3.7 | 3.6 | 60.3 | 32.6 | 7.1 |
3 | Black | 11.4 | 30.5 | 3.6 | 3.8 | 47.2 | 30.3 | 22.5 |
4 | Black | 9.8 | 18.9 | 2.4 | 3.8 | 52.3 | 41.0 | 6.7 |
5 | Black | 7.1 | 29.1 | 3.4 | 4.0 | 54.2 | 37.3 | 8.5 |
6 | White | 6.0 | 19.1 | 2.2 | 4.3 | 50.8 | 39.7 | 9.5 |
7 | White | 7.1 | 19.1 | 1.4 | 2.2 | 34.7 | 36.2 | 29.1 |
8 | White | 9.1 | 11.7 | 2.1 | 6.4 | 37.5 | 45.6 | 16.8 |
9 | White | 5.7 | 20.0 | 2.6 | 4.2 | 50.6 | 40.4 | 9.0 |
10 | White | 8.0 | 22.1 | 3.0 | 4.6 | 46.2 | 41.8 | 12.0 |
11 | White | 5.7 | 20.1 | 2.6 | 3.8 | 49.0 | 39.3 | 11.7 |
12 | White | 6.5 | 14.2 | 2.2 | 4.7 | 43.9 | 45.3 | 10.8 |
13 | Clear | 4.9 | 6.1 | 1.2 | 7.1 | 30.6 | 42.3 | 27.1 |
14 | Clear | 2.7 | 8.7 | 1.9 | 5.0 | 44.8 | 38.3 | 16.9 |
15 | Clear | 4.6 | 11.7 | 1.9 | 5.3 | 35.1 | 45.0 | 20.0 |
SAC340 and SUVA254 are the specific absorbance coefficients index of relative DOM aromaticity (the higher the values, the more aromatic is the DOM). Abs254/365 is the index of molecular weight (the lower the value, the higher the molecular weight of the DOM). DOC, dissolved organic carbon; DOC concn, DOC concentration in mg/liter; DOM, dissolved organic matter; FDOM, fluorescent dissolved organic matter.
The humic-like fraction of FDOM was associated with the taxonomic structure of global and transcriptionally active bacterioplankton communities within the different waters of the Amazon basin, as the humic FDOM relative abundance isolines respected the natural clustering of the communities on the redundancy analyses of Fig. 5a and b. A coabundance Spearman correlation network-based approach enabled us to identify which taxonomic groups were significantly (Bonferroni-corrected P < 0.05) correlated with humic FDOM (Fig. 5c and d): Acetobacteraceae, Polynucleobacter, Methylocystis, and several unidentified Beta- and Gammaproteobacteria. The results for global and transcriptionally active bacterioplankton showed different correlation profiles. Indeed, for global bacterioplankton, the results suggested a direct correlation between humic FDOM concentration and 44 taxa. In contrast, for transcriptionally active communities, the influence seemed to be mostly indirect, since there were only three taxa directly correlated with humic FDOM relative abundance. However, these three taxa (two Polynucleobacter and one Acetobacteraceae ASVs) were key in the overall community transcriptionally active community; they were important interaction hubs, as their activity was strongly correlated with 93 other taxa.
We recomputed the Spearman correlation analysis (between humic FDOM and bacterial ASV) after agglomerating all ASVs at the genus level to ensure compatibility with the metagenome reference database. Several taxa that were significantly associated with humic FDOM at the ASV level were also associated with humic FDOM at the genus level (e.g., Polynucleobacter and Methylocystis). We investigated the presence/absence of the enzymes known to be part of the humic FDOM degradation pathways (Table S3) in the subset of all genera in which abundance correlated with humic FDOM (Fig. 6) and found that three genera were significantly correlated with humic FDOM degradation pathways: Polynucleobacter, Methylobacterium, and Acinetobacter. When detected, significant Spearman correlations varied between 0.30 and 0.51. These three genera possessed enzymes involved in all four main steps of the degradation of humic compounds.
Four main results suggest that among all taxa, the genus Polynucleobacter, which had a relatively low abundance (mean of 0.05% to 2.25%) in global and transcriptionally active bacterioplankton (Fig. S6), showed the strongest association to humic FDOM. First, the random forest analysis has shown that Polynucleobacter was one of the best taxa to discriminate the taxonomic structure of black water samples (rich in humic FDOM) from white/clear water samples (poor in humic FDOM), in both global and transcriptionally active bacterioplankton (Fig. 3). Second, Polynucleobacter ASVs abundances were significantly correlated with humic FDOM concentrations (Bonferroni-corrected P value < 0.05) in global and transcriptionally active bacterioplankton communities (in which they represented two of three ASVs correlated with humic FDOM) (Fig. 5c and d). Third, when correlation analyses were performed at the genus level, Polynucleobacter was the genus of which the relative abundance showed the strongest correlation (Spearman correlation = 0.30 to 0.51) to the presence of humic FDOM degradation pathways (Fig. 6). Fourth, the inferred functional repertory of Amazonian Polynucleobacter suggested that this group had the genetic potential to be involved in all four main degradation steps of humic compounds (Fig. 6): initial oxidation, funneling, O-demethylation, and ring cleavage pathways.
DISCUSSION
Water type shapes the structure of bacterial communities and their inferred functional potential.
Water type has been shown to be a major driver of the diversity, composition, and population genomics of eukaryotic biological communities in Amazonia. This has been shown in a vast array of species, including teleosts (36–39), phyto and zooplankton (40), and periphyton communities (41). Our results showed that the Amazonian bacterioplankton was similar in composition to what has been previously reported (22), with an important influence of water type on the taxonomic structure of global and transcriptionally active bacterioplankton communities and on their functional profiles (Fig. 2). At the taxonomic and functional levels, we showed that among the environmental parameters that were the most associated with community clustering (DOC quantity and type, pH, conductivity, and concentrations of Cd2+, Co2+, Mg2+, Na+, Ca2+, K+, Pb2+, Fe3+, Mn2+, and silicates), several are known to be the main parameters driving differences between water types. For instance, higher concentrations of total DOC and overall relative enrichment in humic FDOM are known to be strongly associated with black waters (18, 42). Interestingly, we observed that the clustering of functional repertory according to water types (Fig. 2c) was not as clear as structural or transcriptional activity profile clustering (Fig. 2a and b). This result might be associated with the fact that several housekeeping genes are shared by all bacterial members, thus reducing intersite variability when considering the functional repertories. However, based on the significant PERMANOVA results, it still appears that there were significant water type-specific functional profiles.
Humic FDOM and bacterioplankton communities.
Dissolved organic carbon forms the very basis of the majority of aquatic food webs and is an important food source to heterotrophs within river systems (43, 44). The bioavailability of DOC to bacterioplankton depends on the type of DOC present. It has been suggested that allochthonous humic-like DOC may be more bioavailable to bacteria than lower molecular weight DOC (45) (although research on saltwater ecosystems has shown different results [46]) and that some bacteria prefer terrestrially derived DOC over autochthonous protein-like DOC derived from algae and/or bacteria (47). Previous research has shown that bacteria are able to breakdown humic DOC, supporting the idea that this component is bioavailable to some bacterial species (45, 47). Our analysis of the FDOM fractions from the 15 sites showed that black waters have higher concentrations of DOC but also show distinct FDOM profiles comprising a significant enrichment in the humic fraction characterized by higher SAC340 and SUVA254 scores (Table 2; Fig. 4). These results support previous findings (18, 42) suggesting that naturally acidic waters show a unique FDOM signature compared to circumneutral and groundwater-fed systems. Multivariate correlation analyses also suggested that the relative abundance of humic FDOM is an important factor shaping the taxonomic structure of global and transcriptionally active bacterioplankton (Fig. 2 and 5). Furthermore, analyses at the functional level showed that there are several genera in the Amazonian bacterioplankton community of which the abundance correlates with humic FDOM and that possess genes associated with its degradation processes (Fig. 6).
Overall, in our data set, the genus Polynucleobacter has shown the strongest correlation to the relative abundance of humic FDOM (Fig. 6). The genus Polynucleobacter mostly comprises free-living aquatic bacteria and is omnipresent in freshwater lakes and ponds worldwide (48), including in several Amazonian streams (23, 24). Several studies have detected a strong correlation between the abundance of this genus and DOC concentrations (48–50). Furthermore, another study has shown that Polynucleobacter subclusters show ecological niche separation in accordance with DOM optical characteristics (51). Polynucleobacter phylotypes respond quickly to an enhanced availability of DOM (52), and some reports suggest that Polynucleobacter can feed on it (49, 50). However, experiments conducted on a population of Polynucleobacter from a humic temperate pond indicated that these bacteria mostly live as chemoorganotrophs by utilizing low-molecular-weight substrates derived from the photooxidation of humic substances (53), as suggested in other studies (50, 54). The high growth of Polynucleobacter phylotypes (55) could be favored by the chemical mineralization of low-molecular-weight substances such as acetate, a typical photolysis product (54). Overall, the genomic potential to degrade humic compounds or humic by-products appears to be strain specific in Polynucleobacter. Indeed, Hahn and colleagues (53) did not detect genes involved in humic substances degradation (i.e., mono- and dioxygenases) but found a pathway encoding the degradation of humic compounds in another strain of Polynucleobacter asymbioticus (56).
In our study, it is unlikely that the ASVs identified as Polynucleobacter originated from only one species; for instance, a recent study detected an important diversity of this genus: 60 to 90 species-like Polynucleobacter operational taxonomic units (OTUs) were detected in temperate rivers (57). Until now, the implication of members of this clade in the degradation of humic compounds in Amazonia has yet to be investigated. Although we did not measure specific humic degradation rates in this study, the set of genes that was detected in Amazonian Polynucleobacter (Fig. 6) suggests that they possess the genomic potential to be involved in the degradation of humic substances such as humic acids or their by-products via a derivative of the β-aryl ether degradation pathway for diaryl residues (58).
In addition to Polynucleobacter, our results suggest that Methylobacterium and Acinetobacter are also strongly correlated with the relative abundance of humic FDOM. They also show that these clades possess genes coding for enzymes involved in all the main steps of humic degradation (Fig. 6). Like Polynucleobacter, these genera possess the genomic potential to degrade humic substances via a derivative of the β-aryl ether degradation pathway for diaryl residues. Methylobacterium could also profit from by-products of the humic degradation; methanol, one of the main carbon sources for methylotrophic bacteria like Methylobacterium (59), is produced during the demethylation of humic substances (60). Several studies have documented the humic substances degradation potential of Acinetobacter (61, 62). The set of genes detected in this genus suggests that the humic compounds’ O-demethylation (63) and ring cleavage (64) differ from those of Polynucleobacter and Methylobacterium (see more details in the supplemental material under “Pathways of humic compounds’ degradation”). Interestingly, while Polynucleobacter is exclusively aquatic, Methylobacterium and Acinetobacter are often found in humic soils (65, 66) and could potentially be derived from terrestrial organic matter that leached into the riverine water.
Since not all enzymes required for the complete degradation of humic compounds were detected in the aforementioned genera, further studies are needed to decipher whether and exactly how these bacteria degrade humic DOC and to determine whether they are able to perform this process alone, with associations with other members of the bacterioplankton community, or with nonbacterial microbes. Although not tested here, functional compartmentalization of the community for humic degradation has already been documented (67) and could potentially involve interkingdom relationships via alternative pathways that remain to be discovered (reviewed in references 58 and 68). Finally, further studies should also assess whether bacterial DOC degradation in Amazonia is dependent on a coupling with physical photodegradation processes, as suggested by Nalven et al. (69).
Conclusions.
The results from our study show that the most important environmental factors affecting the Amazonian bacterioplankton communities within the three different water types of the Amazon basin rely on the relative abundance of the different FDOM fractions detected, especially the enrichment in humic FDOM characteristic of black water environments. Among the taxa mostly associated with the different water types, ASVs assigned to the genera Polynucleobacter, Methylobacterium, and Acinetobacter particularly stood out, as their relative abundance in global and transcriptionally active bacterioplankton was strongly associated with black water environments and correlated well with the relative abundance of humic FDOM. The inferred functions of these genera suggest that they possess genes coding for enzymes implicated in the main degradation steps of humic compounds, indicating that the role of these taxa in carbon cycling within the Amazonian basin merits further investigation.
MATERIALS AND METHODS
Sampling and processing.
Water samples were collected from 15 sites in the Brazilian Amazon basin in October and November of 2018 and 2019 (sampling times indicated in Table 1). The 15 sites were distributed over an area of >300,000 km2 along the Rio Negro, Rio Solimões, and Rio Tapajos watersheds, the three major tributaries of the Amazon River (10). GPS coordinates and a map of all sites are found in Table 1 and Fig. 7, respectively. The 15 sites include five black water, seven white water, and three clear water sites. Six replicate water samples were collected per site. Surface water samples were taken at a depth of 30 cm in 2-liter Nalgene (Thermo Fisher Scientific, Waltham, MA) bottles. Filtration was performed as previously described (70) through 0.22-μm-pore size polyethersulfone Sterivex filters (catalog no. SVGP01050, Millipore, Burlington, MA) less than 30 min after collection. Immediately upon collection, the filters were stored in 2 mL of nucleic acid preservation (NAP) buffer. NAP buffer, which preserves DNA and RNA integrity at room temperature, contains EDTA disodium salt dihydrate, sodium citrate trisodium salt dihydrate, and ammonium sulfate (71, 72). The samples were then stored in −80°C until processing. Before DNA/RNA extraction, the Sterivex filter casings were opened and processed according as previously described (70) using sterile instruments, and the filter membranes were stored in TRIzol (catalog no. 15596026, Thermo Fisher Scientific). DNA and RNA extractions were performed according to the manufacturer’s instructions for TRIzol without modification. Four blank controls (sterile filters stored in the NAP buffer) were also processed identically to all samples for DNA/RNA extractions.
Environmental variables.
A total of 34 environmental variables commonly characterized in limnological studies (73) and associated with the physicochemical differences between different water colors (10) were measured in this study (Table 2; Table S4 to S6). Temperature (°C), conductivity (μS), pH, and dissolved oxygen (%) were measured directly on-site using a YSI professional plus series multimeter (YSI Inc./Xylem Inc., Yellow Springs, OH). The concentration of DOC, dissolved metals, nutrients, free ions, and chlorophyll a were measured at the laboratory, according to the techniques described below.
Chlorophyll a and phaeopigments.
Three replicates of 250 mL of water per site were filtered using a Masterflex Easy-Load II peristaltic pump from Cole-Parmer (catalog no. HV-77200-62, Montreal, Quebec, Canada) through 0.45-μm-pore size glass fiber filters, which were then immediately stored at −80°C for chlorophyll a quantification (74). Chlorophyll a was extracted after a 24 h of incubation of filters in acetone at −20°C before measuring the absorbance on a Turner Designs (San Jose, CA) fluorimeter model 10 AU (catalog no. 1100-100). Chlorophyll and phaeopigment concentrations were then calculated according to the method described previously (74, 75).
Nutrients.
For nutrients (NO2−, NO3−, and silicates) analysis, three replicates/site of 12 mL of water were filtered through 0.45-μm-pore size glass fiber filters and stored in sterile and acid-washed (1 M HCl) 15-mL Falcon tubes (Fisher Scientific, Hampton, NH). A total of 24 μL of HgCl2 solution (1 g/100 mL) was added for conservation, before measurement on a Bran and Luebbe III (AA3) nutrient autoanalyzer as previously described (76).
(i) Ionic composition. Water samples for determination of ionic composition (Na+, Ca2+, Mg2+, and K+) were analyzed using flame atomic absorption spectroscopy (PerkinElmer model 3100, catalog no. 63929-1, PerkinElmer Inc., Woodbridge, Ontario, Canada). Cl− was measured using the colorimetric method described previously (77). Hardness was calculated from the Ca2+ and Mg2+ concentrations.
(ii) Dissolved organic carbon and dissolved metals. For analysis of dissolved metals and DOC, we filtered samples through Millipore polyvinylidene difluoride (PVDF) 0.45-mm Sartorius filters (Sartorius, Germany). Metal suites were measured by inductively coupled plasma mass spectroscopy (ICP-MS). Quality assurance and quality control (QA/QC) consisted of analysis of method blanks, laboratory duplicates, and matrix spikes conducted using certified standards. Total suspended solids was determined using gravimetric analysis as outlined in Environmental Protection Agency Method 160.2 (78). DOC was analyzed as nonpurgeable organic carbon using a total carbon analyzer (Apollo 9000 combustion TOC analyzer: Teledyne Tekmar, Mason, OH). The TOC machine was calibrated using primary standard grade potassium hydrogen phthalate (KHP), and QA/QC KHP standards were run every 10 samples. Fluorescence excitation emission (FEEM) and absorbance scans were performed using a quartz cuvette in an Aqualog fluorimeter (HORIBA Scientific, Piscataway, NJ) to determine FDOM components and characteristics. FEEM scans along with simultaneous absorbance measurements were conducted on all samples, with excitation wavelengths in 2-nm steps between 250 and 450 nm, and emission wavelengths of 250 to 620 nm. The absorbance of a blank of ultrapure water was run before each sample and automatically subtracted from each sample, with inner filter effects and first and second order Rayleigh and Raman scatter also removed. The FEEMs were analyzed in MATLAB R2014b (MathWorks, Inc.) and modeled using PARAFAC (PLS-toolbox in MATLAB; Eigenvectors Research Inc., Manson, WA). The PARAFAC model was validated following the previously described recommendations (79). No clear consistent patterns and peaks were visible in the residual plots, core consistency was 99%, and split-half analysis results were consistent with the model. Relative DOC aromaticity and molecular weight were determined using specific UV absorbance at 254 nm (SUVA254) and at 350 nm (SAC340) along with Abs254/365. The raw data of the excitation emission and the absorbance scans are available on OSF (https://osf.io/dz6vf).
The water type (Table 1) of each site was determined based on the physicochemical profiles of the sampled environments, which mostly differed in terms of pH, DOC quantity, FDOM optical characteristics, and conductivity. Black waters are usually acidic due to a high load of humic DOC and also have a very low conductivity (15). White waters have a circumneutral pH, and they usually have lower overall DOC concentrations (enriched in fulvic-like DOC) but high conductivity levels. Clear waters have a slightly acidic pH, with low DOC concentrations (enriched in protein-like DOC) and low conductivity (10). The water types are also easily recognized by their appearance: black waters are rich in tannins that stain the water like tea, white waters have a milky appearance due to the very high load of suspended sediments, while clear waters are transparent. Our measurements of environmental parameters confirmed the a priori knowledge of the water types of these sampling sites, based on the literature.
16S rRNA sequence analysis.
The taxonomic structure of global and of transcriptionally active bacterial communities were assessed using 16S rRNA approaches, respectively, conducted on DNA and RNA extracts. Retrotranscription of the RNA extractions was done using the qScript cDNA synthesis kit (catalog no. 95048-100) from QuantaBio (Beverly, MA) according to the manufacturer’s instructions. Then, the fragment V3-V4 (~500 bp) of the 16S rRNA gene was amplified from DNA and cDNA extracts by two PCRs. The first PCR was performed with primers specific to the V3-V4 region of the 16S rRNA gene (primers 347F and 803R) (80), which were tailed on the 5′ end with part of the Illumina TruSeq adaptors (oligonucleotide sequences; Illumina, Inc.). The following oligonucleotide sequences were used for amplification for the first PCR (actual primer sequences are in bold, and the rest corresponds to the adapter sequence): forward (347F), 5′-ACACTCTTTCCCTACACGACGCTCTTCCGATCTGGAGGCAGCAGTRRGGAAT-3′; and reverse primer (803R), 5′-GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTCTACCRGGGTATCTAATCC-3′. Then, a second PCR was performed to attach remaining adaptor sequence (the regions that anneal to the flowcell and library specific barcodes): generic forward second PCR primer, 5′-AATGATACGGCGACCACCGAGATCTACAC[index1]ACACTCTTTCCCTACACGAC-3′; and generic reverse second PCR primer, 5′-CAAGCAGAAGACGGCATACGAGAT[index2]GTGACTGGAGTTCAGACGTGT-3′.
The PCRs were performed according to the manufacturer’s instructions of the Qiagen Multiplex PCR kit (catalog no. 206143, Hilden, Germany) using an annealing temperature of 60°C and 30 amplification cycles. Amplified DNA was purified with AMPure beads (catalog no. A63880; Beckman Coulter, Pasadena, CA), according to the manufacturer’s instructions, to eliminate primers, dimers, proteins, and phenols. Post-PCR DNA concentration and quality were assessed on a Qubit instrument (Thermo Fisher Scientific) and by electrophoresis on 2% agarose gels. After purification, multiplex paired-end sequencing was performed on Illumina MiSeq by the Plateforme d’Analyses Génomiques at the Institut de Biologie Intégrative et des Systèmes of Université Laval.
After sequencing, 24,341,734 sequences were obtained (mean of 135,232 sequences/sample). DADA2 (81) was used for ASV picking. Quality control of reads was done with the filterAndTrim function using the following parameters: 290 for the forward read truncation length, 270 for the reverse read truncation length, 2 as the phred score threshold for total read removal, and a maximum expected error of 2 for forward reads and 3 for reverse reads. The filtered reads were then fed to the error rate learning, dereplication, and ASV inference steps (with default settings) using the functions learnErrors, derepFastq, and DADA, which are all from the DADA2 pipeline (81). The merging of sequence pairs was done using mergePairs (with default settings) also from DADA2. Chimeric sequences were removed using the removeBimeraDenovo function (default settings) with the “consensus” method parameter. Sequenced PCR negative controls were used to remove ASVs identified as potential cross contaminants using the isContaminant function from the “decontam” package in R with a default threshold of 0.4. Taxonomic annotation of amplicon sequence variants (ASVs) was performed by using blastn matches against NCBI “16S Refseq Nucleotide” database (November 2020). As the NCBI database for 16S sequences is updated more frequently than other sources (Silva, Greengenes, etc.), it matched our requirements for exhaustive information about lesser-known taxa while minimizing ambiguous annotations. Matches above 99% identity were assigned the reported taxonomic identity. Sequences with no matches above the identity threshold were assigned taxonomy using a lowest common ancestor method generated on the top 50 matches using blastn (E value cutoff of 1e−4 with default parameters). This method is closely inspired from the Lowest Common Ancestor (LCA) algorithm implemented in MEGAN (82). An analysis of Shannon diversity according to sampling depth for each sample can be found in Fig. S2 and S4. ASV tables, metadata files, and taxonomy information were incorporated into phyloseq objects (83) (phyloseq v. 1.32.0) before downstream analyses.
Shotgun metagenome functional database.
We built a reference metagenomic database to infer the functional profile of the microbial communities previously characterized using the 16S rRNA approach. The reference database was built from 90 high quality metagenomes from the Amazon River sampled in previous studies (21, 22, 24, 27, 29) that included samples from the upper and lower parts of the basin, as well as the Amazon River plume. These shotgun metagenomes were previously compiled in the Amazon River basin Microbial nonredundant Gene Catalogue (AMnrGC) by Santos et al. (21). Details on the environments where these metagenomes were collected are provided as supplemental data by Santos et al. (21). The shotgun metagenomes (see accession numbers under Data availability) were fetched from the NCBI Sequence reads archive (SRA). Trimmomatic unpaired filtered reads (default parameters) were assembled using the megahit assembler in a coassembly with the large metagenome (meta-large) parameter preset, a k-min of 27 and with iterative increments of k (k-step) of 10 (exact parameters: –presets meta-large –k-min 27 –k-step 10 –t 20 –m 400e9). A total of 3,495,176,470 reads were assembled into 21,731,098 contigs (smallest contig size: 200 bases, largest: 489,715 bases, N50: 681 bases, average contig size: 648 bases). Taxonomic annotation of the assembled contigs was performed by using blastn (November 19, 2020) matches against the NCBI “nt” database using an E value cutoff of 1e−4 (other parameters were at default values). Matches above 99% identity were assigned the reported taxonomic identity. Sequences with no matches above the identity threshold were assigned taxonomy using a lowest common ancestor method generated on the top 50 blastn matches obtained. This method is closely inspired from the LCA algorithm implemented in MEGAN (82). Functional annotation of contigs was made using the following steps. First, the predicted proteins were determined from the contig nucleotide sequences using ORFM (default parameters: –m 96). A total of 233,744,501 proteins were predicted from open reading frames (ORFs). Then, the predicted protein sequences were annotated using Diamond’s implementations for blastp (with the –sensitive parameter) against the SwissProt-UniProt database (November 24, 2020) in order to obtain gene ontology (GO) and KEGG ontology (KO) information. Finally, a database combining the taxonomic and functional information was made to be used as a reference for subsequent steps. The functional repertory (i.e., list of all KEGG pathways) in each sample was inferred from this database, at the most precise KEGG orthology level. Then, a table comprising the abundance of all pathways in each sample was produced and could be handled in the same way as standard ASV tables for downstream analyses. The functional reference database constructed is freely and publicly available on OSF (https://osf.io/dz6vf/). Details on the shotgun metagenomic database construction can be found in the flowchart of Fig. S8.
Statistical analysis.
First, we aimed to understand to what extent environmental conditions drive the phylogenetic structure of global and transcriptionally active bacterial communities, and the inferred functional repertory (i.e., the list of all KEGG pathways) of these Amazonian bacterioplankton communities. We first computed the Shannon H diversity index using plot_richness from R phyloseq package (Fig. S3) and described the relative abundance of the different bacterial phyla in stacked bar plots from ggplot2 (Fig. S1) in each water type. Then, we computed distance-based redundancy analyses (RDA) using capscale from vegan R package (84) to summarize the variation in the response variables (i.e., bacterioplankton communities) explained by environmental parameters (explanatory variables) (Fig. 2). Relative abundance (sum normalization) tables of ASVs were used as inputs for RDA analyses. We checked for multicollinearity by measuring variance inflation factors (VIF) using vif.cca from vegan (85) and then by computing stepwise variable selection using ordistep from vegan (86). The proportion of the variance explained by each variable was checked using envfit, also from vegan (86). Only environmental parameters with VIF < 10 and selected by ordistep were kept for RDAs (Fig. 2).
Then, we aimed to identify which taxa were associated with the variations between water types. We used a machine-learning approach to identify the taxa that best discriminated the different water types. To do so, we implemented Breiman’s random forest algorithm for classification (87) using randomForest (88) with ntree = 500. This algorithm splits data in a training and a test set; the training set is used to construct consensus trees of classification via bootstrapping, and the test set (≈37% of the samples) is then used to estimate the node error rates in the trees of classification (i.e., the out-of-bag estimate of error). We isolated the 40 ASVs responsible for the most important mean decrease in GINI coefficient (measure of node purity) with significant P values following Bonferroni correction. These ASVs comprised the 40 taxa that best discriminated the different water types (with the lowest classification error rate) in the random forest tests. The relative abundance of these taxa in global and transcriptionally active bacterial communities was represented on heat maps (Fig. 3). The same approach was used to detect the inferred functions that best discriminated water types (Fig. S7).
We then characterized the FDOM (i.e., assessed the presence of its different humic-, fulvic-, and protein-like fractions) in the sampled environments using principal components analysis (PCA) (Fig. 4a), PARAFAC model components (Fig. 4b), and FEEM scans (Fig. S5). To test our hypothesis concerning the potential role of humic-like materials in structuring bacterioplankton communities, we assessed the correlation between the relative abundance of humic FDOM and the structures of global and transcriptionally active bacterioplankton communities. We first fitted the humic FDOM relative abundance on RDAs (Fig. 5a and b). Then, we assessed the Spearman correlations between the relative abundances of humic FDOM and bacterial ASVs (Fig. 5c and d). Correlations > 0.5 with P value < 0.05 (after Bonferroni correction) were plotted on Cytoscape v.3.7.1. Then, we used the same approach (i.e., Spearman correlations) at a higher taxonomic level, to identify bacterial genera of which the abundance correlated significantly with humic FDOM (Fig. 6). Based on these correlation results, we also investigated whether these genera possessed pathways involved in the degradation of humic DOM. To do so, we first identified in the literature (21, 58, 68) the potential enzymes currently known to be involved in humic FDOM degradation in bacteria (see list in Table S3). Then, we searched for the presence of these enzymes in the inferred functions of the genera correlated with the relative abundance of humic FDOM. We plotted these genera and their functional pathways using a heat map (Fig. 6).
Data availability.
The data sets generated and analyzed during the current study can be found in the Sequence Read Archive (SRA) repository under BioProjectIDs PRJNA736442 and PRJNA736450. The accession numbers of the 90 metagenomes used to build the custom database are SRR1182511, SRR1182512, SRR1183643, SRR1183650, SRR1185413, SRR1185414, SRR1186214, SRR1199270, SRR1199271, SRR1199272, SRR1202081, SRR1202089, SRR1202090, SRR1202091, SRR1202095, SRR1204580, SRR1204581, SRR1205250, SRR1205251, SRR1205252, SRR1205253, SRR1209976, SRR1209977, SRR1209978, SRR1514963, SRR1515032, SRR1518285, SRR1522964, SRR1522971, SRR1522973, SRR1522974, SRR1786279, SRR1786281, SRR1786608, SRR1786616, SRR1787940, SRR1787943, SRR1788318, SRR1790487, SRR1790489, SRR1790644, SRR1790646, SRR1790647, SRR1790676, SRR1790678, SRR1790679, SRR1790680, SRR1792674, SRR1792852, SRR1793861, SRR1793862, SRR1796116, SRR1796118, SRR1796234, SRR1796236, SRR4831644, SRR4831645, SRR4831646, SRR4831647, SRR4831648, SRR4831649, SRR4831650, SRR4831651, SRR4831652, SRR4831653, SRR4831654, SRR4831655, SRR4831656, SRR4831657, SRR4831658, SRR4831659, SRR4831660, SRR4831661, SRR4831662, SRR4831663, SRR4831664, SRR4831665, SRR4831666, SRR4831667, SRR4833053, SRR4833055, SRR4833056, SRR4833057, SRR4833059, SRR4833060, SRR4833062, SRR4833064, SRR4833067, SRR4833073, SRR4833077, SRR4833080, SRR4833081, SRR4833084, SRR4833086, SRR4833087, SRR4833089, SRR5123271, SRR5123272, SRR5123273, SRR5123274, SRR5123275, SRR5123276, and SRR5123277. The scripts used for the 16S DNA/RNA sequence analysis, the input files including all metadata, the functional inference database, and the raw EEMS and absorbance scans data are all freely available on the Open Science Network platform (https://osf.io/dz6vf/). The main script for statistical analysis, an RData file containing the phyloseq objects, and the main KEGG table used were all uploaded as supplemental material.
Supplementary Material
ACKNOWLEDGMENTS
This work was supported by the National Geographic Society, Natural Sciences and Engineering Research Council of Canada (NSERC), MITACS, and Ressources Aquatiques Québec through travel and field work grants (F.-É.S.). This study was also supported in part by a NSERC Discovery grant (N.D.), the Instituto Nacional de Ciência e Tecnologia de Adaptações da Biota Aquática da Amazônia (INCT ADAPTA) project (A.L.V.), a Canada-Brazil Awards Joint Research Project (N.D. and A.L.V.), and by funds from Conselho Nacional de Desenvolvimento Cientifico e Technologico (CNPq), Fundação de Amparo à Pesquisa do Estado do Amazonas (FAPEAM), and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES).
We thank Thiago Nascimento, Reginaldo Oliveira, and Nazaré Paula for technical support with field work logistics. We thank Roxanne Dhommée for support in the molecular biology laboratory work.
We declare no conflict of interest.
F.-É.S., A.L.V., and N.D. designed the study; F.-É.S., N.L., A.H., and N.D. performed field sampling; F.-É.S., N.L., and P.-L.M. conducted RNA extractions and prepared 16S rRNA libraries; F.-É.S., N.L., and S.B. conducted the bioinformatical and statistical analyses; F.-É.S. wrote the manuscript; all authors revised the manuscript.
Footnotes
Supplemental material is available online only.
Contributor Information
François-Étienne Sylvain, Email: francois-etienne.sylvain.1@ulaval.ca.
Eva C. Sonnenschein, Swansea University
Elias Broman, Stockholms Universitet.
Pengfa Li, Nanjing Agricultural University.
REFERENCES
- 1.Mikhailov VN. 2010. Water and sediment runoff at the Amazon River mouth. Water Resour 37:145–159. doi: 10.1134/S009780781002003X. [DOI] [Google Scholar]
- 2.Maretti CC, Riveros S, Hofstede JC, Oliveira R, Charity D, Granizo S, Alvarez T, Valdujo C, Thompson PC. 2014. State of the Amazon: ecological representation in protected areas and indigenous territories. WWF Living Amazon (Global) Initiative, Brasília, Brazil. [Google Scholar]
- 3.Richey JE, Nobre C, Deser C. 1989. Amazon river discharge and climate variability—1903 to 1985. Science 246:101–103. doi: 10.1126/science.246.4926.101. [DOI] [PubMed] [Google Scholar]
- 4.Subramaniam A, Yager PL, Carpenter EJ, Mahaffey C, Björkman K, Cooley S, Kustka AB, Montoya JP, Sañudo-Wilhelmy SA, Shipe R, Capone DG. 2008. Amazon River enhances diazotrophy and carbon sequestration in the tropical North Atlantic Ocean. Proc Natl Acad Sci USA 105:10460–10465. doi: 10.1073/pnas.0710279105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Henderson PA, Crampton WRG. 1997. A comparison of fish diversity and abundance between nutrient-rich and nutrient-poor lakes in the Upper Amazon. J Trop Ecol 13:175–198. doi: 10.1017/S0266467400010403. [DOI] [Google Scholar]
- 6.Junk WJ, Soares MG, Carvalho FM. 1983. Distribution of fish species in a lake of the Amazon River floodplain near Manaus (Lago Camaleão), with special reference to extreme oxygen conditions. Amazoniana 7:397–431. [Google Scholar]
- 7.Petry P, Bayley PB, Markle DF. 2003. Relationships between fish assemblages, macrophytes and environmental gradients in the Amazon River floodplain. J Fish Biol 63:547–579. doi: 10.1046/j.1095-8649.2003.00169.x. [DOI] [Google Scholar]
- 8.Rodriguez MA, Lewis WM. 1997. Structure of fish assemblages along environmental gradients in floodplain lakes of the Orinoco River. Ecol Monogr 67:109–128. doi: 10.2307/2963507. [DOI] [Google Scholar]
- 9.Saint-Paul U, Zuanon J, Correa MAV, Garcia M, Fabré NN, Berger U, Junk WK. 2000. Fish communities in central Amazonian white- and blackwater floodplains. Env Biol Fishes 57:235–250. doi: 10.1023/A:1007699130333. [DOI] [Google Scholar]
- 10.Sioli H. 1984. The Amazon limnology and landscape ecology of a mighty tropical river and its basin. Springer, Dordrecht, The Netherlands. [Google Scholar]
- 11.Gaillardet J, Dupre B, Allegre CJ, Negrel P. 1997. Chemical and physical denudation in the Amazon River basin. Chem Geol 142:141–173. doi: 10.1016/S0009-2541(97)00074-0. [DOI] [Google Scholar]
- 12.Dal Pont G, Domingos FXV, Fernandes-de-Castilho M, Val AL. 2017. Potential of the biotic ligand model (BLM) to predict copper toxicity in the white-water of the Solimões-Amazon river. Bull Environ Contam Toxicol 98:27–32. doi: 10.1007/s00128-016-1986-1. [DOI] [PubMed] [Google Scholar]
- 13.Val AL, Almeida-Val VMF. 1995. Fishes of the Amazon and Their Environment. Springer-Verlag; Berlin Heidelberg. [Google Scholar]
- 14.Hoorn C, Wesselingh FP, ter Steege H, Bermudez MA, Mora A, Sevink J, Sanmartín I, Sanchez-Meseguer A, Anderson CL, Figueiredo JP, Jaramillo C, Riff D, Negri FR, Hooghiemstra H, Lundberg J, Stadler T, Särkinen T, Antonelli A. 2010. Amazonia through time: Andean uplift, climate change, landscape evolution, and biodiversity. Science 330:927–931. doi: 10.1126/science.1194585. [DOI] [PubMed] [Google Scholar]
- 15.Duarte RM, Smith DS, Val AL, Wood CM. 2016. Dissolved organic carbon from the upper Rio Negro protects zebrafish (Danio rerio) against ionoregulatory disturbances caused by low pH exposure. Sci Rep 6:20377. doi: 10.1038/srep20377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Coble PG. 2007. Marine optical biogeochemistry: the chemistry of ocean color. Chem Rev 107:402–418. doi: 10.1021/cr050350+. [DOI] [PubMed] [Google Scholar]
- 17.Spencer RGM, Aiken GR, Butler KD, Dornblaser MM, Striegl RG, Hernes PJ. 2009. Utilizing chromophoric dissolved organic matter measurements to derive export and reactivity of dissolved organic carbon exported to the Arctic Ocean: a case study of the Yukon River, Alaska. Geophys Res Lett 36:L06401. doi: 10.1029/2008GL036831. [DOI] [Google Scholar]
- 18.Holland A, Wood CM, Smith DS, Correia TG, Val AL. 2017. Nickel toxicity to cardinal tetra (Paracheirodon axelrodi) differs seasonally and among the black, white and clear river waters of the Amazon basin. Water Res 123:21–29. doi: 10.1016/j.watres.2017.06.044. [DOI] [PubMed] [Google Scholar]
- 19.Wissmar RC, Richey JE, Stallard RF, Edmond JM. 1981. Plankton metabolism and carbon processes in the Amazon river, its tributaries, and floodplain waters, Peru-Brazil, May-June 1977. Ecology 62:1622–1633. doi: 10.2307/1941517. [DOI] [Google Scholar]
- 20.Mayorga E, Aufdenkampe AK, Masiello CA, Krusche AV, Hedges JI, Quay PD, Richey JE, Brown TA. 2005. Young organic matter as a source of carbon dioxide outgassing from Amazonian rivers. Nature 436:538–541. doi: 10.1038/nature03880. [DOI] [PubMed] [Google Scholar]
- 21.Santos CD, Sarmento H, de Miranda FP, Henrique-Silva F, Logares R. 2020. Uncovering the genomic potential of the Amazon River microbiome to degrade rainforest organic matter. Microbiome 8:151. doi: 10.1186/s40168-020-00930-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Santos-Júnior CD, Kishi LT, Toyama D, Soares-Costa A, Oliveira TC, de Miranda FP, Henrique-Silva F. 2017. Metagenome sequencing of prokaryotic microbiota collected from rivers in the upper Amazon basin. Genome Announc 5:e01450-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ghai R, Rodŕíguez-Valera F, McMahon KD, Toyama D, Rinke R, Cristina Souza de Oliveira T, Wagner Garcia J, Pellon de Miranda F, Henrique-Silva F. 2011. Metagenomics of the water column in the pristine upper course of the Amazon River. PLoS One 6:e23785. doi: 10.1371/journal.pone.0023785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Santos CD, Toyama D, de Oliveira TCS, de Miranda FP, Henrique-Silva F. 2019. Flood season microbiota from the Amazon basin lakes: analysis with metagenome sequencing. Microbiol Res Ann 8:e00229-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Satinsky BM, Smith CB, Sharma S, Landa M, Medeiros PM, Coles VJ, Yager PL, Crump BC, Moran MA. 2017. Expression patterns of elemental cycling genes in the Amazon River plume. ISME J 11:1852–1864. doi: 10.1038/ismej.2017.46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Satinsky BM, Smith CB, Sharma S, Ward ND, Krusche AV, Richey JE, Yager PL, Crump BC, Moran MA. 2017. Patterns of bacterial and archaeal gene expression through the lower Amazon River. Front Mar Sci 4. doi: 10.3389/fmars.2017.00253. [DOI] [Google Scholar]
- 27.Satinsky BM, Zielinski BL, Doherty M, Smith CB, Sharma S, Paul JH, Crump BC, Moran MA. 2014. The Amazon continuum dataset: quantitative metagenomic and metatranscriptomic inventories of the Amazon River plume, June 2010. Microbiome 2:17. doi: 10.1186/2049-2618-2-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Sylvain FÉ, Holland A, Bouslama S, Audet-Gilbert É, Lavoie C, Val AL, Derome N. 2020. Fish skin and gut microbiomes show contrasting signatures of host species and habitat. Appl Environ Microbiol 86:e00789-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Toyama D, Kishi LT, Santos-Júnior CD, Soares-Costa A, de Oliveira TC, de Miranda FP, Henrique-Silva F. 2016. Metagenomics analysis of microorganisms in freshwater lakes of the Amazon basin. Microbiol Res Ann 4:e01440-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kritzberg E, Langenheder S, Lindström ES. 2006. Influence of dissolved organic matter source on lake bacterioplankton structure and function: implication for seasonal dynamics of community composition. FEMS Microbiol Ecol 56:406–417. doi: 10.1111/j.1574-6941.2006.00084.x. [DOI] [PubMed] [Google Scholar]
- 31.Gómez-Consarnau L, Lindh M, Gasol JM, Pinhassi J. 2012. Structuring of bacterioplankton communities by specific dissolved organic compounds. Environ Microbiol 14:2361–2378. doi: 10.1111/j.1462-2920.2012.02804.x. [DOI] [PubMed] [Google Scholar]
- 32.Figueroa D, Rowe OF, Paczkowska J, Legrand C, Andersson A. 2016. Allochthonous carbon—a major driver of bacterioplankton production in the subarctic northern Baltic Sea. Microb Ecol 71:789–801. doi: 10.1007/s00248-015-0714-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Araujo JDA, Ghelfi A, Val AL. 2017. Triportheus albus cope, 1872 in the blackwater, clearwater, and whitewater of the Amazon: a case of phenotypic plasticity? Front Genet 8:114. doi: 10.3389/fgene.2017.00114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Morris C, Val AL, Brauner CJ, Wood CM. 2021. The physiology of fish in acidic waters rich in dissolved organic carbon, with specific reference to the Amazon basin: ionoregulation, acid-base regulation, ammonia excretion, and metal toxicity. J Exp Zool A Ecol Integr Physiol 335:843–863. doi: 10.1002/jez.2468. [DOI] [PubMed] [Google Scholar]
- 35.Anderson MJ, Walsh DCI. 2013. PERMANOVA, ANOSIM, and the Mantel test in the face of heterogeneous dispersions: what null hypothesis are you testing? Ecol Monogr 83:557–574. doi: 10.1890/12-2010.1. [DOI] [Google Scholar]
- 36.Beheregaray LB, Cooke GM, Chao NL, Landguth EL. 2015. Ecological speciation in the tropics: insights from comparative genetic studies in Amazonia. Front Genet 5:477. doi: 10.3389/fgene.2014.00477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Bogotá-Gregory JD, Lima FCT, Correa SB, Silva-Oliveira C, Jenkins DG, Ribeiro FR, Lovejoy NR, Reis RE, Crampton WGR. 2020. Biogeochemical water type influences community composition, species richness, and biomass in megadiverse Amazonian fish assemblages. Sci Rep 10:15349. doi: 10.1038/s41598-020-72349-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Cooke GM, Landguth EL, Beheregaray LB. 2014. Riverscape genetics identifies replicated ecological divergence across an Amazonian ecotone. Evolution 68:1947–1960. doi: 10.1111/evo.12410. [DOI] [PubMed] [Google Scholar]
- 39.Van der Sleen A, Albert JS. 2018. Field guide to the fishes of the Amazon. In Orinoco and Guianas. Oxford University Press, Oxford, UK. [Google Scholar]
- 40.Sahoo PK, Guimarães JTF, Souza-Filho PWM, Bozelli RL, de Araujo LR, de Souza Menezes R, Lopes PM, da Silva MS, Rodrigues TM, da Costa MF, Dall’Agnol R. 2017. Limnological characteristics and planktonic diversity of five tropical upland lakes from Brazilian Amazon. Int J Limnol 53:467–483. doi: 10.1051/limn/2017026. [DOI] [Google Scholar]
- 41.Putz R. 1997. Periphyton communities in Amazonian black- and whitewater habitats: community structure, biomass and productivity. Aquatic Science 59:74–93. doi: 10.1007/BF02522552. [DOI] [Google Scholar]
- 42.Holland A, Stauber J, Wood CM, Trenfield M, Jolley DF. 2018. Dissolved organic matter signatures vary between naturally acidic, circumneutral and groundwater-fed freshwaters in Australia. Water Res 137:184–192. doi: 10.1016/j.watres.2018.02.043. [DOI] [PubMed] [Google Scholar]
- 43.Holland A, Duivenvoorden LJ, Kinnear SHW. 2012. Naturally acidic waterways: conceptual food webs for better management and understanding of ecological functioning. Aquatic Conserv Mar Freshw Ecosyst 22:836–847. doi: 10.1002/aqc.2267. [DOI] [Google Scholar]
- 44.Holland A, McInerney PJ, Shackleton ME, Rees GN, Bond NR, Silvester E. 2020. Dissolved organic matter and metabolic dynamics in dryland lowland rivers. Spectrochim Acta Part A Mol Biomol Spect 229:117871. doi: 10.1016/j.saa.2019.117871. [DOI] [PubMed] [Google Scholar]
- 45.Yamashita Y, Fichot CG, Shen Y, Jaffe R, Benner R. 2015. Linkages among fluorescent dissolved organic matter, dissolved amino acids and lignin-derived phenols in a river-influenced ocean margin. Front Mar Sci 2. doi: 10.3389/fmars.2015.00092. [DOI] [Google Scholar]
- 46.Benner R, Amon RMW. 2015. The size-reactivity continuum of major bioelements in the ocean. Annu Rev Mar Sci 7:185–205. doi: 10.1146/annurev-marine-010213-135126. [DOI] [PubMed] [Google Scholar]
- 47.Roiha T, Peura S, Cusson M, Rautio M. 2016. Allochthonous carbon is a major regulator to bacterial growth and community composition in subarctic freshwaters. Sci Rep 6:34456. doi: 10.1038/srep34456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Jezberova J, Jezbera J, Brandt U, Lindstrom ES, Langenheder S, Hahn MW. 2010. Ubiquity of Polynucleobacter necessarius ssp. asymbioticus in lentic freshwater habitats of a heterogenous 2000 km2 area. Environ Microbiol 12:658–669. doi: 10.1111/j.1462-2920.2009.02106.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Broman E, Asmala E, Carstensen J, Pinhassi J, Dopson M. 2019. Distinct coastal microbiome populations associated with autochthonous- and allochthonous-like dissolved organic matter. Front Microb 10:2579. doi: 10.3389/fmicb.2019.02579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Watanabe K, Komatsu N, Ishii Y, Negishi M. 2009. Effective isolation of bacterioplankton genus Polynucleobacter from freshwater environments grown on photochemically degraded dissolved organic matter. FEMS Microbiol Ecol 67:57–68. doi: 10.1111/j.1574-6941.2008.00606.x. [DOI] [PubMed] [Google Scholar]
- 51.Watanabe K, Komatsu N, Kitamura T, Ishii Y, Park HD, Miyata R, Noda N, Sekiguchi Y, Satou T, Watanabe M, Yamamura S, Imai A, Hayashi S. 2012. Ecological niche separation in the Polynucleobacter subclusters linked to quality of dissolved organic matter: a demonstration using a high sensitivity cultivation-based approach. Environ Microbiol 14:2511–2525. doi: 10.1111/j.1462-2920.2012.02815.x. [DOI] [PubMed] [Google Scholar]
- 52.Burkert U, Warnecke F, Babenzien D, Zwirnmann E, Pernthaler J. 2003. Members of a readily enriched betaproteobacterial clade are common in surface waters of a humic lake. Appl Environ Microbiol 69:6550–6559. doi: 10.1128/AEM.69.11.6550-6559.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Hahn MW, Scheuerl T, Jezberová J, Koll U, Jezbera J, Šimek K, Vannini C, Petroni G, Wu QL. 2012. The passive yet successful way of planktonic life: genomic and experimental analysis of the ecology of a free-living Polynucleobacter population. PLoS One 7:e32772. doi: 10.1371/journal.pone.0032772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Buck U, Grossart HP, Amann R, Pernthaler J. 2009. Substrate incorporation patterns of bacterioplankton populations in stratified and mixed waters of a humic lake. Environ Microbiol 11:1854–1865. doi: 10.1111/j.1462-2920.2009.01910.x. [DOI] [PubMed] [Google Scholar]
- 55.Grossart HP, Jezbera J, Hornak K, Hutalle KML, Buck U, Simek K. 2008. Top-down and bottom-up induced shifts in bacterial abundance, production and community composition in an experimentally divided humic lake. Environ Microbiol 10:635–652. doi: 10.1111/j.1462-2920.2007.01487.x. [DOI] [PubMed] [Google Scholar]
- 56.Hoetzinger M, Schmidt J, Jezberová J, Koll U, Hahn MW. 2017. Microdiversification of a pelagic Polynucleobacter species is mainly driven by acquisition of genomic islands from a partially interspecific gene pool. Appl Env Microbiol 83:e02266-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Hahn MW, Huemer A, Pitt A, Hoetzinger M. 2021. Opening a next-generation black box: ecological trends for hundreds of species-like taxa uncovered within a single bacterial >99% 16S rRNA operational taxonomic unit. Mol Ecol Resour 21:2471–2485. doi: 10.1111/1755-0998.13444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kamimura N, Takahashi K, Mori K, Araki T, Fujita M, Higuchi Y, Masai E. 2017. Bacterial catabolism of lignin-derived aromatics: new findings in a recent decade: update on bacterial lignin catabolism. Environ Microbiol Rep 9:679–705. doi: 10.1111/1758-2229.12597. [DOI] [PubMed] [Google Scholar]
- 59.Stein LY. 2018. Proteobacterial methanotrophs, methylotrophs, and nitrogen, p 57–66. In Kalyuzhnaya M, Xing XH (ed), Methane biocatalysis: paving the way to sustainability. Springer, Cham. [Google Scholar]
- 60.Janusz G, Pawlik A, Sulej J, Swiderska-Burek U, Jarosz-Wilkolazka A, Paszczynski A. 2017. Lignin degradation: microorganisms, enzymes involved, genomes analysis and evolution. FEMS Microbiol Rev 41:941–962. doi: 10.1093/femsre/fux049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Tikhonov VV, Yakushev AV, Zavgorodnyaya YA, Byzov BA, Demin VV. 2010. Effects of humic acids on the growth of bacteria. Eurasian Soil Sc 43:305–313. doi: 10.1134/S1064229310030087. [DOI] [Google Scholar]
- 62.Oya S, Tonegawa S, Nakagawa H, Habe H, Furuya T. 2022. Isolation and characterization of microorganisms capable of cleaving the ether bond of 2-phenoxyacetophenone. Sci Rep 12:2874. doi: 10.1038/s41598-022-06816-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Segura A, Bünz PV, D’Argenio DA, Ornston LN. 1999. Genetic analysis of a chromosomal region containing vanA and vanB, genes required for conversion of either ferulate or vanillate to protocatechuate in Acinetobacter. J Bacteriol 181:3494–3504. doi: 10.1128/JB.181.11.3494-3504.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Vetting MW, D’Argenio DA, Ornston LN, Ohlendorf DH. 2000. Structure of Acinetobacter strain ADP1 protocatechuate 3,4-dioxygenase at 2.2 A resolution: implications for the mechanism of an intradiol dioxygenase. Biochemistry 39:7943–7955. doi: 10.1021/bi000151e. [DOI] [PubMed] [Google Scholar]
- 65.Jan U, Feiwen R, Masood J, Chun SC. 2020. Characterization of soil microorganism from humus and indigenous microorganism amendments. Mycobiology 48:392–398. doi: 10.1080/12298093.2020.1816154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Ahmad S, Daur I, Al-Solaimani SG, Mahmood S, Bakhashwain AA, Madkour MH, Yasir M. 2016. Effect of rhizobacteria inoculation and humic acid application on canola (Brassica napus L.) crop. Pak J Bot 48:2109–2120. [Google Scholar]
- 67.Gilmore SP, Lankiewicz TS, Wilken SE, Brown JL, Sexton JA, Henske JK, Theodorou MK, Valentine DL, O’Malley MA. 2019. Top-down enrichment guides in formation of synthetic microbial consortia for biomass degradation. ACS Synth Biol 8:2174–2185. doi: 10.1021/acssynbio.9b00271. [DOI] [PubMed] [Google Scholar]
- 68.de Gonzalo G, Colpa DI, Habib MHM, Fraaije MW. 2016. Bacterial enzymes involved in lignin degradation. J Biotechnol 236:110–119. doi: 10.1016/j.jbiotec.2016.08.011. [DOI] [PubMed] [Google Scholar]
- 69.Nalven SG, Ward CP, Payet JP, Cory RM, Kling GW, Sharpton TJ, Sullivan CM, Crump BC. 2020. Experimental metatranscriptomics reveals the costs and benefits of dissolved organic matter photo-alteration for freshwater microbes. Environ Microbiol 22:3505–3521. doi: 10.1111/1462-2920.15121. [DOI] [PubMed] [Google Scholar]
- 70.Cruaud P, Vigneron A, Fradette MS, Charette SJ, Rodriguez MJ, Dorea CC, Culley AI. 2017. Open the SterivexTM casing: an easy and effective way to improve DNA extraction yields. Limnol Oceanogr Methods 15:1015–1020. doi: 10.1002/lom3.10221. [DOI] [Google Scholar]
- 71.Camacho-Sanchez M, Burraco P, Gomez-Mestre I, Leonard JA. 2013. Preservation of RNA and DNA from mammal samples under field conditions. Mol Ecol Resour 13:663–673. doi: 10.1111/1755-0998.12108. [DOI] [PubMed] [Google Scholar]
- 72.Menke S, Gillingham MAF, Wilhelm K, Sommer S. 2017. Home-made cost effective preservation buffer is a better alternative to commercial preservation methods for microbiome research. Front Microbiol 8:102. doi: 10.3389/fmicb.2017.00102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Cohen RS, Gray DK, Vucic JM, Murdoch AD, Sharma S. 2021. Environmental variables associated with littoral macroinvertebrate community composition in Arctic lakes. Can J Fish Aquat Sci 78:110–123. doi: 10.1139/cjfas-2020-0065. [DOI] [Google Scholar]
- 74.Parsons R, Maita Y, Lalli M. 1984. A manual of chemical and biological methods for seawater analysis, p 173. Pergamon Press, Oxford. [Google Scholar]
- 75.Holm-Hansen O, Lorenzen CJ, Holmes RW, Strickland JDH. 1965. Fluorometric determination of chlorophyll. ICES J Mar Sci 30:3–15. doi: 10.1093/icesjms/30.1.3. [DOI] [Google Scholar]
- 76.Hansen HP, Koroleff F, Grasshoff K, Kremling K, Ehrhardt M. 1999. Determination of nutrients, p 161–228. In Methods of seawater analyses. Wiley-VCH, Weinheim, Germany. [Google Scholar]
- 77.Clarke FE. 1950. Determination of chloride in water improved colorimetric and titrimetric methods. Anal Chem 22:553–555. doi: 10.1021/ac60040a011. [DOI] [Google Scholar]
- 78.U.S. Environmental Protection Agency. 1983. Methods for chemical analysis of water and wastes, EPA-600/4-79-020, USEPA, method 160.2.
- 79.Murphy KR, Stedmon CA, Graeber D, Bro R. 2013. Fluorescence spectroscopy and multi-way techniques: PARAFAC. Anal Methods 5:6557–6566. doi: 10.1039/c3ay41160e. [DOI] [Google Scholar]
- 80.Nossa CW, Oberdorf WE, Yang L, Aas JA, Paster BJ, Desantis TZ, Brodie EL, Malamud D, Poles MA, Pei Z. 2010. Design of 16S rRNA gene primers for 454 pyrosequencing of the human foregut microbiome. World J Gastroenterol 16:4135–4144. doi: 10.3748/wjg.v16.i33.4135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. 2016. DADA2: high-resolution sample inference from Illumina amplicon data. Nat Methods 13:581–583. doi: 10.1038/nmeth.3869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Huson DH, Beier S, Flade I, Górska A, El-Hadidi M, Mitra S, Ruscheweyh HJ, Tappu R. 2016. MEGAN community edition—interactive exploration and analysis of large-scale microbiome sequencing data. PLoS Comput Biol 12:e1004957. doi: 10.1371/journal.pcbi.1004957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.McMurdie PJ, Holmes S. 2013. phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS One 8:e61217. doi: 10.1371/journal.pone.0061217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Legendre P, Anderson MJ. 1999. Distance-based redundancy analysis: testing multispecies responses in multifactorial ecological experiments. Ecol Monogr 69:1–24. doi: 10.1890/0012-9615(1999)069[0001:DBRATM]2.0.CO;2. [DOI] [Google Scholar]
- 85.Blanchet FG, Legendre P, Borcard D. 2008. Forward selection of explanatory variables. Ecology 89:2623–2632. doi: 10.1890/07-0986.1. [DOI] [PubMed] [Google Scholar]
- 86.Oksanen J, Blanchet FG, Kindt R, Legendre P, Minchin PR, O’Hara RB, Simpson GL, Solymos P, Stevens MHM, Wagner H. 2019. vegan: community ecology package. R package version 2.5- 6. https://CRAN.R-project.org/package=vegan. Accessed 15 March 2021.
- 87.Breiman L. 2001. Random forests. Mach Learn 45:5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]
- 88.Liaw A, Wiener M. 2002. Classification and regression by randomForest. R News 2:18–22. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data sets generated and analyzed during the current study can be found in the Sequence Read Archive (SRA) repository under BioProjectIDs PRJNA736442 and PRJNA736450. The accession numbers of the 90 metagenomes used to build the custom database are SRR1182511, SRR1182512, SRR1183643, SRR1183650, SRR1185413, SRR1185414, SRR1186214, SRR1199270, SRR1199271, SRR1199272, SRR1202081, SRR1202089, SRR1202090, SRR1202091, SRR1202095, SRR1204580, SRR1204581, SRR1205250, SRR1205251, SRR1205252, SRR1205253, SRR1209976, SRR1209977, SRR1209978, SRR1514963, SRR1515032, SRR1518285, SRR1522964, SRR1522971, SRR1522973, SRR1522974, SRR1786279, SRR1786281, SRR1786608, SRR1786616, SRR1787940, SRR1787943, SRR1788318, SRR1790487, SRR1790489, SRR1790644, SRR1790646, SRR1790647, SRR1790676, SRR1790678, SRR1790679, SRR1790680, SRR1792674, SRR1792852, SRR1793861, SRR1793862, SRR1796116, SRR1796118, SRR1796234, SRR1796236, SRR4831644, SRR4831645, SRR4831646, SRR4831647, SRR4831648, SRR4831649, SRR4831650, SRR4831651, SRR4831652, SRR4831653, SRR4831654, SRR4831655, SRR4831656, SRR4831657, SRR4831658, SRR4831659, SRR4831660, SRR4831661, SRR4831662, SRR4831663, SRR4831664, SRR4831665, SRR4831666, SRR4831667, SRR4833053, SRR4833055, SRR4833056, SRR4833057, SRR4833059, SRR4833060, SRR4833062, SRR4833064, SRR4833067, SRR4833073, SRR4833077, SRR4833080, SRR4833081, SRR4833084, SRR4833086, SRR4833087, SRR4833089, SRR5123271, SRR5123272, SRR5123273, SRR5123274, SRR5123275, SRR5123276, and SRR5123277. The scripts used for the 16S DNA/RNA sequence analysis, the input files including all metadata, the functional inference database, and the raw EEMS and absorbance scans data are all freely available on the Open Science Network platform (https://osf.io/dz6vf/). The main script for statistical analysis, an RData file containing the phyloseq objects, and the main KEGG table used were all uploaded as supplemental material.