Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2017 Aug 24;7:9343. doi: 10.1038/s41598-017-09303-0

The impact of immigration on microbial community composition in full-scale anaerobic digesters

Rasmus H Kirkegaard 1, Simon J McIlroy 1, Jannie M Kristensen 1, Marta Nierychlo 1, Søren M Karst 1, Morten S Dueholm 1, Mads Albertsen 1, Per H Nielsen 1,
PMCID: PMC5571154  PMID: 28839166

Abstract

Anaerobic digestion is widely applied to treat organic waste at wastewater treatment plants. Characterisation of the underlying microbiology represents a source of information to develop strategies for improved operation. Hence, we investigated microbial communities of thirty-two full-scale anaerobic digesters over a six-year period using 16S rRNA gene amplicon sequencing. Sampling of the sludge fed into these systems revealed that several of the most abundant populations were likely inactive and immigrating with the influent. This observation indicates that a failure to consider immigration will interfere with correlation analysis and give an inaccurate picture of the growing microbial community. Furthermore, several abundant OTUs could not be classified to genus level with commonly applied taxonomies, making inference of their function unreliable and comparison to other studies problematic. As such, the existing MiDAS taxonomy was updated to include these abundant phylotypes. The communities of individual digesters surveyed were remarkably similar – with only 300 OTUs representing 80% of the total reads across all plants, and 15% of these identified as non-growing and possibly inactive immigrating microbes. By identifying abundant and growing taxa in anaerobic digestion, this study paves the way for targeted characterisation of the process-important organisms towards an in-depth understanding of the microbiology.

Introduction

Biogas production from the anaerobic digestion of organic waste is increasingly being implemented as an alternative renewable energy source. This change is driven by the need for clean energy as well as improved economy of wastewater treatment plants by making them into net energy producers1. Methane gas production from organics is mediated by the tightly coupled synergistic activities of complex microbial communities and is essentially covered by four sequential stages: hydrolysis, fermentation, acetogenesis and methanogenesis. The anaerobic digestion process is generally robust, but occasionally reactors experience operational problems such as foaming events and periods of low efficiency or failure24. A better understanding of the underlying microbiology will facilitate optimisation of the biological processes, and consequently, the microbiology has been widely studied using various approaches with both lab-scale and full-scale systems57.

Understanding the ecology of anaerobic digesters, and how it relates to system function, first requires the identification of the active and abundant microorganisms and subsequent linkage of their identity to their functional roles8. Several 16S rRNA gene amplicon based studies have shown that there appears to be a set of abundant microorganisms, common to similarly operated anaerobic digesters, that are stably present over time6, 7, 9, 10. This is also known for other biological processes, such as wastewater treatment plants11 and the human digestive system12. Furthermore, other studies have revealed process temperature, substrate composition, and ammonia concentrations as important factors in the shaping of the microbial community composition. However, in anaerobic digesters a large part of the observable microbial community might originate from dead or inactive cells arriving with the influent biomass from which DNA persists. Hence, the observed microbial community dynamics will not truly reflect the changes in process performance or stability. This can lead to spurious correlations and false conclusions11. In an attempt to mitigate the problem of DNA from inactive cells influencing microbial analysis, molecular techniques have been developed to remove or bind the extracellular DNA prior to cell lysis13, 14. However, the complex matrix of anaerobic digester sludge samples will likely lead to problems with unwanted chemical reactions and limited penetration of the light used in the process14. Hence, an alternative approach is to monitor the microbial composition in the influent to identify organisms whose abundance is likely maintained by immigration11, 1517.

Associating phylogeny with function is essential for understanding the ecology of these systems. However, a substantial proportion (67%9 to 73%7) of sequences obtained in previous 16S rRNA gene amplicon surveys of anaerobic digesters were not classified to the genus level with the commonly applied taxonomies: such as SILVA, RDP and Greengenes1820. Furthermore, biases associated with DNA extraction, primer coverage and differences in the taxonomy applied for classification21, 22, greatly hampers cross-study comparisons. Hence, only by using well-defined standard methods and the same curated database for taxonomic classification across the field, it is possible to make meaningful cross-study comparisons and robust biological conclusions23. Standardisation has been established for activated sludge from wastewater treatment plants with the MiDAS protocols21 and the curated MiDAS taxonomy22, but is currently lacking for anaerobic digestion.

Another approach to study community composition in anaerobic digesters is metagenomic sequencing2428. However, this approach is currently hampered by the limited number of genomes in the reference databases, which results in a poor classification of reads and contigs27, and there is thus a pressing need for populating the genome databases with the relevant genomes before such an approach is truly meaningful28, 29.

The aim of this study was to identify the abundant and growing organisms in full-scale anaerobic digester systems, fed waste activated sludge, using 16S rRNA gene amplicon sequencing. The survey included 32 Danish full-scale reactors located at 20 wastewater treatment plants over a six-year period (>300 samples), including both mesophilic and thermophilic reactors, and represents the most comprehensive study of full-scale systems to date. Comparison of read abundances in the digester sludge, and the corresponding influent primary and surplus sludge, was used to identify immigrating populations and to provide an assessment of the growing populations in the anaerobic digesters. Furthermore, having identified the abundant populations present in the anaerobic digesters, we have performed a manual curation of the SILVA taxonomy for the most abundant operational taxonomic units (OTUs), many of which were poorly classified with existing databases. By providing genus level classifications for all abundant taxa, researchers in the field will be able to link the identity with the accumulated knowledge regarding their population dynamics and ecophysiology. The comprehensive list of the microorganisms enriched in anaerobic digesters will also form the foundation for prioritising the effort in getting high quality genomes from key members of the microbial communities from metagenomic binning.

Results

Characteristic of the sampled anaerobic digesters

More than 300 samples were collected from 32 full-scale digesters at 20 wastewater treatment plants in Denmark over a period of 6 years (2011–2016). The sampled reactors represent mesophilic (~37 °C) and thermophilic (~55 °C) processes running mainly on primary sludge and surplus activated sludge (approx. 50:50% in relative amount of organic matter). The reactors have reported ammonium levels in the range of 500–3000 mg/L, acetate concentrations of 0.5–20 mmol/L, alkalinity levels of 0.01–0.5 mmol/L, pH of 7.1–8.5, reactor volumes of 1300–6000 m3 and sludge retention times of 10–55 days. The plants in Fredericia and Næstved have mesophilic reactors with a thermal hydrolysis pre-treatment (THP), the type of pre-treatment in both cases are CambiTHPTM installations.

Community structure: Archaea

The archaea were targeted with archaea-specific primers amplifying the V3-5 regions of the 16 S rRNA gene. The resulting quality filtered sequencing data were subsampled to 10 000 reads per sample, giving more than 3 million reads in total. There were 169 OTUs (97% sequence identity), spanning 8 phyla, which constituted at least 0.1% in a single sample. Principal component analysis revealed that the thermophilic and mesophilic reactors formed very distinct archaeal communities (Fig. 1A). Euryarchaeota was by far the most dominant archaeal phylum making up 93–100% of the archaeal reads in each sample (Fig. 2A). The acetoclastic methanogenic genus Methanosaeta dominated the sequencing libraries of the mesophilic reactors (60–80% of the reads), followed by a variety of hydrogenoclastic methanogenic genera such as Methanolinea, Methanospirillum, Methanobrevibacter as well as the WCHA1-57, which was recently renamed Candidatus Methanofastidiosa30 (Fig. 2B). The mesophilic reactors with thermal hydrolysis pre-treatment were also dominated by Methanosaeta (83–87%). The underlying OTUs for the most abundant genera were the same for the different plants (Fig. S1). For the genus Methanosaeta, there was one dominant OTU (25–33% relative read abundance) and six additional OTUs in read abundances of each 2–15% in all mesophilic reactors, including those with THP, indicating a substantial diversity within the genus.

Figure 1.

Figure 1

Principal component analysis of the microbial communities in ADs, highlighting samples by process type information (Inline graphic mesophilic, Inline graphic thermophilic, Inline graphic mesophilic with thermal hydrolysis pretreatment (THP)). (A) the separation of archaeal communities coloured by process type, (B) the separation of bacterial communities coloured by process type, (C) The bacterial communities of mesophilic plants coloured and labelled by plant location, (D) The bacterial communities of thermophilic plants coloured and labelled by plant location.

Figure 2.

Figure 2

(A) Heatmap of the 5 most abundant archaeal phyla. (B) Heatmap of the 20 most abundant archaeal genera in the anaerobic digesters. When no genus level classification is available the OTU number is given. The phylum level classification is shown for all genera. Data based on 32 AD reactors (1–4 per plant) analysed 2–23 times. The mean read abundance is shown for each plant. The taxa are sorted by mean read abundance across the plants at the respective phylogenetic level (phylum, genus).

The thermophilic reactors were dominated by the hydrogenoclastic methanogenic genus Methanothermobacter (64–77% of the reads), followed by the more versatile Methanosarcina (13–33% of the reads). The latter is known to perform both acetoclastic and hydrogenoclastic methanogenesis. Methanobrevibacter was the third most common methanogen and along with Methanosaeta, the only abundant archaeon shared with the mesophilic reactors. However, it was not found in mesophilic reactors with thermal hydrolysis pre-treatment. The underlying OTUs for the two abundant genera were the same for the different plants (Fig. S1). For Methanothermobacter, there was one dominant OTU (37–48% relative read abundance) and two less abundant OTUs (6–20%). For Methanosarcina, there was one dominant OTU (10–25%) and one less abundant OTU (3–6%). The archaeal community of the thermophilic samples clearly had a lower diversity than the mesophilic samples (Fig. 2B & See diversity metrics in Fig. S2).

Community structure: Bacteria

The bacteria were targeted with bacteria-specific primers amplifying the V1-3 regions of the 16 S rRNA gene. The resulting quality filtered sequencing data were subsampled to 10 000 reads per sample giving more than 3 million reads in total. The resulting 5614 OTUs, each making up at least 0.1% of the reads in at least one sample, covered 46 phyla. Principal component analysis revealed that the thermophilic and mesophilic reactors formed very distinct bacterial communities with a separate cluster for reactors with thermal hydrolysis pre-treatment (Fig. 1B). Principal component analysis of the samples within the mesophilic and thermophilic clusters (Fig. 1C,D) shows that the overall structure of the microbial communities overlap between some plants during the period. The dominant phyla were Firmicutes, Proteobacteria, Actinobacteria, Bacteriodetes and Chloroflexi (Fig. 3A). Along with the more “well-known” phyla, a few candidate phyla, such as Fermentibacteria (Hyd-24–12), Aminicenantes (OP8), and Atribacteria (OP9), were also observed. Most mesophilic reactors were dominated by the MiDAS genus T78 belonging to Chloroflexi, followed by the genera Tetrasphaera and Candidatus Microthrix (Fig. 3B). The thermophilic reactors also had a high read abundance of Tetrasphaera and Ca. Microthrix. However, the mesophilic reactors with thermal hydrolysis pre-treatment did not have a notable read abundance of either of these two genera despite them being present in the surplus sludge (Fig. 3B). This suggests that these genera do not grow in mesophilic digesters, but are coming in with the feed. Supporting this idea is that the underlying OTUs for the most abundant genera were the same for the different plants (Fig. S3). The dominant OTUs in the digesters were generally shared among the plants with similar operation (Fig. S4) and as few as 300 OTUs account for 80% of the reads, which is a metric sometimes defined as the “abundant core” (Fig. S5)11.

Figure 3.

Figure 3

(A) Heatmap of the 20 most abundant bacterial phyla. (B) Heatmap of the 20 most abundant bacterial genera in the anaerobic digesters. When no genus level classification is available the OTU number is given. The phylum level classification is shown for all genera. Data based on 32 AD reactors (1–4 per plant) analysed 3–30 times. The mean read abundance is shown for each plant. The taxa are sorted by mean read abundance across the plants at the respective phylogenetic level (phylum, genus).

Community composition of primary and surplus sludge

The feed for all digesters, except Fredericia, was a mixture of primary sludge settled from influent wastewater and surplus sludge harvested from the activated sludge plant, approximately in an organic mass ratio of 50:50. Fredericia had only surplus sludge. The bacterial community composition was analysed in 121 samples of primary sludge from 14 WWTPs and 137 activated sludge samples from all 24 WWTPs. The overall community structure showed clear clustering of the different sample types, separating primary sludge, surplus sludge, mesophilic, thermophilic and THP reactors (Fig. S6), indicating noticeably different communities. The microbial communities in the primary sludge were very similar in all samples and the most abundant genera were Streptococcus, Arcobacter and Trichococcus (Fig. S7). The most abundant genera in the surplus sludge were also very similar in most plants reflecting the presence of abundant core species such as Tetrasphaera, Ca. Microthrix, and Ca. Amarilinum (Fig. S8).

Survival of influent bacteria in the digesters

Some organisms were present in both of the influent streams and the digesters, whereas others were detected almost exclusively in one of the three sample types (Figs 4 and 5). No overlap was found between the communities in the influent streams and in reactors with THP (Fig. 4). Some organisms, such as Tetrasphaera, Ca. Microthrix, and Rhodobacter, were generally present in both the surplus sludge and the digesters, regardless of process temperature. Other organisms, such as Arcobacter, Streptococcus, and Blautia, which were the most abundant bacterial genera in the primary sludge, were hardly detected in the digesters.

Figure 4.

Figure 4

Heatmap of the 20 most abundant bacterial genera (A) Taxa sorted by the mean read abundance in the influent (primary and surplus sludge) (B) Taxa sorted by the mean read abundance in the anaerobic digesters (mesophilic, thermophilic and THP). The numbers represent mean read abundances for groups with more reactors and more samples (30–279).

Figure 5.

Figure 5

Distribution of ratios between mean OTU read abundance in the anaerobic digesters and the influent streams (primary and surplus sludge) weighted by the read abundance in the anaerobic digesters. Higher ratios mean that the relative read abundance for the OTU is enriched in the digester compared to the primary and surplus sludges, ratios close to 1 or below indicates that the OTU is not enriched in the digester. Mesophilic (Inline graphic), thermophilic (Inline graphic), and mesophilic with thermal hydrolysis pre-treatment (Inline graphic THP).

We tried to assess whether the immigrating organisms tended to die off, survive, or grow in the digesters by calculating the ratios of their mean read abundance in the digester compared to the mean read abundance in the influent streams (Fig. 5, Figs S4S9 & Table S2). This calculation does not give an exact measure of the growth rate of the individual species as has previously been performed based on detailed mass-balances11, 17. However, despite some variability in the sludge retention times and in the fraction of primary and surplus sludge, a clear bimodal distribution of the ratios (Fig. 5) was observed for the mesophilic and thermophilic reactors without the thermal hydrolysis pre-treatment. This indicates that there was a peak for the group of organisms with high ratios that were heavily enriched in the digesters compared to the influent streams, and thus likely growing in the system. The peak with lower ratios include the group of organisms with a read abundance that is unchanged or lower compared to the influent streams and these OTUs are thus likely non-growing or dying off in the digesters. This bimodal behaviour was also seen for the individual plants when samples from influent streams and digesters were analysed (Fig. S9). The bimodal distributions had a split around a ratio of 10 (Fig. 5), indicating that there was a clear difference between organisms growing exclusively in anaerobic digesters and organisms that were dying off or only present because they were fed into the anaerobic digester. The ratio distribution for the THP plants, which were used as a control for reactors without living biomass going into the reactors, had a single peak above 10 supporting that OTUs above this threshold are likely growing in the reactors. Some of the seemingly most abundant organisms in the digesters, such as the genera Tetrasphaera and Ca. Microthrix, had ratios close to or below one (Table S2), which indicate that they were likely not growing and only present in the digesters by being supplied with the influent streams. However, 203 of the 300 most abundant OTUs had ratios above 10, which indicate that there is a clear shift for microorganisms growing exclusively in the digesters, e.g. the genera Ca. Fermentibacter, Fastidiosipila and Coprothermobacter.

Discussion

In this study, the microbial communities of 32 full-scale anaerobic digesters at wastewater treatment plants and their influent streams were analysed using 16 S rRNA gene amplicon sequencing to identify the abundant and growing microorganisms of these biotechnologically important systems. Principal component analysis (PCA) revealed that the bacterial communities were distinct for the thermophilic, mesophilic and mesophilic with THP systems (Fig. 1B). The different communities observed in the mesophilic systems, with and without THP, may be partly attributed to the reported higher concentrations of ammonia in the latter (Table S1). These findings are consistent with previous studies which have also shown communities to be clearly influenced by process temperatures and ammonia concentrations9. The six-year survey period of the current study indicates that the digester communities at each wastewater treatment plants were relatively stable over time (Fig. S10). Furthermore, more detailed analyses of the taxa revealed that the most abundant organisms were shared between reactors of the same process type (Figs 2 and 3).

Previous studies have also observed abundant organisms that were shared among many anaerobic digester plants of similar operation6, 7, 9. This common finding indicates that efforts to characterise the process-important organisms is feasible, with less than 300 OTUs accounting for 80% of the amplicon reads across all plants in the current study. Previous attempts to identify the important genera in anaerobic digesters has been hampered by the lack of taxonomic classification of several abundant organisms with commonly applied taxonomies. As a consequence, previous studies have often focused microbial analysis on OTUs with taxonomic classification to high levels, such as phylum, order or class6, 7, 9, where the link between phylogeny and function is more unreliable31. In this study, we have sought to address this problem by updating the MiDAS taxonomy to cover abundant genus level taxa in full-scale anaerobic digesters32, along with abundant organisms previously identified in activated sludge22. Application of the updated taxonomy in this study gave genus level classification for 78% and 97% of all the bacterial and archaeal reads, respectively. Of the bacterial OTUs within top 300 (“abundant core”) the ones with MiDAS specific genus classification accounted for 31% of the bacterial reads.

Importantly, a substantial presence of incoming organisms or their DNA in the community of the assessed digesters was observed in this study (Fig. 5), indicating that some of the seemingly most abundant organisms were related to influent streams rather than growing. To assess the source and activity of abundant organisms, we performed the microbial analysis on the primary and surplus sludge and calculated the ratio of their read abundance in these influent streams and the receiving digesters. The ratios indicate if continuous transfer into the system, and/or active growth, maintains an organism’s read abundance. Fifteen percent of the 300 OTUs, which accounted for 80% of the reads, had ratios of one or below. Four of the 25 most abundant genera (Fig. 3B) had low relative read abundance ratios. These included Tetrasphaera, Ca. Microthrix, Clostridium sensu strictu 1 and Romboutsia; which are all genera that were also seemingly shared among mesophilic and thermophilic reactors but not present in the reactors with THP. The suggestion that some of these do not belong in anaerobic digesters is also supported by what is known about their metabolism e.g. Ca. Microthrix is a known aerobe33. A similar approach also identified abundant inactive influent organisms in a single anaerobic digester treating surplus sludge17. Inactive organisms identified in the previous study, including Trichococcus, Rhodobacter and Thauera, were also determined to be non-growing in the current study - having ratios lower than one.

The impact of the influent on the observed community causes a multitude of problems for microbial analysis in digesters as it interferes with attempts to establish relationships between microorganisms and process performance. It is worth noting that, despite likely being non-growing, the persistence of the filamentous members of the genus Ca. Microthrix, which is well-known to cause bulking and foaming in activated sludge systems, also has been linked to foaming problems in receiving anaerobic digester systems34, 35.

The identification of non-growing populations in anaerobic digesters further shortens the list of microorganisms likely most important to the bulk transformations of these systems. The majority of previously characterised growing genera are known to be fermentative organisms; including Coprothermobacter 36 and Anaerobaculum 37 in thermophilic systems, and Thermovirga 38 , Leptolinea 39 and Ca. Fermentibacter40 in mesophilic systems. Smithella 41 and Gelria 42 represent known acetogens. In general, apart from the influent organisms, abundant genera were generally not shared between thermophilic and mesophilic systems. The exception within the top 25 is the genus Gelria – which was present in both mesophilic and thermophilic reactors with a high read abundance and ratio. The genus was originally isolated from a thermophilic methanogenic enrichment43. However, the underlying species-level OTUs differ between the mesophilic and thermophilic reactors, indicating that organisms even within the same genus can occupy distinct niches in these systems (Fig. S11). It is an important observation that for a substantial proportion of the abundant genus-level taxa nothing is known of their potential role in these systems. These include the MiDAS taxa T78, B55_F and G35_D8, within the phyla Chloroflexi, Firmicutes and Bacteroidetes, respectively (Fig. 3B), which are obvious targets for future research into the ecology of these systems. Influent populations of the archaeal domain were not assessed.

The dominant Archaea in the mesophilic reactors running on primary and surplus sludge was Methanosaeta, with a range of other hydrogenoclastic organisms such as Methanolinea, Methanospirillum, Methanobrevibacter as well as Ca. Methanofastidiosa (WCHA1-57) at lower read abundances. The uncultured Ca. Methanofastidiosa is suggested to be restricted to methylated thiol reduction for methane generation as all known genomes lack genes for acetoclastic and CO2-reducing methanogenesis30. The dominance of Methanosaeta in mesophilic digesters is supported by other studies using amplicon sequencing, qPCR and shotgun sequencing7, 9, 25.

Methanothermobacter and Methanosarcina were the dominant methanogens in the thermophilic systems. The difference between the dominant acetoclastic methanogen could be due to process temperature or shorter residence times as both Methanosaeta and Methanosarcina cover species able to grow across the entire temperature range of operation44. Interestingly, Methanobrevibacter was also seemingly abundant in the thermophilic reactors, although it is usually considered mesophilic. However, it was not found in mesophilic reactors with thermal hydrolysis pre-treatment, and Methanobrevibacter has previously been found in wastewater treatment processes and isolated from faeces4547 – indicating after all that there may be some influence of immigration on archaeal populations in digesters. In addition to a high read abundance of Methanosaeta, the mesophilic reactors with thermal hydrolysis pre-treatment also had a high read abundance of Methanoculleus. The methanogen Methanoculleus has previously been related to elevated ammonium levels, a relationship that was also supported by the high ammonia levels reported for the THP plants in this study (Table S1)9, 48, 49.

In this study, we present a comprehensive list of the growing microorganisms of full-scale anaerobic digesters receiving primary and surplus sludge from wastewater treatment plants (Fig. S4). The relatively low number of genera makes the organisms needed to study feasible, and biological informed decisions less complex and more tractable. Standard application of the curated MiDAS database32 for anaerobic digester systems, located at wastewater treatment plants, will form an important foundation for future studies of the ecology of these biotechnologically important systems. However, it is important to keep in mind that the list will likely be missing some of the important players due to PCR biases50 and that we need primer-free alternatives to get the entire picture of the microbial diversity in anaerobic digesters51.

Materials and Methods

Sampling

Biomass samples from digesters were obtained 2–4 times a year in the period 2011–2016 from 37 ADs at 21 Danish WWTPs (Supplementary Table S1). For primary sludge, 121 samples from 14 WWTPs were sampled during 3 months in October-December, 2014. Each sample was based on flow proportional sampling collected through 24 h. For surplus activated sludge, 137 sludge samples were obtained from the aeration tank from 23 WWTPs. All samples were homogenised and stored as 2 mL aliquots at −80 °C for DNA extraction.

DNA extraction

DNA was extracted from biomass samples using the FastDNA® Spin kit for soil (MP Biomedicals, Santa Ana, CA, USA) following the standard protocol, except for a 4-time increase in the bead beating duration- as recommended by Albertsen et al.,(2015)21. The biomass input volume was 50 µl for AD sludge and 500 µl for primary sludge and activated sludge. Primary sludge samples were first filtered onto 0.2-µm pore size polycarbonate filters and the DNA extracted from these using the same method described for other samples.

DNA amplification and sequencing

Bacterial PCR

The bacterial primers used were 27 F (AGAGTTTGATCCTGGCTCAG52) and 534 R (ATTACCGCGGCTGCTGG53), which amplify a DNA fragment of ~500 bp of the 16 S rRNA gene (variable regions 1–3). 25 µL PCR reactions in duplicate were run for each sample using 1X Platinum® High fidelity buffer, 100 µM of each dNTP, 1.5 mM MgSO4, 1 U Platinum® Taq DNA Polymerase High Fidelity (Thermo Fisher Scientific, USA), 400 nM of each barcoded V1-V3 primer, and 10 ng template DNA. PCR conditions were 95 °C, for 2 min followed by 30 cycles of {95 °C, for 20 s, 56 °C for 30 s, 72 °C for 60 s} and a final step of elongation at 72 °C for 5 min. PCR products were purified using Agencourt AmpureXP (Beckman Coulter, USA) with a ratio of 0.8 bead solution to PCR solution.

Archaeal PCR

The archaeal primers used were 340 F (CCCTAHGGGGYGCASCA54) and 915 R (GWGCYCCCCCGYCAATTC54), which amplify a DNA fragment of ~ 560 bp of the 16 S rRNA gene (variable regions 3–5). 25 µL PCR reactions in duplicate were run for each sample using 1X Platinum® High fidelity buffer, 100 µM of each dNTP, 1.5 mM MgSO4, 1 U Platinum® Taq DNA Polymerase High Fidelity (Thermo Fisher Scientific, USA), 400 nM of each V3-V5 primer mix, and 10 ng template DNA. PCR conditions were 95 °C, for 2 min followed by 35 cycles of {95 °C, for 20 s, 50 °C for 30 s, 72 °C for 60 s} and a final step of elongation at 72 °C for 5 min. PCR products were purified using Agencourt AmpureXP (Beckman Coulter, USA) with a ratio of 0.8 bead solution/PCR solution. Illumina adapters and barcodes were added with a second PCR. 2 µL purified PCR product from above was used as template for a 25 µL PCR reaction containing 1X PCRBIO Reaction buffer, PCRBIO HiFi Polymerase (PCR Biosystems, United Kingdom). PCR conditions were 95 °C, for 2 min, 8 cycles of {95 °C, for 20 s, 55 °C for 30 s, 72 °C for 60 s} and a final step of elongation at 72 °C for 5 min.

Sequencing

Bacteria and archaea amplicon libraries were pooled separately in equimolar concentrations and diluted to 4 nM. The amplicon libraries were paired-end sequenced (2 × 300 bp) on the Illumina MiSeq using v3 chemistry (Illumina, USA). 10–20% PhiX control library was added to mitigate low diversity library effects.

Read processing and classification

The read data were processed separately for the bacterial and archaeal analysis.

Bacteria

The paired end reads for the bacterial libraries were trimmed using trimmomatic55 and then merged using FLASH56. Bacterial reads were screened for potential PhiX contamination using USEARCH (v. v7.0.1090)57. The reads were clustered at 97% similarity using USEARCH and subsequently classified using the RDP classifier58 with the MiDAS database. The most abundant bacterial (top 80) OTUs from the mesophilic and thermophilic digesters were used to guide curation of the Silva database NR99 v. 1.23 taxonomy as described previously22. The resulting updated MiDAS taxonomy (v. 2.1), covering the abundant organisms of both anaerobic digesters and activated sludge, was applied for all analyses presented in this study.

Archaea

The size of the archaeal V3-V5 fragments made it unattainable to merge the reads, so only read 1 files were used for the analysis. The reads were trimmed to a length of 275 bp. Archaeal reads were screened for potential PhiX contamination using USEARCH (v. v7.0.1090)57. The reads were clustered at 97% similarity using USEARCH and subsequently classified using the RDP classifier58 with the MiDAS database. The most abundant archaeal OTUs (top 40) from the mesophilic and thermophilic digesters were used to guide curation of the Silva database NR99 v. 1.23 taxonomy as described previously22. The resulting updated MiDAS taxonomy (v. 2.1), covering the abundant organisms of both anaerobic digesters and activated sludge, was applied for all analyses presented in this study.

Data visualisation

Further processing of the OTU table was carried out in the R environment (v. 3.3.2)59 using the R studio IDE60 using the ampvis package (v. 1.27.021) for visualisation. The ampvis package wraps a number of packages including the phyloseq package (v. 1.19.1)61, ggplot2 (v. 2.2.1), reshape2 (v. 1.4.2)62, dplyr (v. 0.5.0)63, vegan (v. 2.4–1)64, knitr (v. 1.15.1)65, Biostrings (v. 2.42.1)66, data.table (v. 1.10.0)67, DESeq. 2 (v. 1.14.1)68, ggdendro (v. 0.1–20)69, and stringr (v. 1.1.0)70, and cowplot (v. 0.7.0). The samples were subsampled to an even depth of 10 000 reads per sample. Archaeal primers were not specific to the domain, so sequences not classified as Archaea were discarded and the count transformed to a fraction of the archaeal reads. Ratios were calculated between the average read abundance for a given OTU within the sample group (mesophilic digesters, thermophilic digesters, mesophilic digesters with thermal hydrolysis pre-treatment) and the average read abundance in the influent streams (primary and surplus sludge).

Data availability

Amplicon sequencing data is available at the ENA with the project ID PRJEB15624. OTU tables and metadata files are available at figshare (DOI: 10.6084/m9.figshare.4308191). The RMarkdown files to generate the figures are available at github (github.com/Kirk3gaard/Publications/tree/master/Kirkegaard2017). The curated MiDAS taxonomy32 (v. 2.1) is available for download from the MiDAS website (midasfieldguide.org/en/download/).

Electronic supplementary material

Supplementary info (1.4MB, pdf)
Supplementary Table S1 (2.3KB, csv)
Supplementary Table S2 (838.9KB, csv)

Acknowledgements

We would like to express our gratitude for the plant operators for sending samples and supplying metadata. The project was funded by the Villum foundation (grant no. VKR 022796) and the Innovation Fund Denmark (NomiGas, grant no. 1305-00018B) to P.H. Nielsen. S. McIlroy was supported by the Danish Council for Independent Research (grant no. 4093-00127 A).

Author Contributions

R.H.K., P.H.N., M.A. & J.M.K. designed the experiments; R.H.K., J.M.K., M.N. & S.M.K. performed the experiments. R.H.K., S.J.M. & M.A. performed the data analysis; R.H.K., S.J.M., M.A. & P.H.N. wrote the main manuscript text. All authors reviewed the manuscript.

Competing Interests

Rasmus H. Kirkegaard, Mads Albertsen, Søren M. Karst, and Per H. Nielsen own the DNA analysis based company DNASense ApS. Morten S. Dueholm is employed by DNASense ApS. The remaining authors declare no conflict of interest.

Footnotes

Electronic supplementary material

Supplementary information accompanies this paper at doi:10.1038/s41598-017-09303-0

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Weiland P. Biogas production: Current state and perspectives. Appl. Microbiol. Biotechnol. 2010;85:849–860. doi: 10.1007/s00253-009-2246-7. [DOI] [PubMed] [Google Scholar]
  • 2.de los Reyes MF, de los Reyes FL, Hernandez M, Raskin L. Quantification of Gordona amarae strains in foaming activated sludge and anaerobic digester systems with oligonucleotide hybridization probes. Appl. Environ. Microbiol. 1998;64:2503–12. doi: 10.1128/aem.64.7.2503-2512.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Appels L, et al. Anaerobic digestion in global bio-energy production: Potential and research challenges. Renew. Sustain. Energy Rev. 2011;15:4295–4301. doi: 10.1016/j.rser.2011.07.121. [DOI] [Google Scholar]
  • 4.Sekiguchi Y, et al. First genomic insights into members of a candidate bacterial phylum responsible for wastewater bulking. PeerJ. 2015;3 doi: 10.7717/peerj.740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ahring BK, Westermann P, Ahringt BK, Westermann P. Thermophilic anaerobic degradation of butyrate by a butyrate-utilizing bacterium in coculture and triculture with methanogenic bacteria. Appl. Environmantal Microbiol. 1987;53:429–33. doi: 10.1128/aem.53.2.429-433.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Rivière D, et al. Towards the definition of a core of microorganisms involved in anaerobic digestion of sludge. ISME J. 2009;3:700–714. doi: 10.1038/ismej.2009.2. [DOI] [PubMed] [Google Scholar]
  • 7.Sundberg C, et al. 454 Pyrosequencing analyses of bacterial and archaeal richness in 21 full-scale biogas digesters. FEMS Microbiol. Ecol. 2013;85:612–626. doi: 10.1111/1574-6941.12148. [DOI] [PubMed] [Google Scholar]
  • 8.Nielsen PH, et al. A conceptual ecosystem model of microbial communities in enhanced biological phosphorus removal plants. Water Res. 2010;44:5070–5088. doi: 10.1016/j.watres.2010.07.036. [DOI] [PubMed] [Google Scholar]
  • 9.Vrieze JD, et al. Ammonia and temperature determine potential clustering in the anaerobic digestion microbiome. Water Res. 2015;75:312–23. doi: 10.1016/j.watres.2015.02.025. [DOI] [PubMed] [Google Scholar]
  • 10.Werner JJ, et al. Bacterial community structures are unique and resilient in full-scale bioenergy systems. Proc. Natl. Acad. Sci. U. S. A. 2011;108:4158–4163. doi: 10.1073/pnas.1015676108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Saunders, A. M. A. M. et al. The activated sludge ecosystem contains a core community of abundant organisms. ISME J. 1–10, doi:10.1038/ismej.2015.117 (2015). [DOI] [PMC free article] [PubMed]
  • 12.Fodor, A. A. et al. The ‘most wanted’ taxa from the human microbiome for whole genome sequencing. PLoS One7 (2012). [DOI] [PMC free article] [PubMed]
  • 13.Nocker A, Cheung C-Y, Camper AK. Comparison of propidium monoazide with ethidium monoazide for differentiation of live vs. dead bacteria by selective removal of DNA from dead cells. J. Microbiol. Methods. 2006;67:310–320. doi: 10.1016/j.mimet.2006.04.015. [DOI] [PubMed] [Google Scholar]
  • 14.Wagner AO, Malin C, Knapp BA, Illmer P. Removal of free extracellular DNA from environmental samples by ethidium monoazide and propidium monoazide. Appl. Environ. Microbiol. 2008;74:2537–9. doi: 10.1128/AEM.02288-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Lee S, Kang H-J, Park H-D. Influence of influent wastewater communities on temporal variation of activated sludge communities. Water Res. 2015;73:132–44. doi: 10.1016/j.watres.2015.01.014. [DOI] [PubMed] [Google Scholar]
  • 16.Seib MDD, Berg KJJ, Zitomer DHH. Influent wastewater microbiota and temperature influence anaerobic membrane bioreactor microbial community. Bioresour. Technol. 2016;216:446–452. doi: 10.1016/j.biortech.2016.05.098. [DOI] [PubMed] [Google Scholar]
  • 17.Mei R, Narihiro T, Nobu MK, Kuroda K, Liu W-T. Evaluating digestion efficiency in full-scale anaerobic digesters by identifying active microbial populations through the lens of microbial activity. Sci. Rep. 2016;6 doi: 10.1038/srep34090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Quast C, et al. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Res. 2013;41:590–596. doi: 10.1093/nar/gks1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Cole JR, et al. The Ribosomal Database Project: improved alignments and new tools for rRNA analysis. Nucleic Acids Res. 2009;37:D141–D145. doi: 10.1093/nar/gkn879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Mcdonald D, et al. An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea. ISME J. 2012;6:610–618. doi: 10.1038/ismej.2011.139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Albertsen M, Karst SM, Ziegler AS, Kirkegaard RH, Nielsen PH. Back to basics - The influence of DNA extraction and primer choice on phylogenetic analysis of activated sludge communities. PLoS One. 2015;10 doi: 10.1371/journal.pone.0132783. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.McIlroy SJ, et al. MiDAS: the field guide to the microbes of activated sludge. Database. 2015;2015 doi: 10.1093/database/bav062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Debelius J, et al. Tiny microbes, enormous impacts: what matters in gut microbiome studies? Genome Biol. 2016;17 doi: 10.1186/s13059-016-1086-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wirth R, et al. Characterization of a biogas-producing microbial community by short-read next generation DNA sequencing. Biotechnol. Biofuels. 2012;5 doi: 10.1186/1754-6834-5-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Luo G, Fotidis IA, Angelidaki I. Comparative analysis of taxonomic, functional, and metabolic patterns of microbiomes from 14 full-scale biogas reactors by metagenomic sequencing and radioisotopic analysis. Biotechnol. Biofuels. 2016;9 doi: 10.1186/s13068-016-0465-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Solli L, Håvelsrud OE, Horn SJ, Rike AG. A metagenomic study of the microbial communities in four parallel biogas reactors. Biotechnol. Biofuels. 2014;7 doi: 10.1186/s13068-014-0146-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Guo J, et al. Dissecting microbial community structure and methane-producing pathways of a full-scale anaerobic reactor digesting activated sludge from wastewater treatment by metagenomic sequencing. Microb. Cell Fact. 2015;14 doi: 10.1186/s12934-015-0218-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Campanaro S, et al. Metagenomic analysis and functional characterization of the biogas microbiome using high throughput shotgun sequencing and a novel binning strategy. Biotechnol. Biofuels. 2016;9 doi: 10.1186/s13068-016-0441-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Weinstock GM. Genomic approaches to studying the human microbiota. Nature. 2012;489:250–256. doi: 10.1038/nature11553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Nobu MK, Narihiro T, Kuroda K, Mei R, Liu W-T. Chasing the elusive Euryarchaeota class WSA2: genomes reveal a uniquely fastidious methyl-reducing methanogen. ISME J. 2016;10:2478–87. doi: 10.1038/ismej.2016.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Martiny AC, Treseder K, Pusch G. Phylogenetic conservatism of functional traits in microorganisms. ISME J. 2013;7:830–8. doi: 10.1038/ismej.2012.160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.McIlroy SJ, et al. MiDAS 2.0: an ecosystem-specific taxonomy and online database for the organisms of wastewater treatment systems expanded for anaerobic digester groups. Database. 2017;2017 doi: 10.1093/database/bax016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Rossetti S, Tomei MC, Nielsen PH, Tandoi V. ‘Microthrix parvicella’, a filamentous bacterium causing bulking and foaming in activated sludge systems: A review of current knowledge. FEMS Microbiol. Rev. 2005;29:49–64. doi: 10.1016/j.femsre.2004.09.005. [DOI] [PubMed] [Google Scholar]
  • 34.Westlund ÅD, Hagland E, Rothman M. Operational aspects on foaming on digesters caused by Microtrix parvicella. Water Sci. Technol. 1998;38:29–34. [Google Scholar]
  • 35.Ganidi N, Tyrrel S, Cartmell E. Anaerobic digestion foaming causes–a review. Bioresour. Technol. 2009;100:5546–54. doi: 10.1016/j.biortech.2009.06.024. [DOI] [PubMed] [Google Scholar]
  • 36.Etchebehere C, Pavan ME, Zorzópulos J, Soubes M, Muxí L. Coprothermobacter platensis sp. nov., a new anaerobic proteolytic thermophilic bacterium isolated from an anaerobic mesophilic sludge. Int. J. Syst. Bacteriol. 1998;48(Pt 4):1297–1304. doi: 10.1099/00207713-48-4-1297. [DOI] [PubMed] [Google Scholar]
  • 37.Rees GN, Patel BK, Grassia GS, Sheehy AJ. Anaerobaculum thermoterrenum gen. nov., sp. nov., a novel, thermophilic bacterium which ferments citrate. Int. J. Syst. Bacteriol. 1997;47:150–4. doi: 10.1099/00207713-47-1-150. [DOI] [PubMed] [Google Scholar]
  • 38.Dahle H, Birkeland N-KK. Thermovirga lienii gen. nov., sp. nov., a novel moderately thermophilic, anaerobic, amino-acid-degrading bacterium isolated from a North Sea oil well. Int. J. Syst. Evol. Microbiol. 2006;56:1539–45. doi: 10.1099/ijs.0.63894-0. [DOI] [PubMed] [Google Scholar]
  • 39.Yamada T, et al. Anaerolinea thermolimosa sp. nov., Levilinea saccharolytica gen. nov., sp. nov. and Leptolinea tardivitalis gen. nov., sp. nov., novel filamentous anaerobes, and description of the new classes Anaerolineae classis nov. and Caldilineae classis nov. Int. J. Syst. Evol. Microbiol. 2006;56:1331–40. doi: 10.1099/ijs.0.64169-0. [DOI] [PubMed] [Google Scholar]
  • 40.Kirkegaard, R. H. et al. Genomic insights into members of the candidate phylum Hyd24-12 common in mesophilic anaerobic digesters. ISME J. 1–13, doi:10.1038/ismej.2016.43 (2016). [DOI] [PMC free article] [PubMed]
  • 41.Liu Y, Balkwill DL, Aldrich HC, Drake GR, Boone DR. Characterization of the anaerobic propionate-degrading syntrophs Smithella propionica gen. nov., sp. nov. and Syntrophobacter wolinii. Int. J. Syst. Bacteriol. 1999;49(Pt 2):545–56. doi: 10.1099/00207713-49-2-545. [DOI] [PubMed] [Google Scholar]
  • 42.Mosbæk F, et al. Identification of syntrophic acetate-oxidizing bacteria in anaerobic digesters. ISME J. 2016;2:1–14. doi: 10.1038/ismej.2016.39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Plugge CM, Balk M, Zoetendal EG, Stams AJM. Gelria glutamica gen. nov., sp. nov., a thermophilic, obligately syntrophic, glutamate-degrading anaerobe. Int. J. Syst. Evol. Microbiol. 2002;52:401–7. doi: 10.1099/00207713-52-2-401. [DOI] [PubMed] [Google Scholar]
  • 44.De Vrieze J, Hennebel T, Boon N, Verstraete W. Methanosarcina: The rediscovered methanogen for heavy duty biomethanation. Bioresour. Technol. 2012;112:1–9. doi: 10.1016/j.biortech.2012.02.079. [DOI] [PubMed] [Google Scholar]
  • 45.Gray ND, Miskin IP, Kornilova O, Curtis TP, Head IM. Occurrence and activity of archaea in aerated activated sludge wastewater treatment plants. Environ. Microbiol. 2002;4:158–168. doi: 10.1046/j.1462-2920.2002.00280.x. [DOI] [PubMed] [Google Scholar]
  • 46.Miller TL, Wolin MJ, Demacario EC, Macario AJL. Isolation of Methanobrevibacter-Smithii From Human Feces. Appl. Environ. Microbiol. 1982;43:227–232. doi: 10.1128/aem.43.1.227-232.1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Johnston C, Ufnar JA, Griffith JF, Gooch JA, Stewart JR. A real-time qPCR assay for the detection of the nifH gene of Methanobrevibacter smithii, a potential indicator of sewage pollution. J. Appl. Microbiol. 2010;109:1946–1956. doi: 10.1111/j.1365-2672.2010.04824.x. [DOI] [PubMed] [Google Scholar]
  • 48.Carballa M, Regueiro L, Lema JM. Microbial management of anaerobic digestion: exploiting the microbiome-functionality nexus. Curr. Opin. Biotechnol. 2015;33:103–11. doi: 10.1016/j.copbio.2015.01.008. [DOI] [PubMed] [Google Scholar]
  • 49.Ziganshin AM, Liebetrau J, Pröter J, Kleinsteuber S. Microbial community structure and dynamics during anaerobic digestion of various agricultural waste materials. Appl. Microbiol. Biotechnol. 2013;97:5161–5174. doi: 10.1007/s00253-013-4867-0. [DOI] [PubMed] [Google Scholar]
  • 50.Eloe-Fadrosh, E. A., Ivanova, N. N., Woyke, T. & Kyrpides, N. C. Metagenomics uncovers gaps in amplicon-based detection of microbial diversity. Nat. Microbiol. 15032, doi:10.1038/nmicrobiol.2015.32 (2016). [DOI] [PubMed]
  • 51.Karst SM, et al. Thousands of primer-free, high-quality, full-length SSU rRNA sequences from all domains of life. bioRxiv. 2016;10 [Google Scholar]
  • 52.Lane, D. J. 16S/23S rRNA sequencing. Nucleic acid Tech. Bact. Syst. 125–175 (1991).
  • 53.Muyzer G, Waal ECDE, Uitierlinden AG, De Waal EC, Uitterlinden AG. Profiling of complex microbial populations by denaturing gradient gel electrophoresis analysis of polymerase chain reaction-amplified genes coding for 16S rRNA. Appl. Environ. Microbiol. 1993;59:695–700. doi: 10.1128/aem.59.3.695-700.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Pinto AJ, Raskin L. PCR biases distort bacterial and archaeal community structure in pyrosequencing datasets. PLoS One. 2012;7 doi: 10.1371/journal.pone.0043093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Salzberg SL, Magoč T, Salzberg SL. FLASH: Fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27:2957–2963. doi: 10.1093/bioinformatics/btr507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–2461. doi: 10.1093/bioinformatics/btq461. [DOI] [PubMed] [Google Scholar]
  • 58.Wang Q, Garrity GM, Tiedje JM, Cole JR, Al WET. Naïve Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl. Environ. Microbiol. 2007;73:5261–5267. doi: 10.1128/AEM.00062-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.R Core Team R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. www.r-project.org/ (2016).
  • 60.RStudio Team RStudio: Integrated Development Environment for R. RStudio, Inc. Boston, USA. www.rstudio.com/ (2015).
  • 61.McMurdie PJ, Holmes S. phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS One. 2013;8 doi: 10.1371/journal.pone.0061217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Wickham H. Reshaping data with the {reshape} package. J. Stat. Softw. 2007;21:1–20. doi: 10.18637/jss.v021.i12. [DOI] [Google Scholar]
  • 63.Wickham, H. & Francois, R. dplyr: A grammar of data manipulation. R Foundation for Statistical Computing, Vienna, Austria. https://cran.r-project.org/package=dplyr (2015).
  • 64.Oksanen, J. et al. vegan: Community Ecology Package. Foundation for Statistical Computing, Vienna, Austria. https://cran.r-project.org/package=vegan (2015).
  • 65.Xie, Y knitr: A general-purpose package for dynamic report generation in R. Foundation for Statistical Computing, Vienna, Austria. https://cran.r-project.org/package=knitr (2013).
  • 66.Pages, H., Gentleman, R., Aboyoun, P. & DebRoy, S. Biostrings: String objects representing biological sequences, and matching algorithms. Bioconductor, Buffalo, USA. http://bioconductor.org/packages/release/bioc/html/Biostrings.html (2008).
  • 67.Dowle, M. et al. data.table: Extension of data.frame. Foundation for Statistical Computing, Vienna, Austria. https://cran.r-project.org/package=data.table (2014).
  • 68.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq. 2. Genome Biol. 2014;15 doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.de Vries, A. & Ripley, B. D. ggdendro: Create dendrograms and tree diagrams using ‘ggplot2’. Foundation for Statistical Computing, Vienna, Austria. https://cran.r-project.org/package=ggdendro (2015).
  • 70.Wickham, H. stringr: Simple, consistent wrappers for common string operations. Foundation for Statistical Computing, Vienna, Austria. https://cran.r-project.org/package=stringr (2015).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary info (1.4MB, pdf)
Supplementary Table S1 (2.3KB, csv)
Supplementary Table S2 (838.9KB, csv)

Data Availability Statement

Amplicon sequencing data is available at the ENA with the project ID PRJEB15624. OTU tables and metadata files are available at figshare (DOI: 10.6084/m9.figshare.4308191). The RMarkdown files to generate the figures are available at github (github.com/Kirk3gaard/Publications/tree/master/Kirkegaard2017). The curated MiDAS taxonomy32 (v. 2.1) is available for download from the MiDAS website (midasfieldguide.org/en/download/).


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES