Abstract
Dietary DNA metabarcoding enables researchers to identify and characterize trophic interactions with a high degree of taxonomic precision. It is also sensitive to sources of bias and contamination in the field and laboratory. One of the earliest and most common strategies for dealing with such sensitivities has been to remove all low‐abundance sequences and conduct ecological analyses based on the presence or absence of food taxa. Although this step is now often perceived to be necessary, evidence of its sufficiency is lacking and more attention to the risk of introducing other errors is needed. Using computer simulations, we demonstrate that common strategies to remove low‐abundance sequences can erroneously eliminate true dietary sequences in ways that impact downstream inferences. Using real data from well‐studied wildlife populations in Yellowstone National Park, we further show how these strategies can markedly alter the composition of dietary profiles in ways that scale‐up to obscure ecological interpretations about dietary generalism, specialism, and composition. Although the practice of removing low‐abundance sequences may continue to be a useful strategy to address research questions that focus on a subset of relatively abundant foods, its continued widespread use risks generating misleading perceptions about the structure of trophic networks. Researchers working with dietary DNA metabarcoding data—or similar data such as environmental DNA, microbiomes, or pathobiomes—should be aware of drawbacks and consider alternative bioinformatic, experimental, and statistical solutions.
Keywords: bighorn sheep, bison, frequency of occurrence, grazer‐browser continuum, herbivore, Hill numbers, microhistology, relative read abundance
1. INTRODUCTION
Advances in dietary DNA metabarcoding have revolutionized our ability to address some of the most fundamental goals in ecology: to quantify the diversity of species and understand how they interact. Myriad technical developments in molecular ecology have improved our ability to identify foods, assess dietary diversity and overlap, and measure the relative abundance of taxa in the diets of wild animals, livestock, and humans (Deagle et al., 2009; Mata et al., 2019; Pegard et al., 2009; Reese et al., 2019; Valentini et al., 2009). Yet the utility of dietary DNA metabarcoding methods has occasionally come under scrutiny over doubts about the accuracy of a particular step in the bioinformatic pipelines that ultimately determine how we interpret the data.
Among the most controversial decisions about how to analyse dietary DNA has been how to convert sequence count data into dietary profiles. Early research primarily aimed to validate DNA metabarcoding as a method to generate accurate and precise lists of food species (Shehzad et al., 2012; Soininen et al., 2009; Valentini et al., 2009; Zeale et al., 2011). Researchers quickly became aware that contamination, PCR and sequencing errors, tag jumps, and biological and technical biases can complicate this aim (Pompanon et al., 2012). High‐throughput sequencing technologies generate thousands of low‐quality and erroneous sequences with each run. Since most errors occur at low relative abundances, truncating the long tail of the sequence abundance distribution clearly removes many such errors. Because eliminating errors is desirable, and because the goal of generating a list of food species can be accomplished using presence/absence data, an apparently simple solution was to eliminate low‐abundance sequences and conclude that the rest were “present” in a sample. Yet, despite the focus on presence/absence data, diet data sets often retained biologically meaningful signals of sequence relative read abundance (RRA) that could be corroborated by feeding trials (Deagle et al., 2009; Willerslev et al., 2014), stable‐isotopes analysis (Kartzinel et al., 2015), and microhistology (Soininen et al., 2009), albeit often with the need for correction factors (Thomas et al., 2016) or the lumping of sequences into operational taxonomic units (Clarke et al., 2014). Observations like these established awareness of the need to balance the aims of quantifying both the presence and relative read abundance of “true” food species against the risk of including errors.
Researchers continue to emphasize the occurrence of relatively abundant taxa because of the legacy of the early literature on presence/absence data, because it is an apparently simple solution to the risk of including errors, and because of the assumption that the most ecologically and nutritiously important foods are abundant. A common strategy is simply to remove taxa from a sample that do not exceed a minimum threshold. Dietary analyses may rely on minimum overall count thresholds (Valentini et al., 2009) or, more often in recent studies, sample‐wise RRA thresholds (Ait Baamrane et al., 2012; Kartzinel et al., 2015; Pompanon et al., 2012). Since removing sequences with low overall or sample‐wise RRA can eliminate low‐abundance errors, it is often characterized as a “conservative” option that minimizes the risk of including false‐positive sequences (Alberdi et al., 2018; Ando et al., 2020). Crucial drawbacks to this strategy, however, include the often arbitrary and subjective selection of thresholds, the inability to eliminate errors that exceed whatever threshold is selected, and the risk of inadvertently excluding true dietary taxa while inflating the apparent importance of the taxa that remain (Deagle et al., 2019; Kelly et al., 2019). There is thus much confusion about how to employ this strategy, and evidence of its efficacy is lacking, creating a need to focus on appropriate alternative strategies and the treatment of rare taxa (Ando et al., 2020).
What does it mean to be “conservative” with dietary DNA metabarcoding data? The answer to this question must be evaluated in the context of the goals of a particular study. More than a decade ago when the predominant goal in this field was to evaluate whether a particular taxon was present in a sample, it may have been prudent to discard low‐abundance sequences as putative contaminants in order to avoid including spurious taxa. On the other hand, there is a risk in removing low‐abundance sequences that could represent rare food taxa and provide information about animal nutrition, foraging behavior, and the structure of food webs. Understanding the ecology and evolution of dietary specialization, for example, requires differentiation of dietary specialists that concentrate on one or a few resources from dietary generalists that utilize a more even array of resources (Araújo et al., 2011; Bolnick et al., 2003). Understanding how individual feeding interactions scale‐up to establish the links and nodes of complex trophic networks requires determination of interaction strengths, and whether “weak” links are as important to the network as they are often theorized to be (Pringle & Hutchinson, 2020). Real sources of variation in dietary breadth and interaction strength will necessarily translate into variation in the number of rare dietary DNA sequences that appear in data, and this in turn ensures that any bioinformatic decision about how to treat low‐abundance sequences can differentially impact the dietary profiles of animals with broad versus narrow diets. Awareness of how common strategies for analyzing low‐abundance sequences distort dietary profiles and alter ecological interpretations is critical for the appropriate treatment of data.
2. DIFFERENTIAL IMPACTS OF DATA FILTERING ON SIMULATED DIETS
Although abundance‐filtering is common in DNA metabarcoding pipelines, the RRA cutoff used to determine which taxa count as “present” varies widely among studies (Alberdi et al., 2018; Pompanon et al., 2012). Deagle et al. (2019) suggested that a 1% RRA threshold may be suitable for many dietary studies, but values in the range of 0%–5% are not uncommon (Alberdi et al., 2018; Bohmann et al., 2018; Kartzinel et al., 2019; Komura et al., 2018; ter Schure et al., 2021).
2.1. Dietary profile simulations
Consider variation in both the richness and rank‐abundance distribution of animal diets. Diets are generally characterized by skewed distributions that are concentrated on a small number of predominant resources but also include many rare resources (Forister et al., 2015). These heavy‐tailed distributions can be approximated by a power law function known as the Pareto distribution (Appendix S1 provides detailed simulation methods). Dietary specialists, such as the koala (Phascolarctos cinereus), usually consume a very small number of food taxa, and this type of dietary profile can be represented by a highly‐skewed distribution (e.g., Figure 1a). In contrast, dietary generalists, such as the racoon (Procyon lotor), tend to consume a wide variety of resources that can be represented by a more even distribution (e.g., Figure 1c). The skewness of a Pareto distribution is denoted by a shape parameter, α (Figure S1).
To develop theoretical intuition, we simulated diet profiles of generalist and specialist consumers (Appendix S1). In our simulations, we considered a set of three hypothetical consumers with access to an identical food base comprising 100 potentially suitable taxa. We assumed that each consumer differed only in the specificity with which it consumed food taxa. This reflects the aims of many empirical studies to characterize feeding specializations, food preferences, foraging behaviors, competition, or other ecological factors that influence the composition of diets and the structure of trophic networks (Figure 1a–c; Appendix S1). For each consumer, the probability that each of the 100 food taxa would be selected was calculated using the R package Pareto v.2.3.0 (Riegel, 2018). To generate diet profiles that differed in their degree of specialism, we defined probability distributions using skew values to represent specialist (α = 0.20), intermediate (α = 0.35), and generalist (α = 1.00) diets (Appendix S1, Figure S2). From each of these probability distributions, we made 25,000 draws to simulate a common number of DNA metabarcoding sequence‐reads obtained per sample with an Illumina MiSeq. The resulting diets are thus theoretically free of any differences in sampling, contamination, technical bias, and error—they represent an ideal data set in which draws of taxa are made in proportion to their true abundances. The specialist's diet profile comprised a much narrower subset of the 100 available resources compared to the intermediate and generalist feeder (Figure 1a–c).
2.2. Skewed diet profiles are differentially impacted by the same thresholds
To evaluate the theoretical impact of removing low‐abundance taxa from the three simulated diet profiles, we incrementally filtered taxa that did not exceed thresholds of 0%–5% RRA (Appendix S1). We found that abundance‐filtering differentially impacted the relatively skewed diet profiles of specialist and intermediate feeders compared to the relatively even diet profile of the generalist (Figure 1). Incrementally removing taxa resulted in a large decrease in richness for each sample, but an immediate 75%–80% loss of dietary richness occurred after applying only a mild 0.2% filter to the specialist and intermediate diet profiles, respectively (Figure 1d–e). In contrast, the generalist diet profile did not exhibit any reduction in richness until more stringent thresholds were applied (Figure 1d–e). Then, at an intermediate threshold of 2.8%, diet profiles converged to generate the incorrect impression that all samples had similar richness (Figure 1d; grey dashed line). At this level, there was a >95% reduction in the inferred richness of the intermediate and generalist diets (Figure 1e; grey dashed line). All else equal, these simulations reveal how any threshold can have a qualitative impact on the apparent richness and composition of a sample in ways that differ depending on its true composition.
3. DIFFERENTIAL IMPACTS OF DATA FILTERING ON WELL‐STUDIED WILDLIFE DIETS
We sought to illustrate how abundance‐filtering can qualitatively alter interpretations of real dietary DNA metabarcoding data by testing predictions about (i) seasonality and (ii) trait‐based differences in the diets of large herbivores from Yellowstone National Park (Appendix S2 provides detailed methods for wildlife diet analysis). Following a well‐established annual migration, Yellowstone bison (Bison bison) and bighorn sheep (Ovis canadensis) follow springtime plant green‐up from their lower elevation winter range to higher elevation meadows. We predicted both species would exhibit greater dietary richness in summer compared to winter because: both species graze on graminoids year round; both have access to a larger number of plant species in summer; both incorporate species‐rich suites of forbs into summer diets; and both incorporate species‐poor shrubs into winter diets (Bergmann et al., 2015; Craine, 2021; Geist, 1971; Peden et al., 1974; Wagner & Peek, 2006). We further predicted that bison would have greater dietary richness than bighorn sheep because although both species have ruminant digestive systems, bison are much larger (~625 vs. 75 kg body mass), have wider muzzles, and have larger home ranges. Thus, all else equal, bison should exhibit greater dietary richness because each individual will encounter and/or be able to consume a greater variety of available forage (Clauss et al., 2013). Yet despite firmly established allometric differences in digestive physiology that support our hypothesis, recent studies of African ungulates have reported a lack of correlation between dietary richness and body size (Kartzinel et al., 2015; Kartzinel & Pringle, 2020). The taxonomic precision of DNA metabarcoding could help reconcile such cases where established allometric and foraging ecology theories appear to diverge from data, and our results will show how crucial it is to consider the potential effects of abundance‐filtering on these types of downstream ecological interpretations.
Our illustrative analyses of bison and bighorn sheep diets are based on 35 samples from winter and summer (median = 10 per species per season) analyzed by sequencing the P6 loop of the chloroplast trnL(UAA) intron and using publicly available plant reference data (Appendix S2). We required a 100% match between a dietary sequence and a reference sequence that was present in the library to include a plant taxon in our analysis, thereby minimizing the risk of introducing sequencing errors and chimeras. This mapping strategy is reasonable when researchers have access to an extensive reference library of potential food sequences—enabling accurate sequence identification and efficient error elimination—although it may not be as appropriate for studies involving markers and/or taxa with poor reference coverage compared to what is available for trnL‐P6 (Pompanon et al., 2012). Overall, 91.1% of high‐quality sequence reads were mapped to the reference library across all 35 samples (1,071,130 of 1,175,453 sequences overall reads; median = 93.2% per sample). From these mapped reads, we initially characterized 357 plant sequences and retained a subset of 355 sequences after rarefying samples to equal sequencing depth (Dryad https://doi.org/10.5061/dryad.kwh70rz4s; Data S1), with 88% identified to family (312 of 355 sequences), 55% to genus (196 of 355), and 23% to species (82 of 355). Because the resulting diet profiles comprise only those taxa that match a reference plant sequence, they could represent an underestimate of the true plant richness in a sample given that: (i) a subset of available plant species may not yet be included in public data and/or (ii) the trnL‐P6 marker has a limited ability to differentiate among certain closely related species (Taberlet et al., 2007). Importantly, if these diet profiles underestimate the true dietary richness, it will be due to limitations inherent to the marker and available reference data rather than an artefact arising from data‐filtering decisions.
Following the same procedure that we applied to simulated diets, we filtered taxa from samples using 0%–5% RRA thresholds (Appendix S2). We compared differences in the inferred dietary richness within and between species based on individual samples, based on the average richness across samples, and based on the total richness of each population after accounting for differences in sample size. We observed that thresholds can: (i) alter the rank‐order of inferred richness values, (ii) obscure patterns of seasonal variation within and among species, and (iii) obscure differences in the overall dietary breadth of species.
3.1. Thresholds alter rank‐order of dietary richness
Detailed comparisons of four representative samples show both intra‐ and interspecific variation in the shape of diet profiles (Figure 2a–d). Differences in richness and evenness led to changes in the inferred rank‐order of samples as low‐abundance taxa were removed (Figure 2e), just as they did in the simulation study. The dietary richness of each sample decreased by ≥51% with a mild threshold of 0.2%. The sample with the lowest initial richness exhibited the largest apparent decline in richness (80% loss; Figure 2f). A larger threshold of 4% resulted in the conversion of both dietary richness and the percent loss of initial richness across all samples, regardless of the consumer species (grey dashed lines in Figure 2e–f). Thus, applying bioinformatic thresholds to real data reproduced the patterns observed in theoretical simulations and obscured ecologically meaningful differences in the shape of diet profiles: (i) a mild threshold led to a precipitous drop in richness for diet profiles with relatively narrow breadth, (ii) a more stringent threshold led to convergence in apparent richness between broad and narrow profiles, and (iii) different thresholds altered the rank‐order of dietary richness that was inferred for this set of consumers.
3.2. Thresholds obscure seasonal patterns dietary richness
Incrementally removing low‐abundance taxa eroded the apparent increase in mean dietary richness across samples from winter to summer. The mean richness of bighorn sheep and bison diets was greater based on the totality of the sequence data in summer versus winter, but this seasonal difference disappeared when low‐abundance taxa were removed and only the subset of dominant dietary taxa remained (Figure 3a). With each threshold, there was a “dropout” of low‐abundance grasses (family Poaceae), buckwheats (Polygonaceae), evening primroses (Onagraceae), and roses (Rosaceae) in the diets of both species, in both summer and winter (Data [Link], [Link]). In summer, however, there was a disproportionate loss of taxonomically diverse sequences: bighorn sheep samples lost sequences representing mustards (Brassicaceae) and legumes (Fabaceae; Data S2), while bison lost many sedges (Cyperaceae) and willows (Salicaceae; Data S3). Sequences remaining following the most stringent thresholds included a subset of common foods for both bighorn sheep (e.g., grasses, geraniums (Geraniaceae), and roses; Data S2) and bison (e.g., sedges and grasses; Data S3). Thus, although using thresholds in this way may help researchers identify the “core” resources present, it can also generate the false impression that consumers have narrow diets, disproportionately exclude taxa from species‐rich groups (e.g., grasses, forbs), and generate artificially simple trophic networks.
3.3. Thresholds obscure population‐level differences in overall dietary breadth
The removal of low‐abundance taxa differentially influenced each species’ total dietary richness. As above, incremental removal of taxa resulted in a large decrease in total richness inferred for both bighorn sheep and bison (Figure 3b, Data [Link], [Link]). Dietary richness apparently increased from winter to summer for bighorn sheep, regardless of threshold. In contrast, seasonal differences in dietary richness were apparently reversed for bison, depending on threshold: total richness appeared greater in summer when no threshold was applied, but it appeared greater in winter whenever low‐abundance taxa were removed (Figure 3b). This drop in summer dietary richness for bison was driven by the removal of a large number of taxa, each occurring at low relative abundance: asters, evening primroses, broomrapes (Orobanchaceae), and buttercups (Ranunculaceae; Data S5). A purported benefit of population‐level analyses is the ability to “average out” sampling stochasticity, since a resource that is erroneously excluded from one sample may still be registered in another for downstream analyses (Deagle et al., 2019; Kartzinel & Pringle, 2020). Our results illustrate the risk of presuming this outcome when consumers eat many food taxa that each represent a relatively small proportion of the diet.
3.4. Ecological interpretations
Apparent declines in richness with each threshold had the potential to alter support for our hypotheses. We predicted dietary richness would be maximized in summer for both species and found strong support for this prediction in the complete data set (Figure 3). Abundance‐filtering, however, eroded this seasonal pattern and, in the case of bison, reversed it (Figure 3). Similarly, we predicted bison would have greater dietary richness than bighorn sheep; we observed this difference with the complete data set, but abundance‐filtering eroded or eliminated it (Figure 3).
Evaluating bioinformatic strategies in light of natural history can guide interpretation. Ideally, information from comprehensive DNA barcode libraries would support taxonomic inference and facilitate error‐filtering (Pompanon et al., 2012). Although we do not yet have a comprehensive library for the flora of Yellowstone, every sequence in our complete data set was a 100% match to publicly available sequence data and the overall diet composition was realistic for the animals we studied. Tracking taxa that dropped out of the data set with each threshold enabled us to evaluate patterns (Data [Link], [Link], [Link], [Link]). For example, wind‐dispersed pollen deposition is thought to be a common source of low‐abundance sequence contamination (Ando et al., 2018). However, we observed dropout of some wind‐dispersed taxa that are palatable to these herbivores using a mild threshold (e.g., some grasses, sedges) while other wind‐dispersed taxa from less palatable groups exceeded high thresholds (e.g., pine trees, Pinaceae). We observed this in summer when plants reproduce, and pollen contamination is likely, as well as in winter when plants are reproductively dormant. Although we cannot say definitively for any given sample whether a sequence represents a food that was deliberately eaten by the animal, we can use the observed variation in sequence representation to guide ecological interpretations. Indeed, quantifying this variation is a fundamental goal of dietary DNA metabarcoding research.
4. STRATEGIES FOR IMPROVEMENT
There is nothing inherently wrong with excluding low‐abundance taxa under at least two conditions: (i) the objective is to identify a subset of “core” taxa that occur above a threshold or (ii) the rare sequences represent errors that persist in the data despite experimental controls and other bioinformatic strategies. Although DNA metabarcoding is useful for identifying core dietary taxa (scenario i), most studies aim to elucidate more complete diets and trophic networks (Ando et al., 2020). There is thus a need to ensure that any suite of rare sequences to be analyzed are as free of error as possible, although this can be difficult to verify (scenario ii). Because there have been many improvements to sampling strategies and laboratory protocols that reduce the incidence of contamination (Alberdi et al., 2018; Ando et al., 2020; Creer et al., 2016; Mata et al., 2019; McInnes et al., 2017), we will focus on promising and transparent analytical strategies to account for low‐abundance taxa.
4.1. Computational and mathematical alternatives to arbitrary thresholds
Promising strategies provide alternatives to relying on arbitrary abundance thresholds. For example, the flexible simulation strategy we demonstrated above may be generally useful for evaluating the effects of analytical options on downstream ecological interpretations. This could include power analyses to assess the probability of detecting an effect or sensitivity analyses that evaluate any qualitative changes to the conclusion of a study that would result from different filtering strategies (Kartzinel et al., 2015; Kartzinel & Pringle, 2020). Related algorithms developed specifically for dietary DNA metabarcoding have potential to address drawbacks of arbitrary thresholds directly. For example, Bayesian strategies can be used to assess underlying uncertainties in the identification of “true” sequences in a food web rather than assuming the suitability of a fixed cutoff (Cirtwill & Hambäck, 2021).
Even without using RRA filters to purge rare sequences from analysis, it is possible to manage their impact on interpretations. For example, Hill numbers are a mathematically unified family of diversity indices that quantify diversity in units of effective number of species (Hill, 1973), enabling researchers to upweight or downweight rare taxa using the scaling parameter q (Jost, 2006). The larger the q value, the greater the importance attributed to abundant taxa. The lowest q value is 0, which is equivalent to species richness (all taxa are counted equally). Increasing to q = 1 yields a value equivalent to the exponential of Shannon entropy (taxa are weighted in proportion to abundance without disproportionately favoring either rare or abundant ones). Finally, q = 2 equals the inverse of Simpson concentration, which upweights abundant taxa and discounts rare ones (Chao et al., 2014). Hill exponents can, therefore, be considered as the richness of all species (q 0), the diversity of "typical” species (q 1), or the diversity of dominant species (q 2). When rare taxa are considered to be of low importance, regardless of whether they are thought to be real or errors, researchers can use higher q values to downweight them (Alberdi & Gilbert, 2019). In contrast, if rare taxa are considered essential for proper understanding, researchers may need to rely on exact counts of taxa present (Alberdi & Gilbert, 2019).
To illustrate similarities and differences in using Hill numbers to downweight rare taxa compared to using thresholds to exclude them, we applied this method to our simulations and our Yellowstone data (Appendix S2). When downweighting emphasis on rare taxa from q 0 to q 2, the apparent diversity of food taxa converged asymptotically towards a low value (Figure 4). This familiar pattern is superficially similar to applying incremental thresholds to the data. In contrast to thresholds, however, Hill numbers had the desirable quality of retaining the rank‐order of samples. This contrast is especially evident in comparisons of simulated diet profiles with known dietary breadth, since the rank‐order of the theoretical generalists and specialists was retained across all values of q but was inverted by the application of RRA thresholds (Figures 1e and 4a).
4.2. Multiple datastreams can corroborate and contextualize
Comparing independent datastreams can support inferences in dietary studies (Nielsen et al., 2018). Ideally, researchers would qualitatively corroborate conclusions in light of strengths and weaknesses inherent to each method. Prior studies, for example, have combined DNA metabarcoding with direct feeding observations or experiments (Deagle et al., 2010; Thomas et al., 2014; Willerslev et al., 2014), stable isotope analysis (Craine et al., 2015; Kartzinel et al., 2015), and gut content analysis or microhistology (King & Schoenecker, 2019; Newmaster et al., 2013; Soininen et al., 2009).
To help contextualize our results for bison and bighorn sheep, we compared dietary diversity indices obtained from DNA metabarcoding with corresponding microhistology data. Microhistology is a method involving the visual examination of fecal material on slides (Appendix S2). An advantage of microhistology is the opportunity to identify taxa that may not be covered by the trnL‐P6 marker we used in this study, such as lichens, mosses, and fungi. However, microhistology is so labour intensive that it is common to: (i) only identify taxa to higher taxonomic classifications (e.g., our analysis aimed to identify plant fragments to genus); (ii) group species into broad taxonomic or functional groups (Garnick et al., 2018; Mayes & Dove, 2000); (iii) underestimate the proportion of digestible forbs (King & Schoenecker, 2019; Shrestha & Wegge, 2006); and (iv) lump fecal deposits into “representative” samples rather than analyzing each individually.
Dietary DNA metabarcoding was better able to differentiate closely related food taxa than microhistology and thus revealed a greater diversity of foods. Consider the 58 grass genera known to occur in Yellowstone (Whipple, 2001). Microhistology identified only nine grass taxa with genus‐level precision across all samples and seasons (Data S6, Figure S4), but DNA metabarcoding revealed between 22 and 40 grass DNA sequences (representing from 5 to 9 grass genera) across each species, depending on season (Data [Link], [Link]). In principle, this means that up to 49 grass genera could be lumped into the “unknown Poaceae” category of our microhistology data. While microhistology provides a reasonable estimate of overall “grass” contributions to diets, it masks the contribution of each constituent grass species to diversity. Microhistology suggested that bison had greater year‐round dietary richness (q0 ) compared to bighorn sheep, although there was no significant difference in the number of typical (q 1) or dominant (q 2) plant taxa (Figure 5). In contrast, DNA further illuminated a significant effect of species and season on the typical (q 1) and dominant (q 2) number of plant taxa consumed by bison and bighorn sheep (Figure 5). Importantly, results obtained from both methods aligned closely when we used Hill numbers to downweight rare taxa (Figure 5). Taken together, these results support the interpretation that many rare taxa were present in the dietary DNA and accounting for them provided a finer level of detail that would not otherwise be possible.
4.3. Transparency and reproducibility
Awareness of how data manipulations influence downstream analyses is vital to ensuring transparency and reproducibility. Given that different decisions about whether and how to use RRA‐based filters in bioinformatic pipelines has the potential to alter ecological patterns, transparency will be crucial for high‐resolution dietary data to accumulate in ways that make it a useful scientific resource. To ensure transparency, it is helpful to differentiate between the bioinformatic steps that are initially used to filter a data set of errors from any subsequent summaries of the data or statistical analyses. In the literature, RRA‐based filters have been used both for both purposes—to purportedly eliminate low‐abundance contaminants and to summarize the relatively abundant “core” resources—which complicates comparisons. Whereas many journal polices stipulate that DNA metabarcoding studies should publish tables of sequence read data, a review by Ando et al. (2020) revealed that 46% of studies omitted them. Providing a quality‐ and contaminant‐filtered count table of sequence data that has not otherwise been transformed, rarefied, or filtered to remove low‐abundance sequences can help reviewers and other researchers evaluate interpretations that may be sensitive to the presence of rare sequences. Such tables are needed to enable further analyses, including those based on statistical frameworks designed specifically for sequence count data (e.g., DESeq2; Love et al., 2014). However, the literature may often give the impression that such data sets are filtered incompletely and potentially wrong. Complicating matters, researchers are increasingly able to work with commercial laboratories to generate dietary DNA profiles and need to be aware that these services often apply low‐abundance filters by default. Good communication between purveyors and practitioners is critical, especially when detection of minor dietary components is needed (Scasta et al., 2019). Providing access to more complete data sets does not preclude researchers from generating, analyzing, and sharing filtered versions of the data—but it does offer the advantage of transparency.
5. DISCUSSION
The precautionary principle emphasizes caution and review before acting in the absence of conclusive evidence to support a decision (Van Der Sluijs et al., 2005). In the context of DNA metabarcoding, researchers face a dilemma: all else equal, will it tend to be better to include or exclude rare DNA sequences from analysis? Researchers initially suggested that prudence dictates a need to gather more evidence before including low‐abundance sequences in analysis—excluding such sequences has thus often been framed as conservative (Brown et al., 2015; Pompanon et al., 2012). We argue that it may often be prudent to retain rare sequences and weigh them appropriately based on relative abundance rather than risk obscuring real patterns in the data—that the rarity of a sequence alone is insufficient evidence of an error or otherwise unimportant trophic link to justify its removal. Advances in conceptual and methodological approaches for DNA metabarcoding now offer alternatives to simple abundance‐filtering that could help researchers better balance these risks and support more robust ecological interpretations.
Prior empirical evaluations of abundance‐filtering in dietary and environmental DNA studies have shown that many contaminants (but not all) tend to occur at low relative abundance (Alberdi et al., 2018; Ando et al., 2018). By quantifying plant DNA contamination in samples from herbivores fed controlled diets, for example, Ando et al. (2018) noted that removing sequences below 1% RRA reduced the probability of incorporating contaminants. There are at least two ways to reconcile experimental results like this with the issues we illustrated here: (i) researchers may benefit from obtaining empirical estimates of contamination rates and sources that they can use as evidence to support decisions about what sequences to exclude; (ii) since contaminants rarely rise to high relative abundance, they represent a source of error that can be managed in many research applications by using appropriate statistics and by weighing sequences based on RRA. Importantly, although a certain threshold may be shown to reduce the probability of including contaminants in one study, it does not necessarily follow that it is a procedure that eliminates all contaminants or retains all relevant sequences. Similar thresholds could be inappropriate in other studies since contamination rates vary among target taxa, environments, barcode regions, and methods (Ando et al., 2020). Hence, focus on how to appropriately account for rare sequences is warranted.
We began this paper by asking when and whether using thresholds to remove low‐abundance sequences makes sense, noting the importance of addressing this question with respect to a particular research objective. If the aim is to determine whether the DNA of a particular taxon is definitively present in a sample, then it might be appropriate to remove low‐abundance sequences. If, however, the goal is to compare dietary profiles or infer the topology of trophic networks, then it may be risky to abundance‐filter data in light of the evidence that removing low‐abundance taxa can distort comparisons of interest. In particular, removing rare taxa from the diet profiles of species with a relatively high degree of individual diet specialization—including predators and phytophagous insects (Bolnick et al., 2007; Codron et al., 2016)—is liable to artificially accentuate the inferred level of interindividual variation by omitting overlapping food sources from a subset of diet profiles. In contrast, the effect on diet profiles from populations with less inter‐individual variation—such as large mammalian herbivores that tend to have broader and more even diet profiles (Codron et al., 2016)—could be to eliminate sources of variation in ways that artificially homogenize the group.
Accounting for rare food taxa can be integral to understanding animal diets. Wild species are often observed consuming unexpected food items, which can inspire fascination and challenge long‐standing dogma (Burton, 2018). For example, many species are unable to obtain vital trace elements through “normal” diets (Pringle & Hutchinson, 2020). Crocodilians are generally assumed to be obligate carnivores incapable of digesting plant proteins and polysaccharides, but yet they regularly consume fruits to supplement an otherwise carnivorous diet (Platt et al., 2013). Many ungulates, in contrast, are thought to be obligate herbivores but yet surprising examples of protein‐rich food sources are regularly documented, including deer eating songbirds (Pietz & Granfors, 2000) and warthogs hunting antelope (Roberts, 2012). Large mammalian herbivores are able to feed on a broad array of food plants, but nevertheless feed preferentially on a subset of available plant species while avoiding others (Owen‐Smith & Novellie, 1982). Animals may rarely eat desirable foods if they benefit from diversifying their diets to dilute taxon‐specific defence compounds (Freeland & Janzen, 1974), if they feed in ways that suppress the local availability of preferred foods (Bryant et al., 1991; Kartzinel & Pringle, 2020; Spiller & Schoener, 1998), or if they experience competitive displacement (Pringle et al., 2019). For all of these reasons, important dietary taxa may only register in diets at low relative abundances and improving our ability to account for them is a major research priority.
Knowing that many important foods can occur at low relative abundances poses a challenge for dietary DNA metabarcoding research. These rare foods need to be documented in order to understand food webs and foraging behaviors, but common bioinformatic procedures can eliminate them from analysis. Continuing to exclude them may only serve to reinforce preconceived notions about what animals eat and inhibit new understanding of how ecology works. To fulfill the promise of dietary DNA metabarcoding by using cutting‐edge technology to characterize diets with precision requires us to overcome this challenge.
CONFLICT OF INTEREST
Authors declare that there are no conflicts of interest.
AUTHOR CONTRIBUTIONS
B.L.L.C. and T.R.K. conceived and designed the study; L.M.M. and C.G. provided fecal samples and microhistology data; P.T.F., V.I.S., C.V.T., and T.R.K. generated dietary DNA data; B.L.L.C. and T.R.K. led data analyses and wrote the manuscript with contributions from all authors.
Supporting information
ACKNOWLEDGEMENTS
We would like to thank the National Park Service, Ecology Project International, and Yellowstone Forever for their help collecting wildlife fecal samples; Brown University students in BIOL 1515/2015 in 2018 and 2020 for their course‐based participation in this research; The Howard Hughes Medical Institute and The Sheridan Center for Teaching and Learning for financially supporting student participation; the Brown Multidisciplinary Teaching Laboratory staff; Bruce Davitt for performing microhistological analyses; Douglas Frank for sparking this collaboration on Yellowstone food webs. Funding was provided by NSF DEB‐2046797, DEB‐2026294, DEB‐1930820, and OIA‐2033823.
Littleford‐Colquhoun, B. L. , Freeman, P. T. , Sackett, V. I. , Tulloss, C. V. , McGarvey, L. M. , Geremia, C. , & Kartzinel, T. R. (2022). The precautionary principle and dietary DNA metabarcoding: Commonly used abundance thresholds change ecological interpretation. Molecular Ecology, 31, 1615–1626. 10.1111/mec.16352
Contributor Information
Bethan L. Littleford‐Colquhoun, Email: bethan_littleford-colquhoun@brown.edu.
Tyler R. Kartzinel, Email: tyler_kartzinel@brown.edu.
DATA AVAILABILITY STATEMENT
Illumina sequence read data and sample metadata have been made available at NCBI (BioProject accession no.: PRJNA780500; Littleford‐Colquhoun et al., 2021). Unrarefied/rarefied sequence read tables and plant taxonomy information have been made available at Dryad (https://doi.org/10.5061/dryad.kwh70rz4s). All bioinformatic scripts are available at Zenodo (10.5281/zenodo.5703310).
REFERENCES
- Ait Baamrane, M. A. , Shehzad, W. , Ouhammou, A. , Abbad, A. , Naimi, M. , Coissac, E. , Taberlet, P. , & Znari, M. (2012). Assessment of the food habits of the Moroccan dorcas gazelle in M’Sabih Talaa, west central Morocco, using the trnL approach. PLoS One, 7(4), e35643. 10.1371/journal.pone.0035643 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Alberdi, A. , Aizpurua, O. , Gilbert, M. T. P. , & Bohmann, K. (2018). Scrutinizing key steps for reliable metabarcoding of environmental samples. Methods in Ecology and Evolution, 9(1), 134–147. 10.1111/2041-210X.12849 [DOI] [Google Scholar]
- Alberdi, A. , & Gilbert, M. T. P. (2019). A guide to the application of Hill numbers to DNA‐based diversity analyses. Molecular Ecology Resources, 19(4), 804–817. 10.1111/1755-0998.13014 [DOI] [PubMed] [Google Scholar]
- Ando, H. , Fujii, C. , Kawanabe, M. , Ao, Y. , Inoue, T. , & Takenaka, A. (2018). Evaluation of plant contamination in metabarcoding diet analysis of a herbivore. Scientific Reports, 8(1), 1–10. 10.1038/s41598-018-32845-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ando, H. , Mukai, H. , Komura, T. , Dewi, T. , Ando, M. , & Isagi, Y. (2020). Methodological trends and perspectives of animal dietary studies by noninvasive fecal DNA metabarcoding. Environmental DNA, 2(4), 391–406. 10.1002/edn3.117 [DOI] [Google Scholar]
- Araújo, M. S. , Bolnick, D. I. , & Layman, C. A. (2011). The ecological causes of individual specialisation. Ecology Letters, 14(9), 948–958. 10.1111/j.1461-0248.2011.01662.x [DOI] [PubMed] [Google Scholar]
- Bergmann, G. T. , Craine, J. M. , Robeson, M. S. , & Fierer, N. (2015). Seasonal shifts in diet and gut microbiota of the American bison (Bison bison). PLoS One, 10(11), e0142409. 10.1371/journal.pone.0142409 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bohmann, K. , Gopalakrishnan, S. , Nielsen, M. , Nielsen, L. D. S. B. , Jones, G. , Streicker, D. G. , & Gilbert, M. T. P. (2018). Using DNA metabarcoding for simultaneous inference of common vampire bat diet and population structure. Molecular Ecology Resources, 18(5), 1050–1063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolnick, D. I. , Svanbäck, R. , Araújo, M. S. , & Persson, L. (2007). Comparative support for the niche variation hypothesis that more generalized populations also are more heterogeneous. Proceedings of the National Academy of Sciences, 104(24), 10075–10079. 10.1073/pnas.0703743104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolnick, D. I. , Svanbäck, R. , Fordyce, J. A. , Yang, L. H. , Davis, J. M. , Hulsey, C. D. , & Forister, M. L. (2003). The ecology of individuals: incidence and implications of individual specialization. The American Naturalist, 161(1), 1–28. 10.1086/343878 [DOI] [PubMed] [Google Scholar]
- Brown, E. A. , Chain, F. J. , Crease, T. J. , MacIsaac, H. J. , & Cristescu, M. E. (2015). Divergence thresholds and divergent biodiversity estimates: can metabarcoding reliably describe zooplankton communities? Ecology and Evolution, 5(11), 2234–2251. 10.1002/ece3.1485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bryant, J. P. , Provenza, F. D. , Pastor, J. , Reichardt, P. B. , Clausen, T. P. , & Du Toit, J. T. (1991). Interactions between woody plants and browsing mammals mediated by secondary metabolites. Annual Review of Ecology and Systematics, 22(1), 431–446. 10.1146/annurev.es.22.110191.002243 [DOI] [Google Scholar]
- Burton, A. (2018). Degenerate diners. Wiley Online Library. [Google Scholar]
- Chao, A. , Gotelli, N. J. , Hsieh, T. , Sander, E. L. , Ma, K. , Colwell, R. K. , & Ellison, A. M. (2014). Rarefaction and extrapolation with Hill numbers: a framework for sampling and estimation in species diversity studies. Ecological Monographs, 84(1), 45–67. 10.1890/13-0133.1 [DOI] [Google Scholar]
- Cirtwill, A. R. , & Hambäck, P. (2021). Building food networks from molecular data: Bayesian or fixed‐number thresholds for including links. Basic and Applied Ecology, 50, 67–76. 10.1016/j.baae.2020.11.007 [DOI] [Google Scholar]
- Clarke, L. J. , Soubrier, J. , Weyrich, L. S. , & Cooper, A. (2014). Environmental metabarcodes for insects: in silico PCR reveals potential for taxonomic bias. Molecular Ecology Resources, 14(6), 1160–1170. [DOI] [PubMed] [Google Scholar]
- Clauss, M. , Steuer, P. , Müller, D. W. , Codron, D. , & Hummel, J. (2013). Herbivory and body size: allometries of diet quality and gastrointestinal physiology, and implications for herbivore ecology and dinosaur gigantism. PLoS One, 8(10), e68714. 10.1371/journal.pone.0068714 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Codron, D. , Codron, J. , Sponheimer, M. , & Clauss, M. (2016). Within‐population isotopic niche variability in savanna mammals: disparity between carnivores and herbivores. Frontiers in Ecology and Evolution, 4, 15. 10.3389/fevo.2016.00015 [DOI] [Google Scholar]
- Craine, J. M. (2021). Seasonal patterns of bison diet across climate gradients in North America. Scientific Reports, 11(1), 1–11. 10.1038/s41598-021-86260-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Craine, J. M. , Towne, E. G. , Miller, M. , & Fierer, N. (2015). Climatic warming and the future of bison as grazers. Scientific Reports, 5(1), 1–8. 10.1038/srep16738 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Creer, S. , Deiner, K. , Frey, S. , Porazinska, D. , Taberlet, P. , Thomas, W. K. , Potter, C. , & Bik, H. M. (2016). The ecologist's field guide to sequence‐based identification of biodiversity. Methods in Ecology and Evolution, 7(9), 1008–1018. 10.1111/2041-210X.12574 [DOI] [Google Scholar]
- Deagle, B. E. , Chiaradia, A. , McInnes, J. , & Jarman, S. N. (2010). Pyrosequencing faecal DNA to determine diet of little penguins: is what goes in what comes out? Conservation Genetics, 11(5), 2039–2048. 10.1007/s10592-010-0096-6 [DOI] [Google Scholar]
- Deagle, B. E. , Kirkwood, R. , & Jarman, S. N. (2009). Analysis of Australian fur seal diet by pyrosequencing prey DNA in faeces. Molecular Ecology, 18(9), 2022–2038. 10.1111/j.1365-294X.2009.04158.x [DOI] [PubMed] [Google Scholar]
- Deagle, B. E. , Thomas, A. C. , McInnes, J. C. , Clarke, L. J. , Vesterinen, E. J. , Clare, E. L. , & Eveson, J. P. (2019). Counting with DNA in metabarcoding studies: How should we convert sequence reads to dietary data? Molecular Ecology, 28(2), 391–406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Forister, M. L. , Novotny, V. , Panorska, A. K. , Baje, L. , Basset, Y. , Butterill, P. T. , Cizek, L. , Coley, P. D. , Dem, F. , Diniz, I. R. , Drozd, P. , Fox, M. , Glassmire, A. E. , Hazen, R. , Hrcek, J. , Jahner, J. P. , Kaman, O. , Kozubowski, T. J. , Kursar, T. A. , … Dyer, L. A. (2015). The global distribution of diet breadth in insect herbivores. Proceedings of the National Academy of Sciences, 112(2), 442–447. 10.1073/pnas.1423042112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Freeland, W. J. , & Janzen, D. H. (1974). Strategies in herbivory by mammals: the role of plant secondary compounds. The American Naturalist, 108(961), 269–289. 10.1086/282907 [DOI] [Google Scholar]
- Garnick, S. , Barboza, P. S. , & Walker, J. W. (2018). Assessment of animal‐based methods used for estimating and monitoring rangeland herbivore diet composition. Rangeland Ecology & Management, 71(4), 449–457. 10.1016/j.rama.2018.03.003 [DOI] [Google Scholar]
- Geist, V. (1971). Mountain sheep. A study in behavior and evolution. University of Chicago Press. [Google Scholar]
- Hill, M. O. (1973). Diversity and evenness: a unifying notation and its consequences. Ecology, 54(2), 427–432. 10.2307/1934352 [DOI] [Google Scholar]
- Jost, L. (2006). Entropy and diversity. Oikos, 113(2), 363–375. [Google Scholar]
- Kartzinel, T. R. , Chen, P. A. , Coverdale, T. C. , Erickson, D. L. , Kress, W. J. , Kuzmina, M. L. , Rubenstein, D. I. , Wang, W. , & Pringle, R. M. (2015). DNA metabarcoding illuminates dietary niche partitioning by African large herbivores. Proceedings of the National Academy of Sciences, 112(26), 8019–8024. 10.1073/pnas.1503283112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kartzinel, T. R. , Hsing, J. C. , Musili, P. M. , Brown, B. R. , & Pringle, R. M. (2019). Covariation of diet and gut microbiome in African megafauna. Proceedings of the National Academy of Sciences, 116(47), 23588–23593. 10.1073/pnas.1905666116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kartzinel, T. R. , & Pringle, R. M. (2020). Multiple dimensions of dietary diversity in large mammalian herbivores. Journal of Animal Ecology, 89(6), 1482–1496. 10.1111/1365-2656.13206 [DOI] [PubMed] [Google Scholar]
- Kelly, R. P. , Shelton, A. O. , & Gallego, R. (2019). Understanding PCR processes to draw meaningful conclusions from environmental DNA studies. Scientific Reports, 9(1), 1–14. 10.1038/s41598-019-48546-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- King, S. R. , & Schoenecker, K. A. (2019). Comparison of methods to examine diet of feral horses from noninvasively collected fecal samples. Rangeland Ecology & Management, 72(4), 661–666. 10.1016/j.rama.2019.02.005 [DOI] [Google Scholar]
- Komura, T. , Ando, H. , Horikoshi, K. , Suzuki, H. , & Isagi, Y. (2018). DNA barcoding reveals seasonal shifts in diet and consumption of deep‐sea fishes in wedge‐tailed shearwaters. PLoS One, 13(4), e0195385. 10.1371/journal.pone.0195385 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Littleford‐Colquhoun, B. L. , Freeman, P. T. , Sackett, V. I. , Tulloss, C. V. , McGarvey, L. M. , Geremia, C. , & Kartzinel, T. R. (2021). The precautionary principle and dietary DNA metabarcoding: commonly used abundance thresholds change ecological interpretation. NCBI. [BioProject accession number: PRJNA780500] [DOI] [PMC free article] [PubMed]
- Love, M. I. , Huber, W. , & Anders, S. (2014). Moderated estimation of fold change and dispersion for RNA‐seq data with DESeq2. Genome Biology, 15(12), 1–21. 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mata, V. A. , Rebelo, H. , Amorim, F. , McCracken, G. F. , Jarman, S. , & Beja, P. (2019). How much is enough? Effects of technical and biological replication on metabarcoding dietary analysis. Molecular Ecology, 28(2), 165–175. 10.1111/mec.14779 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mayes, R. W. , & Dove, H. (2000). Measurement of dietary nutrient intake in free‐ranging mammalian herbivores. Nutrition Research Reviews, 13(1), 107–138. 10.1079/095442200108729025 [DOI] [PubMed] [Google Scholar]
- McInnes, J. C. , Alderman, R. , Deagle, B. E. , Lea, M. A. , Raymond, B. , & Jarman, S. N. (2017). Optimised scat collection protocols for dietary DNA metabarcoding in vertebrates. Methods in Ecology and Evolution, 8(2), 192–202. [Google Scholar]
- Newmaster, S. G. , Thompson, I. D. , Steeves, R. A. D. , Rodgers, A. R. , Fazekas, A. J. , Maloles, J. R. , McMullin, R. T. , & Fryxell, J. M. (2013). Examination of two new technologies to assess the diet of woodland caribou: video recorders attached to collars and DNA barcoding. Canadian Journal of Forest Research, 43(10), 897–900. 10.1139/cjfr-2013-0108 [DOI] [Google Scholar]
- Nielsen, J. M. , Clare, E. L. , Hayden, B. , Brett, M. T. , & Kratina, P. (2018). Diet tracing in ecology: Method comparison and selection. Methods in Ecology and Evolution, 9(2), 278–291. 10.1111/2041-210X.12869 [DOI] [Google Scholar]
- Owen‐Smith, N. , & Novellie, P. (1982). What should a clever ungulate eat? The American Naturalist, 119(2), 151–178. 10.1086/283902 [DOI] [Google Scholar]
- Peden, D. G. , Van Dyne, G. M. , Rice, R. W. , & Hansen, R. M. (1974). The trophic ecology of Bison bison L. on shortgrass plains. Journal of Applied Ecology, 11(2), 489–497. 10.2307/2402203 [DOI] [Google Scholar]
- Pegard, A. , Miquel, C. , Valentini, A. , Coissac, E. , Bouvier, F. , François, D. , Taberlet, P. , Engel, E. , & Pompanon, F. (2009). Universal DNA‐based methods for assessing the diet of grazing livestock and wildlife from feces. Journal of Agricultural and Food Chemistry, 57(13), 5700–5706. 10.1021/jf803680c [DOI] [PubMed] [Google Scholar]
- Pietz, P. J. , & Granfors, D. A. (2000). White‐tailed deer (Odocoileus virginianus) predation on grassland songbird nestlings. The American Midland Naturalist, 144(2), 419–422. [Google Scholar]
- Platt, S. G. , Thorbjarnarson, J. B. , Rainwater, T. R. , & Martin, D. R. (2013). Diet of the American crocodile (Crocodylus acutus) in marine environments of coastal Belize. Journal of Herpetology, 47(1), 1–10. [Google Scholar]
- Pompanon, F. , Deagle, B. E. , Symondson, W. O. , Brown, D. S. , Jarman, S. N. , & Taberlet, P. (2012). Who is eating what: diet assessment using next generation sequencing. Molecular Ecology, 21(8), 1931–1950. 10.1111/j.1365-294X.2011.05403.x [DOI] [PubMed] [Google Scholar]
- Pringle, R. M. , & Hutchinson, M. C. (2020). Resolving food‐web structure. Annual Review of Ecology, Evolution, and Systematics, 51, 55–80. 10.1146/annurev-ecolsys-110218-024908 [DOI] [Google Scholar]
- Pringle, R. M. , Kartzinel, T. R. , Palmer, T. M. , Thurman, T. J. , Fox‐Dobbs, K. , Xu, C. C. , & Evangelista, D. A. (2019). Predator‐induced collapse of niche structure and species coexistence. Nature, 570(7759), 58–64. [DOI] [PubMed] [Google Scholar]
- Reese, A. T. , Kartzinel, T. R. , Petrone, B. L. , Turnbaugh, P. J. , Pringle, R. M. , & David, L. A. (2019). Using DNA metabarcoding to evaluate the plant component of human diets: a proof of concept. Msystems, 4(5), e00458–e1419. 10.1128/mSystems.00458-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riegel, U. (2018). Matching tower information with piecewise Pareto. European Actuarial Journal, 8(2), 437–460. 10.1007/s13385-018-0177-3 [DOI] [Google Scholar]
- Roberts, B. A. (2012). An attack by a warthog Phacochoerus africanus on a newborn Thomson's gazelle Gazella thomsonii . African Journal of Ecology, 50(4), 507–508. 10.1111/j.1365-2028.2012.01344.x [DOI] [Google Scholar]
- Scasta, J. , Jorns, T. , Derner, J. , Lake, S. , Augustine, D. , Windh, J. , & Smith, T. (2019). Validation of DNA metabarcoding of fecal samples using cattle fed known rations. Animal Feed Science and Technology, 255, 114219. 10.1016/j.anifeedsci.2019.114219 [DOI] [Google Scholar]
- Schure, A. T. M. , Pillai, A. A. S. , Thorbek, L. , Bhavani Shankar, M. , Puri, R. , Ravikanth, G. , Boer, H. J. , & Boessenkool, S. (2021). eDNA metabarcoding reveals dietary niche overlap among herbivores in an Indian wildlife sanctuary. Environmental DNA, 3(3), 681–696. 10.1002/edn3.168 [DOI] [Google Scholar]
- Shehzad, W. , Riaz, T. , Nawaz, M. A. , Miquel, C. , Poillot, C. , Shah, S. A. , & Taberlet, P. (2012). Carnivore diet analysis based on next‐generation sequencing: Application to the leopard cat (Prionailurus bengalensis) in Pakistan. Molecular Ecology, 21(8), 1951–1965. [DOI] [PubMed] [Google Scholar]
- Shrestha, R. , & Wegge, P. (2006). Determining the composition of herbivore diets in the Trans‐Himalayan rangelands: a comparison of field methods. Rangeland Ecology & Management, 59(5), 512–518. 10.2111/06-022R2.1 [DOI] [Google Scholar]
- Soininen, E. M. , Valentini, A. , Coissac, E. , Miquel, C. , Gielly, L. , Brochmann, C. , Brysting, A. K. , Sønstebø, J. H. , Ims, R. A. , Yoccoz, N. G. , & Taberlet, P. (2009). Analysing diet of small herbivores: the efficiency of DNA barcoding coupled with high‐throughput pyrosequencing for deciphering the composition of complex plant mixtures. Frontiers in Zoology, 6(1), 1–9. 10.1186/1742-9994-6-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spiller, D. A. , & Schoener, T. W. (1998). Lizards reduce spider species richness by excluding rare species. Ecology, 79(2), 503–516. [Google Scholar]
- Taberlet, P. , Coissac, E. , Pompanon, F. , Gielly, L. , Miquel, C. , Valentini, A. , Vermat, T. , Corthier, G. , Brochmann, C. , & Willerslev, E. (2007). Power and limitations of the chloroplast trn L (UAA) intron for plant DNA barcoding. Nucleic Acids Research, 35(3), e14. 10.1093/nar/gkl938 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomas, A. C. , Deagle, B. E. , Eveson, J. P. , Harsch, C. H. , & Trites, A. W. (2016). Quantitative DNA metabarcoding: improved estimates of species proportional biomass using correction factors derived from control material. Molecular Ecology Resources, 16(3), 714–726. 10.1111/1755-0998.12490 [DOI] [PubMed] [Google Scholar]
- Thomas, A. C. , Jarman, S. N. , Haman, K. H. , Trites, A. W. , & Deagle, B. E. (2014). Improving accuracy of DNA diet estimates using food tissue control materials and an evaluation of proxies for digestion bias. Molecular Ecology, 23(15), 3706–3718. 10.1111/mec.12523 [DOI] [PubMed] [Google Scholar]
- Valentini, A. , Miquel, C. , Nawaz, M. A. , Bellemain, E. , Coissac, E. , Pompanon, F. , & Wincker, P. (2009). New perspectives in diet analysis based on DNA barcoding and parallel pyrosequencing: the trnL approach. Molecular Ecology Resources, 9(1), 51–60. [DOI] [PubMed] [Google Scholar]
- Van Der Sluijs, J. , Kaiser, M. , Beder, S. , Hosle, V. , Kemelmajer de Carlucci, A. , & Kinzig, A. (2005). The precautionary principle. Unesco. [Google Scholar]
- Wagner, G. D. , & Peek, J. M. (2006). Bighorn sheep diet selection and forage quality in central Idaho. Northwest Science, 80(4), 246. [Google Scholar]
- Whipple, J. J. (2001). Annotated checklist of exotic vascular plants in Yellowstone National Park. Western North American Naturalist, 61(3), 336–346. [Google Scholar]
- Willerslev, E. , Davison, J. , Moora, M. , Zobel, M. , Coissac, E. , Edwards, M. E. , & Haile, J. (2014). Fifty thousand years of Arctic vegetation and megafaunal diet. Nature, 506(7486), 47–51. [DOI] [PubMed] [Google Scholar]
- Zeale, M. R. , Butlin, R. K. , Barker, G. L. , Lees, D. C. , & Jones, G. (2011). Taxon‐specific PCR for DNA barcoding arthropod prey in bat faeces. Molecular Ecology Resources, 11(2), 236–244. 10.1111/j.1755-0998.2010.02920.x [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Illumina sequence read data and sample metadata have been made available at NCBI (BioProject accession no.: PRJNA780500; Littleford‐Colquhoun et al., 2021). Unrarefied/rarefied sequence read tables and plant taxonomy information have been made available at Dryad (https://doi.org/10.5061/dryad.kwh70rz4s). All bioinformatic scripts are available at Zenodo (10.5281/zenodo.5703310).