Skip to main content
PLOS One logoLink to PLOS One
. 2024 Apr 24;19(4):e0300583. doi: 10.1371/journal.pone.0300583

Fecal DNA metabarcoding helps characterize the Canada jay’s diet and confirms its reliance on stored food for winter survival and breeding

Alex O Sutton 1,2,*, Dan Strickland 3, Jacob Lachapelle 1, Robert G Young 1, Robert Hanner 1, Daniel F Brunton 4, Jeffrey H Skevington 5, Nikole E Freeman 1,2,6, D Ryan Norris 1
Editor: Petr Heneberg7
PMCID: PMC11042713  PMID: 38656932

Abstract

Accurately determining the diet of wild animals can be challenging if food items are small, visible only briefly, or rendered visually unidentifiable in the digestive system. In some food caching species, an additional challenge is determining whether consumed diet items have been previously stored or are fresh. The Canada jay (Perisoreus canadensis) is a generalist resident of North American boreal and subalpine forests with anatomical and behavioural adaptations allowing it to make thousands of arboreal food caches in summer and fall that are presumably responsible for its high winter survival and late winter/early spring breeding. We used DNA fecal metabarcoding to obtain novel information on nestling diets and compiled a dataset of 662 published and unpublished direct observations or stomach contents identifications of natural foods consumed by Canada jays throughout the year. We then used detailed natural history information to make informed decisions on whether each item identified to species in the diets of winter adults and nestlings was best characterized as ‘likely cached’, ‘likely fresh’ (i.e., was available as a non-cached item when it appeared in a jay’s feces or stomach), or ‘either possible’. Of the 87 food items consumed by adults in the winter, 39% were classified as ‘likely cached’ and 6% were deemed to be ‘likely fresh’. For nestlings, 29% of 125 food items identified to species were ‘likely cached’ and 38% were ‘likely fresh’. Our results support both the indispensability of cached food for Canada jay winter survival and previous suggestions that cached food is important for late winter/early spring breeding. Our work highlights the value of combining metabarcoding, stomach contents analysis, and direct observations to determine the cached vs. non-cached origins of consumed food items and the identity of food caches, some of which could be especially vulnerable to degradation through climate change.

Introduction

While diet is closely linked to growth and individual performance [13], several factors frequently limit our ability to identify what free-living animals are consuming [4]. It is often difficult to observe what is eaten or fed to young either because individual food items are too small to identify, have been manipulated in ways that limit the ability of an observer to identify a food item (e.g., multiple food items combined into a single food bolus), or are hidden by the cryptic behaviour of adults (e.g., nesting high in a tree, building a well-concealed nest). Some of these difficulties can be overcome through the identification and quantification of stomach contents but this approach suffers from limitations of its own, including the need to collect samples before they are digested [57], the requirement that animals must either be found dead or sacrificed to obtain samples, and the difficulty of identifying or quantifying soft-tissue foods such as fungi, vertebrate flesh, or even some plant material [8].

Technological advances have overcome at least a few problems associated with estimating diet. In some circumstances, high-speed photography and/or the use of drones have enhanced our ability to identify food items being consumed by animals [9] and stable isotope analysis can be used to quantify diet composition during the period of tissue growth [6, 10]. Significant limitations nevertheless persist. Videography, for example, will likely never provide the ability to identify arthropods being delivered in a compact bolus directly into the mouth of a nestling passerine and, unless the diet of an animal is simple (few potential sources), stable isotope analysis is typically restricted to identifying only major food groups (e.g., invertebrates vs. vertebrates vs. plants). The problems associated with food item identification can be further complicated in species that cache food [11]. In these cases, it may not always be clear whether individuals are consuming food that is currently available (i.e., fresh) or food items that were stored weeks or months earlier, raising the possibility that a completely different suite of food resources had been available compared to when the food was consumed.

All these challenges have a bearing on attempts to elucidate the diet of the Canada jay (Perisoreus canadensis), an iconic, sedentary, food-caching species of North American boreal and subalpine forests. Canada jays regularly store food in trees, well above eventual snow levels, using sticky saliva from unusually large salivary glands [12, 13] to fasten individual food items under bark scales or tufts of lichens. Food storage is conspicuous late-summer and fall behaviour in Canada jays and, from the successful retrieval of stored food, presumably contributes to high (> 90%) adult winter survival, at least at the southern edge of their range [14, 15]. However, despite the likelihood that the recovery of cached food explains high winter survival and territorial fidelity of Canada jays, we have very little data on what sustains them throughout the winter and what proportion of this food originates from cached stores. A further reason why such information is of particular interest is that the main foods Canada jays are known to consume (arthropods; berries; fleshy fungi; and vertebrate flesh; [22]) are all highly perishable. It seems questionable that such foods could reliably retain sufficient nutritional value from summer storage to the onset of sub-freezing (degradation-arresting) temperatures in the fall to account for the high winter survival of Canada jays [14]. Strickland et al. [16] provided some experimental evidence that volatile anti-microbial resins of conifers, particularly spruce (Picea spp.), may account for reduced degradation of perishable Canada jay caches but, ideally, one would like some assurance that high winter jay survival is not attributable instead to some unsuspected non-perishable food, as in nutcrackers (Nucifraga spp.; [17]).

Unequivocally characterizing Canada jay diet and determining the importance of stored food is even more difficult in the breeding season. With no obvious sources of fresh food, Canada jay females form and lay clutches as early as mid-February in Algonquin Park, Ontario [18] and raise offspring to fledging by late April-early May when, at least historically, the ground may still be snow covered, lakes are frozen, green-up has barely begun, and fewer than 10% of migratory passerines have returned let alone begun nesting themselves [19]. Experimental evidence supports the inference that stored food facilitates early breeding in Canada jays [20], but concrete evidence in the form of positively identified natural food items that could only have been stored at the time they were fed to nestlings has been lacking.

We had three primary aims in this study. First, we used dietary DNA (dDNA) metabarcoding [21] to expand our knowledge of the diet of nestling Canada jays. Second, using all published and non-published sources, we characterized, as completely as possible, nestling and adult diets of the Canada jay, identifying any differences between them and, for adults, any seasonal differences (i.e., winter versus “non-winter”). Third, we assessed the extent to which items identified in diets of nestlings and winter adults had likely been cached before being recovered by the adults and then consumed or fed to nestlings. In doing so, we also discuss the practicality and limitations of using fecal dDNA metabarcoding for determining previously difficult-to-obtain information on animal diets.

Methods

Study species and field site

Canada jays are year-round residents of North American boreal and subalpine forests [22], ranging from the tree line in the north to as far as Arizona in the south. We collected diet data as part of an ongoing long-term study begun in 1964 of Canada jay demography and behavior in Algonquin Provincial Park (APP) near the southern edge of its range in Ontario, Canada in the transition zone between the Great Lakes-St. Lawrence deciduous hardwood forest and the boreal forest [16]. Nest building begins in mid- to late-February with clutches initiated as early as February 22. Nestlings typically hatch in mid- to late-March under still wintry conditions and remain in the nest for approximately 23 days before fledging [22].

Collection of nestling fecal samples for metabarcoding

Since the only information on the Canada Jay nestling diet was from the contents of ten nestling stomachs or castings summarized by Strickland and Ouellet [22], our first aim was to improve our understanding of the diet through fecal dDNA metabarcoding. From April and May 2015–2017, we collected 20 fecal sacs from Canada jay nestlings in APP. Fourteen fecal sacs were collected opportunistically while nestlings were removed from the nest to be banded when they were approximately 11 days old [1, 15]. Each fecal sac was collected from a unique individual, although six fecal sacs came from the same nest on the day that young fledged. In the latter case, because the individual that each fecal sac came from could not be determined with certainty, at least one nestling, likely more, was represented more than once. Cumulatively, fecal sacs came from 8 separate nests and at least 11 nestlings. Upon collection, fecal sacs were immediately stored in a 50mL falcon tube filled with 100% ethanol and stored in a -20°C freezer within 12 hrs of being collected and until processing.

DNA extraction, amplification, and sequencing

Three gene regions were targeted in the PCR amplifications. These regions included a 157 bp region of the animal barcode, a fragment of the Cytochrome c Oxidase I five prime region (COI-5P; primer set ZBJ-ArtF1c-ion and ZBJ-Art2-ion; [23]), a 350 bp region of the Internal Transcribed Spacer (ITS2) region for fungi (primer set ITS3-M13ion and ITS4-M13ion; [24]), and a 163 bp region of the Ribulose 1,5-Biphosphate Carboxylase (rbcLa) gene for plants (primer set rbcLaF-M13ion and MrbcL 163-R1-M13ion; [24, 25]).

DNA was extracted from fecal sacs for amplification following methods outlined by Prosser and Hebert [24]. This extraction included 400 μL lysis buffer 700 mM guanidine thiocyanate (Sigma), 30 mM EDTA pH 8.0 (Fisher Scientific), 30 mM Tris-HCl pH 8.0 (Sigma), 0.5% Triton X-100 (Sigma), 5% Tween-20 (Fluka Analytical) mixed with 2 mg/mL of Proteinase K (Promega). This lysis buffer was added to the fecal samples and incubated overnight at 56°C with gentle shaking.

Purification of DNA followed [26], whereby lysate was mixed with two volumes of binding mix 3 M guanidine thiocyanate, 10 mM EDTA pH 8.0, 5 mM Tris-HCl pH 6.4, 2% Triton X-100, 50% ethanol and applied to a silica membrane spin column (Epoch Biolabs), 700 μL at a time, centrifuged at 6000g for 2 min, and repeated until no solution remained. After this step, the column was washed with 750 μL wash buffer (1.56 M guanidine thiocyanate, 5.2 mM EDTA pH 8.0, 2.6 mM Tris-HCl pH 6.4, 1.04% Triton X-100), 70% ethanol and centrifuged at 6000g for 2 min. The column was washed a second time with 750 μL of wash buffer (50 mM NaCl (Fisher Scientific), 0.5 mM EDTA pH 8.0, 10 mM Tris-HCl pH 7.4), 60% ethanol and centrifuged at 6000g for 4 min. The flow-through was discarded and the column was centrifuged at 10,000g for 4 min. The silica membrane was transferred to a clean 1.5 mL microfuge tube and dried at 56°C for 30 min. To release DNA from the silica membrane, 50 μL elution buffer (10 mM Tris-HCl, pH 8.0, prewarmed to 56°C) was added to the membrane and was left to incubate at room temperature for 1 min. The column was centrifuged at 10,000g for 5 min. to elute the DNA. The DNA was quantified using a Qubit 2.0 fluorometer (Life Technologies) and adjusted to approximately 0.5 ng/lL with elution buffer.

Next, a two-step polymerase chain reaction (PCR) amplification was conducted. The initial PCR amplified the DNA without the presence of Ion Torrent PGM (Life Technologies) multiplex identifier (MID) tags which provide identifiers for sequence reads to specific samples. This reaction had a total volume of 12.5 μL and was made up of 6.25 μL of 10% D-(+)-trehalose dihydrate (Fluka Analytical), 2.0 μL of Hyclone ultra-pure water (Thermo Scientific), 1.25 μL of 10X PlatinumTaq buffer (Invitrogen), 0.625 μL of 50 mM MgCl2 (Invitrogen), 0.125 μL of each 10 μM primer, 0.0625 μL of 10 mM dNTP (KAPA Biosystems), 0.060 μL of 5U/μL PlatinumTaq DNA Polymerase (Invitrogen), and 2 μL of template DNA. PCR thermocycler conditions for this reaction consisted of 94°C for 5 min., 40–60 cycles (40 for ITS2, 60 for rbcLA and COI-5P) of 94°C for 30 s, 48–53°C for 30 s (53°C for ITS2, 55°C for rbcLa, 48°C for COI-5P), 72°C for 30–45 s (45 s for ITS2, 30 s for rbcLa and COI-5P), and a final extension of 72°C for 10 min. The second PCR used the products from the first PCR and the same chemistry but included primers fusion primers (see Prosser and Hebert; [24]). The thermocycling conditions for this reaction were 94°C for 4 min., 20–25 cycles (20 cycles for ITS2, 25 cycles for rbcLa and COI-5P) of 94°C for 40 sec., 51–56°C for 40 sec. (56°C for ITS2, 55°C for rbcLa, 51°C for COI-5P), 72°C for 30–45 sec. (45 sec. for ITS2, 30 sec. for rbcLa and COI-5P), with a final extension of 72°C for 5 min. After amplification the products were cleaned using the magnetic bead protocol described in Prosser and Hebert [24] and then quantified using a Qubit 2.0 fluorometer and adjusted to 1 ng/μL. Cleaned and normalized products used for library construction following Prosser and Hebert [24] and were then sequenced unidirectionally on an Ion Torrent PGM using a 318 v.2 chip at the University of Guelph Centre for Biodiversity Genomics, following the manufacturer’s instructions.

The choices of molecular markers for DNA metabarcoding followed Prosser and Hebert [24]. Three gene regions were chosen to provide information on Canada jay fungal (ITS2), botanical (rbcLa), and animal (COI-5P) diet components. The rbcLa region was selected as it is readily amplifiable across a large diversity of plant life and for its utility in placing specimens to family and/or genus taxonomic levels [27]. The ITS2 region was selected as it has been shown to provide specimen identifications to species for a diverse number of fungi [28]. The COI-5P region has been selected as a “DNA barcode” standard region by the Consortium for the Barcode of Life and is well established as an effective barcode across all animal taxa of interest in this study [29, 30]. Selection of these three gene regions is further supported through the large number of publicly available sequence records against which we can compare our results (as opposed to other marker options).

Bioinformatics

Sequencing data was demultiplexed using the gene region and MID tags. Each resulting data set (three gene regions for each sample) were analysed informatically by first removing primer and adapter sequences [31]; see Prosser and Hebert [24] for sequences used), removing reads shorter than 100 bp, removing reads with low quality scores (QV < 20; github.com/ucdavis-bioinformatics/sickle), and then by de-replicated reads with 100% identity (http://hannonlab.cshl.edu/fastx_toolkit/index.html). After these steps, each read was taxonomically assigned using BLAST and three databases: all flowering plant rbcLa sequences from BOLD, all insect COI-5P sequences from GenBank, and all fungi ITS2 sequences from BOLD (Oct 2017). No clustering of the data occurred, and all trimmed and quality filtered reads were used in the taxonomic identification step. Due to the degraded nature of the DNA from the fecal sacs we filtered the BLAST output to exclude hits with less than 95% identity across at least 100 bp of the query sequence. Data were further filtered to exclude sequences with fewer than 100 reads to eliminate degraded sequences.

To support taxonomic identifications, the discriminatory ability of the amplified gene regions was assessed. A barcode gap analysis was used to ascertain if the gene region, using publicly available data, was able to reliably place a sequence to a taxonomic group [32]. To have a barcode gap there must be less variation within a group then there is between the members of that group and all other members outside the group [33]. To provide the most robust analysis of this gap, distances between all elements were calculated and the highest difference between members within the target group was compared to the smallest distance between members of the target group and all other members outside the target group. If no gap was obtained from gap analyses, COI-5P gene region BLAST identifications were pulled back to genus. With the rbcLa gene region, taxa with no barcode gap using species level taxonomic identifications were tested between genera. If there was no apparent gap at the level of genus taxonomic identifications were pulled back to Family. Placing a higher-level taxonomic identification at genus for animals using a segment of the COI-5P and family for plants using a segment of the rbcLa follows established methods [25].

To assess if a barcode gap exists for BLAST identifications, higher taxonomic levels (family for plants and either family or genus for animals depending on the manageable size of the data set and the number of BLAST identified species from the same genus or family) were used to obtain sequence data sets from the BOLD system (manually downloaded July and August 2018 and unique identifiers are included in the S1 File). Sequences were aligned (MAFFT: 31) and trimmed to target region using MEGA (primers included in S2 File: [34]). Sequences were removed from further analysis if they had greater than 2% unknown nucleotides or if they had greater than 12 gap characters (‘-‘) at either the 3’ or 5’ ends of the sequences. A distance matrix (the proportion or the number of sites that differ between each pair of aligned sequences) was constructed with the R package Ape (Ver. 4.1) dist.dna() matrix function [35]. These matrices were then used to obtain the maximum within species genetic variation and the minimum genetic distance between species of the same genus and this was completed through a custom R script (see S3 File). All within and between taxa values used to assess the specificity of the taxonomic assignments are reported (S4 File).

Compilation of a comprehensive adult and nestling diet dataset

To facilitate a more thorough analysis and comparison of adult and nestling Canada jay diets, we combined the nestling food items identified by metabarcoding outlined above, our own and published direct observations of items taken by foraging jays across their continental range, and all items visually identified by us or others in stomach contents of both nestlings and adults, (including those identified in the 1980s by experts of the then Biosystematics Research Institute [BRI] of Canada’s federal Department of Agriculture in Ottawa) into a single dataset (S1 Table). The BRI identifications (summarized in [22] but otherwise never published) included items from stomachs and regurgitated pellets of 10 APP nestlings and from 18 stomachs of adults inadvertently killed in traps set for furbearers in northern Ontario. The entire dataset included 662 items and was partitioned into three “food groups”: (1) arthropods, (2) “plants”—including vascular plants, fungi, and slime moulds, and (3) vertebrate flesh—including from carcasses or small mammals and nestling birds killed by jays themselves. These items were further partitioned according to life stage (nestling vs “adult”), observation method (direct observation vs stomach contents vs barcoding/metabarcoding), and time of year (“winter”–Nov. 1 to Mar. 31 vs “non-winter”–May 1 to Oct. 31). We supplemented these data with findings from two previously published papers: a stable-isotope analysis of adult and nestling diet [10] and an observational study of adults in Alaska [36] before numerically and graphically summarizing the results (Fig 1) and using them to draw inferences about the similarities and differences between adult and nestling diets.

Fig 1. Proportion of diets composed of arthropods and other invertebrates (grey), ‘plants’ (orange, includes plants, fungi, and molds), and vertebrate tissue (blue).

Fig 1

The table is divided into two sections; free flying individuals (adults and juveniles; A) and nestlings (B). We further divide our observations according to the method used to characterize diet composition (direct observation, stable isotope, and stomach contents) and present the estimated proportion of each of the three food groups in the overall diet. Additional information on identified diet items from each food type are available in S1 Table.

Determination of cached vs “fresh” origins of food items from nestling and adult winter diets

Based on natural history considerations, we sought to determine the extent to which food items identified in nestling fecal sacs through dDNA metabarcoding or stomach contents or in the stomach contents of adults sampled in winter (Nov. 1 –Mar. 31) were likely cached or likely “fresh” (i.e., non-cached). To do this, we assigned all food items to one of four categories: “unknown” (when nothing was known about the source organism’s natural history, when we were confident the item had been misidentified, or when it had not been identified to species); “either possible” (when the food items were present and accessible year round); “likely fresh” (if the food items were widely available as non-cached items at the time when they were detected in fecal sacs or stomach contents and especially if they were unavailable in the preceding summer/fall food storage season); or “likely cached”. We deemed a priori that the “likely cached” designation would be justified if the food item satisfied one or more of the following four criteria: (1) The food item was from a migratory species known to be entirely absent from the location where, and on the date when, the food observation was made (e.g., finding Monarch butterfly (Danaus plexippus) DNA in a nestling fecal sac in April), (2) The identified taxon was in a life stage known not to be present on the date of the food observation (e.g., the adult form of an insect that overwinters as an egg or larva), (3) All individuals of the source taxon are known to be underground when and where the food observation was made (e.g., a hibernating jumping mouse, Zapodidae), and (4) Snow depth on the date of the food observation would have precluded access to the item if it were still in its source location (e.g., a cranberry, Vaccinium sp., still on the parent plant, a few cm from the substrate). To inform our application of criterion 4, we obtained historical snow depth records from Environment and Climate Change Canada weather stations located close to the sampling location (https://climate.weather.gc.ca/historical_data/search_historic_data_e.html) and, whenever possible, supplemented them with more detailed records from the then Ontario Ministry of Natural Resources.

The final determinations of “likely cached vs likely fresh vs either possible” status was made by D.F.B. and J.H.S., using their respective botanical and entomological expertise and their specific familiarity with the natural history and phenology of northern Ontario (where we obtained most of our winter adult stomachs) and of APP (where we obtained all of our nestling fecal barcoding and stomach-contents results).

All work was approved by the Animal Utilization Committee at the University of Guelph. Further approval to collect nestling fecal samples was provided by the Canadian Wildlife Service and Ontario Parks.

Results

Summary of nestling diet items identified through DNA fecal metabarcoding

Metabarcoding results from the fledgling faecal samples were represented by 20 biological collections representing three gene regions. Of the three gene regions, raw sequencing results for the ITS2 focusing on fungal taxa had between 0 and 6,349 reads, rbcLa focusing on the plant taxa had between 5 and 852,304 reads, and COI-5P focusing on animal taxa had between 0 and 267,578 reads for a total of 3,761,841 (S5 File). After trimming and filtering there were 2,775,314 reads across 20 samples representing 1469 unique sequences, 1092 from the COI-5P, 370 from the rbcLa, and 7 ITS2 (S6 File). Unique sequence reads were collapsed into groups based on taxonomic assignment and taxonomic placement with enough data in public databases were evaluated using a DNA barcoding gap assessment (S4 File). After collapsing taxonomic assignments and assessing the taxonomic placement, 147 unique taxa were obtained as likely taxa from the nestling fecal sacs, 2% (3/147) were vertebrates (wood frog; Lithobates sylvaticus and a shrew; Sorex cinereus), 74% (109/147) were arthropods, and the remaining 24% (35/147) were plants. Overall, Araneae (30 spiders) and Lepidoptera (44 moths and butterflies) represented the majority total food items detected (Fig 1), but the most common food items were Ericales plants (heather and allies; most commonly blueberries in our study areas), which were detected in 51% of the fecal sacs. The taxonomic identifications of all consumed prey items are provided in S1 Table.

Summary description of consolidated dataset

Fig 1 summarizes and compares results across methodologies based on data in S1 Table, as well as two recently published studies [10, 36]. S1 Table lists 662 observations of Canada jay diet items that were either reported in the literature (excluding observations in 36), observed by us, or reported to us by others. Of these, 121 (18%) were “direct observations” (i.e., cases where one or more jays were seen consuming a single plant or animal species), 147 (22%) were species identified in our fecal barcoding analysis (see below), and 394 (60%) were items identified in stomach contents including 65 items from ten nestling stomachs. Within each of these three “observation-type” categories, we further partitioned observations into three broad food types (arthropods, plants, and vertebrates). Of the 121 direct observations, for example, 20 (17%) involved arthropods, 23 (19%) involved plant material, and 78 (64%) involved vertebrates. The direct observations were unique in that, for two of the direct observation sub-categories (plants and vertebrates), a further useful partitioning was possible. Eight of the 23 (35%) observations of “plants” being consumed by Canada jays involved fungi or slime moulds, hinting at the possibly under-appreciated importance of these taxa in Canada jay diets. Soft-bodied fungi cannot be identified in semi-digested stomach contents, nor could they be detected through dDNA metabarcoding of nestling fecal sacs in this study (because amplification of ITS2 was not successful) or through stable isotope analyses [10]. In a similar manner, our compilation of direct observations of the consumption of vertebrates (n = 75; S1 Table) showed that 8% (n = 6) involved vertebrate eggs, 45% (n = 34) involved carrion, and 47% (n = 35) involved live prey. Most live prey were small mammals or nestling birds but recently fledged birds were also observed to be consumed, including two that were first struck and disabled in flight [37]. Overall, nestling diet, as revealed by the combined results of fecal dDNA metabarcoding and stomach-contents analyses, had a greater proportion of arthropods than winter-adult diet (80% vs 49%), and a correspondingly lower proportion of “plant” items (18% vs 36%) and vertebrate items (2% vs 15%).

Cached vs. fresh food determinations for diet items

Fig 2 and S2 Table summarize the ‘cached-versus-fresh’ results obtained from metabarcoding of nestling fecal sacs and stomach contents from adults and nestlings. Of 194 winter adult food items and 212 nestling diet items, 45% (n = 87) and 59% (n = 125), respectively, were identified to species with sufficiently well-known natural histories that we were able to further categorize the items as “likely cached”, “likely fresh”, or “either possible”. In the subset of winter-adult food items (n = 87), we judged that 39% were “likely cached”, 55% could have been cached or fresh, and only 6% were “likely fresh”. In the corresponding subset of nestling food items (n = 125), we found a different pattern: 28% were “likely cached”, 34% could have been either cached or fresh, and 38% were “likely fresh”. “Likely-cached” food was most likely to be plant items (e.g. berries) for both winter-adult diets (57%; 32/56 total items) and nestling diets (67%; 22/33 total items). In contrast, vertebrate tissue items were the least likely to be “likely-cached” but this may reasonably be attributed to the fact that all the vertebrate items we found in winter stomachs were from species, particularly small mammals, that are active year-round, precluding us from assigning them as either cached or fresh food. Additionally, despite the relatively small proportion of “likely cached” food items in the nestling diet, it is noteworthy that, 66% (10/15) of nestling fecal sacs contained at least one “likely cached” food item, providing further support for the importance of cached food in the nestling diet of Canada jays.

Fig 2.

Fig 2

Characterization of food items as “likely cached” (light red), “either possible” (light purple), or “likely fresh” (light blue) for (A) adult Canada jay winter diet (n = 87 total observations) and (B) nestling diets using direct observations, stomach contents, and metabarcoding (n = 125 total observations). Allocation to the three categories was based on life history characteristics of food items and environmental conditions at time of observation or sample collection. The total number of food items in each of the three “cached” vs “fresh” categories are displayed under “All Food” and designations for the three major food groups (arthropods, plants, and vertebrate tissue) individually are also displayed.

Discussion

The value of dietary DNA metabarcoding to investigate animal diets

We used DNA metabarcoding to significantly improve our knowledge of the nestling diet of Canada jays and to strengthen evidence that stored food is important in that diet. This is a relatively new method for analyzing diets and a careful assessment of its strengths and weaknesses may, therefore, be useful (Table 1; [38]). First, this method does have limitations because it requires DNA reference libraries of potential food items [39, 40]. Incomplete libraries can lead to missed diversity as the sample reads cannot be matched to known references. In addition, samples may be missing read data from biota of interest when PCR primers are not optimized. One possible solution to primers poorly amplifying taxa, or alternatively, primers preferentially amplifying taxa, is to take a PCR-free approach and, instead, obtain sequences for all available DNA in the samples. This is sometimes referred to as a shotgun metagenomics sequencing and while this approach may help to address some primer concerns and could offer potential for prey biomass estimates, it is not without challenges in application, including high costs and the generation of large datasets leading to computationally intensive analyses [41, 42]. While increasing the amount of nucleotide sequence data obtained will provide more information, they are of little use if there are insufficient records in sequence libraries to identify unknown sequences. This lack of nucleotide sequence data was apparent in this work where some taxa were not able to be evaluated for the presence of a DNA barcode gap due to poorly populated libraries. For example, while the ITS2 region was amplified and sequenced for use in fungal identification (although our lack of success was most likely due to the degree of degradation occurring for these soft bodied organisms), the ITS region has been successfully utilized for identification of other taxa including Canadian flora [43]. However, due to the time and cost constraints to amplify the gene region using additional primers, in addition to the poorly populated sequence libraries for this taxonomic group, this approach was not feasible for this study.

Table 1. Pros and cons of different diet analysis techniques compared in our analysis.

Our table outlines potential benefits and drawbacks of using each diet analysis technique we compared in our manuscript. Each diet analysis technique has clear advantages and potential disadvantages that should dictate when a specific approach would be most beneficial to use and when it should be potentially avoided. Study design and the purpose of the study should be considered carefully before a particular approach is used.

Can the method… direct observations stomach contents fecal barcoding stable isotopes
(adults) (nestlings) (adults/nestlings) (adults/nestlings) (adults/nestlings)
…reliably identify all three main food groups (arthropods, "plants", vertebrates)? Yes No No Yes Yes
…estimate the relative contributions of the main food groups to total diet? No No No No Yes
…identify arthropod taxa? Maybe1 No Yes Yes No
…identify vascular plant taxa? Yes No Yes Yes No
…identify vertebrate taxa? Yes No Maybe2 Yes No
…identify fungi, slime moulds (fourth food group)? Yes No No Maybe3 Maybe4
…be minimally invasive? Yes Yes No5 Yes Yes
…be easily used to collect observations/samples? No No No Yes Yes
…use date and food item identity to potentially assess if it was likely cached? No No Yes Yes No
…use date and life history stage of a food item to potentially assess if it was likely cached? No No Yes No No

1in most cases it is difficult to identify arthropods being consumed through direct observations, but certain taxa may be possible to identify

2vertebrate taxa can be identified if stomach contents contain bones or teeth that may facilitate identification

3advances in metabarcoding techniques now mean that identification of fungi and slime moulds should be possible in barcoding studies, although the approaches employed in this study did not allow us to identify fungi in the Canada jay diet

4Freeman et al. 2021 could not distinguish slime moulds or fungi from other groups due to the isotopes used, but their identification may be possible with additional isotopes

5not invasive if stomach contents can be collected from dead animals or in the form of stomach castings (e.g. Fig 3f)

Second, in combination with accurate natural-history and environmental (e.g., snow-depth) information, dDNA metabarcoding largely duplicates the ability of stomach contents analysis to plausibly assess whether or not a given item was “likely cached”. Moreover, at least in principle, stomach contents analysis would outperform barcoding if it found exoskeletal fragments corresponding to an arthropod’s life-history stage that was unavailable in nature on the date when the fragments were found; dDNA metabarcoding cannot aspire to such determinations because it can only provide species identity, not its life-history stage (Table 1). As a practical matter, however, we found no examples where this potential handicap detracted from dDNA metabarcoding’s otherwise superior ability to identify taxa ingested by Canada jays and to determine which items had been “likely cached”. Not only does metabarcoding permit the identification of soft-tissue or small food items that are easily missed by stomach contents analysis [4446], but also, compared to stomach contents analysis, it allows more samples to be collected, far more easily, and importantly, without sacrificing individuals. In just three seasons (2015–2017) of measuring nestlings, we were able to double the number of identified nestling diet items that we had obtained during the previous 40 years through chance acquisitions of stomach contents and castings. A further advantageous feature is that the technique could be extended to more accurately assess the diets of not just nestling Canada jays, but also those of adults, including the winter diets that are of greatest interest. We have no doubt that it would be eminently feasible to capture winter adults and safely hold them in a suitably appointed small cage until they produced a fecal sample. By the same reasoning, it should be possible, in principle, to use fecal dDNA metabarcoding far beyond the limited use we have made of it here and we encourage others to do so.

Characterization of the Canada jay diet by season and life history stage

The four methods we used to elucidate diet (direct observations, stomach contents analysis, stable isotope analysis, and fecal metabarcoding; Fig 1) all indicated that Canada jays consume arthropods, “plants” (almost entirely berries), and vertebrates, but each method provided different estimates for the overall portion of each food type in the diet. For example, it is likely that the high proportions of vertebrate flesh suggested by our compilation of “direct observations” (65%) and stable isotope analysis of food-dependent APP juveniles (72%; 10) are both overestimates. Stable isotope analyses relied on a fractionation factor derived from an unrelated species (Yellow-rumped warbler; Setophaga coronata), which could have resulted in a mischaracterization of diet groups and their contributions to Canada jay diet [10]. Similarly, our compilation of direct observations almost certainly exaggerates the importance of vertebrate flesh in the diet (and minimizes that of arthropods) because observers are more likely to report consumption of familiar vertebrate taxa than of tiny, quickly ingested invertebrates. In contrast, the much lower proportion of vertebrates (24%) reported in a direct-observation study in Denali, Alaska [36] is probably more accurate because it is based on a much larger sample and because the observers made concerted efforts to record all food acquisitions by focal individuals. However, we also note that differences in diet composition between the APP and Denali population could simply be due to differences in prey composition or climate. Despite potential bias associated with direct observations, one major advantage of this approach is that it is able to register the consumption of fleshy fungi (Fig 3a) and slime moulds (Fig 3b), distinguish between incidents of scavenging versus actual predation (Fig 3c and 3d), and record unusual foraging methods (e.g. flycatching or wading into shallow water to capture tadpoles [47] and larval salamanders [48]).

Fig 3. Direct observations made of Canada jays foraging on a wide range of food items.

Fig 3

A) Canada jay (boreal morphotype) in Algonquin Provincial Park, Ontario feeding on Amanita muscaria (Photo by Langis Sirois, November 5, 2018), B) Canada jay (Pacific morphotype) with yellow residue around bill from recent consumption of the slime mould (Fuligo septica) on Vancouver Island, British Columbia (Photo by Dan Strickland, August 1, 2020), C) Canada jay (boreal morphotype) in Algonquin Provincial Park, Ontario regurgitating Choke Cherry (Prunus virginiana) seeds on a date (January 20, 2020) when cherries would only be available as cached items (Photo by Michael Runtz), D) Canada jay in Algonquin Provincial Park, Ontario dismembering a recently caught shrew (Sorex sp.) (Photo by Ann Brokelman, August 27, 2016), E) Canada jay (Rocky Mountain morphotype) in Grand Teton National Park, Wyoming consuming a Saskatoon berry (Amelanchier alnifolia; fide DFB; Photo by Susan Elliot and used with permission from the Macaulay Library at the Cornell Lab of Ornithology (ML478606261), August 26, 2022). F) Canada jay stomach casting apparently consisting of arthropod exoskeletal fragments (Strathcona Provincial Park, Vancouver Island, British Columbia; Photo by Dan Strickland, June 26, 2021).

The only presently available data permitting a comparison of winter and “non-winter” (May 1 –Oct. 31) Canada jay diets are from stomach contents and stomach castings (Fig 3e). Given the frigid realities of boreal forest winters, it is noteworthy that arthropod and plant items (Fig 3f) accounted for 49% and 36%, respectively of items identified in winter-adult stomachs, compared to corresponding figures of 66% and 30% in non-winter stomachs.

In the Canada jay nestling period, arthropods accounted for even greater proportions of the items identified in nestling stomachs and feces (92% and 74%, respectively) compared to the winter diet of adults, possibly reflecting a particular preference in parents for protein-rich arthropods for their rapidly growing nestlings, as has been found in many other bird species [49, 50]. Caution is warranted, however, before concluding that arthropods are overwhelmingly important in the diet of Canada jay nestlings. Notably, dDNA metabarcoding cannot rule out the possibility of secondary predation [51]. Also, in their stable-isotope analysis of nestling tissues, Freeman et al. [10] found that only 14% of the nestling diet was attributable to invertebrates whereas 51% of the food given to nestlings was believed to be of vertebrate or human-food origin. Notwithstanding methodological issues with stable isotope analysis noted above, this may well be the more meaningful representation of the nestling diet because stable isotopes give an estimate of volume of each food class whereas the analysis of nestling stomach contents or feces yields numbers of separate, identifiable food items or taxa contained in each stomach or fecal sac without an indication of the proportion of each food item. Consider that, if a Canada jay consumed three individuals of each of five species of arthropods and an amount of flesh and identifiable small bones from a single mouse species with an equal nutritional value and mass to that of the arthropods, stomach-contents analysis would potentially indicate, correctly, that the jay had consumed 15 arthropod items and one vertebrate item, dDNA metabarcoding would indicate that the diet included 5 arthropod taxa and one vertebrate, and stable-isotope analysis would indicate, also correctly, that arthropods and vertebrates contributed equally to the jay’s nutrition. Despite these persisting uncertainties in how best to characterize Canada jay diets, we believe our results make clear that nestling and adult diets are at least roughly similar and that, even in winter, Canada jays somehow have access to berries and arthropods that would seem to be primarily, if not exclusively, available only in the snow-free part of the year.

Contribution of stored food to adult-winter and nestling diets

By combining the identification to species level of many items found in the stomachs and feces of winter adults and nestlings with natural history information about those food species, we deemed that 39% of winter-adult items had likely been cached and only 6% were likely found as “fresh items”. By contrast, we deemed that only 28% of items detected in nestling stomachs and feces had likely been cached and at least 38% had been consumed as fresh items. We attribute the much greater proportion of apparently fresh items in the nestling diet to the fact that Canada jay nestling period in our study area typically straddles the disappearance of snow cover in mid- to late-April and, in our case, half of our nestling stomachs and fecal sacs were collected considerably later than that, even as late as May 19 (from a replacement nest). Derbyshire et al. [20] previously pointed out that once snow cover has disappeared, Canada jay parents forage for their nestlings on the forest floor where food storage has never been observed and Swift et al. [36] similarly reported that Canada jays in Denali, Alaska “responded to a record-setting warm spring by directing their foraging efforts away from cache recovery and towards the emergence of fresh food”. Together these results lend further support to the suggestion by Strickland and Ouellet [22] that food storage is the key behaviour in Canada jays permitting not only their high winter survival and territorial fidelity, but also their high nesting success (thanks to the use of stored food as critical “emergency food” in late springs or during the not unusual snow and ice storms that occur in the late-winter Canada jay nesting season).

Our work demonstrates the value of combining methods (e.g., direct observation, stomach contents, and DNA metabarcoding) when assessing the composition of the Canada Jay’s nestling and adult diets. We believe a similar approach could be applied to other free-living species to advance our knowledge of the diet of wild birds. Such knowledge is integral to understanding how individuals interact with their environments and how diet, and subsequently individual performance, may change across habitats and with climate change [52].

Supporting information

S1 Table. Summary of all food items known to be consumed by Canada jays.

The date and location the observation was made is included for each observation in addition to the identification of the food item being consumed. We also summarize additional information related to who made the record, what method was used to make the observation (“DO”–direct observation, “SC”–stomach content analysis, “FS”–metabarcoding of fecal samples), the age of the individual which consumed a food item (e.g., adult, juvenile, or nestling), and which of our four diet groups the observation belongs in (“A”–arthropod, “P”–“plants”, “V”–vertebrate tissue). Finally, for observations made during the winter and spring, we describe whether a food item was “likely-cached”, “likely fresh”, or if it could be either cached or fresh. Observations labeled with “ROM” represent stomach samples from the Royal Ontario Museum collection.

(DOCX)

pone.0300583.s001.docx (180.6KB, docx)
S2 Table. Summary table of diet observations made during the winter for both adult and nestling Canada jays.

Diet items were designated as “likely cached”, “likely fresh”, or “either possible” based on natural history information available for each food item. We further classified diet items based on whether they were consumed by adults or nestlings, and based on the method used to collect the diet item.

(XLSX)

pone.0300583.s002.xlsx (12.7KB, xlsx)
S1 File. Barcode gap records.

Custom dataset containing sequence data sets from the BOLD system and used to assess if a barcode. Gap exists for BLAST identifications.

(TSV)

pone.0300583.s003.tsv (8.6MB, tsv)
S2 File. A complete list of primers used during PCR amplification.

All primers used in our analysis are outlined here in addition to being described in the method section. Available at 10.6084/m9.figshare.25422847.

(TXT)

pone.0300583.s004.txt (218B, txt)
S3 File. R script used to calculate genetic distances between species identified using DNA metabarcoding.

Available at 10.6084/m9.figshare.25422847.

(DOCX)

pone.0300583.s005.docx (24.1KB, docx)
S4 File. Within and between taxa values used to assess the specificity of taxonomic assignments from our DNA metabarcoding analysis.

(TSV)

pone.0300583.s006.tsv (19.6KB, tsv)
S5 File. Raw, unfiltered sequencing results from our DNA metabarcoding analysis.

(TSV)

pone.0300583.s007.tsv (4.8KB, tsv)
S6 File. Trimmed and filtered BLAST results.

(TSV)

pone.0300583.s008.tsv (239.7KB, tsv)

Acknowledgments

We would like to thank numerous field assistants who collected diet information and demographic data from Canada jays in Algonquin Provincial Park. We would also like to thank the Ontario Ministry of Northern Development, Mines, Natural Resources and Forestry for providing snow depth data. Finally, thank you to Langis Sirois, Michael Runtz, Ann Brokelman, the Macaulay Library, and Susan Elliot for allowing us to use their photos.

Data Availability

All relevant data are within the paper and its Supporting information files.

Funding Statement

This study was financially supported by the Natural Sciences and Engineering Research Council of Canada (https://www.nserc-crsng.gc.ca) in the form of a grant received by DRN and NEF. This study was also financially supported by Ontario Parks (https://www.ontarioparks.ca) in the form of a grant received by AOS, DRN, and NEF. This study was also financially supported by Friends of Algonquin Park (https://www.algonquinpark.on.ca) in the form of a grant. This study was also financially supported by the Wildlife Conservation Society Canada (https://www.wcscanada.org) in the form of a grant received by AOS and NEF. This study was also financially supported by the Ministry of Natural Resources and Forestry (https://www.ontario.ca/page/ministry-natural-resources-and-forestry) in the form of a grant received by AOS, DRN and NEF, in addition to providing logistical support for this project. The funders had no additional role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Freeman NE, Norris DR, Sutton AO, Newman AEM. Raising young with limited resources: supplementation improves body condition and advances fledging of Canada Jays. Ecology. 2020;101(1):e02909. doi: 10.1002/ecy.2909 [DOI] [PubMed] [Google Scholar]
  • 2.Kearney SP, Porensky LM, Augustine DJ, Derner JD, Gao F. Predicting spatial-temporal patterns of diet quality and large herbivore performance using satellite time series. Ecological Applications. 2022;32(2):e2503. doi: 10.1002/eap.2503 [DOI] [PubMed] [Google Scholar]
  • 3.Pascacio-Villafán C, Righini N, Nestel D, Birke A, Guillén L, Aluja M. Diet quality and conspecific larval density predict functional trait variation and performance in a polyphagous frugivorous fly. Functional Ecology. 2022;36(5):1163–76. [Google Scholar]
  • 4.Robinson BG, Franke A, Derocher AE. Estimating nestling diet with cameras: quantifying uncertainty from unidentified food items. Wildlife Biology. 2015;21(5):277–82. [Google Scholar]
  • 5.Ralph CP, Nagata SE, Ralph CJ. Analysis of Droppings to Describe Diets of Small Birds. Journal of Field Ornithology. 1985;56(2):165–74. [Google Scholar]
  • 6.Meckstroth AM, Miles AK, Chandra S. Diets of Introduced Predators Using Stable Isotopes and Stomach Contents. The Journal of Wildlife Management. 2007;71(7):2387–92. [Google Scholar]
  • 7.Connan M, McQuaid CD, Bonnevie BT, Smale MJ, Cherel Y. Combined stomach content, lipid and stable isotope analyses reveal spatial and trophic partitioning among three sympatric albatrosses from the Southern Ocean. Marine Ecology Progress Series. 2014;497:259–72. [Google Scholar]
  • 8.Valentini A, Pompanon F, Taberlet P. DNA barcoding for ecologists. Trends in Ecology & Evolution. 2009;24(2):110–7. doi: 10.1016/j.tree.2008.09.011 [DOI] [PubMed] [Google Scholar]
  • 9.Verkuil YI, Nicolaus M, Ubels R, Dietz MW, Samplonius JM, Galema A, et al. DNA metabarcoding quantifies the relative biomass of arthropod taxa in songbird diets: Validation with camera-recorded diets. Ecology and Evolution. 2022;12(5):e8881. doi: 10.1002/ece3.8881 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Freeman NE, Newman AEM, Sutton AO, Kyser TK, Norris DR. Causes and consequences of variation in diet composition of nestling Canada jays. Journal of Avian Biology. 2021;52(1). [Google Scholar]
  • 11.Vander Wall SB. Food Hoarding in Animals. University of Chicago Press; 1990. 458 p. [Google Scholar]
  • 12.Bock WJ. Salivary Glands in the Gray Jays (Perisoreus). The Auk. 1961;78(3):355–65. [Google Scholar]
  • 13.Dow DD. The Role of Saliva in Food Storage by the Gray Jay. The Auk. 1965;82(2):139–54. [Google Scholar]
  • 14.Norris DR, Flockhart DTT, Strickland D. Contrasting patterns of survival and dispersal in multiple habitats reveal an ecological trap in a food-caching bird. Oecologia. 2013;173(3):827–35. doi: 10.1007/s00442-013-2680-1 [DOI] [PubMed] [Google Scholar]
  • 15.Sutton AO, Strickland D, Freeman NE, Norris DR. Climate-driven carry-over effects negatively influence population growth rate in a food-caching boreal passerine. Global Change Biology. 2021;27(5):983–92. [DOI] [PubMed] [Google Scholar]
  • 16.Strickland D, Kielstra B, R Norris D. Experimental evidence for a novel mechanism driving variation in habitat quality in a food-caching bird. Oecologia. 2011;167(4):943–50. doi: 10.1007/s00442-011-2040-y [DOI] [PubMed] [Google Scholar]
  • 17.Tomback DF. Clark’s Nutcracker (Nucifraga columbiana), version 1.0. Birds of the World. 2020; https://birdsoftheworld-org/bow/species/clanut/cur/introduction
  • 18.Sechley TH, Strickland D, Norris DR. Causes and consequences of pre-laying weight gain in a food-caching bird that breeds in late winter. Journal of Avian Biology. 2014;45(1):85–93. [Google Scholar]
  • 19.Tozer R. Birds of Algonquin Provincial Park. 2012. The Friends of Algonquin Park, Whitney, Canada. [Google Scholar]
  • 20.Derbyshire R, Norris DR, Hobson KA, Strickland D. Isotopic spiking and food dye experiments provide evidence that nestling Canada Jays (Perisoreus canadensis) receive cached food from their parents. Canadian Journal of Zoology. 2019;97(4):368–75. [Google Scholar]
  • 21.de Sousa LL, Silva SM, Xavier R. DNA metabarcoding in diet studies: Unveiling ecological aspects in aquatic and terrestrial ecosystems. Environmental DNA. 2019;1(3):199–214. [Google Scholar]
  • 22.Strickland D, Ouellet HR. Canada Jay (Perisoreus canadensis). Rodewald PG, editor. Birds of the World. 2020; https://birdsoftheworld-org./Species-Account/species/gryjay/introduction [Google Scholar]
  • 23.Zeale MRK, Butlin RK, Barker GLA, Lees DC, Jones G. Taxon-specific PCR for DNA barcoding arthropod prey in bat faeces. Molecular Ecology Resources. 2011;11(2):236–44. doi: 10.1111/j.1755-0998.2010.02920.x [DOI] [PubMed] [Google Scholar]
  • 24.Prosser SWJ, Hebert PDN. Rapid identification of the botanical and entomological sources of honey using DNA metabarcoding. Food Chemistry. 2017;214:183–91. doi: 10.1016/j.foodchem.2016.07.077 [DOI] [PubMed] [Google Scholar]
  • 25.CBOL. A DNA Barcode for Land Plants. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(31):12794–7. doi: 10.1073/pnas.0905845106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Ivanova NV, Dewaard JR, Hebert PDN. An inexpensive, automation-friendly protocol for recovering high-quality DNA. Molecular Ecology Notes. 2006;6(4):998–1002. [Google Scholar]
  • 27.Kress WJ, Erickson DL. A Two-Locus Global DNA Barcode for Land Plants: The Coding rbcL Gene Complements the Non-Coding trnH-psbA Spacer Region. PLoS One. 2007. Jun 6;2(6):e508. doi: 10.1371/journal.pone.0000508 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Schoch CL, Seifert KA, Huhndorf S, Robert V, Spouge JL, Levesque CA, et al. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proceedings of the National Academy of Sciences USA. 2012;109(16):6241–6. doi: 10.1073/pnas.1117018109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ratnasingham S, Hebert PDN. bold: The Barcode of Life Data System (http://www.barcodinglife.org). Molecular Ecology Notes. 2007;7(3):355–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Virgilio M, Backeljau T, Nevado B, De Meyer M. Comparative performances of DNA barcoding across insect orders. BMC Bioinformatics. 2010;11(1):206. doi: 10.1186/1471-2105-11-206 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 2011. May 2;17(1):10–2. [Google Scholar]
  • 32.Meyer CP, Paulay G. DNA Barcoding: Error Rates Based on Comprehensive Sampling. PLoS Biology. 2005;3(12):e422. doi: 10.1371/journal.pbio.0030422 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Katoh K, Standley DM. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Molecular Biology and Evolution. 2013;30(4):772–80. doi: 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Molecular Biology and Evolution. 2018;35(6):1547–9. doi: 10.1093/molbev/msy096 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Paradis E, Claude J, Strimmer K. APE: Analyses of Phylogenetics and Evolution in R language. Bioinformatics. 2004;20(2):289–90. doi: 10.1093/bioinformatics/btg412 [DOI] [PubMed] [Google Scholar]
  • 36.Swift KN, Williams EJ, Marzluff JM. An observational analysis of Canada Jay (Perisoreus canadensis) foraging and caching ecology in Denali National Park and Preserve, Alaska, USA. Canadian Journal of Zoology. 2022;100(2):133–46. [Google Scholar]
  • 37.Fuirst M, McLeod J, Norris DR. Habitat preferences of adult Canada Jays (Perisoreus canadensis) during the post-breeding period in Algonquin Provincial Park, Ontario, Canada. Canadian Journal of Zoology. 2022;100(6):355–62. [Google Scholar]
  • 38.Ando H, Mukai H, Komura T, Dewi T, Ando M, Isagi Y. Methodological trends and perspectives of animal dietary studies by non-invasive fecal DNA metabarcoding. Environmental DNA. 2020;2(4):391–406. [Google Scholar]
  • 39.Taberlet P, Coissac E, Pompanon F, Brochmann C, Willerslev E. Towards next-generation biodiversity assessment using DNA metabarcoding. Molecular Ecology. 2012;21(8):2045–50. doi: 10.1111/j.1365-294X.2012.05470.x [DOI] [PubMed] [Google Scholar]
  • 40.De Barba M, Adams JR, Goldberg CS, Stansbury CR, Arias D, Cisneros R, et al. Molecular species identification for multiple carnivores. Conservation Genetics Resources. 2014;6(4):821–4. [Google Scholar]
  • 41.Alberdi A, Aizpurua O, Bohmann K, Gopalakrishnan S, Lynggaard C, Nielsen M, et al. Promises and pitfalls of using high-throughput sequencing for diet analysis. Molecular Ecology Resources. 2019. Mar;19(2):327–48. doi: 10.1111/1755-0998.12960 [DOI] [PubMed] [Google Scholar]
  • 42.Thomas V, Clark J, Doré J. Fecal microbiota analysis: an overview of sample collection methods and sequencing strategies. Future Microbiology. 2015;10(9):1485–504. doi: 10.2217/fmb.15.87 [DOI] [PubMed] [Google Scholar]
  • 43.Michel CI, Meyer RS, Taveras Y, Molina J. The nuclear internal transcribed spacer (ITS2) as a practical plant DNA barcode for herbal medicines. Journal of Applied Research on Medicinal and Aromatic Plants. 2016;3(3):94–100. [Google Scholar]
  • 44.Deagle BE, Gales NJ, Evans K, Jarman SN, Robinson S, Trebilco R, et al. Studying Seabird Diet through Genetic Analysis of Faeces: A Case Study on Macaroni Penguins (Eudyptes chrysolophus). PLoS One. 2007;2(9):e831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Zarzoso-Lacoste D, Bonnaud E, Corse E, Gilles A, Meglecz E, Costedoat C, et al. Improving morphological diet studies with molecular ecology: An application for invasive mammal predation on island birds. Biological Conservation. 2016;193:134–42. [Google Scholar]
  • 46.Groen K, Beekenkamp S, de Iongh HH, Lesilau F, Chege M, Narisha L, et al. DNA metabarcoding illuminates the contribution of small and very small prey taxa to the diet of lions. Environmental DNA. 2023; 5:1321–1331. [Google Scholar]
  • 47.Beiswenger R. Predation by gray jays on aggregating tadpoles of the boreal toad (Bufo boreas). Copeia 1981;2:459–460. [Google Scholar]
  • 48.Murray MP, Pearl CA, Bury RB. Apparent Predation by Gray Jays, Perisoreus canadensis, on Long-toed Salamanders, Ambystoma macrodactylum, in the Oregon Cascade Range. The Canadian Field-Naturalist. 2005;119(2):291–2. [Google Scholar]
  • 49.Wilman H, Belmaker J, Simpson J, de la Rosa C, Rivadeneira MM, Jetz W. EltonTraits 1.0: Species-level foraging attributes of the world’s birds and mammals. Ecology. 2014;95(7):2027–2027. [Google Scholar]
  • 50.Bowler DE, Heldbjerg H, Fox AD, de Jong M, Böhning-Gaese K. Long-term declines of European insectivorous bird populations and potential causes. Conservation Biology. 2019;33(5):1120–30. doi: 10.1111/cobi.13307 [DOI] [PubMed] [Google Scholar]
  • 51.Sheppard SK, Bell J, Sunderland KD, Fenlon J, Skervin D, Symondson WOC. Detection of secondary predation by PCR analyses of the gut contents of invertebrate generalist predators. Molecular Ecology. 2005;14(14):4461–4468. doi: 10.1111/j.1365-294X.2005.02742.x [DOI] [PubMed] [Google Scholar]
  • 52.Bookwalter J, Niyas AMM, Caballero-López B, Villari C, Marco-Tresserras J, Burgas A, et al. DNA metabarcoding Passerine bird feces at tree-line uncovers little intra- and inter-species dietary overlap. Community Ecology. 2023;24:147–157. [Google Scholar]

Decision Letter 0

Petr Heneberg

17 Sep 2023

PONE-D-23-24828Fecal DNA metabarcoding helps characterize Canada jay diet and confirms the reliance of stored food for winter survival and breedingPLOS ONE

Dear Dr. Sutton,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Nov 01 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Petr Heneberg

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following financial disclosure: 

"We would also like to thank NSERC (DRN, NEF), Ontario Parks (AOS, DRN, NEF), Friends of Algonquin Park, Wildlife Conservation Society Canada (AOS, NEF), and the Ministry of Natural Resources and Forestry (AOS, DRN, NEF) for providing funding and logistical support for our project. "

  

Please state what role the funders took in the study.  If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript." 

If this statement is not correct you must amend it as needed. 

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

3. Please expand the acronym “NSERC” (as indicated in your financial disclosure) so that it states the name of your funders in full.

This information should be included in your cover letter; we will change the online submission form on your behalf.

4. Thank you for stating the following in the Acknowledgments Section of your manuscript: 

"We would like to thank countless field assistants who collected diet information and demographic data from Canada jays in Algonquin Provincial Park. We would also like to thank NSERC, Ontario Parks, Friends of Algonquin Park, Wildlife Conservation Society Canada, and the Ministry of Natural Resources and Forestry for providing funding and logistical support for our project. Finally, we would like to thank the Ontario Ministry of Northern Development, Mines, Natural Resources and Forestry for providing snow depth data."

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. 

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: 

"We would also like to thank NSERC (DRN, NEF), Ontario Parks (AOS, DRN, NEF), Friends of Algonquin Park, Wildlife Conservation Society Canada (AOS, NEF), and the Ministry of Natural Resources and Forestry (AOS, DRN, NEF) for providing funding and logistical support for our project. "

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

5. We note that you have referenced (unpublished) on page 2, which has currently not yet been accepted for publication. Please remove this from your References and amend this to state in the body of your manuscript: (ie “Bewick et al. [Unpublished]”) as detailed online in our guide for authors

http://journals.plos.org/plosone/s/submission-guidelines#loc-reference-style 

6. Please include your full ethics statement in the ‘Methods’ section of your manuscript file. In your statement, please include the full name of the IRB or ethics committee who approved or waived your study, as well as whether or not you obtained informed written or verbal consent. If consent was waived for your study, please include this information in your statement as well. 

7. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. 

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Partly

Reviewer #4: Partly

Reviewer #5: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: No

Reviewer #4: I Don't Know

Reviewer #5: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: No

Reviewer #4: No

Reviewer #5: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

Reviewer #5: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This is a well-written paper. It is particularly valuable in that it compares the results of 4 methods that have been used widely. I supports the view that a substantial portion of the jay's diet come from cached food.

Reviewer #2: The presented manuscript describes original research on determining the diet of Canada jay. As the identification of wild animal’s diet is notoriously difficult, the authors venture into new methodology, namely metabarcoding, to help inform their inferences and determine how important food caches are to the Canada jay’s winter survival. They combine this DNA-based method with more traditionally used stomach contentanalyses, stable isotopes and direct observations.

Article is written in a clear and concise way, with good structure - reasoning is fairly transparent and followed throughout the text. Methods and analysis are only partially well described (see comments below). Results are presented with clarity and the overall impression of the manuscript is positive; however, it could definitely benefit from additional analyses/visualizations.

Major issues

There is the need to describe metabarcoding approach more. The authors bravely dive into this methodology, new to the field. Even more, they encourage others to use metabarcoding to widen the scope of their studies. It is very commendable, however, comes with large responsibility. The authors are basically setting the standards for the whole field. They have to present the method comprehensively and understandably. At the moment, the manuscript elaborates on the selection of markers (important matter) but lack entirely any description of DNA extraction, amplification and sequencing (ironically DNA extraction, amplification and sequencing is the deadline of the paragraph; line 130). The authors refer to Posser and Hebert 2017 for all details. In my opinion, which is not alienated, the reader deserves to have a short description of what happened with the samples, as it is necessary to understand the results.

The same is true for the Bioinformatics section of the manuscript (starting at line 153). Information about filtering to a minimum length of x is not informative because we did not learn what how the samples were sequenced. We don’t know what technology was used (Illumina?) and what length of reads was generated?? We’re completely uniformed in this respect. Bioinformatic processing is crucial in metabarcoding projects and has to be reproducible. It is necessary that to publish your scripts/pipelines in some form. Or at least describe it in detail. In line 157 you mention custom reference databases. Custom in what way? Please give readers some details on how you made those databases, so that they can reproduce your analysis.

Two main figures of the text, Fig 1 and 2, are showing a similar type of data but Fig 1 in percentages and Fig 2 in number of food items. It left me a bit perplexed, as I did not understand the reason behind this choice. Relative abundance of anything (like food types Fig. 1) can be generally misleading and I thought it would be more straight-forward to use just absolute numbers in both figures. But perhaps I’m just unfamiliar with the type of analyses you’re doing and I am definitely open to read your reasoning.

Lastly, you make a very good point in the discussion about advantages, and shortcomings of different methods and basically you give great insights on how use those very different sources of data, combine and make sense out it all. I think your paper would profit immensely from a figure summarizing some of the aspects of it. What methods exaggerate importance of some food sources and diminish others? And it could be the place to highlight how metabarcoding improves the overall picture. Obviously, this is just a suggestion but I think it will make your work much more impactful.

Minor issues

The work compiles a large dataset of direct observations of what birds were consuming. It is a very impressive work done with great detail. In fact, I think the dataset is really worth sharing with other researches and should be formatted in a way that is easily accessible. Right now is a Word .doc submitted as a supplementary information. I think it would make much more sense to share it in a .CSV or .TSV format, perhaps GitHub is a good place to upload it? Additionally, the authors could use GitHub to share their bioinformatic pipeline as well.

Line 127: samples are stored in a freezer and then in another freezer. What is the temperature of the first freezer?

Line 184: A simple distance matrix. What distance? Euclidean? Jaccard? B-C? Specify please.

Line 185: Provide version of the package and citation.

Line 277: metabarcoding, not just barcoding.

Line 301: when reference libraries are incomplete you cannot identify some species down to species level but it doesn’t cause misidentification automatically.

Line 302: PCR primers have what is called “primer bias” and some taxa might be missed completely by certain primers, while other taxa will be amplifying strong. It is recommended to use wide-spectrum primers; not optimized for the taxa of interest. I think I understand what you meant in this sentence but the wording suggests something different. Consider clarifying.

Reviewer #3: The authors can find more detailed comments in the manuscript attached. I do feel that the following points needs to be addressed to make this work meaningful and add scientific value to the study:

1. The first concern is with the description of the metabarcoding methods that were used. The description of the methods are incomplete (PCR protocol, sequencing done). The description of the bioinformatic methods used are incomplete (software packages). Which model was chosen to create the distance matrices? There is a reference to supplemental data (distance matrices) that were not available. The methods used to assign taxonomy are not described clearly.

2. It is noted that custom databases were created from data obtained from BOLD and Genbank. However, there are no reference species lists noted that were used to create these custom reference databases. Furthermore the accession numbers of the sequences used to create the reference bases are missing as well as the criteria used to choose the sequences for these custom reference databases.

3.In the results section the metabarcoding results are again incomplete. Of the 147 taxa detected give a list of how many and which taxa, up to family level, genus level and species level? What was the results of the gap analyses for each of the custom databases?

4.In the discussion the authors refer to only the detected food items and not to the relative abundances of the food items that can shed much more light on the diets especially during winter and non-winter and adults vs nestlings. It can even shed more light on cached vs fresh food items. Through the use of FOO and RRA biases could be explored as well as the possibility of using cached items.

5.In the Abstract no reference is made to the type of food items (plants, invertebrates) or how many taxa were detected. There is no mention of dmetabarcoding methods which forms a critical part of the study.

Reviewer #4: I would like to thank the authors and the editor to give me the opportunity to read the manuscript titled “Fecal DNA metabarcoding helps characterize Canada jay diet and confirms the reliance of stored food for winter survival and breeding”.

I found the manuscript very interesting and I think the study has merit, especially in combining molecular and non-molecular techniques.

My expertise is mostly revolving around DNA metabarcoding, so I focused my review on this technique.

Unfortunately, I am afraid the manuscript in its current form requires a major revision since most of the information on the use of metabarcoding are missing or are incomplete.

However, I strongly encourage the authors to address my comments and concerns and provide a revised version of their work, since it will be an important tool for future research.

Main comments:

If the ITS marker could not be amplified (despite following published protocols), the authors should probably remove any reference to it everywhere in this manuscript.

Metabarcoding methods needs to be improved:

Page 7: No information is provided for the sequencing. I think it is very important to provide this kind of information in detail:

- What did the authors do after PCR amplification?

- How did they prepare the sequencing library?

- Were the samples run in duplicate?

- Were there controls to account for plant/fungi present in the nest but not part of the diet?

- Were dual unique indexes used?

- Who performed the sequencing (in-house or outsourced)?

- What platform did the authors used to perform the sequencing?

- Were all markers pooled in the same sequencing run?

Page 8: Metabarcoding reads quality control is not really mentioned, except for size and primer trimming. Were there any chimera removal? Were coding genes checked for stop codons? Especially for fungi, was there any threshold-based OTU clustering?

Page 9: please include software references for R/RStudio and package version for each of the packages used.

Results:

The authors should summarise the fate of the raw reads obtained using metabarcoding, ideally for each genetic marker. For example:

“A total of XX millions raw reads was generated from XX samples. Of these, X thousand reads passed quality controls. A total of XX taxa could be identified based on COI, XXX taxa for ITS, XXX taxa for Rbcl”

Raw data for metabarcoding analysis should be made publicly available.

Minor comments:

Line 132: Change “molecular regions” to “gene regions”.

Line 133: Italicise the name of the genes when reported in full. “Cytochrome c Oxidase” should be “Cytochrome c Oxidase” and so on.

Line 139: the expression “molecular gene region” is used a number of times. I think the use of “molecular” in this case is redundant since any gene region is “molecular” in a way. Can you please remove all instances of “molecular” and leave only “gene region”? At line 138, “molecular marker” is correct and can be used since it specify the type of marker.

Lines 144-145: repetition of “well established” twice over two sentences. If you could remove one instance, it may make the sentence flow better.

Line 147: Change “molecular regions” to “gene regions”.

Line 155: This sentence is missing something “Primers were trimmed from each read and filtered again by a minimum size of 100 bp”. The authors trimmed the primers and then filtered.. the reads? As it stands, the subject of the second sentence is still “primers”.

Line 244 (and line 226): Following the rules of the International Code of Zoological Nomenclature, only names of genera and species should be italicised, but not families and orders. Araneae and Lepidoptera (and Zapodidae) should not be italicised. Also the abbreviation “sp.” should not be italicised (line 227).

Line 246: I am less familiar with plant taxonomy, but I think the above rule also applies there. Ericales should not be italicised.

Line 247: no need of upper case for “blueberries”.

Reviewer #5: This is a very interesting work, and only a few issues need the author's attention.

1. More literature on DNA fecal metabarcoding needs to be added in the introduction section.

2. The methods section needs to be greatly simplified, especially the section on sequencing and bioinformatics. In addition, it is necessary to indicate which sequencing platform was used for sequencing.

3. What does "nestlings" mean?How did this research get samples of nestlings?Do young birds at different stages of development need to be distinguished?

4. Why not use ITS2 to amplify plants, as suggested in this paper “Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species”?

5. Could you consider using "shotgun metabarcoding" for further research? At least the relevant discussion should be added to the discussion section. Relevant literature can be used for reference, for example “The species identification in traditional herbal patent medicine, Wuhu San, based on shotgun metabarcoding”.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Sandra Barnard

Reviewer #4: No

Reviewer #5: No

**********

NOTE: Some reviewer comments were submitted as an attachment file, they are attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: PONE-D-23-24828_reviewer.pdf

pone.0300583.s009.pdf (1.7MB, pdf)
PLoS One. 2024 Apr 24;19(4):e0300583. doi: 10.1371/journal.pone.0300583.r002

Author response to Decision Letter 0


21 Nov 2023

Response to Reviewers

We thank the Associate Editor and five reviewers for their valuable feedback and comments on the manuscript. Below, we have addressed all of the comments from reviewers.

Reviewer #1: This is a well-written paper. It is particularly valuable in that it compares the results of 4 methods that have been used widely. I support the view that a substantial portion of the jay's diet come from cached food.

Author response: We are glad the reviewer enjoyed our paper and thank them for their time reviewing the manuscript.

Reviewer #2: The presented manuscript describes original research on determining the diet of Canada jay. As the identification of wild animal’s diet is notoriously difficult, the authors venture into new methodology, namely metabarcoding, to help inform their inferences and determine how important food caches are to the Canada jay’s winter survival. They combine this DNA-based method with more traditionally used stomach content analyses, stable isotopes and direct observations.

Article is written in a clear and concise way, with good structure - reasoning is fairly transparent and followed throughout the text. Methods and analysis are only partially well described (see comments below). Results are presented with clarity and the overall impression of the manuscript is positive; however, it could definitely benefit from additional analyses/visualizations.

Author response: We thank reviewer #2 for reviewing our manuscript and providing valuable feedback on our analysis.

Major issues

There is the need to describe metabarcoding approach more. The authors bravely dive into this methodology, new to the field. Even more, they encourage others to use metabarcoding to widen the scope of their studies. It is very commendable, however, comes with large responsibility. The authors are basically setting the standards for the whole field. They have to present the method comprehensively and understandably. At the moment, the manuscript elaborates on the selection of markers (important matter) but lack entirely any description of DNA extraction, amplification and sequencing (ironically DNA extraction, amplification and sequencing is the deadline of the paragraph; line 130). The authors refer to Posser and Hebert 2017 for all details. In my opinion, which is not alienated, the reader deserves to have a short description of what happened with the samples, as it is necessary to understand the results.

The same is true for the Bioinformatics section of the manuscript (starting at line 153). Information about filtering to a minimum length of x is not informative because we did not learn what how the samples were sequenced. We don’t know what technology was used (Illumina?) and what length of reads was generated?? We’re completely uniformed in this respect. Bioinformatic processing is crucial in metabarcoding projects and has to be reproducible. It is necessary that to publish your scripts/pipelines in some form. Or at least describe it in detail. In line 157 you mention custom reference databases. Custom in what way? Please give readers some details on how you made those databases, so that they can reproduce your analysis.

Author response: As suggested, we have revised the methods section to provide more of an overview of the methods used to process and sequence samples used in our DNA metabarcoding analysis (lines 128 – 187). We highlight changes made throughout the methods section below in response to specific comments.

Two main figures of the text, Fig 1 and 2, are showing a similar type of data but Fig 1 in percentages and Fig 2 in number of food items. It left me a bit perplexed, as I did not understand the reason behind this choice. Relative abundance of anything (like food types Fig. 1) can be generally misleading and I thought it would be more straight-forward to use just absolute numbers in both figures. But perhaps I’m just unfamiliar with the type of analyses you’re doing and I am definitely open to read your reasoning.

Author response: We chose to include Figure 1 because stable isotope analyses of tissue can only estimate the proportion of each food type, not the absolute number of diet items. For the reasons reviewer #2 points out, in figure 2 we present the absolute numbers without the stable isotope results.

Lastly, you make a very good point in the discussion about advantages, and shortcomings of different methods and basically you give great insights on how use those very different sources of data, combine and make sense out it all. I think your paper would profit immensely from a figure summarizing some of the aspects of it. What methods exaggerate importance of some food sources and diminish others? And it could be the place to highlight how metabarcoding improves the overall picture. Obviously, this is just a suggestion but I think it will make your work much more impactful.

Author response: As suggested, we have created a table to describe pros and cons of each analysis and have included this in the revised version of the manuscript (Table 1).

Minor issues

The work compiles a large dataset of direct observations of what birds were consuming. It is a very impressive work done with great detail. In fact, I think the dataset is really worth sharing with other researches and should be formatted in a way that is easily accessible. Right now is a Word .doc submitted as a supplementary information. I think it would make much more sense to share it in a .CSV or .TSV format, perhaps GitHub is a good place to upload it? Additionally, the authors could use GitHub to share their bioinformatic pipeline as well.

Author response: We plan on making the data from our analysis available to anyone interested in using it. Further, data generated from our DNA metabarcoding and the complete list of diet items identified will also be deposited in an online repository once the paper is accepted. The code used in the analyses are available online (via GitHub and CRAN) and are included in the text of the updated methods section.

Line 127: samples are stored in a freezer and then in another freezer. What is the temperature of the first freezer?

Author response: The movement of the samples has been clarified and the first freezer temperature has been included (line 126).

Line 184: A simple distance matrix. What distance? Euclidean? Jaccard? B-C? Specify please.

Author response: As suggested, we have revised this sentence to include the phrase “…, the proportion or the number of sites that differ between each pair of aligned sequences,…” (lines 224 – 226).

Line 185: Provide version of the package and citation.

Author response: Thank you, we have included the Ape version number that was used in the analysis and note that the citation for the ape package is the citation at the end of the sentence. The updated text reads “matrix was constructed using with the R package Ape (Ver. 4.1) and the dist.dna() matrix function (33)” (lines 225 – 226).

Line 277: metabarcoding, not just barcoding.

Author response: As suggested, we have added metabarcoding in addition to the barcoding already mentioned (line 334).

Line 301: when reference libraries are incomplete you cannot identify some species down to species level but it doesn’t cause misidentification automatically.

Author response: We agree and “misidentifications” was perhaps not the best choice in words. We have revised this section to read: “First, this method does have limitations because it requires DNA reference libraries of potential food items (39,40). Incomplete libraries can lead to missed diversity as the experimental reads cannot be matched to known references. In addition, experimental results may be missing read data from biota of interest when PCR primers are not optimized.” (lines 357 – 361).

Line 302: PCR primers have what is called “primer bias” and some taxa might be missed completely by certain primers, while other taxa will be amplifying strong. It is recommended to use wide-spectrum primers; not optimized for the taxa of interest. I think I understand what you meant in this sentence but the wording suggests something different. Consider clarifying.

Author response: Thank you, please see above comment as we have updated the writing and clarified our point. The updated text now reads: “In addition, experimental results may be missing read data from biota of interest when PCR primers are not optimized. One possible solution to primers poorly amplifying taxa, or alternatively, primers preferentially amplifying taxa, is to take a PCR-free approach and, instead, obtain sequences for all available DNA in the samples. This is sometimes referred to as a shotgun metagenomics sequencing and while this approach may help to address some primer concerns and could offer potential for prey biomass estimates it is not without challenges in application, including high costs and the generation of large datasets leading to computationally intensive analyses (41, 42).” Lines 359 – 366).

Reviewer #3: The authors can find more detailed comments in the manuscript attached. I do feel that the following points needs to be addressed to make this work meaningful and add scientific value to the study:

1. The first concern is with the description of the metabarcoding methods that were used. The description of the methods are incomplete (PCR protocol, sequencing done). The description of the bioinformatic methods used are incomplete (software packages). Which model was chosen to create the distance matrices? There is a reference to supplemental data (distance matrices) that were not available. The methods used to assign taxonomy are not described clearly.

Author response: As suggested, we have substantially revised the methods section based on reviewer comments to clarify the metabarcoding and bioinformatic methods used. We outline the complete PCR protocol and the sequencing methods used to conduct DNA metabarcoding (lines 128 – 187). Additionally, we provide a complete description of our bioinformatic methods, including software packages used in our analysis (Lines 189 – 230). We describe how distance matrices were created (“A simple distance, the proportion or the number of sites that differ between each pair of aligned sequences, matrix was constructed with the R package Ape (Ver. 4.1) dist.dna() matrix function (35). These matrices were then used to obtain the maximum within species genetic variation and the minimum genetic distance between species of the same genus and this was completed through a custom R script” Lines 224-228) and methods used to assign taxonomy (“To support taxonomic identifications, the discriminatory ability of the amplified gene regions was assessed. A barcode gap analysis was used to ascertain if the gene region, using publicly available data, was able to reliably place a sequence to a taxonomic group. To have a barcode gap there must be less variation within a group then there is between the members of that group and all other members outside the group (33). To provide the most robust analysis of this gap, distances between all elements were calculated and the highest difference between members within the target group is compared to the smallest distance between members of the target group and all other members outside the target group. COI gene region BLAST identifications were pulled back to genus if no gap was obtained from our gap analyses. With the rbcLa gene region, taxa with no barcode gap using species level taxonomic identifications were tested between genera. If there was no apparent gap at the level of genus taxonomic identifications were pulled back to Family. Placing a higher-level taxonomic identification at genus for animals using a segment of the COI and family for plants using a segment of the rbcLa follows established methods (25).” Lines 202-215).

2. It is noted that custom databases were created from data obtained from BOLD and Genbank. However, there are no reference species lists noted that were used to create these custom reference databases. Furthermore the accession numbers of the sequences used to create the reference bases are missing as well as the criteria used to choose the sequences for these custom reference databases.

Author response: Thank you, we agree more clarity is necessary here. We have significantly increased the content in the methods section to address this and other methods-related comments. We have noted the content of the reference databases and referenced the paper where our methods were derived and the updated text reads: “After these steps, each read was taxonomically assigned using BLAST and three databases: all flowering plant rbcLa sequences from BOLD, all insect COI sequences from GenBank, and all fungi ITS2 sequences from BOLD (Oct 2017).” lines 194 – 197).

3.In the results section the metabarcoding results are again incomplete. Of the 147 taxa detected give a list of how many and which taxa, up to family level, genus level and species level? What was the results of the gap analyses for each of the custom databases?

Author response: As suggested, we have added this information to the results section ( “Of the three gene regions, raw sequencing results for the ITS2 focusing on fungal taxa had between 0 and 6349 reads, rbcLa focusing on the plant taxa had between 5 and 852,304 reads, and COI-5P focusing on animal taxa had between 0 and 267,578 reads for a total of 3,761,841 (Supplemental file S5). After trimming and filtering there were 2,775,314 reads across 20 samples representing 1469 unique sequences, 1092 from the COI-5P, 370 from the rbcLa, and 7 ITS2 (Supplemental file S6). Unique sequence reads were collapsed into groups based on taxonomic assignment and taxonomic placement with enough data in public databases were evaluated using a DNA barcoding gap assessment (Supplemental file S4). After collapsing taxonomic assignments and assessing the taxonomic placement, 147 unique taxa were obtained as likely taxa from the nestling fecal sacs, 2% (3/147) were vertebrates (wood frog; Lithobates sylvaticus and a shrew; Sorex cinereus), 74% (109/147) were arthropods, and the remaining 24% (35/147) were plants. Overall, Araneae (30 spiders) and Lepidoptera (44 moths and butterflies) represented a majority total food items detected (Figure 1), but the most common food items detected across fecal sacs were Ericales plants (heather and allies; most commonly blueberries in our study), which were detected in 51% of the fecal sacs.”; lines 288 – 305) and referenced the supplemental file that contains all of the specific information on the analyses.

4.In the discussion the authors refer to only the detected food items and not to the relative abundances of the food items that can shed much more light on the diets especially during winter and non-winter and adults vs nestlings. It can even shed more light on cached vs fresh food items. Through the use of FOO and RRA biases could be explored as well as the possibility of using cached items.

Author response: We agree with the reviewer’s comment that abundance information would be valuable for assessing diet. However, the reason that we have not discussed this in detail is that none of the methods, with the exception of stable isotopes, provide information on abundance. And, because the stable isotope analysis did not strongly agree with the other methods, we are reluctant to discuss the results of this method in the context of abundance. Our Discussion also now includes Table 1 that we hope will help readers compare the limitations of, and the difficulties involved with, different methods of assessing diet.

5.In the Abstract no reference is made to the type of food items (plants, invertebrates) or how many taxa were detected. There is no mention of dmetabarcoding methods which forms a critical part of the study.

Author response: As suggested, we have now mentioned the types of food items considered in our study and how many taxa were detected. We briefly mention DNA metabarcoding but cannot elaborate further due to word count restrictions.

Specific comments (from attached file)

Line 53: ‘the’ missing

Author response: As suggested, we have included ‘the’ in this statement (line 46).

Line 59: ‘and’ missing

Author response: As suggested, ‘and’ has been inserted (line 59).

Line 77: parentheses needed

Author response: As suggested, parentheses have been included (line 77).

Line 137: More information is needed on the PCR, the PCR protocol followed, enzyme used

Author response: We have significantly added to the methods section and included all of the details surrounding the PCR amplifications for the molecular work. This includes all reagent mixes/volumes, the number and order of the PCRs, and the reaction conditions of the PCR reactions (lines 152 – 187).

Line 152: Sequencing method? By whom was the sequencing done?

Author response: Thank you, the sequencing method and specifics have been included in an updated methods section, including the location of sequencing at the University of Guelph Centre for Biodiversity Genomics (lines 167 – 187).

Line 155: Which package was used to do the quality filtering or did the authors rely on the MULTIQC reports? Which package was used to remove the primers and adapters?

Author response: Both of these methods are now included in the updated methods section. Cutadapt was used to trim sequences and is referenced in the methods (lines 190 – 191). The quality filtering was completed using the fastq quality scores and the sickle tool and the citation is included in the updated methods (lines 189 – 201).

Line 156: Were both the forward and reverse reads used and merged before collapsing identical sequences? Which packages were used to do the bioinformatics in? Were OTUs or ESVs used?

Author response: The sequence reads were generated using Ion Torrent. The data were not reduced to OTU/ESV’s. We have updated the methods section to provide more clarity about the process of analysing the data and include references to the informatic tools used. We have included the sentence “No clustering of the data occurred and all quality filtered reads were used in the taxonomic identification step.” (lines 197 – 198).

Line 160: The methods used to assign taxonomy is unclear.

Author response: The taxonomic assignment was determined using a BLAST approach against focused libraries and where the top BLAST hit was used for taxonomic placement. We have updated the methods section and believe that this question is reflected in these updates (lines 194 – 197). Revised text reads: After these steps, each read was taxonomically assigned using BLAST and three databases: all flowering plant rbcLa sequences from BOLD, all insect COI sequences from GenBank, and all fungi ITS2 sequences from BOLD (Oct 2017).

Line 163: Is there a table containing the species list of the custom database available? How was it obtained? According to which sequence criteria were the sequences for the custom databases acquired?

Author response: We have provided more information on the contents of the databases used to BLAST the experimental sequences against in the updated methods section (lines 202 – 230). Description of the library datasets included noting the taxonomic focus with the following sentence: “After these steps each read was taxonomically assigned using BLAST and three databases including all flowering plant rbcLa sequences from BOLD, all insect COI sequences from GenBank, and all fungi ITS sequences from BOLD (Oct 2017).” (lines 194 – 197)

Line 184: Using which model?

Author response: Yes, we agree that this needed some clarification. There was no model of molecular evolution used. We applied simple distance and to clarify this we have updated the sentence to include the phrase “, the proportion or the number of sites that differ between each pair of aligned sequences,” (lines 224 – 225)

Line 187: This data was not available to me. Only two tables were available as supplemental data.

Author response: Supplemental data have been included in the revised submission.

Line 242: How many raw reads were obtained? Were the sequences obtained deposited onto a database?

Author response: Thank you, for clarification we have significantly updated the methods section for the metabarcoding and we have added several sentences to the beginning of the Results section to clarify the general results of the metabarcode analyses (lines 288 – 305).

Line 248: The results reported on the sequencing are incomplete, and the reference custom databases are not discussed or shown. No results are reported for the gap analysis. Results on the identification of the DNA barcodes are also limited. There is no information added to the table indicating the acronyms for the methods used.

Author response: Thank you for the comment. We have updated the methods section to include all steps taken during the sequencing and taxonomic identification steps (revised text: “To assess if a barcode gap exists for the BLAST identifications, higher taxonomic levels (family for plants and either family or genus for animals depending on the manageable size of the data set and the number of BLAST identified species from the same genus or family) were used to obtain sequence data sets from the BOLD system (Manually downloaded July and August 2018 and unique identifiers are included in the supplemental file S1). Sequences were aligned (MAFFT: 31) and trimmed to target region using MEGA (Using the primers included in Supplemental file S2: 34). Sequences were removed from further analysis if they had greater than 2% unknown nucleotides or if they had greater than 12 gap characters (‘-‘) at either the 3’ or 5’ ends of the sequences. A simple distance, the proportion or the number of sites that differ between each pair of aligned sequences, matrix was constructed with the R package Ape (Ver. 4.1) dist.dna() matrix function (35). These matrices were then used to obtain the maximum within species genetic variation and the minimum genetic distance between species of the same genus and this was completed through a custom R script (see Supplemental Data S3). All within and between taxa values used to assess the specificity of the taxonomic assignments are reported (Supplemental File S4).” lines 216 – 230).

Lines 261 - 263: Give examples of families or genera detected.

Author response: As suggested, we have included examples of families and genera detected (revised text: “After collapsing taxonomic assignments and assessing the taxonomic placement, 147 unique taxa were obtained as likely taxa from the nestling fecal sacs, 2% (3/147) were vertebrates (wood frog; Lithobates sylvaticus and a shrew; Sorex cinereus), 74% (109/147) were arthropods, and the remaining 24% (35/147) were plants. Overall, Araneae (30 spiders) and Lepidoptera (44 moths and butterflies) represented a majority total food items detected (Figure 1), but the most common food items detected across fecal sacs were Ericales plants (heather and allies; most commonly blueberries in our study), which were detected in 51% of the fecal sacs. The taxonomic identifications of all consumed prey items are provided in Supplementary Table S1.”, lines 297 – 305).

Line 304: I am of the opinion that it will enhance it and not merely duplicate.

Author response: We agree with this comment and have highlighted the benefit of using dDNA metabarcoding below this point in our discussion (“As a practical matter, however, we found no examples in our study where this potential handicap detracted from dDNA metabarcoding’s otherwise superior ability to identify taxa ingested by Canada jays and to determine which items had been “likely cached”. Not only does metabarcoding permit the identification of soft-tissue or small food items that are easily missed by stomach contents analysis (44–46), but also, compared to stomach contents analysis, it allows far more samples to be collected, far more easily, and importantly, without sacrificing individuals. In just three seasons (2015 - 2017) of measuring nestlings, we were able to double the number of identified nestling diet items that we had obtained during the previous 40 years through chance acquisitions of stomach contents and castings. A further advantageous feature is that the technique could be extended to more accurately assess the diets of not just nestling Canada jays, but also those of adults, including the winter diets that are of greatest interest. We have no doubt that it would be eminently feasible to capture winter adults and safely hold them in a suitably appointed small cage until they produced a fecal sample. By the same reasoning, it should be possible, in principle, to use fecal dDNA metabarcoding far beyond the limited use we have made of it here and we encourage others to do so.” Lines 384 – 399).

Reviewer #4: I would like to thank the authors and the editor to give me the opportunity to read the manuscript titled “Fecal DNA metabarcoding helps characterize Canada jay diet and confirms the reliance of stored food for winter survival and breeding”. I found the manuscript very interesting and I think the study has merit, especially in combining molecular and non-molecular techniques.

Author response: We are glad to hear that the reviewer enjoyed reading our manuscript and thank them for their valuable comments that have improved the manuscript.

My expertise is mostly revolving around DNA metabarcoding, so I focused my review on this technique. Unfortunately, I am afraid the manuscript in its current form requires a major revision since most of the information on the use of metabarcoding are missing or are incomplete.

However, I strongly encourage the authors to address my comments and concerns and provide a revised version of their work, since it will be an important tool for future research.

Author response: As suggested, we have revised the methods section of our manuscript to provide additional detail that was missing or incomplete in our initial submission. We provide specific responses, including highlighting text sections which have changed and line numbers, to your comments below.

Main comments:

If the ITS marker could not be amplified (despite following published protocols), the authors should probably remove any reference to it everywhere in this manuscript.

Author response: Despite not being able to amplify the ITS marker, we think it is important to say that this was attempted as fungi could be an important part of the Canada jay diet and future studies could attempt to conduct this analysis.

Metabarcoding methods needs to be improved:

Page 7: No information is provided for the sequencing. I think it is very important to provide this kind of information in detail:

- What did the authors do after PCR amplification?

- How did they prepare the sequencing library?

- Were the samples run in duplicate?

- Were there controls to account for plant/fungi present in the nest but not part of the diet?

- Were dual unique indexes used?

- Who performed the sequencing (in-house or outsourced)?

- What platform did the authors used to perform the sequencing?

- Were all markers pooled in the same sequencing run?

Author response: Thank you. We have included an updated methods section as per suggestions from reviewers. In this section we believe that we have addressed all of the concerns noted above with the lack of methodological specifics (lines 128 – 187).

Page 8: Metabarcoding reads quality control is not really mentioned, except for size and primer trimming. Were there any chimera removal? Were coding genes checked for stop codons? Especially for fungi, was there any threshold-based OTU clustering?

Author response: Thank you for the questions. This work utilized unidirectional Ion Torrent sequencing so there was no need to merge reads. With no merging there was also no need to check for chimeric sequences. We did not employ checks for the coding markers. And there was no clustering that was completed. We have included clarifying language for the instrument and lack of clustering in the updated methods section (lines 184 – 198). Revised text reads: “Cleaned and normalized products used for library construction following Prosser and Hebert (23) and were then sequenced unidirectionally on an Ion Torrent PGM using a 318 v.2 chip at the University of Guelph Centre for Biodiversity Genomics, following the manufacturer’s instructions.

Bioinformatics

Sequencing data was demultiplexed using the gene region and MID tags. Each resulting data set (three gene regions for each sample) were analysed informatically by first removing primer and adapter sequences (31) ; see Prosser and Hebert (23) for sequences used), removing reads shorter than 100 bp, removing reads with low quality scores (QV < 20) (github.com/ucdavis-bioinformatics/sickle), and then by de-replicated reads with 100% identity (http://hannonlab.cshl.edu/fastx_toolkit/index.html). After these steps, each read was taxonomically assigned using BLAST and three databases: all flowering plant rbcLa sequences from BOLD, all insect COI sequences from GenBank, and all fungi ITS2 sequences from BOLD (Oct 2017). No clustering of the data occurred, and all trimmed and quality filtered reads were used in the taxonomic identification step.”

Page 9: please include software references for R/RStudio and package version for each of the packages used.

Author response: As suggested, we have made sure to include citations for all software and R packages used in our analyses (lines 222 – 230).

Results:

The authors should summarise the fate of the raw reads obtained using metabarcoding, ideally for each genetic marker. For example:

“A total of XX millions raw reads was generated from XX samples. Of these, X thousand reads passed quality controls. A total of XX taxa could be identified based on COI, XXX taxa for ITS, XXX taxa for Rbcl”

Author response: As suggested, we have added this information to the results section (lines 228 – 306). We have included a number of sentences detailing the outcomes of each of our analysis steps starting at the beginning of the results section. The revised text now reads: “Metabarcoding results from the fledgling faecal samples were represented by 20 biological collections representing three gene regions. Of the three gene regions, raw sequencing results for the ITS2 focusing on fungal taxa had between 0 and 6349 reads, rbcLa focusing on the plant taxa had between 5 and 852,304 reads, and COI-5P focusing on animal taxa had between 0 and 267,578 reads for a total of 3,761,841 (Supplemental file S5). After trimming and filtering there were 2,775,314 reads across 20 samples representing 1469 unique sequences, 1092 from the COI-5P, 370 from the rbcLa, and 7 ITS2 (Supplemental file S6).” (lines 288 – 294).

Raw data for metabarcoding analysis should be made publicly available.

Author response: We agree and these data will be made publicly available once the manuscript is accepted.

Minor comments:

Line 132: Change “molecular regions” to “gene regions”.

Author response: As suggested, we have changed ‘molecular regions’ to ‘gene regions’ (line 129).

Line 133: Italicise the name of the genes when reported in full. “Cytochrome c Oxidase” should be “Cytochrome c Oxidase” and so on.

Author response: As suggested, all gene names have been italicized when reported in full (lines 130 – 133).

Line 139: the expression “molecular gene region” is used a number of times. I think the use of “molecular” in this case is redundant since any gene region is “molecular” in a way. Can you please remove all instances of “molecular” and leave only “gene region”? At line 138, “molecular marker” is correct and can be used since it specify the type of marker.

Author response: As suggested, we have changed all references to ‘molecular gene region’ to gene region’.

Lines 144-145: repetition of “well established” twice over two sentences. If you could remove one instance, it may make the sentence flow better.

Author response: As suggested, we have removed the first ‘well established’ from this statement (lines 140 – 143).

Line 147: Change “molecular regions” to “gene regions”.

Author response: As suggested, we have changed ‘molecular region’ to ‘gene region’.

Line 155: This sentence is missing something “Primers were trimmed from each read and filtered again by a minimum size of 100 bp”. The authors trimmed the primers and then filtered. the reads? As it stands, the subject of the second sentence is still “primers”.

Author response: As suggested, we have updated the methods section and clarified these points (lines 189 – 194). Revised text reads: “Sequencing data was demultiplexed using the gene region and MID tags. Each resulting data set (three gene regions for each sample) were analysed informatically by first removing primer and adapter sequences (31) ; see Prosser and Hebert (23) for sequences used), removing reads shorter than 100 bp, removing reads with low quality scores (QV < 20) (github.com/ucdavis-bioinformatics/sickle), and then by de-replicated reads with 100% identity (http://hannonlab.cshl.edu/fastx_toolkit/index.html).”

Line 244 (and line 226): Following the rules of the International Code of Zoological Nomenclature, only names of genera and species should be italicised, but not families and orders. Araneae and Lepidoptera (and Zapodidae) should not be italicised. Also the abbreviation “sp.” should not be italicised (line 227).

Author response: As suggested, families are no longer italicized (lines 299 – 303).

Line 246: I am less familiar with plant taxonomy, but I think the above rule also applies there. Ericales should not be italicised.

Author response: As suggested, Ericales is no longer italicized (line 302).

Line 247: no need of upper case for “blueberries”.

Author response: As suggested, blueberry is no longer capitalized (line 303).

Reviewer #5: This is a very interesting work, and only a few issues need the author's attention.

Author response: We thank the reviewer for positive assessment of the manuscript and for their helpful comments.

1. More literature on DNA fecal metabarcoding needs to be added in the introduction section.

Author response: As suggested, we have provided additional literature to the introduction.

2. The methods section needs to be greatly simplified, especially the section on sequencing and bioinformatics. In addition, it is necessary to indicate which sequencing platform was used for sequencing.

Author response: Several other reviewers asked for more information in this section. We have included details requested by other reviewers while trying to ensure the methods are as simplified as possible.

3. What does "nestlings" mean?How did this research get samples of nestlings?Do young birds at different stages of development need to be distinguished?

Author response: Nestling is a widely accepted term in ornithological research that describes a developing individual in a nest. While developmental stages could be differentiated, we do not believe this level of detail is necessary in our analysis.

4. Why not use ITS2 to amplify plants, as suggested in this paper “Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species”?

Author response: Thank you. We have provided more information in the discussion about the choices made for our methods and potential other options for studies in the future. To address your specific point regarding the ITS2 gene region and identifying plant species we have included the following sentence in the discussion. “For example, while the ITS2 region was amplified and sequenced for use in fungal identification (although unsuccessfully most likely due to the degree of degradation occurring for these soft bodied organisms), the ITS region has been successfully utilized for identification of other taxa including Canadian flora (43). However, due to the time and cost constraints to amplify the gene region using additional primers, in addition to the poorly populated sequence libraries for this taxonomic group, this approach was not feasible for this study.” (lines 371 – 377).

5. Could you consider using "shotgun metabarcoding" for further research? At least the relevant discussion should be added to the discussion section. Relevant literature can be used for reference, for example “The species identification in traditional herbal patent medicine, Wuhu San, based on shotgun metabarcoding”.

Author response: Thank you for the suggestion. We have included the following sentence in the discussion section to highlight the possibility. “One possible solution to primers poorly amplifying taxa, or alternatively, primers preferentially amplifying taxa, is to take a PCR-free approach and, instead, obtain sequences for all available DNA in the samples. This is sometimes referred to as a shotgun metagenomics sequencing and while this approach may help to address some primer concerns and could offer potential for prey biomass estimates it is not without challenges in application, including high costs and the generation of large datasets leading to computationally intensive analyses (41, 42).” (lines 361 – 366).

Attachment

Submitted filename: CAJA Diet_Response to Reviewers_final.docx

pone.0300583.s010.docx (42.9KB, docx)

Decision Letter 1

Petr Heneberg

26 Dec 2023

PONE-D-23-24828R1Fecal DNA metabarcoding helps characterize the Canada jay’s diet and confirms its reliance on stored food for winter survival and breedingPLOS ONE

Dear Dr. Sutton,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Feb 09 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Petr Heneberg

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

Reviewer #3: All comments have been addressed

Reviewer #4: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

Reviewer #3: N/A

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: My main concerns have been addressed and I am satisfied with the answers and corrections.

Methods are now described in detail.

One note is that if you were following standard kits (i.e. for DNA extraction) and followed manufacturers instructions then you can skip giving all the details and just say "we used this-and-that kit following manufacturers instructions".

Small comments:

general: Degrees symbol doesn't appear well in the PDF. To be fixed in formatting.

general: Sometimes you write "COI" and sometimes "COI-5P". I'm not sure what the difference is. Usually "COI" is sufficient.

Line 126-127: Thanks for clarifying. In this case you can just write that it was stored in -20 degrees within 12hrs of collection and until further processing.

Line 130: You should write full name of the region first - Cytochrome c Oxidase I - and abbreviation in parentheses. Also only "c" should be italicised. Check it throughout the text.

Line 131: Same with ITS. First write the full name of the fragment and then add abbreviation in parentheses.

Line 132: same

Line 224: Thanks for clarifying what the distance metric is. Now I think that "simple" is necessary. This is not a precise word. Just delete it and rephrase.

Reviewer #3: I want to commend the authors for the effort they put in to improve the manuscript and the attention they paid to reviewers’ comments. The authors made major changes to the manuscript that improved the value of the manuscript considerably. I like to recommend that the authors also consider the UNITE database for fungal sequence identification in possible future studies using DNA metabarcoding. I have noticed a few minor errors in the methods section:

Line 173: 10 µM (not lM) primer, 0.0625 μL of 10 mM dNTP (KAPA Biosystems), 0.060 µL (not lL) of 5U/μL PlatinumTaq

Line 175: 175 reactions consisted of 94 °C for 5 min, 40–60 cycles (40 for ITS2, 60 for rbcLa (not A) and COI)

Reviewer #4: I am very happy with the answers and changes provided by the authors.

I think the manuscript is now ready to be accepted.

I have only two very minor remarks:

- Lines 129-145: the paragraph on the selection of the markers should go with the PCR section, not before the DNA extraction. DNA extraction and purification happen before PCR, and they should be reported in this order.

- Check the whole manuscript for the use of the degree symbol °C. Not sure what symbol was used instead, but it really looks weird.

Great work, looking forward to see this published! Congratulations!

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Reviewer #3: No

Reviewer #4: No

**********

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Apr 24;19(4):e0300583. doi: 10.1371/journal.pone.0300583.r004

Author response to Decision Letter 1


27 Feb 2024

We thank the reviewers for their comments on our revised manuscript. Our responses to their comments are highlighted in the manuscript in red and bolded below for clarity.

Reviewer #2: My main concerns have been addressed and I am satisfied with the answers and corrections.

Methods are now described in detail.

One note is that if you were following standard kits (i.e. for DNA extraction) and followed manufacturers instructions then you can skip giving all the details and just say "we used this-and-that kit following manufacturers instructions".

Author response: Thank you for your suggestion. We have left the description of our manual extraction methods in the manuscript because it describes the methods followed throughout the analysis.

Small comments:

general: Degrees symbol doesn't appear well in the PDF. To be fixed in formatting.

Author response: As suggested, we have made sure all degree symbols are now being displayed appropriately.

general: Sometimes you write "COI" and sometimes "COI-5P". I'm not sure what the difference is. Usually "COI" is sufficient.

Author response: There is a difference between COI and COI-5P. For consistency and to reflect the fact we used the COI-5P, we have made sure all mention of COI throughout the manuscript has been changed to COI-5P.

Line 126-127: Thanks for clarifying. In this case you can just write that it was stored in -20 degrees within 12hrs of collection and until further processing.

Author response: As suggested, we have revised the text to state this.

Line 130: You should write full name of the region first - Cytochrome c Oxidase I - and abbreviation in parentheses. Also only "c" should be italicised. Check it throughout the text.

Author response: As suggested, we have made sure to write the full name first and only italicize the “c”.

Line 131: Same with ITS. First write the full name of the fragment and then add abbreviation in parentheses.

Author response: As suggested, we have made sure to write the full name first.

Line 132: same

Author response: As suggested, we have made sure to write the full name first.

Line 224: Thanks for clarifying what the distance metric is. Now I think that "simple" is necessary. This is not a precise word. Just delete it and rephrase.

Author response: As suggested, we have removed the word “simple” and revised the sentence to describe the matrix that was created. The revised text now reads: “A distance matrix (the proportion or the number of sites that differ between each pair of aligned sequences) was constructed with the R package Ape (Ver. 4.1) dist.dna() matrix function (35).” (lines 224 – 226)

Reviewer #3: I want to commend the authors for the effort they put in to improve the manuscript and the attention they paid to reviewers’ comments. The authors made major changes to the manuscript that improved the value of the manuscript considerably. I like to recommend that the authors also consider the UNITE database for fungal sequence identification in possible future studies using DNA metabarcoding. I have noticed a few minor errors in the methods section:

Line 173: 10 µM (not lM) primer, 0.0625 μL of 10 mM dNTP (KAPA Biosystems), 0.060 µL (not lL) of 5U/μL PlatinumTaq

Author response: Thank you for catching these mistakes. We have revised the text to correct these errors.

Line 175: 175 reactions consisted of 94 °C for 5 min, 40–60 cycles (40 for ITS2, 60 for rbcLa (not A) and COI)

Author response: We now have removed the [] from the sentence.

Reviewer #4: I am very happy with the answers and changes provided by the authors.

I think the manuscript is now ready to be accepted.

I have only two very minor remarks:

- Lines 129-145: the paragraph on the selection of the markers should go with the PCR section, not before the DNA extraction. DNA extraction and purification happen before PCR, and they should be reported in this order.

Author response: As suggested, we have moved this paragraph to be below our description of the PCR method used and now appears on lines 177 – 187.

- Check the whole manuscript for the use of the degree symbol °C. Not sure what symbol was used instead, but it really looks weird.

Author response: We have made sure all uses of °C are now consistently formatted.

Great work, looking forward to see this published! Congratulations!

Author response: Thank you for the kind words and for your helpful comment on previous versions of the MS.

Attachment

Submitted filename: CAJA Diet_Final_Response to Reviewer_final.docx

pone.0300583.s011.docx (22.3KB, docx)

Decision Letter 2

Petr Heneberg

1 Mar 2024

Fecal DNA metabarcoding helps characterize the Canada jay’s diet and confirms its reliance on stored food for winter survival and breeding

PONE-D-23-24828R2

Dear Dr. Sutton,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Petr Heneberg

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Petr Heneberg

2 Apr 2024

PONE-D-23-24828R2

PLOS ONE

Dear Dr. Sutton,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Petr Heneberg

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Summary of all food items known to be consumed by Canada jays.

    The date and location the observation was made is included for each observation in addition to the identification of the food item being consumed. We also summarize additional information related to who made the record, what method was used to make the observation (“DO”–direct observation, “SC”–stomach content analysis, “FS”–metabarcoding of fecal samples), the age of the individual which consumed a food item (e.g., adult, juvenile, or nestling), and which of our four diet groups the observation belongs in (“A”–arthropod, “P”–“plants”, “V”–vertebrate tissue). Finally, for observations made during the winter and spring, we describe whether a food item was “likely-cached”, “likely fresh”, or if it could be either cached or fresh. Observations labeled with “ROM” represent stomach samples from the Royal Ontario Museum collection.

    (DOCX)

    pone.0300583.s001.docx (180.6KB, docx)
    S2 Table. Summary table of diet observations made during the winter for both adult and nestling Canada jays.

    Diet items were designated as “likely cached”, “likely fresh”, or “either possible” based on natural history information available for each food item. We further classified diet items based on whether they were consumed by adults or nestlings, and based on the method used to collect the diet item.

    (XLSX)

    pone.0300583.s002.xlsx (12.7KB, xlsx)
    S1 File. Barcode gap records.

    Custom dataset containing sequence data sets from the BOLD system and used to assess if a barcode. Gap exists for BLAST identifications.

    (TSV)

    pone.0300583.s003.tsv (8.6MB, tsv)
    S2 File. A complete list of primers used during PCR amplification.

    All primers used in our analysis are outlined here in addition to being described in the method section. Available at 10.6084/m9.figshare.25422847.

    (TXT)

    pone.0300583.s004.txt (218B, txt)
    S3 File. R script used to calculate genetic distances between species identified using DNA metabarcoding.

    Available at 10.6084/m9.figshare.25422847.

    (DOCX)

    pone.0300583.s005.docx (24.1KB, docx)
    S4 File. Within and between taxa values used to assess the specificity of taxonomic assignments from our DNA metabarcoding analysis.

    (TSV)

    pone.0300583.s006.tsv (19.6KB, tsv)
    S5 File. Raw, unfiltered sequencing results from our DNA metabarcoding analysis.

    (TSV)

    pone.0300583.s007.tsv (4.8KB, tsv)
    S6 File. Trimmed and filtered BLAST results.

    (TSV)

    pone.0300583.s008.tsv (239.7KB, tsv)
    Attachment

    Submitted filename: PONE-D-23-24828_reviewer.pdf

    pone.0300583.s009.pdf (1.7MB, pdf)
    Attachment

    Submitted filename: CAJA Diet_Response to Reviewers_final.docx

    pone.0300583.s010.docx (42.9KB, docx)
    Attachment

    Submitted filename: CAJA Diet_Final_Response to Reviewer_final.docx

    pone.0300583.s011.docx (22.3KB, docx)

    Data Availability Statement

    All relevant data are within the paper and its Supporting information files.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES