Skip to main content
eLife logoLink to eLife
. 2019 May 14;8:e44628. doi: 10.7554/eLife.44628

Independent amylase gene copy number bursts correlate with dietary preferences in mammals

Petar Pajic 1,2, Pavlos Pavlidis 3, Kirsten Dean 1, Lubov Neznanova 2, Rose-Anne Romano 2, Danielle Garneau 4, Erin Daugherity 5, Anja Globig 6, Stefan Ruhl 2,, Omer Gokcumen 1,
Editors: George H Perry7, Diethard Tautz8
PMCID: PMC6516957  PMID: 31084707

Abstract

The amylase gene (AMY), which codes for a starch-digesting enzyme in animals, underwent several gene copy number gains in humans (Perry et al., 2007), dogs (Axelsson et al., 2013), and mice (Schibler et al., 1982), possibly along with increased starch consumption during the evolution of these species. Here, we present comprehensive evidence for AMY copy number expansions that independently occurred in several mammalian species which consume diets rich in starch. We also provide correlative evidence that AMY gene duplications may be an essential first step for amylase to be expressed in saliva. Our findings underscore the overall importance of gene copy number amplification as a flexible and fast evolutionary mechanism that can independently occur in different branches of the phylogeny.

Research organism: Human, Mouse, Rat, Rhesus macaque, Other

eLife digest

Many mammals can digest starch by using an enzyme called amylase, but different species eat different amounts of starchy foods. Amylase is released by the pancreas, and in certain species such as humans, it is also created by the glands that produce saliva, allowing the enzyme to be present in the mouth. There, amylase can start to break down starch, releasing a sweet taste that helps the animal to detect starchy foods.

Curiously, humans have multiple copies of the gene that codes for the enzyme, but the exact number varies between people. Previous research has found that populations with more copies also eat more starch; if this correlation also existed in other species, it could help to understand how diets influence and shape genetic information. In addition, it is unclear how amylase came to be present in saliva, as the ancestors of mammals only produced the protein in the pancreas.

Pajic et al. analyzed the genomes of a range of mammals and found that the more starch a species had in its diet, the more amylase gene copies it harbored in its genome. In fact, unrelated mammals living in different habitats and eating different types of food have similar numbers of amylase gene copies if they have the same level of starch in their diet.

In addition, Pajic et al. discovered that animals such as mice, rats, pigs and dogs, which have lived in close contact with people for thousands of years, quickly adapted to the large amount of starch present in human food. In each of these species, a mechanism called gene duplication independently created new copies of the amylase gene. This could represent the first step towards some of these copies becoming active in the glands that release saliva.

In people, having fewer copies of the amylase gene could mean they have a higher risk for diabetes; this number is also tied to the composition of the collection of bacteria that live in the mouth and the gut. Understanding how the copy number of the amylase gene affects biology will help to grasp how it also affects health and wellbeing, in humans and in our four-legged companions.

Introduction

Diet has been a significant adaptive force in shaping human and nonhuman primate variation (Hardy et al., 2015; Milton, 1981; Zhang et al., 2002). One of the best-described examples of diet-related adaptation is the expansion of the copy number of the amylase gene in concordance with the increase of starch consumption in the human lineage (Perry et al., 2007), likely postdating the human Neanderthal split (Inchley et al., 2016). A gene duplication in the ancestor of Old World monkeys and great apes initially led to the formation of two amylase genes (AMY2A and AMY2B) with pancreas-specific expression (Samuelson et al., 1990). Then, a subsequent gene duplication in the ancestor of great apes led to the formation of AMY1 which gained salivary gland-specific expression (Meisler and Ting, 1993). In the human lineage, further gene copy number gains of AMY1 led to increased expression of the AMY1 enzyme in human saliva (Perry et al., 2007). Gene copy numbers of both AMY1 and AMY2 vary in different human populations (Carpenter et al., 2015; Usher et al., 2015), the former correlating with the extent of traditional starch consumption in these communities (Perry et al., 2007).

While the evolution of the amylase locus in the human lineage is well described, its evolution in other mammals is less well understood. Some studies have produced intriguing findings. For example, it was shown that mice, rats, and pigs express substantial levels of amylase in their saliva (Boehlke et al., 2015; Janiak, 2016) In addition, the amylase locus has been shown to be evolving under positive selection in dogs and in house mice (Reiter et al., 2016; Staubach et al., 2012). However, a comprehensive analysis of the evolutionary dynamics shaping the amylase locus across mammals is missing. Within this context, one interesting question is how amylase has evolved in animals who live in a commensal relationship with humans. Recent studies, for instance, have shown that dogs gained multiple copies of the amylase gene after their split from the wolf 30,000–40,000 years ago (Skoglund et al., 2015). This might likely be a result of their domestication, which exposed them to human food leftovers rich in starch (Axelsson et al., 2013; Botigué et al., 2017; Ollivier et al., 2016; Reiter et al., 2016). Thus, the evolution of amylase in other domesticated or human commensal mammals remains an alluring area of inquiry. Similarly, our understanding of the evolution of the amylase locus within the primate lineage remains limited. For instance, it is not known why some Old World monkeys have substantial amylase enzymatic activity in their saliva, despite missing the amylase duplication found in great apes (Janiak, 2016).

Here, we address three areas of inquiry with regard to the evolution of the amylase locus in mammals: (i) Can the link between diet and amylase evolution, suggested in the human lineage, be generalized to other mammals? (ii) What are the evolutionary forces that shape amylase copy numbers in mammals? (iii) What are the genetic mechanisms in different mammals leading to expression of amylase in salivary glands? To answer these questions, we pursued a comprehensive investigation of amylase gene copy numbers and salivary expression across multiple mammalian lineages.

Results and discussion

Amylase copy number gains occurred in multiple mammalian lineages independently

The human lineage-specific amylase gene duplications were initially thought to represent a unique case of evolutionary adaptation to increased starch consumption in humans (Perry et al., 2007). Up to five more haploid copies of the amylase gene can be found in humans than in chimpanzees. Therefore, the recent revelation that a similar increase in amylase gene copy number occurred in dogs (Axelsson et al., 2013; Ollivier et al., 2016) is remarkable since it shows that the same gene underwent bursts of gene copy number gains in two separate species independently. Copy number variation was also noted in pigs (Paudel et al., 2013). To comprehensively investigate amylase gene copy number gains in other mammalian lineages, we conducted a digital droplet polymerase chain reaction (ddPCR)-based analysis of amylase gene copy numbers from 204 DNA samples across 46 species encompassing all major branches of the mammalian phylogeny. In addition to humans and dogs, we discovered similar bursts (i.e. gains of more than one copy) of amylase gene copy number in mice, rats, pigs, and boars (Figure 1—figure supplement 1, Supplementary file 1).

We hypothesized that the elevated gene copy numbers observed in different branches of the mammalian phylogeny (Figure 1) result from independent duplication events. An alternative explanation would be that the ancestor of placental mammals had multiple copies of the amylase gene, which were subsequently lost in certain mammalian lineages. To distinguish between these two scenarios, we constructed from available reference genomes a maximum likelihood tree of amylase coding sequences (Figure 2A, see Figure 2—figure supplement 1 for a more comprehensive tree with outgroups included). Our results showed that amylase genes within a given species are more similar to each other than they are to those of other species. One explanation for this observation could be that duplications of the amylase gene might have occurred in each lineage independently.

Figure 1. Amylase gene copy number bursts across mammals.

Boxes represent all haploid amylase gene copies found in representative species or among clades across the mammalian phylogeny (see Supplementary file 1 for a comprehensive dataset). Lighter colored boxes represent the variation in copy numbers found in at least two individuals of a given species or in reference genomes of at least two species within a clade. Triangles at the end of branches indicate that copy numbers of more than one species belonging to the same clade were shown together as a single column. The numbers in parentheses following clade names indicate the number of species used for estimating the gene copy numbers.

Figure 1.

Figure 1—figure supplement 1. Primer design and ddPCR accuracy.

Figure 1—figure supplement 1.

(a) Primer/probe sets were designed to match all the AMY gene copies in a given reference genome. For example, in the case of the five haploid copies in humans, the primers and probe were designed to match 100% to all five gene copies of amylase. (b) A schematic phylogeny showing the strategy behind primer design for species where a reference genome was not available. For example, to design a primer/probe set for zebra, we designed oligonucleotide sequences that match the closest relatives of zebra for which reference genomes were available (in this case: donkey and horse). (c) Correlation between AMY gene copy numbers estimated by BLAST analysis of reference genomes and our estimates by ddPCR analysis (mean gene copy number of all individuals measured for any given species). The y-axis represents the haploid copy numbers obtained from available references genomes. The x-axis represents copy numbers derived from our ddPCR analysis (n = 31 species). A linear regression line is plotted to visualize the correlation.

Figure 2. Amylase duplications evolved independently.

(a) Maximum likelihood tree based on phylogenetic similarity of amino acid sequences of amylase gene copies translated from copies identified by BLAST. Bootstrap values are shown next to the major branch splits. A larger tree containing all bootstrap values and additional outgroup species can be found in Figure 2—figure supplement 1. (b) Types and locations of retrotransposons associated with amylase gene copies found in human, mouse, rat, dog and pig reference genomes. Small green boxes symbolize the positions of the retrotransposons. Arrows above individual amylase gene copies indicate the direction of transcription. Distances in kilobases between amylase gene copies and associated retrotransposons are shown above the green boxes. Non-assembled contigs are shown in dotted frames. Two major gaps around the third retrotransposon in the rat amylase locus are indicated as line breaks.

Figure 2.

Figure 2—figure supplement 1. Expanded maximum likelihood amylase gene tree of mammalian reference genome sequences (PhyML).

Figure 2—figure supplement 1.

The number in parentheses distinguishes specific amylase genes in a given species. If AMY gene copies were previously annotated, we used that annotation (e.g. A2a5 in the mouse is amylase A2a5 by USCS Gene Track). If the gene copy was not annotated, numbers were used for different copies (shown in parentheses).
Figure 2—figure supplement 2. RNA-sequencing data for expression of amylase genes in mouse parotid salivary gland.

Figure 2—figure supplement 2.

Green boxes on the x-axis represent the gene order on the mouse reference genome. The y-axis is drawn in log scale and represents the fragments per kilobase of exon per million reads (FPKMS) from RNA sequencing. Vertical bars designate the average FPKMS read coverage for RNA from two adult mice (12 weeks of age) for their parotid salivary glands. The gene schematic diagram displays the RNA sequencing coverage across the exons of amy1. Data were extracted from Gluck et al. (2016).

Yet another explanation could be that lineage-specific gene conversion events occurred among ancestral amylase copies. Indeed, inter- and intra-chromosomal crossover has been shown in the amylase locus of humans (Groot et al., 1990; Gumucio et al., 1988). Such a process, if it occurred frequently enough, could potentially generate high similarity among amylase gene copies in any given mammalian lineage. These two scenarios, independent lineage-specific duplication events and gene conversion among existing gene copies, are difficult to distinguish using phylogenetic analysis alone (Mendes et al., 2018). To solve this conundrum in humans, Samuelson et al. searched for lineage-specific signatures associated with individual gene copies in the amylase locus. They identified a retrotransposon (HERV_a_int) inserted upstream of a new amylase gene duplicate (AMY1) in the ancestor of great apes (Samuelson et al., 1990). This retrotransposon was found to be associated with all the additional AMY1 copies detected in humans (Perry et al., 2007). This finding strongly supported the notion that the duplications of AMY1 occurred after the human-chimpanzee phylogenetic split (Perry et al., 2007; Samuelson et al., 1990).

Based on that, we asked if similar retrotransposons or other genomic signatures could help us determine whether the amylase gene copy number bursts in other mammalian genomes occurred independently. We first interrogated the mouse reference genome, as it is adequately complete for such an analysis. Indeed, we found a mouse lineage-specific retrotransposon (L1Md_T) in the upstream region of five out of the seven mouse amylase gene copies. The presence of this retrotransposon along with the duplicated copies parallels the situation in humans (Figure 2B). By ddPCR analysis, we found 9–13 diploid copies of the amylase gene in brown rats and 6–7 copies in black rats and wood rats (Supplementary file 1). Considering the close phylogenetic relationship of rats and mice, we expected that the high copy number of amylase had evolved in their rodent ancestor. However, the L1Md_T retrotransposon was not found in rats. When we investigated the amylase locus in the rat genome, we found a rat-specific retrotransposon (L1_Rat3) inserted upstream of two out of the three rat amylase copies in the assembled rat reference genome (Figure 2B). Therefore, amylase gene duplications in rats likely occurred independently from the ones in mice. The amylase gene copy numbers in rats found by ddPCR do not match those annotated in the rat reference genome. This inconsistency could be due to major sequence gaps in the assembled amylase locus (indicated by line breaks in Figure 2B). As an additional complication, we discovered that one of the gene copies, amy1 in mouse (discussed in the next section within the context of salivary expression, see Figure 2—figure supplement 2), is shared with other rodent species including rats. Nevertheless, our results support that most amylase gene copy number gains occurred in mouse and rat lineages independently.

We also conducted similar analyses of the amylase locus in dogs and pigs (Figure 2B). Despite the fact that the assemblies of their genomes are not as complete as human and mouse reference genomes, we were able to investigate the genomic signatures in contigs that harbor distinct copies of amylase genes. In dogs, we found a canid-specific L1 element (L1_Canid) in all four amylase gene copies assembled across three different contigs. In pigs, we found an older lineage-specific L1 element (L1M3) downstream of all six amylase copies assembled across three contigs.

Overall, we found lineage-specific retrotransposons located in similar proximity (0.8–4.2 kb upstream or downstream) to multiple amylase gene copies in human, mouse, rat, pig, and dog reference genomes (Figure 2B). In each of these cases, the respective retrotransposons are in identical positions relative to the amylase gene within each species. We surmised that these retrotransposons inserted in proximity to an ancestral amylase copy in each species independently and were subsequently duplicated along with further gained amylase gene copies. These findings do not rule out that gene conversion as well as other mechanisms (incomplete lineage sorting, crossover events, and ancestral gene duplication polymorphisms) might have shaped variation in this locus. However, the fact that lineage-specific retrotransposons accompany amylase gene copy number gains in humans, mice, rats, dogs, and pigs, clearly points to lineage-specific duplications as a major driver of amylase gene copy number bursts in these species. The extent to which retrotransposons affect mutational or functional dynamics in the amylase locus remains an important area for future research.

Expression of amylase in saliva evolved in different mammalian lineages independently and was facilitated by gene copy number duplication

Ancestrally, amylase was a pancreatic enzyme in mammals. However, in certain mammalian species, amylase became expressed also in saliva (Chauncey et al., 1963). In humans, this acquisition of salivary gland-specific expression has been well explained (Ting et al., 1992). It has been shown that the above-described retrotransposon insertion along with the AMY1 duplicate in the ancestor of great apes was responsible for tissue-specific expression of this gene in salivary glands (Samuelson et al., 1990). Previous studies hypothesized that a similar, but independent gene duplication event led to the expression of amylase also in the saliva of mice (Meisler and Ting, 1993). It remained unresolved whether the mechanism that enabled the expression of amylase in mouse saliva is similar to that determined for humans. Moreover, even though some reports noted the expression of amylase in the saliva of various other mammalian species (Janiak, 2016), a comprehensive analysis of its expression across the mammalian phylogeny is still missing. Another unanswered question from an evolutionary perspective is what a potential adaptive benefit of expressing amylase in saliva could be. Even though amylase is a digestive enzyme, it is clear that in most mammals starch digestion primarily occurs through the activity of the pancreatic enzyme in the intestines rather than in the mouth (Fernández and Wiley, 2017).

To address these questions, we performed a screen across the mammalian phylogeny to investigate which lineages express amylase activity in saliva. We used a two-pronged approach, comprising a starch lysis plate assay (Figure 3A) and a high-sensitivity in-solution fluorescence-based amylase assay (Figure 3B). Currently, our study provides the most comprehensive documentation of salivary amylase activity in mammals, encompassing 127 saliva samples across 22 species (Supplementary file 1). This is a significant contribution given that previous studies varied considerably in sample preparation, methods of analysis, and assay sensitivity (Janiak, 2016).

Figure 3. Salivary amylase activity and relationship to gene copy number.

Figure 3.

(a) Representative starch lysis plate assay showing the extent of lysis caused by the enzymatic activity of amylase in the saliva of various mammalian species. The left panel shows side-by-side comparisons of domesticated species and their counterparts in the wild. The agar plate shows the lysis caused by representative primate saliva samples. (b) Density plots showing salivary amylase activity in different species as measured by a high-sensitivity colorimetric assay. The dotted line represents the mean amylase activity level determined in the serum of humans, rats, pigs, boars, and gorilla (mean: 0.034 U/mg, range: 0.01–0.077 U/mg). A full dataset can be found in Supplementary file 1. (c) Scatter plot of amylase activity and gene copy number in multiple species as measured by ddPCR. The dotted line represents the same serum activity shown in (b). Images of starch plate lysis caused by standard dilutions of amylase are shown to the left of the y-axis next to their corresponding activity values.

Our results showed that amylase activity in saliva is more widespread among mammals than previously thought (Figure 3B). In addition to species that were already known to express amylase in their saliva, we observed salivary amylase activity in some New World monkeys, boars, dogs, deer mice, woodrats, and giant African pouched rats (Supplementary file 1). It is important to note here that our findings also suggest that amylase activity in dog saliva varies from breed to breed (Supplementary file 1). It remains to be determined to what degree this variable expression of amylase in the saliva of different dog breeds might have been the result of older adaptive forces, perhaps related to dogs becoming companions of humans thereby exposed to a human diet, rather than merely occurring as a byproduct of recent intentional breeding for other traits.

To explain the expression of amylase in the saliva of some mammalian lineages, but not others, we considered two scenarios. First, it is possible that salivary expression of amylase could be an ancestral trait that was subsequently lost in most species. Second, it is possible that salivary expression may have evolved independently in different lineages. To distinguish between these two scenarios, we asked whether orthologous copies of the amylase gene are expressed in humans and mice. Based on previous work showing that AMY1 copies in humans are expressed in salivary glands, we asked which amylase copy is expressed in mouse salivary glands. By mapping parotid salivary gland RNA-Seq data (Gluck et al., 2016) to the mouse reference genome (mm9), we found that the copy annotated as mouse amy1 is expressed in salivary glands, while the other amylase gene duplicates have a negligible expression in that tissue (Figure 2—figure supplement 2). However, mouse amy1 is not orthologous to human AMY1 (Meisler and Ting, 1993). Furthermore, its amino acid sequence is distinct from those of other amylase copies in the mouse genome. This distinct amy1 sequence is shared with rat and other rodents including deer mouse, vole, Mongolian gerbil, and golden hamster. This suggests that the duplication event that led to the formation of mouse amy1, the copy that is expressed in mouse salivary glands, likely occurred in an ancestor of muroidea. Interestingly, the more recently acquired, mouse and rat-specific amylase gene duplications are not expressed in mouse salivary glands (Figure 2—figure supplement 2).

We could not find a mammalian species that underwent a ‘burst’ of amylase gene copy number that did not show concurrent salivary amylase activity (Figure 3C). Also, we found no species with less than four diploid amylase copies that showed any measurable amylase activity in saliva. It is important to note here that the relationship between amylase gene copy number and salivary amylase activity cannot be explained by linear correlation. For example, rhesus macaques and baboons have relatively low amylase gene copy numbers (5–6 diploid copies) but show high amylase activity in their saliva (514–1,652 units per mg of total protein). In contrast, dogs, which among mammals showed some of the highest gene copy numbers of amylase (9–19 diploid copies), express very low amounts of amylase in their saliva (0–9 units/mg). Overall, these results suggest that a gene duplication may be a necessary condition for amylase to become expressed in saliva. Subsequently, however, different regulatory architectures are likely responsible for the differences in amylase activity observed in saliva across different mammalian species.

A broad-range diet containing starch correlates with increased amylase gene copy number

For humans, it has been postulated that starch consumption has driven the increased gene copy number of the amylase gene through positive selection (Perry et al., 2007). For dogs, the rapid amylase gene copy number increase as compared to wolves (3–8 haploid copy number gain) has been associated with a transition during their domestication from a primarily meat-based diet to a diet enriched in starch (Axelsson et al., 2013). Based on these previous findings, we hypothesized that in other mammalian species, gains in gene copy number and the associated gain of amylase expression in saliva are likely driven by starch being a dietary component. Unfortunately, a systematic survey describing the amounts of starch consumption across mammals is lacking. Moreover, the amount of starch in the diet varies among subspecies, and sometimes even among geographically distinct populations of the same species (Pineda-Munoz and Alroy, 2014). Thus, to test the above hypotheses, we categorized mammals into those that consume specialized diets (strict carnivores or strict herbivores) and those consuming broad-range diets (including different amounts of starch). Based on extensive literature review (see Materials and methods), we subdivided mammals consuming broad-ranged diets further into those that consume diets containing high amounts of starch (humans, mice, brown and black rats, dogs, pigs, and boars) and those that consume low amounts of starch. We then conducted a comparative analysis of amylase copy number and salivary enzymatic activity among these categories (Figures 4A, B and C).

Figure 4. Amylase gene copy numbers and salivary enzyme activity in species with different dietary preferences.

Figure 4.

Box plots representing (a) AMY gene copy numbers and (b) salivary amylase activities (in units per milligram of total salivary protein) in mammalian species assigned by their dietary preferences into either specialized (carnivore or herbivore) or broad-range. The species consuming a broad-range diet were further sub-grouped into two categories based on the relative amounts of starch in their diet. (c) Estimates of statistical significances for the differences found between dietary groups. p-values were calculated for independent contrasts to account for phylogenetic confounding effects.

We found that species consuming a broad-range diet generally harbor significantly higher copy numbers of the amylase gene (Figure 4A). Specifically, the mean diploid amylase copy number among animals consuming specialized diets is 2.4, while it is 7.0 for those consuming broad-range diets. Corrected for phylogenetic dependence, a broad-range diet is significantly associated with increased amylase gene copy number (p =~9×10−6). Among species that consume a broad-range diet, we found that those who over recent evolutionary time gained access to abundant starch-rich foods — either through domestication (as in the case of dogs and pigs) or through dietary commensalism with humans (as in the case of house mice or brown and black rats) — harbor considerably higher copy numbers of the amylase gene (Figure 4A). The mean diploid amylase gene copy number is 11.6 for species consuming a starch-rich diet, while it is 5.0 for those that consume lower amounts of starch in their diet. When corrected for phylogenetic dependence, starch consumption (categorized into high- and low-starch content in diet) is significantly correlated with amylase gene copy number (p =~10−4).

We found that mammals consuming a broad-range diet also express on average a thousand-fold higher amylase activity in their saliva (~243.6 units/mg of total salivary protein) than those consuming specialized diets (~0.1 units/mg) (Figure 4B). When corrected for phylogenetic dependence, it appeared that consumption of a broad-range diet is significantly associated with the enzymatic activity of amylase in saliva (p =~10−4). Salivary expression of amylase, however, turned out to not be associated with the amount of starch consumption per se (Figure 4B). Rather, salivary expression of amylase appeared to be associated with the consumption of starch, regardless of the amount. Previous work in humans and in rats showed that salivary amylase is linked to the perception of starch (Mandel et al., 2010; Sclafani et al., 1987). Based on this, one possible evolutionary explanation could be that the ability to enzymatically liberate sugar from long-chain starch molecules for the perception of sweet taste might have provided a metabolic fitness advantage to mammalian species consuming a broad range diet. We like to point to two outliers in our dataset of primate species consuming broad-range diets. We found that marmosets and lemurs express negligible amylase activity in their saliva (~0.2 and ~0.009 units/mg, respectively). However, for them starch is not a primary component in their otherwise diverse diet. Marmosets were shown to consume tree gum while lemurs consume tamarind fruit, reportedly rich in non-starch polysaccharides (for further information on starch consumption by mammalian species see Materials and methods).

Next, to investigate the influence of human commensalism, we conducted a comparative investigation of amylase gene copy number and enzymatic activity in saliva between mammalian species interacting with humans and their closest evolutionary relatives in the wild. In dogs, that due to their commensalism with humans consume a considerable amount of starch, we found an increase, not only in amylase gene copy number (Axelsson et al., 2013) but also in enzymatic activity of amylase in saliva as compared to the carnivorous wolf from which dogs diverged approximately 30,000–40,000 years ago (Skoglund et al., 2015) (Figure 3A). A less substantial increase was found in species that already consumed starch in their ancestral state (e.g. mice and rats which diverged from granivorous ancestors). Along the same lines, we found no difference of amylase gene copy numbers and salivary enzymatic activity between domesticated pigs and wild boars. This could be explained because boars, the ancestral species, already consumed starch in amounts comparable to those of pigs, their domestic counterparts. In fact, previous observations showed that boars and humans have similar starch-rich ancestral diets due to their consumption of underground starch-containing storage stem tissues known as tubers (Hatley and Kappelman, 1980).

Evolution of amylase in primates

In primates, we could conduct a finer resolution analysis of amylase evolution and its relationship to diet than in non-primate mammals because for primates we have access to more data and samples. Specifically, we could investigate the amylase gene locus in 14 different primate species (53 DNA samples) for gene copy number, and from 8 of these species, we also could obtain saliva (26 saliva samples) for measuring enzymatic activity (Figure 5). We confirmed previous studies which documented a duplication of the amylase gene in the ancestral population of the catarrhini and an additional duplication in the ancestral population of the great apes (Meisler and Ting, 1993). Among Old World monkeys, we found further amylase gene copies in rhesus macaques, baboons, and vervets, all species which consume a broad-range diet. In contrast, we found that leaf-eating Old World monkeys (colobus, snub-nose, and proboscis monkeys) (Hohmann, 2009) possess only one copy of the amylase gene, indicating a potential loss of a copy in this lineage.

Figure 5. Amylase gene copy number duplication events and salivary activities across the primate phylogeny.

Figure 5.

Bars in the upper part of the diagram represent mean amylase activities in saliva of corresponding primate species (indicated by the dotted lines). Orange dots in the branches of the phylogenetic tree show the presumed occurrences of gene duplication events. The red X indicates a likely gene loss event. Haploid amylase gene copy numbers are indicated in parentheses above the species name. The copy numbers determined through genotyping by ddPCR are shown in black round parentheses, while those based on reference genomes are shown in gray square brackets. Phylogenetic branches are colored according to dietary preferences (see boxed legend). The upper boxed section is a zoomed-in version of the tree. Abbreviations: G.M., green monkey; V, vervet; A.S., Allen’s swamp monkey; O.W.M., Old World monkeys.

Most New World monkey genomes that we tested carry two haploid amylase copies. Assuming that the ancestral state of this lineage had one haploid copy, our results suggest yet another occurrence of gene copy number gain in the ancestor of New World monkeys. Moreover, we found an additional amylase gene copy in capuchins, which generally consume more starch than other New World monkeys (Galetti and Pedroni, 1994; Rowe and Myers, 2016). Next, we investigated lemurs, an outgroup primate species to monkeys and great apes, and found that they indeed only harbor one haploid copy of the amylase gene (Figure 5). Our result in the lemur lineage, combined with prior reports that ancestors of simians have a single amylase gene copy (Samuelson et al., 1996; Samuelson et al., 1990), suggests that primate ancestors possessed only one haploid copy of the amylase gene. Still, other scenarios, involving multiple amylase copies in the primate ancestor followed by gene loss, may also explain the observed copy number variation across the primate phylogeny, and cannot be dismissed at the present stage, although they are more complex in their assumptions. Clearly, more studies are needed to fully resolve the evolutionary history of the variation in this locus in primates, which, in addition to gene duplications, may have been shaped by gene conversion and complex rearrangements as well as incomplete lineage sorting.

Next, we investigated whether variation in amylase gene copy numbers among primates translates into salivary expression in a similar way as we had shown for non-primate mammals. We found that several species of Old World monkeys, including rhesus macaques and baboons, express high enzymatic activity of amylase in their saliva (Figure 5) (Mau et al., 2010). These primates, belonging to the subfamily cercopithecinae, are known for their cheek pouches in which they store food for prolonged oral predigestion (Lambert, 2017; Rahaman et al., 1975). Thus, this primate subfamily could be an exception in that salivary amylase may substantially participate in oral digestion of starch.

Most New World monkeys do not consume starch in their regular diets. For example, marmosets primarily consume insects and plant exudate (Rylands and Faria, 1993), while owl monkeys consume flowers, insects, nectar, and leaves (Rowe and Myers, 2016; Wright, 1994). In agreement with their dietary habits, we found little to no salivary activity of amylase in these New World monkeys. Capuchin monkeys are an exception because they consume fruits, bulbs, and seeds (Galetti and Pedroni, 1994; Rowe and Myers, 2016). Accordingly, we discovered enzymatic activity levels of salivary amylase in capuchins that reach levels found in pigs and boars (Figures 3C and 5).

Our results in primates document two additional instances (cercopithecinae and capuchins) where duplications of the amylase gene coincide with salivary expression. Combined, our results suggest that the evolution of the amylase locus in primates followed the same general trends observed for all mammals in that dietary strategies coincide both with amylase gene copy number and salivary expression in a lineage-specific manner.

Conclusion and outlook: a working model to explain how the amylase locus evolved

Our results reveal a staggering diversity of amylase gene copy numbers across extant mammals that consume starch. We report multiple bursts of amylase copy number gains that occurred independently in different branches of the mammalian phylogeny. Our results showed that each of these bursts coincided with expression of amylase in saliva. Our results also showed that phylogenetically distant species living in different habitats and consuming different diets have arrived at astonishingly similar amylase gene copy numbers, which correlate with the level of starch in their diet. Building on earlier models of the locus’ evolution (Axelsson et al., 2013; Perry et al., 2007; Samuelson et al., 1990) and using our own data, we deduce a model of how the amylase gene locus might have evolved across mammals (Figure 6). We posit here that the amylase locus evolved under the influence of the dietary context driven by the functional importance of amylase enzymatic activity in two digestive gland systems, namely the salivary glands, located at the entrance to the gastrointestinal tract, and the pancreas located further distally in the digestive continuum.

Figure 6. A working model to explain the evolution of the amylase locus.

Figure 6.

The schematic diagram illustrates how the amylase gene locus could have independently evolved in different lineages (Lineage A and B) of the mammalian phylogeny. Our results are consistent with a model where the enzymatic expression in saliva is gained through the concerted effect of gene duplications and regulatory architecture. Our findings suggest that at least one gene duplication is required for salivary-gland-specific expression. Our suggested model posits that along with the strength of the regulatory architecture and the dietary context, additional gene duplications drive the evolution of amylase expression in saliva. Examples representative of Lineage A are muroidea and cercopithecinae. Examples representative of Lineage B are dogs, pigs, and humans.

Most evolutionary models agree that the ancestral mammalian amylase gene was expressed in the pancreas. It has been suggested that increase in gene copy number leads to higher amylase expression in the pancreas, which in turn allows rapid and effective intestinal digestion of starch in species consuming a higher amount of that food component in their diet (Axelsson et al., 2013). Based on our findings, we propose here that at least one gene duplication is required for amylase to gain expression in salivary glands. We argue that this is a form of neofunctionalization (Ohno S, 1970) where an otherwise intact gene copy acquires mutations in its regulatory architecture, thus leading to expression in a new tissue, in this case the salivary gland system.

We hypothesize that one or more gene duplication events of this copy are needed to gain a level of enzymatic activity in saliva optimally suited to accommodate starch predigestion in the mouth environment. As illustrated in Figure 6, we further propose that there are different ways to achieve a given level of salivary amylase activity, depending on the strength of the regulatory element associated with the salivary gland specific gene copy. For instance, muroidea and cercopithecinae possess only a single salivary-gland-specific gene copy but arrive at high levels of amylase activity in their saliva. In this case, we surmise that a strong regulatory element is associated with that single copy. In addition to that copy, muroidea possess multiple other amylase gene copies that are not expressed in salivary gland tissue. It can be assumed that these gene copies are expressed in pancreatic tissue and may increase enzymatic activity there in the more distal parts of the digestive tract. Dogs and pigs have numbers of amylase gene copies comparable to humans and some muroidea but show much less amylase activity in their saliva than these species. Given the data, we are not able to say which of these copies are expressed in pancreas and which in salivary glands. Regardless, those copies that might be expressed in salivary glands must be associated with weak regulatory architectures.

In humans, the number of amylase gene copy numbers was shown to correlate with levels of amylase expression in saliva (Bank et al., 1992; Perry et al., 2007). Our data show that across all mammalian species such a simple correlation does not hold true. However, within species possessing multiple and variable copy numbers there might be a correlation of copy number with salivary expression. In this regard, we could show a correlation of amylase gene copy number and salivary enzyme activity for dog (R2 = 0.45) and pig (R2 = 0.69). A comparative study investigating expression of individual amylase gene copies in pancreas versus salivary gland system will be the logical next step to delineate the mechanisms through which gland-specificity of amylase has evolved in mammals. It is of particular interest to simultaneously elucidate the role of lineage-specific retrotransposons or other regulatory elements in modulating tissue-specific expression of amylase.

From a broader ecological perspective, we showed that amylase gene copy numbers generally correlate with a broad-range diet and with high versus low-starch consumption. However, salivary amylase activity is only correlated with broad-range diet but not with the amount of starch. As such, a simple explanation solely based on the digestive function of amylase cannot fully explain why some mammals, including humans, express so much amylase in their saliva. As a matter of fact, most mammalian species do not keep food long enough in their mouth for salivary amylase to substantially participate in starch digestion. We argue here that in most mammalian species the role of amylase in saliva may be to liberate oligomeric sugar molecules from polymeric starch chains that can then be perceived by sweet-taste receptors in the oral cavity. Indeed, studies found links between salivary amylase and taste perception (Mandel et al., 2010; Sclafani et al., 1987). Being able to perceive otherwise tasteless starch in their diet might confer an adaptive advantage to species consuming broad-range diets by enabling them to detect high caloric (i.e. starch-containing), food components. Lastly, putatively adaptive benefits of amylase expression in saliva depend on the ecological and behavioral context for any given species. An exceptional example are the cheek-pouched cercopithecinae where putative fitness advantage of salivary amylase expression goes beyond taste perception. In these species, which conduct almost half of their starch digestion in the oral cavity (Janiak, 2016), salivary amylase may have evolved to substantially participate in the overall digestion of dietary starch, a role executed by pancreatic amylase in most other species.

In addition to gustatory and digestive functions, salivary amylase may also be involved in the regulation of metabolic glucose homeostasis (Peyrot des Gachons and Breslin, 2016) as well as associated with bacterial composition in the oral cavity (Davenport, 2017; Scannapieco et al., 1989) or in the gut (Poole et al., 2019). In that regard it will be interesting to find out whether different amylase gene copies encode proteins of slightly different functional activity due to differences in DNA sequence, differential RNA splicing, and post-translational modifications, including glycosylation. Overall our present study highlights the potential role of amylase enzymatic activity in saliva in shaping food preference and niche partitioning among omnivorous starch-consuming mammals, possibly in coevolution with the oral microbiome.

Materials and methods

Sampling

We chose our panel of mammalian species based on their phylogeny, diet preference (broad vs. specialized), domestication status, and commensal relationship with humans. Overall, we compiled 204 DNA samples from 46 different species and 127 saliva samples from 22 different species. Detailed information about the samples used in this study and their sources can be found in Supplementary file 1. Briefly, DNA from various animals was collected using buccal swabs (PurFlock, Puritan Medical Products), saliva samples, or museum specimens (dried blood and tissues, Museum of Southwestern Biology, Division of Genomics Resources). Saliva samples were collected by suction using commercially available devices containing absorbent sponges in a syringe-like receptacle (Super-SAL and Micro-SAL, Oasis Diagnostics, Vancouver, WA) unless otherwise specified. DNA was extracted from swabs using a commercially available kit (ChargeSwitch gDNA Buccal Cell Kit, Invitrogen). For saliva samples, we used a commercially available extraction kit (BioWorld, Dublin, OH). Detailed information about sampling strategies employed for each species can be found in Collection of saliva samples section below.

Copy number variation analysis

Digital droplet PCR (ddPCR) was used to experimentally determine amylase gene copy numbers. If reference genomes were available for a given species, primers were designed specifically for use in these species. For species where reference genomes were unavailable, amylase coding sequences were chosen for primer design that were confirmed to be conserved in the two most closely related species. Further details about primer design and strategy are described in Primer design for digital PCR section below. The primer sets used for each species are listed in Supplementary file 2.

Phylogenetic analysis

Translated amino acid sequences of the amylase gene copies were downloaded from NCBI reference genomes. Sequences were aligned and a phylogenetic output was generated using a custom Python code as described previously (https://github.com/duoduoo/VCFtoTree) (Xu et al., 2017; Pajic et al., 2016). We constructed a phylogenetic tree from the protein sequences by Randomized Axelerated Maximum Likelihood (RAxML) (Stamatakis, 2014) using the LG substitution model (Le and Gascuel, 2008), bootstrapping it with 1000 replicates for branch support. Visualization was performed using the FigTree software (Rambaut, 2012).

Retrotransposon analysis

Previous work utilized lineage-specific retrotransposons to estimate the timing of amylase gene duplications and to distinguish between salivary and pancreatic amylase genes in humans (Samuelson et al., 1990). Using this approach, the salivary AMY1 gene could be traced back to a great ape ancestor (Samuelson et al., 1990). Later studies used the sequence of the great-ape specific retrotransposon to label AMY1 gene copies in humans by fiber-FISH (Perry et al., 2007). Building upon these findings, we searched 5 kb upstream and downstream of the amylase copies in mouse, rat, pig, and dog reference genomes for the existence of lineage-specific retrotransposition markers. Specifically, we searched for relatively recent L1 elements having sW scores of more than 1000 and being located at nearly identical distances to the 5’ or 3’ ends of amylase gene copies. Using this approach, we detected a relatively small number (less than 10) of distinct L1 retrotransposons in each species (Figure 2B). To ensure that these retrotransposons were duplicated specifically in the amylase locus, we conducted a BLAST analysis to search for the existence of these same retrotransposons outside of the amylase locus. We found that the retrotransposons within the amylase locus are highly similar to each other (>90%) and we found no similarly close matches for these retrotransposons elsewhere in the reference genomes of these species (Supplementary file 3). Next, we verified that these retrotransposons were indeed lineage-specific by showing that there were no close matches in reference genomes of other species. The most parsimonious explanation for our observations is that these L1 elements inserted into the proximity of an amylase gene copy and then duplicated along with additional copies of that gene, thereby suggesting lineage-specificity of duplication events.

Measurement of amylase enzymatic activity

We used two different methods to measure enzyme activity of amylase in saliva. First, we conducted a direct estimate of enzyme activity using a traditional starch lysis agar plate assay following a previously described protocol (Kilian and Nyvad, 1990). In brief, holes were punched in a starch-containing agar and filled with saliva. After 24 hr incubation at 37°C, the undigested starch remaining in the agar was stained with iodine and the diameters of the lysed clear rings were measured. Enzymatic activity was extrapolated from serial standard dilutions of purified α-amylase from human saliva (Sigma) measured in the same assay. In parallel, we measured the samples using a high-sensitivity (detection limit 2 mU/ml) colorimetric in-solution assay (EnzCheck Ultra Amylase Assay Kit, Invitrogen) following the manufacturer’s protocol with the same human α-amylase as the standard. Concentrations of total protein in saliva were determined by the bicinchoninic acid (BCA) assay (micro-BCA, BioRad) using bovine serum albumin as the standard. Optical density measurements were performed using a Nanodrop 2000 spectrophotometer (Thermo Fisher). Amylase activities were calculated as units of enzymatic activity normalized per mg of total salivary protein.

Data analyses

All input data used for creating the main figures are provided in Supplementary file 1. Information about the dietary preferences of individual species was acquired from extensive literature research presented in Categorization of starch consumption section below. All figures were produced using the R statistical package (https://www.r-project.org/). For calculating the independent phylogenetic contrasts shown in Figure 4C, we used the approach outlined by Felsenstein (1985). For this analysis, we used the subset of species available through the Hg19 100way conservation alignment (http://genomewiki.ucsc.edu/index.php/Hg19_100way_conservation_alignment). Using the phylogenetic distance provided in this dataset, we first normalized the differences in amylase gene copy number and salivary enzyme activity between any two species by the square roots of the phylogenetic distance between them. Using these normalized values and applying the non-parametric Kolmogorov–Smirnov test, we tested the null hypothesis that the phylogenetic contrasts between species consuming a specialized diet is not different from the phylogenetic contrasts between species consuming the different types of diet. Among the species consuming a broad-range diet, we further tested that the phylogenetic contrasts between high and moderate starch consuming species are not different from those between species consuming moderate levels of starch.

Collection of saliva samples

Saliva samples and buccal swabs from deer mice (Peromyscus spp.) were provided by Danielle Garneau (SUNY Plattsburgh). Mice were trapped in the wild by Sherman live traps (Garneau et al., 2012). After restraint by scruffing behind the neck, a glass capillary tube was introduced to the animal's mouth and was moved about the lower lip and cheeks to collect saliva. The tube was introduced at an angle such that gravity would help draw down the sample into the tube. The capillary tube was placed in an Eppendorf tube and a pipet pump was used to force air to drive the rest of the sample from the capillary tube into the Eppendorf tube for storage at −20°C and shipment on dry ice.

Saliva from house mice (laboratory strain C57BL10/SNJ) was kindly provided by Jill Kramer (University at Buffalo) using a collection procedure as previously described (Kiripolsky et al., 2017).

Saliva from woodrats was kindly provided by Michelle Skopec (Weber State University). To collect saliva, woodrats were scruffed and Micro-Sal collection (Oasis Diagnostics, Vancouver, WA) devices were placed in their mouths. The woodrats were allowed to chew on the absorbent sponge part of the device and, then, their tongues and cheeks were swabbed to retrieve residual saliva. Collection devices were centrifuged and saliva samples were stored at −20°C before shipment on dry ice.

Saliva from Long Evans hooded rats was kindly provided by Ann-Marie Torregrossa (University at Buffalo). As described previously (Martin et al., 2018; Torregrossa et al., 2014), rats were conditioned to salivate when a pipette was inserted into the mouth and saliva was collected. Approximately 50 µl saliva was retrieved by suction from below and around the tongue where it pools naturally.

Saliva from dogs, cows, sheep, goats, horses, pigs, and giant African pouched rats was provided by Erin Daugherity and Luce E. Guanzini (Cornell University). Animals were not allowed to eat or drink prior to the collection to ensure the oral cavity was free of food and other debris. Saliva from giant African pouched rats was collected opportunistically while animals were anesthetized for an unrelated clinical procedure. The collection was performed using a commercially available device (Micro-Sal, Oasis Diagnostics). Large animals were gently restrained and a larger collection device (Super-Sal, Oasis Diagnostics) was placed under the tongue for up to three minutes, or until fully soaked. Devices were stored at −20°C before shipping on dry ice.

Saliva from female wild boars and castrated domestic pigs was provided by Anja Globig (Friedrich-Loeffler-Institut, Insel Riems - Greifswald, Germany). For the collection of saliva a commercial collection device, consisting of an absorbent cotton swab in a tube, was used (Salivette, Sarstedt, Nümbrecht, Germany). The swab was inserted in the animal’s mouth and fixated with a forceps until it was drenched with saliva. After placing the swab back in the tube, saliva was extracted by centrifugation. Samples were lyophilized before international shipping.

Saliva from wolves was kindly provided by Karen Davis (Wolf Park, Battle Town, IN). The wolves housed in this facility are well socialized, which allowed saliva collection by inserting Super-Sal (Oasis Diagnostics) devices into the mouths of adult wolves willing to participate. Swabs were kept in the animals’ mouths as long as they would tolerate it or until fully soaked. Samples from juvenile wolves could be collected while they were resting by inserting the swabs into their mouths. Samples were stored at −20°C before shipment on dry ice.

Saliva from dogs was kindly provided by Barbara McCabe (Buffalo, NY). Samples were obtained from diverse breeds of dogs including Boxers, Pitbulls, Golden Retrievers, and Labradors, along with several mixed breeds (see Supplementary file 1 for details). Super-Sal devices (Oasis Diagnostics) were placed in the mouth of dogs for 1–5 min, or until swab was damp. The swabs were stored at −20°C until transfer to our laboratory.

Saliva from Ring-tailed Lemur samples was kindly provided by Erin Ehmke (Duke Lemur Center). Samples were collected using commercially available absorbent strips (SalivaBio Children's Swabs, Salimetrics, Carlsbad, CA). Saliva-soaked swabs were immediately centrifuged and the collected saliva was frozen at −80°C and shipped on dry ice.

Saliva from humans was collected by passive drooling following the protocol approved by the University at Buffalo Human Subjects IRB board (study # 030–505616). Informed consent was obtained from all human participants. Saliva from chimpanzees and gorillas was collected in a noninvasive manner following the protocol approved by the University at Buffalo IACUC committee (IACUC ID# AR201800024). Chimpanzees were trained by the caretaker to voluntarily expectorate into a plastic cup. Gorilla (Western lowland gorilla) saliva was collected by the animal caretakers with a soft disposable plastic Pasteur pipette (VWR, Radnor, PA) from individuals who were previously trained to open their mouth upon request. Saliva from Rhesus macaques was provided by the Southwest National Primate Research Center, San Antonio, TX, and by the Yerkes National Primate Research Center, Atlanta, GA. All samples were immediately transferred into a polypropylene tube chilled on ice. Aliquots were stored at −80°C and shipped on dry ice.

Categorization of starch consumption

All input data used for creating the main figures are provided in Supplementary file 1. Information about the dietary preferences of individual species was acquired from the Michigan Animal Diversity Web (https://animaldiversity.org/), unless other studies were cited. With regard to starch consumption, the literature was limited. Thus, we undertook the following steps to construct a categorization of starch consumption among the species that we used in our analysis presented in Figure 4A and B.

Based on the information available on Michigan Animal Diversity Web, we first identified animals with specialized diets (carnivores: cat, polar bear, cougar, and wolf; herbivores: sheep, bison, snow sheep, cow, goat, horse, bighorn sheep, ibex, yak, wild goat, zebra, sheep, and donkey). We assumed that starch makes up a negligible percentage of these animals’ diet.

For the animals with broad-range diets, which presumably have considerable starch content in their diet, we conducted a wider literature research. Most information about starch consumption is available for present-day human populations (Bright-See and Jazmaji, 1991), and it has been suggested that humans consume a higher percentage of starch in their diet than great apes (Perry et al., 2007; Zohary et al., 2012). Indeed, chimpanzees and bonobos primarily consume ripe fruits (poor in starch), and in the scarcity of ripe fruits, they prefer piths (also poor in starch) (Hohmann, 2009; Wrangham et al., 1998). Orangutans and gorillas primarily consume ripe fruit and leaves (Hohmann, 2009). However, it was suggested that they also consume seeds and cambium, an observation that led Janiak (Janiak, 2016) to argue that gorillas and orangutans have a relatively higher starch content in their diets than chimpanzees and bonobos.

Old World monkeys show remarkable diversity in their diets. Specifically, cercopithecines (represented in our dataset by baboons, rhesus macaques, vervet monkeys, green monkeys, and Allen’s swamp monkeys) consume considerable amounts of unripened fruit (higher starch content [Lambert, 1998]), especially when no ripe fruit is available (Mau et al., 2010; Wrangham et al., 1998). In contrast, most colobinae (represented by the colobus monkeys in our dataset) are primarily leaf-eating and, thus, likely have little starch in their diet (Oates, 1994).

New World monkeys (represented by capuchin monkeys, owl monkeys, and marmosets in our dataset) also show diversity in their likely starch consumption. Capuchin and owl monkeys primarily consume fruits, even though they supplement their diets with flower foraging and insects (Lambert, 2017; Kinzey, 1997), which likely indicates starch consumption similar to chimpanzee and bonobos. Marmosets differ in their dietary habits as their primary food intake comprises gum and other exudates (which are not starch sources) from various trees and vines, and scarcely involve fruits (Soini, 1982). Lemurs (represented by ring-tailed lemur in our dataset) have been reported to primarily consume Tamarind fruit, which is rich in non-starch polysaccharides (Gould et al., 2003). Overall, primates depend on a wide variety of food sources, including starch-based foods. However, humans are the only primate species consuming a diet unusually high in starch content.

As for rodents, we first considered species which are primarily human-commensal (represented by house mice as well as by brown and black rats in our dataset). These species have considerable variation in their diets depending on the ecological context they are living in. For example in the wild, house mice have been reported to eat primarily insects and seeds, the latter containing significant amounts of starch (Badan, 1986; Roux et al., 2002). However, house mice normally live in human-influenced habitats and consume agricultural grains and other starch-rich human-produced food and human food leftovers (Clark, 1982; Gardner-Santana et al., 2009; Hulme-Beaman et al., 2016; Pocock et al., 2004; Schein and Orgain, 1953; Singleton et al., 2003). Other rodents with less human-commensal interactions (represented by birch mouse, deer mouse, pouched rat, and woodrat in our study) also consume diverse diets, including insects, seeds, grains, and flowers (Ajayi, 1977; Baker, 1991; Everett et al., 1978; Juskaitis, 2000). However, their access to starch-rich grains and seeds is seasonal, and these foods are not necessarily their primary caloric source.

The other mammals with broad-range diet in our dataset were dogs, pigs, boars, and bears. Brown and black bears eat a wide range of foods including leaves, fruits, grains, as well as meat from hunting or scavenging (Bojarska and Selva, 2012; Graber and White, 1983; Torgersen et al., 2001). However, the same studies documented that starch-containing foods, such as grains, make up only a small portion of the bears’ diet. Boars also have a diverse preference in their diet, including mushrooms, roots, fruits, and insects. However, unlike bears, boar diets include substantial amounts of roots and tubers (Baubet et al., 2004; Massei and Genov, 2004) and, if available, human agricultural crops (Herrero et al., 2006). A close relative of the boar, the domesticated pig thrives primarily on human-produced starch-rich food sources, such as corn or potatoes. Overall, pigs and boars have higher starch content in their diets than bears. In fact, their diet can be comparable to early human diets (Hatley and Kappelman, 1980; Miller and Ullrey, 1987). Another mammal consuming a broad-range diet that was included in our study was the dog, which has been discussed within the context of recent adaptation to human-derived starch-rich diets (Arendt et al., 2016; Axelsson et al., 2013). Based on this literature, we presume that dogs, pigs, and boars have higher starch content in their diet, while starch makes up a smaller portion of the diet of the bears.

Overall, all the mammals consuming a broad-range diet mostly have considerable levels of starch in their diet. However, our literature search indicates that humans, pigs, boars, dogs, mice, and rats (both brown and black) stand out in that their diet predominantly depends on starch-rich foods (grains, roots, and tubers). Thus, we grouped them under the ‘higher starch’ consuming category whereas we grouped the other species under the ‘lower starch’ consuming category (Figure 4A and B).

Primer design for digital PCR

For digital droplet PCR experiments, we used two primer/probe sets. One targeted the amylase copies and the other targeted a conserved ‘reference’ sequence, which was found to be a single haploid copy in mammals with known reference genomes (SRSF7 gene). For the reference sequence, we used a primer/probe set that targets one of the exons of the SRSF7 gene. The sequence is highly conserved across species and unique (i.e. a single haploid copy) in all mammalian reference genomes we investigated. We have checked that the sequence is 100% conserved in species that we considered and for which reference genomes were available from the UCSC genome portal. To capture as many amylase gene copies as possible, we carefully designed primers and probes for each species where a reference genome was available to match (100% as assessed by BLAST alignment) all of the reference amylase copies. Primer and probe sequences are listed in Supplementary file 2. It is possible that we underestimated or missed some of the amylase copies that are not represented in the reference genomes. However, digital PCR is robust to 1–2 mismatches and in most species, ddPCR results were highly concordant with copy number estimations based on BLASTx and BLASTp analysis (Figure 1—figure supplement 1, Supplementary file 2). Therefore, we surmise that the main trends we observed in this study are reliable.

To decide which primer/probe sets to use for species where no reference genome was available, we designed primer/probe sets that work in the phylogenetically most closely related species for which reference genomes were available. For example, for zebra, we used a primer/probe set designed for the horse reference genome. To ensure that this primer/probe set was appropriate, we first made sure that it also worked in the donkey reference genome. As such, we surmised that our approach should work unless there is rapid, species-specific sequence divergence in zebra as compared to horse and donkey. An analogous approach was used for all the other species for which reference genomes were not available (Figure 1—figure supplement 1, Supplementary file 2, for substitute species genomes chosen). Of course, this approach might be prone to undercalling the number of amylase gene copies in species where reference genomes are not available. Although we are confident about these estimates, none of the major conclusions of this study depends on data from such species.

Acknowledgements

We recognize and thank all individuals and institutions who provided us with DNA and saliva samples: Joseph Cook and Mariel Campbell at the Museum of Southwestern Biology, Division of Genomics Resources, University of New Mexico; the Coriell Institute for Medical Research, Camden, New Jersey; Karen Davis at Wolf Park, Battle Ground, Indiana; Barbara McCabe, University at Buffalo, New York; Luce E Guanzini at the Center for Animal Resources and Education, Cornell University; Klaus Depner at the Friedrich-Loeffler-Institut, Greifswald, Germany; Ann-Marie Torregrossa, Department of Psychology, University at Buffalo; Michele Skopec at the Department of Zoology, Weber State University, Ogden, Utah; Jill Kramer at the Department of Oral Biology, University at Buffalo; Kurt Volle, Alicia Dubrava and fellow gorilla caretakers of the Buffalo Zoo, the Southwest National Primate Research Center (funded by NIH - ORIP/OD P51 OD011133), the Yerkes National Primate Research Center (funded by NIH - ORIP/OD P51 OD011132), and the Duke Lemur Center (funded by NSF - LSCBR #1050035). We are grateful to Ozgur Taskent, Jessica Poulin, Trevor Krabbenhoft, and Derek Taylor for proofreading the manuscript and discussions of the data. This study was funded by The National Science Foundation NSF grant No. 1714867 (OG), National Institute of Dental and Craniofacial Research (NIDCR) grants R01 DE019807 and R21 DE025826 (SR), and National Cancer Institute (NCI) grant U01 CA221244 (SR).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Stefan Ruhl, Email: shruhl@buffalo.edu.

Omer Gokcumen, Email: gokcumen@gmail.com.

George H Perry, Pennsylvania State University, United States.

Diethard Tautz, Max-Planck Institute for Evolutionary Biology, Germany.

Funding Information

This paper was supported by the following grants:

  • National Institute of Dental and Craniofacial Research R01 DE019807 to Stefan Ruhl.

  • National Institute of Dental and Craniofacial Research R21 DE025826 to Stefan Ruhl.

  • National Cancer Institute U01 CA221244 to Stefan Ruhl.

  • National Science Foundation 1714867 to Omer Gokcumen.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing—original draft, Writing—review and editing.

Formal analysis, Methodology, Writing—review and editing.

Validation, Investigation.

Validation, Investigation, Methodology.

Resources, Data curation, Methodology.

Resources, Methodology, Writing—review and editing.

Resources, Validation, Methodology.

Resources, Validation, Methodology, Writing—review and editing.

Conceptualization, Supervision, Funding acquisition, Visualization, Methodology, Writing—original draft, Project administration, Writing—review and editing.

Conceptualization, Resources, Formal analysis, Supervision, Funding acquisition, Visualization, Methodology, Writing—original draft, Writing—review and editing.

Ethics

Human subjects: Saliva from humans was collected by passive drooling following the protocol approved by the University at Buffalo Human Subjects IRB board (study # 030-505616). Informed consent was obtained from all human participants.

Animal experimentation: Saliva from chimpanzees and gorillas was collected in a noninvasive manner following the protocol approved by the University at Buffalo IACUC committee (IACUC ID# AR201800024). The samples from other animals were collected in collaboration with colleagues at museums, zoos, and other research institutions. The samples from all the live animals specifically for this study were.

collected using.

minimally invasive methods, and involved buccal swabs and saliva collections with specialized kits or from the drool. Detailed descriptions of sample sources and collection methods can be found in the main text.

Additional files

Supplementary file 1. Primary datasets used in our study including amylase enzymatic activity in saliva and gene copy numbers across mammalian species (tab 1: data used for Figures 3 and 5), as well as dietary preferences across species (tab 2: data used for Figure 4), and amylase copy numbers found in available reference genomes (tab 3: data used for Figure 1—figure supplement 1).
elife-44628-supp1.xlsx (35.3KB, xlsx)
DOI: 10.7554/eLife.44628.012
Supplementary file 2. Details of the primer and probe sets that we used in this study.
elife-44628-supp2.xlsx (10.9KB, xlsx)
DOI: 10.7554/eLife.44628.013
Supplementary file 3. Results of our analysis of retrotransposons associated with the amylase gene copies depicted in Figure 2B.

Figure supplements

elife-44628-supp3.xlsx (13.1KB, xlsx)
DOI: 10.7554/eLife.44628.014
Transparent reporting form
DOI: 10.7554/eLife.44628.015

Data availability

All data generated (copy numbers and salivary activity information) can be found in the supplementary files and Methods.

References

  1. Ajayi SS. Field observations on the african giant rat Cricetomys gambianus waterhouse in southern Nigeria. African Journal of Ecology. 1977;15:191–198. doi: 10.1111/j.1365-2028.1977.tb00397.x. [DOI] [Google Scholar]
  2. Arendt M, Cairns KM, Ballard JW, Savolainen P, Axelsson E. Diet adaptation in dog reflects spread of prehistoric agriculture. Heredity. 2016;117:301–306. doi: 10.1038/hdy.2016.48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Axelsson E, Ratnakumar A, Arendt ML, Maqbool K, Webster MT, Perloski M, Liberg O, Arnemo JM, Hedhammar A, Lindblad-Toh K. The genomic signature of dog domestication reveals adaptation to a starch-rich diet. Nature. 2013;495:360–364. doi: 10.1038/nature11837. [DOI] [PubMed] [Google Scholar]
  4. Badan D. Diet of the house mouse (Mus musculus L.) in two pine and a kauri forest. New Zealand Journal of Ecology. 1986;9:137–142. [Google Scholar]
  5. Baker RH. Michigan Mammals. Michigan State University Press; 1991. [Google Scholar]
  6. Bank RA, Hettema EH, Muijs MA, Pals G, Arwert F, Boomsma DI, Pronk JC. Variation in gene copy number and polymorphism of the human salivary amylase isoenzyme system in Caucasians. Human Genetics. 1992;89:213–222. doi: 10.1007/BF00217126. [DOI] [PubMed] [Google Scholar]
  7. Baubet E, Bonenfant C, Brandt S. Diet of the wild boar in the French Alps. Galemys. 2004;16:101–113. [Google Scholar]
  8. Boehlke C, Zierau O, Hannig C. Salivary amylase - The enzyme of unspecialized euryphagous animals. Archives of Oral Biology. 2015;60:1162–1176. doi: 10.1016/j.archoralbio.2015.05.008. [DOI] [PubMed] [Google Scholar]
  9. Bojarska K, Selva N. Spatial patterns in Brown bear ursus arctos diet: the role of geographical and environmental factors. Mammal Review. 2012;42:120–143. doi: 10.1111/j.1365-2907.2011.00192.x. [DOI] [Google Scholar]
  10. Botigué LR, Song S, Scheu A, Gopalan S, Pendleton AL, Oetjens M, Taravella AM, Seregély T, Zeeb-Lanz A, Arbogast RM, Bobo D, Daly K, Unterländer M, Burger J, Kidd JM, Veeramah KR. Ancient European dog genomes reveal continuity since the Early Neolithic. Nature Communications. 2017;8:16082. doi: 10.1038/ncomms16082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bright-See E, Jazmaji V. Estimation of the amount of dietary starch available to different populations. Canadian Journal of Physiology and Pharmacology. 1991;69:56–59. doi: 10.1139/y91-010. [DOI] [PubMed] [Google Scholar]
  12. Carpenter D, Dhar S, Mitchell LM, Fu B, Tyson J, Shwan NA, Yang F, Thomas MG, Armour JA. Obesity, starch digestion and amylase: association between copy number variants at human salivary (AMY1) and pancreatic (AMY2) amylase genes. Human Molecular Genetics. 2015;24:3472–3480. doi: 10.1093/hmg/ddv098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chauncey HH, Henriques BL, Tanzer JM. Comparative enzyme activity of saliva from the sheep, hog, dog, rabbit, rat, and human. Archives of Oral Biology. 1963;8:615–627. doi: 10.1016/0003-9969(63)90076-1. [DOI] [PubMed] [Google Scholar]
  14. Clark DA. Foraging behavior of a vertebrate omnivore (Rattus rattus): Meal structure, sampling, and diet breadth. Ecology. 1982;63:763–772. doi: 10.2307/1936797. [DOI] [Google Scholar]
  15. Davenport ER. Tooth be told, genetics influences oral microbiome. Cell Host & Microbe. 2017;22:251–253. doi: 10.1016/j.chom.2017.08.018. [DOI] [PubMed] [Google Scholar]
  16. Everett RL, Meeuwig RO, Stevens R. Deer mouse preference for seed of commonly planted species, indigenous weed seed, and sacrifice foods. Journal of Range Management. 1978;31:70–73. doi: 10.2307/3897641. [DOI] [Google Scholar]
  17. Felsenstein J. Phylogenies and the comparative method. The American Naturalist. 1985;125:1–15. doi: 10.1086/284325. [DOI] [PubMed] [Google Scholar]
  18. Fernández CI, Wiley AS. Rethinking the starch digestion hypothesis for AMY1 copy number variation in humans. American Journal of Physical Anthropology. 2017;163:645–657. doi: 10.1002/ajpa.23237. [DOI] [PubMed] [Google Scholar]
  19. Galetti M, Pedroni F. Seasonal diet of capuchin monkeys (Cebus apella) in a semideciduous forest in south-east Brazil. Journal of Tropical Ecology. 1994;10:27–39. doi: 10.1017/S0266467400007689. [DOI] [Google Scholar]
  20. Gardner-Santana LC, Norris DE, Fornadel CM, Hinson ER, Klein SL, Glass GE. Commensal ecology, urban landscapes, and their influence on the genetic characteristics of city-dwelling Norway rats (Rattus norvegicus) Molecular Ecology. 2009;18:2766–2778. doi: 10.1111/j.1365-294X.2009.04232.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Garneau DE, Lawler ME, Rumpf AS, Weyburne ES, Cuppernull TM, Boe AG. Potential effects of beech bark disease on small mammals and invertebrates in northeastern US forests. Northeastern Naturalist. 2012;19:391–410. doi: 10.1656/045.019.0303. [DOI] [Google Scholar]
  22. Gluck C, Min S, Oyelakin A, Smalley K, Sinha S, Romano R-A. RNA-seq based transcriptomic map reveals new insights into mouse salivary gland development and maturation. BMC Genomics. 2016;17:923. doi: 10.1186/s12864-016-3228-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Gould L, Sussman RW, Sauther ML. Demographic and life-history patterns in a population of ring-tailed lemurs (Lemur catta) at Beza Mahafaly Reserve, Madagascar: a 15-year perspective. American Journal of Physical Anthropology. 2003;120:182–194. doi: 10.1002/ajpa.10151. [DOI] [PubMed] [Google Scholar]
  24. Graber DM, White M. Black bear food habits in yosemite national park. In: Bears: Their Biology and Management. 1983;5:1–10. doi: 10.2307/3872514. [DOI] [Google Scholar]
  25. Groot PC, Mager WH, Henriquez NV, Pronk JC, Arwert F, Planta RJ, Eriksson AW, Frants RR. Evolution of the human alpha-amylase multigene family through unequal, homologous, and inter- and intrachromosomal crossovers. Genomics. 1990;8:97–105. doi: 10.1016/0888-7543(90)90230-R. [DOI] [PubMed] [Google Scholar]
  26. Gumucio DL, Wiebauer K, Caldwell RM, Samuelson LC, Meisler MH. Concerted evolution of human amylase genes. Molecular and Cellular Biology. 1988;8:1197–1205. doi: 10.1128/MCB.8.3.1197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hardy K, Brand-Miller J, Brown KD, Thomas MG, Copeland L. The importance of dietary carbohydrate in human evolution. The Quarterly Review of Biology. 2015;90:251–268. doi: 10.1086/682587. [DOI] [PubMed] [Google Scholar]
  28. Hatley T, Kappelman J. Bears, pigs, and Plio-Pleistocene hominids: a case for the exploitation of belowground food resources. Human Ecology. 1980;8:371–387. doi: 10.1007/BF01561000. [DOI] [Google Scholar]
  29. Herrero J, García-Serrano A, Couto S, Ortuño VM, García-González R. Diet of wild boar Sus scrofa L. and crop damage in an intensive agroecosystem. European Journal of Wildlife Research. 2006;52:245–250. doi: 10.1007/s10344-006-0045-3. [DOI] [Google Scholar]
  30. Hohmann G. The Evolution of Hominin Diets: Integrating Approaches to the Study of Palaeolithic Subsistence. Dordrecht: Springer; 2009. The Diets of non-human primates: Frugivory, food processing, and food sharing; pp. 1–14.978-1-4020-9699-0 [DOI] [Google Scholar]
  31. Hulme-Beaman A, Dobney K, Cucchi T, Searle JB. An ecological and evolutionary framework for commensalism in anthropogenic environments. Trends in Ecology & Evolution. 2016;31:633–645. doi: 10.1016/j.tree.2016.05.001. [DOI] [PubMed] [Google Scholar]
  32. Inchley CE, Larbey CD, Shwan NA, Pagani L, Saag L, Antão T, Jacobs G, Hudjashov G, Metspalu E, Mitt M, Eichstaedt CA, Malyarchuk B, Derenko M, Wee J, Abdullah S, Ricaut FX, Mormina M, Mägi R, Villems R, Metspalu M, Jones MK, Armour JA, Kivisild T. Selective sweep on human amylase genes postdates the split with Neanderthals. Scientific Reports. 2016;6:37198. doi: 10.1038/srep37198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Janiak MC. Digestive enzymes of human and nonhuman primates. Evolutionary Anthropology: Issues, News, and Reviews. 2016;25:253–266. doi: 10.1002/evan.21498. [DOI] [PubMed] [Google Scholar]
  34. Juskaitis R. New data on the birch mouse (Sicista betulina) in Lithuania. Folia Theriologica Estonica. 2000;5:51–56. [Google Scholar]
  35. Kilian M, Nyvad B. Ability to bind salivary alpha-amylase discriminates certain viridans group streptococcal species. Journal of Clinical Microbiology. 1990;28:2576–2577. doi: 10.1128/jcm.28.11.2576-2577.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Kinzey WG, editor. New World Primates: Ecology, Evolution and Behavior. Routledge; 1997. 978-0202011868 [Google Scholar]
  37. Kiripolsky J, McCabe LG, Gaile DP, Kramer JM. Myd88 is required for disease development in a primary Sjögren's syndrome mouse model. Journal of Leukocyte Biology. 2017;102:1411–1420. doi: 10.1189/jlb.3A0717-311R. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lambert JE. Primate digestion: interactions among anatomy, physiology, and feeding ecology. Evolutionary Anthropology: Issues, News, and Reviews. 1998;7:8–20. doi: 10.1002/(SICI)1520-6505(1998)7:1<8::AID-EVAN3>3.0.CO;2-C. [DOI] [Google Scholar]
  39. Lambert J. Cheek Pouches. In: Fuentes A, editor. The International Encyclopedia of Primatology. Hoboken: John Wiley & Sons; 2017. [Google Scholar]
  40. Le SQ, Gascuel O. An improved general amino acid replacement matrix. Molecular Biology and Evolution. 2008;25:1307–1320. doi: 10.1093/molbev/msn067. [DOI] [PubMed] [Google Scholar]
  41. Mandel AL, Peyrot des Gachons C, Plank KL, Alarcon S, Breslin PA. Individual differences in AMY1 gene copy number, salivary α-amylase levels, and the perception of oral starch. PLOS ONE. 2010;5:e13352. doi: 10.1371/journal.pone.0013352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Martin LE, Nikonova LV, Kay K, Paedae AB, Contreras RJ, Torregrossa A-M, Torregrossa A-M Salivary proteins alter taste-guided behaviors and taste nerve signaling in rat. Physiology. 2018;184:150–161. doi: 10.1016/j.physbeh.2017.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Massei G, Genov PV. The environmental impact of wild boar. Galemys. 2004;16:135–145. [Google Scholar]
  44. Mau M, Südekum KH, Johann A, Sliwa A, Kaiser TM. Indication of higher salivary alpha-amylase expression in hamadryas baboons and geladas compared to chimpanzees and humans. Journal of Medical Primatology. 2010;39:187–190. doi: 10.1111/j.1600-0684.2010.00407.x. [DOI] [PubMed] [Google Scholar]
  45. Meisler MH, Ting CN. The remarkable evolutionary history of the human amylase genes. Critical Reviews in Oral Biology & Medicine. 1993;4:503–509. doi: 10.1177/10454411930040033501. [DOI] [PubMed] [Google Scholar]
  46. Mendes FK, Livera A, Hahn MW. The perils of intralocus recombination for inferences of molecular convergence. bioRxiv. 2018 doi: 10.1101/393124. [DOI] [PMC free article] [PubMed]
  47. Miller ER, Ullrey DE. The pig as a model for human nutrition. Annual Review of Nutrition. 1987;7:361–382. doi: 10.1146/annurev.nu.07.070187.002045. [DOI] [PubMed] [Google Scholar]
  48. Milton K. Distribution patterns of tropical plant foods as an evolutionary stimulus to primate mental development. American Anthropologist. 1981;83:534–548. doi: 10.1525/aa.1981.83.3.02a00020. [DOI] [Google Scholar]
  49. Oates JF. The natural history of African colobines. In: Davies G, Oates J, editors. Colobine Monkeys: Their Ecology, Behaviour and Evolution. Cambridge: Cambridge University Press; 1994. pp. 75–128. [Google Scholar]
  50. Ohno S O. Evolution by Gene Duplication. London: Springer-Verlag; 1970. [Google Scholar]
  51. Ollivier M, Tresset A, Bastian F, Lagoutte L, Axelsson E, Arendt ML, Bălăşescu A, Marshour M, Sablin MV, Salanova L, Vigne JD, Hitte C, Hänni C. Amy2B copy number variation reveals starch diet adaptations in ancient european dogs. Royal Society Open Science. 2016;3:160449. doi: 10.1098/rsos.160449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Pajic P, Lin YL, Xu D, Gokcumen O. The psoriasis-associated deletion of late cornified envelope genes LCE3B and LCE3C has been maintained under balancing selection since human Denisovan divergence. BMC Evolutionary Biology. 2016;16:265. doi: 10.1186/s12862-016-0842-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Paudel Y, Madsen O, Megens HJ, Frantz LA, Bosse M, Bastiaansen JW, Crooijmans RP, Groenen MA. Evolutionary dynamics of copy number variation in pig genomes in the context of adaptation and domestication. BMC Genomics. 2013;14:449. doi: 10.1186/1471-2164-14-449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Perry GH, Dominy NJ, Claw KG, Lee AS, Fiegler H, Redon R, Werner J, Villanea FA, Mountain JL, Misra R, Carter NP, Lee C, Stone AC. Diet and the evolution of human amylase gene copy number variation. Nature Genetics. 2007;39:1256–1260. doi: 10.1038/ng2123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Peyrot des Gachons C, Breslin PA. Salivary amylase: digestion and metabolic syndrome. Current Diabetes Reports. 2016;16:102. doi: 10.1007/s11892-016-0794-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Pineda-Munoz S, Alroy J. Dietary characterization of terrestrial mammals. Proceedings of the Royal Society B: Biological Sciences. 2014;281:20141173. doi: 10.1098/rspb.2014.1173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Pocock MJO, Searle JB, White PCL. Adaptations of animals to commensal habitats: population dynamics of house mice Mus musculus domesticus on farms. Journal of Animal Ecology. 2004;73:878–888. doi: 10.1111/j.0021-8790.2004.00863.x. [DOI] [Google Scholar]
  58. Poole AC, Goodrich JK, Youngblut ND, Luque GG, Ruaud A, Sutter JL, Waters JL, Shi Q, El-Hadidi M, Johnson LM, Bar HY, Huson DH, Booth JG, Ley RE. Human salivary amylase gene copy number impacts oral and gut microbiomes. Cell Host & Microbe. 2019;25:553–564. doi: 10.1016/j.chom.2019.03.001. [DOI] [PubMed] [Google Scholar]
  59. Rahaman H, Srihari K, Krishnamoorthy RV. Polysaccharide digestion in cheek pouches of the bonnet macaque. Primates. 1975;16:175–180. doi: 10.1007/BF02381415. [DOI] [Google Scholar]
  60. Rambaut A. FigTree. Edinburgh: University of Edinburgh, Institute of Evolutionary Biology; 2012. http://tree.bio.ed.ac.uk/software/figtree/ [Google Scholar]
  61. Reiter T, Jagoda E, Capellini TD. Dietary variation and evolution of gene copy number among dog breeds. PLOS ONE. 2016;11:e0148899. doi: 10.1371/journal.pone.0148899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Roux VL, JL C, Y F, P V. Diet of the house mouse ( Mus musculus ) on Guillou island, Kerguelen Archipelago, subantarctic. Polar Biology. 2002;25:49–57. doi: 10.1007/s003000100310. [DOI] [Google Scholar]
  63. Rowe N, Myers M. All the World’s Primates. Pogonias Press; 2016. [Google Scholar]
  64. Rylands AB, Faria DS. Habitats, feeding ecology, and home range size in the genus Callithrix. In: Rylands A. B, editor. Marmosets and Tamarins: Systematics, Behaviour, and Ecology. Oxford: Oxford Scientific Publications; 1993. pp. 262–269. [Google Scholar]
  65. Samuelson LC, Wiebauer K, Snow CM, Meisler MH. Retroviral and pseudogene insertion sites reveal the lineage of human salivary and pancreatic amylase genes from a single gene during primate evolution. Molecular and Cellular Biology. 1990;10:2513–2520. doi: 10.1128/MCB.10.6.2513. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Samuelson LC, Phillips RS, Swanberg LJ. Amylase gene structures in primates: retroposon insertions and promoter evolution. Molecular Biology and Evolution. 1996;13:767–779. doi: 10.1093/oxfordjournals.molbev.a025637. [DOI] [PubMed] [Google Scholar]
  67. Scannapieco FA, Bergey EJ, Reddy MS, Levine MJ. Characterization of salivary alpha-amylase binding to Streptococcus sanguis. Infection and Immunity. 1989;57:2853–2863. doi: 10.1128/iai.57.9.2853-2863.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Schein MW, Orgain H. A preliminary analysis of garbage as food for the Norway rat. The American Journal of Tropical Medicine and Hygiene. 1953;2:1117–1130. doi: 10.4269/ajtmh.1953.2.1117. [DOI] [PubMed] [Google Scholar]
  69. Schibler U, Pittet AC, Young RA, Hagenbüchle O, Tosi M, Gellman S, Wellauer PK. The mouse alpha-amylase multigene family. sequence organization of members expressed in the pancreas, salivary gland and liver. Journal of Molecular Biology. 1982;155:247–266. doi: 10.1016/0022-2836(82)90004-3. [DOI] [PubMed] [Google Scholar]
  70. Sclafani A, Nissenbaum JW, Vigorito M. Starch preference in rats. Neuroscience & Biobehavioral Reviews. 1987;11:253–262. doi: 10.1016/S0149-7634(87)80033-7. [DOI] [PubMed] [Google Scholar]
  71. Singleton GR, Kenney AJ, Tann CR, Nguyen QH O. Rodent Outbreaks: Ecology and Impacts. International Rice Research Institute; 2003. Myth, Dogma and Rodent Management: Good Stories Ruined by Data? [Google Scholar]
  72. Skoglund P, Ersmark E, Palkopoulou E, Dalén L. Ancient wolf genome reveals an early divergence of domestic dog ancestors and admixture into high-latitude breeds. Current Biology. 2015;25:1515–1519. doi: 10.1016/j.cub.2015.04.019. [DOI] [PubMed] [Google Scholar]
  73. Soini P. Ecology and population dynamics of the pygmy marmoset, Cebuella pygmaea. Folia Primatologica. 1982;39:1–21. doi: 10.1159/000156066. [DOI] [PubMed] [Google Scholar]
  74. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Staubach F, Lorenc A, Messer PW, Tang K, Petrov DA, Tautz D. Genome patterns of selection and introgression of haplotypes in natural populations of the house mouse (Mus musculus) PLOS Genetics. 2012;8:e1002891. doi: 10.1371/journal.pgen.1002891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Ting CN, Rosenberg MP, Snow CM, Samuelson LC, Meisler MH. Endogenous retroviral sequences are required for tissue-specific expression of a human salivary amylase gene. Genes & Development. 1992;6:1457–1465. doi: 10.1101/gad.6.8.1457. [DOI] [PubMed] [Google Scholar]
  77. Torgersen TR, Bull EL, Wertz TL. The importance of vegetation, insects, and neonate ungulates in black bear diet in Northeastern Oregon. Northwest Science. 2001;75:244–253. [Google Scholar]
  78. Torregrossa AM, Nikonova L, Bales MB, Villalobos Leal M, Smith JC, Contreras RJ, Eckel LA. Induction of salivary proteins modifies measures of both orosensory and postingestive feedback during exposure to a tannic acid diet. PLOS ONE. 2014;9:e105232. doi: 10.1371/journal.pone.0105232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Usher CL, Handsaker RE, Esko T, Tuke MA, Weedon MN, Hastie AR, Cao H, Moon JE, Kashin S, Fuchsberger C, Metspalu A, Pato CN, Pato MT, McCarthy MI, Boehnke M, Altshuler DM, Frayling TM, Hirschhorn JN, McCarroll SA. Structural forms of the human amylase locus and their relationships to SNPs, haplotypes and obesity. Nature Genetics. 2015;47:921–925. doi: 10.1038/ng.3340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Wrangham RW, Conklin-Brittain NL, Hunt KD. Dietary response of chimpanzees and cercopithecines to seasonal variation in fruit abundance. I. antifeedants. International Journal of Primatology. 1998;19:949–970. [Google Scholar]
  81. Wright PC. The behavior and ecology of the owl monkey. In: Baer J. F, Weller R. E, Kakoma I, editors. Aotus: The Owl Monkey. Academic Press; 1994. pp. 97–112. [Google Scholar]
  82. Xu D, Jaber Y, Pavlidis P, Gokcumen O. VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences. BMC Bioinformatics. 2017;18:426. doi: 10.1186/s12859-017-1844-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Zhang J, Zhang YP, Rosenberg HF. Adaptive evolution of a duplicated pancreatic ribonuclease gene in a leaf-eating monkey. Nature Genetics. 2002;30:411–415. doi: 10.1038/ng852. [DOI] [PubMed] [Google Scholar]
  84. Zohary D, Hopf M, Weiss E. Domestication of Plants in the Old World: The Origin and Spread of Domesticated Plants in Southwest Asia, Europe, and the Mediterranean Basin. Fourth Edition. Oxford: Oxford University Press; 2012. [Google Scholar]

Decision letter

Editor: George H Perry1
Reviewed by: George H Perry2

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

[Editors’ note: a previous version of this study was rejected after peer review, but the authors submitted for reconsideration. The first decision letter after peer review is shown below.]

Thank you for submitting your work entitled "Amylase copy number analysis in several mammalian lineages reveals convergent adaptive bursts shaped by diet" for consideration by eLife. Your article has been reviewed by four peer reviewers, including George H Perry as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by a Senior Editor.

Our decision has been reached after consultation between the reviewers. Based on these discussions and the individual reviews below, we regret to inform you that your work will not be considered further for publication in eLife.

There is general agreement among the reviewers (including me as the reviewing editor) that multiple results within your dataset would be of fundamental interest to the eLife genomics and evolutionary biology readership communities. However, we collectively identified (and shared a consensus opinion about, following consultation) multiple substantial issues with the current version of the analyses presented and the manuscript that preclude our ability to consider your submission for publication at this time. In particular, the phylogenetic and evolutionary analyses presented did not account for gene conversion and phylogenetic non-independence and important data that should be available for inclusion are missing, interpretations and conclusions about the relationship between copy number and salivary amylase expression are (perhaps unnecessarily) over-extended, and there are questions about the accuracy of the cross-species ddPCR approach that are not alleviated by the supplementary QC figure. These and other substantial concerns are detailed in the individual reviews below.

In addition, I want to draw your attention to a different, but also important, type of major limitation in the present manuscript, specifically the (much too limited) amount of methodological and analytical detail provided. This issue frustrated peer experts in these methods and in this area of research and precluded our thorough review of many aspects of the paper. This would be an even bigger problem for a general readership.

Given our interest in the potential of this dataset, I do not want to completely slam the door on consideration of your manuscript in the future at eLife, if you are able to substantially re-work the analyses and manuscript to address the major concerns raised through this process and find that your primary conclusions are supported more robustly. However, we feel that this revision would require an extensive amount of work, and moreover as I mentioned above there are multiple components of even the present version of the paper that we could not yet assess fully. Thus any future submission would be considered an entirely new manuscript, with high expectations at the editorial review stage and no guarantee of full review. Regardless of how you choose to proceed, we hope that the detailed comments provided below are helpful for your next round of revision of this interesting dataset, and we look forward to seeing the ultimate outcome!

Reviewer #1:

This manuscript from Pajic et al. presents a broad survey of amylase gene (AMY; the protein products of AMY genes help digest dietary starch) copy number variation among mammals (153 individuals from a total of 44 species), alongside salivary amylase protein expression level data for a subset of those taxa (118 individuals from a total of 20 species). The paper extends previous datasets available for humans and dogs to identify the intriguing result of widely recurrent AMY copy number gains in the genomes of species with relatively higher levels of dietary starch compared to related taxa (e.g. pigs, mice and rats, capuchins, cercopithecine monkeys, humans), although there are limitations associated with the absence of equivalent diet data for all of the species tested. Still, this broader pattern will likely be robust to necessary revisions to the evolutionary analysis and other aspects of the manuscript, although some other technical issues could undermine the underlying data depending on answers to some of the below questions about the experimental design.

Essential revisions:

1) The accuracy of the digital droplet PCR method to estimate amylase copy number across this broad range of species is critical. In the brief Materials and methods, it is stated "For primer design we targeted amylase exonic sequences that are conserved among copies and between species." But 11 different primer sets were used, and of course even within species groups (and among gene family copies within species, e.g. AMY1 vs. AMY2 in apes) sharing the same primer set varying levels of sequence divergence at these primer sites is expected. How does this impact the results? The methods for confirming the accuracy of this approach (other than reference to Supplementary Figure 4) are absent, and even for the analysis of Supplementary Figure 4 I suspect that the included species might be biased towards those from which the primer sequences for each group were designed in the first place, keeping this from being a true assessment.

2) The premise of the phylogenetic analysis of amylase coding region sequences to conclude that amylase duplications occurred independently within each lineage with duplications rather than being an ancestral trait (versus gene loss in some species instead) does not consider either gene conversion or ultra-high rates of NAHR, either of which (or both in combination) could obliterate any long-term phylogenetic signals in these data. I'm not sure that this is resolvable. While I agree that independent duplication events are the most likely scenario, and this is something that could be discussed as such, I don't think the authors' analyses or interpretations should be reliant on this demonstration. That is, the pattern is still evolutionarily interesting even if it is functional constraint to maintain higher copy numbers in lineages with higher levels of dietary starch, with losses in other lineages.

That said, I thought that the mouse vs. human retrotransposon result was convincing, and the rat versus mouse results may be as well, although this result needs more description and explanation in the Results text. The data/logic for the dog and pig/boar results are not provided (and again the text here seems to suggest that dogs and wolves diverged only 5000 years ago, which is incorrect), which needs to be addressed. This all can be presented as part of the 'We believe the most likely explanation for these observations are repeated, independent duplication events in each lineage… however, we cannot exclude… regardless, of evolutionary interest" text in the revised approach to this discussion.

3) In general (as mentioned directly above, but this is actually a wide problem and a big issue) there is insufficient detail presented on the experimental results and analyses in the paper. While this issue can readily be addressed in revision, it means that reviewers may not be able to identify some problems from this version of the manuscript.

4) The finding of salivary amylase expression in some dog breeds is definitely interesting. However, I felt that the conclusions of AMY duplication 'bursts' necessarily being linked to the production of salivary amylase was too strong based on the data presented, and again seemingly not necessarily for this to be an interesting and valuable paper.

5) Why are non-human primates analyzed separately from the non-primate mammal data?

6) In the evolutionary analyses, some sort of approach that takes phylogenetic relationships into account should be used. Also, I understand the limitations with differentially available dietary starch intake data among species… however, could a subset of the dataset be formally analyzed in this framework, i.e. considering the species for which roughly equivalent dietary data are available?

7) Details were insufficient for me to evaluate the AMY copy number simulation results, although it is difficult for me to imagine the parameters of the simulation exercise being able to accurately model mutation rates (and especially given that rates of NAHR once duplication has occurred are expected to be strikingly higher than the rate of initial duplication of a locus).

Reviewer #2:

In this paper, employing digital PCR experiments to assess amylase copy number across a diverse range of mammalian lineages, Pajic et al. identify higher copy number within certain starch-consuming lineages. Additionally, they perform assays to measure salivary amylase activity and coalescent simulations to model neutral copy number variant evolution.

This work is performed in the context of known amylase gene family expansions in human, rodents, and dog, so the novelty here is the comprehensive nature with which sampling and analysis took place. The data presented here is confirmatory of the previously known amylase gene family expansions without addressing some relevant previous work, notably positive selection identified to be acting on amylase loci in mouse and dog (Staubach et al., 2012, Reiter et al., 2016).

The possibility of gene conversion and the effect this could have on the data and inferences presented here is not discussed. There is evidence of gene conversion within this locus in human (Groot et al., 1990) and only the scenario of numerous gene duplication events and one gene loss event is presented in the section 'Evolution of amylase in primates'. If gene conversion is taking place, then duplicates may appear to be younger and more lineage-specific. Alternative scenarios are not discussed as other explanations of the data observed. Fewer and older duplication events could have occurred, followed by more than one gene loss event and this history is masked by lineage-specific similarity as a consequence of gene conversion. Additionally, ancestral duplication polymorphisms, incomplete lineage sorting and gene conversion is a possibility that is not adequately dismissed with the data and arguments presented.

The maximum likelihood phylogenetic tree constructed using amylase protein sequences translated from reference genomes (Figure 2A) shows species-specific clustering of sequences. The phylogenetic tree would be more informative if it also including sequences from species where increases in copy number have not occurred and was rooted with an outgroup, which are both readily available.

The authors state that "amylase gene copy numbers in species correlate well with observable enzymatic activity in saliva (Figure 3C)" and additionally state that "the copy annotated as mouse AMY1 (Figure 2) is expressed in salivary glands, and is likely responsible for salivary expression of amylase in mice, while the other amylase duplicates have a negligible expression in salivary gland tissue (Figure 2—figure supplement 2)." However, they do not adequately reconcile these two statements. If only one copy in mouse is meaningfully expressed in saliva, what does that mean for salivary expression correlation with copy number? All but one copy have no impact on salivary activity so, for mouse at least, a relationship between salivary expression level and number of copies is not direct. Perhaps salivary expression level is correlated with a starch-rich diet and a starch-rich diet is also correlated with pancreatic expression. Increased pancreatic expression could be facilitated through gene duplication and increased salivary expression through enhancers? Discussing these alternative means of achieving increased expression in different tissues given the data observed would be welcome.

Reviewer #3:

In "Amylase copy number analysis in several mammalian lineages reveals convergent adaptive bursts shaped by diet" Pajic et al. examine amylase copy number across the mammalian phylogeny. The authors observe that amylase copy number appears to be higher in species that have high-start diets than those that do not, and that this correlates with levels of salivary amylase. Although these are some compelling observations, the authors claims of "adaptive bursts" and convergence in the title are both unjustified in my opinion.

Regarding "adaptive bursts": this is largely supported by the authors evidence of independent origins of amylase copies. However, the protein sequence tree shown in Figure 2 could be effected by interlocus gene conversion, which will cause copies in the same species to appear more closely related. Thus we cannot be certain of the extent of the "burstiness" of the duplication events. The authors have also performed simulations of gene family evolution under various mutation rates, but I found this analysis rather simplistic (e.g. no difference in gain and loss rates allowed), qualitative (no model fits are formally assessed), and incomplete (no alternative models including natural selection are considered).

On a related note, in Figure 5, it is not clear to me how the authors placed the locations of duplication events across the phylogeny, and how confident we can be in these placements regardless of whether gene conversion is taken into account. Overall I would say that while the authors' observations are interesting and highly suggestive, there is no direct evidence for positive selection in these data, and thus the language about adaptation sprinkled throughout the manuscript should be removed or rephrased.

Second, the evidence for convergent evolution is limited. The strongest example of convergence that the authors give is in the third paragraph of the subsection “Recurrent amylase copy number gains in multiple mammalian lineages”, where they describe that in both humans and mice there are tandem arrays of amylase genes with a flanking transposable element, suggesting the possibility of repeat-induced expansion of this gene family in both species. However, in mouse these copies do not encode a salivary amylase according to Figure 2. Thus this example provides no support for convergence related to salivary digestion.

Third, the authors examine in Figures 3 and 4 evidence of the relationship between amylase copy number and salivary amylase activity. However as far as I can tell the authors have not assessed the strength of this relationship statistically in a manner that accounts for phylogenetic relatedness among data points. Moreover, to me the authors' results imply that the amylase copy numbers are cannot explain a sizable portion of the variation in amylase activity among mammals. This is demonstrated in Figure 4, where the authors show amylase copy number along a linear scale and salivary amylase activity in log scale. For amylase copy number the difference between the "specialized" and "broad range" diet classes spans about a factor of 2- or 3-fold. For salivary amylase, the difference is about 2-3 orders of magnitude. Assuming that copy number has at best a linear effect on protein dosage, this result strongly argues against copy number having a very important role in salivary amylase activity in mammals.

Reviewer #4:

The authors seek to understand whether the evolution of additional AMY genes occurred independently in different mammalian lineages, and whether the driving force behind AMY gene expansion was a starch-rich diet. To do this, they use a multi-pronged approach to investigate their research questions, including enzymatic assays, ddPCR to measure AMY gene copy numbers, and estimating copy numbers from reference genomes. The resulting dataset is extensive, with data on salivary amylase expression and AMY copy number variation in a large sample of mammals, significantly expanding our current knowledge about the evolution of salivary and pancreatic amylase, as well as the link between digestive enzyme evolution and diet more broadly. The manuscript is well-written and the figures effectively communicate the findings.

The AMY gene copy numbers are quite consistent between ddPCR and reference genome estimates, providing confidence in the authors' findings regarding copy number variation.

Interestingly, the authors find salivary amylase expression in mammals that were previously found not to express amylase in their saliva, including New World primates and dogs. Given that previous research did not find evidence for salivary amylase expression in these species (Perry et al., 2007; Axelsson et al., 2013), I think some skepticism is warranted. The authors state that dogs, for example, express "substantial amounts" of amylase in their saliva, however, based on the data in Supplementary file 1, amylase activity was quite low in most dogs compared to humans and rodents. Similarly, amylase activity in capuchins, pigs, and boars seemed quite low compared to catarrhines and rodents, making me wonder if some of the amylase "activity" may have been contamination or a basic level of activity that is always present. Was a negative control used? If so, it would be helpful to include the activity measures obtained for the negative control in Supplementary file 1. If a negative control was not used, the authors should repeat the assays with one.

In the Results, the authors state that salivary amylase activity varied across different dog breeds and refer the reader to Supplementary file 1. I was curious to see which dog breeds have higher salivary amylase activity, however, Supplementary file 1 does not include information on the different dog breeds included in the sample. I think this information should be included and the differences might be worthy of additional discussion. Do breeds associated with high-starch human populations have higher salivary amylase activity, for example?

[Editors’ note: what now follows is the decision letter after the authors submitted for further consideration.]

Thank you for submitting your article "Independent amylase gene copy number bursts correlate with dietary preferences in mammals" for consideration by eLife. Your article has been reviewed by four peer reviewers, including George H Perry as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by Diethard Tautz as the Senior Editor.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

The reviewers commend your efforts to revise your manuscript in response to our original reviews. The manuscript has been re-worked substantially, to positive effect. The reviewers felt that your primary conclusion that there have been bursts of amylase gene duplication in multiple lineages independently was now supported more strongly. Overall, the combination of your careful molecular genetics work on this complex locus alongside the protein expression measurements across a diversity of mammals is impressive, providing a substantial new advance into the evolutionary biology and evolutionary ecology of the amylase gene family, with broader molecular biology insights related to gene duplication and tissue expression pattern processes. We do collectively share some concerns about the manuscript that we ask you to address thoroughly.

Essential revisions:

1) Clarify your methods for grouping broad-range diet mammals into lower-starch and higher-starch categories. We sympathize with the challenges of obtaining equivalent food consumption data for all species, but greatly increased detail is necessary here. This analysis should be fully transparent and replicable.

2) The simulations as presented are uncompelling. If the authors choose to maintain this component of the manuscript, then the simulation approach and mechanics, process of parameter selection (using a range of values encompassing extremes of reasonable possibility, with clear justifications), and results all need to be clarified and detailed much more extensively. Is the diet information included in the model (i.e. asking how often neutral processes can explain the observed pattern whereby species with higher-starch diets are much more likely to have duplication bursts)?

3) Explicitly present a more complete/clarified model (or models) for amylase duplication and the gain of salivary amylase that could produce the observations in the paper. Some thoughts along these lines: If the model is simply that duplication must occur prior to the acquisition of salivary expression, then this is essentially a model of neofunctionalization (here, convergent in multiple mammal lineages for which starch is a substantial part of the diet). This model would posit that a single amylase copy cannot do the job of pancreatic and salivary amylase, perhaps due to changes to the amino acid sequence required for proper amylase activity (this is unlikely?), or instead that something about the regulatory structure of amylase prevents expression of a single copy in both the pancreas and in salivary glands. Also, the authors' results and interpretation now implies that just one new gene copy is required for the acquisition of sufficient salivary amylase activity, so the presence of additional copies in some species should be explicitly addressed as part of this discussion. We recommend creating a clearly defined section (e.g. 'Model for amylase gene locus evolution' or similar) in which this discussion can be concisely presented (note that a few relevant existing statements along these lines can be moved and modified from other sections), without needing to be conclusive. Opportunities for future investigation can be noted.

eLife. 2019 May 14;8:e44628. doi: 10.7554/eLife.44628.018

Author response


[Editors’ note: the author responses to the first round of peer review follow.]

There is general agreement among the reviewers (including me as the reviewing editor) that multiple results within your dataset would be of fundamental interest to the eLife genomics and evolutionary biology readership communities. However, we collectively identified (and shared a consensus opinion about, following consultation) multiple substantial issues with the current version of the analyses presented and the manuscript that preclude our ability to consider your submission for publication at this time. In particular, the phylogenetic and evolutionary analyses presented did not account for gene conversion and phylogenetic non-independence and important data that should be available for inclusion are missing, interpretations and conclusions about the relationship between copy number and salivary amylase expression are (perhaps unnecessarily) over-extended, and there are questions about the accuracy of the cross-species ddPCR approach that are not alleviated by the supplementary QC figure. These and other substantial concerns are detailed in the individual reviews below.

In addition, I want to draw your attention to a different, but also important, type of major limitation in the present manuscript, specifically the (much too limited) amount of methodological and analytical detail provided. This issue frustrated peer experts in these methods and in this area of research and precluded our thorough review of many aspects of the paper. This would be an even bigger problem for a general readership.

Given our interest in the potential of this dataset, I do not want to completely slam the door on consideration of your manuscript in the future at eLife, if you are able to substantially re-work the analyses and manuscript to address the major concerns raised through this process and find that your primary conclusions are supported more robustly. However, we feel that this revision would require an extensive amount of work, and moreover as I mentioned above there are multiple components of even the present version of the paper that we could not yet assess fully. Thus any future submission would be considered an entirely new manuscript, with high expectations at the editorial review stage and no guarantee of full review. Regardless of how you choose to proceed, we hope that the detailed comments provided below are helpful for your next round of revision of this interesting dataset, and we look forward to seeing the ultimate outcome!

We are grateful for your careful consideration of our paper and your thoughtful and thorough comments. We took them to heart and conducted several additional analyses to address each of these concerns. We also want to apologize for the lack of details in our originally submitted Materials and methods section. We now thoroughly revised and expanded our Materials and methods section. We also included a new Materials and methods section to include more specific details concerning our approach. Furthermore, to better reflect the revised content of this manuscript, we modified the main title to “Independent amylase gene copy number bursts correlate with dietary preferences in mammals”.

Reviewer #1:

Essential revisions:

1) The accuracy of the digital droplet PCR method to estimate amylase copy number across this broad range of species is critical. In the brief Materials and methods, it is stated "For primer design we targeted amylase exonic sequences that are conserved among copies and between species." But 11 different primer sets were used, and of course even within species groups (and among gene family copies within species, e.g. AMY1 vs. AMY2 in apes) sharing the same primer set varying levels of sequence divergence at these primer sites is expected. How does this impact the results? The methods for confirming the accuracy of this approach (other than reference to Supplementary Figure 4) are absent, and even for the analysis of Supplementary Figure 4 I suspect that the included species might be biased towards those from which the primer sequences for each group were designed in the first place, keeping this from being a true assessment.

The reviewer raises a salient point which we had actually considered when originally designing our primers. We apologize that in our original submission we did not thoroughly explain our strategy in primer and probe design. We now added more information to the Materials and methods section and provide a detailed description of the primer design and choices of probes for the various mammalian species investigated. We also expanded Supplementary file 2to now show the degree to which each given primer pair matches the targeted sequence in the different species. We further provide a new supplementary figure (Figure 1—figure supplement 1) to explain how we designed primers for species where no reference genomes were available. For such cases, we inferred the sequences from reference genomes of phylogenetically most closely related species. We now also completed direct genotyping of AMY gene copy numbers by ddPCR for those samples to show a direct measurement of copy number and salivary activity of amylase in all samples in Figure 3C. We now believe that our current dataset is robust enough to support our main conclusions.

2) The premise of the phylogenetic analysis of amylase coding region sequences to conclude that amylase duplications occurred independently within each lineage with duplications rather than being an ancestral trait (versus gene loss in some species instead) does not consider either gene conversion or ultra-high rates of NAHR, either of which (or both in combination) could obliterate any long-term phylogenetic signals in these data. I'm not sure that this is resolvable. While I agree that independent duplication events are the most likely scenario, and this is something that could be discussed as such, I don't think the authors' analyses or interpretations should be reliant on this demonstration. That is, the pattern is still evolutionarily interesting even if it is functional constraint to maintain higher copy numbers in lineages with higher levels of dietary starch, with losses in other lineages.

That said, I thought that the mouse vs. human retrotransposon result was convincing, and the rat versus mouse results may be as well, although this result needs more description and explanation in the Results text. The data/logic for the dog and pig/boar results are not provided (and again the text here seems to suggest that dogs and wolves diverged only 5000 years ago, which is incorrect), which needs to be addressed. This all can be presented as part of the 'We believe the most likely explanation for these observations are repeated, independent duplication events in each lineage… however, we cannot exclude… regardless, of evolutionary interest" text in the revised approach to this discussion.

We thank the reviewer for this comment that was also pointed out by the other reviewers. To further assess the occurrence of independent gene duplications, we now expanded the retrotransposition analysis that we conducted in mouse and human genomes to pig, dog, and rat genomes. Notably, we found that lineage-specific L1 retrotransposons accompany amylase gene copy number gains in these species (see new Supplementary file 3and revised Figure 2B).This conceptual aspect of our approach is now described in the revised Results section and the technical details are provided in the main Materials and methods section. Overall, our analyses suggest that independent, lineage-specific gene duplications indeed occurred. However, we agree with the reviewer that we cannot fully exclude the role of other mechanisms at play in shaping the genetic variation in this locus. Thus, in our interpretation of these data in the Results and Discussion sections, we mentioned that additional mechanisms (e.g. gene conversion, incomplete lineage sorting, crossover events, and ancestral gene duplication polymorphisms) may also have contributed to the observed genetic variation in the amylase locus.

3) In general (as mentioned directly above, but this is actually a wide problem and a big issue) there is insufficient detail presented on the experimental results and analyses in the paper. While this issue can readily be addressed in revision, it means that reviewers may not be able to identify some problems from this version of the manuscript.

We apologize for the lack of detail in the previous version of our manuscript. We have now substantially revised both the Results and Materials and methods sections to include detailed information about experimental results and analyses. As we mentioned above, we also added a new Materials and methods section.

4) The finding of salivary amylase expression in some dog breeds is definitely interesting. However, I felt that the conclusions of AMY duplication 'bursts' necessarily being linked to the production of salivary amylase was too strong based on the data presented, and again seemingly not necessarily for this to be an interesting and valuable paper.

We agree with the reviewer. In fact, this same point was raised by the other reviewers as well. We found that all the species that express amylase in their saliva harbor at least one additional copy of the amylase gene. This could suggest that a gene duplication event may be a necessary first step for amylase to be expressed in salivary glands. As this reviewer correctly noted, we were not able to demonstrate a direct correlation between AMY gene copy numbers and enzymatic activity in saliva. We now revised our Results section to tone down our initially over-reaching conclusion.

5) Why are non-human primates analyzed separately from the non-primate mammal data?

We decided to dedicate a section to primates simply because we have better reference genomes, broader sampling, and accompanying dietary data for this branch of the mammalian tree. For example, we were able to detect more subtle changes, potentially driven by diet, in primates as compared to non-primate mammals in gene copy number and salivary expression (Figure 5). We felt that this warranted a separate section. We now clarified this reasoning at the beginning of the “Evolution of amylase in primates” section.

6) In the evolutionary analyses, some sort of approach that takes phylogenetic relationships into account should be used. Also, I understand the limitations with differentially available dietary starch intake data among species… however, could a subset of the dataset be formally analyzed in this framework, i.e. considering the species for which roughly equivalent dietary data are available?

The reviewer is right. To address this, we have now conducted an independent phylogenetic contrasts analysis (new Figure 4C). This analysis, which puts into consideration the phylogenetic distances, still showed that the trends we observed based on direct comparison remain robust. In addition, we used information found in the literature to update our analysis with the predicted starch consumption of the animals we tested. Specifically, we now included additional columns in Figures 4A and 4B to contrast high and low starch consuming species.

7) Details were insufficient for me to evaluate the AMY copy number simulation results, although it is difficult for me to imagine the parameters of the simulation exercise being able to accurately model mutation rates (and especially given that rates of NAHR once duplication has occurred are expected to be strikingly higher than the rate of initial duplication of a locus).

We have now added a detailed explanation of our modeling approach in the new Materials and methods. In addition, we have now updated our modeling approach with additional parameters to reflect expected mutational mechanisms. For example, we considered, as the reviewer suggested, that the mutation rate of a single copy state is expected to be significantly lower than at a state with two or more copies where nonallelic homology is established.

Reviewer #2:

In this paper, employing digital PCR experiments to assess amylase copy number across a diverse range of mammalian lineages, Pajic et al. identify higher copy number within certain starch-consuming lineages. Additionally, they perform assays to measure salivary amylase activity and coalescent simulations to model neutral copy number variant evolution.

This work is performed in the context of known amylase gene family expansions in human, rodents, and dog, so the novelty here is the comprehensive nature with which sampling and analysis took place. The data presented here is confirmatory of the previously known amylase gene family expansions without addressing some relevant previous work, notably positive selection identified to be acting on amylase loci in mouse and dog (Staubach et al., 2012, Reiter et al., 2016).

We thank the reviewer for directing us to these relevant publications. In our revised Introduction section, we now explicitly mention their findings describing potential adaptive evolution of the amylase locus in house mice and dogs.

The possibility of gene conversion and the effect this could have on the data and inferences presented here is not discussed. There is evidence of gene conversion within this locus in human (Groot et al., 1990) and only the scenario of numerous gene duplication events and one gene loss event is presented in the section 'Evolution of amylase in primates'. If gene conversion is taking place, then duplicates may appear to be younger and more lineage-specific. Alternative scenarios are not discussed as other explanations of the data observed. Fewer and older duplication events could have occurred, followed by more than one gene loss event and this history is masked by lineage-specific similarity as a consequence of gene conversion. Additionally, ancestral duplication polymorphisms, incomplete lineage sorting and gene conversion is a possibility that is not adequately dismissed with the data and arguments presented.

To provide more evidence for lineage-specific duplications in the AMY locus, we expanded our analyses of the lineage-specific retrotransposons in this locus in pigs, rats, and dogs as described in our response to reviewer #1 (revised Figure 2B). However, the reviewer is right in pointing out the potential roles of other mechanisms. In our revised manuscript, we now mention gene conversion, ancestral duplication polymorphisms, and incomplete lineage sorting, as additional factors in shaping variation in this locus.

The maximum likelihood phylogenetic tree constructed using amylase protein sequences translated from reference genomes (Figure 2A) shows species-specific clustering of sequences. The phylogenetic tree would be more informative if it also including sequences from species where increases in copy number have not occurred and was rooted with an outgroup, which are both readily available.

We thank the reviewer for this helpful advice. We now included outgroup species in our analysis as suggested by the reviewer. Since adding these species to our main Figure 2A would have made this tree too voluminous, we chose to show this larger tree in a new Figure 2—figure supplement 1instead. The results of this analysis did not change our conclusions based on the data presented in Figure 2A.

The authors state that "amylase gene copy numbers in species correlate well with observable enzymatic activity in saliva (Figure 3C)" and additionally state that "the copy annotated as mouse AMY1 (Figure 2) is expressed in salivary glands, and is likely responsible for salivary expression of amylase in mice, while the other amylase duplicates have a negligible expression in salivary gland tissue (Figure S2)." However, they do not adequately reconcile these two statements. If only one copy in mouse is meaningfully expressed in saliva, what does that mean for salivary expression correlation with copy number? All but one copy have no impact on salivary activity so, for mouse at least, a relationship between salivary expression level and number of copies is not direct. Perhaps salivary expression level is correlated with a starch-rich diet and a starch-rich diet is also correlated with pancreatic expression. Increased pancreatic expression could be facilitated through gene duplication and increased salivary expression through enhancers? Discussing these alternative means of achieving increased expression in different tissues given the data observed would be welcome.

We completely agree with the reviewer and apologize for not having discussed this aspect of our results adequately. As already stated in our above response to Reviewer #1 we found no species with salivary enzymatic activity that does not have an AMY gene duplication. This suggests that duplication may be a necessary first step towards salivary gland-specific expression. Overall, however, there is no linear correlation between AMY gene copy number and salivary expression. We now describe these findings more carefully and discuss them in the revised Results and Conclusion sections.

Reviewer #3:

In "Amylase copy number analysis in several mammalian lineages reveals convergent adaptive bursts shaped by diet" Pajic et al. examine amylase copy number across the mammalian phylogeny. The authors observe that amylase copy number appears to be higher in species that have high-start diets than those that do not, and that this correlates with levels of salivary amylase. Although these are some compelling observations, the authors claims of "adaptive bursts" and convergence in the title are both unjustified in my opinion.

Regarding "adaptive bursts": this is largely supported by the authors evidence of independent origins of amylase copies. However, the protein sequence tree shown in Figure 2 could be effected by interlocus gene conversion, which will cause copies in the same species to appear more closely related. Thus we cannot be certain of the extent of the "burstiness" of the duplication events.

The reviewer is right and echoes the other reviewers’ concerns. As described above in our responses to reviewer #1 and reviewer #2, we now expanded our analyses of the lineage-specific retrotransposons associated with amylase gene copies in this locus in pigs, rats, and dogs (revised Figure 2B). This analysis provided more evidence that gene copy number expansions occurred in a lineage-specific manner. Furthermore, we now discuss gene conversion as a potential additional factor shaping variation in this locus and toned down our argument with regards to the “bursts” of amylase copies.

The authors have also performed simulations of gene family evolution under various mutation rates, but I found this analysis rather simplistic (e.g. no difference in gain and loss rates allowed), qualitative (no model fits are formally assessed), and incomplete (no alternative models including natural selection are considered).

We agree with the reviewer. We now expanded the description of our simulation results and added additional mutation rates. We also allowed dynamic copy number gain and loss rates in our revised simulations (now discussed in detail in the new Materials and methods section). However, since we have insufficient knowledge of the adaptive forces that may have acted on the amylase locus, we refrained from simulating alternative adaptive scenarios. We also toned down our formerly perhaps too assertive statements throughout the manuscript, including in the Title and the Abstract.

On a related note, in Figure 5, it is not clear to me how the authors placed the locations of duplication events across the phylogeny, and how confident we can be in these placements regardless of whether gene conversion is taken into account. Overall I would say that while the authors' observations are interesting and highly suggestive, there is no direct evidence for positive selection in these data, and thus the language about adaptation sprinkled throughout the manuscript should be removed or rephrased.

We agree with the reviewer on both accounts. We now discuss the caveats of interpreting the data shown in Figure 5in the Results section. In addition, we toned down our arguments throughout the manuscript with regards to adaptive forces shaping the copy number state and enzymatic activity of amylase in saliva. As a consequence of that, we also removed the phrase “convergent adaptive bursts” from the title of our revised manuscript and replaced it with the word “independent”.

Second, the evidence for convergent evolution is limited. The strongest example of convergence that the authors give is in the third paragraph of the subsection “Recurrent amylase copy number gains in multiple mammalian lineages”, where they describe that in both humans and mice there are tandem arrays of amylase genes with a flanking transposable element, suggesting the possibility of repeat-induced expansion of this gene family in both species. However, in mouse these copies do not encode a salivary amylase according to Figure 2. Thus this example provides no support for convergence related to salivary digestion.

We thank the reviewer for raising these questions which made us rethink and better clarify our interpretation of the data. In addition, we were able to find specific retrotransposons flanking rat, pig, and dog amylase gene copies, thereby providing lineage-specific signatures that further support our hypothesis that AMY gene copy number expansions occurred independently in different mammalian lineages. We also agree that enzymatic activity of amylase in saliva is not directly correlated with the number of AMY gene copies. Nevertheless, it is important to note that in all the species analyzed, salivary expression of amylase always went along with an amylase gene duplication event. We never intended to claim that the retrotransposon insertions by themselves lead to the expression of a given amylase copy in salivary glands. We also acknowledge that we do not have direct evidence for convergence of the salivary expression trends and, thus, removed this argument from our revised manuscript altogether.

Third, the authors examine in Figures 3 and 4 evidence of the relationship between amylase copy number and salivary amylase activity. However as far as I can tell the authors have not assessed the strength of this relationship statistically in a manner that accounts for phylogenetic relatedness among data points. Moreover, to me the authors' results imply that the amylase copy numbers are cannot explain a sizable portion of the variation in amylase activity among mammals. This is demonstrated in Figure 4, where the authors show amylase copy number along a linear scale and salivary amylase activity in log scale. For amylase copy number the difference between the "specialized" and "broad range" diet classes spans about a factor of 2- or 3-fold. For salivary amylase, the difference is about 2-3 orders of magnitude. Assuming that copy number has at best a linear effect on protein dosage, this result strongly argues against copy number having a very important role in salivary amylase activity in mammals.

The reviewer is right. In fact, the same point was raised by the other reviewers as well. In our analyses, we found no species with salivary expression above the basal levels that do not also have an AMY gene duplication. This suggests that duplication may be a necessary step towards the gain of expression in salivary glands. Beyond that, however, as the reviewer pointed out, there is no correlation across species between the number of AMY gene copies and the enzymatic activity of amylase in saliva. We now revised the Results section to reflect these insights.

Reviewer #4:

The authors seek to understand whether the evolution of additional AMY genes occurred independently in different mammalian lineages, and whether the driving force behind AMY gene expansion was a starch-rich diet. To do this, they use a multi-pronged approach to investigate their research questions, including enzymatic assays, ddPCR to measure AMY gene copy numbers, and estimating copy numbers from reference genomes. The resulting dataset is extensive, with data on salivary amylase expression and AMY copy number variation in a large sample of mammals, significantly expanding our current knowledge about the evolution of salivary and pancreatic amylase, as well as the link between digestive enzyme evolution and diet more broadly. The manuscript is well-written and the figures effectively communicate the findings.

The AMY gene copy numbers are quite consistent between ddPCR and reference genome estimates, providing confidence in the authors' findings regarding copy number variation.

Interestingly, the authors find salivary amylase expression in mammals that were previously found not to express amylase in their saliva, including New World primates and dogs. Given that previous research did not find evidence for salivary amylase expression in these species (Perry et al., 2007; Axelsson et al., 2013), I think some skepticism is warranted. The authors state that dogs, for example, express "substantial amounts" of amylase in their saliva, however, based on the data in Supplementary file 1, amylase activity was quite low in most dogs compared to humans and rodents. Similarly, amylase activity in capuchins, pigs, and boars seemed quite low compared to catarrhines and rodents, making me wonder if some of the amylase "activity" may have been contamination or a basic level of activity that is always present. Was a negative control used? If so, it would be helpful to include the activity measures obtained for the negative control in Supplementary file 1. If a negative control was not used, the authors should repeat the assays with one.

We agree with this reviewer in that our description of amylase activities was too simplistic and may have been prone to misunderstanding. As a control, we now measured the amylase enzymatic activity in serum of several species as a background value. We now include this information as a threshold value, depicted as a dotted line in revised Figure 3C. Thus, only those values of salivary amylase activity that range above this threshold are considered as positive. Based on this, there are indeed some dog breeds/samples, that have to be considered “negative” for salivary amylase. However, others range clearly above the cut-off line, implicating that expression of amylase in dog saliva is not universal and might have occurred only in certain breeds. To further aid in the visual interpretation of the data, we added images of the lysis diameters corresponding to salivary amylase activity in the starch lysis plate assay next to the corresponding activity values in the y-axis of revised Figure 3C.

In the Results, the authors state that salivary amylase activity varied across different dog breeds and refer the reader to Supplementary file 1. I was curious to see which dog breeds have higher salivary amylase activity, however, Supplementary file 1 does not include information on the different dog breeds included in the sample. I think this information should be included and the differences might be worthy of additional discussion. Do breeds associated with high-starch human populations have higher salivary amylase activity, for example?

We agree with the reviewer that this is an interesting observation and that the adaptation to starch-rich diets in dogs (similar to the observations in humans) is worthy of further investigation. We now included the dog breeds in our revised Supplementary file 1.Our sample size, however, does not allow to conclusively test the correlation between different breeds and their corresponding salivary amylase levels. Further, we believe that more recent intentional breeding may have confounded the effect of starch intake that might have originally affected salivary amylase expression at a time when dogs became followers of human groups. These questions warrant a separate, much larger study solely focusing on dogs. We now discuss these thoughts in the Results section.

[Editors' note: the author responses to the re-review follow.]

Summary:

The reviewers commend your efforts to revise your manuscript in response to our original reviews. The manuscript has been re-worked substantially, to positive effect. The reviewers felt that your primary conclusion that there have been bursts of amylase gene duplication in multiple lineages independently was now supported more strongly. Overall, the combination of your careful molecular genetics work on this complex locus alongside the protein expression measurements across a diversity of mammals is impressive, providing a substantial new advance into the evolutionary biology and evolutionary ecology of the amylase gene family, with broader molecular biology insights related to gene duplication and tissue expression pattern processes. We do collectively share some concerns about the manuscript that we ask you to address thoroughly.

We thank the editors and the reviewers for their thorough revision of our manuscript and the constructive comments. We now reworked our conclusion section to add a working model of the amylase locus evolution, removed the simulation component from this manuscript, and added a comprehensive literature review on starch consumption of mammals in the Materials and methods section. We also added a new figure (Figure 6) illustrating our current working model for the evolution of the amylase locus to accompany the conclusion section. We have addressed all the comments and believe that our revised manuscript has now improved substantially. Below we provide a point-by-point response to both “Essential Revisions”, as well as to individual reviewer comments. Please note that for ease of assessing the changes, we have highlighted them in the manuscript with yellow color.

Essential revisions:

1) Clarify your methods for grouping broad-range diet mammals into lower-starch and higher-starch categories. We sympathize with the challenges of obtaining equivalent food consumption data for all species, but greatly increased detail is necessary here. This analysis should be fully transparent and replicable.

To address this concern, we have now conducted a comprehensive literature review on the diets of the species that we use in constructing Figures 4A and 4B. We now included this review in the Materials and methods section of the manuscript under the “Categorization of starch consumption” subsection. In this subsection, we clearly laid out our reasoning as to how we placed the mammals into lower- and higher-starch consuming categories.

2) The simulations as presented are uncompelling. If the authors choose to maintain this component of the manuscript, then the simulation approach and mechanics, process of parameter selection (using a range of values encompassing extremes of reasonable possibility, with clear justifications), and results all need to be clarified and detailed much more extensively. Is the diet information included in the model (i.e. asking how often neutral processes can explain the observed pattern whereby species with higher-starch diets are much more likely to have duplication bursts)?

After considering the editors’ and reviewers’ comments, we agree that the simulation data at this point do not add much to our conclusions and need substantial improvement. Thus, we decided to remove the simulation component from the manuscript.

3) Explicitly present a more complete/clarified model (or models) for amylase duplication and the gain of salivary amylase that could produce the observations in the paper. Some thoughts along these lines: If the model is simply that duplication must occur prior to the acquisition of salivary expression, then this is essentially a model of neofunctionalization (here, convergent in multiple mammal lineages for which starch is a substantial part of the diet). This model would posit that a single amylase copy cannot do the job of pancreatic and salivary amylase, perhaps due to changes to the amino acid sequence required for proper amylase activity (this is unlikely?), or instead that something about the regulatory structure of amylase prevents expression of a single copy in both the pancreas and in salivary glands. Also, the authors' results and interpretation now implies that just one new gene copy is required for the acquisition of sufficient salivary amylase activity, so the presence of additional copies in some species should be explicitly addressed as part of this discussion. We recommend creating a clearly defined section (e.g. 'Model for amylase gene locus evolution' or similar) in which this discussion can be concisely presented (note that a few relevant existing statements along these lines can be moved and modified from other sections), without needing to be conclusive. Opportunities for future investigation can be noted.

This was an important suggestion for which we are grateful. We now renamed our conclusion section as “Conclusion and outlook: a working model explaining how the amylase locus evolved” and substantially revised and expanded this section to include a logical working model of evolution of the amylase locus. To support our explanation, we also included a new figure (Figure 6).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Supplementary file 1. Primary datasets used in our study including amylase enzymatic activity in saliva and gene copy numbers across mammalian species (tab 1: data used for Figures 3 and 5), as well as dietary preferences across species (tab 2: data used for Figure 4), and amylase copy numbers found in available reference genomes (tab 3: data used for Figure 1—figure supplement 1).
    elife-44628-supp1.xlsx (35.3KB, xlsx)
    DOI: 10.7554/eLife.44628.012
    Supplementary file 2. Details of the primer and probe sets that we used in this study.
    elife-44628-supp2.xlsx (10.9KB, xlsx)
    DOI: 10.7554/eLife.44628.013
    Supplementary file 3. Results of our analysis of retrotransposons associated with the amylase gene copies depicted in Figure 2B.

    Figure supplements

    elife-44628-supp3.xlsx (13.1KB, xlsx)
    DOI: 10.7554/eLife.44628.014
    Transparent reporting form
    DOI: 10.7554/eLife.44628.015

    Data Availability Statement

    All data generated (copy numbers and salivary activity information) can be found in the supplementary files and Methods.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES