Abstract
In this study, we investigated the role for ancestral functional variation that may be selected upon to generate protein functional shifts using ancestral protein resurrection, statistical tests for positive selection, forward and reverse evolutionary genetics, and enzyme functional assays. Data are presented for three instances of protein functional change in the salicylic acid/benzoic acid/theobromine (SABATH) lineage of plant secondary metabolite-producing enzymes. In each case, we demonstrate that ancestral nonpreferred activities were improved upon in a daughter enzyme after gene duplication, and that these functional shifts were likely coincident with positive selection. Both forward and reverse mutagenesis studies validate the impact of one or a few sites toward increasing activity with ancestrally nonpreferred substrates. In one case, we document the occurrence of an evolutionary reversal of an active site residue that reversed enzyme properties. Furthermore, these studies show that functionally important amino acid replacements result in substrate discrimination as reflected in evolutionary changes in the specificity constant (kcat/KM) for competing substrates, even though adaptive substitutions may affect KM and kcat separately. In total, these results indicate that nonpreferred, or even latent, ancestral protein activities may be coopted at later times to become the primary or preferred protein activities.
Keywords: carboxyl methyltransferase, adaptive protein evolution
The evolution of proteins is responsible in large part for generating the diversity of organismal form and function observed today. As such, understanding the nature of protein functional diversification is a primary goal of evolutionary biologists. In 1970, Ohno's landmark model of protein evolution explained how descendants of single function ancestral proteins may acquire novel functions after gene duplication (1). Subsequent models have explicitly required ancestral proteins to have multiple functions that then are partitioned and/or improved upon after gene duplication. In the cases of duplication–degeneration–complementation (DDC) (2) and escape from adaptive conflict (EAC) (3–6), multiple ancestral functions are selectively maintained in the single progenitor protein but for innovation–amplification–divergence (IAD) (7, 8), one ancestral function is under selection, whereas others are neutral. After gene duplication, loci may evolve neutrally such that ancestral functions are partitioned between daughters (DDC) or selection may improve both ancestral functions (EAC) or only a previously neutral one (IAD). The IAD model of protein functional change is congruent with related ideas arising from protein engineering studies (9–11) in which neutral, promiscuous protein functions can arise under purifying selection for maintenance of primary protein function and then be selected for (10, 12–15). Although a proposed mechanism for functional change has been demonstrated by engineering experiments, the importance of multiple ancestral activities for protein evolution in lineages of naturally occurring enzymes remains unclear.
Discerning the fate of ancestral activities during protein functional shifts is hampered by the fact that ancient proteins are extinct. Ancestral state estimates based on modern-day protein functions (16) can provide insight into ancestral conditions; however, a particularly promising strategy is to resurrect ancestral proteins and directly determine their activities. This paleomolecular approach has uncovered ancestral protein properties and indicated the structural bases of functional evolution in several studies (17–23). To investigate the importance of ancestral protein functions for enzyme functional divergence, we have resurrected and biochemically characterized ancestral enzymes for a group of plant methyltransferases from the salicylic acid/benzoic acid/theobromine (SABATH) gene family that are important for floral fragrance production, pathogen and herbivore defense, and plant development (24–29). The enzymes focused on in this study, salicylic acid carboxyl methyltransferase (SAMT), benzoic/salicylic acid carboxyl methyltransferase (BSMT), and nicotinic acid carboxyl methyltransferase (NAMT), display considerable functional variation even though they are also capable of discrimination among structurally similar substrates (30). To investigate functional change in the SABATH gene family, we used a combination of paleomolecular biology, evolutionary statistics, and forward and reverse evolutionary genetic techniques. Together, these complementary approaches allow insight into ancestral conditions and provide experimental evolutionary tests to understand protein functional diversification.
Results
Modern-Day SABATH Enzymes Have High Activity with Few Substrates and Lesser Activity with Many.
As part of our on-going studies of carboxyl methyltransferases, we determined enzyme activities of SAMT or NAMT from Hoya carnosa (Apocynaceae), Nicotiana suaveolens, Datura wrightii, and Cestrum nocturnum (all Solanaceae) against 18 substrates. All four of these species emit the products of these enzymes, methyl salicylate, methyl benzoate, and/or methyl nicotinate from their flowers (31–33). Fig. 1 shows a comparison of the relative activities of these four enzymes with nine others from Apocynaceae and Solanaceae we have previously characterized (30, 33) (Table S1). It is clear that the substrate profiles of nearly all SAMTs are highly similar to each other and show highest activity with salicylic acid (SA), which they prefer over benzoic acid (BA) by 3- to 10-fold or greater (Fig. 1). Otherwise only 2,3-dihydroxyBA and 2,5-dihydroxyBA are methylated at appreciable levels (>15%). SAMT from Stephanotis floribunda, a close relative of H. carnosa, differs in that it has high relative activity with a large number of different substrates, including SA (33). The SAMT substrate profiles are markedly different from those shown for BSMTs from Solanaceae, which have highest activities with BA, 3- or 4-hydroxyBA, o-anisic acid, or anthranilic acid (Fig. 1). Nearly all BSMTs show much less activity with SA, preferring BA by 2- to 10-fold or greater. Remarkably, the Solanaceae SAMT and BSMT differences have largely been maintained since the lineages diverged from each other at least 50 Mya when the family originated (34). The most enzymatically divergent enzyme shown is NAMT, known only from N. gossei and N. suaveolens, which prefers to methylate nicotinic acid (NA) above all other substrates (Fig. 1 and Table S1) (30).
Fig. 1.
Relative enzyme activities of 13 SAMT, BSMT, and NAMT with 18 substrates arranged by evolutionary relationships. Hoya and Stephanotis are from Apocynaceae, whereas all other species are members of Solanaceae. Activity with the favored substrate was used to normalize all other activities for each enzyme, which range from 0 to 100. Most SAMT show high relative preference for salicylic acid (SA) relative to benzoic acid (BA) and all other substrates. Most BSMT show higher relative preference for BA than SA, although high activities are shown with o-anisic acid and anthranilic acid as well. NAMT exhibits highest preference for nicotinic acid (NA). The gene duplication event that gave rise to the entire lineage of enzymes shown is hypothesized to have occurred at node A (31) (Fig. S1). A gene duplication event at node D occurred early in Solanaceae evolutionary history and resulted in the SAMT and BSMT lineages of that family. A later duplication event is thought to have occurred at node E only within Nicotiana (30). Data for S. floribunda are from ref. 33. Numbered structures are as follow: 1, salicylic acid; 2, benzoic acid (BA); 3, 3-hydroxyBA; 4, 4-hydroxyBA; 5, 2,3-dihydroxyBA; 6, 2,4-dihydroxyBA; 7, 2,5-dihydroxyBA; 8, 2,6-dihydroxyBA; 9, 3,4-dihydroxyBA; 10, 3,5-dihydroxyBA; 11, cinnamic acid; 12, o-coumaric acid; 13, m-coumaric acid; 14, p-coumaric acid; 15, o-anisic acid; 16, anthranilic acid; 17, jasmonic acid; and 18, nicotinic acid.
Assays of Ancestral Enzymes Indicate Three Evolutionary Shifts in Substrate Preference After Gene Duplication.
We investigated the extent of ancestral enzyme divergence after gene duplication with respect to the three substrates used by modern-day enzymes (Fig. 1) for floral scent production and pathogen/herbivore defense: SA, BA, and NA. For each of three lineages, we used three lines of evidence to understand the role of ancestral functional variation for protein divergence. First, in each case, ancestral enzyme activity with a nonpreferred substrate was shown to have become the primary activity in a descendant enzyme after gene duplication. Second, statistical tests indicate historical episodes of positive selection that were concomitant with changes in substrate preference. Third, mutagenesis experiments verified the role of putatively adaptive sites for the functional changes.
Evolution of Functional Change Between Nodes A and B.
First, the lineage of SAMT, BSMT, and NAMT enzymes shown in Fig. 1 is thought to have arisen from the gene duplication event at node A, which separates them from the other functionally diverse members of the family including IAMT, GAMT, FAMT, JMT, and caffeine synthase (Fig. 2 and Fig. S1) (31, 35). Thus, we investigated evolution of substrate preference between node A and B. To determine ancestral activities at node A, we experimentally resurrected six alleles of an ancestral enzyme, ancMT-A, that were estimated from contemporary protein sequences (36), including all functionally characterized SABATH enzymes (Fig. S1). Despite the high levels of sequence divergence within this family of proteins, confidence was high for the reconstructed ancestral amino acid sequence of ancMT-A (mean posterior probability (PP = 0.92; Figs. S2 and S3). Fig. 2 shows that ancMT-A activity with BA was maximal, whereas activity with SA was 4- to 10-fold lower (Fig. 2A). Resurrection of three alleles of ancestral enzyme ancMT-0, for an even more ancient node in the SABATH gene tree (node 0; Figs. S1–S3) also indicates that BA was ancestrally preferred over SA. Assuming this ancestral condition at node A, it is apparent from experimental analysis of six alleles of the resurrected enzyme at node B (ancMT-B) (Figs. S1–S3) that it evolved to have highest activity with the formerly nonpreferred substrate, SA (Fig. 2B).
Fig. 2.
Experimental results used to investigate the fate of ancestral nonpreferred enzyme activities after functional shifts. The tree shown is simplified from that shown in Fig. 1 with the same node labels. (A–G) Nodes for which relative enzyme preference for BA (green), SA (black), and NA (red) for either resurrected enzymes and their forward mutants or reverse mutants of modern-day enzymes (bold) was determined. Mean and SD are shown on the basis of at least two replicate assays. Color shown for lineages indicates the highest ancestral relative activity with a particular substrate. The ancestral enzyme at node A appears to have preferred BA but later evolved to prefer SA as shown for node B. This change in preference was concomitant with positive selection for the replacement of His by Met at position 201 (P < 0.05). The ancestral enzyme at node D had a 20-fold higher relative activity for SA over BA, but after gene duplication, the descendant enzyme at node E evolved a >20-fold higher relative activity for BA over SA. This evolutionary reversal appears to have been concomitant with positive selection for the reverse replacement of Met by His at position 201 (P < 0.05). The ancestral enzyme, ancMT-F, evolved high relative preference for NA from an ancestor that preferred BA (ancMT-E). Subsequently, preference for NA increased even more in NAMT, probably by the replacement of Phe420 by Tyr, although it is not clear whether positive selection was concomitant with this change (P > 0.05).
Second, one of the predictions of most models of protein functional divergence after gene duplication is that positive selection will promote change. As shown between nodes A and B (Fig. 2), a branch-sites statistical analysis (37, 38) indicates that although there likely were >50 changes along this branch, positive selection was associated with substitution at a single active site residue and concomitant with the change in preference toward SA along this lineage [dN/dS = 35; P < 0.05; position 201: His to Met (PP = 0.998)]. This branch has been previously reported to have experienced positive selection (31) and even with the addition of several newly characterized sequences, the statistical signature of adaptive evolution remains.
Third, to experimentally verify the predictions of the statistical analyses of selection between nodes A and B, we forward mutated ancMT-A1 and ancMT-03. In each case, replacement of His201 by Met recapitulated the inferred evolutionary change along branch A–B: both ancestral enzymes changed from preferring BA >4-fold over SA (Fig. 2A) to preferring SA over BA by at least 1.6-fold (Fig. 2B).
Although it could be argued that the change in relative enzyme preference was due to a decrease in activity with BA, rather than an increase with SA, this does not appear to be the case. An investigation of enzyme kinetics of a site-directed reverse mutant M201H Hoya SAMT enzyme revealed increased relative activity with BA compared with wild type (Table S2). Replacement of Met201 by His in H. carnosa SAMT affected the catalytic efficiency of this enzyme for SA because kcat/KM decreased in the mutant by more than fourfold (Table S3). In contrast, kcat/KM for BA is only twofold lower in M201H compared with wild type. Whereas wild type has a kcat/KM for SA that is nearly eight times higher than that for BA, M201H is only approximately three times higher (Table S3). Thus, it appears that one change along lineage A–B promoted increased activity with the ancestrally nonpreferred substrate, SA, and this change was due mostly to an increased kcat.
Evolution of Functional Change Between Nodes D and E.
First, the major enzymatic divergence between Solanaceae SAMT and BSMT appears to have coincided with gene duplication at node D (Fig. 1) (30, 31). Thus, protein functional evolution was investigated between nodes D and E (Fig. 2). At node D, SA appears to have been the preferred substrate, on the basis of assays of six alleles of the resurrected ancestral enzyme, ancMT-D (Figs. S1–S3), whereas in comparison, activity with BA was fourfold less (Fig. 2D). Subsequent to gene duplication at node D, activity with BA evolved to become maximal, whereas activity with SA diminished as indicated by assays of five alleles of the resurrected enzyme ancMT-E (Fig. 2E and Figs. S1–S3). This evolutionary change of relative enzyme preference is the reverse of that observed between nodes A and B described above (Fig. 2 A and B).
Second, a statistical signature of positive selection (dN/dS = 38; P < 0.05) is associated with the increased relative activity with BA that evolved between nodes D and E after gene duplication. Of the >50 likely changes along this branch, the only predicted adaptive site was the active site residue 201, which appears to have reversed from Met back to the ancestral His (PP = 0.99) (compare with amino acid replacement between nodes A and B).
Third, to test for functional relevance of the positively selected site, we recapitulated the evolutionary change of Met201 to His in ancMT-D1. This mutant enzyme changed from showing a 10-fold higher activity with SA relative to BA to showing 3-fold higher activity with previously nonpreferred substrate, BA (Fig. 2E). The importance of this site for activity with BA is strengthened by mutagenesis of His201 back to Met in N. suaveolens BSMT1. This reverse evolutionary mutation resulted in a switch from preference for BA over SA in wild-type BSMT1 to a 1.6-fold preference for SA over BA (Fig. 2D and Table S2).
Finally, it appears that the functional shift in relative enzyme preference between nodes D and E was due to an increase in activity with the previously nonpreferred substrate BA rather than merely decreasing activity with SA, on the basis of kinetic measurements of both forward and reverse mutated enzymes. For ancMT-D, it appears that positive selection promoted increased substrate preference for BA because KM of the mutant M201H for BA decreased nearly 4-fold and kcat increased 5-fold, whereas for SA, KM and kcat decreased 2-fold (Table S3). As a result, kcat/KM for BA increased 20-fold, whereas it did not change markedly for SA. It should be noted that kcat/KM is still higher for SA. For the N. suaveolens BSMT1 reverse mutant enzyme H201M, kcat/KM for BA decreased 14-fold compared with wild type, whereas kcat/KM for SA was essentially the same as wild type (Table S3). The H201M mutant enzyme had a >10-fold higher KM with BA compared with wild type, even though kcat was largely unchanged. Mutant KM with SA was 6-fold higher than wild type but kcat actually increased toward SA (Table S3). These data indicate that changes along lineage D–E resulted in an increased preference toward the ancestrally nonpreferred substrate, BA, apparently due to selection for decreased KM and increased kcat.
Evolution of Functional Change Between Nodes E and F and NAMT.
First, node E is postulated to have undergone gene duplication within Nicotiana (Fig. S2) (30). Thus, enzyme activity evolution was investigated between nodes E and F. Assays of five resurrected alleles of ancMT-F (Figs. S1–S3) indicate that activity with NA was highest for this ancestral enzyme, whereas activity with the ancestrally preferred substrate, BA, was 7- to 30-fold less (Fig. 2F). This high level of activity with NA evolved from an ancestral condition at node E in which BA was preferred and activity with NA was only minimal (Fig. 2E). From the condition at node F, it appears that NAMT evolved to have negligible activity with BA and SA relative to NA (Fig. 2G). Surprisingly, given the ancestral condition at node F, BSMT1 evolved to have higher activity with BA relative to NA, an evolutionary reversal from that observed between nodes E and F (Fig. 2).
Second, although no positive selection was detected during the divergence of NAMT from ancestor F when analyzing all SABATH family members together (Fig. S1) (P > 0.05), an analysis that involved only SAMT lineage members, including a dense sampling from Solanaceae (Fig. S4), did result in an optimal estimate of dN/dS = 64 (Fig. 2), although this was not statistically distinguishable from the null (P > 0.05). Sequence divergence is very low along this branch and thus the test may lack sufficient power to detect statistical significance in this case (39). Nonetheless, in the second statistical analysis assuming the tree in Fig. S4, of approximately eight sites that have changed along this branch, two sites (141 and 402) in the NAMT lineage were predicted to be under positive selection having switched from previously being under purifying selection. No significant signature of positive selection was detected between nodes E and F or from node F to BSMT1.
Third, to test the functional importance of one of the putatively selected sites, forward mutagenesis was performed. Introduction of the F402Y active site mutation into ancMT-F resulted in a further enzyme preference for NA (Fig. 2G) compared with node F (Fig. 2F). However, it is not yet known whether evolution proceeded by increasing enzyme relative preference for NA, decreasing it for BA and SA, or a combination of both. Experiments to ascertain the role of particular amino acid replacements contributing to the enzymatic divergences between nodes E and F and node F and BSMT1 are currently underway.
Discussion
In this family of enzymes, it is clear that ancestral functional variation was refined to generate activity shifts in multiple lineages. These evolutionary patterns of enzyme activity change were uncovered by the resurrection and functional characterization of ancestral proteins in combination with statistical analyses that implicated specific amino acid residues. Subsequent experimental manipulation of these sites demonstrated their respective contributions to functional change. Because SABATH family enzyme functional evolution appears to depend upon the existence of multiple ancestral activities, we attempted to reconcile the theoretical predictions of DDC, EAC, and IAD (2–4, 40, 41) with the results we have reported.
Conflict Appears to Exist Among Some Ancestral Activities.
It appears that in this family of enzymes, neither ancestral nor modern-day enzymes can be equally effective with the physiologically relevant substrates, BA and SA (Figs. 1 and 2). The changes, introduced to the active site residue 201 that improved preference for one substrate, came at the expense of reduced relative preference for the other (Fig. 2, lineages A–B and D–E, and Tables S2 and S3). In fact, relative activities with BA and SA in extant enzymes are negatively related (covariance = −0.331; phylogenetic covariance estimate = −0.168). Assuming the efficient production of MeBA and MeSA is advantageous, adaptive conflict appears to exist between specialization for SA or BA, making the enzyme functional shifts of lineage A–B and D–E potentially best described by the EAC model of protein evolution. After gene duplication, the ancestral enzymes at nodes B and E evolved improved activity with SA and BA, respectively, consistent with EAC. The other daughter enzymes descending from the gene duplication events may also have evolved improvement with an ancestral function because the SAMT enzymes, descended from node D, appear to have evolved increased relative activity for SA to some extent; however, it is currently unknown what activities the other descendants of node A evolved to specialize upon due to uncertainty in the gene tree (Fig. S1). Also consistent with EAC is the finding that a single positively selected codon (amino acid position 201) largely accounts for the change in preference between nodes A and B and D and E; still, it is not clear to what extent positive selection may have acted on the other branches descending from duplication events at nodes A and D because no statistical significance was observed. The finding that a single residue governs enzyme substrate preference is particularly important because it identifies an obvious mechanism by which adaptive conflict could arise and has been implicated for a laboratory study of bacterial protein evolution (42). Because other studies have reported single amino acid switches that interconvert modern-day phenylpropanoid enzyme substrate preferences (43) and product outcome of diterpene- and carotenoid-producing enzymes (44, 45), adaptive conflict may be pervasive, especially for proteins involved in specialized metabolite production. Due to the uncertainty associated with inferring positive selection and determining whether both descendant enzymes improved upon ancestral activities after the gene duplication events at nodes A and D, it remains possible that IAD also describes these divergences, despite the apparent conflicting ancestral activities.
Latent, Low Level Activities Provide Raw Material for Evolutionary Change.
Our results show that low ancestral activity with nonpreferred substrates may be latent for long periods and persist through multiple gene duplication and speciation events, but ultimately these low activities can evolve to become high. Specifically, NA activity was minor in the ancestor of the entire lineage at node A (ca. 100 Mya) and through nodes B–E, yet at node F (<7.5 Mya) (46), a shift occurred such that the ancestral enzyme evolved to prefer NA above all others. So, whereas the ancestral enzymes shown in Fig. 2 vary between having preference for SA or BA, activity with NA does not change appreciably until node F and thus may have been historically neutral with respect to enzyme preference for the other substrates. If so, the functional shifts between node E and F and NAMT in N. gossei likely represent instances of evolution by IAD. In this case, the previously minor activity with NA would serve as an innovation. Amplification appears to have occurred in N. gossei and N. suaveolens after duplications at nodes D and E and because they are allopolyploids providing them with no less than four SAMT/BSMT-like genes that could have potentially provided considerable NA methylation ability in combination (30, 47). Divergence appears to have occurred over multiple time frames after duplication at node E, whereby node F evolved highest activity with NA, and this activity further evolved in the lineage leading to NAMT at node G (Fig. 2). As predicted by IAD, whereas activity with NA was increased in one daughter after duplication at node E, relative activity with BA does not seem to have increased after duplication in descendants of the other daughter of ancMT-E, BSMT2. Although we have identified one site that contributed to increased activity with NA, positive selection was not statistically significant, making DDC potentially explanatory for the results as well.
Limited Constraint on Evolutionary Reversals of Enzyme Activity.
In addition to demonstrating the importance of ancestral promiscuous activities for protein functional evolution, we have documented a clear case of an evolutionary reversal of an active site residue that resulted in a reversal of associated enzyme activity. Initially, after gene duplication resulted in formation of the SAMT lineage at node A, the active site His201 was replaced by Met (Fig. 2 between nodes A and B), resulting in the evolution of enzyme preference for SA (Fig. 2B). Remarkably, after gene duplication at node D, this homologous site in one of the daughter lineages then underwent an evolutionary reversal from Met back to the ancestral His (Fig. 2, between nodes D and E), which resulted in a reversal of enzymatic properties including loss of preference for SA and a return to higher preference for BA (Fig. 2E). Furthermore, whereas the causative amino acid replacements have not yet been identified, it is also clear that a second reversal occurred. First, between nodes E and F, high activity with NA evolved from an ancestor that preferred BA. Subsequently, the ancestral NA-preferring enzyme at node F gave rise to BSMT1, which prefers BA. Previous studies have clearly shown that restrictive epistatic interactions among amino acid sites can constrain evolutionary reversals (19, 48). Because the original change from His201 to Met likely predated the divergence of rosids and asterids (ca. 100–125 Mya) (30) and the reversal occurred after the origin of Solanaceae (ca. 50 Mya), it is surprising that mutations did not accumulate during the ca. 50–75 million-year interval that would constrain this active site reversal and associated functional change. If generalizable, these results indicate that epistasis may not constrain evolutionary changes of enzymes involved in specialized metabolism to the extent it does with other proteins (19, 48).
Our mutagenesis studies of this family of methyltransferases validate the impact of historical changes on shaping enzyme preference. Specifically, selection appears to have promoted directional changes in catalytic efficiency (kcat/KM) for the two competing substrates, salicylic and benzoic acid, even though the adaptive substitutions may affect KM and kcat separately (Table S3). It seems probable that the ability to discriminate between SA and BA is valuable, given that recent studies have shown a central role of MeSA in pathogen (29) and herbivore defense (35); therefore, its efficient production by SAMT is likely advantageous. Likewise, selection for efficient MeBA and MeNA production in Nicotiana is likely related to effective pollinator attraction as both BSMT1 and NAMT are expressed primarily in petal tissue from which the corresponding volatiles are emitted (30). It should be noted that measured substrate preference of these enzymes is not necessarily indicative of the role in planta, because the SAMT orthologous enzymes in Petunia and Stephanotis, which prefer SA in vitro, are used to methylate BA in petals (33, 49). In this case, substrate pool is an important determining factor for phenotype rather than enzyme kinetic properties alone (33, 49). Although it may be adaptive for an enzyme to distinguish among competing substrates as shown for SAMT, BSMT, and NAMT, it seems that variable levels of relative activity with many substrates appear nearly universal in ancestral and extant enzymes and have likely served to facilitate functional diversification throughout the history of this family. The importance of ancestral functional variation for protein evolution may be much broader than previously appreciated. Beyond its obvious role in the evolution of novel activity via protein engineering (9, 11, 50), ligand-binding promiscuity has been reported in ancestral vertebrate steroid receptors, which served as the basis for subsequent natural evolution of novel receptor–ligand interactions (20, 51). Likewise, a chimeric alcohol dehydrogenase from Drosophila, evolved preference for substrates that were nonpreferred in the progenitor enzyme (52, 53).
Materials and Methods
Heterologous Expression and Purification of Enzymes.
The basic protocols used for gene cloning and protein overexpression were performed as previously described (30, 33). Briefly, genes were cloned into expression vectors and overexpression of His6 protein was achieved in BL-21 cells. Purification of the His6-tagged protein was purified by Ni-NTA affinity chromatatography (Qiagen) according to the manufacturer's instructions. To determine protein concentration, a standard Bradford assay was used. For mutagenesis, the QuikChange Site-Directed Mutagenesis kit (Stratagene) was used according to the manufacturer's instructions.
Enzyme Assays.
The purified enzymes were tested for activity with the 18 substrates shown in Fig. 1. Radiochemical assays were performed as previously described (30). The highest enzyme activity reached with a specific substrate was set to 100 and relative activities with remaining substrates were calculated. Each assay was run in duplicate and mean, plus SD, was calculated for Fig. 1. For the ancestral enzyme functional assays, we used the same radiochemical assays described above except that we did not add HCL before extraction of reaction products. All assays shown in Fig. 2 were performed on total protein because relative activity levels were similar between total and purified protein for the two enzymes we compared (ancMT-D and ancMT-D M201H).
Estimation of Michaelis–Menten Parameters.
For kinetic measurements, enzyme assays were performed by varying SA and BA concentrations, whereas SAM and enzyme concentrations were held at saturating levels. All kinetic studies were performed in two independent experiments with incubation times chosen so that reaction velocity was linear. Lineweaver-Burk plots were performed to determine the KM and kcat values as in our previous studies (33).
Statistical and Molecular Evolutionary Analyses.
DNA sequences from all enzymatically characterized SABATH gene family members were obtained from GenBank. Maximum likelihood phylogenetic analyses were performed with PAUP* (54) as previously described (30) assuming the HKY+I+G model of nucleotide substitution as chosen by Modeltest (55) using 10 random addition sequences and Tree-bisection-reconnection swapping. Bootstrapping was performed using 100 replicates. PAML ver. 4.2 (36) was used to test the hypothesis of positive selection in the SABATH gene family using the branch-sites test because it is expected that positive selection should act only on a subset of sites and branches of a gene tree as functional divergence occurs. This test has recently been shown to be robust under a wide range of conditions and is the most powerful test available (38, 56). Analyses were performed multiple times using different starting values of ω and assumed the HKY model of nucleotide substitution. Codeml was used to estimate ancestral enzyme sequences for the SABATH gene family with the HKY model with κ estimated. The γ and invariant parameters cannot be implemented in Codeml under the branch-sites model. Additional details about ancestral sequence estimation and how alternative sites were chosen to assess uncertainty are provided in SI Materials and Methods. The optimal sequences were subsequently synthesized by Genescript with codons chosen for optimal protein expression in Escherichia coli.
Supplementary Material
Acknowledgments
We thank Doug Coulter, Pam Hoppe, Talline Martins, Julie Ryan, John Spitsbergen, and James Kiddle for helpful discussions and assistance; and the Chemistry Department at Western Michigan University for facilitating our GC/MS analyses. This study was supported by National Science Foundation Grants DEB 0344496 and MCB-1120624 and a Faculty Research and Creative Activities Award from Western Michigan University (to T.J.B.) and by Deutsche Forschungsgemeinschaft Pi 153/22 (to B.P.).
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1019605109/-/DCSupplemental.
References
- 1.Ohno S. Evolution by Gene Duplication. New York: Springer; 1970. [Google Scholar]
- 2.Force A, et al. Preservation of duplicate genes by complementary, degenerative mutations. Genetics. 1999;151:1531–1545. doi: 10.1093/genetics/151.4.1531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Hughes AL. The evolution of functionally novel proteins after gene duplication. Proc Biol Sci. 1994;256:119–124. doi: 10.1098/rspb.1994.0058. [DOI] [PubMed] [Google Scholar]
- 4.Des Marais DL, Rausher MD. Escape from adaptive conflict after duplication in an anthocyanin pathway gene. Nature. 2008;454:762–765. doi: 10.1038/nature07092. [DOI] [PubMed] [Google Scholar]
- 5.Hittinger CT, Carroll SB. Gene duplication and the adaptive evolution of a classic genetic switch. Nature. 2007;449:677–681. doi: 10.1038/nature06151. [DOI] [PubMed] [Google Scholar]
- 6.Deng C, Cheng C-HC, Ye H, He X, Chen L. Evolution of an antifreeze protein by neofunctionalization under escape from adaptive conflict. Proc Natl Acad Sci USA. 2010;107:21593–21598. doi: 10.1073/pnas.1007883107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bergthorsson U, Andersson DI, Roth JR. Ohno's dilemma: Evolution of new genes under continuous selection. Proc Natl Acad Sci USA. 2007;104:17004–17009. doi: 10.1073/pnas.0707158104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Soo VWC, Hanson-Manful P, Patrick WM. Artificial gene amplification reveals an abundance of promiscuous resistance determinants in Escherichia coli. Proc Natl Acad Sci USA. 2011;108:1484–1489. doi: 10.1073/pnas.1012108108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Aharoni A, et al. The ‘evolvability’ of promiscuous protein functions. Nat Genet. 2005;37:73–76. doi: 10.1038/ng1482. [DOI] [PubMed] [Google Scholar]
- 10.Khersonsky O, Roodveldt C, Tawfik DS. Enzyme promiscuity: Evolutionary and mechanistic aspects. Curr Opin Chem Biol. 2006;10:498–508. doi: 10.1016/j.cbpa.2006.08.011. [DOI] [PubMed] [Google Scholar]
- 11.Bloom JD, Arnold FH. In the light of directed evolution: Pathways of adaptive protein evolution. Proc Natl Acad Sci USA. 2009;106(Suppl 1):9995–10000. doi: 10.1073/pnas.0901522106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Tracewell CA, Arnold FH. Directed enzyme evolution: Climbing fitness peaks one amino acid at a time. Curr Opin Chem Biol. 2009;13:3–9. doi: 10.1016/j.cbpa.2009.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Bershtein S, Tawfik DS. Ohno's model revisited: Measuring the frequency of potentially adaptive mutations under various mutational drifts. Mol Biol Evol. 2008;25:2311–2318. doi: 10.1093/molbev/msn174. [DOI] [PubMed] [Google Scholar]
- 14.Bloom JD, Romero PA, Lu Z, Arnold FH. Neutral genetic drift can alter promiscuous protein functions, potentially aiding functional evolution. Biol Direct. 2007;2:17. doi: 10.1186/1745-6150-2-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Peisajovich SG, Tawfik DS. Protein engineers turned evolutionists. Nat Methods. 2007;4:991–994. doi: 10.1038/nmeth1207-991. [DOI] [PubMed] [Google Scholar]
- 16.Pagel M. Inferring the historical patterns of biological evolution. Nature. 1999;401:877–884. doi: 10.1038/44766. [DOI] [PubMed] [Google Scholar]
- 17.Yokoyama S, Tada T, Zhang H, Britt L. Elucidation of phenotypic adaptations: Molecular analyses of dim-light vision proteins in vertebrates. Proc Natl Acad Sci USA. 2008;105:13480–13485. doi: 10.1073/pnas.0802426105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Thomson JM, et al. Resurrecting ancestral alcohol dehydrogenases from yeast. Nat Genet. 2005;37:630–635. doi: 10.1038/ng1553. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Harms MJ, Thornton JW. Analyzing protein structure and function using ancestral gene reconstruction. Curr Opin Struct Biol. 2010;20:360–366. doi: 10.1016/j.sbi.2010.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bridgham JT, Carroll SM, Thornton JW. Evolution of hormone-receptor complexity by molecular exploitation. Science. 2006;312:97–101. doi: 10.1126/science.1123348. [DOI] [PubMed] [Google Scholar]
- 21.Zhang J. Parallel adaptive origins of digestive RNases in Asian and African leaf monkeys. Nat Genet. 2006;38:819–823. doi: 10.1038/ng1812. [DOI] [PubMed] [Google Scholar]
- 22.Ortlund EA, Bridgham JT, Redinbo MR, Thornton JW. Crystal structure of an ancient protein: Evolution by conformational epistasis. Science. 2007;317:1544–1548. doi: 10.1126/science.1142819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Field SF, Matz MV. Retracing evolution of red fluorescence in GFP-like proteins from Faviina corals. Mol Biol Evol. 2010;27:225–233. doi: 10.1093/molbev/msp230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.D'Auria JC, Chen F, Pichersky E. In: Recent Advances in Phytochemistry. Romeo JT, editor. Vol 37. Oxford: Elsevier Science; 2003. pp. 253–283. [Google Scholar]
- 25.Murata J, Roepke J, Gordon H, De Luca V. The leaf epidermome of Catharanthus roseus reveals its biochemical specialization. Plant Cell. 2008;20:524–542. doi: 10.1105/tpc.107.056630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ross JR, Nam KH, D'Auria JC, Pichersky E. S-Adenosyl-L-methionine:salicylic acid carboxyl methyltransferase, an enzyme involved in floral scent production and plant defense, represents a new class of plant methyltransferases. Arch Biochem Biophys. 1999;367:9–16. doi: 10.1006/abbi.1999.1255. [DOI] [PubMed] [Google Scholar]
- 27.Seo HS, et al. Jasmonic acid carboxyl methyltransferase: A key enzyme for jasmonate-regulated plant responses. Proc Natl Acad Sci USA. 2001;98:4788–4793. doi: 10.1073/pnas.081557298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Varbanova M, et al. Methylation of gibberellins by Arabidopsis GAMT1 and GAMT2. Plant Cell. 2007;19:32–45. doi: 10.1105/tpc.106.044602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Park S-W, Kaimoyo E, Kumar D, Mosher S, Klessig DF. Methyl salicylate is a critical mobile signal for plant systemic acquired resistance. Science. 2007;318:113–116. doi: 10.1126/science.1147113. [DOI] [PubMed] [Google Scholar]
- 30.Hippauf F, et al. Enzymatic, expression and structural divergences among carboxyl O-methyltransferases after gene duplication and speciation in Nicotiana. Plant Mol Biol. 2010;72:311–330. doi: 10.1007/s11103-009-9572-0. [DOI] [PubMed] [Google Scholar]
- 31.Barkman TJ, Martins TR, Sutton E, Stout JT. Positive selection for single amino acid change promotes substrate discrimination of a plant volatile-producing enzyme. Mol Biol Evol. 2007;24:1320–1329. doi: 10.1093/molbev/msm053. [DOI] [PubMed] [Google Scholar]
- 32.Altenburger R, Matile P. Circadian rhythmicity of fragrance emission in flowers of Hoya carnosa R Br. Planta. 1988;174:248–252. doi: 10.1007/BF00394778. [DOI] [PubMed] [Google Scholar]
- 33.Pott MB, et al. Biochemical and structural characterization of benzenoid carboxyl methyltransferases involved in floral scent production in Stephanotis floribunda and Nicotiana suaveolens. Plant Physiol. 2004;135:1946–1955. doi: 10.1104/pp.104.041806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Paape T, et al. A 15-Myr-old genetic bottleneck. Mol Biol Evol. 2008;25:655–663. doi: 10.1093/molbev/msn016. [DOI] [PubMed] [Google Scholar]
- 35.Ament K, et al. Methyl salicylate production in tomato affects biotic interactions. Plant J. 2010;62:124–134. doi: 10.1111/j.1365-313X.2010.04132.x. [DOI] [PubMed] [Google Scholar]
- 36.Yang ZH. PAML 4: Phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586–1591. doi: 10.1093/molbev/msm088. [DOI] [PubMed] [Google Scholar]
- 37.Yang ZH, Nielsen R. Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol Biol Evol. 2002;19:908–917. doi: 10.1093/oxfordjournals.molbev.a004148. [DOI] [PubMed] [Google Scholar]
- 38.Yang Z, dos Reis M. Statistical properties of the branch-site test of positive selection. Mol Biol Evol. 2011;28:1217–1228. doi: 10.1093/molbev/msq303. [DOI] [PubMed] [Google Scholar]
- 39.Nozawa M, Suzuki Y, Nei M. Reliabilities of identifying positive selection by the branch-site and the site-prediction methods. Proc Natl Acad Sci USA. 2009;106:6700–6705. doi: 10.1073/pnas.0901855106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Zhang J. Evolution by gene duplication: An update. Trends Ecol Evol. 2003;18:292–298. [Google Scholar]
- 41.Barkman TJ, Zhang J. Evidence for escape from adaptive conflict? Nature. 2009;462:E1–E2. doi: 10.1038/nature08663. discussion E2–E3. [DOI] [PubMed] [Google Scholar]
- 42.McLoughlin SY, Copley SD. A compromise required by gene sharing enables survival: Implications for evolution of new enzyme activities. Proc Natl Acad Sci USA. 2008;105:13497–13502. doi: 10.1073/pnas.0804804105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Gang DR, et al. Characterization of phenylpropene O-methyltransferases from sweet basil: Facile change of substrate specificity and convergent evolution within a plant O-methyltransferase family. Plant Cell. 2002;14:505–519. doi: 10.1105/tpc.010327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Cunningham FX, Jr, Gantt E. One ring or two? Determination of ring number in carotenoids by lycopene epsilon-cyclases. Proc Natl Acad Sci USA. 2001;98:2905–2910. doi: 10.1073/pnas.051618398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Xu M, Wilderman PR, Peters RJ. Following evolution's lead to a single residue switch for diterpene synthase product outcome. Proc Natl Acad Sci USA. 2007;104:7397–7401. doi: 10.1073/pnas.0611454104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Wu F, et al. COSII genetic maps of two diploid Nicotiana species provide a detailed picture of synteny with tomato and insights into chromosome evolution in tetraploid N. tabacum. Theor Appl Genet. 2010;120:809–827. doi: 10.1007/s00122-009-1206-z. [DOI] [PubMed] [Google Scholar]
- 47.Chase MW, et al. Molecular systematics, GISH and the origin of hybrid taxa in Nicotiana (Solanaceae) Ann Bot (Lond) 2003;92:107–127. doi: 10.1093/aob/mcg087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Bridgham JT, Ortlund EA, Thornton JW. An epistatic ratchet constrains the direction of glucocorticoid receptor evolution. Nature. 2009;461:515–519. doi: 10.1038/nature08249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Negre F, et al. Regulation of methylbenzoate emission after pollination in snapdragon and petunia flowers. Plant Cell. 2003;15:2992–3006. doi: 10.1105/tpc.016766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Arnold FH. Cold Spring Harbor Symposia on Quantitative Biology. Vol LXXIV. Cold Spring Harbor, NY: Cold Spring Harbor Lab Press; 2009. pp. 41–46. [Google Scholar]
- 51.Carroll SM, Bridgham JT, Thornton JW. Evolution of hormone signaling in elasmobranchs by exploitation of promiscuous receptors. Mol Biol Evol. 2008;25:2643–2652. doi: 10.1093/molbev/msn204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Zhang J, Dean AM, Brunet F, Long M. Evolving protein functional diversity in new genes of Drosophila. Proc Natl Acad Sci USA. 2004;101:16246–16250. doi: 10.1073/pnas.0407066101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Zhang J, Yang H, Long M, Li L, Dean AM. Evolution of enzymatic activities of testis-specific short-chain dehydrogenase/reductase in Drosophila. J Mol Evol. 2010;71:241–249. doi: 10.1007/s00239-010-9384-5. [DOI] [PubMed] [Google Scholar]
- 54.Swofford DL. PAUP*: Phylogenetic Analysis Using Parsimony (*and Other Methods) Version 4. Sunderland, MA: Sinauer; 2003. [Google Scholar]
- 55.Posada D, Crandall KA. MODELTEST: Testing the model of DNA substitution. Bioinformatics. 1998;14:817–818. doi: 10.1093/bioinformatics/14.9.817. [DOI] [PubMed] [Google Scholar]
- 56.Zhang JZ, Nielsen R, Yang ZH. Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol. 2005;22:2472–2479. doi: 10.1093/molbev/msi237. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.