Abstract
Glycosylated derivatives of natural product polyphenols display a spectrum of biological activities, rendering them critical for both nutritional and pharmacological applications. Their enzymatic synthesis by glycosyltransferases is frequently constrained by the limited repertoire of characterized enzyme-catalyzed transformations. Here, we explore the glycosylation capabilities and substrate preferences of newly identified plant uridine diphosphate (UDP)-dependent glycosyltransferases (UGTs) within the UGT72 and UGT84 families, with particular focus on natural polyphenol glycosylation from UDP-glucose. Four UGTs are classified according to their phylogenetic relationships and reaction products, identifying them as biocatalysts for either glucoside (UGT72 enzymes) or glucose ester (UGT84 members) formation from selected phenylpropanoid compounds. Detailed kinetic evaluations expose the unique attributes of these enzymes, including their specific activities and regio-selectivities towards diverse polyphenolic substrates, with product characterizations validating the capacity of UGT84 family members to perform di-O-glycosylation on flavones. Sequence analysis coupled with structural predictions through AlphaFold reveal an unexpected absence of a conserved threonine residue across all four enzymes, a trait previously linked to pentosyltransferases. This comparative analysis broadens the understood substrate specificity range for UGT72 and UGT84 enzymes, enhancing our understanding of their utility in the production of natural phenolic glycosides. The findings from this in-depth characterization provide valuable insights into the functional versatility of UGT-mediated reactions.
Subject terms: Transferases, Biocatalysis, Glycosides, Biosynthesis, Natural product synthesis
Uridine diphosphate-dependent glycosyltransferases (UGTs) play a vital role in the biocatalytic glycosylation of phenolic compounds, however, UGT-catalyzed transformations remain not well-characterized. Here, the authors investigate new members of UGT72 and UGT84 families, revealing their specific reactivity and regio-selectivity on selected polyphenolic substrates.
Introduction
Plants contain a broad variety of natural phenolics, which serve as important compounds in food and pharmaceutical industry due to their diverse biological activities1–3. Glycosylation presents an effective method for mitigating the challenges posed by low solubility, poor stability, and limited bioavailability of phenolic compounds in their applications4–6. Uridine diphosphate (UDP)-dependent glycosyltransferases (UGTs) play a pivotal role in the natural glycosylation and biosynthesis of various phenolic glycosides by transferring UDP-sugars to acceptor substrates7–10. The lack of well-characterized UGT-catalyzed transformations limits the practical usage of these enzymes, which presents the need for the discovery of new catalytic activities11–13.
Plant UGTs typically possess a 44-amino acid consensus sequence known as plant secondary product glycosyltransferase (PSPG) motif, which serves as a useful tool in the bioinformatic search of putative UGT genes14–16. Based on sequence analysis, numerous UGTs have been classified into different families and larger groups that include several distinct UGT families17. UGTs within the same family display an amino acid sequence identity of greater than 40%, and are assigned with a characteristic numerical designation (e.g., UGT72, UGT84)18.
Among the 18 major phylogenetic groups of plant UGTs (labeled from A to R based on sequence homology), group E UGTs, which consist of UGT71, UGT72, and UGT88 families, have been shown to produce glucosides of phenolic compounds17. As their predicted natural function, UGT72 enzymes play an important role in the biosynthesis of plant lignin and are responsible for the glycosylation of monolignols, encompassing p-coumaryl, coniferyl, and sinapyl alcohols, along with their corresponding aldehydes and acids19–23. Similar to most other UGTs, UGT72 enzymes form a glycosidic linkage (e.g., C−O−C, Fig. 1a) from the phenolic hydroxyl group of the aglycone to the anomeric carbon of the sugar moiety24,25. Conversely, exclusive glucose ester-forming activities toward phenolic acids are exhibited by group L UGTs, which include UGT74, UGT75, and UGT84 family enzymes26. UGT84 enzymes transfer a sugar molecule into the carboxylic acid moiety of sinapic acid, gallic acid, and other phenolic acids, generating ester linkages (C(=O)−O−C, Fig. 1a)27–32.
Members from UGT72 and UGT84 families often manifest O-glycosylation activities toward polyphenolic acceptor substrates (e.g., flavonoids) producing high-value phenolic O-glycosides26,33–37. Despite the widespread individual characterization of UGT72 and UGT84 enzymes concerning various phenolic substrates, a paucity of comparative studies between these two enzyme classes persists. Furthermore, previous investigations have often lacked a comprehensive kinetic analysis of reaction components, presenting challenges in the application of these UGTs for the biosynthesis of natural phenolic glycosides. Thus, a nuanced understanding of the biochemical attributes and potential utilizations of UGT72 and UGT84 enzymes necessitates further exploration and in-depth investigation of their glycosylation activities.
Here, previously unknown plant UGTs were discovered and thoroughly characterized. The enzymes were classified into UGT72 and UGT84 families based on their phylogenetic relationships and product patterns (glucoside vs glucose ester) obtained from representative phenylpropanoid substrates. A comprehensive analysis of the specific activities and regio-selectivities conducted on selected polyphenolic substrate classes (Fig. 1b) revealed distinct family-dependent characteristics of the enzymes. Preparative-scale reactions with product isolations confirmed the di-O-glycosylation potential of UGT84 enzymes on flavones. The AlphaFold-predicted 3D-structures of the discovered UGTs highlight the absence of a conserved threonine for the stabilization of donor glucose C6-OH, a phenomenon previously linked to pentosyltransferases38. This research contributes to the advancement of our comprehension of the natural functions, substrate promiscuity, and interconnections associated with UGT72 and UGT84 enzymes.
Results and discussion
Discovery and phylogenetic analysis of putative UGT72 and UGT84 enzymes
For initiating the search of putative UGT candidates, well-characterized Arabidopsis thaliana enzymes UGT72E319 and UGT84A239 were chosen as representative members from the UGT72 and UGT84 families, respectively. Using these enzymes as benchmarks, a BLAST search was initiated with a stringent criterion set to identify homologous sequences with an amino acid identity exceeding 40%. This search yielded potential UGT72 and UGT84 family members from a diverse array of plant species, six of which were selected for further investigation (Supplementary Table 1). The herein named putative UGTs UGT84A49 (accession: Q2V6K1), UGT84A119 (accession: A0A2N9FYZ7) and J045 (accession: A0A078J045) shared >50% sequence identity with UGT84A2, while UGT72D1 (accession: Q9ZU72), UGT72D7 (accession: A0A067GVI4) and KZ95 (accession: A0A2U1KZ95) were closely related to UGT72E3 (>40% sequence identity, Supplementary Table 2). Among these genes, only the one corresponding to UGT72D1 had previously been linked to glycosyltransferase activity; however, the enzyme had not been subjected to a detailed analysis concerning substrate specificity and regioselectivity25. Despite the high sequence homology between the candidate UGTs and their respective Arabidopsis templates, inter-family sequence identities were notably lower, falling beneath the 30% threshold (Supplementary Table 2). Subsequent multiple sequence alignment revealed the conserved C-terminal PSPG motif for the sugar donor binding (Fig. 2a)14,40,41, confirming the chosen candidates as plant UGTs. Remarkably, distinct conserved family-dependent differences were discovered in the following positions of the PSPG-box: Cys/Ser in position 2 for UGT84 members in comparison to Ala in UGT72, Glu in position 5 for UGT84 enzymes in respect to Val/Ala in UGT72 family, Cys in position 15 of UGT84 enzymes versus Gly in UGT72, and Gln in the position 40 of UGT84 members in comparison to Leu in UGT72 (Fig. 2a). The presence of a conserved Gln in the last position 44 of all the enzymes indicates the preference for UDP-glucose (UDP-Glc) as a donor substrate (Fig. 2a)42.
Phylogenetic analysis of the putative UGTs together with characterized plant UGTs corroborated the taxonomic assignment of UGT72D1, UGT72D7, and KZ95 within the UGT72 family, belonging to the larger phylogenetic group E (Fig. 2b, Supplementary Table 3). Concurrently, UGT84A119, UGT84A49, and J045 were positioned within the UGT84 clade, as anticipated, aligning with group L of the phylogenetic classification (Fig. 2b, Supplementary Table 3).
Characterization of UGT72/84 enzymes with anticipated natural substrates
Heterologous expression of the chosen UGT candidates in E. coli resulted in soluble expression for the proteins UGT84A49, UGT72D1, UGT84A119 and UGT72D7, which were successfully purified utilizing N-terminal 6xHis-tag (Supplementary Fig. 1). While UGTs are classically known as difficult-to-express enzymes accompanied by low yields of isolated protein43, the candidates UGT84A119 and UGT72D1 showed remarkably high soluble expression at 45 mg and 20 mg protein per liter of culture medium, respectively (Supplementary Fig. 1). Despite the optimization of the expression host to E. coli C43(DE3), tailored for the production of challenging (membrane-associated; toxic) proteins44, J045 and KZ95 evaded soluble expression and were thus excluded from the study.
The purified enzymes (0.50 mg mL−1) reacted with sinapic acid (1.0 mM) showed distinct family-dependent product patterns (Fig. 3a) for glycosylation from UDP-glucose (2.0 mM). UGT84-members (UGT84A119, UGT84A49) formed a product in a strictly pH-dependent fashion (Fig. 3b), consistent with the previous studies on glucose-ester forming UGTs operating preferably at acidic pH for a suitable reaction equilibrium26,37. Reactions of UGT84A119 and UGT84A49 at pH 5.0 yielded a glucosylated product, while no conversion was observed at pH 8.0 (Fig. 3b). As anticipated, preparative scale (13.5 mg) reaction with product isolation confirmed the product identity as sinapic acid glucose ester (Fig. 3a, Supplementary Fig. 2). UGT72-enzymes UGT72D1 and UGT72D7 demonstrated the preference for alkaline pH for glycosylation (Fig. 3c), which is consistent with the pH dependence of the equilibrium constant in favor of glycoside formation at basic pH45–48. Product isolation from a preparative (20.5 mg) reaction identified the obtained compound as sinapic acid 4-O-glucoside (Fig. 3a, Supplementary Fig. 3). Time course analysis of the reactions performed at suitable pH (5.0 for UGT84A119, UGT84A49; pH 8.0 for UGT72D1, UGT72D7; Fig. 3d) and enzyme concentration allowed the precise measurement of enzymatic activities. UGT84A119 showed a specific activity of 2.0 U mg−1 (±2%; N = 2) towards sinapic acid, similar to the value reported for the neighboring enzyme UGT84A13 from the same phylogenetic clade (Fig. 2b)49. The activity of UGT84A49 (1.0 U mg−1 ± 1%; N = 2) fell within the same order of magnitude. UGT72D7 catalyzed the glycosylation at the phenolic hydroxyl with specific activity of 0.3 U mg−1 (±6%; N = 2), comparable to a monolignol glycosyltransferase UGT72E3 (~1.2 U mg−1)19 from the same family (Fig. 2b). Since UGT72D1 exhibited significantly lower activity of 55 mU mg−1 (±3%; N = 2) toward sinapic acid, coniferyl aldehyde (1.0 mM) was tested as an alternative sugar acceptor from the phenylpropanoid class (Supplementary Fig. 4a). Both UGT72D1 and UGT72D7 showed strong preference for coniferyl aldehyde over sinapic acid, with notable enzymatic activities of 4.9 U mg−1 (±10%; N = 2) and 1.7 U mg−1 (±6%; N = 2), respectively (Supplementary Fig. 4b). The product was identified after isolation from a preparative-scale (32.3 mg) reaction as coniferyl aldehyde 4-O-glucoside (Supplementary Figs. 5, 4a). While other UGT72 enzymes (e.g., NbUGT72AY150, StUGT72AY250, UGT72E323, Fig. 2b) are also capable of producing coniferyl aldehyde 4-O-glucoside, the activities have typically been lower (<0.2 U mg−1) than those obtained here for UGT72D1 and UGT72D7.
Intriguingly, also the UGT84 enzymes UGT84A119 and UGT84A49 demonstrated modest enzymatic activities of 28 mU mg−1 (±6%; N = 2) and 5.2 mU mg−1 (±4%; N = 2) towards coniferyl aldehyde, respectively (Supplementary Fig. 4b). To the best of our knowledge, this observation marks the first instance of such activity within the UGT84 family. The results suggest that the substrate specificity of UGT84 and UGT72 enzymes may be significantly broader than previously understood. All the reactions performed with sinapic acid and coniferyl aldehyde were additionally analyzed for the formation of UDP, which correlated well with the concentration of glycoside product released (Supplementary Figs. 6, 7). Consequently, we concluded that the enzymes characterized in this study did not exhibit hydrolytic side activity when reacted with phenylpropanoid acceptor substrates.
Exploration of the acceptor scope of UGT72/84 enzymes
To assess the substrate specificity and promiscuity of the discovered UGTs, five characteristic phenolic compounds (Fig. 1b) were chosen to screen for enzymatic activity and resulting product profiles. Specifically, p-hydroxyacetophenone (p-HAP, Fig. 1b) was selected as a representative acetophenone derivative, given the commercial and medicinal significance of both p-HAP and its O-glucoside, picein51,52. From the group of natural product polyphenols, phloretin was chosen as a dihydrochalcone, daidzein as isoflavone, and both apigenin and luteolin as flavones (Fig. 1b).
Through comparative HPLC chromatogram analysis of the reaction mixtures and standard samples, all four UGTs were found capable of catalyzing the formation of 4-O-glucoside of p-HAP (Fig. 4a, Supplementary Figs. 8, 9). Thus, p-HAP emerges as a potential universal acceptor substrate for both UGT72 and UGT84 family members. The specific activities of the enzymes towards p-HAP varied considerably (Table 1, Supplementary Fig. 9), being at least 30-fold lower than those for phenylpropanoid substrates. While UGT84A49, UGT84A119 and UGT72D7 converted p-HAP with activities falling below 32 mU mg−1, UGT72D1 showed a remarkably higher activity of 171 mU mg−1 (Table 1, Supplementary Fig. 9). The results obtained hold considerable interest in the realm of picein biosynthesis. Prior to this study, only one UGT (UGT72E2 from Arabidopsis thaliana, Fig. 2b) has been documented to catalyze the formation of this glycoside, with volumetric activity of 18.4 mU mL−153. Notably, the activity measured for UGT72D1 (85.5 mU mL−1) exceeds the value reported for UGT72E2 by 4.6-fold.
Table 1.
UGT | Activities (mU mg−1, N = 2) | |||||
---|---|---|---|---|---|---|
p-HAP | Phloretin | Apigenin | Luteolin | Daidzein | Activity with other sugar donorsc | |
UGT84A49 |
31.4 ± 0.6 (4-O) |
/b |
34.2 ± 0.8 (7-O) 18.9 ± 0.4 (4′-O) 7.8 ± 0.8 (7,4′-di-O) |
14.7 ± 0.1 (7-O) 27.5 ± 0.1 (4′-O) 10.2 ± 0.6 (7,4′-di-O) |
224 ± 11.7 (7-O) |
38.5 ± 0.1 (Daidzein, UDP-Xyl) |
UGT84A119 |
12.4 ± 0.3 (4-O) |
0.4 ± 1.4 × 10−3 (2′-O) 2.4 ± 1.4 × 10−2 (4′-O) |
64.7 ± 3.0 (7-O) 0.3 ± 2.8 × 10−2 (4′-O) 3.9 ± 0.2 (7,4′-di-O) |
83.1 ± 0.1 (7-O) 5.6 ± 0.3 (3′-O) 4.9 ± 0.4 (4′-O) 1.3 ± 7.0 × 10−2 (7,3′-di-O) 1.9 ± 0.1 (7,4′-di-O) |
171 ± 21.2 (7-O) |
9.6 ± 0.6 (Daidzein, UDP-Gal) 6.5 ± 0.4 (Daidzein, UDP-Xyl) |
UGT72D1 |
171 ± 5.4 (4-O) |
34.7 ± 0.4 (2′-O) 4.3 ± 0.1 (4′-O) |
9.8 ± 0.6 (7-O) 4.5 × 10−2 ± 2.8 × 10−3 (7,4′-di-O) |
14.2 ± 0.6 (7-O) 22.3 ± 0.1 (3′-O) 7.5 × 10−2 ± 3.5 × 10−3 (7,3′-di-O) |
/b |
2.9 ± 0.1 (p-HAP, UDP-Xyl) |
UGT72D7 |
2.4 ± 0.2 (4-O) |
/b | /b |
0.5 ± 4.5 × 10−3 (7-O) 7.2 ± 0.8 (4′-O) 0.1 ± 3.0 × 10−3 (7,4′-di-O) |
/b | /b |
aReactions (100 μL) contained 1.0 mM acceptor substrate, 2.0 mM UDP-sugar and 0.50 mg mL−1 enzyme in potassium phosphate buffer (50 mM, pH 8.0), and were carried out at 30 °C without agitation.
b“/” stands for no products detected in the reaction.
cUDP-Xyl, UDP-Gal and UDP-GlcA were tested. Daidzein was used as acceptor for UGT84A49 and UGT84A119, while p-HAP and luteolin were chosen for UGT72D1 and UGT72D7, respectively.
Although phloretin is commonly recognized as a substrate by UGTs belonging to the phylogenetic groups L and E (Fig. 2b)54,55, our findings indicate that only UGT72D1 and UGT84A119 exhibited reactivity towards this particular acceptor (Fig. 4b). UGT84A49 and UGT72D7 proved inactive independent on the enzyme concentration used, while UGT72D1 showed higher activity (40 mU mg−1) than UGT84A119 (2.7 mU mg−1, Supplementary Fig. 10). Two products were observed in both cases; phlorizin (phloretin 2′-O-glucoside) and herein isolated and characterized phloretin 4′-O-glucoside (Fig. 4b, Supplementary Figs. 11, 12). Notably, the regio-selectivity varied between the enzymes: while the UGT84-member preferred the position 4′-O for glycosylation, the UGT72 enzyme showed clear preference for the position 2′-O (Fig. 4b).
Isoflavones, alongside flavones, constitute another noteworthy group of flavonoids that serve as potential acceptor substrates for UGT72 and UGT84 enzymes. While the UGT72 enzymes (UGT72D1, UGT72D7) were inactive with daidzein, both UGT84 members (UGT84A119, UGT84A49) evidently catalyzed the conversion of daidzein into its 7-O-glucoside daidzin (Fig. 4c). Notably, other members of the UGT84 family, such as PgUGT84A23 from Punica granatum and UGT84F6 from Glycyrrhiza uralensis, have also demonstrated substantial 7-O-glycosylation activity towards isoflavones26,56. The specific activities recorded for UGT84A49 (224 mU mg−1) and UGT84A119 (171 mU mg−1) toward daidzein (Table 1, Supplementary Fig. 13) were in line with the activity of PgUGT84A23 toward an analogous genistein (414 mU mg−1)26. These findings suggest that members of the UGT84 family may be additionally classified as isoflavone 7-O-glycosyltransferases, corroborating both previous and current studies.
Flavones differ structurally from isoflavones in respect to the positioning of the ring B of the flavonoid skeleton (attached to the C2 instead of C3, Fig. 1b)57. Two common flavones, apigenin and luteolin, were chosen for further probing of the acceptor scope of the discovered UGTs. The most interesting reaction characteristics were observed with the UGT84 enzymes, including the potential for di-O-glycosylation. UGT84A49 reacted with apigenin produced two mono-O-glucosides and one di-O-glucoside, confirmed by HPLC-UV (Fig. 5a) and HPLC-UV/MS (Supplementary Fig. 14). Product isolation and subsequent NMR analysis identified the unknown intermediate as apigenin 4′-O-glucoside (Supplementary Figs. 15–17), and the double-glycosylated product as apigenin 7,4′-di-O-glucoside (Supplementary Figs. 18, 19). Similarly, the UGT84A49 reactions with luteolin produced the corresponding 7-O, 4′-O and 7,4′-di-O-glucosides, as identified based on HPLC-UV and NMR analysis (Fig. 5b, Supplementary Figs. 20–22). While the activity of UGT84A49 towards apigenin (61 mU mg−1, Supplementary Fig. 23a, b) and luteolin (51 mU mg−1, Supplementary Fig. 23c, d) was similar, the striking difference arises from the regio-selectivity: while apigenin was preferably glycosylated at position 7-O (34 mU mg−1 towards apigenin 7-O-glucoside, Table 1, Fig. 5c), the reaction with luteolin favored the position 4′-O (28 mU mg−1 towards luteolin 4′-O-glucoside, Table 1, Fig. 5d). The formation of the respective di-O-glucosides was considerably slower (7.8 mU mg−1 towards apigenin 7,4′-di-O-glucoside; 10.2 mU mg−1 towards luteolin 7,4′-di-O-glucoside, Table 1), consistent with previous reports on di-O and di-C-glycosyltransferases capable of catalyzing the serial two-step glycosylation58,59.
While UGT84A119 exhibited comparable enzymatic activity (68 mU mg−1, Supplementary Fig. 24a, b) and similar product pattern (7-O, 4′-O, 7,4′-di-O) with apigenin as observed for UGT84A49, including the preference for the 7-O position (Fig. 5e), its reaction with luteolin was markedly more complex (Fig. 5f). The activity of UGT84A119 (101 mU mg−1, Supplementary Fig. 24c, d) was 2-fold higher relative to UGT84A49, and UGT84A119 demonstrated a pronounced affinity for glycosylation at the 7-O position on ring A (83 mU mg−1 towards luteolin 7-O-glucoside, Table 1). Beyond the glycosides identified in the UGT84A49 reaction, UGT84A119 produced two additional compounds, characterized as luteolin 3′-O-glucoside and luteolin 7,3′-di-O-glucoside, from a combination of HPLC-MS and NMR analysis (Supplementary Figs. 25, 26). Thorough analysis of the apigenin and luteolin time courses (Fig. 5e, f) suggests that the formation of di-O-glucosides predominantly initiates from the corresponding 7-O-glucosides. Several other enzymes within the UGT84 family, e.g., UGT84A68 and UGT84A34, have also demonstrated a propensity for synthesizing 7,4′-di-O-glucosides of flavones58,60.
Contrasting with the UGT84 family members, the UGT72 enzymes exhibited significantly lower activities toward the di-O-glycosylation of flavones (Table 1). UGT72D7 was found to be inactive with apigenin but displayed reactivity with luteolin (8.1 mU mg−1, Supplementary Fig. 27), showing a notable preference for the position 4′-O on ring B (7.2 mU mg−1 toward luteolin 4′-O-glucoside, Table 1, Supplementary Fig. 27). Interestingly, UGT72D1 was capable of glycosylating both apigenin and luteolin, albeit with lower activities (9.8 mU mg−1 and 32 mU mg−1, respectively, Supplementary Fig. 28) relative to the UGT84 enzymes. While the glycosylation of apigenin by UGT72D1 predominantly yielded the 7-O-glucoside product, the reaction with luteolin exhibited a substantial preference for the position 3′-O (22 mU mg−1 toward luteolin 3′-O-glucoside, Table 1, Supplementary Fig. 28). The di-O-glycosylation activities of UGT72D1 and UGT72D7 towards flavones were low (Table 1), with only trace amounts of di-glycosides detected in the product mixtures (Supplementary Figs. 27b, 28b, e).
These findings are significant in demonstrating that the addition of a single hydroxyl group can dramatically alter the preferred glycosylation site from ring A in apigenin to ring B in luteolin, or induce enzymatic activity into a previously non-accepted substrate (UGT72D7 with apigenin/luteolin). The observed phenomenon may be attributed to the effect of luteolin conformers on the pKa values of its phenolic hydroxyl groups, which are conventionally ranked by decreasing acidity as follows: 4′-O > 7-O > 3′-O > 5-O61. The degree of ionization between the hydroxyl groups at positions 4′ and 7 is remarkably similar (pKa ~ 7), whereas the hydroxyl group at position 3′ displays significantly lower acidity (pKa > 10)61. Luteolin has been reported to possess three conformers of the B-ring hydroxyl substituents, with the type of conformer having a profound influence on the order of pKa values61. Notably, the pKa value at position 3′-O is substantially reduced to ~7.5 in one conformer61. This suggests that the UGT72 enzyme UGT72D1 likely engages in specific binding interactions to stabilize this particular luteolin conformation, thereby rendering the hydroxyl group at position 3′ accessible for deprotonation by an active site histidine residue (see Supplementary Fig. 29 for a general UGT reaction mechanism). In contrast, UGT72D7 and the UGT84 family members UGT84A119 and UGT84A49 are hypothesized to utilize relatively nonspecific binding interactions, facilitating glycosylation at positions 4′-O and 7-O, achievable from any of the three luteolin configurations. The case of apigenin diverges due to the presence of a single hydroxyl group in ring B. Recent studies have delineated the pKa values for apigenin’s hydroxyl groups as 7.2 (4′-O), 8.7 (7-O), and 11.8 (5-O), determining the position 4′-O as the most acidic62. This raises curiosity about the preference for position 7-O by all the enzymes, while explaining the complete absence of glycosylation at 5-O. Although pKa values may significantly alter upon enzyme binding63–66, the general order of acidity is anticipated to be preserved in this context. The in-depth time course analysis of the reactions offers valuable insights into the evolving product composition, especially in reactions involving di-O-glycosylation. The UGT84-catalyzed reactions with apigenin culminate in ~70-80% conversion to apigenin 7,4′-di-O-glucoside at the end-point, with apigenin 7-O-glucoside predominating within the first hour of the reaction (Fig. 5c, e). Notably, the UGT84 reactions proceed to complete conversion of apigenin to a mixture of its glycosides (Fig. 5c, e and Supplementary Figs. 23a, 24a), while the UGT72D1 reaction stabilizes at an equilibrium of apigenin (~45%) and apigenin 7-O-glucoside (~55%) at the end-point (Supplementary Fig. 28b). Conversely, for luteolin, all reactions (UGT84A49, UGT84A119, UGT72D7, UGT72D1) advance to complete conversion of the acceptor to its glycosides (Fig. 5d, f and Supplementary Figs. 27a, b and 28d, e), ruling out the potential scenario of enzyme inactivation over prolonged reaction times. Detailed time course analysis further reveals that under the herein applied reaction conditions, the UGT72 and UGT84 reactions with daidzein and p-HAP reach an equilibrium consisting of ~90% and ~75% of the corresponding mono-O-glucoside products, respectively (Supplementary Figs. 9, 13).
The reactions with apigenin, luteolin, daidzein and phloretin demonstrated high consistency between UDP and glycoside formation, further confirming the accuracy of the reaction analysis. The meticulous analysis of UDP-glucose consumption is critical for two reasons: to track the possible donor substrate hydrolysis, an undesired phenomenon commonly linked to glycosyltransferases67, and to verify the accuracy of the HPLC-UV analysis for acceptor to glycoside conversion. Given the low water solubility of the selected acceptor substrates and the potential variance in absorption maxima between the acceptors and their glycosides, the reliability of comparing relative peak areas was consistently validated through the ratio of UDP-glucose to UDP. The reactions with p-HAP showed significant UDP-glucose hydrolysis (Supplementary Fig. 9), likely due to the small acceptor substrate size causing nonspecific binding and the formation of catalytically inactive complexes, which permit donor hydrolysis but prevent glycosyl transfer. Collectively, the herein obtained results highlight the considerable adaptability of the UGT active sites. Particularly, the UGT84 family exhibits substantial flexibility in acceptor binding, adeptly accommodating both flavones and isoflavones in catalytically productive conformations. It is generally understood that UGTs engage acceptor substrates within hydrophobic cavities, which possibly accounts for the observed multi-site specificity (e.g., 4′-O and 7-O glycosylation sites for the same substrate) attributed to positional effects14,68–70.
Investigation of the donor binding site in UGT72/84 enzymes
Unlike acceptor binding, which predominantly involves hydrophobic interactions, donor substrates in UGTs typically bind via specific hydrogen bonding interactions, leading to a notably higher specificity for donor compared to acceptor substrates14,45,69,71. We evaluated UDP-xylose (UDP-Xyl), UDP-galactose (UDP-Gal), and UDP-glucuronic acid (UDP-GlcA) as alternative sugar donors for UGT84A49, UGT84A119, UGT72D1 and UGT72D7. Our findings revealed that all enzymes, except UGT72D7, utilized UDP-xylose, though the activities were largely reduced compared to UDP-glucose (Table 1, Supplementary Figs. 30, 31). UDP-galactose was accepted by UGT84A119, albeit with a significant decrease in activity (Table 1, Supplementary Fig. 32), while UDP-glucuronic acid was not utilized by any enzyme. UGTs active on UDP-glucose generally feature a conserved Gln-Asp/Glu-Trp-Thr amino acid tetrad, interacting with the hydroxyl groups of the donor glucose (Supplementary Fig. 33)24,69,72–75. To elucidate the specific donor binding interactions in the herein characterized enzymes, we employed AlphaFold for quaternary structure prediction. Comparison with the crystal structures of flavonoid O-glycosyltransferase VvGT173 (Supplementary Fig. 33) and di-C-glycosyltransferase GgCGT74 (Fig. 6a) revealed the absence of the Thr residue, which is commonly associated with stabilizing the glucose C6-OH (Fig. 6a, Supplementary Figs. 33, 34). Notably, UGT84A49 possesses an isoleucine (Ile148) at this position, a residue previously linked to xylosyl- and apiosyltransferases76,77. Prior hypotheses suggest that the steric hindrance of isoleucine leads to specificity for UDP-pentose donors, precluding UDP-glucose from fitting into the active site77. Our findings indicate that the donor specificity is more intricate than previously understood, as UGT84A49 displays a strong preference for UDP-glucose over UDP-xylose, despite the presence of isoleucine near the glucose C6-OH. Here, we observed that relying solely on the multiple sequence alignment proved insufficient for precise comparison of residues. In the cases of UGT84A49 and UGT84A119, sequence alignment suggested glutamine in the position close to C6-OH (Gln149 and Gln144, respectively), whereas structural predictions identified the correct candidate as the preceding amino acid in the sequence (Fig. 6b, Supplementary Fig. 34). The universal preference for UDP-glucose among the enzymes indicates that the conserved differences observed in the PSPG motifs of UGT72 and UGT84 members (Fig. 2a, Supplementary Fig. 35) do not play a decisive role in determining donor substrate specificity. This observation is consistent with the findings reported in a recent review78, which suggest that positions other than 2, 5, 15, and 40 of the PSPG box are considered the most critical in defining the donor scope of UGTs. Additional sequence analysis of UGTs from groups E and L suggested that the Gly/Cys residue at position 15 of the PSPG box in UGT72/UGT84 members may be conserved not only within the subfamily but possibly across the entire group, with Gly prevalent in group E and Cys in group L (Supplementary Fig. 36). Similarly, the presence of Ala at position 2 of the PSPG motif in UGT72 enzymes may be a characteristic common to UGTs from group E (Supplementary Fig. 36). Collectively, these findings are of importance for the sequence-based identification and engineering of UGTs tailored for distinct donor substrate preferences.
Conclusions
In conclusion, this study has elucidated the glycosylation capabilities and substrate preferences of novel plant UGTs within the UGT72 and UGT84 families, demonstrating their functional versatility and substrate specificity. The identification and characterization of new UGT enzymes contributes to expanding the repertoire of biocatalysts capable of synthesizing natural polyphenol glycosides from UDP-glucose. Through detailed time course and reaction rate analyses combined with product isolations and characterizations, distinct regio-selectivities and specific activities were revealed towards a variety of polyphenolic substrates. Notably, the herein discovered UGT84 enzymes were characterized by their ability to form glucose esters from phenylpropanoid acceptor substrates, and to catalyze the di-O-glycosylation of flavone acceptors. Conversely, the UGT72-members exclusively facilitated the production of phenylpropanoid O-glucosides and lacked the ability of efficient di-O-glycosylation on flavones. The absence of a conserved threonine residue in the AlphaFold-predicted structures of these enzymes, previously associated with donor glucose stabilization, challenges existing paradigms about structure-function relationships within the UGT family. The intricate specificity and selectivity of UGT-mediated reactions with respect to acceptor substrates represent a challenging area that still resists straightforward prediction by means of bioinformatic and structural analyses. This work broadens our understanding of the natural functions and substrate specificity of UGT72 and UGT84 enzymes, offering valuable insights into their potential for the enzymatic synthesis of natural phenolic glycosides. As the complex mechanisms underlying UGT-mediated glycosylation continue to be unraveled, these findings hold the potential to enhance future biotechnological applications, including the design and engineering of UGTs for specific glycosylation reactions.
Methods
Bioinformatic analysis
A BLAST search was performed with protein sequences of UGT72E319 and UGT84A239 as queries, using the online BLAST tool (https://blast.ncbi.nlm.nih.gov/Blast.cgi)79. The search was conducted against the NCBI nr and Swiss-Prot database, with a cut-off E-value of 1 × 10−5. Multiple sequence alignments were executed using ClustalX software80, and the visual representation of the aligned sequences was generated using ESPript 3.081 or Adobe Acrobat DC software. The phylogenetic tree of characterized plant UGTs (GenBank accession numbers in Supplementary Table 3) was constructed using MEGA 11 software82, employing the neighbor-joining method based on ClustalW multiple sequence alignment. For enhanced visualization, the tree was modified by the online ornament tool (https://itol.embl.de/)83. Structural predictions of the proteins were obtained from AlphaFold DB84,85.
Enzyme production
The codon-optimized synthetic genes of candidate UGTs incorporated in pET-28a expression vector were acquired from GenScript (Germany) and transformed into E. coli BL21(DE3) cells. UGTs were expressed in Terrific Broth (TB) medium supplemented with 2% (v/v) glycerol and purified utilizing an N-terminal 6xHis-tag. The size and purity of UGTs were confirmed by SDS-PAGE (Supplementary Fig. 1). Full details of the expression and purification conditions are given in the Supplementary Methods under “Enzyme production”.
Enzyme activity assays
The specific activity of UGTs towards sinapic acid, coniferyl aldehyde, p-HAP, phloretin, apigenin, luteolin and daidzein was determined in an enzymatic assay containing purified UGT enzyme (0.0050-0.50 mg mL−1), 1.0 mM acceptor substrate and 2.0 mM UDP-sugar. The initial formation rates for each product were calculated from the corresponding time courses. Full details of the activity assays are provided in the Supplementary Methods under “Enzyme activity assays”.
Preparation and isolation of products
All the preparative reactions were performed using either UGT84A119 or UGT72D1 as a biocatalyst. Sinapic acid 4-O-glucoside and sinapic acid glucose ester were purified using silica column chromatography, coniferyl aldehyde 4-O-glucoside was isolated using TLC. Luteolin 3′-O-glucoside, apigenin 4′-O-glucoside, apigenin 7,4′-di-O-glucoside, luteolin 4′-O-glucoside, and luteolin 7,4′-di-O-glucoside were isolated by preparative HPLC. Phloretin 4′-O-glucoside was separated from the other reaction components using C18-reversed phase column chromatography. Full details of the reactions and product isolations are provided in the Supplementary Methods under “Preparation and isolation of products”.
Analytical methods
HPLC-UV
For the separation of UDP-sugars and UDP, a Shimadzu Prominence HPLC-UV system (Shimadzu, Korneuburg, Austria) with Kinetex C18 column (5 µm, 100 Å, 50/150 × 4.6 mm) was employed, using an isocratic method with acetonitrile and tetrabutylammonium bromide (TBAB) buffer (40 mM TBAB, 20 mM K2HPO4/KH2PO4, pH 5.9) as the mobile phase. Detection of UDP-sugars and UDP was achieved by UV at a wavelength of 262 nm. Acceptor substrates and glucoside products were separated using an Agilent 1200 Series HPLC-UV system (Morges, Switzerland) with Kinetex C18 column (3/5 µm, 100 Å, 200/150 × 4.6 mm), using a gradient method of water and acetonitrile (each containing 0.1% (v/v) formic acid) as mobile phase. UV detection for acceptor substrates and glucoside products was performed at wavelength of 288 nm (sinapic acid, coniferyl aldehyde, p-HAP, phloretin, daidzein and corresponding O-glucoside products), 340 nm (sinapic acid and corresponding glucose ester product) or 360 nm (apigenin, luteolin and corresponding O-glucoside products). A summary of the HPLC-UV methods is provided in Supplementary Table 4. The details of the HPLC-UV/MS and preparative HPLC analyses are given in the Supplementary Methods under “Analytical methods–HPLC-UV/MS and Reversed-Phase Preparative HPLC”.
TLC
Thin-layer chromatography (TLC) was used for the separation of coniferyl aldehyde 4-O-glucoside from its reaction mixture. The samples (10 µL) and reference compounds (2 µL; 40 mM UDP, 40 mM UDP-Glc, 50 mM coniferyl aldehyde) were analyzed on silica gel plates (gel 60 F254; Merck, Darmstadt, Germany). The eluent consisting of 1-butanol/acetic acid/deionized water (2:1:1) was used for the separation. Visualization was performed with UV, or DNP-staining consisting of dinitrophenylhydrazine (35% w/v), ethanol (58% v/v), water (24% v/v) and concentrated H2SO4 (18% v/v).
NMR
The identity of the synthesized glycosides was confirmed by NMR. The acquisitions were carried out in deuterium oxide (D2O, 99.96% 2H, Eurisotop, Saint-Aubin Cedex, France), dimethylsulfoxide (DMSO-D6, 99.80% 2H, Eurisotop, Saint-Aubin Cedex, France) or mixtures thereof. The spectra of flavonoid glucosides were acquired on a Jeol JNM-ECZL 500 MHz NMR spectrometer with Royal HFX-probe (automatic tuning and matching). Frequency 1H: 499.7189 MHz; 13C: 125.6544 MHz. The Jeol Delta 6.1 software was used for the measurements. The acquisitions of phenylpropanoid glycosides were carried out on a Bruker AVANCE III 300-MHz spectrometer (Bruker, Rheinstetten, Germany) with an autosampler. The Bruker Topspin 3.5 software was used for the measurements. The spectra were analyzed using MestReNova 16.0 (Mestrelab Research, S.L.) and JASON (Jeol).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
The COMET center acib: Next Generation Bioproduction is funded by BMK, BMAW, SFG, Standortagentur Tirol, Government of Lower Austria und Vienna Business Agency in the framework of COMET—Competence Centers for Excellent Technologies. The COMET-Funding Program is managed by the Austrian Research Promotion Agency FFG.
Author contributions
A.J.E.B., T.L. and B.N., design of study. T.L., enzymatic reactions and phylogenetic analysis. A.J.E.B., sequence- and structural analysis, HPLC-UV/MS analysis. T.L., isolation of phenylpropanoid glucosides. L.K. and A.J.E.B., isolation of flavonoid glucosides. A.J.E.B. interpreted the NMR spectra acquired by H.W., in collaboration with L.K. R.B., supervision and resources. A.J.E.B., B.N., and T.L. wrote the paper. B.N. funding acquisition.
Peer review
Peer review information
Communications Chemistry thanks David Teze and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Data availability
All relevant data are reported in the manuscript and in the associated Supplementary Information. The file Supplementary Data 1 contains the source data for Figs. 3, 4 and 5. All data are available from the corresponding author upon request. Sequence information on proteins used within this study can be found on the NCBI database (https://www.ncbi.nlm.nih.gov/) using the accession numbers provided in the text.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Tuo Li, Annika J. E. Borg.
Supplementary information
The online version contains supplementary material available at 10.1038/s42004-024-01231-1.
References
- 1.Gao J, et al. A comprehensive review on natural phenolic compounds as alternatives to in-feed antibiotics. Sci. China Life Sci. 2023;66:1518–1534. doi: 10.1007/s11427-022-2246-4. [DOI] [PubMed] [Google Scholar]
- 2.Sun W, Shahrajabian MH. Therapeutic potential of phenolic compounds in medicinal plants-Natural health products for human health. Molecules. 2023;28:1845. doi: 10.3390/molecules28041845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Singh N, Yadav SS. A review on health benefits of phenolics derived from dietary spices. Curr. Res. Food Sci. 2022;5:1508–1523. doi: 10.1016/j.crfs.2022.09.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Andreu A, et al. Enzymatic glycosylation strategies in the production of bioactive compounds. Catalysts. 2023;13:1359. doi: 10.3390/catal13101359. [DOI] [Google Scholar]
- 5.Xu L, Qi T, Xu L, Lu L, Xiao M. Recent progress in the enzymatic glycosylation of phenolic compounds. J. Carbohydr. Chem. 2016;35:1–23. doi: 10.1080/07328303.2015.1137580. [DOI] [Google Scholar]
- 6.Lee UJ, Sohng JK, Kim BG, Choi KY. Recent trends in the modification of polyphenolic compounds using hydroxylation and glycosylation. Curr. Opin. Biotechnol. 2023;80:102914. doi: 10.1016/j.copbio.2023.102914. [DOI] [PubMed] [Google Scholar]
- 7.Brazier‐Hicks M, Gershater M, Dixon D, Edwards R. Substrate specificity and safener inducibility of the plant UDP‐glucose‐dependent family 1 glycosyltransferase super‐family. Plant Biotechnol. J. 2018;16:337–348. doi: 10.1111/pbi.12775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Yao J, et al. Structure-function relationships in plant UDP-glycosyltransferases. Ind. Crops Prod. 2022;189:115784. doi: 10.1016/j.indcrop.2022.115784. [DOI] [Google Scholar]
- 9.Sirirungruang S, Barnum CR, Tang SN, Shih PM. Plant glycosyltransferases for expanding bioactive glycoside diversity. Nat. Prod. Rep. 2023;40:1170–1180. doi: 10.1039/D2NP00077F. [DOI] [PubMed] [Google Scholar]
- 10.Gharabli H, Della Gala V, Welner DH. The function of UDP-glycosyltransferases in plants and their possible use in crop protection. Biotechnol. Adv. 2023;67:108182. doi: 10.1016/j.biotechadv.2023.108182. [DOI] [PubMed] [Google Scholar]
- 11.Liu Y, et al. pUGTdb: a comprehensive database of plant UDP-dependent glycosyltransferases. Mol. Plant. 2023;16:643–646. doi: 10.1016/j.molp.2023.01.003. [DOI] [PubMed] [Google Scholar]
- 12.Zhang P, Zhang Z, Zhang L, Wang J, Wu C. Glycosyltransferase GT1 family: phylogenetic distribution, substrates coverage, and representative structural features. Comput. Struct. Biotechnol. J. 2020;18:1383–1390. doi: 10.1016/j.csbj.2020.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hoffmann TD, Kurze E, Hoffmann T, Song C, Schwab W. Genome-wide identification of UDP-glycosyltransferases in the tea plant (Camellia sinensis) and their biochemical and physiological functions. Front. Plant Sci. 2023;14:1191625. doi: 10.3389/fpls.2023.1191625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Kurze E, et al. Structure-function relationship of terpenoid glycosyltransferases from plants. Nat. Prod. Rep. 2022;39:389–409. doi: 10.1039/D1NP00038A. [DOI] [PubMed] [Google Scholar]
- 15.He B, et al. Glycosyltransferases: mining, engineering and applications in biosynthesis of glycosylated plant natural products. Synth. Syst. Biotechnol. 2022;7:602–620. doi: 10.1016/j.synbio.2022.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Rahimi S, et al. Triterpenoid-biosynthetic UDP-glycosyltransferases from plants. Biotechnol. Adv. 2019;37:107394. doi: 10.1016/j.biotechadv.2019.04.016. [DOI] [PubMed] [Google Scholar]
- 17.Wilson AE, Tian L. Phylogenomic analysis of UDP‐dependent glycosyltransferases provides insights into the evolutionary landscape of glycosylation in plant metabolism. Plant J. 2019;100:1273–1288. doi: 10.1111/tpj.14514. [DOI] [PubMed] [Google Scholar]
- 18.Ross J, Li Y, Lim EK, Bowles DJ. Higher plant glycosyltransferases. Genome Biol. 2001;2:1–6. doi: 10.1186/gb-2001-2-2-reviews3004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lim EK, et al. Identification of glucosyltransferase genes involved in sinapate metabolism and lignin synthesis in Arabidopsis. J. Biol. Chem. 2001;276:4344–4349. doi: 10.1074/jbc.M007263200. [DOI] [PubMed] [Google Scholar]
- 20.Speeckaert N, et al. Characterization of the UDP-glycosyltransferase UGT72 family in poplar and identification of genes involved in the glycosylation of monolignols. Int. J. Mol. Sci. 2020;21:5018. doi: 10.3390/ijms21145018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lin JS, et al. UDP‐glycosyltransferase 72B1 catalyzes the glucose conjugation of monolignols and is essential for the normal cell wall lignification in Arabidopsis thaliana. Plant J. 2016;88:26–42. doi: 10.1111/tpj.13229. [DOI] [PubMed] [Google Scholar]
- 22.Lanot A, et al. The glucosyltransferase UGT72E2 is responsible for monolignol 4‐O‐glucoside production in Arabidopsis thaliana. Plant J. 2006;48:286–295. doi: 10.1111/j.1365-313X.2006.02872.x. [DOI] [PubMed] [Google Scholar]
- 23.Lim EK, Jackson RG, Bowles DJ. Identification and characterisation of Arabidopsis glycosyltransferases capable of glucosylating coniferyl aldehyde and sinapyl aldehyde. FEBS Lett. 2005;579:2802–2806. doi: 10.1016/j.febslet.2005.04.016. [DOI] [PubMed] [Google Scholar]
- 24.Aubuchon LN, Schmiederer K, Lee SG. Comparative structural analysis of plant uridine diphosphate (UDP)-dependent glycosyltransferases (UGTs) in plant specialized metabolism: structures of plant UGTs for biosynthesis of steviol glycosides. Phytochem. Rev. 2023;22:385–406. doi: 10.1007/s11101-023-09857-0. [DOI] [Google Scholar]
- 25.Speeckaert N, El Jaziri M, Baucher M, Behr M. UGT72, a major glycosyltransferase family for flavonoid and monolignol homeostasis in plants. Biology. 2022;11:441. doi: 10.3390/biology11030441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wilson AE, et al. Characterization of a UGT84 family glycosyltransferase provides new insights into substrate binding and reactivity of galloylglucose ester-forming UGTs. Biochemistry. 2017;56:6389–6400. doi: 10.1021/acs.biochem.7b00946. [DOI] [PubMed] [Google Scholar]
- 27.Tahara K, Nishiguchi M, Frolov A, Mittasch J, Milkowski C. Identification of UDP glucosyltransferases from the aluminum-resistant tree Eucalyptus camaldulensis forming β-glucogallin, the precursor of hydrolyzable tannins. Phytochemistry. 2018;152:154–161. doi: 10.1016/j.phytochem.2018.05.005. [DOI] [PubMed] [Google Scholar]
- 28.Ye Q, et al. Identification and characterization of glucosyltransferase that forms 1-galloyl-β-d-glucogallin in Canarium album L., a functional fruit rich in hydrolysable tannins. Molecules. 2021;26:4650. doi: 10.3390/molecules26154650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Meißner D, Albert A, Böttcher C, Strack D, Milkowski C. The role of UDP-glucose: hydroxycinnamate glucosyltransferases in phenylpropanoid metabolism and the response to UV-B radiation in Arabidopsis thaliana. Planta. 2008;228:663–674. doi: 10.1007/s00425-008-0768-3. [DOI] [PubMed] [Google Scholar]
- 30.Mittasch J, Mikolajewski S, Breuer F, Strack D, Milkowski C. Genomic microstructure and differential expression of the genes encoding UDP-glucose: sinapate glucosyltransferase (UGT84A9) in oilseed rape (Brassica napus) Theor. Appl. Genet. 2010;120:1485–1500. doi: 10.1007/s00122-010-1270-4. [DOI] [PubMed] [Google Scholar]
- 31.Khater F, et al. Identification and functional characterization of cDNAs coding for hydroxybenzoate/hydroxycinnamate glucosyltransferases co-expressed with genes related to proanthocyanidin biosynthesis. J. Exp. Bot. 2012;63:1201–1214. doi: 10.1093/jxb/err340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Lunkenbein S, et al. Cinnamate metabolism in ripening fruit. Characterization of a UDP-glucose: cinnamate glucosyltransferase from strawberry. Plant Physiol. 2006;140:1047–1058. doi: 10.1104/pp.105.074955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Yin Q, et al. Involvement of three putative glucosyltransferases from the UGT72 family in flavonol glucoside/rhamnoside biosynthesis in Lotus japonicus seeds. J. Exp. Bot. 2017;68:597–612. doi: 10.1093/jxb/erw420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Louveau T, et al. Predicting the substrate specificity of a glycosyltransferase implicated in the production of phenolic volatiles in tomato fruit. FEBS J. 2011;278:390–400. doi: 10.1111/j.1742-4658.2010.07962.x. [DOI] [PubMed] [Google Scholar]
- 35.Sun G, et al. Apocarotenoids are allosteric effectors of a dimeric plant glycosyltransferase involved in defense and lignin formation. N. Phytol. 2023;238:2080–2098. doi: 10.1111/nph.18875. [DOI] [PubMed] [Google Scholar]
- 36.Zhao X, et al. Functional analysis of an uridine diphosphate glycosyltransferase involved in the biosynthesis of polyphenolic glucoside in tea plants (Camellia sinensis) J. Agric. Food Chem. 2017;65:10993–11001. doi: 10.1021/acs.jafc.7b04969. [DOI] [PubMed] [Google Scholar]
- 37.Hall D, De Luca V. Mesocarp localization of a bi‐functional resveratrol/hydroxycinnamic acid glucosyltransferase of concord grape (Vitis labrusca) Plant J. 2007;49:579–591. doi: 10.1111/j.1365-313X.2006.02987.x. [DOI] [PubMed] [Google Scholar]
- 38.Osmani SA, Bak S, Møller BL. Substrate specificity of plant UDP-dependent glycosyltransferases predicted from crystal structures and homology modeling. Phytochemistry. 2009;70:325–347. doi: 10.1016/j.phytochem.2008.12.009. [DOI] [PubMed] [Google Scholar]
- 39.Yonekura‐Sakakibara K, et al. Two glycosyltransferases involved in anthocyanin modification delineated by transcriptome independent component analysis in Arabidopsis thaliana. Plant J. 2012;69:154–167. doi: 10.1111/j.1365-313X.2011.04779.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Paquette S, Møller BL, Bak S. On the origin of family 1 plant glycosyltransferases. Phytochemistry. 2003;62:399–413. doi: 10.1016/S0031-9422(02)00558-7. [DOI] [PubMed] [Google Scholar]
- 41.Lu X, Huang L, Scheller HV, Keasling JD. Medicinal terpenoid UDP-glycosyltransferases in plants: recent advances and research strategies. J. Exp. Bot. 2023;74:1343–1357. doi: 10.1093/jxb/erac505. [DOI] [PubMed] [Google Scholar]
- 42.Kubo A, Arai Y, Nagashima S, Yoshikawa T. Alteration of sugar donor specificities of plant glycosyltransferases by a single point mutation. Arch. Biochem. Biophys. 2004;429:198–203. doi: 10.1016/j.abb.2004.06.021. [DOI] [PubMed] [Google Scholar]
- 43.Shu W, et al. Enhanced heterologous production of glycosyltransferase UGT76G1 by co-expression of endogenous prpD and malK in Escherichia coli and its transglycosylation application in production of rebaudioside. Int. J. Mol. Sci. 2020;21:5752. doi: 10.3390/ijms21165752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Gopal GJ, Kumar A. Strategies for the production of recombinant protein in Escherichia coli. Protein J. 2013;32:419–425. doi: 10.1007/s10930-013-9502-5. [DOI] [PubMed] [Google Scholar]
- 45.Lairson LL, Henrissat B, Davies GJ, Withers SG. Glycosyltransferases: structures, functions, and mechanisms. Annu. Rev. Biochem. 2008;77:521–555. doi: 10.1146/annurev.biochem.76.061005.092322. [DOI] [PubMed] [Google Scholar]
- 46.Nidetzky B, Gutmann A, Zhong C. Leloir glycosyltransferases as biocatalysts for chemical production. ACS Catal. 2018;8:6283–6300. doi: 10.1021/acscatal.8b00710. [DOI] [Google Scholar]
- 47.Gutmann A, Krump C, Bungaruang L, Nidetzky B. A two-step O- to C-glycosidic bond rearrangement using complementary glycosyltransferase activities. Chem. Commun. 2014;50:5465–5468. doi: 10.1039/C4CC00536H. [DOI] [PubMed] [Google Scholar]
- 48.Gutmann A, Nidetzky B. Unlocking the potential of leloir glycosyltransferases for applied biocatalysis: efficient synthesis of uridine 5′‐diphosphate‐glucose by sucrose synthase. Adv. Synth. Catal. 2016;358:3600–3609. doi: 10.1002/adsc.201600754. [DOI] [Google Scholar]
- 49.Mittasch J, Böttcher C, Frolova N, Bönn M, Milkowski C. Identification of UGT84A13 as a candidate enzyme for the first committed step of gallotannin biosynthesis in pedunculate oak (Quercus robur) Phytochemistry. 2014;99:44–51. doi: 10.1016/j.phytochem.2013.11.023. [DOI] [PubMed] [Google Scholar]
- 50.Liao J, et al. Subfunctionalization of a monolignol to a phytoalexin glucosyltransferase is accompanied by substrate inhibition. Plant Commun. 2023;4:100506. doi: 10.1016/j.xplc.2022.100506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Huang S, et al. P-hydroxyacetophenone ameliorates alcohol-induced steatosis and oxidative stress via the NF-κB signaling pathway in zebrafish and hepatocytes. Front. Pharmacol. 2020;10:1594. doi: 10.3389/fphar.2019.01594. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Kesari KK, et al. Plant-derived natural biomolecule picein attenuates menadione induced oxidative stress on neuroblastoma cell mitochondria. Antioxidants. 2020;9:552. doi: 10.3390/antiox9060552. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Lu L, et al. Establishing biosynthetic pathway for the production of p-hydroxyacetophenone and its glucoside in Escherichia coli. Metab. Eng. 2023;76:110–119. doi: 10.1016/j.ymben.2023.02.001. [DOI] [PubMed] [Google Scholar]
- 54.Wang Y, et al. Biosynthesis of the dihydrochalcone sweetener trilobatin requires phloretin glycosyltransferase2. Plant Physiol. 2020;184:738–752. doi: 10.1104/pp.20.00807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Zhang J, et al. Identification and functional analysis of glycosyltransferase catalyzing the synthesis of phlorizin and trilobatin in Lithocarpus polystachyus Rehd. Ind. Crops Prod. 2023;192:116056. doi: 10.1016/j.indcrop.2022.116056. [DOI] [Google Scholar]
- 56.Chen K, et al. Diversity of O-glycosyltransferases contributes to the biosynthesis of flavonoid and triterpenoid glycosides in Glycyrrhiza uralensis. ACS Synth. Biol. 2019;8:1858–1866. doi: 10.1021/acssynbio.9b00171. [DOI] [PubMed] [Google Scholar]
- 57.Verma AK, Pratap R. The biological potential of flavones. Nat. Prod. Rep. 2010;27:1571–1593. doi: 10.1039/c004698c. [DOI] [PubMed] [Google Scholar]
- 58.Wang ZL, et al. Functional characterization, structural basis, and regio-selectivity control of a promiscuous flavonoid 7,4′-di-O-glycosyltransferase from Ziziphus jujuba var. spinosa. Chem. Sci. 2023;14:4418–4425. doi: 10.1039/D2SC06504E. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Li T, Borg AJ, Krammer L, Breinbauer R, Nidetzky B. Reaction intensification for biocatalytic production of polyphenolic natural product di‐C‐β‐glucosides. Biotechnol. Bioeng. 2023;120:1506–1520. doi: 10.1002/bit.28354. [DOI] [PubMed] [Google Scholar]
- 60.Okitsu N, Matsui K, Horikawa M, Sugahara K, Tanaka Y. Identification and characterization of novel Nemophila menziesii flavone glucosyltransferases that catalyze biosynthesis of flavone 7,4′-O-diglucoside, a key component of blue metalloanthocyanins. Plant Cell Physiol. 2018;59:2075–2085. doi: 10.1093/pcp/pcy129. [DOI] [PubMed] [Google Scholar]
- 61.Amat A, De Angelis F, Sgamellotti A, Fantacci S. Acid-base chemistry of luteolin and its methyl-ether derivatives: a DFT and ab initio investigation. Chem. Phys. Lett. 2008;462:313–317. doi: 10.1016/j.cplett.2008.07.098. [DOI] [Google Scholar]
- 62.Fuguet E, Ràfols C, Mañé M, Ruiz R, Bosch E. Acidity constants of hydroxyl groups placed in several flavonoids: two flavanones, two flavones and five flavonols. Talanta. 2023;253:124096. doi: 10.1016/j.talanta.2022.124096. [DOI] [Google Scholar]
- 63.Pitsawong W, et al. Tuning of pKa values activates substrates in flavin-dependent aromatic hydroxylases. J. Biol. Chem. 2020;295:3965–3981. doi: 10.1074/jbc.RA119.011884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Toney MD. Carbon acidity in enzyme active sites. Front. Bioeng. Biotechnol. 2019;7:25. doi: 10.3389/fbioe.2019.00025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Eschrich K, van der BOLT FJ, de KOK A, van BERKEL WJ. Role of Tyr201 and Tyr385 in substrate activation by p‐hydroxybenzoate hydroxylase from Pseudomonas fluorescens. Eur. J. Biochem. 1993;216:137–146. doi: 10.1111/j.1432-1033.1993.tb18125.x. [DOI] [PubMed] [Google Scholar]
- 66.Tongsook C, Sucharitakul J, Thotsaporn K, Chaiyen P. Interactions with the substrate phenolic group are essential for hydroxylation by the oxygenase component of p-hydroxyphenylacetate 3-hydroxylase. J. Biol. Chem. 2011;286:44491–44502. doi: 10.1074/jbc.M111.284463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.McArthur JB, Chen X. Glycosyltransferase engineering for carbohydrate synthesis. Biochem. Soc. Trans. 2016;44:129–142. doi: 10.1042/BST20150200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Hofer B. Recent developments in the enzymatic O-glycosylation of flavonoids. Appl. Microbiol. Biotechnol. 2016;100:4269–4281. doi: 10.1007/s00253-016-7465-0. [DOI] [PubMed] [Google Scholar]
- 69.Wang M, Ji Q, Lai B, Liu Y, Mei K. Structure-function and engineering of plant UDP-glycosyltransferase. Comput. Struct. Biotechnol. J. 2023;21:5358–5371. doi: 10.1016/j.csbj.2023.10.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.de Boer RM, et al. Regioselective glycosylation of polyphenols by family 1 glycosyltransferases: experiments and simulations. ACS Omega. 2023;8:46300–46308. doi: 10.1021/acsomega.3c08255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Teze D, et al. O-/N-/S-specificity in glycosyltransferase catalysis: from mechanistic understanding to engineering. ACS Catal. 2021;11:1810–1815. doi: 10.1021/acscatal.0c04171. [DOI] [Google Scholar]
- 72.He JB, et al. Molecular and structural characterization of a promiscuous C‐glycosyltransferase from Trollius chinensis. Angew. Chem. 2019;131:11637–11644. doi: 10.1002/ange.201905505. [DOI] [PubMed] [Google Scholar]
- 73.Offen W, et al. Structure of a flavonoid glucosyltransferase reveals the basis for plant natural product modification. EMBO J. 2006;25:1396–1405. doi: 10.1038/sj.emboj.7600970. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Zhang M, et al. Functional characterization and structural basis of an efficient di-C-glycosyltransferase from Glycyrrhiza glabra. J. Am. Chem. Soc. 2020;142:3506–3512. doi: 10.1021/jacs.9b12211. [DOI] [PubMed] [Google Scholar]
- 75.Lethe MCL, Paris V, Wang X, Chan CT. Similarities in structure and function of UDP-glycosyltransferase homologs from human and plants. Int. J. Mol. Sci. 2024;25:2782. doi: 10.3390/ijms25052782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Ohgami S, et al. Volatile glycosylation in tea plants: sequential glycosylations for the biosynthesis of aroma β-primeverosides are catalyzed by two Camellia sinensis glycosyltransferases. Plant Physiol. 2015;168:464–477. doi: 10.1104/pp.15.00403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Yamashita M, et al. The apiosyltransferase celery UGT94AX1 catalyzes the biosynthesis of the flavone glycoside apiin. Plant Physiol. 2023;193:1758–1771. doi: 10.1093/plphys/kiad402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Gharabli H, Welner DH. The sugar donor specificity of plant family 1 glycosyltransferases. Front. Bioeng. Biotechnol. 2024;12:1396268. doi: 10.3389/fbioe.2024.1396268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Stover NA, Cavalcanti AR. Using NCBI BLAST. Curr. Protoc. Essent. Lab. Tech. 2017;14:11. doi: 10.1002/cpet.8. [DOI] [Google Scholar]
- 80.Thompson JD, Gibson TJ, Higgins DG. Multiple sequence alignment using ClustalW and ClustalX. Curr. Protoc. Bioinform. 2003;1:2–3. doi: 10.1002/0471250953.bi0203s00. [DOI] [PubMed] [Google Scholar]
- 81.Robert X, Gouet P. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Res. 2014;42:320–324. doi: 10.1093/nar/gku316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Tamura K, Stecher G, Kumar S. MEGA11: molecular evolutionary genetics analysis version 11. Mol. Biol. Evol. 2021;38:3022–3027. doi: 10.1093/molbev/msab120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Letunic I, Bork P. Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49:293–296. doi: 10.1093/nar/gkab301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Jumper J, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Varadi M, et al. AlphaFold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022;50:439–444. doi: 10.1093/nar/gkab1061. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are reported in the manuscript and in the associated Supplementary Information. The file Supplementary Data 1 contains the source data for Figs. 3, 4 and 5. All data are available from the corresponding author upon request. Sequence information on proteins used within this study can be found on the NCBI database (https://www.ncbi.nlm.nih.gov/) using the accession numbers provided in the text.