Abstract
The essential oils of ginger (Zingiber officinale) and turmeric (Curcuma longa) contain a large variety of terpenoids, some of which possess anticancer, antiulcer, and antioxidant properties. Despite their importance, only four terpene synthases have been identified from the Zingiberaceae family: (+)-germacrene D synthase and (S)-β-bisabolene synthase from ginger rhizome, and α-humulene synthase and β-eudesmol synthase from shampoo ginger (Zingiber zerumbet) rhizome. We report the identification of 25 mono- and 18 sesquiterpene synthases from ginger and turmeric, with 13 and 11, respectively, being functionally characterized. Novel terpene synthases, (−)-caryolan-1-ol synthase and α-zingiberene/β-sesquiphellandrene synthase, which is responsible for formation of the major sesquiterpenoids in ginger and turmeric rhizomes, were also discovered. These suites of enzymes are responsible for formation of the majority of the terpenoids present in these two plants. Structures of several were modeled, and a comparison of sets of paralogs suggests how the terpene synthases in ginger and turmeric evolved. The most abundant and most important sesquiterpenoids in turmeric rhizomes, (+)-α-turmerone and (+)-β-turmerone, are produced from (−)-α-zingiberene and (−)-β-sesquiphellandrene, respectively, via α-zingiberene/β-sesquiphellandrene oxidase and a still unidentified dehydrogenase.
Introduction
Ginger (Zingiber officinale Rosc.) and turmeric (Curcuma longa L.) have been used for centuries to treat human ailments. Ginger is effective against symptoms of the common cold, fever, rheumatic disorders, gastrointestinal complications, motion sickness, diabetes, and cancer [1]. It has anti-bacterial [2] and anti-fungal [3] activities. Many of these medicinal activities including anti-cancer and anti-inflammatory [4] are believed to be due to the presence of active phenolic compounds such as the gingerols, paradols and shogaols [5], [6], [7]. However, terpenoids from ginger have also been reported to have important human health roles. For example, β-elemene arrests the cell cycle and induces apoptotic cell death in lung cancer cells [8] and elemene aids patients with chylothorax [9]. Zingiberene as well as [6]-gingerol significantly inhibited gastric lesions [10] and further research revealed (−)-β-sesquiphellandrene, β -bisabolene, ar-curcumene and 6-shogaol as anti-ulcer active principles in ginger [11]. Turmeric has anti-inflammatory [12] and anti-cancer [13] properties, which have been reported to be mainly due to the presence of curcumin, a diarylheptanoid. On the other hand, turmeric oil, reported to contain ar-turmerone, turmerone and curlone, showed antioxidant effects and anti-mutagenic action [14]. Turmeric oil, consisting largely of terpenoids, also has anti-bacterial activity [15]. Both curcuminoids and sesquiterpenoids in turmeric exhibit hypoglycemic effects via peroxisome proliferator-activated receptor-γ (PPAR-γ) activation and suppress an increase in blood glucose levels in type 2 diabetic KK-Ay mice. The effect was synergistic when both curcuminoids and sesquiterpenoids in turmeric were applied together [16]. ar-Turmerone from turmeric oil displays anti-tumorigenesis activity, inhibiting cell proliferation and activating ar-turmerone-mediated apoptotic protein in human lymphoma U937 cells [17]. It was also found that apoptosis was selectively induced by ar-turmerone in human leukemia Molt 4B and HL-60 cells, but not in human stomach cancer KATO III cells. ar-Turmerone also has antiplatelet activities that can prevent and treat arteriol thrombosis [18].
Despite the importance of the sesquiterpenoids from these two plants to human health, the enzymes involved in their formation have not been previously identified. Terpene synthases (TPSs) are the entry point and perhaps most important type of enzymes leading to the various subclasses of terpenoids in these plants. However, only four sesquiterpene synthases (STPSs) have been characterized from the Zingiberaceae: (+)-germacrene D synthase [19] and (S)-β-bisabolene synthase [20] from culinary ginger (Zingiber officinale Rosc.) rhizome, and α-humulene synthase [21] and β-eudesmol synthase [22] from shampoo ginger (Z. zerumbet Smith) rhizome. These enzymes do not account for the major compounds produced in these species. For example, both ginger and turmeric produce large amounts of (−)-α-zingiberene and (−)-β-sesquiphellandrene. Turmeric also synthesizes appreciable quantities of α-turmerone and β-turmerone (Figure 1), which are also sometimes called tumerone and curlone, respectively [14], [23]. These compounds are not the direct or downstream products of the four reported TPSs. In this report, we describe the identification and characterization of a large suite of TPS enzymes involved in the formation of the large array of terpenoids found in these plants, and elucidate the means by which the sesquiterpenoids α-turmerone and β-turmerone are formed in turmeric.
Materials and Methods
Plant Material
Ginger (Zingiber officinale Rosc.) and turmeric (Curcuma longa L.) were grown in a greenhouse for 5 to 7 months. Two varieties of culinary ginger (white ginger and yellow ginger) were used, which are different from the white and yellow “gingers” (not Zingiber species at all) that have white and yellow flowers, respectively. The white and yellow gingers used in this study are culinary and medicinal varieties of ginger that have green inflorescences and morphologically are very similar to each other except that they have slightly different rhizome colors. Hawaiian red turmeric (HRT) was used for cloning genes and “fat mild orange” (FMO) turmeric was used to for GC/MS analysis, as in Figure 1. These two “varieties” are really the same clonal line that was obtained from two different organic ginger growers in Hawaii, respectively, Dean Pinner from Pinner Creek Organics and Hugh Johnson, from Puna Organics. The GC/MS total ion chromatograms of FMO and HRT are essentially identical.
Cloning of Full Length cDNAs
Some unitrans identified in the ginger and turmeric EST databases were homologous to TPSs and appeared to be full length, although others were incomplete. For those genes missing either/both the 5′ and/or the 3′ end sequences, the SMART RACE (Rapid Amplification of cDNA End) method (Clontech) was used to find the missing 5′ or/and 3′ end(s) except for the unitrans ST00. 5′ RACE ready cDNAs and 3′ RACE ready cDNAs were synthesized from 8 different total RNAs (GW-Rh, GW-R, GW-L, GY-Rh, GY-R, GY-L, T-Rh and T-L, where GW: white ginger, GY: yellow ginger, T: turmeric, Rh: rhizome, R: root, L: leaf) extracted with the RNeasy Plant Mini kit (Qiagen) using superscript III reverse transcriptase (Invitrogen) for 3′ RACE ready cDNAs and superscript II reverse transcriptase (Invitrogen) for 5′ RACE ready cDNAs respectively. 3′ RACE CDS (AAGCAGTGGTATCAACGCAGAGTAC(T)30VN), 5′ RACE CDS ((T)25VN), and SMART II™ A Oligonucleotide (AAGCAGTGGTATCAACGCAGAGTACGCGGG) were used for RACE ready cDNA synthesis according to the manufacturer’s protocol. Amplifications of either the 3′ or 5′ end were carried out using the Advantage 2 PCR kit (Clontech) with Universal Primer A Mix (UPM, Long: CTAATACGACTCACTATAGGGCAAGCAGTGGTATCAACGCAGAGT, Short: CTAATACGACTCACTATAGGGC) and gene specific primers for RACE (Table S1). Using RACE products, a second round of PCR was done with gene specific nested primers for RACE (Table S1) and either N-UP (5′-AAGCAGTGGTATCAACGCAGAGT-3′), UP-M (5′-CACTATAGGGCAAGCAGTGGT-3′), or UP-S (5′-CTAATACGACTCACTATAGGGC-3′). PCR products of the expected estimated size were eluted using the MinElute PCR purification kit (Qiagen) and inserted into the pCR2.1-TOPO vector (Invitrogen). For ST00, we found the 5′ end by a different method, using the ZO__Ed (T-Rh) cDNA library stock. Using Zd01L13RR, a ST00 specific primer (Table S2), and UPM-T7 (5′-CTAATACGACTCACTATAGGGCGTAATACGACTCACTATAGGGCGAATTG-3′), a cloning vector specific primer, we amplified a ST00 specific fragment, which was sub-cloned as described above.
After sequencing and confirmation of 5′ and/or 3′ sequences, full length cDNAs were amplified from either 5′ RACE ready cDNA or 3′ RACE ready cDNA using Pfu thermostable polymerase with two gene-specific primer sets (Table S2) and were inserted into the pCR2.1-TOPO vector. In this manner, full length mono- and sesquiterpene synthases were cloned into the pCR2.1-TOPO vector. Truncated monoterpene synthases (with plastidial peptides removed) and full length sesquiterpene synthases were sub-cloned into expression vectors such as pCRT7/CT-TOPO vector (Invitrogen), pEXP5/CT-TOPO vector (Invitrogen), pET101/D-TOPO vector (Invitrogen) and/or pH9GW vector for expression in E. coli. For pH9GW-MT00, the PCR product produced using the 10N21GtwyAttF and 10N21G-AttR primers (Table S2) was first cloned using the Gateway BP reaction by Gateway BP clonase II enzyme mix (Invitrogen) with pDONR207 vector (Invitrogen) to yield pENTR207-10N21, and then the pENTR207-10N21-based Gateway LR reaction was performed using Gateway LR clonase II enzyme mix (Invitrogen) with pH9GW. For pH9GW-Zc05I02tt, the PCR product with Zc05I02tFt and Zc05I02tR primers (Table S2) was first inserted into pENTR/D-TOPO vector (Invitrogen) to produce pENTR-Zc05I02tt vector, which was used to produce pH9GW-Zc05I02tt using the Gateway LR reaction with Gateway LR clonase II enzyme mix and pH9GW. There are two versions of pCRT7CT-MT11: with His-tag or without His-tag at the 3′ end due to absence or presence of a stop codon in gene specific reverse primers, Zc07C01CT-R and Zc07C01tR, respectively (Table S2). Several genes were also cloned into the pESC-URA vector (Stratagene). PCR products amplified by appropriate primer pairs (Table S2) were sub-cloned into pCR8/GW-TOPO (Invitrogen) and fragments produced by digestion of the resulting constructs with BamHI (NEB) and XmaI (NEB) were sub-cloned into pESC-URA vector digested with BamHI and XmaI.
Expression of Terpene Synthases in E. coli and Yeast
We used several E. coli strains for expression of terpene synthases; BL21 (DE3) pLysS (Invitrogen), BL21 CodonPlus (DE3) RIL (Stratagene), BL21 CodonPlus (DE3) RP (Stratagene), BL21 CodonPlus (DE3) RILP (Stratagene), Rosetta (Novagen), Rosetta2 (DE3) pLysS (Novagen), ArcticExpress (DE3) RIL (Stratagene), BL21-AI (Invitrogen), BL21-AI RIL, BL21 Star (DE3) (Invitrogen), BL21 Star (DE3) RIL and BL21 Star (DE3) pMevT pMBI RIL. The E. coli strain, BL21-AI RIL is BL21-AI plus the RIL plasmid from BL21 CodonPlus (DE3) RIL cells. The E. coli strain, BL21 Star (DE3) RIL is BL21 Star (DE3) plus the RIL plasmid. The E. coli strain, BL21 Star (DE3) pMevT pMBI RIL is BL21 Star (DE3) (Invitrogen) plus three additional plasmids, pMevT, pMBI and RIL, where pMevT and pMBI are from the Keasling lab [24] and RIL is from BL21 CodonPlus (DE3) RILP. The plasmids, pMevT and pMBI enrich IPP and DMAPP pools inside the E. coli cells.
E. coli was grown at 37°C to OD 0.6 at 600 nm and induced for 18 h at 18°C with IPTG (0.05 mM ∼ 1 mM) and 0.2% arabinose for BL21-AI derived strains.
We used several yeast strains for terpene synthase expression; INVSc1 (Invitrogen), EPY219 and EPY224, which are from the Keasling lab [25]. EPY219 and EPY224 are engineered to produce more GPP and FPP, respectively. Although EPY219 and EPY224 contain the pRS425ADS plasmid, growth in YPD media led to loss of this plasmid and EPY219 and EPY224 in this investigation did not contain the pRS425ADS plasmid. Yeast cells grown for 2 d at 30°C in SD-URA media were transferred to NB-URA media and grown at 18°C for 2–8 d.
Enzyme Assays of Terpene Synthases Expressed in E. coli
Overnight grown E. coli cultures that had been induced to express recombinant proteins were centrifuged to collect cell pellets. The pellets were vortexed with Washing Buffer (20 mM Tris-HCl, pH 7.0, 50 mM KCl) and then centrifuged. Protein Extraction Buffer (50 mM 3-(N-morpholino)-2-hydroxypropanesulfonic acid, pH 7.0, 10% [v/v] glycerol, 5 mM MgCl2, 5 mM DTT, 5 mM sodium ascorbate, 0.5 mM phenylmethylsulfonyl fluoride) was added to washed E. coli pellets, which were then vortexed, sonicated and centrifuged. Supernatant was recovered and the buffer was changed to Enzyme Assay Buffer (10 mM 3-(N-morpholino)-2-hydroxypropanesulfonic acid, pH 7.0, 10% [v/v] glycerol, 1 mM DTT) using PD-10 columns (GE Healthcare Life Sciences). Divalent cations (20 mM MgCl2 and/or 0.5 mM MnCl2 at final concentration), phosphatase inhibitors (0.2 mM NaWO4, 0.1 mM NaF at final concentration) and either geranyl diphosphate (GPP, 10 µg) or farnesyl diphosphate (FPP, 10 µg) were added to total 500 µl of Enzyme Assay Buffer containing soluble proteins and incubated for 3 h at 30°C with 200 µl of top layered pentane. The top pentane phase was removed directly at the end of the assay time (or vortexed with the aqueous phase and then centrifuged prior to removal) and was used for metabolite analysis.
Terpenoid Analysis
A Thermo Finnigan Trace GC 2000 with a Rtx-5MS w/5 m Integra-Guard Column (Restek, 0.25 mm ID, 0.25 µm df, 30 m) coupled to a DSQ mass spectrometer was used for gas chromatography/mass spectrometry (GC/MS) analysis, using methods previously described [26]. A chiral column, Rt-βDEXse (Restek, 0.25 mm ID, 0.25 µm df, 30 m), was used for determining enantiomers of linalool and caryolan-1-ol. Eluted compounds were identified by comparison of resulting mass spectra to the NIST/EPA/NIH Mass Spectral Library (NIST 02) and the essential oil GC/MS mass spectra library from Dr. Robert P. Adams [27]. For peak identification, we used both mass spectra similarity and peak retention time indices unless we specify the use of authentic standards. Adams’ essential oil library has a retention time index. An example of how the retention time index was used with mass spectra similarity for peak identification is shown in Figure S1. Unless an authentic standard was used, all identifications should be viewed as tentative, although we believe that for the important compounds discussed in this manuscript, they are indeed correct.
Western Blot Analysis for Terpene Synthases Expressed in Yeast
Proteins from yeast expressing terpene synthases were extracted using acid-washed glass beads (425–600 µm, 30–40 U.S. sieve) (Sigma). Yeast cell pellets from 10 to 15 ml of media were vortexed 15 times for 30 seconds on ice with 0.5 g of acid-washed glass beads and Protein Extraction Buffer (see above). Protein concentrations were determined by the Bradford Protein Assay (Bio-Rad). After 10 µg of total and soluble proteins were run on SDS-PAGE, the gel was blotted onto PVDF Transfer Membrane (0.45 µm, Thermo Pierce) with transfer buffer (12 mM Tris, 96 mM Glycine, 20% (v/v) MtOH, pH 8.3) in a western blot transfer apparatus according to the protocol of the PVDF Transfer Membrane manual. After treatment with blocking solution (TBS containing 5% [w/v] non-fat milk powder), membranes were incubated with blocking solution containing Mouse Anti-c-Myc-tag Monoclonal Antibody (Genscript) for 2 h, washed twice with TBS-T (TBS containing 0.1% (v/v) Tween-20) for 30 min, incubated with blocking solution containing Goat Anti-Mouse IgG (H&L) [HRP] Polyclonal Antibody (Genscript) for 2 h and washed twice with TBS-T for 30 min. The ECL system (SuperSignal West Pico Chemiluminescent Substrate, Thermo Pierce) was used to detect expressed terpene synthases in the yeast. For positive control we used the Multiple Tag (Purified) (Genscript).
Protein Structural Modeling
SWISS-MODEL [28] was used to model the putative protein structures and UCSF Chimera [28] was used to visualize the models.
Results
Cloning and Expression of Ginger and Turmeric TPSs
In efforts to identify how the large array of mono- and sesquiterpenoids (Figure 1) in ginger and turmeric are produced, we first searched a database of 50,139 expressed sequence tags (ESTs) from ginger and turmeric tissues (http://www.agcol.arizona.edu/cgi-bin/pave/GT/index.cgi), and identified many putative terpene synthases (TPSs) that would be expected to catalyze the formation of monoterpenoids (20 unique transcripts [unitrans]), sesquiterpenoids (10 unitrans), diterpenoids (2 unitrans), triterpenoids (3 unitrans), and tetraterpenoids (10 unitrans). Two monoterpene synthases (MTPSs) and no sesquiterpene synthase (STPS) were represented by full length sequences in the database. The rest of the identified TPS genes either required RACE or Genome Walking followed by RT-PCR to obtain full length cDNAs or did not yield full length clones after these efforts (Table S3). During the cloning process, we sequenced several independent clones for each unitrans and found that some exist as multiple paralogs and/or alleles in these species. In some cases, only the paralog(s) could be cloned, whereas the gene represented by the sequence in the original database could not. Because we often found more than two sequences during the RT-PCR-based cloning, we are confident that most of these are indeed paralogs, and not merely allelic pairs. Turmeric is sterile and likely a nonaploid (x = 7, 2n = 9x = 63) [29], thus providing an explanation for why so many related sequences could be found in one species.
A similarity tree generated from a large set of plant TPSs and ginger and turmeric TPS genes including full length genes has one MTPS cluster and one STPS cluster (Figure 2). One MTPS, called MT00, is a linalool/nerolidol synthase and the outgroup relative to other MTSs. All of the ginger and turmeric TPS proteins possess the conserved DDXXD motif required for interaction with the diphosphate group of the substrate (Figure S2). All MTPSs, except for MT00, have a transit peptide and the conserved RRX8W motif in the 5′ region. Most STPSs have the RX9W motif except for ST01, β-selinene synthase. Although MT00 does not possess the conserved tryptophan of the RRX8W motif, ST01 does.
The major products produced by ginger and turmeric TPSs are summarized in Table 1 and 2. When provided the alternative substrate in vitro, some MTPSs produce sesquiterpenes whereas some STPSs synthesize monoterpenes. However, based on the results outlined below, it is likely that such functions have little or no relevance to production of these compounds in vivo in most cases. Although several of the proteins expressed well in E. coli, many required expression in yeast in order to produce soluble and functional enzymes. More detailed descriptions of expression and analysis of many specific TPS genes is included in Results S1.
Table 1. Monoterpenes produced by ginger and turmeric terpene synthases.
MT00 | MT03 | MT04 | MT06/ MT06A | MT06B | MT07 | MT08 | MT09A2 | MT11 | MT12A-M2 | MT17A2 | ST00A/ ST00B | ST02B | ST02C | ST03 | |
tricyclene | 0.7 | 1.5 | 1.2 | ||||||||||||
α-thujene (3-thujene) | 3.4 | ||||||||||||||
α-pinene | 60.1 | 40.4 | 11.7 | 20.4 | 4.4 | 28.4 | 33.7 | ||||||||
camphene | 45.4 | 60.1 | 55.8 | ||||||||||||
sabinene (4(10)-thujene) | 2.7 | 29.0 | 6.2 | ||||||||||||
β-pinene | 30.7 | 1.0 | 0.6 | 3.7 | 0.7 | 5.9 | |||||||||
myrcene | 25.0 | 30.4 | 27.3 | ||||||||||||
α-phellandrene | 92.2 | 2.7 | 5.0 | ||||||||||||
3-carene | 1.2 | ||||||||||||||
α-terpinene | 1.0 | 8.2 | 4.3 | ||||||||||||
limonene | 5.6 | 6.6 | 1.5 | 7.7 | 7.9 | 15.5 | 17.9 | 2.7 | |||||||
β-phellandrene | 3.8 | 88.3 | 49.2 | ||||||||||||
1,8-cineole (eucalyptol) | 0.9 | 75.4 | |||||||||||||
(Z)-β-ocimene | 14.3 | 13.5 | 17.6 | ||||||||||||
(E)-β-ocimene | 9.8 | 9.3 | 3.5 | ||||||||||||
γ-terpinene | 2.5 | 9.0 | 0.4 | 0.1 | |||||||||||
cis-sabinene hydrate | 6.2 | 0.4 | 0.4 | 0.1 | |||||||||||
p-mentha-1,4(8)-diene (terpinolene) | 0.5 | 5.2 | 1.5 | 89.9 | 1.6 | 1.9 | 7.7 | 9.9 | 5.2 | ||||||
linalool | 100 | 24.8 | 100 | 25.1 | 15.6 | 36.8 | |||||||||
cis-p-menth-2-en-1-ol | 3.8 | ||||||||||||||
trans-pinene hydrate (trans-pinan-2-ol) | 0.3 | 0.3 | 0.3 | ||||||||||||
β-citronellal | 0.9 | ||||||||||||||
borneol (endo-borneol) | 3.3 | 6.1 | 3.3 | ||||||||||||
p-menth-1-en-4-ol (terpinen-4-ol) | 8.0 | ||||||||||||||
p-menth-1-en-8-ol (α-terpineol) | 6.2 | 0.3 | 0.4 | 16.5 | 0.4 | 2.7 | 3.3 | 3.3 |
Numbers represent percentage of total products for each terpene synthase. Note: Bold and underlined numbers indicate the most abundant compound(s) produced by specific TPS proteins. Limonene from MT04 also contains a (R)-(+)-m-mentha-6,8-diene (sylvestrene)-like compound (see text and Figure S9).
Table 2. Sesquiterpenes produced by ginger and turmeric terpene synthases.
MT00 | MT02A | MT06/ MT06A | MT08 | MT17A2 | ST00A/ ST00B | ST01 | ST02A4 | ST02B | ST02C | ST03 | ST05 | ST05A | ST07 | ST07A | |
δ-elemene | 1.2 | 2.9 | |||||||||||||
unknown ((+)-cyclosativene-like) | 7.8 | 2.1 | |||||||||||||
(+)-cyclosativene | 3.2 | ||||||||||||||
α-copaene | 11.2 | 1.7 | |||||||||||||
unknown (β-elemene-like) | 2.1 | ||||||||||||||
β-elemene | 9.7 | 12.6 | 18.3 | 49.3 | 1.5 | 1 | |||||||||
unknown (7-epi-sesquithujene-like) | 8.1 | 0.8 | 0.3 | ||||||||||||
cis-α-bergamotene | 7.6 | ||||||||||||||
(E)-caryophyllene (β-caryophyllene) | 0.5 | 2.4 | 14.2 | 10.5 | 1.3 | 0.2 | |||||||||
trans-α-bergamotene | 0.1 | 20.6 | 0.2 | ||||||||||||
γ-elemene | 1.1 | 6.1 | 2.7 | ||||||||||||
unknown (guaia-1(5),7(11)-diene-like) | 3.4 | ||||||||||||||
(Z)-β-farnesene | 100 | ||||||||||||||
α-humulene (α-caryophyllene) | 83.4 | 88.4 | 0.4 | ||||||||||||
allo-aromadendrene | 11.8 | ||||||||||||||
unknown (ererrophila-1(10),11-diene-like) | 11.3 | ||||||||||||||
γ-muurolene | 4.7 | ||||||||||||||
unknown (β-chamigrene-like) | 7.4 | ||||||||||||||
ar-curcumene | 0.9 | ||||||||||||||
γ-curcumene | 4.4 | 1.6 | 0.3 | ||||||||||||
unknown (β-sesquiphellandrene-like) | 1.5 | ||||||||||||||
germacrene D | 6.1 | 1.6 | 12.4 | 4.4 | |||||||||||
β-selinene (eudesma-4(14),11-diene) | 51.9 | ||||||||||||||
(−)-α-zingiberene | 67 | ||||||||||||||
γ-amorphene | 65.4 | ||||||||||||||
α-muurolene | 1.6 | 3.1 | 9.5 | ||||||||||||
β-bisabolene | 1.1 | 13.6 | 6.2 | ||||||||||||
(Z)-α-bisabolene | 1.7 | 22.7 | |||||||||||||
γ-cadinene | 8.7 | ||||||||||||||
β-curcumene | 14.5 | ||||||||||||||
7-epi-α-selinene | 14.2 | ||||||||||||||
(−)-β-sesquiphellandrene | 1.9 | 3.7 | 22.7 | ||||||||||||
δ-cadinene (cadina-1(10),4-diene) | 1.2 | 2.8 | 4 | ||||||||||||
(E)-γ-bisabolene | 0.6 | 0.4 | |||||||||||||
unknown (cis-α-bisabolene-like) | 2.1 | ||||||||||||||
1,5,9-trimethyl-1,5,9-cyclododecatriene | 0.8 | ||||||||||||||
unknown (cis-sesquisabinene hydrate-like) | 4.5 | ||||||||||||||
α-elemol | 44.3 | ||||||||||||||
germacrene B | 4.9 | 2.9 | |||||||||||||
(E)-nerolidol | 100 | 41 | 5.8 | ||||||||||||
(−)-caryolan-1-ol (β-caryophyllene alcohol) | 98.7 | 99.5 | |||||||||||||
germacrene D-4-ol | 9.6 | ||||||||||||||
unknown (trans-sesquisabinene hydrate-like1) | 1.2 | ||||||||||||||
unknown (trans-sesquisabinene hydrate-like2) | 0.6 | ||||||||||||||
unknown (selina-6-en-4-ol-like) | 2.4 | ||||||||||||||
unknown (cubenol-like) | 6.5 | ||||||||||||||
unknown (spathulenol-like) | 0.4 | ||||||||||||||
unknown (α-eudesmol-like) | 0.3 | ||||||||||||||
γ-eudesmol | 0.4 | 0.8 | |||||||||||||
unknown (trans-sesquisabinene hydrate-like3) | 0.4 | ||||||||||||||
α-acorenol | 0.2 | ||||||||||||||
epi-α-muurolol (τ-muurolol) | 6 | ||||||||||||||
α-muurolol (δ-cadinol) | 2.5 | ||||||||||||||
α-cadinol | 7.5 | ||||||||||||||
(−)-neointermedeol | 2.1 | 48.7 | |||||||||||||
(+)-intermedeol | 0.8 | ||||||||||||||
epi-β-bisabolol | 22.3 | ||||||||||||||
epi-α-bisabolol | 58.3 | 1.6 | 12.6 | ||||||||||||
α-bisabolol | 38.7 | 7.3 |
Numbers represent percentage of total products for each terpene synthase. Note: Bold and underlined numbers indicate the most abundant compound(s) produced by specific TPSs.
Function of TPSs Explain Formation of Major Terpenoids in Ginger and Turmeric
As seen in Figure 1, the most abundant terpenes in ginger and turmeric are (−)-α-zingiberene and (−)-β-sesquiphellandrene, based on GC/MS peak areas in total ion chromatograms of extracts from these plants. Two very similar TPS proteins, ST00A and ST00B with 98.4% similarity to each other and high similarity to known STPSs, and lacking a chloroplastic transit peptide, were cloned from turmeric rhizome. These expressed well in E. coli and synthesized the same products in vitro when supplied FPP as substrate: (−)-α-zingiberene (49.3%), (−)-β-sesquiphellandrene (40.7%) and β-bisabolene (6.3%) as major products (Figure S2). When expressed in the yeast strain EPY219, which supplies endogenous prenyl diphosphate precursors, GPP and FPP for production of terpenoids in vivo, these proteins produced the same major products, although the ratios of the products differed according to expression temperature and induction time, e.g., (−)-α-zingiberene (67%), (−)-β-sesquiphellandrene (22.7%), and β-bisabolene (6.2%) after 4 days of induction at 18°C (Figure 3; Figure S3). When expressed in other yeast strains or under different expression temperatures and induction times, the ratios of these products varied, as did the amounts of other minor products of these enzymes. The other STPS genes identified in our database did not produce (−)-α-zingiberene or (−)-β-sesquiphellandrene when expressed in E. coli or in various yeast strains. Thus, it appears that these two enzymes and their paralogs are responsible for formation of the most abundant terpenes in the rhizomes of these plants. As outlined in a later section, these compounds appear to be the precursors for the major oxygenated terpenoids in turmeric as well, indicating a major role for these enzymes in determining that plant’s terpenoid profile. A recently reported terpene synthase from sorghum, SbTPS1 also produces (−)-α-zingiberene and (−)-β-sesquiphellandrene as major products in very similar ratios to ST00A/B, although SbTPS1 is only 43% identical with ST00A [30], [31].
β-Phellandrene, the major product of the protein designated MT08, is the second most abundant monoterpene in ginger rhizomes (Figure 1). This protein, a MTPS with a chloroplastic targeting peptide, was very difficult to express in soluble form. However, when finally expressed and assayed, it produced both β-phellandrene (88.3%) and α-pinene (11.7%) (Table 1, Figure S5). No other MTPS produced β-phellandrene as a major product. (−)-β-Phellandrene and (−)-β-sesquiphellandrene are structurally identical, except that the latter has a longer tail. β-Phellandrene can also be produced by the two α-zingiberene/β-sesquiphellandrene synthases (ST00A and ST00B) described above, which make β-phellandrene as 49.2% of their monoterpene production, when provided GPP as substrate. However, GPP is a very poor substrate for these enzymes in general. When considering the fact that monoterpenes are mainly produced in plastids, and ST00A and ST00B are not targeted to that organelle, whereas MT08 is, the latter is likely to be the main enzyme responsible for formation of β-phellandrene in ginger rhizome. Microarray data also shows high expression of MT13 ( = MT08) in ginger rhizome over time (Table S4).
Camphene, the most abundant monoterpene in ginger, was produced by three different enzymes, MT06B, MT09A2 and MT12A-M2, which all function as camphene/α-pinene synthases and produce camphene as the major product when provided with GPP as substrate (Table 1, Figures S6, S7 and S8) but were not able to use FPP as a substrate. In contrast, the enzyme MT04 produces no camphene, but produces α-pinene (60.1%) and β-pinene (30.7%) as major products (Table 1, Figure S9). It is likely that most of the α-pinene accumulated by ginger rhizomes is produced by the camphene/α-pinene synthases (MT06B, MT09A2 and MT12A-M2). The β-pinene peak from ginger rhizome samples is very small (Figure 1). Moreover, the amount of α-pinene produced by MT04 is only about twice that of β-pinene, which suggests that MT04 contributes at most ∼21% of the α-pinene produced in ginger rhizomes. In contrast, ginger roots accumulate both α-pinene and β-pinene at significantly higher levels compared to camphene (Figure 1). Differences between extracts from plants of different ages supported this observation, where MT04 products (α-pinene and β-pinene) are more abundant in roots of 2 month old yellow ginger plants when compared to 7 month old plants; 5 times more for β-pinene. These results were supported by microarray results for tissues from these plants (with 7-fold higher expression in the younger plants, for MT04; Table S4). It thus appears that MT04 synthesizes most of the β-pinene found in ginger rhizomes and roots, whereas α-pinene accumulation is the result of the action of several enzymes. β-pinene is also the main monoterpene in turmeric leaves, but MT04 was not expressed at all in turmeric, suggesting that turmeric possesses an as yet unidentified leaf-specific β-pinene synthase.
Both ginger and turmeric rhizomes produce 1,8-cineole in large amount (Figure 1) and recombinant MT11 makes 1,8-cineole (75.4%), p-menth-1-en-8-ol (α-terpineol) (16.5%), α-pinene (4.4%) and β-pinene (3.7%) with GPP as substrate (Table 1, Figure S10). This enzyme could not use FPP. The relative amounts of 1,8-cineole and p-menth-1-en-8-ol (α-terpineol) in yellow ginger rhizome (Figure 1) are 77.57% and 22.42% respectively, which is similar to but not identical with the MT11 product ratio. p-Menth-1-en-8-ol (α-terpineol) is also produced by MT06/MT06A (6.2%, described below), which may contribute to its production in vivo. According to the microarray data, the expression level of MT11 is high in ginger rhizome, but 29 and 32 times less abundant in ginger root and leaf, respectively, and not expressed at all in turmeric (Table S4). These results confirm MT11 as the likely major contributor to 1,8-cineole synthesis in ginger rhizome, but indicate that there must be a separate turmeric-specific 1,8-cineole synthase that we did not identify.
α-Phellandrene is the most abundant monoterpene in turmeric rhizome (Figure 1). Although ginger contains α-phellandrene, it is not a major product. The enzyme designated MT03 synthesizes α-phellandrene (92.2%) as a major product, with β-phellandrene (3.8%) and several other compounds as minor products (Table 1, Figure S11). FPP was not a substrate for MT03. Microarray data suggest high expression of MT10 ( = MT03, see Table S3) at early stages of turmeric rhizome development, with a reduction in expression over time (Table S4). The levels of α-phellandrene also decrease over development in turmeric rhizome, although this decrease was not as great in magnitude as that observed for the transcript level. It is possible that turmeric rhizome stores α-phellandrene in the rhizome, leading to accumulation in older rhizomes. The expression levels of MT03 in turmeric root and leaf are lower than that observed for turmeric rhizome, and α-phellandrene amounts in root and leaf of 7 month old turmeric plants are 28.7% and 14.5% of that observed for the rhizome of the same plants. Thus, it appears that MT03 is likely the major producer of α-phellandrene in turmeric.
Turmeric rhizome produces p-mentha-1,4(8)-diene (terpinolene) as its second most abundant monoterpene (Figure 1). With GPP as a substrate, the enzyme designated MT07 synthesized p-mentha-1,4(8)-diene (terpinolene) (89.9%) as a major (Table 1, Figure S12). MT07 did not utilize FPP as substrate. MT07 is not expressed in ginger. Based on microarray data, it is expressed in turmeric rhizome (Table S4). In ginger, only the rhizomes possessed detectable amounts of p-mentha-1,4(8)-diene (Figure 1), which is probably produced by MT06/MT06A. The production of p-mentha-1,4(8)-diene (terpinolene) in turmeric root and leaf (Figure 1) and the lack of MT07 expression in turmeric root and leaf suggests there could be other p-mentha-1,4(8)-diene (terpinolene) synthases in those tissues.
Ginger rhizome and leaf and turmeric leaf produce small amounts of linalool (Figure 1). Two MTPSs, MT00 and MT17 from turmeric leaf, produce linalool from GPP. The best hit from a BLAST search of MT00 against public sequence databases is to another linalool synthase ((3S)-linalool/(E)-nerolidol synthase [Vitis vinifera] [32]). Enzyme assays with recombinant MT00 produced linalool (100%) with GPP as a substrate and (E)-nerolidol (100%) with FPP as a substrate (Table 1 and 2, Figure S13). MT17, in contrast, is more similar to other monoterpene synthases than to other known linalool synthases. Efforts to clone a full length cDNA for MT17 yielded diverse paralogs: MT17A, MT17A2, MT17B, MT17B2, MT17C and MT17D (Figure 2). Enzyme assays with MT17A2 synthesized linalool (100%) with GPP and a variety of sesquiterpenes with FPP as substrate (Table 1 and 2, Figure S14), with cis-α-bisabolene (22.7%), trans-α-bergamotene (20.6%), β-bisabolene (13.6%), epi-α-bisabolol (12.6%) as the major products. When a chiral column was used to analyze the products of these assays, we found that MT00 produces (S)-(+)-linalool while MT17A2 synthesizes (R)-(−)-linalool (Figure 4). Thus, these are indeed quite different enzymes, yielding different products, which are both unfortunately trivially called “linalool”. The evolution of MT00 is likely to have followed the same course as the Vitis vinifera S-linalool synthase, whereas MT17 is like other known R-linalool synthases, which have evolved very recently in their respective plant species from other MTPSs.
MT00 and MT17 are not expressed in ginger tissues. Instead, MT06/MT06A, which produces 24.8% linalool, is likely to be the enzyme responsible for formation of -linalool in ginger. MT06 and MT06A are 98.0% identical, the only differences are found in the transit peptide sequences (Figure S2), and thus appear to be paralogs. MT06 and MT06A produce a variety of monoterpenoids and sesquiterpenoids (Table 1 and 2, Figures S15 and S1). Several MT06/MT06A products, such as α-thujene (3-thujene), p-menth-1-en-4-ol (terpinen-4-ol), β-curcumene, epi-β-bisabolol and some unknown compounds, were only produced by MT06/MT06A among the recombinant proteins that we characterized. Although it is possible that there are other yet unidentified enzymes that specifically synthesize these products, these are minor products in ginger and turmeric tissues and it is likely that they are produced by the paralogs MT06/MT06A.
There are small amounts of epi-α-bisabolol and α-bisabolol in ginger and turmeric rhizomes. MT02A was difficult to express in soluble form, but produced the sesquiterpenoids epi-α-bisabolol (58.3%) and α-bisabolol (38.7%) as major products when expressed in yeast strain EPY224. MT02A also synthesized trace amounts of (Z)-α-bisabolene (1.7%), β-bisabolene (1.1%), and trans-α -bergamotene (0.1%) (Figure S16). Based on sequence similarity, MT02A is classified as a monoterpene synthase, however it did not produce monoterpenes in yeast. This may be due to the properties of the enzyme or because yeast may have limited ability to produce monoterpenes [33], [34].
The sesquiterpene β-selinene (eudesma-4(14),11-diene) is produced at detectable levels in ginger rhizome, but is found only at trace amounts in ginger root and is not detectable in ginger leaf or turmeric tissues. Recombinant ST01 synthesized β-selinene (eudesma-4(14),11-diene) (51.9%) as the major product when FPP was used as substrate (Table 2, Figure S17). With GPP as a substrate, ST01 did not produce any detectable product. ST01 is expressed at higher levels in ginger rhizome than ginger root and leaf and is not expressed in turmeric, according to microarray data (Table S4), supporting the role of this enzyme in production of β-selinene in vivo.
None of the STPS genes were represented by complete sequences in the EST database, requiring additional efforts such as RACE or genome walking to yield full length cDNA sequences. During the cloning of one such gene, designated ST02, many paralogs were found, ST02A, ST02A2, ST02A3, ST02A4, ST02B, ST02B2-FS, ST02C and ST02C2, which had similar yet distinct product profiles. Although many of the corresponding recombinant proteins for these genes were insoluble, some were soluble or partially soluble in E. coli or yeast. Yeast strain EPY224 expressing ST02A4 produced (−)-neointermedeol (48.7%) as a major product and a long list of minor products (Table 2, Figure S18). Enzyme assays using E. coli crude extract expressing ST02B with GPP as a substrate produced several monoterpenes, with linalool (25.1%), myrcene (25.0%), limonene (15.5%) as major products (Table 1, Figure S19). With FPP as a substrate, ST02B also produced several sesquiterpenes, with α-elemol (44.3%) as the major product (Figure S20). Enzyme assays using E. coli crude extracts expressing ST02C with GPP as a substrate produced several monoterpenes, with myrcene (30.4%), limonene (17.9%), linalool (15.6%) as major products (Table 1, Figure S21). Similar assays for ST02C with FPP as a substrate produced β-elemene (49.3%) and germacrene D (12.4%) as major products and a long list of minor products (Table 2, Figure S22). It is likely that the β-elemene detected in these assays is a thermal degradation product of germacrene A [35]–[36] (Figure 5), because our GC/MS inlet was initially set to a high temperature (220°C). Reducing the GC inlet temperature (to 150°C), as has been suggested [37], led to a decrease in the amount of β-elemene detected with an increase in germacrene A detected in our system. Although the amount of germacrene A was still small and the reduction of β-elemene not so dramatic, it is likely that the β-elemene that we detected may indeed result from thermal degradation of germacrene A. Similarly, α-elemol is likely the thermal breakdown product of (+)-hedycaryol (Figure 5). However the low inlet temperature led to greatly reduced sensitivity, so the relative product amounts shown in Table 2 are based on injection at 220°C.
Based on the microarray data, ST02 was expressed 8-, 120- and 36-fold higher in ginger rhizome compared to ginger root, ginger leaf and turmeric leaf, respectively. The microarray data indicated lack of expression of ST02 in turmeric rhizome and root (Table S4). Based on ginger and turmeric terpene synthase product profiles, several terpenes were exclusively produced by ST02A4, ST02B or ST02C, including δ-elemene, (+)-cyclosativene, α-copaene, γ-elemene, γ-muurolene, α-muurolene, δ-cadinene (cadina-1(10),4-diene), α-elemol, germacrene B although some are produced at very low levels. Among these compounds, γ-muurolene and δ-cadinene (cadina-1(10),4-diene) were not detected from ginger or turmeric samples, α-copaene was detected in both ginger rhizome and root samples, and all others are only detected in ginger rhizome samples (Figure 1). When these results are considered together, it appears that diverse ST02 paralogs play an important role in enriching sesquiterpene diversity in ginger rhizome.
The sesquiterpene synthase designated ST03 is very similar (89%–95% identical) to ST02 derived genes. However, recombinantly expressed ST03 synthesizes different products compared to the ST02 enzymes. Enzyme assays with FPP as substrate and using crude E. coli extract containing expressed ST03 yielded γ-amorphene (65.4%) as the major product, with allo-aromadendrene (11.8%), germacrene D-4-ol (9.6%), γ-cadinene (8.7%) and germacrene D (4.4%) also produced at appreciable levels (Table 2, Figure S23). Comparable assays with GPP as substrate yielded a number of monoterpenoids (Table 1, Figure S24), although this enzyme is not likely transported to the plastid. The major sesquiterpene product of this enzyme, γ-amorphene, is present at low levels in ginger, and is barely detectable in turmeric (Figure 1). It is difficult to detect γ-amorphene in extracts from ginger rhizome (the tissue where the ST03 gene is most highly expressed) because it comes off of the GC column immediately after the very large (−)-α-zingiberene peak. Two products of ST03, allo-aromadendrene and γ-cadinene, are more readily detected, with allo-aromadendrene present in ginger rhizome, root and leaf and γ-cadinene in ginger rhizome. ST03, which is expressed in ginger rhizome, root and leaf based on microarray data (Table S4), is the likely source of these compounds. And as expected based on the chemical profile, ST03 was not expressed in turmeric according to microarray data.
α-Humulene (also called α-caryophyllene) is present in all turmeric tissues and in ginger rhizome at very low levels. However, it is the most abundant sesquiterpene in ginger roots. ST05 (and close paralog ST05A), which produces α-humulene in vitro, was expressed at much higher levels in ginger root than in other tissues (Table S4). ST05 and ST05A are very similar paralogs: 99.1% and 98.1% identity at the DNA and amino acid sequence levels, respectively. Although they are very similar, their solubilities are different. ST05A is barely soluble and ST05 is quite soluble (Table S5). Therefore, we tried to purify ST05A using a HIS-tag. However, the amount of soluble ST05A was small, and some E. coli proteins co-eluted, leading only to partial purification. Enzyme assays with crude E. coli extracts containing expressed recombinant ST05 or with partially purified ST05A produced very similar results. These assays did not produce detectable monoterpenes with GPP as a substrate. With FPP as a substrate, however, ST05 synthesized α-humulene (α-caryophyllene) (83.4%) as the major product and (E)-caryophyllene (β-caryophyllene) (14.2%), β-elemene (1.5%) and 1,5,9-trimethyl-1,5,9-cyclododecatriene (0.8%) as minor products (Table 2, Figure S25). ST05A showed a similar product profile, except for the last compound, which was not detected. An α-humulene synthase from shampoo ginger (Zingiber zerumbet Smith) has 91% similarity to ST05 and ST05A. That enzyme was reported to produce α-humulene (α-caryophyllene) as major product and small amounts of (E)-caryophyllene (β-caryophyllene) [21]. However, shampoo ginger α-humulene synthase did not synthesize β-elemene or 1,5,9-trimethyl-1,5,9-cyclododecatriene. It is possible that these compounds were not detected due to very low abundance, just as we could not detect 1,5,9-trimethyl-1,5,9-cyclododecatriene in our ST05A assays. Interestingly, (E)-caryophyllene is the most abundant sesquiterpene in ginger leaf but there are only small amounts of α-humulene in ginger leaves. This suggests that either α-humulene is converted in ginger leaves into another compound (which we could not detect in our metabolite profiling experiments) or ginger leaves have a different terpene synthase that produces (E)-caryophyllene as a major product.
A different pair of parologous STPSs, ST07 and ST07A (97% identical to each other), is not expressed in turmeric, but is expressed at high levels in ginger root and leaf tissues (Table S4). These expression patterns are similar to (E)-caryophyllene production patterns in ginger and turmeric tissues, suggesting that ST07 and ST07A might be good candidates for involvement in production of (E)-caryophyllene. These proteins were essentially insoluble in E. coli expression systems (several were tested, see Table S5). Despite their high sequence similarity, ST07A expressed in yeast is highly soluble, whereas ST07 expressed in yeast was much less soluble when checked by Western blotting. Based on GC/MS library searches, the best hits for the major product of yeast-expressed ST07 and ST07A (see Figures S26 and S27) are caryophyllenyl alcohol, caryolan-1-ol or caryolan-8-ol, all of which are oxygenated forms of (E)-caryophyllene. It seems unlikely that caryophyllenyl alcohol could be produced directly by a terpene synthase, whereas caryolan-1-ol, caryolan-8-ol or other oxygenated forms of (E)-caryophyllene (Compound A, Compound B and Compound C shown in Figure 6) can be made by a terpene synthase. We could not find oxygenated (E)-caryophyllene derivatives in ginger and turmeric except for caryophyllene oxide. Therefore, it seemed possible that ST07 and ST07A may produce (E)-caryophyllene, which is observed in all tissues of both ginger and turmeric, and that (E)-caryophyllene produced in yeast would then be oxygenated by yeast proteins in the expression system used. However, (E)-caryophyllene was produced in yeast without signs of being oxygenated [36]. Therefore, ST07 and ST07A do indeed appear to produce oxygenated forms of (E)-caryophyllene. In ginger, the product of ST07 and ST07A, an oxygenated (E)-caryophyllene, could be further processed by other enzymes and would therefore not be detected in our GC/MS analysis shown in Figure 1.
Recently, a (+)-caryolan-1-ol synthase was reported from bacteria, Streptomyces griseus [38], and the mass spectrum of that compound was very similar to the major product of ST07 and ST07A. In contrast, plants have been reported to produce the opposite enantiomer of (E)-caryophyllene, (−)-(E)-caryophyllene. Comparison with authentic standards of (−)-caryolan-1-ol and (+)-caryolan-1-ol in a general purpose GC column and a chiral column confirmed that the major product of ST07 and ST07A is (−)-caryolan-1-ol (β-caryophyllene alcohol) (Figures 7, S28 and S29). To our knowledge, this is the first example of a (−)-caryolan-1-ol synthase.
Identification of α-zingiberene/β-sesquiphellandrene Oxidase
As discussed above, (+)-α-turmerone and (+)-β-turmerone appear to be produced by the oxidation of (−)-α-zingiberene and (−)-β-sesquiphellandrene, respectively. By comparing EST data, microarray data and metabolite data, four P450 monooxygenases were selected from 170 ginger and turmeric P450s as the most likely candidates to be involved in formation of these two sesquiterpenoids. These four monooxygenases were named P1, P2, P3 and P4. P2 and P3 were partial clones in the EST database, missing 5′ ends, and genome walking revealed the complete sequences. P1, P2 and P4 are very similar to each other and belong to the clade (CYP71D) that contains limonene hydroxylase, which forms a secondary hydroxyl group. P3 is different from the other three P450s and it belongs to the clade (CYP71AV1) that contains amorphadiene oxidase. Amorphadiene oxidase catalyzes a three step of oxidation (primary alcohol → aldehyde → acid) at the end of a hydrocarbon chain [25] (although the latter two oxidations are much less favorable than the first) and limonene hydroxylase forms a secondary alcohol on a six carbon ring [39]. Oxygenation of (−)-α-zingiberene and (−)-β-sesquiphellandrene at position 9 (Figure 8) is a more similar reaction to that catalyzed by limonene hydroxylase than to that by amorphadiene oxidase. Therefore, P1, P2 and P4 seemed to be the best candidates for the first step in the conversion of (−)-α-zingiberene and (−)-β-sesquiphellandrene into (+)-α-turmerone and (+)-β-turmerone.
Because all three enzymes were very similar, P1 and P4 were selected for cloning from turmeric rhizome cDNA, yielding five paralogs, P1A, P1A2, P4, P4A and P4A2, which were expressed in yeast. Co-expression of these P450s with ST00A in EPY224 that was not expressing a cytochrome P450 reductase gene did not produce hydroxylated forms of (−)-α-zingiberene or (−)-β-sesquiphellandrene. Co-expression of these P450s with ST00A and sweet basil P450 reductase (Ob_CPR) yielded some products that appear to be hydroxylated forms of (−)-α-zingiberene and (−)-β-sesquiphellandrene, according to comparison of mass spectra of these peaks with (−)-α-zingiberene, (−)-β-bisabolene and (−)-β-sesquiphellandrene (Figure 9). However, these hydroxylated products are not registered in the NIST GC/MS library, and unambiguous identification will require significant work in the future. Nevertheless, we can draw conclusions using our current data that suggest that these are indeed the intermediates of interest for the biosynthesis of the turmerones.
In Figure 9H, peak 2 and 4 have m/z 220, which is the mass of hydroxylated forms of (−)-α-zingiberene, (−)-β-bisabolene or (−)-β-sesquiphellandrene, and peak 1 and 3 have m/z 222. Although m/z 222 is not what would be expected for hydroxylated forms of (−)-α-zingiberene, α-bisabolol (MW 220) also contains m/z 222 in its mass spectrum. Other ions such as m/z 93, 119, 68, etc., also support the proposal that these peaks are likely to be hydroxylated forms of (−)-α-zingiberene, (−)-β-bisabolene or (−)-β-sesquiphellandrene. Because these peaks have different mass spectra than those of α-bisabolol, epi- α-bisabolol or β-bisabolol, it is likely that they are derived instead from (−)-α-zingiberene or (−)-β-sesquiphellandrene. Thus, the biosyntheses of (+)-α-turmerone and (+)-β-turmerone likely originates by the action of α-zingiberene/β-sesquiphellandrene synthase, followed by the action of α-zingiberene/β-sesquiphellandrene hydroxylase (P1, P2, or P4) and is completed by the action of a dehydrogenase (still uncharacterized) that converts the secondary alcohols to the ketone forms.
Discussion
Evolution of Terpene Synthases in Ginger and Turmeric Investigated through Protein Structural Modeling
We identified 25 mono- and 16 sesquiterpene synthases from ginger and turmeric and revealed the function of 13 mono- and 11 sesquiterpene synthases. The products of these TPSs and terpenoids in tissues correspond well except for a few instances, where for example high levels of citral (geranial+neral) were observed in ginger rhizomes and leaves, but no enzyme with geraniol synthase activity was observed. Geraniol is the precursor of citral. It is likely that one of the insoluble enzymes is a geraniol synthase.
Many of the identified TPSs are very similar to each other and are considered to be paralogs, of which some have conserved functions whereas others have very divergent functions. For example, MT06 and MT06A were 98.0% identical and only had differences in the transit peptide sequences with one gap. Also, ST00A and ST00B, with 98.4% similarity, synthesized the same products. However, although MT06B is 92.9% identical to MT06 and 90.8% to MT06A, the products of these enzymes are quite different. MT06 and MT06A can produce sesquiterpenes whereas MT06B cannot. When MT06 and MT06B protein structures were modeled using (4S)-limonene synthase from Mentha spicata [40] as a template, their backbone structures were found to be very similar but the side chains in the substrate binding pocket were different (Figure 10). F327 of MT06B appears to prevent FPP binding whereas MT06 has a leucine at that position, which can allow FPP binding, leading to production of sesquiterpenes. Although MT06B has one extra amino acid in the loop (arrow in Figure 10B) when compared to MT06 and (4S)-limonene synthase, the tyrosines after the loop (Y576 in MT06, Y577 in MT06B) are aligned very well in the modeled structures and do not seem to affect protein function. These results suggest that replacement of F327 with a Leu may allow expanded substrate versatility and product production.
ST02B and ST02C are 95.7% identical. However, ST02C produces β-elemene (germacrene A) as a major product and ST02B produces α-elemol as a major product and β-elemene (germacrene A) as a minor product. When their protein structures were modeled based on (+)-δ-cadinene synthase from Gossypium arboreum [41], we could not see side chain differences around the active sites in the ST02B and ST02C modeled structures (Figure 11). According to the modeled structures, the main differences lie in the N-terminal loop. The N-terminal end loop of ST02B is closer to the C-terminal end loop than was predicted for ST02C. The difference in contact of the N- and C-terminal end loops may affect protein breathing and cause easy access for water molecules to quench the reaction. ST02A4 is also very similar to ST02B and ST02C: 95.5% and 96.6% similarity, respectively. ST02A4 produces (−)-neointermedeol as a major product and β-elemene (germacrene A) as a minor product. When the ST02A4 structure was modeled against the (+)-δ-cadinene synthase structure and compared with the modeled ST02B and ST02C structures, there was no difference in side chains near the active site. Again, ST02A4 has a different loop structure at the N-terminal end. The expected structure for ST02A4 is more similar to ST02B than to ST02C because ST02C is expected to have an extended α-helix and shorter loop (Figure 11). The loop around the tryptophan at the N-terminal end of the RRX8W motif is in contact with the C-terminal end of the helix (Figure 11D) in all three modeled structures. This interaction may stabilize the C-terminal end structure near the active site. Although ST02A4 and ST02B synthesize different major products, both compounds are quenched by a water molecule, which can be explained by their similar loop structures at the N-terminal ends. ST02C and the template, (+)-δ-cadinene synthase, have similar loop structures at the N-terminal ends, prolonged α-helices and shorter loops, and both produce the terpenes not quenched by water molecules.
ST02A4, ST02B and ST02C appear to have diverged recently and their modeled structures are very similar. They also produce similar products: ST02A4 produces (−)-neointermedeol, ST02B produces α-elemol and ST02C produces β-elemene (germacrene A) as major products. These compounds are synthesized via the same mechanistic pathway through (+)-germacrene A (Figure 5).
(+)-α-Turmerone and (+)-β-turmerone are Derived from (−)-α-zingiberene and (−)-β-sesquiphellandrene
During efforts to identify ginger and turmeric compounds eluted in our GC/MS analyses, four major sesquiterpenoids stood out as the most abundant in turmeric rhizomes. Two of these compounds were clearly (−)-α-zingiberene and (−)-β-sesquiphellandrene, also the major compounds in ginger rhizomes. The other two compounds were oxygenated, and had best hits to “tumerone” and “curlone” when searched against the NIST database. A search for tumerone (CAS# 180315-67-7) in SciFinder Scholar gave only two references: one from 1934 and the other in 1996 [23], which used analysis of fragmented masses in GC/MS but not NMR for structural characterization. No absolute configuration data were reported for this compound; thus the actual structure of tumerone was clearly in question. Our data suggest that the compound detected in our samples was instead (+)-α-turmerone, with absolute configuration of (6R, 7S). The absolute configuration of (−)-curlone (CAS# 87440-60-6), on the other hand, was proposed to be (6S, 7S) by dehydrogenation to form (+)-ar-turmerone (7S) and NOE correlations that were not shown in the original NMR-based characterization of that compound [42] that were said to indicate possible 6S configuration, based on some assumptions regarding molecule conformation that may not be valid. These results suggested that either (−)-curlone exists as a (6S, 7S) diastereomer of (+)-β-turmerone (6R, 7S), or it was incorrectly assigned the 6S configuration and (−)-curlone and (+)-β-turmerone are one and the same compound. The latter is very likely the case [43].
When we compared the terpenoid profiles of ginger and turmeric, we observed that both ginger and turmeric produce (−)-α-zingiberene and (−)-β-sesquiphellandrene, but only turmeric synthesizes the two oxygenated compounds. Ginger and turmeric produce more (−)-α-zingiberene than (−)-β-sesquiphellandrene and one oxygenated compound (α-turmerone) is also more abundant than the other oxygenated compound (β-turmerone, see Figure 1 and Figure 8). The ratios of these compounds are fairly consistent in all analyses. Indeed, when we view (−)-α-zingiberene (green) and (+)-α-turmerone (orange) as one group and (−)-β-sesquiphellandrene (blue) and (+)-β-turmerone (red) as a second group, the ratios of the first group to the second group are 2.7 and 2.9, respectively, in Figure 1D and 1E (in turmeric rhizomes), which is very similar to the ratio (3.2) of (−)-α-zingiberene to (−)-β-sesquiphellandrene in ginger rhizome (Figure 1A). This suggested that the two oxygenated compounds ((+)-α-turmerone and (+)-β-turmerone) could indeed be derived from (−)-α-zingiberene and (−)-β-sesquiphellandrene, respectively, and, based on absolute configuration of the compounds in question (all four share the 6R, 7S configuration), and the mechanism whereby the ketones are formed from the sesquiterpenes, which would not be expected to lead to inversion of configuration at C6, the oxygenated compounds present in the turmeric lines that we tested must indeed be (+)-α-turmerone and (+)-β-turmerone, even though they were known as tumerone and curlone in the NIST database. These results again point to the care that must be taken in such metabolite profiling experiments, where one should not rely solely on database searches to identify compounds in complex plant samples. It is possible that some turmeric accessions or plants do contain (−)-curlone, the diastereomer of (+)-β-turmerone. However, since it seems very unlikely that (−)-β-sesquiphellandrene is converted into (−)-curlone, either another route to (−)-curlone must exist in other turmeric lines, or the original paper describing (−)-curlone was incorrect in assignment of the configuration at C6. If (−)-curlone is present in other plants, its biosynthesis would involve the alternate intermediate, cyclohexene, 3-[(1S)-1,5-dimethyl-4-hexen-1-yl]-6-methylene-, (3S)- (CAS# 251318-35-1) (Figure 8), which has only been reported as a synthetic molecule [44]. (−)-β-Sesquiphellandrene and (−)-α-zingiberene are both produced by the STPS through an (S)-bisabolyl cation followed by a C7/C1 1,3-hydride shift (Figure 8). The precursor for (−)-curlone would have to be produced from the (R)-bisabolyl cation via a C7/C1 1,3-hydride shift. Thus, (−)-curlone synthesis (should it occur in nature) likely does not involve ST00A/ST00B, which appears to utilize only the (S)-bisabolyl cation as an enzyme-bound intermediate.
As discussed above, the cloned α-zingiberene/β-sesquiphellandrene synthases produce (−)-α-zingiberene and (−)-β-sesquiphellandrene with a similar ratio to that observed for the compounds measured in ginger and turmeric rhizome samples. Identification of these genes supports the conclusion that the blue colored peak in the metabolite profiling experiments (Figure 1) is (−)-β-sesquiphellandrene, leading to the conclusion that the red peak is (+)-β-turmerone instead of (−)-curlone.
Supporting Information
Acknowledgments
We thank to Dr. Jay Keasling for yeast cells (EPY219, EPY224) and plasmids (pMevT, pMBI and pMBIS), Dr. Joseph Noel for pH9GW plasmid and Dr. Yasuo Ohnishi for authentic standards of (−)-caryolan-1-ol and (+)-caryolan-1-ol.
Funding Statement
This research was funded by a grant from the US National Science Foundation to DRG (grant #DBI-0227618; www.nsf.gov). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Kundu JK, Na HK, Surh YJ (2009) Ginger-derived phenolic substances with cancer preventive and therapeutic potential. Forum Nutr 61: 182–192. [DOI] [PubMed] [Google Scholar]
- 2. Lopez P, Sanchez C, Batlle R, Nerin C (2005) Solid- and vapor-phase antimicrobial activities of six essential oils: susceptibility of selected foodborne bacterial and fungal strains. J Agric Food Chem 53: 6939–6946. [DOI] [PubMed] [Google Scholar]
- 3. Ficker CE, Arnason JT, Vindas PS, Alvarez LP, Akpagana K, et al. (2003) Inhibition of human pathogenic fungi by ethnobotanically selected plant extracts. Mycoses 46: 29–37. [DOI] [PubMed] [Google Scholar]
- 4. Habib SH, Makpol S, Abdul Hamid NA, Das S, Ngah WZ, et al. (2008) Ginger extract (Zingiber officinale) has anti-cancer and anti-inflammatory effects on ethionine-induced hepatoma rats. Clinics (Sao Paulo) 63: 807–813. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Nigam N, George J, Srivastava S, Roy P, Bhui K, et al. (2009) Induction of apoptosis by [6]-gingerol associated with the modulation of p53 and involvement of mitochondrial signaling pathway in B[a]P-induced mouse skin tumorigenesis. Cancer Chemother Pharmacol 24: 24. [DOI] [PubMed] [Google Scholar]
- 6. Jeong CH, Bode AM, Pugliese A, Cho YY, Kim HG, et al. (2009) [6]-Gingerol suppresses colon cancer growth by targeting leukotriene A4 hydrolase. Cancer Res 69: 5584–5591. [DOI] [PubMed] [Google Scholar]
- 7. Shukla Y, Singh M (2007) Cancer preventive properties of ginger: a brief review. Food Chem Toxicol 45: 683–690. [DOI] [PubMed] [Google Scholar]
- 8. Wang G, Li X, Huang F, Zhao J, Ding H, et al. (2005) Antitumor effect of beta-elemene in non-small-cell lung cancer cells is mediated via induction of cell cycle arrest and apoptotic cell death. Cell Mol Life Sci 62: 881–893. [DOI] [PubMed] [Google Scholar]
- 9. Jianjun Q, Song Z, Yin L, Jia Z, Donglei L (2008) Treatment of chylothorax with elemene. Thorac Cardiovasc Surg 56: 103–105. [DOI] [PubMed] [Google Scholar]
- 10. Yamahara J, Mochizuki M, Rong HQ, Matsuda H, Fujimura H (1988) The anti-ulcer effect in rats of ginger constituents. J Ethnopharmacol 23: 299–304. [DOI] [PubMed] [Google Scholar]
- 11. Yamahara J, Hatakeyama S, Taniguchi K, Kawamura M, Yoshikawa M (1992) Stomachic principles in ginger. II. Pungent and anti-ulcer effects of low polar constituents isolated from ginger, the dried rhizoma of Zingiber officinale Roscoe cultivated in Taiwan. The absolute stereostructure of a new diarylheptanoid. Yakugaku Zasshi 112: 645–655. [DOI] [PubMed] [Google Scholar]
- 12. Jurenka JS (2009) Anti-inflammatory properties of curcumin, a major constituent of Curcuma longa: a review of preclinical and clinical research. Altern Med Rev 14: 141–153. [PubMed] [Google Scholar]
- 13. Ravindran J, Prasad S, Aggarwal BB (2009) Curcumin and cancer cells: how many ways can curry kill tumor cells selectively? Aaps J 11: 495–510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Jayaprakasha GK, Jena BS, Negi PS, Sakariah KK (2002) Evaluation of antioxidant activities and antimutagenicity of turmeric oil: a byproduct from curcumin production. Z Naturforsch [C] 57: 828–835. [DOI] [PubMed] [Google Scholar]
- 15. Negi PS, Jayaprakasha GK, Jagan Mohan Rao L, Sakariah KK (1999) Antibacterial activity of turmeric oil: a byproduct from curcumin manufacture. J Agric Food Chem 47: 4297–4300. [DOI] [PubMed] [Google Scholar]
- 16. Nishiyama T, Mae T, Kishida H, Tsukagawa M, Mimaki Y, et al. (2005) Curcuminoids and sesquiterpenoids in turmeric (Curcuma longa L.) suppress an increase in blood glucose level in type 2 diabetic KK-Ay mice. J Agric Food Chem 53: 959–963. [DOI] [PubMed] [Google Scholar]
- 17. Lee Y (2009) Activation of apoptotic protein in U937 cells by a component of turmeric oil. BMB Rep 42: 96–100. [DOI] [PubMed] [Google Scholar]
- 18. Lee HS (2006) Antiplatelet property of Curcuma longa L. rhizome-derived ar-turmerone. Bioresource Technology 97: 1372–1376. [DOI] [PubMed] [Google Scholar]
- 19. Picaud S, Olsson ME, Brodelius M, Brodelius PE (2006) Cloning, expression, purification and characterization of recombinant (+)-germacrene D synthase from Zingiber officinale . Arch Biochem Biophys 452: 17–28. [DOI] [PubMed] [Google Scholar]
- 20. Fujisawa M, Harada H, Kenmoku H, Mizutani S, Misawa N (2010) Cloning and characterization of a novel gene that encodes (S)-beta-bisabolene synthase from ginger, Zingiber officinale . Planta 232: 121–130. [DOI] [PubMed] [Google Scholar]
- 21. Yu F, Okamto S, Nakasone K, Adachi K, Matsuda S, et al. (2008) Molecular cloning and functional characterization of alpha-humulene synthase, a possible key enzyme of zerumbone biosynthesis in shampoo ginger (Zingiber zerumbet Smith). Planta 227: 1291–1299. [DOI] [PubMed] [Google Scholar]
- 22. Yu F, Harada H, Yamasaki K, Okamoto S, Hirase S, et al. (2008) Isolation and functional characterization of a beta-eudesmol synthase, a new sesquiterpene synthase from Zingiber zerumbet Smith. FEBS Lett 582: 565–572. [DOI] [PubMed] [Google Scholar]
- 23. Hiserodt R, Hartman TG, Ho CT, Rosen RT (1996) Characterization of powdered turmeric by liquid chromatography mass spectrometry and gas chromatography mass spectrometry. Journal of Chromatography A 740: 51–63. [Google Scholar]
- 24. Martin VJ, Pitera DJ, Withers ST, Newman JD, Keasling JD (2003) Engineering a mevalonate pathway in Escherichia coli for production of terpenoids. Nat Biotechnol 21: 796–802. [DOI] [PubMed] [Google Scholar]
- 25. Ro DK, Paradise EM, Ouellet M, Fisher KJ, Newman KL, et al. (2006) Production of the antimalarial drug precursor artemisinic acid in engineered yeast. Nature 440: 940–943. [DOI] [PubMed] [Google Scholar]
- 26. Jiang H, Xie Z, Koo HJ, McLaughlin SP, Timmermann BN, et al. (2006) Metabolic profiling and phylogenetic analysis of medicinal Zingiber species: Tools for authentication of ginger (Zingiber officinale Rosc). Phytochemistry 67: 1673–1685. [DOI] [PubMed] [Google Scholar]
- 27.Adams RP (1995) Identification of Essential Oil Components by Gas Chromatography/Mass Spectroscopy. Illinois, USA: Allured Publishing Co.
- 28. Arnold K, Bordoli L, Kopp J, Schwede T (2006) The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics 22: 195–201. [DOI] [PubMed] [Google Scholar]
- 29. Leong-Skornickova J, Sida O, Jarolimova V, Sabu M, Fer T, et al. (2007) Chromosome numbers and genome size variation in Indian species of Curcuma (Zingiberaceae). Annals of Botany 100: 505–526. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Garms S, Chen F, Boland W, Gershenzon J, Köllner TG (2012) A single amino acid determines the site of deprotonation in the active center of sesquiterpene synthases SbTPS1 and SbTPS2 from Sorghum bicolor. Phytochemistry 75: 6–13. [DOI] [PubMed] [Google Scholar]
- 31. Zhuang X, Köllner TG, Zhao N, Li G, Jiang Y, et al. (2012) Dynamic evolution of herbivore-induced sesquiterpene biosynthesis in sorghum and related grass crops. Plant J 69: 70–80. [DOI] [PubMed] [Google Scholar]
- 32. Martin DM, Aubourg S, Schouwey MB, Daviet L, Schalk M, et al. (2010) Functional annotation, genome organization and phylogeny of the grapevine (Vitis vinifera) terpene synthase gene family based on genome assembly, FLcDNA cloning, and enzyme assays. BMC Plant Biol 10: 226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Oswald M, Fischer M, Dirninger N, Karst F (2007) Monoterpenoid biosynthesis in Saccharomyces cerevisiae. FEMS Yeast Res 7: 413–421. [DOI] [PubMed] [Google Scholar]
- 34. Fischer MJ, Meyer S, Claudel P, Bergdoll M, Karst F (2011) Metabolic engineering of monoterpene synthesis in yeast. Biotechnol Bioeng 108: 1883–1892. [DOI] [PubMed] [Google Scholar]
- 35. de Kraker JW, Franssen MC, de Groot A, Konig WA, Bouwmeester HJ (1998) (+)-Germacrene A biosynthesis. The committed step in the biosynthesis of bitter sesquiterpene lactones in chicory. Plant Physiol 117: 1381–1392. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Gopfert JC, Macnevin G, Ro DK, Spring O (2009) Identification, functional characterization and developmental regulation of sesquiterpene synthases from sunflower capitate glandular trichomes. BMC Plant Biol 9: 86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Wang G, Tian L, Aziz N, Broun P, Dai X, et al. (2008) Terpene biosynthesis in glandular trichomes of hop. Plant Physiol 148: 1254–1266. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Nakano C, Horinouchi S, Ohnishi Y (2011) Characterization of a novel sesquiterpene cyclase involved in (+)-caryolan-1-ol biosynthesis in Streptomyces griseus. J Biol Chem 2011: 21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Ponnamperuma K, Croteau R (1996) Purification and characterization of an NADPH-cytochrome P450 (cytochrome c) reductase from spearmint (Mentha spicata) glandular trichomes. Arch Biochem Biophys 329: 9–16. [DOI] [PubMed] [Google Scholar]
- 40. Hyatt DC, Youn B, Zhao Y, Santhamma B, Coates RM, et al. (2007) Structure of limonene synthase, a simple model for terpenoid cyclase catalysis. Proc Natl Acad Sci U S A 104: 5360–5365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Gennadios HA, Gonzalez V, Di Costanzo L, Li A, Yu F, et al. (2009) Crystal structure of (+)-delta-cadinene synthase from Gossypium arboreum and evolutionary divergence of metal binding motifs for catalysis. Biochemistry 48: 6175–6183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Kiso Y, Suzuki Y, Watanabe N, Oshima Y, Hikino H (1983) Stereostructure of curlone, a sesquiterpenoid of Curcuma longa rhizomes. Phytochemistry 22: 596–597. [Google Scholar]
- 43. Golding BT, Pombo-Villar E (1992) Structures of α- and β-turmerone. Journal of the Chemical Society, Perkin Transactions 1: 1519–1524. [Google Scholar]
- 44. Kreiser W, Korner F (1999) Stereospecific synthesis of (+)-beta-sesquiphellandrene. Helvetica Chimica Acta 82: 1427–1433. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.