Abstract
Galanthamine is an Amaryllidaceae alkaloid used to treat the symptoms of Alzheimer’s disease. This compound is primarily isolated from daffodil (Narcissus spp.), snowdrop (Galanthus spp.), and summer snowflake (Leucojum aestivum). Despite its importance as a medicine, no genes involved in the biosynthetic pathway of galanthamine have been identified. This absence of genetic information on biosynthetic pathways is a limiting factor in the development of synthetic biology platforms for many important botanical medicines. The paucity of information is largely due to the limitations of traditional methods for finding biochemical pathway enzymes and genes in non-model organisms. A new bioinformatic approach using several recent technological improvements was applied to search for genes in the proposed galanthamine biosynthetic pathway, first targeting methyltransferases due to strong signature amino acid sequences in the proteins. Using Illumina sequencing, a de novo transcriptome assembly was constructed for daffodil. BLAST was used to identify sequences that contain signatures for plant O-methyltransferases in this transcriptome. The program HAYSTACK was then used to identify methyltransferases that fit a model for galanthamine biosynthesis in leaf, bulb and inflorescence tissues. One candidate gene for the methylation of norbelladine to 4′-O-methylnorbelladine in the proposed galanthamine biosynthetic pathway was identified. This methyltransferase cDNA was expressed in E. coli and the protein purified by affinity chromatography. The resulting protein was found to be a norbelladine 4′-O-methyltransferase (NpN4OMT) of the proposed galanthamine biosynthetic pathway.
Introduction
Amaryllidaceae alkaloids are a group of alkaloids with many documented biological activities. This makes them valuable potential medicines several examples are the anti-cancer compounds hemanthamine and lycorine and the anti-viral compound pancratistatin [1]–[3]. One example of an Amaryllidaceae alkaloid already used medically to treat Alzheimer’s disease is galanthamine. Galanthamine is an alkaloid discovered in 1953 that is produced by members of the Amaryllidaceae family [4]. It reduces the symptoms of Alzheimer’s disease through acetylcholine esterase inhibition and nicotinic receptor binding. These activities are thought to compensate for reduced acetylcholine sensitivity in Alzheimer’s disease by increasing acetylcholine levels and perhaps increasing acetylcholine sensitivity [5], [6]. Until now, no committed biosynthetic genes have been identified [7], [8]. Limited enzyme kinetic characterization has been done on plant protein extracts enriched for the norbelladine 4′-O-methyltransferase (N4OMT) of Nerine bowdenii, but the underlying gene was never identified [9].
The putative galanthamine biosynthetic pathway has been studied in detail and intermediates in the pathway have been determined. This knowledge is based on radiolabeling experiments. Work on other Amaryllidaceae alkaloids including lycorine and hemanthamine studying steps prior to 4′-O-methylnorbelladine can be applied to galanthamine biosynthesis because 4′-O-methylnorbelladine is a universal substrate for these alkaloids [10]. The proposed pathway starts with the amino acids phenylalanine and tyrosine [11]. In Narcissus incomparabilis phenylalanine was established as a precursor that contributes the catechol portion of norbelladine. This was done using radiolabeling experiments to trace incorporation of [3-14C]phenylalanine into lycorine and degradation experiments on the resulting lycorine to determine the location of the 14C label [12]. Similar experiments with phenylalanine were performed in Nerine browdenii monitoring hemanthamine incorporation [13]. As a follow up radiolabeling experiments were used to determine that phenylalanine probably proceeds sequentially through the intermediates trans-cinnamic acid, p-hydroxycinnamic acid and 3,4-dihydroxycinnamic acid or p-hydroxybenzaldehyde before conversion into 3,4-dihydroxybenzaldehyde [14]. Tyrosine has been established as a precursor of galanthamine that in contrast to phenylalanine contributes only to the non-catechol half of the norbelladine intermediate. This was done by observing [2-14C]tyrosine incorporation into galanthamine and degradation experiments of galanthamine [11]. Tyrosine decarboxylase converts tyrosine into tyramine and is well characterized in other plant families [15]. 3,4-Dihydroxybenzaldehyde and tyramine condense into a Schiff-base and are reduced to form the first alkaloid in the proposed pathway, norbelladine. Norbelladine has been documented to incorporate into galanthamine and all major Amaryllidaceae alkaloid types in 14C radiolabeling studies [11], [16]–[18]. 4′-O-methylnorbelladine is then formed by O-methylation of norbelladine [11]. A phenol-coupling reaction, followed by spontaneous oxide bridge formation, creates N-demethylnarwedine, which is then reduced and N-methylated to yield galanthamine (Figure 1) [10]. In one study, Barton et al. fed O-methyl[1-14C]norbelladine to flower stalks of King Alfred daffodils, but it was not incorporated into galanthamine. The authors concluded that the intermediate in the pathway must be 4′-O-methyl-N-methylnorbelladine despite low incorporation of this compound when the equivalent experiment was conducted with 4′-O-methyl-[N-methyl-14C]norbelladine [11]. A recent revision of the proposed pathway by Eichhorn et al. contradicted this conclusion and placed the N-methylation step at the end of the proposed pathway instead of before the phenol-coupling reaction. In that study, [OC3H3]4′-O-methylnorbelladine was applied to ovary walls of Leucojum aestivum. Incorporation into products indicated that the pathway produced N-demethylated intermediates up until the penultimate step to galanthamine. N-methylation was proposed as the final step of biosynthesis [10].
The discovery of genes involved in metabolism is essential to metabolic engineering and synthetic biology. The elucidation of plant biochemical pathways can take decades. In fact, the biosynthesis of morphine, an important opiate analgesic, is still not completely elucidated at the gene level, even though the first enzyme specific to morphine biosynthesis was discovered more than 20 years ago in 1993 [19]. Reports on the enzymatic activities of poppy extracts to describe the morphine biosynthetic pathway go even farther back to 1971 [20]. After more than 40 years of enzymology and reverse genetics, the morphine biosynthetic pathway is still incomplete at the gene level. Traditionally, plant biochemical pathway enzymes have been identified either directly by purification from plant extracts or indirectly by examining enriched cDNA libraries and functionally expressing clones [9], [21]–[27]. To reduce pathway discovery from a 20+ year process to a more reasonable time frame, new methods must be developed and embraced. The previous work on galanthamine biosynthesis makes the prediction of enzyme classes involved in the proposed pathway possible, thereby rendering the galanthamine pathway a suitable system for development of an omic methodology for biochemical pathway discovery.
In this study, using galanthamine biosynthesis as proof-of-concept, a novel workflow is presented to streamline the identification of biosynthetic pathway genes. A de novo transcriptome is created for Narcissus sp. aff. pseudonarcissus using illumina sequencing. HAYSTACK, a program that utilizes the Pearson correlation, is used to find genes that co-express with galanthamine accumulation in this transcriptome. This set of candidates is interrogated for homologs to methyltransferases. An OMT that converts norbelladine to 4′-O-methylnorbelladine (NpN4OMT) in the proposed biosynthesis of galanthamine is identified in this manner and characterized.
Materials and Methods
Plant tissue and chemicals
Daffodil plants were collected from an outdoor plot in St. Louis, MO, with the GPS coordinates (38.659305, −90.410203), during peak flowering and separated into leaf, bulb and inflorescence tissues. Inflorescence is considered all tissues above the spathe. The plants were collected with the permission of the corresponding author who is the owner of the private property. No endangered species were involved in this collection.
Formic acid, potassium phosphate monobasic, potassium phosphate dibasic, tris(hydroxymethyl)aminomethane, glycerol, sodium acetate, sodium chloride, tetramethylethylenediamine, calcium chloride, magnesium chloride and β-mercaptoethanol were obtained from Acros Organics. Glycine, papaverine hydrochloride, S-adenosyl methionine (AdoMet), cobalt chloride, zinc chloride and manganese chloride were obtained from Fisher Scientific. Other chemicals include acetonitrile, JT Baker; InstaPAGE, IBI Scientific; ethanol 200 proof, KOPTEC; Bradford reagent, Bio-Rad; S-adenosyl-L-homocysteine, Sigma-Aldrich; deoxynucleotide triphosphates (dNTPs), New England BioLabs, Inc. (NEB); and isopropyl β-D-1-thiogalactopyranoside (IPTG), Gold Biotechnology. The norbelladine N-methylnorbelladine, 4′-O-methyl-N-methylnorbelladine and 4′-O-methylnorbelladine were synthesized previously [10]. NotI, NdeI, T4 DNA ligase, Taq DNA Polymerase and Phusion High-Fidelity DNA Polymerase enzymes were from NEB. M-MLV reverse transcriptase and RNaseOUT were obtained from Invitrogen.
Alkaloid extraction and quantification
Daffodil leaf, bulb and inflorescence tissues were extracted by grinding tissue with a mortar and pestle cooled with liquid nitrogen. Each ground sample was split into three technical replicates. Two volumes of 70% ethanol were added followed by vortexing 5 min and centrifuging at 14,000×g for 10 min. The supernatant was filtered through a 0.2 µm low protein binding hydrophilic LCR (PTFE, millex-LG) membrane. For galanthamine quantitation, samples were diluted 1000 fold. Liquid chromatography samples were injected (10 µl) onto an LC-20AD (Shimadzu) with a Waters Nova Pak C-18 (300×3.9 mm 4 µm) column coupled to a 4000 QTRAP (AB Sciex Instruments) for MS/MS analysis. The gradient program had a flow rate of 0.8 ml/min; solvent A was 0.1% formic acid in H2O and solvent B was 0.1% formic acid in acetonitrile. At the beginning of the program, solvent B was held at 15% for 2 min, followed by a linear gradient to 43% B at 15 min, 90% B at 15.1 min, 90% B at 20 min, 15% B at 21 min and 15% B at 26 min. A Turbo Ion Spray ionization source temperature of 500°C was used with low resolution for Q1 and Q3. All multiple reaction monitoring (MRM) scans were performed in positive ion mode. The ion fragment used for quantitation of galanthamine was 288.00 [M+H]+/213.00 [M-OH-C3H7N]+• m/z. Galanthamine was identified by comparison of retention time and fragmentation pattern to authentic galanthamine standard. The Analyst 1.5 software was used to quantitate galanthamine using a comparison of peak area of the unknown to authentic galanthamine.
Illumina sequencing and transcriptome assembly
The transcriptome was generated via data cleaning, short read assembly, final assembly, and post processing steps. A modified TRIzol RNA isolation method found as protocol number 13 in Johnson et al. was used to obtain RNA for cDNA library preparation [28]. Illumina RNA-Seq was used to generate 100 base pair paired-end reads from the cDNA library. The resulting data was monitored for overrepresented reads. Having found no such reads, adaptor sequences and sections of the standard phi X genome were identified and removed. Reads were then trimmed for quality using the FASTX toolkit version 0.0.8 with a Q value cutoff of 10 as is default for PHRAP [29].
Reads were assembled in the following manner. ABySS was used to run multiple assemblies of the reads with a range of kmers 24≤k≤54. The resulting assemblies were assembled into scaffolds using ABySS scaffolder [30]. Gaps in the sequences were resolved using GapCloser from the SOAPdenovo suit [31]. A final assembly was conducted on the resulting synthetic ESTs using Mira in EST assembly mode [32]. All sequences with over 98% identity were considered redundant and removed using CD-Hit [33]. The resulting contigs >100 base pairs long were included in the final assembly. Protein products for these contigs were predicted using ESTScan [34], [35]; all peptides over 30 amino acids were reported. Burrows-Wheeler Aligner was used to align the original reads to the assembled transcriptome to generate relative expression data for the contigs in leaf, bulb and inflorescence tissues [36]. The daffodil assembly and the raw read data can be found at the MedPlant RNA Seq Database, http://www.medplantrnaseq.org. Anomalies in the number of reads per contig and abnormally long or short contigs were manually curated. To normalize for read depth, each expression value for each contig was divided by the total reads for the respective tissue and multiplied by 1 million.
Candidate gene identification
Relative expression data was compared to the levels of galanthamine in daffodil tissues using HAYSTACK with a background cutoff of 1, correlation cutoff 0.8, fold cutoff 4 and p-value 0.05 [37]. Using BLASTP, a list of known methyltransferases were queried against the daffodil transcriptome peptide list with an E-value of e−9 to identify methyltransferase homologs [38]. Accession numbers from NCBI for these methyltransferases are presented in Table S1. Overlap between the methyltransferase homologs and contigs that pass the HAYSTACK criteria were considered candidate genes. The candidate daffodil norbelladine 4′-OMT has the designation medp_9narc_20101112|62361 in the contigs.fa file in the Narcissus_spp.tar file on http://www.medplantrnaseq.org.
Phylogenetic tree
Sequences found in Table S2 were aligned using MUSCLE in the MEGA 5.2 software with default parameters [39]. For the phylogeny, this alignment was provided as input into the Maximum-Likelihood algorithm also found in MEGA 5.2. Default parameters were used except the Gaps/Missing Data treatment was set to partial deletion.
PCR and Cloning
The 5′ and 3′ ends of the NpN4OMT sequence were completed using Rapid Amplification of cDNA Ends (RACE) with the Invitrogen RACE kit. For gene specific primers (GSP) see Table S3. The same PCR program was used for both 5′ and 3′RACE. This applies to both cycles of nested PCR as well. The PCR program parameters were 30 s 98°C 1 cycle; 10 s 98°C, 30 s 60°C, 1 min 72°C 30 cycles; 5 min 72°C 1 cycle. The outer-primer PCR was a mixture of 4.6 ng/µl RACE ready bulb cDNA, 0.3 mM dNTPs, 0.3 µM GSP primer, 0.9 µM kit provided RACE primer, 1 U NEB Phusion High-Fidelity DNA Polymerase and Invitrogen recommended quantity of buffer in a 50 µl reaction. The inner-primer PCR used the product of the outer-primer PCR as template with 0.2 µM of the inner RACE GSP and Invitrogen primers and 0.2 mM dNTPs.
Amplification of the NpN4OMT open reading frame was performed with 5.1 ng/µl daffodil bulb oligo(dT) primed cDNA, 0.4 mM dNTPs, 0.4 µM each forward and reverse outer-primer, 1 UNEB Phusion High-Fidelity DNA Polymerase and recommended buffer in a 50 µl reaction. With the following PCR program parameters: 30 s 98°C 1 cycle; 10 s 98°C, 30 s 52°C, 1 min 72°C for 30 cycles; 5 min 72°C 1 cycle. The inner-primer PCR used 1 µl of the outer-primer PCR product and used the inner-primers in Table S3. The same PCR time program was used except the annealing temperature was increased to 53°C.
NpN4OMT was cloned into the pET28a vector with the NotI and NdeI restriction sites that were added to the 5′ and 3′ ends of the open reading frame using the inner PCR primers. PCR product and pET28a were digested with NotI and NdeI enzymes, followed by gel purification and ligation with the T4 DNA ligase. The resulting construct was transformed into E. coli DH5α cells and screened on Luria-Bertani agar plates with 50 µg/ml kanamycin. Resulting colonies were screened by colony PCR with T7 sequencing and T7 terminator primers and Taq DNA Polymerase. The following cycle program was used: 3 min 94°C 1 cycle; 30 s 94°C, 30 s 52°C, 2 min 72°C 30 cycles; 7 min 72°C 1 cycle. Plasmid minipreps were obtained using the QIAGEN QIAprep Spin Miniprep Kit. After Sanger sequencing of constructs (Genewiz), the desired plasmids were transformed into E. coli BL21(DE3) Codon Plus RIL competent cells. The sequences of the resulting five variants have the following accession numbers KJ584561(NpN4OMT1), KJ584562(NpN4OMT2), KJ584563(NpN4OMT3), KJ584564(NpN4OMT4) and KJ584565(NpN4OMT5).
Protein purification
Recombinant protein production in 1 L of E. coli and purification with TALON resin followed the protocol found in [40]. No proteases were added to the protein extract, and desalting was performed with PD-10 columns from GE Healthcare. Protein quantity was determined according to Bradford; purity was monitored by SDS-PAGE. The E. coli cell line containing the hexahistidine-tagged methylthioadenosine/S-adenosylhomocysteine nucleosidase (Pfs) construct from Choi-Rhee and Cronan’s work was used to purify Pfs protein [41].
Screening enzyme assays
Enzyme assays for initial testing of NpN4OMT1 contained 10 µg of pure protein with 200 µM AdoMet, 100 µM norbelladine and 30 mM potassium phosphate buffer pH 8.0 in 100 µl. The assays were incubated for 2 hr at 30°C. The vector control was an E. coli extract purified with TALON in the same way as the methyltransferase protein. For the vector control assay, an equal volume of the pure vector control extract was substituted for the NpN4OMT1 protein in the enzyme assay. These assays were quenched by adjusting the pH to 9.5 with two volumes of sodium bicarbonate and extracted with two volumes ethyl acetate two times. After drying, the extracts were re-suspended in the initial mobile phase of the HPLC program. The HPLC separation of the assays was performed using a phenomenex Luna C8(2) 5 µm 250×4.6 mm column with solvent A (0.1% formic acid in H2O) and solvent B (acetonitrile). The program started with 10% solvent B and a flow rate of 0.8 ml/min, a linear gradient began at 2 min to 30% at 15 min, 90% at 15.1 min, 90% at 20 min, 10% at 21 min and 10% at 28 min. Injection volume was 20 µl using a Waters Autosampler. Waters UV detector was set to 277 nm.
Kinetic characterization
After optimization of the assay, the buffer was changed to 100 µM glycine at pH 8.8, with 5 mM of MgCl2 added and the temperature was increased to 37°C in 100 µl total reaction volume. When performing kinetic assays, the E. coli enzyme Pfs was added to break down SAH and prevent product inhibition. Product formation in kinetic experiments was quantified by comparing product peak area to a standard curve of the expected product or equivalent. Papaverine was used as an internal standard.
With the same solvent system as for screening enzyme assays, the HPLC program started with 20% B and a flow rate of 0.8 ml/min, a linear gradient began at 2 min to 25.4% B at 7 min, 90% at 7.2 min, 90% at 9 min, 20% at 9.1 min and 20% at 14 min. A 4000 QTRAP mass spectrometer coupled to the same LC column and time program as used in HPLC was used to collect all compound mass and fragmentation data. For fragmentation data and program setting details see Table S4. For norbelladine kinetics an MRM program in positive ion mode was used to monitor the following fragments 260.00 [M+H]+/138.00 [M-C8H9O]+• m/z, 260.00 [M+H]+/121.00 [M-C7H8NO2]+• m/z, 274.00 [M+H]+/137.00 [M+H-C8H9O2]+ m/z, 274.00 [M+H]+/122.00 [M+H-C8H10NO2]+ m/z. The fragments with 260.00 [M+H]+ m/z and 274.00 [M+H]+ m/z molecular ions were replaced when looking at N-methylnorbelladine for 274.00 [M+H]+/152.10 [M-C8H9O]+• m/z, 274.00 [M+H]+/121.00 [M-C9H12NO2]+• m/z, 288.00 [M+H]+/150.10 [M-C8H9O2]+• m/z and 288.20 [M+H]+/137.00 [M-C9H12NO]+• m/z. Papaverine internal standard was monitored with the following fragments 340.40[M+H]+/324.20 [M-CH3]+• m/z and 340.40 [M+H]+/202.10 [M-C8H9O2]+• m/z. When conducting dopamine kinetics, galanthamine was used as the internal standard and samples were not ethyl acetate extracted prior to LC/MS/MS analysis. To remove protein, two volumes of acetonitrile were added followed by 1 hr at −20°C and 10 min centrifugation at 16,100×g, 4°C. The supernatant was dried under vacuum and re-suspended in the starting mobile phase before analysis. The HPLC time program was changed to start at 5% solvent B with solution going to waste until 3.9 min, at 5 min start linear gradient to 25% B at 25 min, 90% B at 9.5 min, 90% B at 11 min, 5% B at 11.1 min and 5% B at 16 min. Ions monitored in the MRM were 168.00 [M+H]+/151.00 [M+H-OH]+ m/z and 168.00 [M+H]+/119.00 [M-OH-OCH3]+• m/z. AdoMet steady state kinetic parameters were determined with norbelladine as the saturated substrate. Product was quantitated using HPLC with the 28 min program used for screening enzyme assays. Product for assays on the additional NpN4OMT variants was detected with this same 28 min program on HPLC.
When conducting kinetic experiments the Km was at least five fold higher than the minimum concentration of substrate and fivefold lower than the maximum concentration of substrate tested. Km and kcat were calculated by nonlinear regression to the Michaelis-Menten kinetics equation with the GraphPad PRISM 5.0 software.
NMR
NMR spectra were acquired for 4′-O-methylnorbelladine in CD3OD at 600 MHz on a BrukerAvance 600 MHz spectrometer equipped with a BrukerBioSpin TCI 1.7 mm MicroCryoProbe. Proton, gCOSY, ROESY, gHSQC, and gHMBC spectra were acquired; 13C chemical shifts were obtained from the HSQC and HMBC spectra. Chemical shifts are reported with respect to the residual non-deuterated MeOD signal (δH 3.31) (Figure S1, S2, S3, S4 and S5).
Quantitative Real Time-PCR (qRT-PCR)
cDNA for leaf, bulb and inflorescence tissues of daffodil were created using 1 µg RNA from the respective tissues, random primers and M-MLV reverse transcriptase according to the Invitrogen protocol. qRT-PCR was conducted with a TaqMan designed gene expression assay for the methyltransferase with ribosomal RNA as a reference according to manufacture protocol. Reactions (5 µl) were performed in quadruplicate with outlier exclusion using Applied Biosystems StepOnePlus Real-Time PCR system. Methyltransferase relative expression values were determined by calculating ΔΔCT values relative to standard ribosomal RNA and leaf tissue [42].
Results
The Illumina sequencing of daffodil leaf, bulb and inflorescence tissues resulted in 65 million paired-end reads that were used to make the daffodil transcriptome assembly. The transcriptome assembly consisted of 106,450 sequences (Figure 2A) with a mean length of 551 base pairs and a maximum length of 13,381 base pairs. A similar number of >100 base pair sequences were found in the transcriptome of Chlorophytum borivilianum [43]. This mean length indicates a high number of the sequences are long enough for homology searches and cloning work. Of these sequences, 79,980 were predicted to have open reading frames and were translated into peptides. After determining the reads coming from the three tissues, several homologs of genes with predictable expression patterns were used to evaluate the quality of the expression estimations. The RuBisCO large and small subunits have high amounts of expression in the photosynthetic leaf and inflorescence tissues compared to the non-photosynthetic bulb tissue. A homolog to the MADS62 floral development transcription factor is exclusively expressed in the inflorescence tissue as would be expected [44]. The read counts were thus determined to produce expected expression patterns.
The LC/MS/MS data for leaf, bulb, and inflorescence tissues resulted in a pronounced accumulation pattern of galanthamine. The largest concentration was found in bulb tissue, with a lower level found in leaf and the lowest level in inflorescence (Figure 2B).
Using BLAST to seek homologs to the methyltransferases found in Table S1 yielded 298 methyltransferase candidate genes [21]. Separately, HAYSTACK identified 9,505 contigs that co-express with galanthamine accumulation. A comparison of the two resulting lists revealed one methyltransferase NpN4OMT that fits the HAYSTACK model (Figure 2A). This methyltransferase was chosen for functional analysis. After RACE, NpN4OMT was found to be a 239 amino acid protein with a predicted molecular weight (MW) of 27 kDa. When expressed using the pET28a vector, the added N-terminal Histidine tag increased the MW to 29 kDa (Figure 3A). In the course of cloning, five unique clones were obtained with >96% identity to each other. Due to the two-toned yellow flower color, single flower and size, the daffodil variety used in this study is likely Carlton. Based on genome size estimates, Carlton is thought to be a domesticated form of Narcissus pseudonarcissus with a genome duplication event that resulted in a tetraploid [45]. A high number of paralogs is, therefore, expected. In addition, these bulbs have been propagated vegetatively. For these reasons, the existence of multiple similar sequences is not surprising.
Due to the high similarity of the clones, the first to be cloned was selected for thorough characterization. The clone selected for characterization is 92.5% identical on the amino acid level to the original sequence in the transcriptome assembly (Figure S6). The recombinant protein was purified with a yield of 16.7 mg protein/L E. coli culture. SDS-PAGE analysis revealed the protein to be of apparent homogeneity (Figure 3A). Initial enzyme assays with NpN4OMT1 yielded, upon HPLC analysis, a peak with the retention time of 4′-O-methylnorbelladine. The vector only control lacks NpN4OMT1 but has all other assay components. Therefor the absence of product in the vector control assay excludes the possibility of a background reaction. The absence of product in the assay lacking AdoMet shows that the methyltransferase uses AdoMet as a co-substrate and cannot form product without AdoMet (Figure 3B). The pH optimum was found to be 8.8 and the temperature optimum 45°C (Figure S7B–C). The Pfs protein, shown purified in Figure 3A, was added to prevent SAH inhibition in kinetic enzyme assays, through the Pfs catalyzed hydrolysis of SAH to adenine and S-ribosyl-homocysteine [41].
An alternative methylation product, 3′-O-methylnorbelladine, has the same retention time on HPLC, the same UV profile and MS/MS fragmentation pattern as 4′-O-methylnorbelladine. Thus, NMR analysis was performed to determine the regiospecificity of O-methylation. HMBC correlations from both the methoxyl protons (δH 3.88) and H-6′ (δH 6.90) to the same carbon (δC 149.9) placed the methoxyl group at C-4′. Its location was further supported by a ROESY correlation from the methoxyl protons to H-5′ (δH 6.98). The NMR data thus confirmed that 4′-O-methylnorbelladine is the product of the enzyme reaction (Figure 3C).
To determine the substrate specificity of this methyltransferase, several similar substrates were tested. Activity comparable to that found with norbelladine was observed using N-methylnorbelladine as the substrate. Dopamine also served as a substrate. Under the assay conditions used, product was not detected with caffeic acid, vanillin, 3,4-dihydroxybenzaldehyde, and tyramine as substrates (Table 1). To determine if the other 4 variants show similar activity, they were purified, and enzymatic activity was confirmed for all variants using norbelladine as the substrate. When monitoring NpN4OMT1 norbelladine assays allowed to proceed to completion, no sign of double methylation products were observed as expected.
Table 1. Substrate specificity of NpN4OMT1.
Substrate | K m (µM) | k cat (1/min) | k cat/K m (1/µM*min) |
norbelladine | 1.6±0.3 | 1.3±0.06 | 0.8 |
AdoMet | 28.5±1.6 | 4.5±0.01 | 0.16 |
N-methylnorbelladine | 1.9±0.4 | 2.6±0.15 | 1.3 |
dopamine | 7.3±2.7 | 3.6±0.15 | 0.5 |
caffeic acid | ND | ND | ND |
vanillin | ND | ND | ND |
3,4-dihydroxybenzaldehyde | ND | ND | ND |
tyramine | ND | ND | ND |
ND, Not detected.
±, Standard error.
Phylogenetic analysis of the NpN4OMT1 placed it in the class I OMT group (Figure 4). NpN4OMT1 has a length consistent with the 231–248 amino acid range found in class I OMTs. This is in contrast to other known plant catechol 4-OMTs which all group in the class II OMTs as their length and cofactor requirements reported in previous work would predict. All these methyltransferases are significantly longer than the standard class I OMTs and none are reported to have the characteristic divalent cation dependence of class I OMTs [46]–[50]. When testing NpN4OMT1 for cation dependence, enzymatic activity improved upon the addition of cobalt. Enzymatic activity increased four-fold more with the addition of magnesium instead of cobalt (Figure S7A). This preference for magnesium over other divalent cations is also to be expected from a class I OMT [50]. It is, furthermore, consistent with previous work on enzyme extracts enriched for this OMT [9].
To validate the expression profiles predicted based on read counts; qRT-PCR was conducted with the same RNA preparation used to prepare the cDNA libraries for Illumina sequencing. The resulting expression profile is slightly different from that obtained from Illumina sequencing. The qRT-PCR expression profile has a higher quantity of inflorescence transcript relative to bulb transcript (Figure 2C). This minor difference is potentially due to cross amplification, during qRT-PCR, with other close homologs in the plant.
Discussion
The expression pattern, product formation and low K m for norbelladine all indicate NpN4OMT methylates norbelladine in the proposed galanthamine biosynthetic pathway. Two differing orders of methylation have been proposed for galanthamine biosynthesis [10]. The methylation of N-methylnorbelladine was tested to determine if a preference for the N-methylation state could be observed at O-methylation. Similar Km and kcat values for N-methylnorbelladine and norbelladine indicate that a preference for the N-methylation state does not occur at O-methylation. The results presented here support both proposed galanthamine biosynthetic pathways. Future work on additional enzymes in the pathway will be needed to enzymatically validate one pathway or the other. The lack of enzymatic activity when testing 3,4-dihydroxybenzaldehyde suggests that methylation does not occur prior to formation of norbelladine. The methylation of dopamine is expected considering structural similarity to the methylated moiety of norbelladine. Tyramine was not methylated; this is as expected for a class I OMT (Table 1).
Several aspects of the candidate gene selection approach proved important for this successful identification. One is the selection of a variety of methyltransferases for the homology search. If only the known 4-OMTs had been used in the homology search, the gene would have been missed due to the large difference in sequence between known 4-OMTs and NpN4OMT. It has been shown that catechol 4′-OMT and catechol 3′-OMT can differ by as little as one amino acid [51]. Because of this potential for a conversion form catechol 3′-O-methylation to 4′-O-methylation though evolution, OMTs of both positions were used in the homology search. Also, both class I and class II OMTs were used in the search because both classes are known to methylate catechols. Considering the multiple branches of the N-methyltransferases in the OMT phylogeny, it is worth investigating enzymes that annotate as N-methyltransferases [21]. For these reasons, the sequences used in the initial BLAST search consisted of representatives of known O- and N-methyltransferases of small metabolites. The NpN4OMT turned out to be a member of the class I OMTs. Class I OMTs show closer homology to human catechol OMT than to all known plant catechol 4-OMTs in class II, as demonstrated in Figure 4 [52]. The closest known catechol 4-OMT to NpN4OMT is bacterial, has 34% identity to NpN4OMT and is a class I OMT from Cyanobacterium Synechocystis sp. Strain PCC 6803 (SynOMT) [53]. Many 3-OMTs show even higher homology to NpN4OMT than SynOMT. It is probable that the 4-OMT activity of NpN4OMT was acquired independently of SynOMT (Figure 4).
The second selection criterion, co-expression with galanthamine accumulation, was also of great value. It reduced the number of candidate OMTs from hundreds to one. There are a variety of methods for the prioritization of candidate genes [54], [55]. Many of these methods are oriented towards species and systems for which there are extensive databases or prior knowledge regarding a gene involved in the pathway or process. In one study, a collection of ∼500 microarrays was used to demonstrate the co-expression of genes in the same pathway in Arabidopsis [56]. However, extensive gene expression data are typically not available for non-model systems. There have been several studies that use co-expression analysis to find genes in a pathway and produce promising candidate gene lists. These studies often lack biochemical validation of the in silico candidates [57]. If a novel function is proposed, this type of analysis is incomplete without biochemical validation of enzyme activity. Enzymes that are homologous to functionally equivalent enzymes in a different species can be corroborated by co-expression analysis [57]. There are several studies that use a simple differential expression model and microarrays to find biosynthetic genes by comparing biosynthetically active and inactive accessions in rose and strawberry [58], [59]. Differential expression analysis lacks algorithms to use data with differing levels of metabolism occurring in more than two samples. The Pearson correlation can compare data from multiple samples. Mercke et al. have used a Pearson correlation-based method to identify gene expression with microarrays that correlate with levels of specific terpenes in cucumber [60]. Illumina-based transcriptomes are, however, more sensitive to minor variants in the sequences and to splice variants. Illumina-based gene expression data also have a far greater dynamic range, limited only by sequence depth, than microarrays [61]. Subtleties in the sequences that could be missed with microarrays can now be detected with Illumina sequencing.
The use of HAYSTACK as a platform to use the Pearson correlation is ideal because it is designed to receive a hypothesis for gene expression and look for gene expression that correlates with that hypothesis. This is in contrast to an approach in which gene expression patterns are clustered based on similarity to each other. The search for a defined pattern in the data allows the number of required expression data points to be reduced compared to an approach that needs to define clusters of gene expression patterns based on similarity. In HAYSTACK, the shared expression pattern is already defined. HAYSTACK applies additional screening criteria including a p-value test for significance, a fold cutoff and background cutoff. The approach chosen in our study used knowledge of known chemical intermediates, a transcriptome with expression profiles for three tissues, and metabolite levels to identify a candidate gene product to validate with in vitro enzyme activity. Little prior knowledge of a pathway is required to use this approach, making this workflow ideal for the identification of genes in unknown biochemical pathways.
There are several modifications to this approach that could be used to improve its power. It could be applied to more tissues, environmental conditions or time points to provide even greater statistical power to correlate co-expression of biosynthetic genes with the biosynthesis of their products. The method could also be modified to include analysis of multiple end products. If the pathway in which the enzyme participates branches, several end products could be equally important to co-expression analysis. This combined consideration of multiple end products could lead to more informative models [62]. Another potential source of information on the metabolite level could be the concentrations of intermediates made during synthesis. Correlations between biosynthetic gene expression, and perhaps the accumulation of metabolites as well, tend to decrease as distance in a pathway increases [56]. Experiments that quantitate metabolic intermediates could be useful for finding biosynthetic genes if the flux through the pathway is not so high that intermediates do not accumulate. The latter would be the case in a pathway assembled into a metabolon.
The discovery of this enzyme enables the future elucidation of other enzymes in the proposed galanthamine biosynthetic pathway and other novel pathways. Genes that co-express with NpN4OMT can be identified and used as candidate genes for other steps in the proposed galanthamine biosynthetic pathway. This will potentially be useful for earlier steps in the pathway, considering the tendency of expression correlations to decrease as distance in metabolic pathways increase [56]. This enzyme discovery also validates the utility of this workflow to characterize metabolic pathways and provides a valuable method for pathway discovery in orphan species.
Supporting Information
Acknowledgments
We thank Judy Coyle for figure preparation, Robin S. Kramer (National Center for Genome Resources) for contributing to the transcriptome assembly, Bradley S. Evans (Donald Danforth Plant Science Center) for assistance in mass spectral interpretation and John E. Cronan (Department of Microbiology, University of Illinois) for the E. coli culture with the pfs construct.
Data Availability
The authors confirm that all data underlying the findings are fully available without restriction. All data is currently available in the paper, the paper's Supporting Information files, http://www.medplantrnaseq.org or being held by NCBI until publication (accession numbers KJ584561, KJ584562, KJ584563, KJ584564 and KJ584565).
Funding Statement
The results presented herein were supported by the National Institutes of Health award number 1RC2GM092561 (NIGMS) and by the National Science Foundation Major Research Instrumentation award number DBI-0521250. The web sites for these funding sources are http://www.nih.gov/ and http://www.nsf.gov/. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Gabrielsen B, Monath TP, Huggins JW, Kefauver DF, Pettit GR, et al. (1992) Antiviral (RNA) activity of selected Amaryllidaceae isoquinoline constituents and synthesis of related substances. J Nat Prod 55: 1569–1581. [DOI] [PubMed] [Google Scholar]
- 2. Havelek R, Seifrtova M, Kralovec K, Bruckova L, Cahlikova L, et al. (2014) The effect of Amaryllidaceae alkaloids haemanthamine and haemanthidine on cell cycle progression and apoptosis in p53-negative human leukemic Jurkat cells. Phytomedicine 21: 479–490. [DOI] [PubMed] [Google Scholar]
- 3. Liu J, Hu WX, He LF, Ye M, Li Y (2004) Effects of lycorine on HL-60 cells via arresting cell cycle and inducing apoptosis. FEBS Lett 578: 245–250. [DOI] [PubMed] [Google Scholar]
- 4. Uyeo S, Kobayashi S (1953) Lycoris alkaloids. XXIV. Isolation and characterization of lycoremine. Pharm Bull 1: 139–142. [DOI] [PubMed] [Google Scholar]
- 5. Barik J, Dajas-Bailador F, Wonnacott S (2005) Cellular responses to nicotinic receptor activation are decreased after prolonged exposure to galantamine in human neuroblastoma cells. Br J Pharmacol 145: 1084–1092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Irwin RL, Smith HJ (1960) Cholinesterase inhibition by galanthamine and lycoramine. Biochem Pharmacol 3: 147–148. [DOI] [PubMed] [Google Scholar]
- 7. Wilcock GK, Lilienfeld S, Gaens E (2000) Efficacy and safety of galantamine in patients with mild to moderate Alzheimer’s disease: multicentre randomised controlled trial. Galantamine International-1 Study Group. BMJ 321: 1445–1449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Wilcock G, Howe I, Coles H, Lilienfeld S, Truyen L, et al. (2003) A long-term comparison of galantamine and donepezil in the treatment of Alzheimer’s disease. Drugs Aging 20: 777–789. [DOI] [PubMed] [Google Scholar]
- 9. Mann JD (1963) Alkaloids and plant metabolism. VI. O-Methylation in vitro of norbelladine, a precursor of Amaryllidaceae alkaloids. J Biol Chem 238: 3820–3823. [PubMed] [Google Scholar]
- 10. Ejchhorn J, Takada T, Kita Y, Zenk MH (1998) Biosynthesis of the amaryllidaceae alkaloid galanthamine. Phytochemistry 49: 1037–1047. [Google Scholar]
- 11.Barton DHR, Kirby GW, Thomas GM (1963) Phenol oxidation and biosynthesis. Part VI. The biogenesis of Amaryllidaceae alkaloids. J Chem Soc: 4545–4558.
- 12. Suhadolnik RJ, FIscher AG, Zulalian J (1962) The biogenic origin of the C6-C1 unit of lycorine. J Am Chem Soc 84: 4348–4349. [Google Scholar]
- 13. Wildman WC, Battersby AR, Breuer SW (1962) Biosynthesis in the Amaryllidaceae. Incorporation of 3-C14-Tyrosine and phenylalanine in Nerine browdenii W. Wats. J Am Chem Soc 84: 4599–4600. [Google Scholar]
- 14. Suhadolnik RJ, Fischer AG, Zulalian J (1963) Biogenesis of the Amaryllidaceae alkaloids. II. Studies with whole plants, floral primordia and cell free extracts. Biochem Biophys Res Commun 11: 208–212. [DOI] [PubMed] [Google Scholar]
- 15. Lehmann T, Pollmann S (2009) Gene expression and characterization of a stress-induced tyrosine decarboxylase from Arabidopsis thaliana . FEBS Lett 583: 1895–1900. [DOI] [PubMed] [Google Scholar]
- 16. Battersby AR, Fales HM, Wildman WC (1961) Biosynthesis in the Amaryllidaceae. Tyrosine and norbelladine as precursors of haemanthamine. J Am Chem Soc 83: 4098–4099. [Google Scholar]
- 17.Battersby AR, Bink R, Breuer SW (1961) Biosynthesis in the Amaryllidaceae: incorporation of norbelladine into lycorine and norpluvine. Proc Chem Soc July: 243.
- 18.Barton DHR, Kirby GW, Taylor JB, Thomas GM (1961) The biosynthesis of Amaryllidaceae alkaloids. Proc Chem Soc July: 254–255.
- 19. Gerardy R, Zenk MH (1993) Purification and characterization of salutaridine: NADPH 7-oxidoreductase from Papaver somniferum . Phytochemistry 34: 125–132. [Google Scholar]
- 20. Roberts MF (1971) Polyphenolases in the 1000 g fraction of Papaver somniferum latex. Phytochemistry 10: 3021–3027. [DOI] [PubMed] [Google Scholar]
- 21. Raman SB, Rathinasabapathi B (2003) beta-alanine N-methyltransferase of Limonium latifolium. cDNA cloning and functional expression of a novel N-methyltransferase implicated in the synthesis of the osmoprotectant beta-alanine betaine. Plant Physiol 132: 1642–1651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Takeshita N, Fujiwara H, Mimura H, Fitchen JH, Yamada Y, et al. (1995) Molecular cloning and characterization of S-adenosyl-L-methionine: scoulerine-9-O-methyltransferase from cultured cells of Coptis japonica . Plant Cell Physiol 36: 29–36. [PubMed] [Google Scholar]
- 23. Ounaroon A, Decker G, Schmidt J, Lottspeich F, Kutchan TM (2003) (R,S)-Reticuline 7-O-methyltransferase and (R,S)-norcoclaurine 6-O-methyltransferase of Papaver somniferum - cDNA cloning and characterization of methyl transfer enzymes of alkaloid biosynthesis in opium poppy. Plant J 36: 808–819. [DOI] [PubMed] [Google Scholar]
- 24. Nomura T, Kutchan TM (2010) Three new O-methyltransferases are sufficient for all O-methylation reactions of ipecac alkaloid biosynthesis in root culture of Psychotria ipecacuanha . J Biol Chem 285: 7722–7738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Morishige T, Dubouzet E, Choi KB, Yazaki K, Sato F (2002) Molecular cloning of columbamine O-methyltransferase from cultured Coptis japonica cells. Eur J Biochem 269: 5659–5667. [DOI] [PubMed] [Google Scholar]
- 26. Frick S, Kutchan TM (1999) Molecular cloning and functional expression of O-methyltransferases common to isoquinoline alkaloid and phenylpropanoid biosynthesis. Plant J 17: 329–339. [DOI] [PubMed] [Google Scholar]
- 27. Widiez T, Hartman TG, Dudai N, Yan Q, Lawton M, et al. (2011) Functional characterization of two new members of the caffeoyl CoA O-methyltransferase-like gene family from Vanilla planifolia reveals a new class of plastid-localized O-methyltransferases. Plant Mol Biol 76: 475–488. [DOI] [PubMed] [Google Scholar]
- 28. Johnson MT, Carpenter EJ, Tian Z, Bruskiewich R, Burris JN, et al. (2012) Evaluating methods for isolating total RNA and predicting the success of sequencing phylogenetically diverse plant transcriptomes. PLoS One 7: e50226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.FASTX-Toolkit website. Available: http://hannonlab.cshl.edu/fastx_toolkit. Accessed 2014 Jul 2.
- 30. Birol I, Jackman SD, Nielsen CB, Qian JQ, Varhol R, et al. (2009) De novo transcriptome assembly with ABySS. Bioinformatics 25: 2872–2877. [DOI] [PubMed] [Google Scholar]
- 31. Luo R, Liu B, Xie Y, Li Z, Huang W, et al. (2012) SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1: 18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Chevreux B, Pfisterer T, Drescher B, Driesel AJ, Müller WE, et al. (2004) Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs. Genome Res 14: 1147–1159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Li W, Godzik A (2006) Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22: 1658–1659. [DOI] [PubMed] [Google Scholar]
- 34.Iseli C, Jongeneel CV, Bucher P (1999) ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc Int Conf Intell Syst Mol Biol: 138–148. [PubMed]
- 35. Lottaz C, Iseli C, Jongeneel CV, Bucher P (2003) Modeling sequencing errors by combining Hidden Markov models. Bioinformatics 19 Suppl 2 ii103–112. [DOI] [PubMed] [Google Scholar]
- 36. Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25: 1754–1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Mockler TC, Michael TP, Priest HD, Shen R, Sullivan CM, et al. (2007) The DIURNAL project: DIURNAL and circadian expression profiling, model-based pattern matching, and promoter analysis. Cold Spring Harb Symp Quant Biol 72: 353–363. [DOI] [PubMed] [Google Scholar]
- 38. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, et al. (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 28: 2731–2739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Higashi Y, Smith TJ, Jez JM, Kutchan TM (2010) Crystallization and preliminary X-ray diffraction analysis of salutaridine reductase from the opium poppy Papaver somniferum . Acta Crystallogr Sect F Struct Biol Cryst Commun 66: 163–166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Choi-Rhee E, Cronan JE (2005) A nucleosidase required for in vivo function of the S-adenosyl-L-methionine radical enzyme, biotin synthase. Chem Biol 12: 589–593. [DOI] [PubMed] [Google Scholar]
- 42. Livak KJ, Schmittgen TD (2001) Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 25: 402–408. [DOI] [PubMed] [Google Scholar]
- 43. Kalra S, Puniya BL, Kulshreshtha D, Kumar S, Kaur J, et al. (2013) De novo transcriptome sequencing reveals important molecular networks and metabolic pathways of the plant, Chlorophytum borivilianum . PLoS One 8: e83336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Kwantes M, Liebsch D, Verelst W (2012) How MIKC* MADS-box genes originated and evidence for their conserved function throughout the evolution of vascular plant gametophytes. Mol Biol Evol 29: 293–302. [DOI] [PubMed] [Google Scholar]
- 45. Zonneveld B (2010) The involvement of Narcissus hispanicus Gouan in the origin of Narcissus bujei and of cultivated trumpet daffodils (Amaryllidaceae). Anales del Jardín Botánico de Madrid 67: 29–39. [Google Scholar]
- 46. Schröder G, Wehinger E, Lukacin R, Wellmann F, Seefelder W, et al. (2004) Flavonoid methylation: a novel 4′-O-methyltransferase from Catharanthus roseus, and evidence that partially methylated flavanones are substrates of four different flavonoid dioxygenases. Phytochemistry 65: 1085–1094. [DOI] [PubMed] [Google Scholar]
- 47. Akashi T, Sawada Y, Shimada N, Sakurai N, Aoki T, et al. (2003) cDNA cloning and biochemical characterization of S-adenosyl-L-methionine: 2,7,4′-trihydroxyisoflavanone 4′-O-methyltransferase, a critical enzyme of the legume isoflavonoid phytoalexin pathway. Plant Cell Physiol 44: 103–112. [DOI] [PubMed] [Google Scholar]
- 48. Kim DH, Kim BG, Lee Y, Ryu JY, Lim Y, et al. (2005) Regiospecific methylation of naringenin to ponciretin by soybean O-methyltransferase expressed in Escherichia coli . J Biotechnol 119: 155–162. [DOI] [PubMed] [Google Scholar]
- 49. Gang DR, Lavid N, Zubieta C, Chen F, Beuerle T, et al. (2002) Characterization of phenylpropene O-methyltransferases from sweet basil: facile change of substrate specificity and convergent evolution within a plant O-methyltransferase family. Plant Cell 14: 505–519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Joshi CP, Chiang VL (1998) Conserved sequence motifs in plant S-adenosyl-L-methionine-dependent methyltransferases. Plant Mol Biol 37: 663–674. [DOI] [PubMed] [Google Scholar]
- 51. Wils CR, Brandt W, Manke K, Vogt T (2013) A single amino acid determines position specificity of an Arabidopsis thaliana CCoAOMT-like O-methyltransferase. FEBS Lett 587: 683–689. [DOI] [PubMed] [Google Scholar]
- 52. Ibdah M, Zhang XH, Schmidt J, Vogt T (2003) A novel Mg(2+)-dependent O-methyltransferase in the phenylpropanoid metabolism of Mesembryanthemum crystallinum . J Biol Chem 278: 43961–43972. [DOI] [PubMed] [Google Scholar]
- 53. Kopycki JG, Stubbs MT, Brandt W, Hagemann M, Porzel A, et al. (2008) Functional and structural characterization of a cation-dependent O-methyltransferase from the cyanobacterium Synechocystis sp. strain PCC 6803. J Biol Chem 283: 20888–20896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Moreau Y, Tranchevent LC (2012) Computational tools for prioritizing candidate genes: boosting disease gene discovery. Nat Rev Genet 13: 523–536. [DOI] [PubMed] [Google Scholar]
- 55. Saito K, Hirai MY, Yonekura-Sakakibara K (2008) Decoding genes with coexpression networks and metabolomics – ‘majority report by precogs’. Trends Plant Sci 13: 36–43. [DOI] [PubMed] [Google Scholar]
- 56. Wei H, Persson S, Mehta T, Srinivasasainagendra V, Chen L, et al. (2006) Transcriptional coordination of the metabolic network in Arabidopsis. Plant Physiol 142: 762–774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Sun Y, Luo H, Li Y, Sun C, Song J, et al. (2011) Pyrosequencing of the Camptotheca acuminata transcriptome reveals putative genes involved in camptothecin biosynthesis and transport. BMC Genomics 12: 533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Aharoni A, Keizer LC, Bouwmeester HJ, Sun Z, Alvarez-Huerta M, et al. (2000) Identification of the SAAT gene involved in strawberry flavor biogenesis by use of DNA microarrays. Plant Cell 12: 647–662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Guterman I, Shalit M, Menda N, Piestun D, Dafny-Yelin M, et al. (2002) Rose scent: genomics approach to discovering novel floral fragrance-related genes. Plant Cell 14: 2325–2338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Mercke P, Kappers IF, Verstappen FW, Vorst O, Dicke M, et al. (2004) Combined transcript and metabolite analysis reveals genes involved in spider mite induced volatile formation in cucumber plants. Plant Physiol 135: 2012–2024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Wang Z, Gerstein M, Snyder M (2009) RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10: 57–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Yamazaki M, Mochida K, Asano T, Nakabayashi R, Chiba M, et al. (2013) Coupling deep transcriptome analysis with untargeted metabolic profiling in Ophiorrhiza pumila to further the understanding of the biosynthesis of the anti-cancer alkaloid camptothecin and anthraquinones. Plant Cell Physiol 54: 686–696. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Day A, Neutelings G, Nolin F, Grec S, Habrant A, et al. (2009) Caffeoyl coenzyme A O-methyltransferase down-regulation is associated with modifications in lignin and cell-wall architecture in flax secondary xylem. Plant Physiol Biochem 47: 9–19. [DOI] [PubMed] [Google Scholar]
- 64. Eswaran N, Parameswaran S, Sathram B, Anantharaman B, Kumar G RK, et al. (2010) Yeast functional screen to identify genetic determinants capable of conferring abiotic stress tolerance in Jatropha curcas . BMC Biotechnol 10: 23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Ibrahim RK, Bruneau A, Bantignies B (1998) Plant O-methyltransferases: molecular analysis, common signature and classification. Plant Mol Biol 36: 1–10. [DOI] [PubMed] [Google Scholar]
- 66. Hugueney P, Provenzano S, Verriès C, Ferrandino A, Meudec E, et al. (2009) A novel cation-dependent O-methyltransferase involved in anthocyanin methylation in grapevine. Plant Physiol 150: 2057–2070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Nestor A, Craig BS, Jack O, Ke Z (2008) Modulating lignin in plants. Google Patents.
- 68. Kopycki JG, Rauh D, Chumanevich AA, Neumann P, Vogt T, et al. (2008) Biochemical and structural analysis of substrate promiscuity in plant Mg2+-dependent O-methyltransferases. J Mol Biol 378: 154–164. [DOI] [PubMed] [Google Scholar]
- 69. Choi KB, Morishige T, Shitan N, Yazaki K, Sato F (2002) Molecular cloning and characterization of coclaurine N-methyltransferase from cultured cells of Coptis japonica . J Biol Chem 277: 830–835. [DOI] [PubMed] [Google Scholar]
- 70. Kato M, Mizuno K, Crozier A, Fujimura T, Ashihara H (2000) Caffeine synthase gene from tea leaves. Nature 406: 956–957. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The authors confirm that all data underlying the findings are fully available without restriction. All data is currently available in the paper, the paper's Supporting Information files, http://www.medplantrnaseq.org or being held by NCBI until publication (accession numbers KJ584561, KJ584562, KJ584563, KJ584564 and KJ584565).