Abstract
Carnivorous plants primarily use aspartic proteases during digestion of captured prey. In contrast, the major endopeptidases in the digestive fluid of the Venus flytrap (Dionaea muscipula) are cysteine proteases (dionain-1 to -4). Here, we present the crystal structure of mature dionain-1 in covalent complex with inhibitor E-64 at 1.5 Å resolution. The enzyme exhibits an overall protein fold reminiscent of other plant cysteine proteases. The inactive glycosylated pro-form undergoes autoprocessing and self-activation, optimally at the physiologically relevant pH value of 3.6, at which the protective effect of the pro-domain is lost. The mature enzyme was able to efficiently degrade a Drosophila fly protein extract at pH 4 showing high activity against the abundant Lys- and Arg-rich protein, myosin. The substrate specificity of dionain-1 was largely similar to that of papain with a preference for hydrophobic and aliphatic residues in subsite S2 and for positively charged residues in S1. A tentative structure of the pro-domain was obtained by homology modeling and suggested that a pro-peptide Lys residue intrudes into the S2 pocket, which is more spacious than in papain. This study provides the first analysis of a cysteine protease from the digestive fluid of a carnivorous plant and confirms the close relationship between carnivorous action and plant defense mechanisms.
Keywords: enzyme, enzyme structure, plant biochemistry, plant physiology, proteinase, cysteine proteases, Venus flytrap, digestion, papain, plant carnivory
Introduction
The unusual carnivorous adaptation of plants to nutrient-poor environments has attracted scientific attention since the time of Darwin. Small animals are captured and subsequently digested by the plant's use of passive or active trap structures. A variety of hunting strategies are observed in these plants, including the passive pitfall traps of tropical pitcher plants (Nepenthes), the adhesive flypaper traps of sundews (Drosera), and the active snap traps of Venus flytraps. Because plant carnivory is known to have evolved independently at least six times in the plant kingdom and is represented in >600 identified species, the mechanical components and enzymatic processes display variation from one carnivorous plant to another (1). The Venus flytrap (Dionaea muscipula) belongs to the Droseraceae family, and its natural habitat is the southeastern region of North America. This plant attracts animals using volatile organic compounds (2) and catches prey by rapid movement of specialized leaves in response to a mechanical stimulation of designated trigger hairs (3). The digestion process is initiated through the secretion of acidic digestive fluid (pH 4) from secretory glands and continues for ∼10 days, after which the trap reopens to complete the hunting cycle (4, 5). During the digestion phase, the protein content of the digestive fluid increases along with further acidification and higher proteolytic activity (4, 6).
The protein components of the Venus flytrap digestive fluid were recently determined using a combined proteomics and transcriptomics approach (5). The presence of chitinases, lipases, phosphatases, and peptidases revealed a concerted action targeted at insect prey, where chitinases degrade chitin in the cuticle (chitin-rich outer layer) of insects and spiders, thereby providing access to the internal parts of the prey for further enzymatic breakdown (7). The Venus flytrap takes up alanine, glycine, and peptides from the digestive fluid (8–10) along with the active assimilation of ammonia through designated transporters (11). The degradation of prey proteins is performed by cysteine endopeptidases (dionains) supported by the action of a serine carboxypeptidase (5). This is in contrast to the digestive fluids of Nepenthes, Cephalotus, and Drosera, which all rely heavily on aspartate proteases (12, 13).
Dionains display <70% sequence similarity to other cysteine proteases characterized to date. They are ascribed to the papain-like C1 family of the MEROPS database according to the overall homology and conservation of specific sequence motifs uniquely related to autocatalytic maturation from inactive zymogenic pro-forms to active proteases (14). These include the ERFNIN and GXNXFXD motifs in a predicted pro-region of ∼100–120 residues (15, 16). The pro-domain prevents adventitious proteolysis by blocking the active site Cys-His catalytic dyad (17, 18). Activation occurs at an acidic pH and proceeds through automaturation. The majority of known plant cysteine proteases, such as papain and caricain, are secreted into an acidic environment, where the active enzyme serves as a first line of defense against pathogens and herbivores (19–22). For dionains, pro-domain peptides have been identified in the acidic digestive fluid (5), suggesting that a similar postsecretory maturation process occurs.
In this study, we have performed the first functional and structural characterization of a digestive cysteine protease (dionain-1) from a carnivorous plant. We sequenced the full-length pre-pro-dionain-1 cDNA through targeted cloning based on de novo identified peptide sequences from the digestive fluid. The glycosylated inactive pro-form was produced by heterologous recombinant expression in yeast and found to undergo accelerated autolytic maturation at low pH to yield active dionain-1. The crystal structure of mature dionain-1 in complex with the general cysteine peptidase inhibitor E-64 was solved and refined with data to 1.5 Å resolution. The results revealed an overall conserved protein fold, similar to that of other plant cysteine proteases (papain, caricain, castor bean cysteine peptidase, etc.). In addition, substrate library profiling and structural insight into the specificity pockets of dionain-1 were used to discuss evolutionary similarities and carnivorous lifestyle adaptations.
Experimental Procedures
Materials
Venus flytrap plants were purchased from the Lammehave Nursery (Ringe, Denmark) and grown in a walk-in plant growth chamber under a 12-h/12-h light/dark cycle at 26 °C. All of the experiments were performed on healthy mature plants. EasySelectTM Pichia expression kit, Pichia EasyCompTM kit, and Zeocin were purchased from Invitrogen. The LA PCR kit was purchased from Clontech, and the FastDigest® restriction enzymes and T4 ligase were obtained from Fermentas. HiTrap columns, Superdex 75 10/30, and the Ettan CAFTM MALDI sequencing kit were purchased from GE Healthcare. C18 stage tips were purchased from Proxeon Biosystems A/S. Z-FR-AMC2 was obtained from Bachem, and Z-LR-AMC, Boc-LKR, and Boc-GKR were purchased from the Peptide Institute Inc. (PeptaNova GmbH, Sandhausen, Germany). All other chemicals were obtained from Sigma-Aldrich unless otherwise specified.
De Novo Sequencing of Dionain-1 Peptides
Digestive fluid from mechanostimulated Venus flytrap leaves was analyzed by SDS-PAGE using a 5–15% gradient gel. The suspected major band containing dionain-1 was excised for in-gel trypsin digestion or analyzed using automated Edman degradation (Procise 494-HT protein sequencer) after electrophoretic blotting to an Immobilon membrane. The trypsin digestion was performed overnight at 37 °C and followed by acidification with 0.1% trifluoroacetic acid (TFA), C18 reverse-phase micropurification, and spotting onto a target plate for matrix-assisted laser desorption ionization mass spectrometry (MALDI-MS) using α-cyano-4-hydroxycinnamic acid matrix. Chemical-assisted fragmentation (Ettan CAFTM) was employed to facilitate the fragmentation and interpretation of the generated MS/MS spectra. The MALDI-MS analysis was performed on a Micromass Q-TOF Ultima Global mass spectrometer (Waters).
Cloning of cDNA Encoding Dionain-1
To stimulate the dionain-1 transcription, prey digestion by Venus flytrap was initiated by feeding the plants with yellow mealworm beetles (Tenebrio molitor). After 40–88 h, the trap leaves were collected, rinsed in water to remove residual beetle remains, snap-frozen in liquid nitrogen, and stored at −80 °C. Total RNA was isolated using a cetyltrimethylammonium bromide protocol, followed by mRNA purification using oligo(dT) magnetic beads. Intact transcripts were amplified by RNA ligase-mediated PCR, in which an RNA adapter is linked to dephosphorylated intact 5′ transcript ends (23). First-strand synthesis was performed with 5′-oligo(dT)-linker-primer-3′ and Moloney murine leukemia virus RNase H reverse transcriptase and followed by long and accurate PCR (Sigma-Aldrich) amplification using a 5′-cDNA primer (adapter sequence) and a 3′-primer downstream of the poly(A) tail. For subsequent 5′-RACE PCR, dionain-1 reverse primers were designed from two de novo sequenced peptides by MALDI-MS/MS, DCDTDGNDK (5′-ATAACCATCTTCACCCCAAGAXGTXCCCCA) and WGTSWGEDGY (5′-TTTATCGTTACCATCTGTXTCXCAXTC), where X denotes degenerate positions during primer synthesis to sample all possible codons. The PCR and subsequent TA cloning into pBS II sk+ vector resulted in generation and sequencing of full-length dionain-1 cDNA, which was deposited with GenBankTM (access code KP663370). The dionain-1 cDNA without the predicted 29-residue signal peptide (by SignalP version 4.0 (24)) was subsequently amplified through the incorporation of XhoI and NotI restriction sites and then cloned into the yeast pPICZαC expression vector. An Asp-to-Gln N98pQ mutant, in which “p” refers to the numbering of the pro-protein sequence, was generated by site-directed mutagenesis.
Expression and Purification of WT and N98pQ Pro-dionain-1
For recombinant expression of WT and N98pQ-mutant pro-dionain-1 in Pichia pastoris, competent KM71H cells were transformed with linearized vector DNA according to the manufacturer's protocols for the EasyCompTM and EasySelectTM Picha expression kits (Invitrogen). The transformants were grown at 28 °C for 3 days on YPDS (yeast extract peptone dextrose medium with sorbitol) plates containing Zeocin (100 μg/ml). High-yield expressing colonies were identified by tests in buffered glycerol complex medium, and large scale expression cultures were grown in buffered methanol complex medium, followed by concentration to A600 at ∼85 and expression at 28 °C for 72 h (WT) or 48 h (N98pQ) with the addition of 0.5% methanol every 24 h. For WT pro-dionain-1, the culture supernatant was dialyzed against buffer A (10 mm Tris-HCl, 10 mm NaCl, pH 7.6), filtered, and loaded onto a 5-ml HiTrap Q column equilibrated in buffer A. Bound protein was eluted using a 1%/min gradient of buffer B (10 mm Tris-HCl, 1 m NaCl, pH 7.6), and fractions containing pro-dionain-1 were pooled and dialyzed into 10 mm Tris-HCl, 20 mm NaCl at pH 7 and then concentrated. The final protein concentration was determined using a BCA kit (Thermo Scientific). For pro-dionain-1 N98pQ, ammonium sulfate was added to the culture supernatant to a final concentration of 1 m, filtered, and applied to a 5-ml HiTrap Phenyl HP column equilibrated in buffer A (20 mm Tris-HCl, 1 m ammonium sulfate, pH 7.0). Bound protein was eluted using a 2%/min gradient of buffer C (20 mm Tris-HCl, pH 7.0). Fractions containing pro-dionain-1 N98pQ were collected and dialyzed against 20 mm Tris-HCl, 100 mm NaCl, pH 7.0, and then concentrated. The concentration of the purified zymogens was estimated by absorbance at 280 nm, using the BCA kit (Thermo Scientific) and SDS-PAGE analysis. The yield was ∼150 mg/liter of culture medium. Protein identity was determined by LC-MS/MS analysis (see below). For further purification of the highest glycosylated WT variant, gel filtration was performed using a Superose 6 column (GE Healthcare) equilibrated in 20 mm phosphate buffer (pH 7.0) containing 100 mm NaCl.
SDS-PAGE and Edman Degradation
SDS-PAGE was performed using 5–15% gradient gels and a discontinuous ammediol/glycine buffer system (25). Dionain-1 samples at pH 7.0 were boiled for 5 min in the presence of 50 mm DTT and sample buffer. Dionain-1 samples at pH 3.2–6.0 were treated with 1 mm freshly prepared E-64 solution for 10 min on ice and then boiled in the presence of 50 mm DTT and sample buffer. Deglycosylation of pro-dionain-1 WT was conducted by treatment with endo-β-N-acetylglucosaminidase H for 2 h at 37 °C at a 1:30 (w/w) ratio (endo-β-N-acetylglucosaminidase H/dionain-1) in 10 mm Tris-HCl, 50 mm NaCl, pH 7.0. Gels were stained with Coomassie Brilliant Blue or blotted for 20 min to an Immobilon-P membrane (Millipore) for N-terminal protein sequence analysis by Edman degradation using a Procise 494-HT protein sequencer (Applied Biosystems).
Mass Spectrometry Analyses (LC-MS/MS)
To identify proteins from the gel bands, the relevant material was excised and subjected to overnight in-gel digestion by trypsin (26). The tryptic peptides were purified by C18 stage tips (Proxeon, Thermo Scientific) and analyzed on a TripleTOF 5600+ instrument (AB Sciex) equipped with a Nanospray II source (AB Sciex). The instrument was coupled in-line with an EASY-nLC II system (Thermo Scientific) equipped with a trap column and a 15-cm analytical column pulled in-house and packed with ReproSil-Pur C18-AQ 3-cm resin (Dr. Maisch GmbH, Ammerbuch-Entringen, Germany).
The collected MS files were converted to Mascot generic format (MGF) using the AB SCIEX MS Data Converter beta 1.1 (AB SCIEX) and the “proteinpilot MGF” parameters. The generated peak lists were searched against the Swiss-Prot database using an in-house Mascot search engine (Matrix Science). Search parameters allowed one missed trypsin cleavage site and propionamide (in-gel digest) or carbamidomethyl (in-solution) as a fixed modification with peptide tolerance and MS/MS tolerance set to 10 ppm and 0.2 Da, respectively. The Mascot-provided exponentially modified protein abundance index value was adjusted for molecular mass to calculate the relative protein amount (27). The relative protein amount was based on three replicates.
Proteolytic Susceptibility toward Trypsin
Five μg of each pro-dionain-1 variant (WT, deglycosylated, and N98pQ) was incubated with 0.1 μg of trypsin for either 5 or 15 min at 37 °C, which was followed by immediate transfer to ice and the addition of SDS-PAGE sample buffer. The samples were boiled and analyzed on a 5–15% gradient gel.
Circular Dichroism Spectroscopy
The wavelength spectra and thermal unfolding of WT and N98pQ dionain-1 were assessed by circular dichroism (CD) on a Jasco J-800 spectropolarimeter using a quartz cuvette of 0.1-cm path length and a protein concentration of 0.2 mg/ml in 20 mm NaH2PO4/Na2HPO4 buffer, pH 6.0, with 2 mm E-64. The wavelength spectra were recorded in the far-UV area from 200 to 250 nm at 20 °C to verify similarity in secondary structure content. Thermal unfolding was assessed by monitoring changes in the signal at 222 nm over a temperature range from 20 to 95 °C with a scan rate of 90 °C/h. The thermal melting point was estimated by fitting each thermal denaturation curve to a two-state unfolding model using the Kaleidagraph software (Synergy Software).
Characterization of Pro-dionain-1 Maturation Using SDS-PAGE
WT pro-dionain-1, deglycosylated pro-dionain-1, or N98pQ pro-dionain-1 was incubated at 0.2 mg/ml in standard phosphate-citrate buffer solutions (0.1 m citrate with 0.2 m Na2HPO4 adjusted to the desired pH). To determine the effect of pH, intervals were tested using buffers of pH 3.2, 3.4, 3.6, 3.8, 4.0, and 7.0 at 37 °C for 40 min. To follow the automaturation over time, incubation periods of 0, 15, and 30 min were used in a phosphate-citrate buffer solution at pH 4.0 All incubation periods were followed by inhibition with 1 mm E-64 on ice before SDS-PAGE analysis.
Identification of Pro-domain Peptides
A total of 50 μg of pro-dionain-1 was incubated at 0.2 mg/ml in phosphate/citrate buffer solutions at pH 4.0 for 10 min, followed by the addition of 1 mm E-64 and subsequent acidification by TFA to a final concentration of 0.5%. Pro-domain peptides ranging from 6 to 10 kDa were purified by reverse phase HPLC on a Brownlee SPP C18 column (PerkinElmer Life Sciences) equilibrated in 0.1% TFA (solvent A). The peptides were eluted using a linear gradient from solvent A to 90% acetonitrile. The relevant fractions were analyzed using SDS-PAGE, and the masses were determined by MALDI-TOF-TOF mass spectrometry and compared with relevant pro-domain sequences (GPMAW software).
Kinetic Profiling of Pro-dionain-1 Maturation
Incubation of pro-dionain-1 at various pH values, temperatures, and concentrations was performed in phosphate-citrate buffer solutions (0.1 m citrate with 0.2 m Na2HPO4 adjusted to the desired pH) with a 0.25 mm concentration of the fluorogenic substrate, Z-FR-AMC, in a 96-well plate setup using a FluoStar Omega plate reader at λex = 355 nm and λem = 485 nm for product detection. Assay buffers were preincubated in 96-well half-area plates (Costar 3881) at the indicated temperatures, and 10 nm final pro-dionain-1 was used for each well. Where relevant, the change in enzyme activity was extracted according to the first derivative of the fluorescent signal. Autocatalytic fitting was performed using a two-step model assuming the following reactions and rate expressions, where E* represents the zymogen and E represents the mature enzyme: 1) intermolecular “pro” activation,
![]() |
![]() |
and 2) intermolecular “mature” activation.
![]() |
![]() |
The combined rate equation can be expressed as follows.
![]() |
Assuming [E](t) = E*(0) − E*(t), we can express the following,
![]() |
where E*(0) is the initial amount of pro-dionain-1.
For the automaturation at different temperatures, an apparent rate constant (kapp) was extracted for each curve as the inverse of the time to reach a fixed product signal of 5000 relative fluorescence units (RFU). A classic semilog Aarhenius relationship was observed of kapp as a function of temperature.
Kinetic Profiling of Mature Dionain-1
Mature dionain-1 was produced by incubation of reduced (in 2 mm DTT, 15 min, room temperature) pro-dionain-1 (0.2 mg/ml) for 8–10 min at 45 °C in 50 mm sodium acetate at pH 4.0. The mature enzyme was used immediately. The Z-FR-AMC substrate was used for kinetic profiling either at 0.25 mm or in a dilution series to determine kinetic constants. The catalytic competence of dionain-1 at different temperatures (4, 20, 30, 40, 50, 60, 70, and 80 °C) was probed by end point fluorescence after 15-min incubation of dionain-1 in preheated 50 mm sodium acetate, pH 5.5, assay buffer containing 2 mm DTT and 0.25 mm Z-FR-AMC. The reaction was stopped by the addition of 1 mm E-64 and incubation for 10 min at room temperature before measuring total fluorescence. The effect of pH on the hydrolysis rate was assessed at 35 °C in phosphate-citrate buffer solutions from pH 2.0 to 7.0 adjusted to a constant ionic strength of 0.5 m with KCl. Maximal hydrolysis rates were extracted and normalized to the observed maximal rate. The kinetic constants for activated dionain-1 and papain (Sigma-Aldrich, P4752) were determined in 10 mm phosphate buffer at pH 6.0 and 35 °C. A final theoretical concentration of 5 nm dionain-1 was used, and the actual amount of active sites was quantified using E-64 titration series with preincubation of dionain-1 at 100 nm with relevant E-64 concentrations. For papain, preincubation for 10 min at room temperature with 2 mm DTT preceded kinetic measurements at a final concentration of 2.5 nm. The active site titration by E-64 was performed at the final 2.5 nm papain concentration, and it produced an estimate within 2% of the theoretical concentration. All assays were based on triplicate determinations and repeated at least twice with similar results. Km and Vmax were extracted using Kaleidagraph software (Synergy), and kcat was calculated as Vmax/[ET], where [ET] is the concentration of active sites determined by E-64 titration.
Specificity Profiling Using an Internally Quenched Fluorogenic Probe (IQFP) Library
Substrate preference for mature dionain-1 was probed with a library of 3375 IQFPs (Mimotopes). The peptides in this library consist of AMC-GGXYZGGDPAKK, where each variable position (X, Y, and Z) contains equimolar ratios of A/V, L/I, K/R, S/T, N/Q, E/D, F/Y, or P. Thus, each X, Y, and Z combination results in a peptide pool of 23 unique sequences, except for the proline-containing peptides, where the number of unique sequences is 1, 2, or 4. In total, the assay contains 512 peptide pools divided into microtiter plate wells for the analysis. The peptides remain optically silent in the uncleaved state, whereas upon cleavage, they emit a fluorescent signal with an intensity that is proportional to the extent of the cleavage. The library has been validated as a tool for specificity profiling against the major protease classes (28). The IQFP assay for dionain-1 was performed in pH 6 assay buffer (50 mm sodium acetate) at 40 °C for 2 h with 6 nmol of total substrate and 0.37 pmol of activated dionain-1 in each well (2.5 nm). The extent of cleavage was evaluated as the background-subtracted end point fluorescence (λex = 320 nm, λem = 420 nm). The intensity-weighted score distribution was calculated as follows,
![]() |
where I denotes the product intensity score in RFU, i denotes observed individual substrate pools above 1000 RFU, and n denotes the residue type. An analysis of exact cleavage sites for the top 50 substrates was performed by MALDI-MS. In brief, 2 μl of each cleaved IQFP solution was acidified with 8 μl of 0.1% TFA solution, micropurified using C18 Stagetips (Proxeon), and spotted onto MALDI target plates using an α-cyano-4-hydroxycinnamic acid matrix. The MS analysis was performed on a Micromass Q-TOF Ultima Global mass spectrometer (Waters/Micromass).
Specificity Profiling of Dionain-1 against Bovine β-Casein
A total of 5 μg of bovine β-casein (14 kDa; Sigma) was analyzed on a 5–15% gradient gel, and the intact protein band was cut out and subjected to in-gel digestion with dionain-1 (1:50, w/w) at 37 °C overnight. After digestion, the sample was acidified with 0.1% TFA, micropurified using Poros® 50-μm R2 reverse phase material (Applied Biosystems), and analyzed by LC-MS/MS, as described above. Unique cleavage sites in β-casein were identified based on the peptides that were detected using Mascot version 2.3.02 (Matrix Science). A control sample of bovine β-casein without dionain-1 was included, and the four cleavage sites identified herein were excluded from the dionain-1 specificity analysis. The frequency of each residue identity on positions P3–P1′ was calculated by dividing the number of observations (pobs) by the total number of this residue type in β-casein (ptotal) after subtracting the restricted positions with a P1′ proline identity (ppro).
![]() |
Specificity Profiling against AMC Substrates
All substrates were solubilized according to the manufacturer's instructions. For Z-FR-AMC and Z-RR-AMC, 1.6 mm stocks were prepared in H2O. For all other substrates, 20 mm stocks in DMSO were used. The activities of mature dionain-1 (at 10, 50, and 250 nm) and papain (at 5, 25, and 125 nm) were tested against a 0.25 mm concentration of each peptide substrate. The average hydrolysis rates were extracted as RFU/s and normalized to the total fluorescent signal after complete substrate conversion, through the addition of excess papain. Of note, quantification was performed by normalization of the fluorescent yield to the theoretical substrate concentration because manufacturer variations precluded reliable AMC fluorophore normalization.
Dionain-1 Digest of Drosophila melanogaster
Frozen D. melanogaster were homogenized in 50 mm sodium acetate, pH 4.0, using 10 μl/fly. Thirty μl of the extract (representing three flies) were titrated with 0, 0.625, 1.25, 2.5, 5, and 10 μg of activated dionain-1. The samples were incubated at 37 °C for 90 min with shaking at 1000 rpm. After incubation, 200 mm Tris-HCl, pH 7.6, was added, and one-third of the sample was immediately boiled in the presence of SDS sample buffer and 50 mm DTT and analyzed by SDS-PAGE. The content of selected protein bands was identified by mass spectrometry. The remaining two-thirds of the sample was lyophilized, reduced with 10 mm DTT in the presence of 8 m urea, and alkylated in 30 mm iodoacetamide. The sample was diluted with 20 mm ammonium bicarbonate to reduce the urea concentration to 2 m and treated with 1 μg of trypsin for 16 h at 37 °C. The samples were micropurified and analyzed by mass spectrometry (see above).
Crystallization and Data Collection of Mature Dionain-1
Purified wild-type recombinant pro-dionain-1 was prepared for crystallization by maturation at pH 4.1 (50 mm sodium acetate) at 45 °C for 70 min. For inhibition, E-64 was added in a 6.5-fold molar excess and incubated for 30 min at room temperature before applying the mixture to a Superdex 75 10/30 column (GE Healthcare) at 4 °C, which was equilibrated with 10 mm Tris-HCl containing 20 mm NaCl at pH 7. The purification was performed using an ÄKTA Purifier FPLC system operated at 4 °C (GE Healthcare). The eluate was monitored at 280 nm, and fractions were collected and assayed for the presence of mature dionain-1 + E-64 using SDS-PAGE (10% acrylamide gel, Tricine buffer system) followed by Coomassie Brilliant Blue staining. The pooled fractions were concentrated to 37 mg/ml using a Vivaspin centrifugal concentrator unit with a 5000 molecular weight cut-off polyethersulfone membrane (Sartorius).
Crystallization assays were performed by the sitting drop vapor diffusion method, and reservoir solutions were prepared by a Tecan robot, with 100-nl crystallization drops dispensed on 96 × 2-well MRC plates (Innovadyne) by a Phoenix NanoDrop robot (Art Robbins) or a Cartesian Microsys 4000 XL (Genomic Solutions) robot at the joint Institut de Biologia Molecular de Barcelona/Institute for Research in Biomedicine Automated Crystallography Platform at Barcelona Science Park. In total, 768 different conditions were screened. Plates were stored in Bruker steady temperature crystal farms at 4 and 20 °C. Successful conditions were scaled up to the microliter range in 24-well Cryschem crystallization dishes (Hampton Research).
Mature dionain-1 + E-64 in 10 mm Tris-HCl, 20 mm NaCl, pH 7, was crystallized at 4 °C from equivolumetric drops with 0.8 m K2HPO4, 0.9 m NaH2PO4, 0.2 m Li2SO4, 0.1 m CAPS, pH 10.5, as reservoir solution. Crystals were cryoprotected by immersion in reservoir solution stepwise supplemented with glycerol from 0 to 30% (v/v). The optimal diffraction data set was collected at 100 K from a liquid N2 flash-cryocooled crystal (Oxford Cryosystems 700 series cryostream) on an Area Detector Systems Corp. Quantum Q315r detector at beam line ID23-1 of the European Synchrotron Radiation Facility synchrotron (Grenoble, France) within the block allocation group “Barcelona.” Diffraction data were integrated, scaled, merged, and reduced with the programs XDS (29) and XSCALE (30) and transformed with XDSCONV to formats suitable for the CCP4 suite of programs (31). The crystal was centered monoclinic and contained one protein molecule per asymmetric unit (VM = 2.20 Å3/Da; solvent content = 44%).
Structure Solution and Refinement
The structure was solved by maximum likelihood-scored molecular replacement with the program PHASER (32) and a search model obtained by trimming the side chains of the structure of castor bean (Ricinus communis) cysteine peptidase (Protein Data Bank (PDB) entry 1S4V (17)) with CHAINSAW according to a sequence alignment with dionain-1. These calculations yielded a unique solution with the following final Eulerian angles (α, β, γ in degrees) and fractional cell coordinates (x, y, and z): 155.2, 30.8, and 198.3 and −0.382, −0.499, and 0.230. This solution had initial rotation/translation function Z-scores 19.8/14.1 and yielded an overall log likelihood gain of 728 and an R value of 48.5%. The adequately rotated and placed molecule was subsequently refined and completed by manual model building with the COOT program (34), which alternated with crystallographic refinement with PHENIX (35) and BUSTER/TNT (36) under inclusion of TLS refinement. The final refined model of dionain-1 consisted of residues Val2–Ala223 (mature peptidase numbering; add 129 for the full-length protein in GenBankTM entry KP663370.1), an E-64 molecule attached to the catalytic cysteine (Cys26), one phosphate anion, and 297 solvent molecules. Table 1 presents the final refinement and model quality statistics. The ideal coordinates and parameters for crystallographic refinement of the E-64 ligand were obtained from the PRODRG server (37). Structural similarity searches were performed with DALI (38), and structure figures were prepared with the CHIMERA program (39). The experimental dionain-1 structure was validated with MOLPROBITY (40). The final coordinates of mature D. muscipula dionain-1 were deposited in the Protein Data Bank (entry 5A24).
TABLE 1.
Crystallographic data
Data set | Dionain·E-64 complex |
---|---|
Space group | C2 |
Cell constants (a, b, and c (Å); β (degrees)) | 73.85, 47.79, and 57.79; 92.44 |
Wavelength (Å) | 1.0052 |
No. of measurements/unique reflections | 114,191/30,734 |
Resolution range (Å) | 40.1–1.50 (1.70–1.50)a |
Completeness (%) | 94.9 (94.8) |
Rmergeb | 0.027 (0.089) |
Rr.i.m. (= Rmeas)c/CC(1/2)c | 0.031 (0.103)/0.999 (0.993) |
Average intensity (〈I〉/σ(I))d | 30.7 (16.0) |
B-Factor (Wilson) (Å2)/Average multiplicity | 19.9/3.1 (3.8) |
Resolution range used for refinement (Å) | 13.3–1.50 |
No. of reflections used (test set) | 30,697 (774) |
Crystallographic Rfactor (free Rfactor)b | 0.158 (0.172) |
No. of protein atoms/solvent molecules/covalent ligands/ionic ligands | 1622/297/1 E-64/1 PO43− |
Root mean square deviation from target values | |
Bonds (Å)/angles (degrees) | 0.010/1.01 |
Average B-factors (Å2) | 19.6 |
B-factor protein parts (Å2) | 17.1 |
B-factor ligand E-64 (Å2) | 30.6 |
B-factor solvent molecules (Å2) | 31.7 |
All-atom contacts and geometry analysise | |
Residues in favored regions/outliers/all residues | 212 (96.4%)/0/220 |
Residues with poor rotamers/bad bonds/bad angles | 4 (2.35%)/0/0 |
Residues with Cβ deviations >0.25 Å/Clashscore | 0/4.48 (95th percentile) |
MolProbity score | 1.75 (65th percentile) |
Results and Discussion
De Novo Sequencing of Full-length Pre-pro-dionain-1
The most abundant protein in the digestive fluid of the Venus flytrap, based on SDS-PAGE and Coomassie Brilliant Blue staining, is a ∼45-kDa cysteine protease named dionain-1, reflecting its origin (Dionaea) and homology with papain (41). A more recent combined transcriptomics and proteomics approach revealed the existence of four different dionains in total present at both the transcript and protein levels (5). To verify the high abundance of dionain-1 in the digestive fluid, we analyzed the composition of the Venus flytrap digestive fluid after mechanical stimulation by SDS-PAGE followed by N-terminal sequencing of the major protein band migrating at ∼45 kDa (Fig. 1A). The result revealed a single N terminus with the sequence DVPAAVDXRTAGAVTP (where X denotes ambiguous residue identification), uniquely identifying the protein as dionain-1. The major protein band was also digested with trypsin to facilitate de novo sequencing of peptides using chemically assisted fragmentation MALDI-MS. Two selected peptides were used to generate designated primers for the amplification and cloning of the complete pre-pro-dionain-1 cDNA sequence through targeted 5′-RACE PCR as described under “Experimental Procedures.”
FIGURE 1.
Pre-pro-dionain-1 sequence and maturation. A, Venus flytrap digestive fluid was subjected to SDS-PAGE, and the resolved proteins were blotted to a PVDF membrane and visualized by Coomassie Brilliant Blue staining. A major protein band was observed at ∼45 kDa and subjected to N-terminal sequencing. The obtained sequence (DVPAAVD) corresponded to the N-terminal of dionain-1. B, schematic representation of the full-length dionain-1 cysteine protease (pre-pro-dionain-1) translated from the 5′-race PCR-generated cDNA. Sequence features include the signal peptide (underlined), the pro-domain (residues p1–p100) containing Asn98p as possible N-linked glycosylation site (*), the cleavage site for automaturation (arrow), and the mature protease (residues 1–223) with the active site catalytic dyad, Cys26 and His165, indicated in boldface red type. C, SDS-PAGE analysis of pro-dionain-1 expressed in P. pastoris. Pro-dionain-1 was extensively glycosylated (WT) but migrated as a single band at ∼50 kDa upon deglycosylation with endo-β-N-acetylglucosaminidase H (Endo H). *, Endo H band. SDS-PAGE analysis of the purified N98pQ pro-dionain-1 mutant (N98pQ) confirmed the location of the N-linked glycosylation at this position in the pro-domain.
The full-length dionain-1 cDNA sequence (GenBankTM KP663370) is 1056 base pairs long and encodes a 352-residue pre-pro-dionain-1 with a molecular mass of 37.8 kDa (Fig. 1B). The pre-pro-dionain-1 cDNA sequence obtained by high-throughput next-generation sequencing (5) as well as the previously purified and sequenced dionain-1 peptides (41) fit well with the sequence reported here by the targeted cloning approach. Pro-dionain-1 shares 37% sequence identity with pro-papain from Carica papaya and 44% sequence identity with the cysteine endopeptidase precursor (pro-Cys-EP) from the castor bean (R. communis), representing the closest ortholog with a crystal structure of the mature protease domain (PDB code 1S4V (17)). The pro-domain of dionain-1 forms the first 100 residues (Ser1p-Thr100p; corresponding to Ser30–Thr129 of the full-length pre-pro-protein sequence, followed by the mature protease (Asp1–Ala223; Asp130–Ala352 of full-length pre-pro-protein) with active site residues Cys26 and His165, corresponding to Cys25 and His159 in papain, respectively.
Heterologously Expressed Pro-dionain-1 Is Glycosylated and Automaturates at Acidic pH
The functional expression of pro-dionain-1 (323 residues, 34.8 kDa, pI 4.7) was achieved in P. pastoris using the endogenous yeast signal peptide in place of the 29-residue predicted dionain-1 signal sequence (by SignalP version 4.0 (24)). The pro-form was purified by anion exchange and yielded a protein migrating at sizes ranging from 50 to 66 kDa in SDS-PAGE. We attributed the heterogeneity in mass to variations of N-linked glycans at residue Asn98p. Glycosylation was verified by treating pro-dionain-1 with endoglycosidase H and by producing and purifying a glycosylation-deficient N98pQ mutant (Fig. 1C). Both approaches resulted in a protein that migrated as a single band of ∼50 kDa.
The N terminus of pro-dionain-1 was identified by Edman degradation as S1pSSRLLTS, which is consistent with the predicted processing site. Minor proportions were processed to yield N termini T7pSSEQ (∼20% of total) and S8pSEQV (∼10%). The molecular mass of the deglycosylated form of pro-dionain-1 derived from SDS-PAGE (∼50 kDa) differed from the theoretical value (∼35 kDa) by ∼15 kDa. This has previously been observed for other plant cysteine proteases (42). The aberrant migration during reducing SDS-PAGE was most likely caused by acidic motifs in the active enzyme (residues 1–223) because the activated form similarly migrated abnormally at ∼45 kDa (Fig. 1A), despite a theoretical molecular mass of 23.1 kDa.
The active site in the pro-form of the C1 papain proteases is sterically blocked by a reverse-oriented substrate-like pro-peptide stretch (16, 43, 44). Catalytic competence is gained upon acidification by the loosening of pro-domain contacts and proteolytic maturation (45–47). To investigate the activation of pro-dionain-1, the pro-form was exposed to a series of acidic buffers with pH values between 2.4 and 5.0. This mimicked biological digestive fluid conditions in which slow acidification from approximately pH 4.4 to 3.4 occurs (4). The SDS-PAGE analysis demonstrated accelerated maturation upon pH reduction (Fig. 2A). Automaturation was also followed over time at pH 4.0 and produced a similar maturation pattern, suggesting that pH reduction accelerated the process without altering its features. The autoproteolytic processing of the pro-domain was evident from the appearance of small molecular mass bands representing pro-dionain-1 fragments. The major species observed at ∼9 kDa was found to have an S8pSE sequence as the N terminus. The pro-domain peptide fragments were subjected to further proteolytic breakdown, indicating the presence of initial preferential cleavage sites. Reverse phase purification and subsequent analysis of the intact peptide fragments by mass spectrometry identified S8pSEQ…PLK85p (9.3 kDa), D30pDA…PLK85p (6.6 kDa), and D30pDA…GYK81p (6.1 kDa) as the three major peptide fragments. The occurrence of these peptides preceded the full maturation of dionain-1, indicating that shedding of the glycosylated pro-domain peptide is a sequential process. Furthermore, the results suggested a substrate preference of dionain-1 for hydrophobic/aliphatic residues at P2 and positively charged residues at P1.
FIGURE 2.
Effect of pH and temperature on pro-dionain-1 automaturation. A, pro-dionain-1 was incubated at 0.2 mg/ml at the indicated pH values at 37 °C for 45 min. E-64 (1 mm) was added, and the samples were analyzed by SDS-PAGE. Low pH triggered automaturation to produce mature dionain-1 (∼45 kDa, arrow) and degradation of its own N-terminal pro-domain. B, the enzymatic activity was followed by Z-FR-AMC hydrolysis over time as a function of pH, indicated as RFU. Each curve represents triplicate mean values with experimental deviation of <2.5%. The time to reach the maximal enzyme activity (rate plateau) was extracted through first derivative plots (see inset). The curve profiles clearly demonstrated the pH dependence of the automaturation process. C, the maximum enzyme activity (rate plateau value in RFU/s) was plotted as a function of pH. Error bars, plateau value fit error to the triplicate mean derivative plot. The inset illustrates the derivative plot at pH 3.8 and the fitting curve to the automaturation function (see “Experimental Procedures”).
To better understand the catalytic activation of dionain-1, we used a Z-FR-AMC fluorogenic substrate and analyzed the substrate conversion over a range of pH values in a phosphate/citrate buffer system. The maturation of dionain-1 progressed autocatalytically in the pH range from pH 5.0 to 3.4. The exponential phase was essentially absent below pH 3.4 (Fig. 2B), and in this range, the maximal rate of substrate hydrolysis was reduced with pH, most likely by partial protonation of the active site cysteine (pH-induced change in the AMC fluorophore quantum yield was not significant). Furthermore, the time required to reach full catalytic competence (rate plateau) rapidly decreased upon acidification (Fig. 2B, inset).
Next, we extracted the peak substrate conversion rates as a function of pH. The optimal enzymatic efficiency of dionain-1 was achieved at approximately pH 3.6 and expressed the balance between pro-domain destabilization, catalytic competence, and autodegradation (Fig. 2C). Additional analyses revealed a clear Arrhenius relationship of the automaturation process upon temperature increase, confirming that no apparent thermal threshold is required to unfold or facilitate pro-domain destabilization. The maturation of dionain-1 protease was concentration-dependent, suggesting the dominance of intermolecular trans-activation events. This result correlates well with the pro-domain location of the major observed cut at Lys85p that cannot be performed intramolecularly because of the apparent distance to the active site cleft (see below). Overall, the autolysis of pro-dionain-1 is thermally accelerated and only proceeds to a significant extent below pH 5.0. It reaches a maximum at pH 3.5–3.8 before the protonation of key residues or acid-mediated partial denaturation slows the catalytic activity. This suggests that pro-dionain-1 maturation is tailored for a slightly more acidic environment than that of pro-papain, which presents optimal automaturation at pH 3.5–4.0 (45).
Pro-region Glycosylation Supports Proper Folding of Dionain-1 but Does Not Influence Stability
The presence of an N-linked glycan attached to Asn98p in pro-dionain-1, only two residues away from the mature N terminus (Fig. 1), prompted us to investigate the effect of glycosylation on both domain stability and automaturation. First, the maturation of WT and endo-β-N-acetylglucosaminidase H-treated pro-dionain-1 was tracked by SDS-PAGE and showed no apparent difference (Fig. 3A). Second, the same D1VVPA mature dionain-1 N terminus was observed by Edman degradation. Third, thermal denaturation by CD spectroscopy was conducted to address the structural stability and gave very similar transition midpoints (Tm) of 72.4 ± 0.1 and 73.0 ± 0.1 °C for WT and deglycosylated pro-dionain, respectively (Fig. 3B). Because no apparent effect on enzyme function was found for the Asn98p glycosylation, we addressed the glycosylation-deficient N98pQ mutant. This variant was successfully secreted by P. pastoris but displayed severe cooperative unfolding above 40 °C with a Tm of only 42.7 ± 0.1 °C (Fig. 3B). High proteolytic susceptibility of N98pQ toward trypsin corroborated the lack of proper domain folding, where both WT and deglycosylated pro-dionain-1 were well protected. These data collectively suggest that pro-dionain-1 glycosylation serves to ensure proper folding of the pro-enzyme through the secretory pathway during biosynthesis. Note that pro-domain glycosylation is found in the homologous human pro-cathepsin K (48) and is considered a prerequisite for functional expression of pro-papain in insect cells (49). However, because pro-domain glycosylation motifs are not always used (42) or may be absent in certain cysteine proteases, it is not a per se requirement for proper folding and processing of C1 proteases.
FIGURE 3.
The role of pro-domain glycosylation on automaturation and stability. A, SDS-PAGE analysis of WT and deglycosylated pro-dionain-1 after indicated incubation times at pH 4.0 revealed that automaturation proceeded to a similar extent, independent of Asn98p glycosylation. *, endo-β-N-acetylglucosaminidase H band. B, thermal denaturation profiles of WT, deglycosylated, and N98pQ pro-dionain-1 measured by CD in the presence of 2 mm E-64. Fits to a two-state unfolding model resulted in Tm values of 72.4 ± 0.1, 73.0 ± 0.1, and 42.7 ± 0.1 °C, respectively.
Dionain-1 Activity and Kinetic Parameters
The acidic digestive fluid of Venus flytraps leads to the production of mature dionain-1. The pro-region shedding at low pH to produce a more compact functional form (residues 1–223) is similar to the process in papain and other related extracellular plant cysteine proteases participating in defense mechanisms in the latex of fruits and plants during wounding (19). To determine the kinetic parameters of mature dionain-1, the steady-state hydrolysis rate of Z-FR-AMC was probed as a function of pH and compared with that of papain (Fig. 4A). The optimum activity was found at the pH interval from 5.4 to 6.0, and a rapid decline was observed above pH 7.0. Interestingly, dionain-1 maintained higher relative activity on the acidic limb of the bell-shaped curve compared with that of papain. At pH 3.4, dionain-1 displayed 35% of maximal activity, whereas papain was only 5% active. The catalytic competence of the mature acidic cysteine proteases is primarily linked to the formation of a cysteine-S−/imidazolium-H+ ion pair. The particular reactive Cys displays a low microscopic pKa value (2.5–3.4), but surrounding electrostatic modulators of the active site result in a bell-shaped curve with optimum proteolytic activity at approximately pH 6.0 (50–53). The pH optimum of dionain-1 and the conservation of the most important electrostatic modulators around the active site indicate a similar catalytic construction. However, the biological environment in the digestive fluid of the Venus flytrap requires dionain-1 to function from pH 3.2 to 4.5, which underscores how both the optimum for automaturation (Fig. 2) and the enhanced acidic activity (Fig. 4) may reflect an adaption to functioning in the acidic environment of the “green stomach.”
FIGURE 4.
Effect of pH and temperature on dionain-1 activity. A, normalized dionain-1 and papain product conversion rate of Z-FR-AMC as a function of pH. Dionain-1 displays a skewed bell-shaped curve with pH optimum around pH 6. B, normalized end point product conversion after 15-min incubation at the indicated temperatures. Error bars, triplicate S.D.
The thermal effect on dionain-1 substrate hydrolysis was found to be positive up to ∼60 °C, after which the activity rapidly declined. This was most likely caused by denaturation and enhanced autodegradation (Fig. 4B). The kinetic parameters of dionain-1 at pH 6.0 and 35 °C against Z-FR-AMC were determined and compared with those of papain (Table 2). In both cases, the amount of active enzyme was determined by active site titration with E-64. Notably, poor E-64 inhibition of dionain-1 occurred, possibly due to the presence of interfering pro-domain sequences, and resulted in a slightly overestimated active site concentration. Interestingly, the Km values for the two enzymes were very similar (68.5 ± 4.7 and 71.3 ± 4.7 μm for papain and dionain-1, respectively), but the catalytic efficiency was higher for papain against this common cysteine protease substrate.
TABLE 2.
Kinetic parameters for papain and dionain-1
kcat | Km | kcat/Km | |
---|---|---|---|
s−1 | μm | s−1/m | |
Dionain-1 | 3.33 ± 0.09 | 71.3 ± 4.7 | 46.7 |
Papain | 10.40 ± 0.28 | 68.5 ± 4.7 | 151.8 |
Specificity of Dionain-1 Probed by Substrate Library Profiling
The specificity of dionain-1 was determined using a substrate library screening approach with three variable positions (Trp/His/Met/Cys/Gly not represented). The library contained 3375 different peptide sequences and was based on IQFPs. The analysis suggested that dionain-1 cleaved ∼30% of the substrates in the library to varying extents. Decoding of the resulting main substrate pools (168 sequence motifs) and weighting them by the fluorescence output revealed a high occurrence of Leu/Ile, Arg/Lys, and Phe/Tyr on the variable positions (Fig. 5A). Because the positive identification of substrate cleavage did not provide the P1–P1′ border, we performed mass spectrometry analyses of the top 50 substrate pools, of which the top 10 are shown (Fig. 5B). The identification of Leu/Ile at P2 and Arg/Lys at P1 is consistent with the papain-like preference for hydrophobic residues at P2 and a positively charged P1 (54). In addition, cleavage also occurred with Arg/Lys in the P2 position, which has not been observed for papain (55) (Fig. 5B).
FIGURE 5.
Substrate profiling of dionain-1 using IQFPs. A, intensity-weighted score distribution of residue types in positions Xaa, Yaa, and Zaa found in cleaved substrates above a threshold of 1000 RFU (in total 168 motifs). The analysis demonstrated a preference for residues L/I and R/K. B, top 10 ranked substrate pools listed by intensity score. Boldface positions indicate the P1 site verified by MALDI-MS. Two boldface positions for peptides 8 and 9 indicate identified cleavage after both P1 positions. C, substrate P3–P1′ residue distribution with R/K set to P1 from substrates with R/K in the Zaa or Yaa position. A clear preference for hydrophobic P2 residues (L/I or F/Y) is seen with a minor fraction of N/Q or R/K. Gly at P1′ is part of the linker sequence in the IQFP substrate. D, substrate P3–P1′ residue distribution for peptides with validated cleavage after Gly in the linker sequence (as P1 position). A strong preference for hydrophobic residues at P2 and R/K in P3 is observed. E, substrate P3–P1′ residue distribution (weighted residue frequencies) of the observed β-casein peptide cuts after overnight in-gel digest by dionain-1. Peptide identification was done by LC-MS/MS. Also, cleavage with P2 His (1 of 2 possible), Trp (1 of 1), and Met (1 of 4 possible) was observed but is excluded from the pie diagram shown here. F, relative activities of dionain-1 and papain against different di- and tri-peptide AMC substrates, normalized to the rate of Z-FR-AMC hydrolysis.
To further characterize the P2 preference of dionain-1 observed in the library, peptides with validated cleavage after Arg/Lys were grouped and analyzed for their residue type distribution at P3-P1′ (Fig. 5C). Of the 16 validated unique peptides, 13 contained either Phe/Tyr or Ile/Leu at P2, two contained Asn/Gln, and one contained Arg/Lys. Therefore, hydrophobic residues are favored at P2 by dionain-1, although positively charged residues may also be accommodated. The P3 position did not display any considerable preference, whereas the P1′ position preferred the glycine provided by the linker region. Several cleavages after the first C-terminal linker glycine were observed during the MS validation of P1–P1′ positions in the positive peptide substrates. Because glycine is outside of the variable positions, these cleavage sites would have escaped detection without the MS analysis. The residue preference for the glycine P1 cleavage was followed by a hydrophobic P2 position and an Arg/Lys preference at P3 (Fig. 5D). This identification suggests a need for a stabilizing charge interaction in P3 to support the cleavage after small uncharged residues at P1.
The dionain-1 cleavage of peptides with a charged residue at P2 (R/K) prompted us to investigate differences in substrate preference between papain and dionain-1. A range of di- and tripeptide AMC-coupled substrates were investigated through the extraction of their relative activity benchmarked against Z-FR-AMC (Fig. 5F). The results for the dipeptide substrates Z-FR-AMC, Z-LR-AMC, and Z-RR-AMC showed a preference for Leu in P2 for dionain-1, whereas papain preferred Phe at this position. However, both enzymes only displayed marginal activity with Arg at P2. To test whether the presence of a P3 residue altered the activity toward basic residues at P2, we analyzed the activity toward Boc-LKR-AMC and Boc-GKR-AMC, where the former substrate resembles the composition of one of the top hits in the IQFP library. Dionain-1 was able to cleave Boc-LKR-AMC as efficiently as the Z-LR-AMC substrate but with a clear indication of Leu at P2 and Lys at P1 (identified by the reduced fluorescence yield of R-AMC compared with free AMC). Papain also cleaved this substrate but with reduced efficiency, again preferring Phe over Leu at P2. However, Boc-GKR-AMC was a poor substrate, suggesting that a positively charged residue at P2 was unfavorable, at least in the case of the small peptide substrates tested here.
Dionain-1 Degradation of Intact Protein Targets
The substrate preference of dionain-1 was tested in the context of an intact protein target using bovine β-casein. Here, incubation overnight at an enzyme substrate ratio of 1:50 (w/w) led to extensive β-casein degradation. Analysis of the P3–P1′ identities in the observed proteolytic peptides demonstrated promiscuous P2 acceptance by dionain-1, although hydrophobic/aliphatic residues were still prevalent (∼50% of all cuts) (Fig. 5E). In addition, the substrate preference may change or broaden upon acidification (56) and accelerate autolytic cleavage at the dionain-1 pro-domain border (FT↓DV), not otherwise considered a favored substrate.
Next, we addressed the biological function of dionain-1 as a digestive protease by exposing a Drosophila protein extract to increasing amounts of mature dionain-1 at pH 4.0. A clear degradation of several protein bands was observed by SDS-PAGE, and their content was identified by mass spectrometry (Fig. 6A, lanes 1–6). Interestingly, myosin represented the majority of the proteins in all of the analyzed gel bands, including the bands most susceptible to dionain-1 treatment (Fig. 6A). Although endogenous Drosophila proteases may have contributed to the presence of myosin fragments of various sizes, this protein constitutes an excellent biological substrate for dionain-1. To quantitatively assess the degradation of the most abundant proteins in the Drosophila extract, we analyzed the digests directly by mass spectrometry using the exponentially modified protein abundance index method (Fig. 6B) (27). Myosin heavy chain emerged as a major substrate for dionain-1, supporting our observation by SDS-PAGE. It accounted for ∼27% of the total protein content and was degraded faster than any of the other abundant proteins, with a reduction to 14% of the total protein content at the lowest dionain-1 concentration.
FIGURE 6.
Dionain-1 digest of Drosophila proteins. A, extracts of Drosophila (three animals) were treated with increasing amounts of dionain-1 (0, 0.625, 1.25, 5, and 10 μg). The samples were analyzed by SDS-PAGE (lanes 1–6), including a control incubated without fly extract (lane 7). Dionain-1 treatment resulted in a general reduction in protein content. The bands selected for MS analysis all contained myosin heavy chain or fragments thereof. The most susceptible bands to dionain-1 treatment are marked with asterisks. B, the dionain-1-treated extracts were analyzed by LC-MS/MS, and the relative protein amount was determined by the exponentially modified protein abundance index protocol. The six most abundant proteins are displayed and account for more than 50% of total protein mass. In agreement with the SDS-PAGE analysis, myosin heavy chain is highly prone to degradation by dionain-1. Error bars, triplicate S.D.
The exponentially modified protein abundance index value is essentially based on the number of identified peptides in a protein after trypsin treatment (27). The list of detectable tryptic peptides is reduced by enzymatic action at P1 sites other than Lys and Arg prior to the trypsin treatment. The clear reduction we observed in myosin content underlines the promiscuous nature of dionain-1, which was also evident in the degradation of the β-casein substrate (Fig. 5E). Myosin has a 17% relative content of Lys and Arg residues compared with the average of 10% in the Drosophila proteome. This may explain the effective breakdown by dionain-1, which, although promiscuous, displays a preference for Lys and Arg in the P1 position (Fig. 5). Therefore, in a biological context, dionain-1 mediates the rapid release of peptides high in nitrogen content and facilitates the essential nitrogen uptake for the carnivorous lifestyle.
Structure of Dionain-1
The molecular structure of dionain-1 in complex with the covalent inhibitor E-64 (57) was determined by x-ray diffraction and refined with data to 1.5 Å resolution. As expected, the mature peptidase demonstrated an overall shape reminiscent of other plant cysteine peptidases, such as those of papaya and castor bean (R. communis). Similar to other cysteine proteases from the papain superfamily, dionain-1 has three disulfide bridges and consists of two domains that form the active site cleft at their interface (Fig. 7). The most N-terminal domain is primarily α-helical and harbors the catalytic nucleophile, Cys26 along with Gln20, which forms the oxyanion hole (corresponding to Cys25 and Gln19 in papain (58)). The catalytic His, His165, is located in the β-sheet-rich C-terminal domain along with Asp164, which contributes to catalytic efficiency. The resemblance between dionain-1, papain, and castor bean cysteine protease is reflected by the low root mean square deviation of the corresponding Cα atoms upon structural superposition (0.56 and 0.52, respectively) (Fig. 7C).
FIGURE 7.
The crystal structure of dionain-1. A, dionain-1 is a classical papain-like cysteine protease with an N-terminal α-helical lobe (blue), a C-terminal β-sheet lobe (gray), and three disulfide bridges (red). E-64 is found covalently linked to Cys26 and occupies the active site cleft and the S1 and S2 subsites (purple). His165 of the catalytic dyad is colored in orange. B, B-factor representation of the final model shows only structural ambiguity in a few loop regions. C, superposition of the Cα traces of papain (pink; PDB entry 1PPP), Cys-EP (green; PDB entry 1S4V), and dionain-1 (blue) shows a high degree of similarity and conservation of the overall structural fold.
Catalytic Site and E-64 Contacts
The geometrical arrangement and electrostatic environment around the catalytic site are major determinants of catalytic competence and efficiency through their effect on the pKa values of the catalytic residues. This architecture not only ensures the existence of the cysteine-S−/imidazolium-H+ ion pair but also speeds up the acylation and deacylation steps in the hydrolytic cycle by interaction or stabilization of the transition states (59). Active site His165 is hydrogen-bonded to Asn187 in the dionain-1 crystal structure, and Asp164 forms a contact with the main-chain amide nitrogen of Ala142 and Ser143. This configuration is essentially identical to the arrangement found in both papain (Asp158 contacts Ala136 and Ala137) and castor bean cysteine protease (Asp161 contacts Ala139 and Gly140). The active site Cys26 is covalently linked to the C2 atom of the epoxy ring of the cysteine protease inhibitor, E-64, which neatly fits into the subsites of the substrate-binding cleft upstream of the cleavage site and is held in place by multiple hydrogen bonds (Fig. 8A and Table 3). The arrangement is highly similar to the binding mode found in papain and other cysteine proteases of the same family and causes only a minor expansion of the active site to accommodate the proper coordinate chemistry (60, 61). The oxyanion hole is occupied by the carboxylic acid group of E-64, and the leucyl moiety is located in the S2 pocket of dionain-1, where it is held in place by hydrophobic interactions and by backbone hydrogen bonding of the succeeding E-64 amide nitrogen to the carbonyl oxygen of Gly69 (Fig. 8A). The 4-guanidinobutane moiety of E-64 is not visible in the electron density map (Fig. 8A), most likely due the absence of interactions with the enzyme to restrict its motion. In papain, Tyr61 and Tyr67 hold the guanidinium group in place through bonding via the side-chain hydroxyl groups (61); however, dionain-1 differs by having Arg64 and Thr70 in the corresponding positions. This arrangement indicates clear differences with a possible effect on substrate recognition profiles from S3 subsites and onward between these homologous cysteine proteases.
FIGURE 8.
Structural details of dionain-1. The N-terminal part is shown in blue, and the C-terminal part is shown in gray. A, active site configuration of dionain-1 in complex with E-64 covalently linked to Cys26. Relevant hydrogen bonds are shown in orange. The initial Fourier map for E-64 is superimposed with the final refined coordinates for the part of E-64 that was modeled. B, the defining residues of the S2 pocket of dionain-1 result in a spacious cavity due to Ala139 and Thr70. C, surface presentation of the complete substrate binding cleft in dionain-1 with subsite indication and coloring of important charged groups (S1′: Gln148; S1: Cys66, Asn67, and Asp164; S2: Gln217; S3: Asn62, Asp63, and Asn67). D, comparison of the active site cleft (front view) in papain (purple), Cys-EP (green), and dionain-1 (blue). E, location of loop regions with sequence inserts unique to dionain-1 (compared with papain). Loop 1 is a 2-residue insert (Asp63-Arg64), located in the loop region from Cys57 to Cys66, and results in a protrusion toward the S3-S4 subsites. Loop 2 is 3-residue insert (Ala94-Gly95-Gly96) in the N-terminal domain. Loop 3 is a 6-residue insert (Pro176–Ser181) in the C-terminal domain that expands an antiparallel β-sheet.
TABLE 3.
Hydrogen bonds between E-64 and dionain-1
The figures in parentheses are corresponding distances in the papain-E-64 complex (61).
Subsites of Dionain-1
The nucleophilic attack on the peptide carbonyl group of the P1–P1′ scissile bond by the thiolate anion of Cys26 is guided by the accommodation of the peptide or peptide-like substrate in the subsites of the active site cleft of dionain-1. The unique subsite arrangement is illustrated in Fig. 8C and indicates the electrostatic groups involved in the selectivity and modulation of substrate binding. The S1 subsite of dionain-1 is a relatively shallow V-shaped groove that primarily provides main chain stabilization, as seen in the E-64 contacts formed here. The substrate library profiling (Fig. 5) indicates that long-chain positively charged residues, such as Arg or Lys, are highly preferred at this position, followed by smaller aliphatic residues. Indeed, the exposed main-chain oxygen molecules from Gly24, Cys66, and Asn67 would disfavor negatively charged Asp and Glu residues. A similar arrangement is seen in Cys-EP, where the inhibitor P1 arginine points toward the solvent (17), and in the ervatamin cysteine proteases (from the jasmine plant), where no activity toward substrates with Asp at the P1 position is observed (62).
The S2 subsite of dionain-1 is a deep pocket formed primarily by the side chains of Thr70, Pro71, Ala139, Leu163, and Ala166 and lined by the main chain stretch of residues 163–166 and Trp27 (Fig. 8D). The volume of the pocket (279 Å3) is larger than in papain (255 Å3) because Ala139 is situated at the pocket bottom instead of Val133 (volumes measured by the program CASTp (63)).
In both Cys-EP and ervatamin A, the cavity-bottom residue is also Ala, but the volume of the S2 subsite is restricted by substitution of Pro71 (in dionain-1) with Met (Cys-EP) or Phe (ervatamin A). Additionally, the presence of Thr70 in dionain-1 in place of papain's Tyr67 widens the substrate-binding cleft from subsite S3 outward (Fig. 8D). This places dionain-1 as a promiscuous cysteine protease with a spacious S2 pocket and a large range of possible interactions to accommodate the P3 substrate residue and beyond. These structural features agree with the substrate library profiling in which large hydrophobic residues preferentially occupy the S2 site and low selectivity is seen for P3. In regard to the observed cleavages for substrates with Gly in P1 and concomitant preference for Lys/Arg in P3 (Fig. 5D), dionain adequately carries a negatively charged interface by Asn62 and Asp63 that could comprise a stabilizing S3 subsite. In fact, this interface results from a dionain-1-specific 2-residue insertion (Asp63-Arg64) that gives rise to a main-chain protrusion toward subsites S3-S4 (Fig. 8E). For the dionain-1 activity observed for substrates filling the S2 subsite with a Lys/Arg residue (Fig. 5B), the accommodation of the positive charges in S2 could be explained by the presence of Gln217 at the top of the pocket, which may act as a hydrogen bond donor. Such stabilization is observed in cathepsin B, which carries an Asp in this position for charge compensation (64, 65). Downstream of the scissile bond, the dionain-1 S1′ pocket is very similar to that of papain and is formed by the side chains of Ala142, Gln148, Asp164, and Trp189. From here, the substrate main chain may continue along the domain interface and engage in various interactions because a particular structural requirement for P2′ side chains (or further) is not apparent.
The overall globular structure of dionain-1 is highly similar to the papain-like fold differing primarily by the presence of three sequence inserts (Fig. 8E). The largest sequence insert is found from residue 176 to 181 in the C-terminal domain and leads to expansion of an antiparallel β-sheet, located on the distal top side of the active site cleft. In Cys-EP, a similar 5-residue insertion is found, but the functional role of this segment remains to be established.
Insights into the Pro-domain
The pro-domain of pro-dionain-1 was modeled by the structural prediction program PHYRE2 (66), justified by the conservation of known important sequence motifs for the pro-domain interaction surface with the core enzyme and intradomain salt bridge networks (15, 16) (Fig. 9A). The model suggested an architecture of the dionain-1 pro-domain similar to that of pro-papain and pro-caricain (Fig. 9B) and was combined with the mature dionain-1 enzyme structure. In general, the topology of the enzyme body does not alter significantly upon maturation in cysteine peptidases (root mean square deviation = 0.35 Å for papain and the pro-papain enzyme core Cα atoms) (18). Interesting differences between pro-papain, pro-caricain, and the putative pro-dionain-1 structure are observed in the pro-domain region that covers subsites S2′–S2 (Fig. 9C). In pro-papain, helix III is anchored to the core domain in the S2′ pocket by the hydrophobic pro-domain residues Phe70p, Phe78p, and Tyr82p (pro-domain residue numbers with “p” suffix; Fig. 9C). In pro-dionain-1, the hydrophobic patch is formed by Phe65p, Phe73p, and His77p. The ionizable nature of His77p is likely to play a role in the fine-tuning of automaturation, considering that an F70pH mutation in papain causes a pH shift in activation from 4.0 to 5.0 (16). The extended polypeptide stretch that follows helix III sterically blocks the active site. In papain, the stretch is composed of Thr82p-Gly83p-Ser84p-Leu85p, which bulges away from the catalytic cysteine. In pro-caricain, the similar peptide stretch is Val-Gly-Ser-Leu, but here, the Gly83 carbonyl group is in hydrogen bonding distance to Cys25, and the Ser84p Oγ is positioned toward the solvent. For pro-dionain-1, the model suggests that the active site would be blocked by Asn78p-Gly79p-Tyr80p-Lys81p which, similar to pro-caricain, place Gly79p in hydrogen-bonding distance to Cys26 but also reduce the insertion of the backbone of Tyr80p in the S1 pocket through a stabilized bulge, involving Arg74p and Asn78p (Fig. 9C, middle). Tentatively, these changes could facilitate the filling of the S2 pocket by positively charged Lys81p, which is larger than the corresponding Leu residues in papain and caricain.
FIGURE 9.
Features of the pro-dionain-1 structure from homologous modeling. A, sequence alignment of pro-dionain-1, pro-papain, and pro-caricain reveals highly conserved sequence motifs, including ERFNIN (boxed orange), salt bridges (indicated by squares), and hydrophobic packing motifs (indicated by spheres). Most divergence is seen in the pro-peptide region blocking the active site (subsite filling) and the sequence that follows. Sequence numbers indicate pro-dionain-1 numbering. B, pro-enzyme structures of thermostable pro-papain (left, PDB entry 3TNX) and pro-caricain (right, PDB entry 1PCI). Pro-dionain-1 (middle) was modeled using Phyre2, and the core enzyme was replaced with the crystal structure determined in this study. C, display of the subsite filling by the pro-peptides of the cognate enzymes. Hydrophobic patch residues are colored green, and active site cysteine and histidine are presented by orange and purple surfaces, respectively (pro-caricain carries the H164A mutation). For pro-dionain-1, subsite carbonyl groups in the core enzyme are represented by red surface colors. For pro-papain and pro-caricain, selected core enzyme-stabilizing carbonyl groups are represented by maroon surface colors.
The observed primary proteolytic sites in the pro-domain of dionain-1 during automaturation are VYK29p, GYK81p, and PLK85p, as previously mentioned. Lys27p locates to the loop region between pro-domain helices I and II and would be solvent-exposed according to the modeled structure of pro-dionain. Tyr26p, however, would be buried in the hydrophobic interstrand environment, participating in the hydrogen bond network of Arg24p and Asp65p (conserved motif; see Fig. 9A). Thus, loosening of pro-domain contacts upon acidification is predicted to cause unfolding and solvent exposure of Tyr28p, which would mediate binding to the substrate cleft of dionain-1 and hydrolysis at Lys29p. The cleavages at Lys81p and Lys85p both locate to the extended polypeptide chain spanning the non-primed subsites of dionain-1. In fact, Lys81p fills the S2 site in the proposed pro-domain model. For these peptide stretches to be cleaved intramolecularly, it is necessary to liberate the active site, loosen the stretch, and reverse the peptide chain direction. The reversal is not possible due to structural and spatial constraints, which support an intermolecular cleavage event at these positions. The final maturation to create the dionain-1 N terminus at Asp1 is also assumed to be mediated through trans-processing for the same reason.
Dionains of the Venus Flytrap
The presence of four distinct dionains in Venus flytrap digestive fluid has been established at the transcript and protein level; however, the regulation and function of each isoform remain unclear (5). The full-length sequence of pre-pro-dionain-2 is 85% identical to dionain-1 and does not differ in any aspects regarding the S2 pocket or the pro-peptide chain that blocks the active site cleft. This may represent a functional redundancy and a likely result of an ancestral gene duplication event. Pro-dionain 3 is 58% identical to dionain-1 within the aligned sequence (from Glu18p to Thr222), and dionain-4 is the most diverse, with only 48% identity (from Glu10p to Thr222). The S2 substrate specificity pockets of both dionain-3 and -4 differ slightly from that of dionain-1, with either Asp (dionain-3) or Glu (dionain-4) in place of Gln217 providing a full electrostatic stabilization for a positively charged P2 residue. The substrate-like pro-peptide stretches read GYKQR and GTKLK, maintaining the S2 pocket occupation by a pro-peptide Lys side chain. Although dionain-1 to -3 carry the pro-region ionizable His in the hydrophobic patch of the S2′ site (Fig. 9B), dionain-4 has the more common Tyr in this position and also lacks the pro-domain glycosylation signal. In addition, the active site dicysteine motif, which is present in dionain-1 to -3, is substituted with the more regular serine-cysteine sequence in dionain-4, suggesting a more distant relationship. The presence of multiple dionain isoforms in the digestive fluid of Venus flytrap is comparable with that of the enzyme-rich latex produced upon injury in several other plants. Therefore, the closely related dionains are most likely an attribute of a defense-related system in an ancestral lineage. The carnivorous adaption has been facilitated by exploitation of a pre-existing proteolytic machinery that, in combination with aspartic acid proteases and a serine carboxypeptidase (5), now constitutes an evolutionarily tailored digestive system for efficient breakdown and nutrient absorption.
The origin of the essential proteolytic activity associated with an efficient plant carnivorous lifestyle differs between species and reflects independent evolutionary events (1, 67). For D. muscipula, cysteine proteases account for the majority of the proteolytic activity in prey digestion (5, 41, 68), and recent studies strongly link chemical stimulation by the jasmonate phytohormones to the increased production and secretion of the dionains (69). Such results provide evidence for the fine-tuning of digestive fluid proteolytic activity by hormonal stimulation from prey decomposition, in addition to the mechanical stimulation trigged by trap closure. A similar induction of cysteine proteases in the leaves of R. communis in response to mechanical wounding or jasmonates (70) further supports the link between protective defense mechanisms and evolution of plant carnivory.
Conclusion
The unambiguous identification by N-terminal sequence analysis of dionain-1 in Venus flytrap digestive fluid corroborates the abundance of this protease in the medium. In this study, we used a targeted approach to provide the full-length pre-pro-dionain-1 cDNA sequence. The recombinant heterologous expression of pro-dionain-1 in P. pastoris was established to advance the knowledge of this digestive cysteine protease.
The findings presented here show that mature dionain-1 is structurally homologous to the Cys-EP and papain cysteine proteases from MEROPS family C1. It is activated at low pH by autocatalytic processing and dissociation of the inhibitory pro-domain. Homology modeling of the pro-dionain-1 structure suggests that access to the active site may be blocked by a stabilized Arg-Tyr bulge under occupancy of the S2 subsite by a pro-region Lys residue. The mature protease displays broad activity against a wide range of substrate sequences and prefers hydrophobic residues at the P2 position of the substrates and long positively charged residues at P1. Dionain-1 is inefficient at cleaving substrates carrying positively or negatively charged residues in P2. However, as gauged from the pro-domain organization, positively charged residues seem to fit into the S2 pocket but may lead to suboptimal positioning of the scissile bond for nucleophilic attack. The presence of dionain isoforms and a carboxypeptidase in the enzymatic sap of the Venus flytrap (5) complements dionain-1 in the enzymatic breakdown and highlights the concerted action of proteases for efficient protein and peptide hydrolysis during prey digestion.
This study provides the first functional and structural insight into the essential proteolytic component of the Venus flytrap “green stomach.” Dionain-1 is an efficient, stable, and broad digestive protease resembling defense-related plant proteases. It is activated by the acidic pH in the digestive fluid and is able to efficiently degrade prey proteins. Investigation of the dionain-1 pro-domain binding mode and inhibitory potential is of particular interest for future investigation. Because pro-domain peptides have been shown to confer potent and selective inhibition of homologous C1 proteases across species (71, 72), such knowledge could be applied in the specific targeting of proteolytic processes. This may be relevant in the inhibition of prey proteases released during the Venus flytrap digestion process or in the field of cancer biology and beyond, where homologous C1 proteases play pivotal roles in tissue remodeling, and selective inhibitors are of great interest to pharmaceutical and therapeutic research.
Author Contributions
J. J. E., K. W. S., L. R. T., and M. W. R. designed the study, analyzed the results, and wrote the paper. I. B. T. extracted and analyzed digestive fluid from Venus flytrap. L. R. T. designed and constructed vectors for expression of mutant proteins, and T. A. N. and L. R. purified the protein. M. W. R., L. R. T., and M. V. L., and C. S. characterized the enzyme in vitro. E. M. assisted with substrate library profiling. L. R. T. and T. G. crystallized the protein, and F. X. G. R. and I. G. F. determined and refined the x-ray structure. All authors approved the final version of the manuscript.
Acknowledgments
We thank Steen V. Petersen for the MALDI-TOF-TOF analyses and Johannes Overgaard for providing Drosophila flies. In addition, we thank DuPont for providing the IQFP substrates and Helene Etzerodt for assisting with the assays. Joan Pous and Xandra Kreplin from the Institut de Biologia Molecular de Barcelona/Institute for Research in Biomedicine Protein Crystallography Platform are further thanked for excellent technical assistance during crystallization experiments. We acknowledge the help provided by the European Synchrotron Radiation Facility and ALBA synchrotron local contacts.
This work was supported in part by grants from European, Danish, Spanish, and Catalan agencies (FP7-PEOPLE-2011-ITN-290246 “RAPID”; FP7-HEALTH-2012-306029-2 “TRIGGER”; BFU2012-32862; MDM-2014-0435; BIO2013-49320-EXP; and 2014SGR9). The Department of Structural Biology of the Institut de Biologia Molecular de Barcelona is a “María de Maeztu” Unit of Excellence of the Spanish Ministry of Economy and Competitiveness. Traveling and synchrotron data collection were supported in part by the European Synchrotron Radiation Facility. The authors declare that they have no conflicts of interest with the contents of this article.
The atomic coordinates and structure factors (code 5A24) have been deposited in the Protein Data Bank (http://wwpdb.org/).
- Z
- benzyloxycarbonyl
- AMC
- 7-amido-4-methylcoumarin
- Boc
- butyloxycarbonyl
- RFU
- relative fluorescence units
- IQFP
- internally quenched fluorogenic probe
- Tricine
- N-[2-hydroxy-1,1-bis(hydroxymethyl)ethyl]glycine
- PDB
- Protein Data Bank.
References
- 1. Ellison A. M., and Gotelli N. J. (2009) Energetics and the evolution of carnivorous plants: Darwin's “most wonderful plants in the world”. J. Exp. Bot. 60, 19–42 [DOI] [PubMed] [Google Scholar]
- 2. Kreuzwieser J., Scheerer U., Kruse J., Burzlaff T., Honsel A., Alfarraj S., Georgiev P., Schnitzler J. P., Ghirardo A., Kreuzer I., Hedrich R., and Rennenberg H. (2014) The Venus flytrap attracts insects by the release of volatile organic compounds. J. Exp. Bot. 65, 755–766 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Markin V. S., Volkov A. G., and Jovanov E. (2008) Active movements in plants: mechanism of trap closure by Dionaea muscipula Ellis. Plant Signal. Behav. 3, 778–783 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Escalante-Pérez M., Krol E., Stange A., Geiger D., Al-Rasheid K. A., Hause B., Neher E., and Hedrich R. (2011) A special pair of phytohormones controls excitability, slow closure, and external stomach formation in the Venus flytrap. Proc. Natl. Acad. Sci. U.S.A. 108, 15492–15497 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Schulze W. X., Sanggaard K. W., Kreuzer I., Knudsen A. D., Bemm F., Thøgersen I. B., Bräutigam A., Thomsen L. R., Schliesky S., Dyrlund T. F., Escalante-Perez M., Becker D., Schultz J., Karring H., Weber A., Højrup P., Hedrich R., and Enghild J. J. (2012) The protein composition of the digestive fluid from the Venus flytrap sheds light on prey digestion mechanisms. Mol. Cell. Proteomics 11, 1306–1319 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Scala J., Iott K., Schwab D. W., and Semersky F. E. (1969) Digestive secretion of Dionaea muscipula (Venus's flytrap). Plant Physiol. 44, 367–371 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Paszota P., Escalante-Perez M., Thomsen L. R., Risør M. W., Dembski A., Sanglas L., Nielsen T. A., Karring H., Thøgersen I. B., Hedrich R., Enghild J. J., Kreuzer I., and Sanggaard K. W. (2014) Secreted major Venus flytrap chitinase enables digestion of Arthropod prey. Biochim. Biophys. Acta 1844, 374–383 [DOI] [PubMed] [Google Scholar]
- 8. Adlassnig W., Koller-Peroutka M., Bauer S., Koshkin E., Lendl T., and Lichtscheidl I. K. (2012) Endocytotic uptake of nutrients in carnivorous plants. Plant J. 71, 303–313 [DOI] [PubMed] [Google Scholar]
- 9. Greenaway W., May J., English S., and Whatley F. R. (1990) Metabolism of [15N]glycine and [13C]glycine by Dionaea muscipula studied by gas chromatography-mass spectrometry. New Phytol. 114, 581–588 [Google Scholar]
- 10. Rea P. A. (1984) Evidence for the H+-co-transport of d-alanine by the digestive glands of Dionaea muscipula Ellis. Plant Cell Environ. 7, 363–366 [Google Scholar]
- 11. Scherzer S., Krol E., Kreuzer I., Kruse J., Karl F., von Rüden M., Escalante-Perez M., Müller T., Rennenberg H., Al-Rasheid K. A. S., Neher E., and Hedrich R. (2013) The Dionaea muscipula ammonium channel DmAMT1 provides NH4+ uptake associated with Venus flytrap's prey digestion. Curr. Biol. 23, 1649–1657 [DOI] [PubMed] [Google Scholar]
- 12. Athauda S. B., Matsumoto K., Rajapakshe S., Kuribayashi M., Kojima M., Kubomura-Yoshida N., Iwamatsu A., Shibata C., Inoue H., and Takahashi K. (2004) Enzymic and structural characterization of nepenthesin, a unique member of a novel subfamily of aspartic proteinases. Biochem. J. 381, 295–306 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Takahashi H., Matsumoto K., Nishii W., Muramatsu M., and Kubota K. (2009) Comparative Studies on the acid proteinase activities in the digestive fluids of Nepenthes, Cephalotus, Dionaea, and Drosera. Carniv. Plant Newsl. 38, 75–82 [Google Scholar]
- 14. Rawlings N. D., Waller M., Barrett A. J., and Bateman A. (2014) MEROPS: the database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 42, D503–D509 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Karrer K. M., Peiffer S. L., and DiTomas M. E. (1993) Two distinct gene subfamilies within the family of cysteine protease genes. Proc. Natl. Acad. Sci. U.S.A. 90, 3063–3067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Vernet T., Berti P. J., de Montigny C., Musil R., Tessier D. C., Ménard R., Magny M. C., Storer A. C., and Thomas D. Y. (1995) Processing of the papain precursor: the ionization state of a conserved amino acid motif within the Pro region participates in the regulation of intramolecular processing. J. Biol. Chem. 270, 10838–10846 [DOI] [PubMed] [Google Scholar]
- 17. Than M. E., Helm M., Simpson D. J., Lottspeich F., Huber R., and Gietl C. (2004) The 2.0 Å crystal structure and substrate specificity of the KDEL-tailed cysteine endopeptidase functioning in programmed cell death of Ricinus communis endosperm. J. Mol. Biol. 336, 1103–1116 [DOI] [PubMed] [Google Scholar]
- 18. Roy S., Choudhury D., Aich P., Dattagupta J. K., and Biswas S. (2012) The structure of a thermostable mutant of pro-papain reveals its activation mechanism. Acta Crystallogr. D Biol. Crystallogr. 68, 1591–1603 [DOI] [PubMed] [Google Scholar]
- 19. Konno K., Hirayama C., Nakamura M., Tateishi K., Tamura Y., Hattori M., and Kohno K. (2004) Papain protects papaya trees from herbivorous insects: role of cysteine proteases in latex. Plant J. 37, 370–378 [DOI] [PubMed] [Google Scholar]
- 20. Pechan T., Ye L., Chang Y., Mitra A., Lin L., Davis F. M., Williams W. P., and Luthe D. S. (2000) A unique 33-kD cysteine proteinase accumulates in response to larval feeding in maize genotypes resistant to fall armyworm and other Lepidoptera. Plant Cell 12, 1031–1040 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Krüger J., Thomas C. M., Golstein C., Dixon M. S., Smoker M., Tang S., Mulder L., and Jones J. D. (2002) A tomato cysteine protease required for Cf-2-dependent disease resistance and suppression of autonecrosis. Science 296, 744–747 [DOI] [PubMed] [Google Scholar]
- 22. Tian M., Win J., Song J., van der Hoorn R., van der Knaap E., and Kamoun S. (2007) A Phytophthora infestans cystatin-like protein targets a novel tomato papain-like apoplastic protease. Plant Physiol. 143, 364–377 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Fromont-Racine M., Bertrand E., Pictet R., and Grange T. (1993) A highly sensitive method for mapping the 5′ termini of mRNAs. Nucleic Acids Res. 21, 1683–1684 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Petersen T. N., Brunak S., von Heijne G., and Nielsen H. (2011) SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat. Methods 8, 785–786 [DOI] [PubMed] [Google Scholar]
- 25. Bury A. F. (1981) Analysis of protein and peptide mixtures: evaluation of 3 sodium dodecyl sulfate-polyacrylamide gel-electrophoresis buffer systems. J. Chromatogr. 213, 491–500 [Google Scholar]
- 26. Shevchenko A., Tomas H., Havlis J., Olsen J. V., and Mann M. (2006) In-gel digestion for mass spectrometric characterization of proteins and proteomes. Nat. Protoc. 1, 2856–2860 [DOI] [PubMed] [Google Scholar]
- 27. Ishihama Y., Oda Y., Tabata T., Sato T., Nagasu T., Rappsilber J., and Mann M. (2005) Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein. Mol. Cell. Proteomics 4, 1265–1272 [DOI] [PubMed] [Google Scholar]
- 28. Thomas D. A., Francis P., Smith C., Ratcliffe S., Ede N. J., Kay C., Wayne G., Martin S. L., Moore K., Amour A., and Hooper N. M. (2006) A broad-spectrum fluorescence-based peptide library for the rapid identification of protease substrates. Proteomics 6, 2112–2120 [DOI] [PubMed] [Google Scholar]
- 29. Kabsch W. (2010) XDS. Acta Crystallogr. D Biol. Crystallogr. 66, 125–132 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Kabsch W. (2010) Integration, scaling, space-group assignment, and post-refinement. Acta Crystallogr. D Biol. Crystallogr. 66, 133–144 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Winn M. D., Ballard C. C., Cowtan K. D., Dodson E. J., Emsley P., Evans P. R., Keegan R. M., Krissinel E. B., Leslie A. G., McCoy A., McNicholas S. J., Murshudov G. N., Pannu N. S., Potterton E. A., Powell H. R., Read R. J., Vagin A., and Wilson K. S. (2011) Overview of the CCP4 suite and current developments. Acta Crystallogr. D Biol. Crystallogr. 67, 235–242 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. McCoy A. J. (2007) Solving structures of protein complexes by molecular replacement with Phaser. Acta Crystallogr. D Biol. Crystallogr. 63, 32–41 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Davis I. W., Leaver-Fay A., Chen V. B., Block J. N., Kapral G. J., Wang X., Murray L. W., Bryan Arendall W. B. 3rd, Snoeyink J., Richardson J. S., and Richardson D. C. (2007) MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res. 35, W375–W383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Emsley P., Lohkamp B., Scott W. G., and Cowtan K. (2010) Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486–501 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Afonine P. V., Grosse-Kunstleve R. W., Echols N., Headd J. J., Moriarty N. W., Mustyakimov M., Terwilliger T. C., Urzhumtsev A., Zwart P. H., and Adams P. D. (2012) Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. D Biol. Crystallogr. 68, 352–367 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Smart O. S., Womack T. O., Flensburg C., Keller P., Paciorek W., Sharff A., Vonrhein C., and Bricogne G. (2012) Exploiting structure similarity in refinement: automated NCS and target-structure restraints in BUSTER. Acta Crystallogr. D Biol. Crystallogr. 68, 368–380 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Schüttelkopf A. W., and van Aalten D. M. (2004) PRODRG: a tool for high-throughput crystallography of protein-ligand complexes. Acta Crystallogr. D Biol. Crystallogr. 60, 1355–1363 [DOI] [PubMed] [Google Scholar]
- 38. Holm L., and Rosenström P. (2010) Dali server: conservation mapping in 3D. Nucleic Acids Res. 38, W545–W549 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Pettersen E. F., Goddard T. D., Huang C. C., Couch G. S., Greenblatt D. M., Meng E. C., and Ferrin T. E. (2004) UCSF Chimera: A Visualization System for Exploratory Research and Analysis. J. Comput. Chem. 25, 1605–1612 [DOI] [PubMed] [Google Scholar]
- 40. Chen V. B., Arendall W. B. 3rd, Headd J. J., Keedy D. A., Immormino R. M., Kapral G. J., Murray L. W., Richardson J. S., and Richardson D. C. (2010) MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr. 66, 12–21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Takahashi K., Suzuki T., Nishii W., Kubota K., Shibata C., Isobe T., and Dohmae N. (2011) A cysteine endopeptidase (“dionain”) is involved in the digestive fluid of Dionaea muscipula (Venus's fly-trap). Biosci. Biotechnol. Biochem. 75, 346–348 [DOI] [PubMed] [Google Scholar]
- 42. Schmid M., Simpson D. J., Sarioglu H., Lottspeich F., and Gietl C. (2001) The ricinosomes of senescing plant tissue bud from the endoplasmic reticulum. Proc. Natl. Acad. Sci. U.S.A. 98, 5353–5358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Groves M. R., Taylor M. A., Scott M., Cummings N. J., Pickersgill R. W., and Jenkins J. A. (1996) The prosequence of procaricain forms an α-helical domain that prevents access to the substrate-binding cleft. Structure 4, 1193–1203 [DOI] [PubMed] [Google Scholar]
- 44. Taylor M. A., Baker K. C., Briggs G. S., Connerton I. F., Cummings N. J., Pratt K. A., Revell D. F., Freedman R. B., and Goodenough P. W. (1995) Recombinant pro-regions from papain and papaya proteinase IV are selective high affinity inhibitors of the mature papaya enzymes. Protein Eng. 8, 59–62 [DOI] [PubMed] [Google Scholar]
- 45. Vernet T., Khouri H. E., Laflamme P., Tessier D. C., Musil R., Gour-Salin B. J., Storer A. C., and Thomas D. Y. (1991) Processing of the papain precursor: purification of the zymogen and characterization of its mechanism of processing. J. Biol. Chem. 266, 21451–21457 [PubMed] [Google Scholar]
- 46. Revell D. F., Cummings N. J., Baker K. C., Collins M. E., Taylor M. A., Sumner I. G., Pickersgill R. W., Connerton I. F., and Goodenough P. W. (1993) Nucleotide sequence and expression in Escherichia coli of cDNAs encoding papaya proteinase ω from Carica papaya. Gene 127, 221–225 [DOI] [PubMed] [Google Scholar]
- 47. Bethune M. T., Strop P., Tang Y., Sollid L. M., and Khosla C. (2006) Heterologous expression, purification, refolding, and structural-functional characterization of EP-B2, a self-activating barley cysteine endoprotease. Chem. Biol. 13, 637–647 [DOI] [PubMed] [Google Scholar]
- 48. McQueney M. S., Amegadzie B. Y., D'Alessio K., Hanning C. R., McLaughlin M. M., McNulty D., Carr S. A., Ijames C., Kurdyla J., and Jones C. S. (1997) Autocatalytic activation of human cathepsin K. J. Biol. Chem. 272, 13955–13960 [DOI] [PubMed] [Google Scholar]
- 49. Vernet T., Tessier D. C., Richardson C., Laliberté F., Khouri H. E., Bell A. W., Storer A. C., and Thomas D. Y. (1990) Secretion of functional papain precursor from insect cells: requirement for N-glycosylation of the pro-region. J. Biol. Chem. 265, 16661–16666 [PubMed] [Google Scholar]
- 50. Mellor G. W., Patel M., Thomas E. W., and Brocklehurst K. (1993) Clarification of the pH-dependent kinetic behaviour of papain by using reactivity probes and analysis of alkylation and catalysed acylation reactions in terms of multihydronic state models: implications for electrostatics calculations and interpretation of the consequences of site-specific mutations such as Asp-158-Asn and Asp-158-Glu. Biochem. J. 294, 201–210 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Mellor G. W., Thomas E. W., Topham C. M., and Brocklehurst K. (1993) Ionization characteristics of the Cys-25/His-159 interactive system and of the modulatory group of papain: resolution of ambiguity by electronic perturbation of the quasi-2-mercaptopyridine leaving group in a new pyrimidyl disulphide reactivity probe. Biochem. J. 290, 289–296 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Pinitglang S., Watts A. B., Patel M., Reid J. D., Noble M. A., Gul S., Bokth A., Naeem A., Patel H., Thomas E. W., Sreedharan S. K., Verma C., and Brocklehurst K. (1997) A classical enzyme active center motif lacks catalytic competence until modulated electrostatically. Biochemistry 36, 9968–9982 [DOI] [PubMed] [Google Scholar]
- 53. Vernet T., Tessier D. C., Chatellier J., Plouffe C., Lee T. S., Thomas D. Y., Storer A. C., and Ménard R. (1995) Structural and functional roles of asparagine 175 in the cysteine protease papain. J. Biol. Chem. 270, 16645–16652 [DOI] [PubMed] [Google Scholar]
- 54. Turk D., Guncar G., Podobnik M., and Turk B. (1998) Revised definition of substrate binding sites of papain-like cysteine proteases. Biol. Chem. 379, 137–147 [DOI] [PubMed] [Google Scholar]
- 55. Choe Y., Leonetti F., Greenbaum D. C., Lecaille F., Bogyo M., Brömme D., Ellman J. A., and Craik C. S. (2006) Substrate profiling of cysteine proteases using a combinatorial peptide library identifies functionally unique specificities. J. Biol. Chem. 281, 12824–12832 [DOI] [PubMed] [Google Scholar]
- 56. Hasnain S., Hirama T., Tam A., and Mort J. S. (1992) Characterization of recombinant rat cathepsin B and nonglycosylated mutants expressed in yeast. New insights into the pH dependence of cathepsin B-catalyzed hydrolyses. J. Biol. Chem. 267, 4713–4721 [PubMed] [Google Scholar]
- 57. Hanada K., Tamai M., Yamagishi M., Ohmura S., Sawada J., and Tanaka I. (1978) Isolation and characterization of E-64, a new thiol protease inhibitor. Agric. Biol. Chem. 42, 523–528 [Google Scholar]
- 58. Drenth J., Jansonius J. N., Koekoek R., Swen H. M., and Wolthers B. G. (1968) Structure of papain. Nature 218, 929–932 [DOI] [PubMed] [Google Scholar]
- 59. Noble M. A., Gul S., Verma C. S., and Brocklehurst K. (2000) Ionization characteristics and chemical influences of aspartic acid residue 158 of papain and caricain determined by structure-related kinetic and computational techniques: multiple electrostatic modulators of active-centre chemistry. Biochem. J. 351, 723–733 [PMC free article] [PubMed] [Google Scholar]
- 60. Katerelos N. A., Taylor M. A., Scott M., Goodenough P. W., and Pickersgill R. W. (1996) Crystal structure of a caricain D158E mutant in complex with E-64. FEBS Lett. 392, 35–39 [DOI] [PubMed] [Google Scholar]
- 61. Varughese K. I., Ahmed F. R., Carey P. R., Hasnain S., Huber C. P., and Storer A. C. (1989) Crystal structure of a papain-E-64 complex. Biochemistry 28, 1330–1332 [DOI] [PubMed] [Google Scholar]
- 62. Ghosh R., Chakraborty S., Chakrabarti C., Dattagupta J. K., and Biswas S. (2008) Structural insights into the substrate specificity and activity of ervatamins, the papain-like cysteine proteases from a tropical plant, Ervatamia coronaria. FEBS J. 275, 421–434 [DOI] [PubMed] [Google Scholar]
- 63. Dundas J., Ouyang Z., Tseng J., Binkowski A., Turpaz Y., and Liang J. (2006) CASTp: computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues. Nucleic Acids Res. 34, W116–W118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Jia Z., Hasnain S., Hirama T., Lee X., Mort J. S., To R., and Huber C. P. (1995) Crystal structures of recombinant rat cathepsin B and a cathepsin B-inhibitor complex: implications for structure-based inhibitor design. J. Biol. Chem. 270, 5527–5533 [DOI] [PubMed] [Google Scholar]
- 65. Musil D., Zucic D., Turk D., Engh R. A., Mayr I., Huber R., Popovic T., Turk V., Towatari T., and Katunuma N. (1991) The refined 2.15 Å x-ray crystal structure of human liver cathepsin B: the structural basis for its specificity. EMBO J. 10, 2321–2330 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Kelley L. A., and Sternberg M. J. (2009) Protein structure prediction on the Web: a case study using the Phyre server. Nat. Protoc. 4, 363–371 [DOI] [PubMed] [Google Scholar]
- 67. Albert V. A., Williams S. E., and Chase M. W. (1992) Carnivorous plants: phylogeny and structural evolution. Science 257, 1491–1495 [DOI] [PubMed] [Google Scholar]
- 68. Robins R. I., and Juniper B. E. (1980) The secretory cycle of Dionaea muscipula Ellis. 4. The enzymology of the secretion. New Phytol. 86, 401–412 [Google Scholar]
- 69. Libiaková M., Floková K., Novák O., Slováková L., and Pavlovič A. (2014) Abundance of cysteine endopeptidase dionain in digestive fluid of Venus flytrap (Dionaea muscipula Ellis) is regulated by different stimuli from prey through jasmonates. PLoS One 9, e104424. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Maciel F. M., Salles C. M. C., Retamal C. A., Gomes V. M., and Machado O. L. T. (2011) Identification and partial characterization of two cysteine proteases from castor bean leaves (Ricinus communis L.) activated by wounding and methyl jasmonate stress. Acta Physiol. Plant 33, 1867–1875 [Google Scholar]
- 71. Reis F. C., Costa T. F., Sulea T., Mezzetti A., Scharfstein J., Brömme D., Ménard R., and Lima A. P. (2007) The propeptide of cruzipain: a potent selective inhibitor of the trypanosomal enzymes cruzipain and brucipain and of the human enzyme cathepsin F. FEBS J. 274, 1224–1234 [DOI] [PubMed] [Google Scholar]
- 72. Santamaria M. E., Arnaiz A., Diaz-Mendoza M., Martinez M., and Diaz I. (2015) Inhibitory properties of cysteine protease pro-peptides from barley confer resistance to spider mite feeding. PLoS One 10, e0128323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. García-Castellanos R., Marrero A., Mallorquí-Fernández G., Potempa J., Coll M., and Gomis-Ruth F. X. (2003) Three-dimensional structure of MecI: molecular basis for transcriptional regulation of staphylococcal methicillin resistance. J. Biol. Chem. 278, 39897–39905 [DOI] [PubMed] [Google Scholar]
- 74. Evans P. (2006) Scaling and assessment of data quality. Acta Crystallogr. D Biol. Crystallogr. 62, 72–82 [DOI] [PubMed] [Google Scholar]
- 75. Weiss M. S. (2001) Global indicators of x-ray quality. J. Appl. Crystallogr. 34, 130–135 [Google Scholar]
- 76. Karplus P. A., and Diederichs K. (2012) Linking crystallographic model and data quality. Science 336, 1030–1033 [DOI] [PMC free article] [PubMed] [Google Scholar]