Abstract
Wang et al. recently reported an in silico study of the trichodiene synthase (TDS) conversion of farnesyl diphosphate (FPP) to trichodiene (TD) (Wang et al., ACS Catal. 2017, 7, 5841–5846). Although the methods and level of theory used in that work are nearly identical to our own recent work on this system (Dixit et al., ACS Catal. 2017, 7, 812–818), Wang et al. reach rather different conclusions. The authors claimed to obtain a “very credible” mechanism for the biosynthesis of TD and optimized the optimal folding mode of FPP in the 1,6-ring closure in TDS. However, the folding mode of the FPP substrate that was presented contradicts well-established NMR and mass spectrometry data. Moreover, the authors make numerous incorrect statements regarding our earlier work.
Graphical abstract
The purpose of this commentary is to point out several errors in the recent work of Wang et al.1 on the biocatalysis in trichodiene synthase (TDS). These errors pertain to the claim of an “optimal” binding mode of the substrate farnesyl diphosphate (FPP), which, in fact, is inconsistent with experimental data, and numerous puzzling and plainly incorrect statements on our work that was published earlier in this Journal.2 Importantly, the findings and key reaction steps appear to directly contradict the reaction pathway established by NMR isotope-labeling experiments.
Previously, we reported on an extensive study utilizing a combined quantum mechanical and molecular mechanical (QM/MM)3–4 method on the biosynthetic formation of trichodiene (TD) by TDS (Scheme 1), culminating from laborious and thorough work carried out in a period of over ten years.2 In that study, we addressed numerous issues pertinent to the chemical control exerted by TDS to avoid by-product formation from a sequence of reactive carbocation intermediates, and highlighted several general catalytic principles that are potentially applicable to other terpenoid synthases. TDS has been extensively studied experimentally,5–8 providing an unprecedented amount of data for judicious comparison and validation of theory.9–10 TDS, as other terpenoid synthases, is extremely challenging to theoretical investigation because the precise, folded conformation of the linear substrate FPP in the binding pocket is unknown. There are multiple possible reaction channels leading to unwanted by-products, and it is necessary to use accurate quantum chemistry methods to treat the facile bond breaking and bond forming steps.2, 11–15 To overcome these difficulties, we have developed and reported in reference 2 a systematic strategy to treat these biosynthetic processes computationally.14, 16–17
In a subsequent publication, Wang et al. attempted to address the important role of substrate binding in TDS, with a particular emphasis on stereoselectivity.1 In that work, the authors followed essentially the same procedures, including exactly the same level of theory described in Ref. 2, but started by docking the large, open structure of FPP directly into the active site. Then, the authors tried to delineate how this initial fold determines chemical specificity. Unfortunately, the report of Ref. 1 contains egregious errors that we will address in the following, including (1) erroneous key results that directly contradict the previously established reaction pathway by NMR and mass spectrometric experiments, and (2) incorrect statements that, surprisingly, directly contradict the structures and written texts in Ref. 2.
(1) Erroneous key results that contradict well-established experimental NMR data
Wang et al. stated that the Rup conformation (Fig. 1b), a notation used in Ref. 1, gives the most reasonable C-O bond dissociation profile, and thus is the favorable folded substrate conformation in TDS. Hence, mode Rup is the most likely productive folding mode. This conclusion was based on energetic considerations (see Fig. 2a in Ref. 1), and indeed, was a main result of their work. Wang et al. utilized this conformation for FPP in their mechanistic studies. As a result, the C13-methyl group (C12 in Ref. 1) was transferred to C7 (Fig. S7 in Ref. 1). In contrast, it is, in fact, the C12-methyl group (standard numbering), trans to C9 of FPP, that migrates. This has been laboriously defined and elucidated by Cane and coworkers and others through extensive NMR and mass-spectrometry experiments, in which 13C-labeled mevalonate was used to generate labeled FPP, in turn, to form labeled TD (Scheme 2a).6, 8, 18–24 We further point out that it is well established that there is no free rotation around the C10–C11 bond prior to methyl group migration in the enzyme active site.19 Thus, C12 and C13 are non-equivalent. Note that the numbering for the C12/C13 atoms is inverted in Ref. 1, but to leave no room for misunderstanding, C12 in Ref. 1 is the methyl group “cis” to C9 of FPP, whereas the standard IUPAC style numbering for the cis carbon is C13 used in our and other publications.5 The correctly folded conformation used in our earlier work is shown schematically in Scheme 3a, whereas the incorrectly folded conformation used by Wang et al. is shown in Scheme 3b.
Due to the use of this incorrectly folded substrate conformation (Fig. 1b, Scheme 3b), the proton on the C6 carbon was transferred to the incorrect Re-face of C10 in Ref. 1, instead of Si. However, the use of a 3H-labeled FPP substrate [6-3H-12,13-14C]-farnesyl pyrophosphate unambiguously assigned the 3H-labeled TD in Scheme 2b as the exclusively product.19 Hence, both the proton and methyl transfer steps in the study by Wang et al. directly contradicted well-established stereochemistry from NMR and mass-spectrometry isotope-labeling experiments carried out by several groups: it is the Si-face, not Re, of C10 to which the proton on C6 migrates (Scheme 2b, 3a).6, 8, 18–24
The contradiction of the results from Ref. 1 with experiments specifically designed to answer the precise regio- and stereo-selectivity of hydrogen and methyl group migrations in TDS demonstrate that the conformation used in Ref. 1 does not correspond to the correct initial Michaelis complex. On the other hand, the erroneous reaction pathways followed in Ref. 1 (Scheme 3b) lend further support to our contention that the promiscuous binding in terpene synthases is one of the greatest challenges to theory due to weak interactions between the substrate and the hydrophobic pocket in the active site. In particular, the numerous weak dispersion interactions are difficult to model without using the most accurate quantum chemical methods, or dedicated dispersion correction schemes,25 in conjunction with carefully parametrized QM/MM interaction terms.26–27
(2) Incorrect statements of published work on TDS
For reasons that are puzzling to us, the authors of Ref. 1 incorrectly quoted several direct statements and structures that are plainly contrary to those reported in Ref. 2; unfortunately, the incorrect statements were used as evidence to justify their erroneous findings discussed above. In Ref. 1, Wang et al. stated that “Major et al. theoretically investigated the reaction pathway from cisoid 3R-NPP to TD derived from folding mode Rdown”. These authors created and illustrated their Rdown conformation that is duplicated in Fig. 1c, based on which they proclaimed that the cyclization step in our study “would directly lead to the formation of bisabolyl cation B with S configuration conflicted with the configuration demanded for the product”. There are at least two incorrect statements in this short paragraph of Ref. 1. First, we explicitly wrote, on page 815 of Ref. 2, “electrophilic attack at the C6–C7 double bond to produce (6R)-bisabolyl cation” (3 in Ref. 2). Moreover, (6R)-bisabolyl cation 3 is clearly illustrated in Scheme 1, Figs. 1, 2, and 3 of our original paper.2 We further note that only in this conformation, can the correct methyl migrations and proton transfer take place. Second, the simulations in Ref. 2 were carried out with the so-called folding mode Rdown, with the initial ring closure proceeding with an electrophilic attack on the Si-face of the C6–C7 double bond (Fig. 2). This results in the correct (6R)-bisabolyl cation. Wang et al. called this conformation “the conformation that Cane employed”, which is in fact deduced based on the extensive NMR data discussed above.6, 8, 18–24 The current correspondence is a re-affirmation of the modeling result for the Michaelis complex of TDS obtained in Ref. 2.
Wang et al. further proceeded with the statement that “the modeling (of Ref. 2) was based on the docking pose of intermediate D with exo conformation instead of the endo conformation derived from mode Rup of substrate FPP.” Indeed, our simulations commenced with cation 6 (D in Ref. 1). However, Wang et al. seemingly did not realize that the intermediate structure determined by X-ray diffraction and modeling was only used as the starting point for a series of modeling and optimization calculations that included more than 10,000 configurations. The structure adopted as a model for the Michaelis complex was chosen based on energetic and mechanistic considerations, an issue we have repeatedly addressed and warned in our general studies of enzymatic reactions in the past two decades.16, 28–36 Furthermore, we did not, and in fact, we never restrict the conformations of the substrate or the protein in computational studies of enzymatic processes, including the present conformational flexibility of the C9 methylene group. This group was indeed free to fluctuate between the so-called exo and endo forms. These substrate conformation fluctuations were in fact observed to take place frequently in our simulation, as the free energy barrier for such a flip is about 1–2 kcal/mol. Hence, it is inconsequential which ring conformation we started with, as long as it is docked correctly into the active site. If carried out properly, we suspect similar conformational fluctuations would be observed in other TDS simulations. What is crucial, however, is that the correct farnesyl cation conformation is used, and the one we obtained by retrobiosynthesis from cation 6 is shown in Fig. 1a and 2. Only this general conformation leads to agreement with all available experimental data.
A central question in terpenoid biosynthesis of natural products is the initial conformational state of the substrate of the linear FPP. There are numerous conformers coupled with 8 single bonds in FPP, which could yield a wide range of folded structures. A wonder of terpenoid biosynthesis is that each terpenoid synthase adopts a FPP conformation to generate one dominant natural product with precisely defined regio- and stereochemistry. Wang et al. criticize the protocol employed in our initial modeling of the Michaelis complex starting from the ligand bound X-ray crystal structure derived from cation 6 by contrasting their approach: “theoretical modeling starting from the native FPP substrate is more warrantable than that from a hypothetical intermediate.” Of the countless number of conformers of FPP, it is not clear which one may be considered as the “native FPP substrate” for a given enzyme configuration. Additionally, it is far from trivial to identify the dominating bound conformation of FPP. On the other hand, the intermediate structure (cation 6) that we employed as a starting point resembles the final product configuration, and it is a rather rigid molecule that is simpler to model. This intermediate contains key information regarding the mechanistic flow of the biosynthesis. Our structure-based retro-biosynthesis approach commences with docking, energy minimization, and molecular dynamics simulations of a key semi-rigid intermediate (cation 6). Subsequently, the intermediate is allowed to slowly relax to the FPP substrate in several sequential, mechanistically reverse, free energy molecular dynamics steps. This procedure provides the potentially correct fold of the substrate in the Michaelis complex, whose correctness can only be validated by the energetics of reaction pathway and the correct formation of the major and numerous side products.
Very few crystal structures exist of the native Michaelis complex of enzymes, since it will readily be decomposed into the product.29 Thus, the modeling of the initial substrate conformation and substrate-protein interactions is always necessary, and often, this is the determining factor for the quality of the computational approach, rather than a particular QM model used, be it ab initio wave function theory, density functional theory, or semiempirical methods. Unfortunately, there is no unique way of approaching this question, because vastly different structures and biochemical data are available for different enzymes. We have thoroughly discussed approaches to address this important question in the past,16, 28–36 and the work presented in Ref. 2 provided yet another strategy. The procedures on substrate conformation modeling, and the subsequent conclusions reached by Wang et al. may serve as a counter-example, illustrating the pitfalls of improperly modeling the initial Michaelis complex.1 Indeed, one should include all experimental information as far as modeling is concerned, since enzyme Michaelis complex construction has not yet reached a stage where it can predict function in an entirely ab initio manner.
Enzyme catalysis
We finally comment on the free energy profile obtained by Wu and co-workers, which contains what may be called death-traps in terpenoid biochemistry. The conversion of bisabolyl cation (3) to the subsequent intermediates has a significant barrier according to Wu and co-workers. This is problematic since this bottleneck appears after the key intermediate (3), which branches into pathways leading to many byproducts.2 This issue of the life-time of 3 has been carefully discussed in Ref. 2. We stress that a significant barrier along the carbocation reaction coordinate poses intractable problems for the cyclization cascade, due to the extremely high reactivity of carbocations. In fact, the carbocation chemistry has been shown to proceed with small barriers or to be barrierless processes.2, 11, 13–14, 16, 37 The appearance of a key mechanistic branching intermediate, bisabolyl cation 3, immediately before a bottle-neck in the calculations of Ref. 1 can only increase the likelihood of by-product formation in the trichodiene biosynthetic pathway. The interplay of balancing kinetics and thermodynamics in carbocation cyclization cascades has been discussed for this and other terpenoid synthases.2, 16
The fact that the initial folding of the substrate in terpene synthases is crucial is well known,38 as is discussed in our work on TDS.2 Clearly, without the correct initial fold, the enzyme is unlikely to form the correct product. One may wonder why the same methodology at the same level of theory using M06-2X/6-31G(d)39 in both studies yielded such vastly different qualitative and quantitative pictures on the carbocation cyclization cascade by TDS. In Ref. 2 the barrier from the key intermediate bisabolyl cation 3 is just about 1 kcal/mol leading all the way to cation 6 (D in Ref. 1), whereas, in Ref. 1 there are multiple barriers as large as 11.8 and 10.8 kcal/mol for the same conversion. We have addressed the potential biological consequences of this difference, but from a computational perspective, a possible reason for this difference, if all simulations were performed correctly, is that the protein environment, e.g. the fold of the FPP substrate and its subsequent cation intermediates, is different. Since we identified a protein-substrate conformational state that yields a lower free energy barrier pathway, there is no conceivable reason to think that the real biological system (i.e. TDS) would chose a reaction pathway with multiple barriers that are more than 10 kcal/mol higher in energy to perform multiple reaction steps involving highly reactive intermediates. Hence, we disagree with the assertion by the authors of Ref. 1 that the regio- and stereochemistry of terpenoid synthases is determined only by the FPP conformation and the selective activation of the initial farnesyl-phosphate bond, whereas the enzyme environment plays little role in regulating the subsequent carbocation cyclization processes. The selection of the FPP fold based on the initial farnesyl-phosphate bond cleavage in the work of Ref. 1 certainly did not produce the correct intermediates and final product in TDS.
Acknowledgments
This contribution has been supported by the Israel Science Foundation (Grant # 2146/15), the National Institutes of Health (GM46736), and National Natural Science Foundation of China (Grant #21533003)
Footnotes
Author Contributions
D.T.M and J.G. wrote the paper with input from all authors.
References
- 1.Wang YH, Xie H, Zhou J, Zhang F, Wu R. ACS Catal. 2017;7:5841–5846. [Google Scholar]
- 2.Dixit M, Weitman M, Gao J, Major DT. ACS Catal. 2017;7:812–818. doi: 10.1021/acscatal.6b02584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Warshel A, Levitt M. J Mol Biol. 1976;103:227–49. doi: 10.1016/0022-2836(76)90311-9. [DOI] [PubMed] [Google Scholar]
- 4.Gao J. Methods and Applications of Combined Quantum Mechanical and Molecular Mechanical Potentials. In: Lipkowitz KB, Boyd DB, editors. Reviews in Computational Chemistry. Vol. 7. VCH Publishers; New York: 1995. pp. 119–185. [Google Scholar]
- 5.Cane DE, Swanson S, Murthy PPN. J Am Chem Soc. 1981;103:2136–2138. [Google Scholar]
- 6.Cane DE. Chem Rev. 1990;90:1089–1103. [Google Scholar]
- 7.Cane DE, Chiu HT, Liang PH, Anderson KS. Biochemistry. 1997;36:8332–8339. doi: 10.1021/bi963018o. [DOI] [PubMed] [Google Scholar]
- 8.Cane DE. Sesquiterpene Biosynthesis: Cyclization Mechanisms. In: Cane DE, editor. Comprehensive Natural Products Chemistry: Isoprenoids Including Carotenoids and Stereoids. Vol. 2. Pergamon Press; Oxford: 1999. pp. 155–200. [Google Scholar]
- 9.Hong YJ, Tantillo DJ. Org Lett. 2006;8:4601–4604. doi: 10.1021/ol061884f. [DOI] [PubMed] [Google Scholar]
- 10.Hong YJ, Tantillo DJ. J Am Chem Soc. 2014;136:2450–2463. doi: 10.1021/ja4106489. [DOI] [PubMed] [Google Scholar]
- 11.Weitman M, Major DT. J Am Chem Soc. 2010;132:6349–6360. doi: 10.1021/ja910134x. [DOI] [PubMed] [Google Scholar]
- 12.Tantillo DJ. Nat Prod Rep. 2011;28:1035–1053. doi: 10.1039/c1np00006c. [DOI] [PubMed] [Google Scholar]
- 13.Major DT, Weitman M. J Am Chem Soc. 2012;134:19454–19462. doi: 10.1021/ja308295p. [DOI] [PubMed] [Google Scholar]
- 14.Major DT, Freud Y, Weitman M. Curr Opin in Struct Biol. 2014;21:25–33. doi: 10.1016/j.cbpa.2014.03.010. [DOI] [PubMed] [Google Scholar]
- 15.Major DT. ACS Catal. 2017;7:5461–5465. doi: 10.1021/acscatal.6b02584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Rajamani R, Gao J. J Am Chem Soc. 2003;125:12768–12781. doi: 10.1021/ja0371799. [DOI] [PubMed] [Google Scholar]
- 17.Allemann RK, Young NJ, Ma S, Truhlar DG, Gao J. J Am Chem Soc. 2007;129:13008–13013. doi: 10.1021/ja0722067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Jones ERH, Lowe G. J Chem Soc. 1960;0:3959–3962. [Google Scholar]
- 19.Arigoni D, Cane DE, Muller B, Tamm C. Helv Chim Acta. 1973;56:2946–2949. [Google Scholar]
- 20.Hanson JR, Marten T, Siverns M. J Chem Soc, Perkin Trans. 1;1974:1033–1036. [Google Scholar]
- 21.Cane DE, Levin RH. J Am Chem Soc. 1975;98:1183–1188. doi: 10.1021/ja00421a022. [DOI] [PubMed] [Google Scholar]
- 22.Tamm C, Breitenstein W. The Biosynthesis of Trichotecene Mycotoxins. In: Steyn PS, editor. The Biosynthesis of Mycotoxins: A Study in Secondary Metabolism. Acadmic Press; New York: 1980. pp. 69–104. [Google Scholar]
- 23.Nabeta K, Kawakita K, Yada Y, Okuyama H. Biosci Biotech Biochem. 1993;57:792–798. [Google Scholar]
- 24.Christensen DJ, Poulter CD. Bioorg Med Chem. 1994;2:631–637. doi: 10.1016/0968-0896(94)85011-9. [DOI] [PubMed] [Google Scholar]
- 25.Grimme S, Hansen A, Brandenburg JG, Bannwarth C. Chem Rev. 2016;116:5105–5154. doi: 10.1021/acs.chemrev.5b00533. [DOI] [PubMed] [Google Scholar]
- 26.Gao JL, Xia XF. Science. 1992;258:631–635. doi: 10.1126/science.1411573. [DOI] [PubMed] [Google Scholar]
- 27.Gao J. Modeling the Hydrogen Bond. In: Smith DA, editor. ACS Symposium Series. Vol. 569. American Chemical Society; Washington, DC: 1994. pp. 8–20. [Google Scholar]
- 28.Wu N, Mo Y, Gao J, Pai EF. Proc Natl Acad Sci USA. 2000;97:2017–2022. doi: 10.1073/pnas.050417797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Devi-Kesavan LS, Gao J. J Am Chem Soc. 2003;125:1532–1540. doi: 10.1021/ja026955u. [DOI] [PubMed] [Google Scholar]
- 30.Garcia-Viloca M, Truhlar DG, Gao J. Biochemistry. 2003;42:13558–13575. doi: 10.1021/bi034824f. [DOI] [PubMed] [Google Scholar]
- 31.Wong K, Gao J. Biochemistry. 2007;46:13352–13369. doi: 10.1021/bi700460c. [DOI] [PubMed] [Google Scholar]
- 32.Ma S, Devi-Kesavan LS, Gao J. J Am Chem Soc. 2007;129:13633–13645. doi: 10.1021/ja074222+. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Major DT, Heroux A, Orville AM, Valley MP, Fitzpatrick PF, Gao J. Proc Natl Acad Sci USA. 2009;106:20736–20739. doi: 10.1073/pnas.0911416106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Lin Y-l, Gao J. J Am Chem Soc. 2011;133:4398–4403. doi: 10.1021/ja108209w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wong KY, Gao J. FASEB J. 2011;278:2579–2595. doi: 10.1111/j.1742-4658.2011.08187.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Dixit M, Das S, Mhashal AR, Eitan R, Major DT. Practical Aspects of Multiscale Classical and Quantum Simulations of Enzyme Reactions. In: Voth GA, editor. Methods in Enzymology. Vol. 577. Academic Press; Burlington: 2016. pp. 251–286. [DOI] [PubMed] [Google Scholar]
- 37.Gao D, Pan YK, Byun K, Gao J. J Am Chem Soc. 1998;120:4045–4046. [Google Scholar]
- 38.Christianson DW. Chem Rev. 2006;106:3412–3442. doi: 10.1021/cr050286w. [DOI] [PubMed] [Google Scholar]
- 39.Zhao Y, Truhlar DG. Theor Chem Acc. 2007;120:215–241. [Google Scholar]