An efficient algorithmic workflow was developed to optimize seven process parameters of a homogeneous catalytic system with minimal experimental effort.
Abstract
In times of a warming climate due to excessive carbon dioxide production, catalytic conversion of carbon dioxide to formaldehyde is not only a process of great industrial interest, but it could also serve as a means for meeting our climate goals. Currently, formaldehyde is produced in an energetically unfavourable and atom-inefficient process. A much needed solution remains academically challenging. Here we present an algorithmic workflow to improve the ruthenium-catalysed transformation of carbon dioxide to the formaldehyde derivative dimethoxymethane. Catalytic processes are typically optimised by comprehensive screening of catalysts, substrates, reaction parameters and additives to enhance activity and selectivity. The common problem of the multidimensionality of the parameter space, leading to only incremental improvement in laborious physical investigations, was overcome by combining elements from machine learning, optimisation and experimental design, tripling the turnover number of 786 to 2761. The optimised conditions were then used in a new reaction setup tailored to the process parameters leading to a turnover number of 3874, exceeding by far those of known processes.
Introduction
In this century, our ecosystem faces severe problems such as global warming, environmental pollution and resource depletion. The negative impact of humankind as well as its responsibility in solving these problems can no longer be ignored.1,2 Besides the necessity for future-oriented global politics,3,4 both the scientific community and the chemical industry must provide answers to crucial questions regarding sustainable process development, alternative energies as well as recycling of waste and pollutants.5–9 In this context, more efficient techniques must be developed and applied in research to reduce time, cost and resources.10–12
In catalytic investigations, system optimisation is typically approached by one-factor-at-a-time (OFAT) methods, successively screening along one parameter axis. Once optimised, a parameter is kept constant for the subsequent experiments. In this univariate analysis, variables are treated as being independent of each other. Beside the vast number of experiments that must be performed, local maxima with higher performances might be missed. Consequently, algorithm-based screening and optimisation techniques have been among the fastest growing research areas in recent years.13–24 Considering the interactions of parameters, optimised results can be achieved with minimal experimental effort.25,26
Recently, we applied a univariate optimisation approach including several hundred catalytic reactions to improve the selective ruthenium-catalysed transformation of carbon dioxide to dimethoxymethane (DMM) reaching a turnover number (TON) of 786 (Scheme 1).27,28 The product DMM itself is a high value feedstock for biofuels, but can also be hydrolysed yielding formaldehyde and methanol or directly employed as a formaldehyde synthon.29,30 Beside the desired product, only methyl formate (MF) was formed with TONs of up to 1290 (Scheme 1).27 Previously, two studies on the selective hydrogenation of CO2 by the group of Klankermayer showed the formation of DMM and MF by using a homogeneous ruthenium catalyst31 with TONs of 214 and 104 or a cobalt catalyst32 with TONs of 157 and 37, respectively. Further selective reductions toward the formaldehyde oxidation state were reported utilising hydroboration,33–37 hydrosilylation,38,39 and frustrated Lewis pairs,40,41 however, being mainly of academic interest due to the stoichiometric use of reducing reagents.
Inspired by previous reports, which underline the importance of a holistic view on the results obtained,13,19 we now utilised multivariate analysis42,43 to further optimise the investigated catalysis. Based on the large amount of experimental data and the dependency of the catalytic system on seven different parameters, we envisioned to increase its performance by modelling and predicting conditions optimally accounting for parameter interaction. For this purpose, we devised a multi-step process combining elements from machine learning, optimisation and experimental design (design of experiments, DoE). Fig. 1 gives a schematic overview of the easy-to-use algorithmic workflow, which was developed in this work and is applicable for any catalytic screening. Experimental data (Fig. 1(I)) was modelled using the random forest (RF) algorithm (Fig. 1(II)) to identify promising subspaces with high catalytic performance (Fig. 1(III)). To better understand the origin of the exceptional activity and to extend the amount of data for further modelling, the subspace was augmented (Fig. 1(IV)) by additional experiments (Fig. 1(V)) that were based on experimental design. Starting from these first experiments, an iterative workflow – consisting of DoE (Fig. 1(V) and (VI)), optimisation (relaxation; Fig. 1(VII) and (VIII)) and evaluation (Fig. 1(IX)) – was applied until the final optimum was reached (Fig. 1(X)).
Results and discussion
Catalytic system
The selective transformation of CO2 to DMM was performed using the ruthenium catalyst [Ru(N-triphosPh)(tmm)] (tmm = trimethylenemethane dianion) for hydride transfer and Al(OTf)3 to facilitate the acetalisation with methanol (Scheme 1). Beneficially, the robust catalyst can be obtained on a large scale in a simple three-step procedure, which underlines its suitability for industrial and large scale applications.27 The catalysis is mainly affected by the reaction temperature (T), partial pressure of H2 (pH2) and CO2 (pCO2), reaction time (t), the amount of catalyst (ncat) and Lewis acid (nadd) as well as the reaction volume (V).
Experimental data analysis
Following the rationale outlined in the previous section, the optimisation started with a corpus of 49 unique settings of the seven process factors (X: X1, X2, …, X7 = T, pH2, pCO2, t, ncat, nadd, V), two responses (Y: Y1, Y2 = TONDMM, TONMF) and 144 observations, which are measured values of the responses (Table 1, Fig. 1(I), ESI Table 1†).27 Each setting was run as a triple replicate, except for three settings that were realised as simple replicates, thus making up the 144 cases of the dataset. Goal of the optimisation was finding reaction conditions , that maximise TONDMM (for a theoretical introduction on this approach, see the Methods section). As process factors, six of the previously investigated parameters, namely temperature, partial pressure of H2 and CO2, reaction time as well as the amount of catalyst and Lewis acid, proved suitable for the optimisation. The influence of varying the nature of the Lewis acid was neglected here, because statistical analysis of the dataset revealed Al(OTf)3 as the only promising candidate. Instead, the volume of the catalysis solution was considered for the multivariate analysis to account for mass transfer phenomena and gas solubility effects (concentration effects) potentially influencing the performance of the catalyst.
Table 1. Factors and responses of the dataset27.
Entry | Variable | Mean | Median | Min | Max |
1 | T (°C) | 89.2 | 90 | 20 | 120 |
2 | p H2 (bar) | 82.1 | 90 | 40 | 100 |
3 | p CO2 (bar) | 17.9 | 20 | 5 | 40 |
4 | t (h) | 26.5 | 18 | 1 | 168 |
5 | n cat (μmol) | 1.3 | 1.5 | 0 | 3 |
6 | n add (μmol) | 5.4 | 6.25 | 0 | 12.5 |
7 | V (mL) | 0.5 | 0.5 | 0.25 | 0.5 |
8 | TONDMM | 275 | 263 | 0 | 906 |
9 | TONMF | 145 | 71 | 0 | 1377 |
Modelling and analysis of datasets based on empirical optimisation (e.g. OFAT) is complex and more difficult than estimating parametric surrogate models from well-designed data. The difficulties arise primarily from non-linearities, which simple parametric models cannot appropriately describe. Furthermore, OFAT variations often and inadvertently lead to co-varying and thereby confounding the effects of process parameters. For instance, in the present dataset the process parameters ncat and nadd were found to be highly correlated with a Pearson correlation coefficient r(ncat – nadd) = 0.895, rendering the data non-informative in terms of the independent effects of ncat and nadd.
Next, the problem for these datasets is finding an appropriate functional representation, f[combining circumflex](), of the target response as a function of the process factors Xi, that is TONDMM = f(T,pH2,pCO2,t,ncat,nadd,V) + ε, without making any prior assumptions about the analytical form of the ‘true’ function f(). Here, we used RF as a powerful non-linear and easy-to-use method for the empirical model building of complex datasets (Fig. 1(II)).44,45
The RF models describe 79% of the responses' variance on average (R2(TONDMM) = 0.83; R2(TONMF) = 0.74), which is a good result given the heterogeneous nature of the dataset. Further, tuning the RF hyperparameters using 10-fold cross validation with consecutive blocks (mtry* = 2 with R2 = 0.83) revealed that the default RF hyperparameters describe the data appropriately. The effect structure of the models can be conveniently explored by plotting the RF model predictions f[combining circumflex](X) against the process factors X1, X2, …, X7. Fig. 2 shows the effects of the process parameters T, pCO2, ncat and nadd, given the median values pH2 = 90 bar, t = 18 h and V = 0.5 mL, as a conditional trellis plot.
As a tree-based ensemble method, RF splits and mean-aggregates the experimental space into hyperrectangles, the consequence being that the RF model surface becomes non-smooth (Fig. 2). This property of RF is a particular strength when it comes to identifying promising domains of the experimental space and was another motivation for choosing RF as modelling technique.
Evidently, ncat and nadd exert strong non-linear, step like negative effects on the process performance (TONDMM), dividing the ncat × nadd space into domains of different performance. The pressure, pCO2, reveals a positive and temperature, T, a small convex effect, both, however, negligible compared to the dominant effects of ncat and nadd. At this point, the erroneous impression may arise that the heuristic interpretation of the experimental data could lead to a further improvement by a trivial reduction of the amount of catalyst, which is often reduced to minimise catalyst deactivation. However, a closer look at the experimental data shows that a reduction in the amount of catalyst without changing other parameters leads to a decrease in catalytic activity with respect to DMM. This case shows exemplarily how important multivariate analytical methods can be, because they can easily identify complex correlations of process parameters that are not detectable by a univariate analysis of the scientist.
With the condition TONDMM > 400 deduced from the landscape of TONDMM over ncat × nadd (Fig. 2), a subset of 26 observations with 9 unique settings was selected from the data (Fig. 1(III)). In this subset, the parameters T, pH2, pCO2 and t were found constant at T = 90 °C, pH2 = 90 bar, pCO2 = 20 bar and t = 18 h, thereby suggesting to conditionally optimise ncat, nadd and V first, while keeping T, pH2, pCO2 and t constant for the time being.
First DoE
These 9 candidate points (identified unique settings mentioned above) were subsequently augmented by an additional number of 6 points to fully support all second order effects of ncat, nadd and V (Fig. 1(IV)). The intention behind this augmentation was to render the formerly confounded effects of ncat and nadd estimable as well as, assuming high complexity, to allow for interactions and non-linearities of the process parameters as a solid basis for further optimisation (Fig. 1(V)).
The 6 augmentation trials were realised as triple replicates in the lab, added to the 9 settings already available and the responses (TONDMM, TONMF) were modelled as a function of the least square parameters with stepwise ordinary least squares (OLS; Fig. 1(VI)). Both models accurately describe the data within the replication error and explain 91% (R2(TONDMM) = 0.91; DF = 37) and 97% (R2(TONMF) = 0.97; DF = 35) of the responses' variance with DF denoting the degrees of freedom (number of data points minus number of estimated model parameters).
The local effects of ncat, nadd and V are depicted in Fig. 3A as response surface trellis plot. Again, there is a strong negative effect for ncat along with positive effects for nadd and V. Together with the positive, synergistic effect between nadd and V, the effect structure suggests to decrease ncat and to increase nadd and V to further maximise the catalytic performance beyond the best result of the first DoE (Fig. 4A, entry 4).
First relaxation
To optimise towards the direction of maximal improvement, following the procedure outlined in the Methods section [eqn (5)], the experimental space was relaxed with 10% step size. The triple obtained from relaxing the design space together with the achieved experimental results are listed in Fig. 4B (Fig. 1(VII) and (VIII); see the Methods section).
The joint condition max(TONDMM), max(TONMF) was best met by the 20% relaxation trial (Fig. 4B, entry 2), and the factor setting , and V* = 0.550 mL thus became the reference point for further optimisation. The 30% relaxation trial was very poor, indicating that a local maximum had been exceeded (Fig. 4B, entry 3). The pronounced drop in activity, resulting most likely from the reduced catalyst loading, indicates a molecular deactivation pathway of the catalytically active species due to potential inhibitors, such as carbon monoxide, moisture and oxygen, which are probably present in low concentrations.
At this point, there were two alternatives to proceed: (1) create an experimental design around , and V* to fully identify the topology around the relaxation point at the desired resolution (complexity). (2) Consider , and V* = 0.550 mL as locally optimal and switch to optimising the candidates T, pH2, pCO2 and t, which had so far been kept constant.
The poor outcome of the third relaxation trial (Fig. 4B, entry 3) showed that not much was to be expected from exploring the three-dimensional environment of the 20% relaxation trial any further. Therefore, option 2 was chosen and the 20% relaxation trial became the reference point for optimising the candidates T, pH2, pCO2 and t.
Second DoE
A small linear design [eqn (2)] with 5 runs in the ranges listed in Fig. 4C was created with the reference point , and V* = 0.550 mL, T* = 90 °C, , bar, t* = 18 h at the design centre as replicate (Fig. 1(V); see the Methods section). The experiments were realised in the lab as triple replicate to provide a measure of accuracy (Fig. 1(VI)).
The measured responses (TONDMM, TONMF) were linearly modelled as a function of the process parameters with stepwise OLS. The models explain 92% (R2(TONDMM) = 0.92; DF = 16) and 78% (R2(TONMF) = 0.78; DF = 18) variance of the responses thus indicating a large signal-to-noise ratio for TONDMM and to a lesser extent for TONMF.
Fig. 3B shows the linear effects of the process parameters on TONDMM as trellis response surface plot. The factors T and pH2 both have strong positive effects on TONDMM, whereas t reveals only a small positive and pCO2 a moderate negative effect on TONDMM. Optimal conditions were found in the upper left panel and these are the conditions of the top candidate found in the design list with T, pH2, t at the upper and pCO2 at the lower bound, yielding respective TONs for DMM and MF of 2610 and 2356 (Fig. 4C, entry 4).
Second relaxation
Following eqn (5), the experimental space was relaxed in 25% and 50% steps and the relaxation trials were experimentally realised in the lab (Fig. 4D and 1(VII); see the Methods section).
Again, we saw a small improvement of the 25% relaxation trial (Fig. 4D, entry 1) compared with the best candidate from the second DoE (Fig. 4C, entry 4), whereas the 50% relaxation candidate performed comparatively poorly (Fig. 4D, entry 2; Fig. 1(VIII)). With these relaxation trials, we reached the technical limits of our setup regarding hydrogen gas pressure, and therefore, the conditions of the 25% relaxation experiment can be considered locally optimal given the constraints of technical feasibility (Fig. 1(IX) and (X)). We would like to point out that the catalytic conditions optimised by the here presented strategy may still represent a local maximum. Modification of the catalyst and the additive might also result in further improvement, but as demonstrated in this work the identification of high-performance catalytic conditions should be a prerequisite for a design strategy of new catalysts.
A complete overview of the results obtained at each step of the optimisation project is given in Fig. 4E, overall tripling the initial TONDMM value of 786 to a final value of 2761. As illustrated, the standard deviation (SD) of TONDMM tends to increase with increasing mean value of TONDMM (Fig. 4E), which might be a joint effect from decreasing ncat and increasing both nadd and V over the course of the optimisation project. Decreasing ncat and increasing V is equivalent to reducing the concentration of the catalyst, which presumably renders the system more susceptible to random disturbances by catalyst deactivation, thereby providing an explanation for the observed increase of the standard deviation. Simple Spearman rank correlation analysis of the relationship between SD(TONDMM), SD(TONMF) and the process parameters Xi supports this hypothesis by revealing a negative and a positive association of SD(TONDMM) with ncat and V, respectively (ESI Table 26†).
Another interesting result in Fig. 4E refers to the independent replication error. The linear design of the second DoE includes two independent settings of the 20% reference as centre points with each point measured as triple replicate (ESI Tables 11 and 17†). This sextet from the second DoE (Fig. 4C, entry RS1) excellently matches the 20% relaxation triple (Fig. 4B, entry 2) indicating good repeatability and reliability of the system (Fig. 4E, ESI Tables 27 and 28†). In a similar way, the best candidate from the second DoE (Fig. 4C, entry 4) has been independently replicated when running the second relaxation sequence (Fig. 4D, entry RS2) and both replicates turned out identical within the experimental error (Fig. 4E, ESI Tables 29 and 30†).
Technical adaption
The results of the non-biased mathematical modelling approach presented in this study revealed that a better catalytic performance is inter alia strongly correlated to a combination of lower catalyst loadings and higher reaction volumes. The heuristic interpretation of this finding indicates that the poor solubility of hydrogen gas in methanol and mass transfer might be limiting factors within our experimental setup. We anticipated that a larger reaction volume, a higher surface to volume ratio and improved mixing would lead to better mass transfer and thus designed an experimental setup accordingly to enhance the catalytic performance. With this upscale autoclave, the TON for DMM increased to 3874, while the TON for MF reached a value of 1445, resulting in a higher selectivity toward DMM (ESI Table 21†). The result of this technical adaption shows that while the reaction is already optimised to a high degree with respect to reaction parameters, further improvements can be expected by focusing on engineering aspects of the reaction setup.
Conclusion
We demonstrated the power of multivariate optimisation for catalytic processes over the usually applied cumbersome one-factor-at-a-time method. In the homogeneously catalysed transformation of CO2 to DMM (dimethoxymethane) and MF (methyl formate) using the ruthenium-triphos complex [Ru(N-triphosPh)(tmm)], the TON (turnover number) for DMM was drastically increased to 2761 (with it: TONMF 1769) by an easy-to-use algorithmic workflow combined with only a small number of catalytic experiments. Given the complexity of the transformation, which depends on seven parameters, conventional OFAT screening techniques would have been very costly and time-consuming, with uncertain outcome.
Starting from catalytic data using RF (random forest) for empirical model building, an experimental subspace was identified and subsequently augmented to render the effects of a first set of three process factors estimable. Modelling and optimisation, followed by relaxation led to a sequence of relaxation trials with one candidate assumed to be locally optimal. With this candidate as reference, a linear design of the remaining four variables yielded another substantial improvement. Relaxation of the second design further enhanced the catalytic performance, thereby reaching the technical limits of the setup.
The optimised conditions were used in a specifically designed experimental setup and the highest TON for DMM of 3874 (with it: TONMF 1445) was obtained, which is, to the best of our knowledge, the by far highest value reported in the investigated catalysis.
Methods
Theoretical introduction
Experimental design (design of experiments, DoE) methods can be used to study the joint effects of several parameters X: X1, X2, …, XI on response Y.26,42,43 This can formally be written as:
Y = f(X) + ε | 1 |
f() denotes the true, however unknown function, linking the responses Y with the process conditions X: X1, X2, …, XI, whereas ε is a random element taken from a normal distribution with variance σ2, ε ∼ N(0,σ2) to account for experimental uncertainties. Conceptually, nature evaluates in an experiment the function f() known to her only at reaction conditions X, then adds some random noise ε and returns the experimental results Y [eqn (1)].
Under the weak assumption that f() is smooth and continuous, f() can be locally approximated as polynomial parametric surrogates, g1,2,3(), of increasing complexity, formally:
2 |
3 |
4 |
These are linear [eqn (2)], bilinear [eqn (3)] or quadratic [eqn (4)] parametric surrogates of the true function f(). With the experimental values Y, X1, X2, …, XI available, the unknown parameters ai, aii, aij can be estimated from the data using ordinary least squares (OLS).46
Given process factors X1, X2, …, XI, their ranges Xi ∈ {LB,UB} with LB, UB denoting the lower and upper bounds of the process factors Xi and, depending on the expected complexity, the parametric form of the surrogate model, the design points X (experimental design) optimally supporting the chosen model can be calculated. However, in an early project phase it is often unclear which factors Xi and ranges should be chosen and what levels of complexity must be assumed for the domain under investigation. Therefore, DoE can benefit from experimental data analysis, with the latter helping to answer the questions arising in the former.
After first optimisation by an experimental design, ascending in the direction of maximal improvement can be easily achieved by increasing (relaxing) the experimental space in discrete steps and by solving a maximisation problem subject to a sequence of hypercubical constraints, that is
max(g(X)) subject to LB – kΔX < X < UB + kΔX | 5 |
with ΔX being the step size of the relaxation, here taken to be 10% of the initial factor ranges, that is ΔX = 0.1[UB – LB], and LB, UB denoting the lower and upper bounds of the process factors Xi. Varying k = 1, 2, …, K leads to a sequence of relaxation trials to be realised in the lab.
Additional information
The detailed description of all experiments, the performed multivariate analysis, the spectroscopic data of compounds as well as the NMR spectra of compounds and catalysis samples can be found in the ESI.† In order to improve comprehensibility, simplified names were used in some cases rather than using exact IUPAC names.
All calculations were done using the statistical software R.47 Random forest modelling was performed with the R-package ‘randomForest’.48 Experimental designs were calculated with the D-optimal criterion of the function optFederov() in the R-package ‘AlgDesign’.49 Optimisation was achieved with the augmented Lagrange method from the R-package ‘Rsolnp’.50 Graphics were produced with the R-package ‘lattice’.51
All catalyses and the corresponding analyses were performed following a procedure previously reported by our group.27 The catalysis was also performed in the absence of a catalyst, a co-catalyst or both to demonstrate the need of the catalytic system for the formation of DMM and MF. In all cases, no significant conversions for both of these compounds were observed (ESI†).
Reproduction of modelling results
The R-code used as well as the catalytic data analysed (Excel sheet) are available online. The multivariate analysis can be reproduced by following the instructions in the ESI.†
Author contributions
M. Siebert and G. K. designed the experiments. M. Siebert performed the experiments. G. K. performed computer modelling and analysis of the data. All authors contributed to the interpretation of the data. The manuscript was co-written by M. Siebert and G. K. and all authors contributed to the manuscript. O. T. supervised the project.
Conflicts of interest
There are no conflicts to declare.
Supplementary Material
Acknowledgments
Generous financial support by the German Federal Ministry of Education and Research (BMBF) within the funding initiative ‘CO2Plus – Stoffliche Nutzung von CO2 zur Verbreiterung der Rohstoffbasis’ is gratefully acknowledged. We thank A. Closs, A. Schreieck and P. Laloire for fruitful discussions.
Footnotes
†Electronic supplementary information (ESI) available: Synthetic procedures, multivariate modelling procedure, additional tables, catalysis data, R-code. See DOI: 10.1039/c9sc04591k
References
- Crutzen P. J. Nature. 2002;415:23. doi: 10.1038/415023a. [DOI] [PubMed] [Google Scholar]
- Martin J.-L., Maris V., Simberloff D. S. Proc. Natl. Acad. Sci. U. S. A. 2016;113:6105–6112. doi: 10.1073/pnas.1525003113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sterner T., Barbier E. B., Bateman I., van den Bijgaart I., Crépin A.-S., Edenhofer O., Fischer C., Habla W., Hassler J., Johansson-Stenman O., Lange A., Polasky S., Rockström J., Smith H. G., Steffen W., Wagner G., Wilen J. E., Alpízar F., Azar C., Carless D., Chávez C., Coria J., Engström G., Jagers S. C., Köhlin G., Löfgren Å., Pleijel H., Robinson A. Nature Sustainability. 2019;2:14–21. [Google Scholar]
- Geng Y., Sarkis J., Bleischwitz R. Nature. 2019;565:153–155. doi: 10.1038/d41586-019-00017-z. [DOI] [PubMed] [Google Scholar]
- Keim W., in Carbon Dioxide as a Source of Carbon: Biochemical and Chemical Uses, ed. M. Aresta and G. Forti, Springer, Dordrecht, 1st edn, 1987, 10.1007/978-94-009-3923-3. [DOI] [Google Scholar]
- Aresta M., in Carbon Dioxide as Chemical Feedstock, ed. M. Aresta, Wiley-VCH, Weinheim, 2010. [Google Scholar]
- Beckman E. J. Nature. 2016;531:180–181. doi: 10.1038/531180a. [DOI] [PubMed] [Google Scholar]
- Sheldon R. A. Green Chem. 2016;18:3180–3183. [Google Scholar]
- Martens J. A., Bogaerts A., De Kimpe N., Jacobs P. A., Marin G. B., Rabaey K., Saeys M., Verhelst S. ChemSusChem. 2017;10:1039–1055. doi: 10.1002/cssc.201601051. [DOI] [PubMed] [Google Scholar]
- Gil Y., Greaves M., Hendler J., Hirsh H. Science. 2014;346:171–172. doi: 10.1126/science.1259439. [DOI] [PubMed] [Google Scholar]
- Jordan M. I., Mitchell T. M. Science. 2015;349:255–260. doi: 10.1126/science.aaa8415. [DOI] [PubMed] [Google Scholar]
- Rothenberg G., Catalysis: Concepts and Green Applications, Wiley, Weinheim, 2017. [Google Scholar]
- Sigman M. S., Harper K. C., Bess E. N., Milo A. Acc. Chem. Res. 2016;49:1292–1301. doi: 10.1021/acs.accounts.6b00194. [DOI] [PubMed] [Google Scholar]
- Segler M. H. S., Preuss M., Waller M. P. Nature. 2018;555:604–610. doi: 10.1038/nature25978. [DOI] [PubMed] [Google Scholar]
- Coley C. W., Green W. H., Jensen K. F. Acc. Chem. Res. 2018;51:1281–1289. doi: 10.1021/acs.accounts.8b00087. [DOI] [PubMed] [Google Scholar]
- Gao H., Struble T. J., Coley C. W., Wang Y., Green W. H., Jensen K. F. ACS Cent. Sci. 2018;4:1465–1476. doi: 10.1021/acscentsci.8b00357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ahneman D. T., Estrada J. G., Lin S., Dreher S. D., Doyle A. G. Science. 2018;360:186–190. doi: 10.1126/science.aar5169. [DOI] [PubMed] [Google Scholar]
- Granda J. M., Donina L., Dragone V., Long D.-L., Cronin L. Nature. 2018;559:377–381. doi: 10.1038/s41586-018-0307-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Santiago C. B., Guo J.-Y., Sigman M. S. Chem. Sci. 2018;9:2398–2412. doi: 10.1039/c7sc04679k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao S., Gensch T., Murray B., Niemeyer Z. L., Sigman M. S., Biscoe M. R. Science. 2018;362:670–674. doi: 10.1126/science.aat2299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zahrt A. F., Henle J. J., Rose B. T., Wang Y., Darrow W. T., Denmark S. E. Science. 2019;363:eaau5631. doi: 10.1126/science.aau5631. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reid J. P., Sigman M. S. Nature. 2019;571:343–348. doi: 10.1038/s41586-019-1384-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brethomé A. V., Paton R. S., Fletcher S. P. ACS Catal. 2019;9:7179–7187. doi: 10.1021/acscatal.9b01814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gromski P. S., Henson A. B., Granda J. M., Cronin L. Nat. Rev. Chem. 2019;3:119–128. [Google Scholar]
- Box G. E. P., Hunter W. G. and Hunter J. S., Statistics for Experimenters, John Wiley & Sons, New York, 1st edn, 1978. [Google Scholar]
- Box G. E. P., Hunter W. G. and Hunter J. S., Statistics for Experimenters: Design, Innovation, and Discovery, John Wiley & Sons, Hoboken, 2st edn, 2005. [Google Scholar]
- Siebert M., Seibicke M., Siegle A. F., Kräh S., Trapp O. J. Am. Chem. Soc. 2019;141:334–341. doi: 10.1021/jacs.8b10233. [DOI] [PubMed] [Google Scholar]
- Seibicke M., Siebert M., Siegle A. F., Gutenthaler S. M., Trapp O. Organometallics. 2019;38:1809–1814. [Google Scholar]
- Dabral S., Schaub T. Adv. Synth. Catal. 2019;361:223–246. [Google Scholar]
- Sun R., Delidovich I., Palkovits R. ACS Catal. 2019;9:1298–1318. [Google Scholar]
- Thenert K., Beydoun K., Wiesenthal J., Leitner W., Klankermayer J. Angew. Chem., Int. Ed. 2016;55:12266–12269. doi: 10.1002/anie.201606427. [DOI] [PubMed] [Google Scholar]
- Schieweck B. G., Klankermayer J. Angew. Chem., Int. Ed. 2017;56:10854–10857. doi: 10.1002/anie.201702905. [DOI] [PubMed] [Google Scholar]
- Bontemps S., Vendier L., Sabo-Etienne S. Angew. Chem., Int. Ed. 2012;51:1671–1674. doi: 10.1002/anie.201107352. [DOI] [PubMed] [Google Scholar]
- Bontemps S., Vendier L., Sabo-Etienne S. J. Am. Chem. Soc. 2014;136:4419–4425. doi: 10.1021/ja500708w. [DOI] [PubMed] [Google Scholar]
- Jin G., Werncke C. G., Escudié Y., Sabo-Etienne S., Bontemps S. J. Am. Chem. Soc. 2015;137:9563–9566. doi: 10.1021/jacs.5b06077. [DOI] [PubMed] [Google Scholar]
- Aloisi A., Berthet J.-C., Genre C., Thuéry P., Cantat T. Dalton Trans. 2016;45:14774–14788. doi: 10.1039/c6dt02135b. [DOI] [PubMed] [Google Scholar]
- Espinosa M. R., Charboneau D. J., Garcia de Oliveira A., Hazari N. ACS Catal. 2019;9:301–314. [Google Scholar]
- Metsänen T. T., Oestreich M. Organometallics. 2015;34:543–546. [Google Scholar]
- Ríos P., Curado N., López-Serrano J., Rodríguez A. Chem. Commun. 2016;52:2114–2117. doi: 10.1039/c5cc09650b. [DOI] [PubMed] [Google Scholar]
- Jiang Y., Blacque O., Fox T., Berke H. J. Am. Chem. Soc. 2013;135:7751–7760. doi: 10.1021/ja402381d. [DOI] [PubMed] [Google Scholar]
- LeBlanc F. A., Piers W. E., Parvez M. Angew. Chem., Int. Ed. 2014;53:789–792. doi: 10.1002/anie.201309094. [DOI] [PubMed] [Google Scholar]
- Box G. E. P. and Draper N. R., Empirical Model-Building and Response Surfaces, John Wiley & Sons, New York, 1987. [Google Scholar]
- Montgomery D. C., Design and Analysis of Experiments, John Wiley & Sons, New York, 3rd edn, 1991. [Google Scholar]
- Breiman L. Mach. Learn. 2001;45:5–32. [Google Scholar]
- Hastie T., Tibshirani R. and Friedman J., The Elements of Statistical Learning, Springer-Verlag, New York, 2nd edn, 2009. [Google Scholar]
- Sen A. and Srivastava M., Regression Analysis, Springer-Verlag, New York, 1990. [Google Scholar]
- R Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2017, https://www.R-project.org/.
- Liaw A., Wiener M. R News. 2002;2:18–22. [Google Scholar]
- Wheeler B., AlgDesign: Algorithmic Experimental Design, R package version 1.1-7.3, 2014, https://CRAN.R-project.org/package=AlgDesign.
- Ghalanos A. and Theussl S., Rsolnp: General Non-Linear Optimization, R package version 1.16, 2015, https://CRAN.R-project.org/package=Rsolnp.
- Sarkar D., Lattice, Springer-Verlag, New York, 2008. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.