Artificial-intelligence-driven discovery of catalyst genes with application to CO2 activation on semiconductor oxides

Aliaksei Mazheika; Yang-Gang Wang; Rosendo Valero; Francesc Viñes; Francesc Illas; Luca M Ghiringhelli; Sergey V Levchenko; Matthias Scheffler

doi:10.1038/s41467-022-28042-z

. 2022 Jan 20;13:419. doi: 10.1038/s41467-022-28042-z

Artificial-intelligence-driven discovery of catalyst genes with application to CO₂ activation on semiconductor oxides

Aliaksei Mazheika ^1,^✉, Yang-Gang Wang ^1,², Rosendo Valero ^3,⁴, Francesc Viñes ³, Francesc Illas ³, Luca M Ghiringhelli ^1,⁵, Sergey V Levchenko ^6,^✉, Matthias Scheffler ^1,⁵

PMCID: PMC8776738 PMID: 35058444

Abstract

Catalytic-materials design requires predictive modeling of the interaction between catalyst and reactants. This is challenging due to the complexity and diversity of structure-property relationships across the chemical space. Here, we report a strategy for a rational design of catalytic materials using the artificial intelligence approach (AI) subgroup discovery. We identify catalyst genes (features) that correlate with mechanisms that trigger, facilitate, or hinder the activation of carbon dioxide (CO₂) towards a chemical conversion. The AI model is trained on first-principles data for a broad family of oxides. We demonstrate that surfaces of experimentally identified good catalysts consistently exhibit combinations of genes resulting in a strong elongation of a C-O bond. The same combinations of genes also minimize the OCO-angle, the previously proposed indicator of activation, albeit under the constraint that the Sabatier principle is satisfied. Based on these findings, we propose a set of new promising catalyst materials for CO₂ conversion.

Subject terms: Materials for energy and catalysis, Theory and computation, Atomistic models, Computational chemistry

Here the authors demonstrate an artificial-intelligence based approach to identify catalytic materials features that correlate with mechanisms that trigger, facilitate, or hinder CO2 catalytic reactions.

Introduction

The need for converting stable molecules such as carbon dioxide (CO₂), methane, or water into useful chemicals and fuels is growing quickly along with the depletion of fossil-fuel reserves and the pollution of the environment^1–3. Such a conversion does not have a satisfactory solution, so far. In particular, CO₂ conversion remains one of the most important societal and technological challenges^1,2,4–8.

The general understanding in heterogeneous catalysis is that a stable molecule such as CO₂ needs to be “prepared” before its catalytic conversion occurs. This leads to the notion of molecular activation⁹. However, on one hand, this notion encompasses a very wide variety of processes (adsorption, photo-excitation, application of electric field, etc.) and materials (including compositional and structural variability), and it remains unclear which properties of the catalytic material and the adsorbed molecule determine the final chemistry, what is the relationship between the two sets of properties, and how general this relationship may be. On the other hand, finding the set of descriptive parameters of a catalytic material that characterize the catalytic performance in a particular process, or even in general for a given reactant, would be very valuable, because it would allow us to quickly search for promising candidate catalysts using rational design^10–17. We call these properties materials genes. The genes do not necessarily correlate with catalytic activity by themselves. Similar to biological genes, their role depends on the combination in which they occur, and can be either beneficial or detrimental to the catalytic activity.

Several strategies exist to find such properties for a given reaction. One way is to explore the free-energy surface for each catalyst candidate, which is a slow and resource-consuming process, and currently computationally unfeasible for many materials on a high-throughput basis. An alternative approach consists in searching for a correlation between experimentally determined material’s properties and its catalytic performance. Such a strategy requires consistent experimental measurements at well-defined conditions for a set of materials. To the best of our knowledge, such consistent data have not been reported so far for CO₂ conversion on semiconductor oxides. Moreover, available publications usually do not report unsuccessful experimental results. These issues and a strategy to address them have been recently discussed in our publication¹⁸.

Yet another strategy is to find an indicator of activation, namely, a property of the system that directly indicates the certain catalytic performance of the material¹⁰. Indicators are distinguished from materials genes based on a qualitatively different level of computational complexity. The indicator can still be unfeasible or hard for a high-throughput study of hundreds of thousands or millions of materials. However, when it can be calculated for a few tens or hundreds of materials in a reasonable time, these data can then be used to find materials genes that control the value of the indicator. Since a direct search for a relationship between the indicator and catalytic performance of material would also require a consistent set of data of turnover frequency (TOF), selectivity, and yield values, one could instead consider several most promising indicators, find out which materials are good catalysts, and then check which indicators correlate with this observation. This approach also addresses the problem of defining activation in terms of the adsorbed-molecule properties as potential indicators of catalytic activity.

Catalytic conversion of CO₂ requires activation of other reactants as well, e.g., molecular hydrogen, water, or methane. In particular, hydrogen can serve as an environmentally friendly reagent that can be produced by water electrolysis or photo-splitting avoiding extra CO₂ emissions^19–21. Also, oxygen vacancies have been proposed as active sites for CO₂ conversion on some materials²². Therefore, predictions of catalytic activity of materials for CO₂ conversion can be refined based on analysis of activation of other reactants and defects. An additional challenge is to ensure that the useful products, as well as the surface catalytic activity, are preserved under the conditions of activation and subsequent conversion. While the strong C–O double bonds in CO₂ can be weakened or even broken by adsorption at a solid surface at an elevated temperature, this may also lead to too strong adsorption or further dissociation of the molecule, so that the catalytic surface is poisoned by carbonate or carbon deposits. Weak adsorption, on the other hand, means no activation.

In this work, we combine first-principles calculations with an artificial-intelligence (AI) method, subgroup discovery (SGD), to identify pristine materials properties that optimize indicators of catalytic CO₂ activation. Moreover, SGD allows identifying one or more distinct combinations of materials features (genes) that promote activation. We focus on oxide materials as candidate catalysts. Oxides are structurally and compositionally stable under realistic temperatures and can be less expensive than the traditional precious metal-containing catalysts^23–25. Activation of other reactants and defects are not considered. As shown below, meaningful predictions can be made based solely on the analysis of the adsorption properties of CO₂ on pristine surfaces. This confirms that these properties are good indicators of activation with a viable optimization pathway at least for the chosen class of materials. The Sabatier principle is taken into account by ensuring that the adsorption energy is not too large or too small. In order to ensure reproducibility of our AI data analysis, we provide all necessary metadata (input parameters) and workflow in the easily accessible form of a Jupyter notebook²⁶. We argue that, with the ever-growing importance and complexity of AI, such detailed and tutorial documentation is a necessity of good scientific practice. Our approach is applicable to a wider class of materials and molecules, not limited to oxides or CO₂. Our study by no means encompasses all possible mechanisms of CO₂ conversion on oxide surfaces, but it offers a clear design path among many possible ones.

Results

CO₂ activation

We find that on semiconductor oxide surfaces CO₂ is chemisorbed exclusively when the carbon atom binds to surface O-atoms. All other minima of the potential-energy surface are found to be either metastable or correspond to physisorption. Therefore, there are as many different potential chemisorption sites as there are unique O-atoms at the surface. The dataset includes all non-equivalent surface O-atoms on the 141 considered surfaces of 71 materials, which sum up to 255 unique adsorption sites. Among these sites on about 4% (10 out of 255) CO₂ prefers to physisorb, i.e., any chemisorbed state is metastable with respect to the physisorbed one. The physisorption can be easily identified by an almost linear geometry of the adsorbed molecule, and a C–O bond distance very close to the C–O bond length in a gas-phase CO₂ molecule, 1.17 Å.

We considered six different candidate indicators of CO₂ activation, including OCO-angle and C–O bond distance. The bending of the OCO-angle in the adsorbed CO₂ molecule relative to the gas-phase value of 180° (linear configuration) has been previously proposed²⁷ and is widely accepted as a good indicator of activation. For gas-phase CO₂, it is understood that the C–O double bond is weakened when an electron is added to the lowest unoccupied orbital, because it is of antibonding (π*) character with a concomitant bending of the molecule. There is a one-to-one mapping between the C–O bond length l(C–O) and the OCO-angle in gas-phase CO₂^δ− for a range of δ > 0 (red curve in Fig. 1). However, this is not the case for the adsorbed CO₂ (dots in Fig. 1). There is a subset of adsorbed CO₂ that is close to the red line, but there are many cases where l(C–O) is substantially larger for a given OCO-angle. This is in contrast to metal alloy nanoparticle catalysts, where there is a better correlation between OCO-angle and l(C–O)²⁸. Also, a longer C–O bond reflects a weakening and readiness for further chemical transformations. Thus, the bond elongation itself may be an alternative indicator of activation. A look at the adsorbed CO₂ structures reveals that, on sites following the gas-phase correlation, the molecule adsorbs in nearly symmetric adsorption structures with nearly equal length of the two C–O bonds. In the other cases one O-atom of CO₂ is close to surface cation(s), leading to a pronounced asymmetry of the adsorbed molecule.

Fig. 1 — The OCO-angle in charged gas-phase CO₂ is shown with the red line, and adsorbed CO₂ structures are shown with the dots. Colored dots: blue—adsorption sites from the unconstrained subgroup with OCO < 132°, green—subgroup of sites with l(C–O) > 1.30 Å, black—the remaining samples (see the text). The subgroups obtained with Sabatier principle constraint are marked with “c”.

Other considered potential indicators of activation include Hirshfeld charge²⁹ of adsorbed CO₂ (a direct indicator of the charge transferred to CO₂), the dipole moment of the surface along the surface normal per adsorbed CO₂ molecule (includes charge transfer to the molecule, as well as adsorption induced surface relaxation), the difference in Hirshfeld charges of C and O-atoms in an adsorbed CO₂ molecule (indicates the ionicity of C–O bonds), and the difference in Hirshfeld charges of the O-atoms in the adsorbed molecule (indicates asymmetry of the adsorbed molecule)^9,29.

Subgroup discovery

To find out which properties (features) of the clean surfaces determine when a given activation indicator is maximized or minimized, we employ the subgroup-discovery (SGD) approach^30–34. Given a dataset and a target property known for all data points, the SGD algorithm identifies subgroups with “outstanding characteristics” (see further for the criteria for being outstanding) and describes them by means of conjunction of basic propositions (selectors) of the kind “(f₁ < a) AND (f₂ ≥ b) AND ...”, where f_i is a feature and a, b are threshold values also found by SGD. In the framework of SGD, we call the selected primary features {f₁, f₂, ...} materials genes. Thus, SGD identifies both the outstanding subgroups and the relevant materials genes for a given target property.

Obviously, the selectors should only contain features that are much easier to evaluate than the target property. In the presented work, the considered features include properties of gas-phase atoms that build the material, and properties of the pristine material (properties of the bulk phase and of the pristine relaxed surface). Overall 46 primary features have been considered. The full list is presented Supplementary Table 3. Our strategy is to provide an almost exhaustive list of features, and use data analytics to select materials genes from this list. Some of these features have been explored previously as descriptors of catalytic activity for semiconducting and metallic oxides^35–38. O 2p-band center features have been shown to correlate with catalytic properties of both semiconducting and metallic oxides^35,37. In particular, most of the features (or closely related ones) mentioned in ref. ³⁶, inspired by the work of Grasselli³⁹, are included in our set, except oxygen vacancy formation energy, which is relevant for the oxidation catalysis, while here we are interested in partial or complete reduction. Additional important features in our work (see below) include features related to the polarizability of surface cations, which describe the long-range surface response to charged adsorbates. A subset of features from our list has been recently used successfully for predicting catalytic properties of metallic oxides³⁸, along with additional features relevant specifically for metallic oxides (such as partial electronic state fillings).

The features selected by the SGD are summarized in Table 1.

Table 1.

Features that appear in the top SGD selectors (see text).

symbol	Meaning
IP_min/max	Ionization potential, minimal and maximal in the pair of atoms A and B; calculated as E_atom − E_cation
EA_min/max	Electron affinity, minimal and maximal in the pair of atoms A and B; calculated as E_anion − E_atom
EN_min/max	Mulliken electronegativity, minimal and maximal in the pair of gas-phase atoms A and B
r₋₁^min, r₋₁^max	Radii of the maximum value of the Kohn-Sham radial wave functions of the spin-unpolarized spherically symmetric atom for HOMO-1, maximum (max) and minimum (min) in the pair of atoms A and B
r₊₁^min, r₊₁^max	Radii of the maximum value of the Kohn-Sham radial wave functions of the spin-unpolarized spherically symmetric atom for LUMO, maximum (max) and minimum (min) in the pair of atoms A and B
M	Energy at which the surface O 2p-band projected density of states (PDOS) is maximal
d₁, d₂, d₃	Distances from surface O-atom to the first-, second-, and third-nearest cations
W	Work function W, as the negative of the valence-band maximum (W = −VBM) with respect to vacuum level
q_min, q_max	Minimal and maximal Hirshfeld charges of cations in the pair A and B, calculated as an average for all surface cations of a given type
Δ	Bandgap
CBM	Conduction band minimum
Q₅, Q₆	Local-order parameter with l = 5 or 6
PC	Weighted surface O 2p-band center
α_O, C₆^O	Polarizability and C₆-coefficient for surface O-atom obtained from many-body dispersion scheme
α_min, α_max, C₆^min, C₆^max	Polarizability and C₆-coefficient for cations, minimal and maximal in the pair A and B, calculated as an average for all surface cations of a given type
q_O	Hirshfeld charge of O-atom at the surface
wid	Square root of the second moment of surface O 2p-band
wid_min, wid_maxS	Square root of the second moment of PDOS of cations within valence-band, minimal and maximal in the pair A and B, calculated as an average for all surface cations of a given type
c_min, c_max	First moment for PDOS of cation within valence-band, minimal and maximal in the pair A and B, calculated as an average for all surface cations of a given type
φ_1.4, φ_2.6, φ_1.4 - φ_2.6	Electrostatic potentials above surface O-atom at 1.4 and 2.6 Å and their difference. 1.4 Å corresponds to the average length of the bond between C and surface O, 2.6 Å is the minimal distance from surface O to C-atom of physisorbed carbon-dioxide molecule as observed from our calculations
L_min, L_max	Energy of lowest unoccupied projected eigenstate of surface cations, minimal and maximal in the pair A and B, calculated as an average for all surface cations of a given type
kurt	Kurtosis of surface O 2p-band PDOS
U	Eigenstate with least negative value in surface O 2p-band
BV	Bond-valence value of surface O-atom

Open in a new tab

The outstanding subgroup should satisfy several criteria. It should be statistically relevant; therefore the subgroups of too small size should be penalized. Target-property values (OCO-angle, C–O bond length, etc.) for subgroup samples should be as different as possible from corresponding gas-phase values since their change upon adsorption indicates CO₂ activation³³. To achieve this, two requirements are imposed simultaneously: (i) The target-property values for subgroup members should be smaller or larger (depending on the target) than a certain value (a cutoff), and (ii) the target-property values are minimized or maximized within the cutoff. The latter condition gives preference to subgroups with smaller or larger target-property values among similarly sized subgroups within the cutoff. The value of the cutoff is a parameter. As it approaches the optimal value of an activation indicator among all data points, additional or alternative materials genes and their combinations leading to stronger activation are identified. We explore the whole range of the parameter for each target property (for OCO-angle—123°, 124°, 126°, 128°, 130°, and 132°; for l(C–O)—1.26 Å, 1.28 Å, and 1.30 Å).

In addition to these criteria, we consider the requirement that adsorption energies are not too strong and not too weak for most of the samples in a subgroup. Strong activation (i.e., strong weakening of the C–O bonds) can be achieved by strong binding to the surface. It is well known that good catalytic performance requires a balanced adsorption strength. This is known as Sabatier principle. In addition to the practical value of identifying subgroups that satisfy this principle, comparison of subgroup selectors obtained with and without this requirement helps to identify combinations of materials features that promote desired changes in target properties and at the same time yield intermediate adsorption energies.

Sabatier principle is reflected by a characteristic volcano-type behavior of catalytic activity as a function of adsorption energy of reactants and intermediates. The position of the top of the volcano depends on particular reactions and conditions. It can be estimated from condition |ΔG| ~ 0, where ΔG is the Gibbs free energy of adsorption. For CO₂ adsorption at room temperature and partial CO₂ pressure of 1 atm this condition corresponds to about −0.5 eV adsorption energy⁴⁰. At temperatures around 450 °C (typical conditions for CO₂ methanation⁴¹) ΔG = 0 corresponds to adsorption energy −1.7 eV⁴¹. Therefore, for catalytic conversion at low or moderate temperatures this implies that CO₂ adsorption energies should be in the range from between −2.0 and −0.5 eV.

These requirements are implemented in the following quality functions that are maximized during the search for subgroups. In particular, for OCO-angle minimization we use:

F (Z) = θ_{cut} [\frac{s (Z)}{s (Y)} \cdot (\frac{\max (Z) - α_{g}}{\min (Y) - α_{g}}) \cdot u (p)]

and for C–O bond maximization the following quality function was applied:

F (Z) = θ_{cut} [\frac{s (Z)}{s (Y)} \cdot (\frac{\min (Z) - l_{g}}{\max (Y) - l_{g}}) \cdot u (p)]

where Y is the whole dataset, Z—a subgroup, s—size (number of data points), min and max – minimal or maximal value of the target property, α_g and l_g are the gas-phase values of OCO-angle and C–O bond distance, 180° and 1.17 Å, respectively, and θ_cut is the Heaviside step function which is equal 1 if all data points in the subgroup satisfy the cutoff condition and 0 otherwise. Thus, larger values of the quality function F(Z) are obtained for those subgroups in which minimal (maximal) value of a target property is close to the maximal (minimal) value of the whole sampling with respect to the gas-phase value of CO₂ molecule. The use of maximum/minimum instead of a median is done to ensure that a target property is optimal for as many members of a subgroup as possible. The gas-phase reference values are usually significantly different from the “chemisorption” subset. Therefore, the term in squared brackets in Eqs. (1) and (2) can noticeably contribute only when the sizes of candidate subgroups are similar.

The term u(p) in Eqs. (1) and (2) is added in order to account for Sabatier principle in SGD framework. We have implemented a multitask quality function, where a factor u(p) increases the quality of subgroups with adsorption energies falling within this range. This is formulated in terms of the information gain³⁴, i.e., reduction of the normalized Shannon entropy. We perform the SGD for each target property both explicitly accounting for the Sabatier principle and without it. The latter case is equal to u(p) = 1 in Eqs. (1) and (2)³⁴.

We note that SGD is qualitatively different from machine-learning classification/regression techniques such as neural networks, kernel regression methods, or decision-tree regression (DTR⁴²) (e.g., random forest). SGD is typically referred to as a supervised descriptive rule-induction technique⁴³, i.e., it uses the labels assigned to the data points (the values of the target property) in order to identify patterns in the data distribution (the statistically exceptional data groups) and the rules defining them (the selectors), by optimizing a quality function which is a function of the distribution of values of the target property⁴³. While there are apparent similarities between SGD and DTR as both methods yield models in terms of physically interpretable selectors (usually, inequalities) on a selected subset of the input features, the analogy stops at this level, as SGD focuses at (and only at) subgroups from the very beginning and says nothing about the data that are not in the subgroup. In contrast, DTR determines a global partitioning of the input space by minimizing a global quality function, i.e., the quality of a single subset is secondary with respect to the resulting quality of all subsets partitioning the whole dataset. In other words, for finding distinct combinations of materials genes driving desirable changes in a particular target property (possibly different combinations leading to the same result), the SGD approach has significantly higher flexibility and reliability. This is demonstrated below for a DTR analysis for our target properties.

The metadata and workflow for the AI analysis are documented in the Jupyter notebook²⁶.

Results of the subgroup discovery

The SGD for OCO-angles was done with Eq. (1) for the quality function, and OCO as a target property, since smaller angles indicate larger charge transferred to the molecular π^* orbital. The subgroup selectors obtained with different OCO-angle cutoffs (126°, 128°, 130°, and 132°) with or without the adsorption energy constraint are listed in Table 2 (for more details see the Supplementary Table 4). Analysis of these subgroups reveals that the angle reduction is determined by an interplay of several factors: an electron transfer from the cations to surface O-atoms, delocalization of electron density between cations and O-atoms, and coordination of the surface O-atoms. Without the Sabatier principle constraint, the OCO-angle reduction below 132° is mainly due to the electron accumulation at the O-atom of the clean surface. This is expressed by the conditions of more negative Hirshfeld charge on O-atoms (q_O < …), not very low IP of at least one cation (IP_max > …), and increased polarizability of the surface O-atom on which CO₂ is adsorbed (C₆^O > ...). Upon adsorption of CO₂, this charge on the surface O-atom is readily available for transfer to CO₂. When the Sabatier principle constraint is introduced, the OCO < 132° subgroup also includes sites with a pronounced electron transfer to CO₂, but with a lower-energy O 2p-band maximum (M < ...) with respect to vacuum level, and a larger kurtosis (kurt > ...). These conditions imply reduced inter-electronic repulsion around the surface O-atom achieved by partial delocalization of the charge density.

Table 2.

Top subgroups and their selectors obtained by minimization of OCO-angle and maximization of l(C–O) with/out Sabatier principle (energies are in eV, distances are in Å, charges are in units of absolute electron charge, polarizabilities are in Bohr³).

cutoff	size	selector	cutoff	size	selector
OCO minimization without Sabatier principle constraint			OCO minimization with Sabatier principle constraint
126	19	L_max > −2.70 (L_min > −2.19, CBM > −3.40, r₊₁^max ≤ 2.83, W < 5.80, U > −5.61) IP_max ≥ −6.05 α_max ≤ 184.5 Δφ > 1.33 q_max ≤ 0.59 wid ≤ 1.59 wid ≥ 0.58	126	15	L_min ≥ −5.1085 φ_2.6 ≤ 0.3033 Δφ ≤ 1.0622 (c_max ≤ −8.5915) d₁ ≥ 1.82 d₂ ≥ 2.005 r₊₁^max > 2.83
128	44	EA_max ≥ −0.43 Q₆ ≥ 0.51 α_max ≥ 50.4 (C₆^max ≥ 389.5, α_O ≤ 2.70) Δφ ≥ 1.00 q_min ≤ 0.49	128	30	C₆^min ≥ 369.5 L_max ≥ −4.73 (r₊₁^min ≤ 2.82, IP_min ≤ −5.83, r_HOMO^min ≤ 1.41) Q₅ ≤ 0.83 Δφ ≥ 0.60 r₊₁^max ≥ 2.80 C₆^O ≤ 12.10
130	77	L_max ≥ −5.23 EA_max ≤ 0.16 (C₆^max ≥ 389.5, IP_max ≥ −7.00) d₁ ≥ 1.82 d₂ > 2.10	130	40	φ_2.6 ≥ −0.15 Δφ ≥ 0.73 d₁ ≤ 2.01 d₂ ≥ 1.96 d₃ ≥ 2.025 (c_min ≤ −9.07, W ≥ 5.10) q_min ≤ 0.49 r₊₁^min ≥ 1.94
132	139	IP_max ≥ −6.99 q_O ≤ -0.32 C₆^O ≥ 10.36	132	58	q_O ≤ −0.3386 M ≤ −6.292 kurt ≥ 2.1035 IP_max ≥ −6.2085 r_HOMO^min ≤ 1.407 (IP_min ≤ −5.91, r₊₁^min ≤ 2.82)
l(C–O) maximization without Sabatier principle constraint			l(C–O) maximization with Sabatier principle constraint
1.26	121	C₆^min ≥ 343.5 φ_2.6 ≤ 0.66 Q₅ ≤ 0.83 M ≥ −8.05 (PC ≥ −9.32)	1.26	56	CBM ≥ −5.17 (L_min ≥ −5.11) Δφ ≤ 1.13 PC ≥ −8.62 d₃ ≤ 2.48 M ≤ −6.06
1.28	38	EA_max ≤ 0.005 d₂ > 2.22 M ≤ −4.12	1.28	30	W ≥ 5.10 (M ≤ −5.19, U ≤ -4.92, PC ≤ −7.21) d₂ > 2.14 q_min < 0.48
1.30	27	U ≤ −5.34 d₂ > 2.14 q_min < 0.48 kurt ≥ 2.10 (q_max ≥ 0.47)	1.30	27	EA_max ≤ 0.005 (W ≥ 5.10, M ≤ −5.19, U ≤ −4.92, PC ≤ −7.21) EN_min ≤ −3.19 (W ≥ 5.10, q_O ≥ −0.45, c_max ≤ −7.18, r_HOMO^min ≤ 1.41, φ_1.4 ≤ 2.40, c_min ≤ −8.135, q_max ≥ 0.47, M ≤ −5.19, IP_min ≤ −5.91, wid ≥ 0.58, U ≤ −4.92, r₋₁^max ≥ 0.97, PC ≤ −7.21, Δφ ≤ 1.81) d₂ > 2.14 q_min < 0.48 kurt ≥ 2.51

Open in a new tab

Proposition replacements that do not change the support are shown in parentheses.

At lower OCO cutoffs, the subgroup selectors include coordination descriptors Q_i, i = 5, 6. Without Sabatier principle, sites with larger Q_i are selected, and vice versa. Larger Q_i indicates lower coordination of the O-atom. This reduces electron repulsion and therefore facilitates electron transfer to the O-atom of the clean surface. However, this also increases the bonding strength of CO₂ to the surface. This explains why selectors of subgroups obtained with Sabatier principle include the opposite conditions (Q₅ < ...).

Other surface features describing electron distribution are related to Madelung potential: electrostatic potential and field (φ_1.4, φ_2.6, and Δφ = φ_1.4 − φ_2.6) and distances between the O-atom and surface cations. More open surface structure with larger distances between cations at the O site facilitates charge transfer to adsorbed CO₂ molecule, since the Madelung potential from the nearby cations is reduced. This is reflected in the appearance of propositions involving features d₁, d₂, and d₃. For example, for the OCO ≤ 130° subgroups, imposing energy constraint changes proposition (d₁ > ...) to (d₁ < ...), which implies an increased energy cost for transferring electrons to CO₂. Larger electric fields Δφ around the adsorption site imply stronger localization of electron density on O-atoms, and thus also improve the efficiency of charge transfer to the adsorbed molecule.

The smaller OCO subgroups with Sabatier principle also include propositions implying increased polarizability of both cations (C₆^min > ...). Another support-defining condition is that the radius of the lowest unoccupied orbital for the metal atoms should not be small (r₊₁ ≥ ...). This requirement is true for most cations with negative electron affinities (Supplementary Fig. 4). Analysis of adsorbed CO₂ structures and Hirshfeld charges reveals that this condition together with the higher polarizability of cations at the pristine surface encompasses two scenarios: (i) additional electron transfer to CO₂ upon adsorption and (ii) stronger binding between O-atoms in CO₂ and surface cations. When scenario (ii) dominates, CO₃^δ− anion lies nearly horizontally at the surface, and is bound with nearby cations by chemical bonds via its oxygen atoms. Such a structure leads to small OCO-angles in CO₃^δ− (around 120°), even if charge transfer is limited. Thus, increased bending of adsorbed CO₂ occurs due to charge transfer over larger distances and/or distortion of the adsorbed molecule and the surface, both leading to weaker adsorption. The cases where both scenarios are active include the same sites as in the subgroups with elongated l(C–O), as described below.

In order to obtain the subgroups of adsorption sites with larger l(C–O), we performed the SGD with the quality function Eq. (2) and l(C–O) as target property. The results for l(C–O) cutoffs 1.26, 1.28, and 1.30 Å are summarized in Table 2 and Supplementary Table 5. In contrast to OCO, the analysis of the obtained top subgroups shows a much less pronounced or no effect of imposing Sabatier principle on the distribution of adsorption energies within the subgroups. This is because sites with too strong adsorption are excluded based on l(C–O) threshold alone, without the need to introduce the energy constraint. For example, the range of l(C–O) for the top l(C–O) > 1.26 Å subgroup without constraining adsorption energies is the same as for the top OCO < 130° subgroup, but it contains significantly more sites with intermediate adsorption energies.

Electron transfer to an adsorbed CO₂ molecule increases both the OCO bending and C–O bond elongation. The main difference between OCO and l(C–O) subgroups is that in the latter an additional mechanism of increasing l(C–O) is in effect, namely a covalent bonding between one O-atom of the CO₂ molecule and the nearest surface cation. This can be concluded from the analysis of adsorption geometries, and correlates with the presence of proposition (EA_max ≤ 0.005 eV), selecting cation species that can accept electron density, e.g., from an O-atom in adsorbed CO₂ molecule. Other proposition that appears in most selectors of top subgroups is (d₂ > 2.14 Å) or (d₂ > 2.22 Å)—larger distances to the second nearest cation from an O-atom. Larger elongation of the C–O bond is achieved by the asymmetry of the cation types at the surface, where one can bind an O-atom of the adsorbed CO₂, while the other (located further away) cannot. An example asymmetric CO₂ adsorption structure is shown in Supplementary Fig. 5.

Other propositions indicate a moderate charge transfer to adsorbed CO₂ molecule as in the case of OCO subgroups with adsorption energy constraint. Propositions (M ≥ −8.05 eV), (PC ≥ −9.32 eV) in l(C–O) < 1.26 Å subgroups imply enhanced charge density on the surface O-atoms, since electron–electron repulsion raises energies of O 2p-band states. However, at larger l(C–O) cutoffs the electron transfer is balanced by such propositions as (M ≤ −5.19 eV), (U ≤ −4.92 eV), and (W ≥ 5.10 eV) indicating limited electron transfer. These propositions point to more covalent bonding between cations and surface O-atom. Rather persistent proposition observed in many selectors of l(C–O) subgroups is the limit of minimal charge on surface cations (q_min < 0.48e). It also shows the limitation of the charge transfer from one type of cations to surface oxygen atoms.

In general, we find that subgroups obtained with smaller cutoffs do not have a strong overlap with subgroups with larger cutoffs for OCO. In particular, for subgroups with close cutoffs the overlap can be smaller than 50% of the smaller subgroup (but is never below 30%). Interestingly, for l(C–O) the situation is opposite: subgroups with tighter cutoffs are mostly contained in the subgroups for more relaxed constraints. This means that, while larger values of l(C–O) are mainly controlled by the same or additional genes, smaller values of OCO are due to alternative genes. The overlap of OCO subgroups becomes even smaller when Sabatier principle is included, confirming the absence of a universal mechanism for OCO-angle reduction that is compatible with moderate adsorption energy.

In summary, we find that, while an increased electron density at the O adsorption site is necessary for chemisorption and leads to both OCO bending and C–O bond elongation in an adsorbed CO₂ molecule, there are additional actuators for these effects that are different for different target properties. The OCO-angle is in general minimized by increasing electron transfer to the O site. However, this also leads to strong adsorption for many materials (Fig. 2). To satisfy Sabatier principle, the electron transfer to CO₂ must be moderate. This is achieved by delocalization of charge density around O sites and/or by distortion of the adsorbed molecule due to the formation of covalent bonds between O-atoms in CO₂ and surface cations. The largest C–O bond elongations are achieved when both charge transfer to adsorbed CO₂ and the covalent interaction are present, and local geometry around surface O-atom provides the asymmetry in adsorption structure. This mechanism automatically fulfills the Sabatier principle.

Fig. 2 — The distribution is shown for the whole dataset (black), for the top subgroups of sites with OCO < 132° angles (blue) and l(C–O) > 1.30 Å (green). The subgroups obtained with adsorption energy constraint are marked with “c.” and shown with dashed lines. The adsorption energy E_ads is defined as the difference between the total energy of the slab with adsorbed CO₂ and the sum of total energies of the clean slab and an isolated CO₂ molecule.

The subgroups found by SGD for the dipole moment induced by CO₂ adsorption, its total Hirshfeld charge, and the difference of charges on C and O-atoms significantly overlap with the subgroup of smaller OCO-angles. The subgroup found by maximizing the difference of Hirshfeld charges on O-atoms of an adsorbed CO₂ largely overlaps with the subgroup of sites delivering larger l(C–O). In general, these indicators are not better than OCO or l(C–O). Therefore, below we focus on OCO-angle and l(C–O) as indicators of CO₂ activation. More details about the other indicators can be found in Supplementary Discussion.

Comparison with experimental results

To address the question which of the discussed properties can serve as an indicator of the catalytic activity, we compare our predictions to reported experimental results (Table 3). It should be stressed that the available experimental data are scarce, and results are difficult to compare quantitatively. We consider thermally and, for completeness, some photo-driven catalysis and thus also include supported metal catalysts with the considered oxides as support. Despite possibly different mechanisms for CO₂ conversion in the different types of catalysis, we believe that the properties of adsorbed CO₂ molecule can still serve as indicators of catalytic activity. Thus, it is possible that under such a daunting situation a reliable indicator of CO₂ activation can still be identified. As described below, our analysis confirms this hope.

Table 3.

The catalytic performance of materials which contain the sites from larger l(C–O)) or/and smaller OCO subgroups.

Material	Catalytic reaction	CO₂ adsorption energies, eV	Belong to subgroups
NaNbO₃	Photocatalytic CO₂ reduction with ~70% of CO selectivity^{46, 48}	−0.77 to −0.81	Materials with sites from l(C–O) > 1.30 Å subgroup and OCO < 132° subgroup with Sabatier principle constraint
LaAlO₃	Dry reforming of methane with Ni-nanoparticles; performance is higher than for Ni-La₂O₃ and Ni-Al₂O₃⁴⁵	−1.17
KNbO₃	Photocatalytic reduction of CO₂ into CH₄ as a composite with Pt/g-C₃N₄; significant improvement of activity when compared to Pt/g-C₃N₄; Pt-KNbO₃ is ~2.5 times more photoactive than Pt-NaNbO₃^{46, 47}	−0.56 to −0.68
CaTiO₃	CO₂ hydrogenation under UV-irradiation, although activity is not very high^{51, 57}; twice higher activity with Ni-nanoparticles⁵⁷	up to −2.70	Materials with sites from l(C–O) > 1.30 Å subgroups and from OCO < 132° subgroup without Sabatier principle constraint
CaZrO₃, SrZrO₃, BaZrO₃, SrTiO₃	Reverse water gas-shift reaction (RWGS) under 700–1100 °C⁴⁹	up to −2.75
SrTiO₃	Photocatalytic CO₂ methanation with Pt, Au-nanoparticles, significant decrease of activity during reaction⁵⁰	up to −2.40
YInO₃^a	No activity observed in photocatalytic CO₂ conversion⁵²	−1.16–−1.47	Materials with sites only from OCO < 132° subgroup without Sabatier principle constraint
CaO, SrO, BaO, Na₂O	Strong carbonation, candidate materials for carbon capture and storage (CCS)⁴⁴	−1.60 to −3.57
La₂O₃	Dry reforming of methane with supported Ni-nanoparticles; lower performance than on Ni-LaAlO₃⁴⁵ and on some other supported catalysts⁵⁴ at 700 and 250 °C correspondingly	−2.14 to −3.11
CaO	Twice smaller reaction rate in CO₂ reforming of methane reaction with supported Ni-nanoparticles than on Ni-La₂O₃⁵⁸ at 750 °C	−1.60 to −3.42
Ga₂O₃	Electrochemical reduction of CO₂ to formic acid⁵⁹; (photo)catalytic hydrogenation of CO₂⁶⁰	−0.74 to −1.34	Materials with sites from OCO < 132° subgroup with Sabatier principle constraint
Al₂O₃	Dry reforming of methane with supported Ni-nanoparticles⁶¹; lower performance than on Ni-LaAlO₃⁴⁵	−0.87

Open in a new tab

^aMaterials with sites also from OCO < 132° subgroup with Sabatier principle constraint.

First, we consider materials with the sites from subgroups obtained by minimization of OCO-angle without Sabatier principle constraint²⁷. For quite many materials from these subgroups, independent of the cutoff value, there are no reports of successful CO₂ conversion, even when they are used as supports for metal nanoparticles (Table 3). This is explained by the fact that absolute adsorption energies for these materials are above 2 eV (Fig. 2 left, Supplementary Table 4), indicating that their surfaces will be permanently poisoned by carbonate species at low or intermediate temperatures. This means that on materials with these sites hardly any reaction of CO₂ conversion can proceed at low, especially room temperature. Moreover, as shown in Table 3, even at increased temperatures, 700–750 °C, the activity of these materials is low. Some of them have been considered as candidates for carbon capture and storage (CaO, SrO, BaO, and Na₂O)⁴⁴, which implies the formation of stable carbonates rather than CO₂ transformation. Thus, we conclude that OCO-angle alone is not a good indicator of enhanced catalytic activity in CO₂ conversion.

On the other hand, several of the materials with sites from l(C–O) > 1.30 Å subgroups (independent on either with or without Sabatier principle constraint) are known as good materials for CO₂ conversion (Table 3) in different reactions proceeding at room or higher temperatures. For these sites, the absolute adsorption energies already satisfy the Sabatier principle (Fig. 2, left), as discussed above. We note that, contrary to what one may expect, there is no correlation between the adsorption energy and the value of l(C–O) (see Supplementary Fig. 5). Although there is a general trend, there are also significant variations in l(C–O) for given adsorption energy.

Interestingly, some of the materials with sites in the l(C–O) > 1.30 Å subgroups were studied as supports for metallic nanoparticles. For instance, Ni/LaAlO₃ is a catalyst for dry reforming of methane⁴⁵ at 700 °C. It was shown that its catalytic performance is higher in terms of CO₂ and CH₄ conversion rates compared to Ni/La₂O₃ and Ni/Al₂O₃⁴⁵. All sites on considered lanthanum (III) oxide surfaces belong to the subgroup of OCO < 132° without Sabatier constraint, whereas the sites on Al₂O₃ do not enter any of the two subgroups. KNbO₃ has been studied only with Pt nanoparticles and as a composite with g-C₃N₄ in photocatalytic reduction of CO₂ into CH₄^46,47. Pt-KNbO₃ is ~2.5 times more photoactive than Pt-NaNbO₃⁴⁶, whereas the NaNbO₃ is known to be photoactive even without nanoparticles⁴⁸. This seems to suggest that l(C–O) is a good indicator of CO₂ activation for both unsupported and supported catalysts even at increased temperatures. Hence, the other materials with the sites from this subgroup are promising new candidates for this task. The most promising materials identified in this work are CsNbO₃, CsVO₃, RbVO₃, LaScO₃, RbNbO₃, and NaSbO₃ as they have the sites from the larger l(C–O) subgroups satisfying the above-mentioned criteria.

There is also a set of materials [ternaries A²⁺B⁴⁺O₃ (A = Ca, Sr, Ba, B = Zr, Ti, Ge, Sn, Si) with a perovskite structure] containing both the surfaces with sites from the smaller OCO subgroups without Sabatier constraint and the surfaces with sites from the larger l(C–O) subgroups (Table 3). These two types of sites are located on different surfaces. Thus, based on the above results, a material for which a surface with sites from the l(C–O) > 1.30 Å subgroups has lower formation energy and is more abundant than the surface with sites from smaller OCO subgroups without Sabatier constraint is expected to be a good catalyst. To explore this possibility, we analyze the surfaces of these materials in more detail. Their most stable surfaces are AO-terminated (001) facets containing sites from the smaller OCO subgroup. The formation energies of ABO₃-terminated (110) surfaces with larger l(C–O) sites are higher: for BaZrO₃, SrZrO₃, CaZrO₃, and SrTiO₃ the differences in formation energies are 0.049, 0.027, 0.013, and 0.037 eV/Å², respectively. The zirconates and SrTiO₃ were found to catalyze the water gas-shift reaction under increased temperatures, 700–1100 °C⁴⁹. At room temperature the photocatalytic activity of SrTiO₃ was found to be significantly decreased⁵⁰. We attribute the latter finding to the strong carbonation of its most stable surface, which is consistent with the calculated high absolute value of CO₂ adsorption energy (−2.4 eV) for this surface. Thus, the activity of SrTiO₃ at 700 °C and higher temperatures is consistent with the estimates of the CO₂ chemical potential given above. The difference in formation energies of the most stable CaO-terminated (001) surface and the stoichiometric (110) surface for CaTiO₃ is less pronounced compared to zirconates and other titanates (CaO-terminated (001) is more stable than the (110) surface by only 0.009 eV/Å²). Thus, the (110) facets, which contain sites from the long l(C–O) subgroup, may be present on catalyst particles at the reaction conditions. This can explain the observed activity of CaTiO₃ in CO₂ conversion not only at high but also at room temperature. We note that the activity of this material was also attributed to the presence of TiO₂ nanoparticles on the surface⁵¹ at reaction conditions.

The OCO subgroup that includes most of the known good catalysts and a minimal number of inactive materials is OCO < 132° with Sabatier principle. It contains the sites on discussed above LaAlO₃, KNbO3, and NaNbO₃ catalysts, but also on non-active YInO₃ according to ref. ⁵² (Table 3). This subgroup contains in addition the sites on a well-known CO₂ conversion catalyst Ga₂O₃. We should mention that the catalytic activity of Ga₂O₃ has been attributed to its reducibility. According to Pan and coworkers⁵³ CO₂ molecules are activated via dissociation on surface O-vacancies. However, in ref. ⁵⁴ only one Ga₂O₃ (100) surface was considered for which no energetically stable CO₂ chemisorption structures were obtained with the PBE functional. We show in Supplementary Table 1 and Supplementary Fig. 1 that this functional underestimates CO₂ adsorption energies. Moreover, in our study we considered also other surfaces and found stable CO₂ chemisorption structures on these surfaces. Thus, activation of CO₂ on Ga₂O₃ can indeed proceed on O-atoms as discussed in our study, even without surface O-vacancies. The subgroups with small OCO cutoffs, 123° and 124°, do not contain any sites on known active or non-active catalysts.

OCO < 132° subgroup with Sabatier principle contains a large number of sites with elongated C–O bonds. The overlap of this subgroup with l(C–O) > 1.30 Å subgroups is 19 samples (70% of the latter).

To demonstrate the advantages of SGD over DTR in finding materials genes and their optimal combinations, we have done a comparison of found SGD subgroups with DTR performance for l(C–O). DTR terminal nodes (leaves) with the largest average l(C–O) (Supplementary Figs. 2 and 3) include surface sites on materials prone to extremely strong carbonation (Table 2), and also sites at which CO₂ prefers to physisorb, with l(C–O) = 1.17 Å. Also, one cannot check the effect of imposing the constraint as there is no standard way to mix regression and classification in DTR. Thus, DTR in contrast to SGD is not able to separate different activation modes and even fails sometimes in distinguishing activation from non-activation.

Best materials for CO₂ reduction among calculated ones

Now those good indicators of activation are identified (OCO with Sabatier principle and l(C–O)), all calculated materials can be ranked according to the value of these indicators (smaller OCO or larger l(C–O) indicate C–O bond weakening and therefore higher catalytic activity, provided adsorption energy is moderate). The resulting list of the most promising catalysts for CO₂ conversion is presented in Table 4. Each surface is characterized by maximum l(C–O) and minimum OCO among all inequivalent sites on that surface. The materials with l(C–O) > 1.30 Å are listed in the order of decreasing l(C–O). Materials with OCO < 132° but l(C–O) < 1.30 Å are appended at the bottom of the list in the order of increasing OCO.

Table 4.

Best materials and surface cuts for CO₂ activation according to the l(C–O) and OCO indicators.

Material	Surface cut	l(C–O), Å	OCO, degree	E_ads, eV	In l(C–O) > 1.30 Å subgroup	In OCO < 132° c. subgroup
According to l(C–O) indicator
NaSbO₃	100	1.370	125.21	−1.32	Yes	Yes
Ga₂O₃	212	1.365	124.57	−1.34		Yes
NaSbO₃	010	1.365	125.95	−1.09	Yes	Yes
LiSbO₃	010	1.359	126.66	−1.04		Yes
NaNbO₃	100	1.353	125.87	−0.78	Yes	Yes
ScAlO₃	010	1.351	127.25	−1.18		Yes
KSbO₃	110	1.345	128.54	−0.72	Yes	Yes
LiNbO₃	100	1.344	126.23	−0.87
NaNbO₃	010	1.344	126.85	−0.77	Yes	Yes
InScO₃	121	1.342	126.26	−1.23
CsNbO₃	100	1.34	126.6	−0.87	Yes
RbNbO₃	111	1.338	126.61	−1.37	Yes	Yes
CsNbO₃	010	1.336	126.23	−1.11	Yes
MgSnO₃	100	1.334	119.84	−1.58
GaAlO₃	100	1.332	129.12	−1.02		Yes
CaGeO₃	001(GeO₂-term.)	1.331	127.65	−0.75
InAlO₃-or.	121	1.33	130.09	−1.02
ScAlO₃	121	1.328	131.61	−0.86
GaInO₃	110	1.327	126.98	−1.34	Yes
LaAlO₃	110	1.327	129.38	−1.17	Yes	Yes
CsVO₃	110	1.327	126.1	−0.72	Yes
KNbO₃	110	1.327	128.49	−0.68	Yes	Yes
RbVO₃	110	1.326	126.04	−1.14
Ga₂O₃	110	1.325	127.76	−1.09		Yes
NaVO₃	110	1.324	127.12	−0.755	Yes
NaNbO₃	110	1.322	128.14	−0.805	Yes	Yes
InAlO₃-rh.	110	1.318	126.83	−0.73	Yes	Yes
LaGaO₃	100	1.317	125.29	−0.97	Yes
ScGaO₃	010	1.314	124.68	−1.06		Yes
GaInO₃	120	1.313	118.41	−1.43	Yes	Yes
MgGeO₃-tetr.	001(GeO₂-term.)	1.312	126.18	−1.35
ScAlO₃	100	1.312	122.28	−1.89		Yes
YAlO₃	011	1.312	127.26	−1.18	Yes	Yes
InScO₃	110	1.31	122.28	−1.54		Yes
In₂O₃	111	1.309	128.44	−0.65
InAlO₃-or.	110	1.309	127.2	−0.66		Yes
YAlO₃	100	1.308	123.82	−1.305	Yes	Yes
InScO₃	110(In₂O₃-term.)	1.305	124.92	−1.57		Yes
YGaO₃	100	1.305	124.76	−1.23
In₂O₃	110	1.301	125.86	−1.00
Sc₂O₃	111	1.301	130.43	−0.885
LaGaO₃	110	1.301	128.88	−0.83	Yes	Yes
LaScO₃	100	1.301	123.6	−1.53	Yes
according to OCO indicator
CaSiO₃	001(CaO-term.)	1.290	118.84	−1.54
SrSiO₃	001(SrO-term.)	1.295	119.10	−1.66
CaGeO₃	001(CaO-term.)	1.288	120.88	−1.94
Ga₂O₃	212	1.297	121.21	−1.53
InScO₃	110	1.292	121.23	−1.88
InScO₃	100	1.277	121.40	−1.74
RbVO₃	100	1.283	121.64	−0.53
In₂O₃	110	1.280	122.52	−1.57
InScO₃	110(In₂O₃-term.)	1.284	122.80	−1.78
SrGeO₃	100(SrO-term.)	1.277	122.90	−1.70
TiO₂-rutile	100	1.276	123.61	−1.05
ZrO₂	111	1.280	123.72	−0.92
BaSnO₃	001(BaO-term.)	1.267	123.80	−1.89
ScGaO₃	110	1.292	123.85	−1.22
ZrO₂	011	1.264	124.06	−0.72
LiVO₃	110	1.295	124.76	−0.70
NaNbO₃	010	1.273	125.00	−1.66
MgTiO₃	012	1.295	125.16	−1.47
InAlO₃-or.	010	1.284	125.30	−0.82		Yes
YInO₃	100	1.293	125.69	−1.47
KNbO₃	010	1.277	125.97	−1.52
InAlO₃-or.	110	1.278	126.04	−0.90
ScAlO₃	110	1.277	126.10	−1.33
Al₂O₃	012	1.265	126.46	−0.87		Yes
Sc₂O₃	110	1.265	126.47	−1.14
CaSiO₃	110(CaO-term.)	1.278	126.49	−1.44
LaInO₃	100	1.287	127.13	−1.27
Sc₂O₃	111	1.265	127.49	−0.95
YInO₃	110	1.298	127.61	−1.22		Yes
ScAlO₃	121	1.268	127.73	−0.755
MgTiO₃	001	1.265	127.85	−1.37
BaGeO₃	001(BaO-term.)	1.270	128.50	−1.80
SrTiO₃	001(TiO₂-term.)	1.266	128.53	−1.92
ZnO	10–10	1.270	128.60	−1.005
YGaO₃	110	1.263	128.68	−1.60
SrSnO₃	001(SnO₂-term.)	1.273	128.90	−1.64
Sc₂O₃	001	1.289	128.90	−1.70
MgGeO₃	001	1.260	128.93	−1.09
CaO	001	1.262	129.20	−1.60
Al₂O₃	001	1.283	129.22	−1.315
BaSnO₃	001(SnO₂-term.)	1.270	129.50	−1.87
CaSnO₃	001(SnO₂-term.)	1.272	130.09	−1.32
KVO₃	010	1.267	130.17	−0.55
CaZrO₃	101(ZrO₂-term.)	1.265	130.36	−1.86
CaSnO₃	110(SnO₂-term.)	1.272	130.50	−1.44
SrGeO₃	100(GeO₂-term.)	1.270	130.90	−1.515
CaTiO₃	101(TiO₂-term.)	1.266	131.42	−1.505
SnO₂	100	1.257	131.50	−0.85
BaSiO₃	100	1.243	131.60	−0.75
MgO	111	1.296	131.70	−1.24

Open in a new tab

Materials and surface cuts higher up in the list in Table 4 that belong to both l(C–O) > 1.30 Å and OCO < 132° subgroups are the most promising catalysts, followed by materials that belong to one of the subgroups, with the performance decreasing further down the list. Taking into account the number of active surface cuts and Sabatier principle, we conclude that NaSbO₃ is the most promising unexplored catalyst for temperatures up to 340 °C (for CO₂ pressures around 1 atm). Other A⁺¹B⁺⁵O₃ type promising materials are KSbO₃ (for temperatures up to 110 °C) and RbNbO₃ (up to 360 °C) that belong to both subgroups, and LiSbO₃ (230 °C), CsNbO₃ (260 °C), CsVO₃ (110 °C), NaVO₃ (130 °C), belonging to one of the subgroups (listed in the order of decreasing performance). There are also several promising A⁺³B⁺³O₃ oxides with surfaces belonging to one or both subgroups, listed in the order they appear first time in the table: ScAlO₃ (up to 550 °C), GaAlO₃ (230 °C), GaInO₃ (340 °C), rhombohedral InAlO₃ (120 °C)—these and other In-containing materials are of course very expensive, but we list them here for completeness, LaGaO₃ (210 °C), ScGaO₃ (240 °C), YAlO₃ (330 °C).

From Table 4 it can be seen that not all promising materials belong to one of the found subgroups. This means that there are other optimal materials gene combinations that are not identified by SGD as statistically significant based on the current dataset. Such combinations may be unique for a given material, or they may be found when more data for different materials are considered. Among these materials the most promising are: InScO₃ (up to 430 °C), MgSnO₃ (430 °C), CaGeO₃ (570 °C), orthorhombic InAlO₃ (230 °C), CaSiO₃ (420 °C), SrSiO₃ (460 °C), SrGeO₃ (480 °C), and BaSnO₃ (up to 550 °C).

Discussion

We have developed the subgroup-discovery strategy for finding improved oxide-based catalysts for the conversion of chemically inert molecules such as CO₂ into useful chemicals or fuels. For this purpose we identified a new indicator of CO₂ activation, namely the large C–O bond distance of the adsorbed molecule. This artificial-intelligence approach identifies the materials genes that correlate most strongly with the activation of the adsorbed molecule. Specifically, these are the following clean surface properties: Hirshfeld charges of O-atom at which CO₂ adsorbs (q_O) and of surface cations (q_min, q_max), surface geometric features [coordination descriptors Q_i, i = 5, 6, distances between the surface O-atom and the nearest surface cations (d_i, i = 1–3)], electrostatic potential and electric field above the adsorption site (Δφ, φ_2.6), polarizability and C₆ coefficients for surface atoms (C₆^min, C₆^O, α_max), radii of HOMO and LUMO of the cation species (r₊₁^max, r₊₁^min, r_HOMO^min), ionization potential, electron affinity, and electronegativity of surface cation species (IP_max, EA_max, EN_min), features of O 2p DOS (kurt, M, PC, U), conduction band minimum (CBM), energies of the lowest unoccupied projected eigenstates of surface cation species (L_max, L_min), and surface work function (W). The found subgroup selectors predict whether a given candidate material belongs to the class of promising catalysts. The peculiarity of the large C–O bond indicator is that it automatically satisfies Sabatier principle for low and middle-temperature CO₂ conversion.

The present study shows also that the previously proposed indicator for CO₂ activation, the decrease of the OCO-angle²⁷, is not appropriate and even correlates with strong adsorption so that poisoning by carbonation is likely which may be useful for carbon capture and storage (CCS) but not for carbon capture and utilization (CCU). When Sabatier principle is purposely included in the SGD search for small OCO, found subgroups substantially overlap with large l(C–O) subgroups (70%), although still contain a few sites on inactive materials for CO₂ conversion.

The subgroup analysis revealed an alternative mechanism of CO₂ activation by adsorption, namely bonding of an O-atom in CO₂ with a surface cation(s), combined with only moderate electron transfer from the surface to the molecule, which results not only in reduction of OCO-angles, but also in pronounced elongation and weakening of the C–O bond. Although the latter can be achieved also by a larger charge transfer, it results in stronger binding of CO₂ molecule to the surface and poisoning of the catalyst, contrary to the new mechanism. The same new mechanism is revealed when Sabatier principle is included when searching for small OCO subgroups.

We also demonstrated that a standard regression technique (DTR), which gives prediction models in a physically interpretable form similar to subgroup discovery (selectors based on identified descriptor), fails to identify the optimal combinations of materials genes and the activation in general. This failure is traced back to the fact that DTR is a global approach, which minimizes error in the prediction of the value of a target property for the whole dataset. As a result, different combinations of genes leading to the optimal value of the same target property are intermixed, and the combination that leads to the most optimal value is not identified. On the contrary, subgroup discovery finds unique local subsets in the data independent of the rest of the data. This makes it more suitable for identifying different combinations of materials genes that result in activation.

The other four considered potential indicators (charge at the adsorbed CO₂, adsorption induced dipole moment, the difference of charges on O-atoms and on C and O-atoms of adsorbed CO₂) were found to reproduce the results of SGD obtained for OCO-angles or C–O bond distances with significant overlap with corresponding subgroups.

Based on our results, we propose several new promising oxide-based catalysts for CO₂ conversion (Table 4). Although the present work has focused on oxides only, the overall strategy is general and can be applied to any other family of materials. This work also emphasizes the importance of documenting metadata and workflows for AI data analysis in materials science in order to ensure the reproducibility of AI models and data analysis results.

Methods

Ab initio calculations

The calculations are performed using density-functional theory (DFT) with the PBEsol exchange-correlation functional⁵⁵ as implemented in FHI-aims code⁵⁶ using ‘tight’ basis sets. The functional is chosen based on a comparison of calculated bulk lattice constants⁵⁵ and CO₂ adsorption energy to the available experimental results and high-level calculations (CCSD(T) and validated hybrid); see Supporting Information (SI) for more details on the computational setup. Nevertheless, it is expected that, because of the large set of systems inspected and the small variations introduced by the functional choice, the main trends will hold even when using another functional.

Studied materials

The dataset includes 71 semiconductor oxide materials, with 141 surfaces. The materials are ternary (ABO₃) and binary oxides with metal cations A and B from groups 1–5 (including La) and groups 12–15 of the periodic table. The full list of materials and surface cuts is given in Supplementary Notes, and the dataset is available in ref. ²⁶. In this study we considered only stoichiometric surface reconstructions obtained by atomic relaxation of stoichiometric bulk-like initial surface geometries. While this seems to be a limitation, our results show that indicators of activation calculated with this assumption correlate with experimental activity for known good oxide catalysts. This does not imply that surfaces of these materials do not reconstruct, but that the properties of unreconstructed surfaces can be used as descriptors for catalysis at reconstructed and defected surfaces under realistic conditions. The inclusion of surface reconstructions in the training data will further improve the predictions and will be a subject of future work.

The details of SGD

The SGD was done with the RealKD code (https://bitbucket.org/realKD/), modified to include quality functions described by Eqs. (1) and (2) in which the information gain was defined as:

u (p) = 1 - (\frac{- 1}{ln2}) (p \cdot \ln (p) + (1 - p) \cdot \ln (1 - p))

here p is the number of samples in a subgroup within the required adsorption energy range divided by the total number of samples in the subgroup. Since Shannon entropy is a symmetric parabola-like function around 0.5, we set here F(Z) = 0 for p ≤ 0.5. Also, x·ln(x) = 0 for x = 0. The search of subgroups is performed using a Monte-Carlo scheme adapted for these tasks³⁴.

The cutoff values x, y, ... used for setting propositions (feature-1 < x, feature-2 ≥ y, etc.) are obtained by k-means clustering, as implemented within RealKD. That is, for a desired number n = k − 1 of cutoff values a set of k representative values of a given feature and k groups (clusters) of the data points are determined that minimize the deviation of all the feature values from the representative values. Thus, each value of the feature in the dataset is assigned to a particular cluster, and the cutoffs are determined as the arithmetic mean between the closest feature values in neighboring clusters. The number k is a parameter, and different k-values can in principle result in different cutoff values. It is worth noting that, due to the stochastic Monte-Carlo sampling, the exact definitions of the subgroups may vary for consecutive runs of the SGD algorithm. We have tested k = 12, 14, and 16 and rerun the algorithm several times for each k. While the results indeed depend on the run and on the k value, the subgroups maximizing the quality function have largely or entirely overlapping populations, and selectors with the same or similar propositions. Here we report selectors that appear most often and have high population and quality function values.

Decision-tree regression

The DTR analysis was performed using Python scikit-learn libraries. DTR is a supervised learning method in which the training set is repeatedly split into patterns (so-called leaves) by means of propositions built from primary features. The fitting of a model is done with respect to the cost function, which encloses the deviation of fitted values of a target property from the actual values. In this study we considered two cost functions—mean squared error (MSE) and mean absolute error (MAE). The search for the most optimal partitioning (the so-called tree) is done with the greedy algorithm. To obtain the most optimal TR model, we used a standard approach for supervised machine learning—leave-one-out cross-validation with respect to the hyperparameters—minimal size of a leaf, maximal depth. The minimal size of a leaf is a bottom threshold of the population of a pattern, since too small size might result in overfitting. Maximal depth is a limit for the maximal number of splits in a tree.

Supplementary information

Supplementary Information^{(2.2MB, doc)}

Peer Review File^{(299.6KB, pdf)}

Acknowledgements

We thank Mario Boley for fruitful discussions on SGD and for providing the RealKD (for SGD) code. We also thank Yoshi Tateyama and Xinyi Lin for helping to generate the bulk oxide models and Helena Muñoz Galan and Oriol Lamiel Garcia for preliminary calculations. This project has received funding from the European Union’s Horizon 2020 research and innovation program (#951786: The NOMAD European Center of Excellence and the ERC grant #740233: TEC1p), the Spanish MICIUN/FEDER RTI2018-095460-B-I00 and María de Maeztu MDM-2017-0767 grants and, in part, by Generalitat de Catalunya 2017SGR13 grant, plus a generous allocation of computational time provided by the Red Española de Supercomputación—RES (QCM-2017-3-0006, QCM-2017-2-0005, QCM-2016-3-0005, QCM-2016-2-0007), and was supported by FAIRmat (FAIR Data Infrastructure for Condensed-Matter Physics and the Chemical Physics of Solids), DFG #460197019. The development of SGD approach was supported by Russian Science Foundation under grant 21-13-00419.

Author contributions

M.S. and F.I. suggested the specific scientific problem and the general idea on methodology, A.M., Y.W., R.V., and F.V. generated the dataset, S.V.L. developed SGD methodology and modified the RealKD code, A.M. applied AI methodology to analyze the data, A.M., S.V.L., L.M.G., and M.S. interpreted the results, A.M., L.M.G., S.V.L., and M.S. established the Jupyter notebook, A.M., S.V.L., and L.M.G. wrote the manuscript.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Data availability

The dataset is available in the NOMAD AI Toolkit²⁶.

Code availability

A Jupyter notebook is available in the NOMAD AI Toolkit²⁶.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Aliaksei Mazheika, Email: alex.mazheika@gmail.com.

Sergey V. Levchenko, Email: s.levchenko@skoltech.ru

Supplementary information

The online version contains supplementary material available at 10.1038/s41467-022-28042-z.

References

1.Arakawa H, et al. Catalysis research of relevance to carbon management: progress, challenges, and opportunities. Chem. Rev. 2001;101:953–996. doi: 10.1021/cr000018s. [DOI] [PubMed] [Google Scholar]
2.Olah GA. Beyond oil and gas: the methanol economy. Angew. Chem. Int. Ed. 2005;44:2636–2639. doi: 10.1002/anie.200462121. [DOI] [PubMed] [Google Scholar]
3.Olah GA, Goeppert A, Surya Prakash GK. Chemical recycling of carbon dioxide to methanol and dimethyl ether: from greenhouse gas to renewable, environmentally carbon neutral fuels and synthetic hydrocarbons. J. Org. Chem. 2009;74:487–498. doi: 10.1021/jo801260f. [DOI] [PubMed] [Google Scholar]
4.Martens JA, et al. The chemical route to a carbon dioxide neutral world. ChemSusChem. 2017;10:1039–1055. doi: 10.1002/cssc.201601051. [DOI] [PubMed] [Google Scholar]
5.Klankermayer J, Wesselbaum S, Beydoun K, Leitner W. Selective catalytic synthesis using the combination of carbon dioxide and hydrogen: catalytic chess at the interface of energy and chemistry. Angew. Chem. Int. Ed. 2016;55:7296–7343. doi: 10.1002/anie.201507458. [DOI] [PubMed] [Google Scholar]
6.Artz J, et al. Sustainable conversion of carbon dioxide: an integrated review of catalysis and life cycle assessment. Chem. Rev. 2018;118:434–504. doi: 10.1021/acs.chemrev.7b00435. [DOI] [PubMed] [Google Scholar]
7.Li W, et al. A short review of recent advances in CO2 hydrogenation to hydrocarbons over heterogeneous catalysts. RSC Adv. 2018;8:7651–7669. doi: 10.1039/c7ra13546g. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Singh AK, Montoya JH, Gregoire JM, Persson KA. Robust and synthesizable photocatalysts for CO2 reduction: a data-driven materials discovery. Nat. Commun. 2019;10:443. doi: 10.1038/s41467-019-08356-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Somorjai, G. A. & Li, Y. Introduction to Surface Chemistry and Catalysis, 2nd edn, 1–800. (John Wiley & Sons, 2010).
10.Nørskov, J. K., Studt, F., Abild-Pedersen, F. & Bligaard, T. Fundamental Concepts in Heterogeneous Catalysis. (John Wiley & Sons, Inc., 2014).
11.Thornton AW, Winkler DA, Liu MS, Haranczyk M, Kennedy DF. Towards computational design of zeolite catalysts for CO2 reduction. RSC Adv. 2015;5:44361. [Google Scholar]
12.Duyar MS, et al. Discovery of a highly active molybdenum phosphide catalyst for methanol synthesis from CO and CO2. Ang. Chem. Int. Ed. 2018;57:15045–15050. doi: 10.1002/anie.201806583. [DOI] [PubMed] [Google Scholar]
13.Peterson AA, Nørskov JK. Activity descriptors for CO2 electroreduction to methane on transition-metal catalysts. J. Phys. Chem. Lett. 2012;3:251–258. [Google Scholar]
14.Liu X, et al. Understanding trends in electrochemical carbon dioxide reduction rates. Nat. Commun. 2017;8:15438. doi: 10.1038/ncomms15438. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Schlexer Lamoureux P, et al. Machine learning for computational heterogeneous catalysis. ChemCatChem. 2019;11:3581–3601. [Google Scholar]
16.Kitchin JP. Machine learning in catalysis. Nat. Catal. 2018;4:230–232. [Google Scholar]
17.Medford AJ, Kunz MR, Ewing SM, Borders T, Fushimi R. Extracting knowledge from data through catalysis informatics. ACS Catal. 2018;8:7403–7429. [Google Scholar]
18.Foppa, L. et al. Materials genes of heterogeneous catalysis from clean experiments and artificial intelligence. MRS Bulletin.46, 1–11 (2021). [DOI] [PMC free article] [PubMed]
19.Kondratenko, E. V., Mul, G., Baltrusaitis, J., Larrazábal, G. O. & Pérez-Ramírez, J. Status and perspectives of CO₂ conversion into fuels and chemicals by catalytic, photocatalytic and electrocatalytic processes. Energy Environ. Sci. 6, 3112 (2013).
20.Li, J. et al. Volcano trend in electrocatalytic CO₂ reduction activity over atomically dispersed metal sites on nitrogen-doped carbon. ACS Catal. 9, 10426 (2019).
21.Frei, M. S., Mondelli, C., Short, M. I. M. & Pérez-Ramírez, J. Methanol as a hydrogen carrier: kinetic and thermodynamic drivers for its CO₂-based synthesis and reforming over heterogeneous catalysts. ChemSusChem.13, 6330 (2020). [DOI] [PubMed]
22.Martin, O. et al. Indium oxide as a superior catalyst for methanol synthesis by CO₂ hydrogenation. Angew. Chem. Int. Ed. 55, 6261 (2016). [DOI] [PubMed]
23.Richter NA, Sicolo S, Levchenko SV, Sauer J, Scheffler M. Concentration of vacancies at metal-oxide surfaces: case study of MgO(100) Phys. Rev. Lett. 2013;111:045502. doi: 10.1103/PhysRevLett.111.045502. [DOI] [PubMed] [Google Scholar]
24.Arndt S, et al. A critical assessment of Li/MgO-based catalysts for the oxidative coupling of methane. Cat. Rev. Sci. Eng. 2011;53:424–514. [Google Scholar]
25.Yan Z, Chinta S, Mohamed AA, Fackler JP, Goodman DW. The role of f-centers in catalysis by Au supported on MgO. J. Am. Chem. Soc. 2005;127:1604–1605. doi: 10.1021/ja043652m. [DOI] [PubMed] [Google Scholar]
26.Mazheika, A., Sbailò, L., Ghiringhelli, L., Levchenko, S. & Scheffler, M. Subgroup discovery for carbon-dioxide activation. https://nomad-lab.eu/aitoolkit/tutorial-CO2-SGD (2021).
27.Freund H-J, Roberts MW. Surface chemistry of carbon dioxide. Surf. Sci. Rep. 1996;25:225–273. [Google Scholar]
28.Austin, N., Butina, B. & Mpourmpakis, G. CO₂ activation on bimetallic CuNi nanoparticles. Prog. Natural Sci. Mater. Int. 26, 487–492 (2016).
29.Hirshfeld FL. Bonded-atom fragments for describing molecular charge densities. Theor. Chim. Acta. 1977;44:129–138. [Google Scholar]
30.Wrobel, S. in European Symposium on Principles of Data Mining and Knowledge Discovery, 78–87 (Springer, 1997).
31.Friedman JH, Fisher NI. Bump hunting in high-dimensional data. Stat. Computing. 1999;9:123–143. [Google Scholar]
32.Atzmueller M. Subgroup discovery. Data Min. Knowl. Discov. 2015;5:35–49. [Google Scholar]
33.Boley M, Goldsmith B, Ghiringhelli LM, Vreeken J. Identifying consistent statements about numerical data with dispersion-corrected subgroup discovery. Data Min. Knowl. Discov. 2017;31:1391–1418. [Google Scholar]
34.Goldsmith B, Boley M, Vreeken J, Scheffler M, Ghiringhelli LM. Uncovering structure-property relationships of materials by subgroup discovery. N. J. Phys. 2017;19:013031. [Google Scholar]
35.Xu, Z. & Kitchin, J. R. Relating the electronic structure and reactivity of the 3d transition metal monoxide surfaces. Catal. Commun. 52, 60 (2014).
36.Capdevila-Cortada M, Vilé G, Teschner D, Pérez-Ramírez J, López N. Reactivity descriptors for ceria in catalysis. Appl. Catal. B Environ. 2016;197:299–312. [Google Scholar]
37.Esterhuizen JA, Goldsmith B, Linic S. Uncovering electronic and geometric descriptors of chemical activity for metal alloys and oxides using unsupervised machine learning. Chem Catal. 2021;1:923–940. [Google Scholar]
38.Xu W, Andersen M, Reuter K. Data-driven descriptor engineering and refined scaling relations for predicting transition metal oxide reactivity. ACS Catal. 2021;11:734–742. [Google Scholar]
39.Grasselli RK. Fundamental principles of selective heterogeneous oxidation catalysis. Top. Catal. 2002;21:79–88. [Google Scholar]
40.Stull DR, Prophet H. JANAF thermochemical tables. J. Phys. Chem. 1974;78:2496–2506. [Google Scholar]
41.Wang W, Gong J. Methanation of carbon dioxide: an overview. Front. Chem. Sci. Eng. 2011;5:2–10. [Google Scholar]
42.Breiman L, Friedman J, Olshen R, Stone C. Classification and regression trees. New York: Wadsworth; 1984. [Google Scholar]
43.Novak PK, Lavrač N, Webb GI. Supervised descriptive rule discovery: a unifying survey of contrast set, emerging pattern and subgroup mining. J. Mach. Learn. Res. 2009;10:377–403. [Google Scholar]
44.Dunstan MT, et al. Large scale computational screening and experimental discovery of novel materials for high temperature CO2 capture. Energy Environ. Sci. 2016;9:1346–1360. [Google Scholar]
45.Kathiraser Y, Thitsartarn W, Sutthiumporn K, Kawi S. Inverse NiAl2O4 on LaAlO3–Al2O3: unique catalytic structure for stable CO2 reforming of methane. J. Phys. Chem. C. 2013;117:8120–8130. [Google Scholar]
46.Shi H, Zou Z. Photophysical and photocatalytic properties of ANbO3 (A=Na, K) photocatalysts. J. Phys. Chem. Sol. 2012;73:788–792. [Google Scholar]
47.Shi H, Zhang C, Zhou C, Chen G. Conversion of CO2 into renewable fuel over Pt–g-C3N4/KNbO3 composite photocatalyst. RSC Adv. 2015;5:93615–93622. [Google Scholar]
48.Fresno F, et al. CO2 reduction over NaNbO3 and NaTaO3 perovskite photocatalysts. Photochem. Photobiol. Sci. 2017;16:17–23. doi: 10.1039/c6pp00235h. [DOI] [PubMed] [Google Scholar]
49.Saito. Y. Catalyst for reverse shift reaction and method for producing synthesis gas using the same. Patent No.: US 8,540,898 B2; (2013).
50.Zeng S, Kar P, Thakur UK, Shankar K. A review on photocatalytic CO2 reduction using perovskite oxide nanomaterials. Nanotechnology. 2018;29:052001. doi: 10.1088/1361-6528/aa9fb1. [DOI] [PubMed] [Google Scholar]
51.Sub Kwak B, Kang M. Photocatalytic reduction of CO2 with H2O using perovskite CaxTiyO3. Appl. Surf. Sci. 2015;337:138–144. [Google Scholar]
52.Khraisheh M, Khazndar A, Al-Ghouti MA. Visible light-driven metal-oxide photocatalytic CO2 conversion. Int. J. Energy Res. 2015;39:1142–1152. [Google Scholar]
53.Pan Y-X, Liu C-J, Mei D, Ge Q. Effects of hydration and oxygen vacancy on CO2 adsorption and activation on β-Ga2O3(100) Langmuir. 2010;26:5551. doi: 10.1021/la903836v. [DOI] [PubMed] [Google Scholar]
54.Muroyama H, et al. Carbon dioxide methanation over Ni catalysts supported on various metal oxides. J. Catal. 2016;343:178–184. [Google Scholar]
55.Perdew JP, et al. Restoring the density-gradient expansion for exchange in solids and surfaces. Phys. Rev. Lett. 2008;100:136406. doi: 10.1103/PhysRevLett.100.136406. [DOI] [PubMed] [Google Scholar]
56.Blum V, et al. Ab initio molecular simulations with numeric atom-centered orbitals. Comput. Phys. Commun. 2009;180:2175–2196. [Google Scholar]
57.Lee JH. Cost-effective and dynamic carbon dioxide conversion into methane using a CaTiO3@Ni-Pt catalyst in a photo-thermal hybrid system. J. Photochem. Photobiol. A Chem. 2018;364:219–232. [Google Scholar]
58.Zhang Z, Verykios XE, MacDonald SM, Affrossman S. Comparative study of carbon dioxide reforming of methane to synthesis gas over Ni/La2O3 and conventional nickel-based catalysts. J. Phys. Chem. 1996;100:744–754. [Google Scholar]
59.Sekimoto T. Electrochemical application of Ga2O3 and related materials: CO2-to-HCOOH conversion. Jpn. J. Appl. Phys. 2016;55:1202. [Google Scholar]
60.Teramura K, Tsuneoka H, Shishido T, Tanaka T. Effect of H2 gas as a reductant on photoreduction of CO2 over a Ga2O3 photocatalyst. Chem. Phys. Lett. 2008;467:191–194. [Google Scholar]
61.Tang S, et al. CO2 reforming of methane to synthesis gas over sol–gel-made Ni/γ-Al2O3 catalysts from organometallic precursors. J. Catal. 2000;194:424–430. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information^{(2.2MB, doc)}

Peer Review File^{(299.6KB, pdf)}

Data Availability Statement

The dataset is available in the NOMAD AI Toolkit²⁶.

A Jupyter notebook is available in the NOMAD AI Toolkit²⁶.

[CR1] 1.Arakawa H, et al. Catalysis research of relevance to carbon management: progress, challenges, and opportunities. Chem. Rev. 2001;101:953–996. doi: 10.1021/cr000018s. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Olah GA. Beyond oil and gas: the methanol economy. Angew. Chem. Int. Ed. 2005;44:2636–2639. doi: 10.1002/anie.200462121. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Olah GA, Goeppert A, Surya Prakash GK. Chemical recycling of carbon dioxide to methanol and dimethyl ether: from greenhouse gas to renewable, environmentally carbon neutral fuels and synthetic hydrocarbons. J. Org. Chem. 2009;74:487–498. doi: 10.1021/jo801260f. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Martens JA, et al. The chemical route to a carbon dioxide neutral world. ChemSusChem. 2017;10:1039–1055. doi: 10.1002/cssc.201601051. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Klankermayer J, Wesselbaum S, Beydoun K, Leitner W. Selective catalytic synthesis using the combination of carbon dioxide and hydrogen: catalytic chess at the interface of energy and chemistry. Angew. Chem. Int. Ed. 2016;55:7296–7343. doi: 10.1002/anie.201507458. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Artz J, et al. Sustainable conversion of carbon dioxide: an integrated review of catalysis and life cycle assessment. Chem. Rev. 2018;118:434–504. doi: 10.1021/acs.chemrev.7b00435. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Li W, et al. A short review of recent advances in CO2 hydrogenation to hydrocarbons over heterogeneous catalysts. RSC Adv. 2018;8:7651–7669. doi: 10.1039/c7ra13546g. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Singh AK, Montoya JH, Gregoire JM, Persson KA. Robust and synthesizable photocatalysts for CO2 reduction: a data-driven materials discovery. Nat. Commun. 2019;10:443. doi: 10.1038/s41467-019-08356-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Somorjai, G. A. & Li, Y. Introduction to Surface Chemistry and Catalysis, 2nd edn, 1–800. (John Wiley & Sons, 2010).

[CR10] 10.Nørskov, J. K., Studt, F., Abild-Pedersen, F. & Bligaard, T. Fundamental Concepts in Heterogeneous Catalysis. (John Wiley & Sons, Inc., 2014).

[CR11] 11.Thornton AW, Winkler DA, Liu MS, Haranczyk M, Kennedy DF. Towards computational design of zeolite catalysts for CO2 reduction. RSC Adv. 2015;5:44361. [Google Scholar]

[CR12] 12.Duyar MS, et al. Discovery of a highly active molybdenum phosphide catalyst for methanol synthesis from CO and CO2. Ang. Chem. Int. Ed. 2018;57:15045–15050. doi: 10.1002/anie.201806583. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Peterson AA, Nørskov JK. Activity descriptors for CO2 electroreduction to methane on transition-metal catalysts. J. Phys. Chem. Lett. 2012;3:251–258. [Google Scholar]

[CR14] 14.Liu X, et al. Understanding trends in electrochemical carbon dioxide reduction rates. Nat. Commun. 2017;8:15438. doi: 10.1038/ncomms15438. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Schlexer Lamoureux P, et al. Machine learning for computational heterogeneous catalysis. ChemCatChem. 2019;11:3581–3601. [Google Scholar]

[CR16] 16.Kitchin JP. Machine learning in catalysis. Nat. Catal. 2018;4:230–232. [Google Scholar]

[CR17] 17.Medford AJ, Kunz MR, Ewing SM, Borders T, Fushimi R. Extracting knowledge from data through catalysis informatics. ACS Catal. 2018;8:7403–7429. [Google Scholar]

[CR18] 18.Foppa, L. et al. Materials genes of heterogeneous catalysis from clean experiments and artificial intelligence. MRS Bulletin.46, 1–11 (2021). [DOI] [PMC free article] [PubMed]

[CR19] 19.Kondratenko, E. V., Mul, G., Baltrusaitis, J., Larrazábal, G. O. & Pérez-Ramírez, J. Status and perspectives of CO₂ conversion into fuels and chemicals by catalytic, photocatalytic and electrocatalytic processes. Energy Environ. Sci. 6, 3112 (2013).

[CR20] 20.Li, J. et al. Volcano trend in electrocatalytic CO₂ reduction activity over atomically dispersed metal sites on nitrogen-doped carbon. ACS Catal. 9, 10426 (2019).

[CR21] 21.Frei, M. S., Mondelli, C., Short, M. I. M. & Pérez-Ramírez, J. Methanol as a hydrogen carrier: kinetic and thermodynamic drivers for its CO₂-based synthesis and reforming over heterogeneous catalysts. ChemSusChem.13, 6330 (2020). [DOI] [PubMed]

[CR22] 22.Martin, O. et al. Indium oxide as a superior catalyst for methanol synthesis by CO₂ hydrogenation. Angew. Chem. Int. Ed. 55, 6261 (2016). [DOI] [PubMed]

[CR23] 23.Richter NA, Sicolo S, Levchenko SV, Sauer J, Scheffler M. Concentration of vacancies at metal-oxide surfaces: case study of MgO(100) Phys. Rev. Lett. 2013;111:045502. doi: 10.1103/PhysRevLett.111.045502. [DOI] [PubMed] [Google Scholar]

[CR24] 24.Arndt S, et al. A critical assessment of Li/MgO-based catalysts for the oxidative coupling of methane. Cat. Rev. Sci. Eng. 2011;53:424–514. [Google Scholar]

[CR25] 25.Yan Z, Chinta S, Mohamed AA, Fackler JP, Goodman DW. The role of f-centers in catalysis by Au supported on MgO. J. Am. Chem. Soc. 2005;127:1604–1605. doi: 10.1021/ja043652m. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Mazheika, A., Sbailò, L., Ghiringhelli, L., Levchenko, S. & Scheffler, M. Subgroup discovery for carbon-dioxide activation. https://nomad-lab.eu/aitoolkit/tutorial-CO2-SGD (2021).

[CR27] 27.Freund H-J, Roberts MW. Surface chemistry of carbon dioxide. Surf. Sci. Rep. 1996;25:225–273. [Google Scholar]

[CR28] 28.Austin, N., Butina, B. & Mpourmpakis, G. CO₂ activation on bimetallic CuNi nanoparticles. Prog. Natural Sci. Mater. Int. 26, 487–492 (2016).

[CR29] 29.Hirshfeld FL. Bonded-atom fragments for describing molecular charge densities. Theor. Chim. Acta. 1977;44:129–138. [Google Scholar]

[CR30] 30.Wrobel, S. in European Symposium on Principles of Data Mining and Knowledge Discovery, 78–87 (Springer, 1997).

[CR31] 31.Friedman JH, Fisher NI. Bump hunting in high-dimensional data. Stat. Computing. 1999;9:123–143. [Google Scholar]

[CR32] 32.Atzmueller M. Subgroup discovery. Data Min. Knowl. Discov. 2015;5:35–49. [Google Scholar]

[CR33] 33.Boley M, Goldsmith B, Ghiringhelli LM, Vreeken J. Identifying consistent statements about numerical data with dispersion-corrected subgroup discovery. Data Min. Knowl. Discov. 2017;31:1391–1418. [Google Scholar]

[CR34] 34.Goldsmith B, Boley M, Vreeken J, Scheffler M, Ghiringhelli LM. Uncovering structure-property relationships of materials by subgroup discovery. N. J. Phys. 2017;19:013031. [Google Scholar]

[CR35] 35.Xu, Z. & Kitchin, J. R. Relating the electronic structure and reactivity of the 3d transition metal monoxide surfaces. Catal. Commun. 52, 60 (2014).

[CR36] 36.Capdevila-Cortada M, Vilé G, Teschner D, Pérez-Ramírez J, López N. Reactivity descriptors for ceria in catalysis. Appl. Catal. B Environ. 2016;197:299–312. [Google Scholar]

[CR37] 37.Esterhuizen JA, Goldsmith B, Linic S. Uncovering electronic and geometric descriptors of chemical activity for metal alloys and oxides using unsupervised machine learning. Chem Catal. 2021;1:923–940. [Google Scholar]

[CR38] 38.Xu W, Andersen M, Reuter K. Data-driven descriptor engineering and refined scaling relations for predicting transition metal oxide reactivity. ACS Catal. 2021;11:734–742. [Google Scholar]

[CR39] 39.Grasselli RK. Fundamental principles of selective heterogeneous oxidation catalysis. Top. Catal. 2002;21:79–88. [Google Scholar]

[CR40] 40.Stull DR, Prophet H. JANAF thermochemical tables. J. Phys. Chem. 1974;78:2496–2506. [Google Scholar]

[CR41] 41.Wang W, Gong J. Methanation of carbon dioxide: an overview. Front. Chem. Sci. Eng. 2011;5:2–10. [Google Scholar]

[CR42] 42.Breiman L, Friedman J, Olshen R, Stone C. Classification and regression trees. New York: Wadsworth; 1984. [Google Scholar]

[CR43] 43.Novak PK, Lavrač N, Webb GI. Supervised descriptive rule discovery: a unifying survey of contrast set, emerging pattern and subgroup mining. J. Mach. Learn. Res. 2009;10:377–403. [Google Scholar]

[CR44] 44.Dunstan MT, et al. Large scale computational screening and experimental discovery of novel materials for high temperature CO2 capture. Energy Environ. Sci. 2016;9:1346–1360. [Google Scholar]

[CR45] 45.Kathiraser Y, Thitsartarn W, Sutthiumporn K, Kawi S. Inverse NiAl2O4 on LaAlO3–Al2O3: unique catalytic structure for stable CO2 reforming of methane. J. Phys. Chem. C. 2013;117:8120–8130. [Google Scholar]

[CR46] 46.Shi H, Zou Z. Photophysical and photocatalytic properties of ANbO3 (A=Na, K) photocatalysts. J. Phys. Chem. Sol. 2012;73:788–792. [Google Scholar]

[CR47] 47.Shi H, Zhang C, Zhou C, Chen G. Conversion of CO2 into renewable fuel over Pt–g-C3N4/KNbO3 composite photocatalyst. RSC Adv. 2015;5:93615–93622. [Google Scholar]

[CR48] 48.Fresno F, et al. CO2 reduction over NaNbO3 and NaTaO3 perovskite photocatalysts. Photochem. Photobiol. Sci. 2017;16:17–23. doi: 10.1039/c6pp00235h. [DOI] [PubMed] [Google Scholar]

[CR49] 49.Saito. Y. Catalyst for reverse shift reaction and method for producing synthesis gas using the same. Patent No.: US 8,540,898 B2; (2013).

[CR50] 50.Zeng S, Kar P, Thakur UK, Shankar K. A review on photocatalytic CO2 reduction using perovskite oxide nanomaterials. Nanotechnology. 2018;29:052001. doi: 10.1088/1361-6528/aa9fb1. [DOI] [PubMed] [Google Scholar]

[CR51] 51.Sub Kwak B, Kang M. Photocatalytic reduction of CO2 with H2O using perovskite CaxTiyO3. Appl. Surf. Sci. 2015;337:138–144. [Google Scholar]

[CR52] 52.Khraisheh M, Khazndar A, Al-Ghouti MA. Visible light-driven metal-oxide photocatalytic CO2 conversion. Int. J. Energy Res. 2015;39:1142–1152. [Google Scholar]

[CR53] 53.Pan Y-X, Liu C-J, Mei D, Ge Q. Effects of hydration and oxygen vacancy on CO2 adsorption and activation on β-Ga2O3(100) Langmuir. 2010;26:5551. doi: 10.1021/la903836v. [DOI] [PubMed] [Google Scholar]

[CR54] 54.Muroyama H, et al. Carbon dioxide methanation over Ni catalysts supported on various metal oxides. J. Catal. 2016;343:178–184. [Google Scholar]

[CR55] 55.Perdew JP, et al. Restoring the density-gradient expansion for exchange in solids and surfaces. Phys. Rev. Lett. 2008;100:136406. doi: 10.1103/PhysRevLett.100.136406. [DOI] [PubMed] [Google Scholar]

[CR56] 56.Blum V, et al. Ab initio molecular simulations with numeric atom-centered orbitals. Comput. Phys. Commun. 2009;180:2175–2196. [Google Scholar]

[CR57] 57.Lee JH. Cost-effective and dynamic carbon dioxide conversion into methane using a CaTiO3@Ni-Pt catalyst in a photo-thermal hybrid system. J. Photochem. Photobiol. A Chem. 2018;364:219–232. [Google Scholar]

[CR58] 58.Zhang Z, Verykios XE, MacDonald SM, Affrossman S. Comparative study of carbon dioxide reforming of methane to synthesis gas over Ni/La2O3 and conventional nickel-based catalysts. J. Phys. Chem. 1996;100:744–754. [Google Scholar]

[CR59] 59.Sekimoto T. Electrochemical application of Ga2O3 and related materials: CO2-to-HCOOH conversion. Jpn. J. Appl. Phys. 2016;55:1202. [Google Scholar]

[CR60] 60.Teramura K, Tsuneoka H, Shishido T, Tanaka T. Effect of H2 gas as a reductant on photoreduction of CO2 over a Ga2O3 photocatalyst. Chem. Phys. Lett. 2008;467:191–194. [Google Scholar]

[CR61] 61.Tang S, et al. CO2 reforming of methane to synthesis gas over sol–gel-made Ni/γ-Al2O3 catalysts from organometallic precursors. J. Catal. 2000;194:424–430. [Google Scholar]

PERMALINK

Artificial-intelligence-driven discovery of catalyst genes with application to CO2 activation on semiconductor oxides

Aliaksei Mazheika

Yang-Gang Wang

Rosendo Valero

Francesc Viñes

Francesc Illas

Luca M Ghiringhelli

Sergey V Levchenko

Matthias Scheffler

Abstract

Introduction

Results

CO2 activation

Fig. 1. Correlation between the larger of the two C–O bond lengths and the OCO-angle for charged gas-phase and adsorbed CO2.

Subgroup discovery

Table 1.

Results of the subgroup discovery

Table 2.

Fig. 2. Distribution of adsorption energies (left) and OCO-angles (right).

Comparison with experimental results

Table 3.

Best materials for CO2 reduction among calculated ones

Table 4.

Discussion

Methods

Ab initio calculations

Studied materials

The details of SGD

Decision-tree regression

Supplementary information

Acknowledgements

Author contributions

Peer review

Peer review information

Funding

Data availability

Code availability

Competing interests

Footnotes

Contributor Information

Supplementary information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Artificial-intelligence-driven discovery of catalyst genes with application to CO₂ activation on semiconductor oxides

CO₂ activation

Fig. 1. Correlation between the larger of the two C–O bond lengths and the OCO-angle for charged gas-phase and adsorbed CO₂.

Best materials for CO₂ reduction among calculated ones