Abstract
The crystal interaction density is generally assumed to be a suitable measure of the polarization of a low‐molecular weight ligand inside an enzyme, but this approximation has seldomly been tested and has never been quantified before. In this study, we compare the crystal interaction density and the interaction electrostatic potential for a model compound of loxistatin acid (E64c) with those inside cathepsin B, in solution, and in vacuum. We apply QM/MM calculations and experimental quantum crystallography to show that the crystal interaction density is indeed very similar to the enzyme interaction density. Less than 0.1 e are shifted between these two environments in total. However, this difference has non‐negligible consequences for derived properties.
Keywords: electron density, electrostatic potential, intermolecular interactions, polarization, protease inhibitor
The polarization of a drug molecule in an enzyme environment can be approximated by the polarization of the same molecule in its small‐molecule crystal structure. The degree of similarity is quantified in this study for a model compound of the protease inhibitor loxistatin acid in its interaction with cathepsin B by using interaction densities and interaction electrostatic potentials.
Introduction
Molecular recognition between an enzyme and a low‐molecular weight ligand is a key feature in the mode of action of biologically active molecules, and in consequence the most important factor in drug design. [1] The main components of molecular recognition are steric and electrostatic complementarity between the enzyme pocket and the active molecule. If the recognition process is simplified from the induced‐fit theory to the classic key–lock mechanism, [2] small‐molecule crystal structures can be used to approximate the correct three‐dimensional shape of the active molecule in the biological environment. In both a biological as well as a crystalline environment, the small molecule will conformationally adapt its shape to the prevailing intermolecular binding forces, so that the resulting bound state reflects both its inherent flexibility and the environment.[ 3 , 4 ] Pascard states that “in numerous cases, there is a nearly perfect correlation between small‐molecule structural results, and the observed binding in receptor‐substrate complexes”. [5] Such similarities, investigated many times at the geometrical level, [6] have led to the development of the method of composite crystal‐field environments by Klebe [7] and SuperStar [8] as part of the Cambridge Structural Database suite of software.
Electrostatic complementarity between an enzyme binding site and an active molecule is an aspect that goes beyond geometry and molecular conformation since the electrostatic potential is inherently related to the electron density distribution of each partner. [9] Electron densities of biologically active small molecules can computationally be estimated in different environments (isolated state, solution, crystalline state) or modeled from experimental X‐ray diffraction structure factors. [10] Experimental electron–density determinations of crystals of low‐molecular weight enzyme ligands give a detailed insight into intermolecular interactions which are relevant for the biological recognition process such as electrostatic forces, hydrogen bonding or van der Waals interactions. [11] Consequently, the numerous experimental electron‐density investigations of biologically active compounds were justified by assuming that the polarization of the molecule under scrutiny reflects the polarization in the enzyme. [12] However, this underlying assumption has only rarely been investigated,[ 10 , 13 ] and, to the best of our knowledge, never been quantified. In this study, we quantify the extent of similarity between the electron‐density distributions of a model compound of the drug E64c as computationally determined in vacuum, in aqueous solution, in the corresponding cathepsin B enzyme complex and in the crystal structure of the pure molecule. The electron‐density distribution of the compound in its crystal structure was also determined from a low‐temperature high‐resolution synchrotron X‐ray diffraction study.
The epoxysuccinyl peptide loxistatin acid (E64c) is a derivative of the natural product E64 [14] and a potent inhibitor of papain‐like cysteine proteases (CAC1 enzymes). [15] E64c irreversibly inhibits proteases through nucleophilic epoxide ring opening and formation of a new C−S covalent bond with the respective cysteine thiol group. This mode of action has been supported by mechanistic studies [16] based on the E64c‐cathepsin B complex crystal structure. [17] Furthermore, the crystal structure of the pure E64c ligand alone was determined in 2015. [4] It was envisaged that this ligand would be ideal for the anticipated experimental study examining the similarities and differences between the polarization of the electron density of a drug molecule in its crystal structure to that in an enzyme since so much computational information about E64c is available.[ 13 , 14 ] However, the four independent molecules of E64c in its crystal structure are related by pseudo‐symmetry and are heavily disordered, which makes them unsuitable for experimental electron‐density determinations. [4] In turn, the E64c model compounds that were in fact suitable for experimental ED investigations [18] do not fit into any of the known cysteine proteases targeted by E64c (in vivo or in silico) preventing a comparison between the crystal and enzyme environments. To overcome these challenges, we recently devised an alternative strategy which first involved synthesis of the model molecule (2S,3S)‐3‐aminocarbonyl‐oxirane‐2‐carboxylic acid (1H). [19] As in E64c, this new substrate contains both the required electrophilic epoxide moiety for the reaction with cysteine and the carboxyl anchor group for initial binding into the enzyme pocket.
At physiological pH values, the carboxylic acid group in E64c is deprotonated and hence binds to cathepsin B as a carboxylate anion. Therefore, in the present study we also deprotonated 1H yielding the anion 1 (Figure 1 a), which crystallizes as a potassium salt with one molecule of co‐crystallized water (1K⋅H2O, Figure 1 b). Water is abundant in the E64c‐cathepsin B complex, too, and the K+ ion in the crystal structure replaces the histidinium residue of cathepsin B in the complex, so that the type of intermolecular interactions in the crystal studied here can be expected to mirror those inside cathepsin B (section “Comparison of intermolecular interactions” in Results and Discussion). A single crystal of 1K⋅H2O was measured to high resolution using synchrotron radiation at BL02B1, SPring‐8, and the electron‐density distribution was subsequently modelled experimentally. For this purpose, we employed X‐ray Wavefunction Refinement (XWR, [20] ), a novel technique combining Hirshfeld Atom Refinement (HAR, [21] ) and X‐ray constrained wavefunction (XCW) fitting.[ 22 , 23 ] In silico, the small anion 1 fits into the binding pocket of cathepsin B, so that its electron density could be compared between the following environments: in the crystal structure (from XWR, model X; from QM/MM calculations, model C for crystal; embedded in a self‐consistent field of cluster point charges, model CC for cluster charges), inside the enzyme cathepsin B (from QM/MM calculations, model P for protein), in solution (COSMO solvation model S) and in vacuum (model G for gas phase).
Intermolecular interactions between molecules lead to a polarization of each molecular electron density. In crystallography, the interaction density is defined as the difference between the electron density of a molecule in a crystal and the electron density of the same molecule with identical geometry in an assembly of non‐interacting molecules (in vacuo model). In biology, the difference between the molecule bonded to the active site and the molecule solved in water outside the active site is decisive. Nevertheless, the interaction density of biologically active small molecular compounds has often been used as a first approximation to the polarization of the ligand in the enzyme. For example, Dittrich and Matta argue that the “interaction density can be seen as an idealized situation that is analogous to drug‐receptor interactions and the redistribution of electron density of a drug molecule in the active site.” [24]
It is an open question if the magnitude of the interaction density is inside or outside experimental errors for charge‐density experiments, which means it is unclear if the interaction density can be measured. [25] Most of the discussion in the past was focused on the accuracy of the multipole model for extracting interaction‐density characteristics. [26] Alternatively, the interaction density was calculated based on periodic‐boundary theoretical computations. [27] Only very recently, an alternative approach to the derivation of the interaction density was discussed, namely the comparison of the non‐fitted and maximally fitted molecular wavefunction according to the X‐ray constrained wavefunction fitting technique. [28] The non‐fitted wavefunction produces the in vacuo electron density of the molecule under examination, whereas the fitted wavefunction includes the polarization of the molecule by the crystalline environment via the measured X‐ray structure factors. This new method was described mainly for an application to theoretically calculated X‐ray structure factors. In this work, we test this method against experimental structure factors for the compound 1K⋅H2O (section “Interaction density and interaction electrostatic potential” in Results and Discussion).
Although the interaction density directly reflects the polarization of the molecule by its environment, for the molecular recognition process the electrostatic potential is the crucial property. Electron density and electrostatic potential are fundamental properties of a wavefunction and in this study both are calculated directly from the (experimentally constrained) wavefunctions. Beyond this relationship, there are also interesting similarities and differences between the topologies of their scalar fields and how these are interpreted with respect to chemistry and biology. [29] However, to the best of our knowledge, the electrostatic‐potential analogue to the interaction density has never been explored before. We refer to this difference between in vacuo and crystal electrostatic potential as interaction electrostatic potential (or simply interaction ESP) since the term interaction potential has a different meaning in the framework of QM/MM calculations. [30]
As discussed above, interaction density (and consequently also interaction ESP) are crystallographic terms, denoting the difference between the crystalline and the in vacuo states at identical geometry. However, for the simplicity of the argument, here we generalize both terms to denote the difference between the polarized (crystal X or C or CC, enzyme P, solution S) and unpolarized (in vacuo, gas phase, G) states at identical geometry (section “Interaction density and interaction electrostatic potential” in Results and Discussion). In addition, we have developed a technique to compare the electron densities of different optimized molecular geometries, which will be introduced in section “Bond‐wise comparison of difference densities” (Results and Discussion). In section “Comparison of derived properties”, we compare derived properties such as the Laplacian of the electron density and atomic charges in different environments.
Results and Discussion
Comparison of intermolecular interactions
Initially, a comparison between the intermolecular interaction network of 1 inside cathepsin B and that inside the crystal structure of 1K⋅H2O (Figure 2) is discussed. For that purpose, molecular dynamics (MD) input was prepared by taking the E64c‐cathepsin B complex crystal structure [17] and pruning off all atoms of E64c that are not part of the simplified 1. After initial equilibration in a water box, molecular dynamics were simulated over 5 ns. The interaction of the carboxylate group with His+‐199 and Cys‐29 (Figure 2, left) is the electrostatic anchor that keeps 1 closely bonded inside the enzyme pocket during the entire time of the simulation (Figure 3, O3‐HN ). The contact with carboxylate atom O2 importantly keeps Cys‐29 close to the epoxide ring, eventually resulting in an interaction between the sulfur atom of Cys‐29 and the epoxide carbon atom 1, which prepares the ring‐opening reaction of E64c to irreversibly form a covalent C−S bond and inactivate the cysteine protease. [16] Figure 3 shows that the C1−SCys‐29 contact remains stable throughout the MD simulation with 1 in cathepsin B implying that the model compound used here indeed represents the major characteristics of E64c as it would act in the same bonding environment. However, the amine group of 1 points into the artificially produced cavity, produced by shortening of the ligand compared to E64c, which is now filled with water molecules. Figure 2, left, shows that the nearest enzyme residues are Gly‐27 and Gly‐74, but as can be seen in Figure 3, these contacts are rather long, giving rise to conformationally flexible and rotating amide groups.
Geometry snapshots were taken every 250 ps along the MD simulation between 4 and 5 ns. Starting from these, a total of four different QM/MM geometry optimizations were carried out. The resulting geometries are similar but not identical due to the multi‐minima nature of protein conformations. Table 1, left column, shows that the highest deviations for the various intermolecular contacts of the four final geometries vary between 0.16 and 0.01 Å, so that it is meaningful to report and interpret their average for a direct comparison between the protein and crystal environments (yielding model P for the following sections).
Table 1.
1 in cathepsin B |
1 in 1K⋅H2O |
||
---|---|---|---|
Contact |
Distance [Å][a] |
Contact |
Distance [Å][b] |
O1⋅⋅⋅H−O |
2.89 (4) |
O1⋅⋅⋅K+ |
2.9092 (4) |
N1−H3⋅⋅⋅O |
2.76 (7) |
N1−H3⋅⋅⋅O2 |
3.0157 (6) |
N1−H4⋅⋅⋅O |
2.98 (16) |
N1−H4⋅⋅⋅O2 |
3.0443 (6) |
O2⋅⋅⋅H−O |
2.69 (4) |
O2⋅⋅⋅H−O |
2.7468 (6) |
O3⋅⋅⋅H−N |
2.69 (3) |
O3⋅⋅⋅K+ |
2.6210 (5) |
O4⋅⋅⋅H−O |
2.65 (1) |
O4⋅⋅⋅K+ |
2.6925 (4) |
[a] The distances are the average of 4 QM/MM optimized geometries (see Experimental Section). The numbers in brackets are the sample standard deviations in the last shown digit(s). [b] The distances are the crystallographically refined geometries from the XWR procedure. The numbers in brackets are the standard uncertainties in the last shown digit from the refinement.
For every hydrogen bonding donor or acceptor, there is either a hydrogen bond or an electrostatic contact in both these environments. The carboxylate group (atoms O2 and O3) forms one hydrogen bond with water as acceptor and one contact dominated by electrostatics (either positive histidinium ring or positive K+ ion) each. The amine group forms two hydrogen bonds to water inside the enzyme pocket, and two hydrogen bonds to a symmetry‐generated carboxylate atom O2 in 1K⋅H2O. The epoxide oxygen atom O1 forms a hydrogen bond or an electrostatic contact to K+, respectively, with the same distance of 2.9 Å. Remarkably, the longest of the contacts coincide in both environments, and the shortest one in 1K⋅H2O coincides with the second‐shortest one in the enzyme, which is, however, the most important one, namely the electrostatic anchor identified in Figure 3.
In summary, a qualitative comparison of the intermolecular interaction networks in both crystal and enzyme environments shows some systematic similarities that lead us to believe that a comparison of the polarization of the electron density in both environments will be meaningful. However, Figure 2 also shows that the conformations of the carboxylate and the amide groups in both environments are significantly different, so that a direct comparison of the electron densities belonging to individual bonds will be difficult. This problem will be tackled in section “Bond‐wise comparison of difference densities”, whereas the next section deals with the interaction densities and interaction electrostatic potentials of the entire anion 1.
Interaction density and interaction electrostatic potential
The interaction densities (Figure 4) and interaction electrostatic potentials (ESPs, Figure 5) were calculated according to the ideas discussed in the Introduction. They represent the differences between anion 1 in its polarized (crystal X or C or CC, enzyme/protein P, solution S) and unpolarized (in vacuo, gas phase, G) states at identical geometry. In detail, Figures 4 and 5 depict the following model differences:
4 a/5 b,c: model P minus model G at the QM/MM‐optimized geometry of P;
4 b/5 e,f: model C minus model G at the QM/MM‐optimized geometry of C;
4 c/5 h,i: polarization of 1 with a cluster of Hirshfeld‐atom point charges and dipoles (model CC, cluster charges) minus model G at the experimental geometry from X‐ray diffraction;
4 d/5 k,l: effect of X‐ray constrained wavefunction fitting:[ 22 , 28 ] X‐ray constrained model X minus model G at the experimental geometry from X‐ray diffraction.
In the QM/MM calculations, only 1 was treated quantum‐mechanically, but everything else including water and potassium ions was treated with molecular mechanics, so that the polarization features seen in Figures 4 a,b and 5 a–f are purely caused by electrostatics, with limited dispersion and no covalent component. The calculations of Figures 4 c,d and 5 g–l, however, required the definition of at least the asymmetric crystal unit, which explicitly includes water and potassium ions. In the Experimental Section, we describe the methodology developed here to isolate only the electrostatic component of the interactions.
The interaction densities in Figures 4 a,b look visually similar. The similarity is even more obvious in the interaction ESP plots, comparing Figure 5 b with 5 e and 5 c with 5 f. It must be noted that the conformations of the amide and carboxylate groups are different in crystal and enzyme environments, so that the lobes around the carboxylate group in Figure 5 f need to be turned by 90° to match the shape of the lobes in Figure 5 c. This means that the same functional groups are qualitatively polarized in the same way in both environments P and C. For example, all oxygen lone pairs show red regions in the interaction densities and ESPs, which means that they accumulate electron density and become more negatively polarized in the P and C environments than in the in vacuo state G. All hydrogen atoms are covered in blue surfaces, which means that they lose electron density relative to the isolated state. Both effects are due to interaction with neighboring non‐covalent bonding partners, for example, via hydrogen bonding, which is characterized by charge transfer.
In absolute numbers, the polarization effect on the ED and ESP is small; the difference values are low. For the interaction density, the maximum value in the grid file is ±0.1 e Å−3 compared to maximum values of up to 265 e Å−3 in the grid file of the total ED. For the interaction ESP, the maximum values in the grid file are ±0.08 e Å−1 compared to maximum values in a range of [−0.31, 782] e Å−1 for the total ESP. A meaningful isovalue for the representation of the interaction densities is 0.0067 e Å−3, whereas it is around 0.03 e Å−1 for the interaction ESP. The real‐space R‐values (RSR) and Jaccard distances (Jd) are still close to zero. (For a definition of these quantities, please see the Experimental and Computational Section.) An integration of the absolute values of the interaction density inside the grid files (N e=number of electrons) is a measure of the charge transfer caused by the polarization. A total of 0.56 electrons are shifted in the protein environment and 0.43 e in the crystal environment (Figures 4 a and b), indicating that, overall, 1 is slightly more polarized by the enzyme than the crystal.
Dittrich et al. suggested to use different point‐charge and ‐dipole models to simulate the crystal field effect and produce the interaction density. [27c] Following this idea, we used a self‐consistent Hirshfeld cluster‐charge model as it is normally employed in Hirshfeld Atom Refinements.[ 21b , 33 ] Figure 4 c) shows the effect of this field on the ED: qualitatively, the regions of accumulation and depletion are the same around oxygen and hydrogen atoms in comparison to the QM/MM results, and quantitatively, RSR, Jd and Ne values are in‐between the respective values for the QM/MM results. However, the regions of ED polarization are wider and more diffuse. This has strong consequences on the ESP, see Figures 5 h and i. The polarization of all the oxygen atoms by the surrounding positive point charges is much stronger than in the QM/MM models: the negative isovalue in Figure 5 i is 2.5 times higher and the RSR value is 3 times higher. Hence, it is clear that the Hirshfeld cluster‐charge model overestimates the polarization effect by the crystal field.
Finally, following an idea by Ernst et al., [28] we have employed XCW fitting against the experimental synchrotron X‐ray structure factors of 1K⋅H2O to map the crystal field effect on 1. The experiment contains more information, for the better or worse, than the quantum‐mechanical ansatz used for the XCW. Beyond polarization via the crystal field, additional effects are electron correlation, [34] and, unfortunately, systematic errors, with radiation damage being the most important one here. These effects are also included into the X‐ray constrained wavefunction, so that the result is not a pure interaction density or interaction ESP. [23] However, since a hybrid‐DFT functional was used in the fitting, we assume that the majority of the electron correlation effect is included already in the unperturbed wavefunction before fitting, so that the major effect presented here is indeed due to the crystal field and the experimental error.
Figure 4 d shows that the effect of the XCW fitting on the ED is much more pronounced than that of the theoretical models where the polarization effect could be treated in an isolated way. The spread of the isosurfaces as well as the RSR and N e values are much larger, showing that 1.88 e are shifted between the unperturbed and the fitted wavefunction, which is nearly four times more than in the QM/MM and point‐charge models. However, qualitatively the experimental information is still meaningful as the polarization pattern in the interaction density is still the same (red at oxygen, blue at hydrogen, see also representations at different isovalues in the Supporting Information, Figures S8). In the interaction ESP (Figures 5 k,l), the polarization of the carboxylate group is unsymmetrical, that is, one oxygen atom becomes more and one less negatively polarized in the crystal environment, which is the only qualitative difference to the theoretical results in Figures 5 a–i. The RSR value of 0.045 is close to those of the QM/MM calculations.
The high‐resolution low‐temperature synchrotron X‐ray diffraction data of 1K⋅H2O used in this study were prone to radiation damage, which was corrected with a scaling procedure (see Supporting Information). It is conceptually difficult within XCW fitting to determine to what extent the data quality is affected, which will in turn affect the results presented here. Therefore, we decided to perform a multipole modeling according to Hansen and Coppens [35] on the experimental and on additional theoretically calculated structure factors with the sole purpose of separating model from data effects, which is discussed in detail in the Supporting Information. We find that significant discrepancies are caused by the model used to treat the experimental data and not by data inaccuracy alone.
In summary, qualitatively and quantitatively the polarization of the electron density and the ESP of 1 in crystal and enzyme environments is similar, with a slightly larger effect in the enzyme. Hirshfeld cluster charges overestimate the crystal field effect, whereas XCW fitting against the experimental structure factors captures the effect with a strong bias in the ED and a smaller bias in the ESP. Since every model for experimental X‐ray data treatment, such as the XCW or the multipole model, includes theoretical assumptions, and since the data always consist of a convolution of many physical effects, we believe that the refinement of experimental data might be useful, but data can never simply be trusted, as also demonstrated in ref. [37] In this context, multipole‐model derived electron densities were compared to the crystal electron densities calculated from QM/MM approaches at various different levels of theory in previous studies.[ 38 , 39 , 40 ] However, it remains unclear, here and in general, whether the level of data accuracy and the convolution of different physical effects in the data yields physically useful and meaningful derivations of the interaction density and ESP from the experiment.[ 25 , 26 ] Therefore, in the next section we concentrate on details of the difference densities from theory alone.
Bond‐wise comparison of difference densities
One disadvantage of the interaction density is the reference to isolated in vacuo systems. Therefore, we now use difference densities that use the protein environment as reference. However, a conceptual difficulty to overcome is that such comparisons are necessarily based on different molecular geometries. The molecular conformations of 1 obtained from geometry optimization in the crystal structure (QM/MM, model C), inside the enzyme cathepsin B (QM/MM, model P), in solution (COSMO, model S) and in vacuum (model G) are significantly different, as visualized in Figure 6 for the C and P geometries.
To account for changes of bond distances and angles, only bond‐centered grids were analyzed that were chosen to be as small as possible and scaled to the respective bond lengths, that is, the sample point separation was adjusted relative to the bond lengths (Figure 7, more details in the Experimental and Computational Section). Following this approach, only minor contaminations of the difference densities in the corners of the boxes used to define the grids are observed.
It is expected that the effect of polarization is most pronounced for bonds that interact directly with the environment via hydrogen bonding. Whereas the carboxylate bonds C3−O2 and C3−O3 that carry the negative charge are less affected, Figure 8 shows the density differences for the most affected bonds C4−O4 and C4−N1. The discrepancy between models G and P is the largest, significantly reduced in the solvation model S, and nearly vanished in the crystal environment C. This implies that the crystal environment is indeed a good model of the larger enzyme environment. For the N−H bonds in Figure 8, the G‐vs.‐P difference is always the most pronounced as for the C−N and C−O bonds. However, the trend between S‐vs.‐P and C‐vs.‐P is more ambiguous. In addition, for the N−H bonds the impurities in the corners of the boxes are higher.
Nevertheless, RSR values were calculated for every bond‐centered difference grid file and graphically summed up in Figure 9 for the same differences as in Figure 8 (G/S/C vs. P), but in addition the K+ countercation was explicitly included in the G and S calculations. K+ is located in proximity to the carboxylate group replacing the His+‐199 group in the protein environment (Figure S9 in the Supporting Information). This first polarization by K+ already reduces the differences between the G and the P environments, as it introduces a first polarizing field acting upon the molecule. This shows that the COSMO solvation model S produces already a good approximation to the polarization present in the enzyme, [13a] so the influence of K+ becomes negligible. However, the crystal environment is the best approximation to the protein environment, and if the bond N1−H4 was not an outlier, the effect would be visually even more compelling. In total, the similarity improves from an RSR sum of 0.171 for model G (w/o K) to 0.072 for model C, which is a drop of 0.1 RSR points or 58 %.
Comparison of derived properties
As the second derivative of the electron density, the Laplacian is very susceptible to any method or model change. Therefore, we decided to plot it along the C4−N1 bond, which is the one most affected by the environmental influences as shown in Figures 8 and 9. Figure 10 shows the progression of the Laplacian along this bond for all different models including the Hirshfeld cluster‐charge model (CC) and the XCW‐fitted model (X). The minima at ca. 0.42 and 0.85 Å are the valence‐shell charge concentrations (VSCCs) inside this polar covalent bond. With the P‐results serving as the reference as in the previous section, the deeper VSCC at the nitrogen atom is well described by all theoretical models except the CC model, which overestimates the charge concentration. The experimental data in the XCW fitting model significantly deviate from all other results, producing a shift of the nitrogen VSCC by about 10 e Å−5 and 0.02 Å. This effect was previously observed and reported for the multipole model.[ 38 , 39 ]
The deviations of all models relative to the model P are best visible in the middle of the bond towards the carbon atom VSCC. Model G differs the most from the protein model P, which is slightly improved when the counter‐cation K+ is accounted for. The results are significantly closer to the reference model P for both solvation models (S), but the closest match exists between the crystal (C) and protein model. These findings agree with those in Figure 9 for the difference densities. The model CC has the most negative Laplacian value (highest electron concentration) of the theoretical models everywhere along the bond, but it is still closer to the models P and C than the model G. In contrast to the model CC, whereas the XCW fitting model shows the least negative Laplacian value for the nitrogen VSCC, it shows the most negative Laplacian value in the middle of the bond, deviating significantly from all other models. As discussed in section “Interaction density and interaction electrostatic potential”, it is difficult to know whether these deviations are caused by data inaccuracy or the convolution of many different physical effects—not only the crystal field effect whose treatment is the only difference between all the theoretical models. A comparison with Figure S7 in the Supporting Information shows, however, that the multipole model using both the experimental and theoretically generated structure factors leads to hugely different results of the Laplacian in this polar covalent C−N bond, and relative to that result the XCW fitting model agrees well with the theoretical models (compare a related discussion in ref. [42] ).
The atomic charges (Table 2, here calculated according to Bader's QTAIM [43] ) are directly related to the electrostatic potentials. All the oxygen atoms are most negatively charged in the protein environment, losing negative charge from models C toward model S, and being least negatively charged in the in vacuo state (models G). The same trend holds for the positive charges of the hydrogen atoms. For both oxygen and hydrogen atoms, the crystal environment C produces the atomic charges that are most similar to the protein environment. All these trends are reflected in the interaction ESP plots in the same way (Figure 5). For the carbon atoms, which are not involved in the hydrogen bonding network, the same trend is true, but less pronounced: the positive charges are highest in the P model and lowest in the G models.
Table 2.
Atom |
G (w/o K) |
G (w K) |
S (w/o K) |
S (w K) |
C |
P |
CC |
XCW |
---|---|---|---|---|---|---|---|---|
O1 |
−0.875 |
−0.852 |
−0.903 |
−0.898 |
−0.899 |
−0.907 |
−0.928 |
−1.140 |
O2 |
−1.212 |
−1.227 |
−1.257 |
−1.247 |
−1.267 |
−1.303 |
−1.346 |
−1.266 |
O3 |
−1.216 |
−1.230 |
−1.256 |
−1.245 |
−1.258 |
−1.304 |
−1.276 |
−1.389 |
O4 |
−1.151 |
−1.133 |
−1.193 |
−1.192 |
−1.222 |
−1.277 |
−1.225 |
−1.276 |
N1 |
−1.105 |
−1.126 |
−1.153 |
−1.152 |
−1.201 |
−1.165 |
−1.185 |
−1.563 |
C1 |
0.356 |
0.359 |
0.356 |
0.357 |
0.360 |
0.382 |
0.398 |
0.325 |
C2 |
0.335 |
0.364 |
0.341 |
0.346 |
0.349 |
0.390 |
0.377 |
0.637 |
C3 |
1.668 |
1.581 |
1.627 |
1.611 |
1.614 |
1.650 |
1.724 |
1.820 |
C4 |
1.350 |
1.404 |
1.409 |
1.412 |
1.393 |
1.407 |
1.500 |
1.583 |
H1 |
0.052 |
0.043 |
0.071 |
0.074 |
0.068 |
0.094 |
0.053 |
−0.010 |
H2 |
0.029 |
0.072 |
0.066 |
0.072 |
0.063 |
0.084 |
0.053 |
0.150 |
H3 |
0.387 |
0.396 |
0.445 |
0.446 |
0.506 |
0.495 |
0.457 |
0.647 |
H4 |
0.384 |
0.409 |
0.448 |
0.449 |
0.495 |
0.456 |
0.435 |
0.569 |
K |
– |
0.940 |
– |
0.968 |
– |
– |
0. 960/ 0. 971 |
0.925/ 0.957 |
On average, the charges produced in the Hirshfeld cluster‐charge model (CC) are slightly higher than those in the QM/MM crystal model C, sometimes even higher than in the enzyme model P — an overestimation of the polarization already discussed with respect to Figure 5 h,i. However, the atomic charges produced after XCW fitting are significantly higher than the charges of any of the other models. Despite these discrepancies, chemically meaningful charge differences of the same element in different functional groups are always preserved. This refers to the lower negative charge of the epoxide oxygen atom O1 or the higher positive charges of the carboxyl and carbonyl carbon atoms C3 and C4 compared to the epoxide carbon atoms C1 and C2.
Conclusions
In this study, we used model compound 1 which consists of the pure deprotonated epoxysuccinyl peptide group that is the biologically active center of the drug loxistatin acid (E64c). In molecular dynamics simulations, 1 showed to exhibit the same stable intermolecular contacts inside the enzyme pocket of cathepsin‐B that lead to the irreversible inhibition of this protease by E64c. Moreover, in the crystal structure 1K⋅H2O intermolecular interactions (hydrogen bonds and electrostatic contacts) are formed that are very similar to those in the enzyme pocket, involving biologically important water molecules and potassium cations. The K+ ion is located in the crystal where the His+ group is located in cathepsin B.
Detailed analyses of the interaction densities and electrostatic potentials as well as of the difference electron densities show unambiguously that the crystal environment, as described by QM/MM, is significantly more similar to the protein environment than any of the other environments (a cluster of Hirshfeld point charges and dipoles, a solvation model, or the isolated molecule in vacuum, the latter two with or without explicit perturbation by the K+ countercation). The difference between the integrated interaction densities in the protein and crystal environments was measured to be smaller than 0.1 e over the entire anion 1, or 0.075 units of the RSR value when integrated over all bonds of 1. We conclude that the sentiment often expressed in the literature that the interaction density in crystals of the pure substance is a measure of the polarization of the same molecule in the enzyme active site is true for the analyzed compound. We also showed that the interaction density can be used to predict the features of the electrostatic recognition process.
Finally, we investigated whether the above conclusion means that experimental electron‐density studies are an ideal tool to get deeper insight into the enzyme‐ligand interaction. Certainly, with an error‐free experiment and an ideal model to refine the experimental structure factors, such studies would be a close simulation of the biological situation. However, the crystal field effect on the electron density is relatively small, that is, less than one out of 67 electrons of 1 are shifted due to polarization. Therefore, the interpretation of the experimental results and maps is difficult relative to the idealized theoretical models. The accuracy of the data and the (in‐)flexibility of the model have a huge influence on the result. The convolution of many physical effects in the experimental data that might be lacking in the theoretical ansatz/model does not allow to unambiguously pin down the crystal field effect in the difference densities. Certainly, for an effect such as the polarization in different environments we cannot trust the experimental data blindly but have to critically assess them, when, simultaneously, a computationally relatively cheap and simple method such as a COSMO solvation model already approximates the results quite closely.
Experimental and Computational Section
The synthesis and spectroscopic characterization of 1K⋅H2O has been reported previously. [19] Details of the synchrotron X‐ray diffraction experiment are given in Table 3. A high‐resolution measurement was carried out at beamline BL02B1 of SPring‐8 using a helium gas‐stream for cooling down to 25 K and a large curved image plate for detection. The data resolution was limited to d=0.45 Å after radiation‐damage problems were observed, which was corrected by using a scaling procedure in the software RapidAuto [44] (see more details in the Supporting Information). The crystal structure of 1K⋅H2O was solved using ShelxT [45] and refined using olex2.refine and Tonto [46] within the NoSpherA2 framework. [47] HAR was performed on a level of theory of B3LYP/def2‐TZVP using a radius of cluster charges and dipoles of 20 Å. Anisotropic displacement parameters of hydrogen atoms of 1 were determined using the SHADE2.1 webservice, [31] and then iteratively updated in Shade in‐between HAR cycles, whereas the hydrogen atoms of the water molecule were refined with isotropic displacement parameters. Deposition number 2024395 contains the supplementary crystallographic data for this paper (crystal structure of 1K⋅H2O, the conditions and the parameters used in the HAR as well as the structure factor magnitudes). These data are provided free of charge by the joint Cambridge Crystallographic Data Centre and Fachinformationszentrum Karlsruhe Access Structures service www.ccdc.cam.ac.uk/structures.
Table 3.
|
HAR |
XCW |
---|---|---|
empirical formula |
C4H6KNO5 |
|
formula weight [g mol−1] |
187.20 |
|
cell setting, space group |
trigonal, P312 (No. 152) |
|
Z |
6 |
|
T [K] |
25 |
|
unit cell dimensions |
|
|
a [Å] |
8.14360(10) |
|
c [Å] |
19.1623(3) |
|
V [Å3] |
1100.55(3) |
|
F(000) |
576 |
|
crystal size [mm3] |
0.15×0.11×0.10 |
|
crystal form, color |
block, colorless |
|
wavelength λ [Å] |
0.3532 |
|
absorption correction |
multi‐scan |
|
absorption coefficient μ [mm−1] |
0.115 |
|
T min/T max |
0.884/1.206 |
|
max. θ (°) |
23.103 |
|
(sin θ/λ)max [Å−1] |
1.11094 |
|
measured, independent and observed reflns |
62 635, 8415, 8362 |
|
completeness |
100 % |
|
redundancy |
7.474 |
|
weighting scheme[a] w= |
(σ 2+(0.048 P)2+0.162 P)−1 |
σ −2 |
R int |
0.0327 |
–[b] |
number of parameters |
121 |
– |
R 1(F) |
0.0266 |
0.0200 |
R w(F) |
0.0757 |
0.0208 |
R all(F) |
0.0267 |
–[c] |
N ref/N v |
69.1 |
– |
Δρ max, Δρmin [e Å−3] |
1.371/−0.378 |
0.756/−0.216 |
Flack parameter |
0.13(6) |
–[b] |
CCDC deposition no. |
2024395 |
– |
[a] With . [b] Merged data. [c] Not reported by Tonto.
The final wavefunction calculation after the last structural least‐squares refinement in HAR serves as the model CC since it is a single‐point calculation under the influence of a surrounding cluster of self‐consistent Hirshfeld point charges and dipoles. Subsequent XCW fitting was performed using a level of theory of B3LYP/def2‐TZVP without cluster charges to include an estimate of the electron correlation effect into the wavefunction ansatz whereas the crystal environmental effect is fitted based on the experimental data. The resulting wavefunction serves as the model X (or XWR/XCW).
The final crystal structure was used to build a supercluster according to the symmetry of the crystal to perform QM/MM calculations in the crystal system (C), treating one anion 1 in the center of the cluster quantum‐mechanically and the proximity using molecular mechanics based on a CHARMM force field. Parameters describing 1 were obtained from the SwissParam webservice. [48] QM/MM structure optimizations were performed using NAMD2 and Turbomole in all cases during this study, interfaced using ChemShell.[ 49 , 50 , 51 , 52 , 53 ] The level of theory used for the QM part was B3LYP/def‐TZVP. Previous studies were able to show that the density is sufficiently well described using triple zeta basis sets, and the extent of differences using different levels of theory in QM/MM are documented.[ 38 , 39 , 40 ]
For a model of the protein environment, anion 1 was placed into the active site of cathepsin B, based on a protein crystal structure of the E64c—cathepsin B complex [17] already modified as MD input including all hydrogen atoms in ref. [13a]. The resulting geometry was then equilibrated and simulated with molecular dynamics (MD) at 310 K using a CHARMM force field for protein, water and salt ions.[ 54 , 55 , 56 ] The geometries at four points during the simulation between 4 and 5 ns (250 ps apart) were taken to perform QM/MM geometry optimizations (NAMD, Turbomole, ChemShell), optimizing the geometry of 1 quantum‐mechanically on a level of theory of B3LYP/def‐TZVP, and optimizing the residues in proximity of 1 with at least one atom in a range of 3 Å with molecular mechanics. Since four slightly different geometries were the result of these calculations, the calculation of grids was performed on all wavefunctions and the arithmetic average of the geometries and the grid values were calculated for the comparisons between the different environments. All plots labelled as model P refer to these averaged grids.
Optimized geometries in solvation (S) and in vacuo (commonly called gas‐phase model, G) were obtained by ab‐initio optimizations using the level of theory B3LYP/def‐TZVP in Turbomole. [51] For model S, a COSMO solvation model of water was used. These calculations were carried out with anionic molecule 1, as well as with a potassium counter‐cation, to give an interaction partner as a first model of interaction with the environment.
The calculation of grid files of the interaction density and interaction ESP is simple, however, the setup of the boxes of the grid files for bond‐wise comparison between different geometries is difficult. We programmed an in‐house software (cuQCT, author: Florian Kleemiss) for the calculation and manipulation of grid files in the Gaussian cube format. Bond‐centered grid files are generated by first finding the midpoint between the two positions of selected atoms. The unit vector connecting them will provide the vector of the box. A third atom is used to form a plane and to find the vector in this plane that is perpendicular to . This way, the second vector for the grid is found, which will be called . Both vectors are normalized to have a length of 1 bohr. The last vector for the calculation of the grid is found by the calculation of the vector product of and The midpoint of the first two selected atoms will then be used as the central point of the grid. By choosing a multiplicator for the distance between atoms 1 and 2 in all three dimensions of the grid, the atoms in different grid files will always be located at the same position of the grids. The origin of the grid is then found by moving into the negative directions of , and for half the corresponding length of the complete grid size from the central point. Then the calculation with a fixed number of grid points scales the grid vectors and allows comparability with grids in different settings or with different molecular geometry without interpolations.
To have a quantitative measure of similarity between two ED or ESP grids, a real space R value (RSR or RRS) was used. The definition of the RSR was taken from the literature. [57] For the ED, it is defined as Equation 1:
(1) |
This RSR gives insight in the total difference in ED relative to the total ED of the compound. It is likewise defined for the ESP. A value of 1 corresponds to a total shift of the complete property, while 0 corresponds to perfect agreement. As an alternative, the weighted Jaccard or Soergel distance, [58] as a variation of the Jaccard distance for binary groups, [59] can be defined for the ED differences as Equation 2:
(2) |
where the sum runs over all grid points of the density.
The number of electrons shifted between two ED distributions is given by the sum over the whole grid with grid point indices i, j, k. The local density difference is multiplied with the voxel size (which is the triple product of the grid vectors) and divided by 2 since local depletion of electrons will accumulate in a different place [Eq. 3]:
(3) |
The wavefunctions for the models CC and X of the crystal system required the explicit inclusion of two potassium ions and a water molecule in the asymmetric units of the crystal structure dictated by the crystallographic symmetry. This is necessary to describe the cluster of charges correctly in the case of Hirshfeld point charges and to calculate correct structure factors in the case of XCW. However, for an optimum comparability with respect to the QM/MM calculations, the calculation of ESP and ED grid files for the models CC and X required a routine to calculate these grids without explicit, only implicit, contributions of the potassium ions and the atoms in water. In a new feature of cuQCT, these atoms were ignored for the calculation of ESP and ED by skipping all basis functions that are associated to these atoms while calculating the value of a molecular orbital at a point in space for the ED or during the integration of the density for the ESP.
Conflict of interest
The authors declare no conflict of interest.
Supporting information
Acknowledgements
S.G. acknowledges funding of the German Research Foundation (DFG) via the Emmy–Noether project GR 4451/1‐1 and of the Australian Research Council (ARC) via Discovery Project DP110105 347. F.K. thanks the German Academic Scholarship Foundation (Studienstiftung des Deutschen Volkes) for a scholarship. The synchrotron measurement at SPring‐8 was carried out under proposal number 2013B1056. Open access funding enabled and organized by Projekt DEAL.
F. Kleemiss, E. K. Wieduwilt, E. Hupf, M. W. Shi, S. G. Stewart, D. Jayatilaka, M. J. Turner, K. Sugimoto, E. Nishibori, T. Schirmeister, T. C. Schmidt, B. Engels, S. Grabowsky, Chem. Eur. J. 2021, 27, 3407.
References
- 1.
- 1a. Böhm H.-J., Klebe G., Angew. Chem. Int. Ed. Engl. 1996, 35, 2588–2614; [Google Scholar]; Angew. Chem. 1996, 108, 2750–2778; [Google Scholar]
- 1b. Babine R. E., Bender S. L., Chem. Rev. 1997, 97, 1359–1472. [DOI] [PubMed] [Google Scholar]
- 2. Koshland D. E., Angew. Chem. Int. Ed. Engl. 1995, 33, 2375–2378. [Google Scholar]
- 3.
- 3a. Sarkhel S., Desiraju G. R., Proteins Struct. Funct. Bioinf. 2004, 54, 247–259; [DOI] [PubMed] [Google Scholar]
- 3b. Fersht A. R., Trends Biochem. Sci. 1987, 12, 301–304. [Google Scholar]
- 4. Shi M. W., Sobolev A. N., Schirmeister T., Engels B., Schmidt T. C., Luger P., Mebs S., Dittrich B., Chen Y.-S., Bąk J. M., Jayatilaka D., Bond C. S., Turner M. J., Stewart S. G., Spackman M. A., Grabowsky S., New J. Chem. 2015, 39, 1628–1633. [Google Scholar]
- 5. Pascard C., Acta Crystallogr. Sect. D 1995, 51, 407–417. [DOI] [PubMed] [Google Scholar]
- 6.
- 6a. Velec H. F., Gohlke H., Klebe G., J. Med. Chem. 2005, 48, 6296–6303; [DOI] [PubMed] [Google Scholar]
- 6b. Bruno I. J., Cole J. C., Jos P. M., Lommerse R. S., Taylor R., Verdonk M. L. J., Computer-Aided Mol. Des. 1997, 11, 525–537. [DOI] [PubMed] [Google Scholar]
- 7.
- 7a. Klebe G., J. Mol. Biol. 1994, 237, 212–235; [DOI] [PubMed] [Google Scholar]
- 7b. Böhm M., Klebe G., J. Med. Chem. 2002, 45, 1585–1597. [DOI] [PubMed] [Google Scholar]
- 8.
- 8a. Verdonk M. L., Cole J. C., Watson P., Gillet V., Willett P., J. Mol. Biol. 2001, 307, 841–859; [DOI] [PubMed] [Google Scholar]
- 8b. Boer D. R., Kroon J., Cole J. C., Smith B., Verdonk M. L., J. Mol. Biol. 2001, 312, 275–287. [DOI] [PubMed] [Google Scholar]
- 9.
- 9a. Náray-Szabó G., J. Mol. Graphics 1989, 7, 76–81; [DOI] [PubMed] [Google Scholar]
- 9b. Kangas E., Tidor B., J. Phys. Chem. B 2001, 105, 880–888; [Google Scholar]
- 9c. Muzet N., Guillot B., Jelsch C., Howard E., Lecomte C., Proc. Natl. Acad. Sci. USA 2003, 100, 8742–8747; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9d. Mebs S., Lüth A., Luger P., Bioorg. Med. Chem. 2010, 18, 5965–5974. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.
- 10a. Grabowsky S., Jayatilaka D., Fink R. F., Schirmeister T., Engels B., Z. Anorg. Allg. Chem. 2013, 639, 1905–1921; [Google Scholar]
- 10b. Leduc T., Aubert E., Espinosa E., Jelsch C., Iordache C., Guíllot B., J. Phys. Chem. A 2019, 123, 7156–7170. [DOI] [PubMed] [Google Scholar]
- 11. Luger P., Org. Biomol. Chem. 2007, 5, 2529–2540. [DOI] [PubMed] [Google Scholar]
- 12.
- 12a. Hibbs D. E., Austin-Woods C. J., Platts J. A., Overgaard J., Turner P., Chem. Eur. J. 2003, 9, 1075–1084; [DOI] [PubMed] [Google Scholar]
- 12b. Grabowsky S., Pfeuffer T., Chęcińska L., Weber M., Morgenroth W., Luger P., Schirmeister T., Eur. J. Org. Chem. 2007, 2759–2768; [Google Scholar]
- 12c. Ghermani N. E., Spasojević-de Biré A., Bouhmaida N., Ouharzoune S., Bouligand J., Layre A., Gref R., Couvreur P., Pharm. Res. 2004, 21, 598–607; [DOI] [PubMed] [Google Scholar]
- 12d. Wagner A., Flaig R., Dittrich B., Schmidt H., Koritsánszky T. S., Luger P., Chem. Eur. J. 2004, 10, 2977–2982; [DOI] [PubMed] [Google Scholar]
- 12e. Dittrich B., Koritsánszky T. S., Volkov A., Mebs S., Luger P., Angew. Chem. Int. Ed. 2007, 46, 2935–2938; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2007, 119, 2993–2996; [Google Scholar]
- 12f. Parrish D., Zhurova E. A., Kirschbaum K., Pinkerton A. A., J. Phys. Chem. B 2006, 110, 26442–26447; [DOI] [PubMed] [Google Scholar]
- 12g. Zhurova E. A., Matta C. F., Wu N., Zhurov V. V., Pinkerton A. A., J. Am. Chem. Soc. 2006, 128, 8849–8861; [DOI] [PubMed] [Google Scholar]
- 12h. Yearley E. J., Zhurova E. A., Zhurov V. V., Pinkerton A. A., J. Am. Chem. Soc. 2007, 129, 15013–15021; [DOI] [PubMed] [Google Scholar]
- 12i. Destro R., Soave R., Barzaghi M., Lo Presti L., Chem. Eur. J. 2005, 11, 4621—4634; [DOI] [PubMed] [Google Scholar]
- 12j. Overgaard J., Turel I., Hibbs D. E., Dalton Trans. 2007, 2171–2178; [DOI] [PubMed] [Google Scholar]
- 12k. Hibbs D. E., Overgaard J., Howard S. T., Nguyen T. H., Org. Biomol. Chem. 2005, 3, 441–447. [DOI] [PubMed] [Google Scholar]
- 13.
- 13a. Mladenovic M., Arnone M., Fink R. F., Engels B., J. Phys. Chem. B 2009, 113, 5072–5082; [DOI] [PubMed] [Google Scholar]
- 13b. Engels B., Schmidt T. C., Gatti C., Schirmeister T., Fink R. F. in Electron Density and Chemical Bonding II, Springer, Berlin, 2011, pp. 47–97. [Google Scholar]
- 14. Hanada K., Tamai M., Yamagishi M., Ohmura S., Sawada J., Tanaka I., Agric. Biol. Chem. 1978, 42, 523–528. [Google Scholar]
- 15.
- 15a. Otto H.-H., Schirmeister T., Chem. Rev. 1997, 97, 133–171; [DOI] [PubMed] [Google Scholar]
- 15b. Powers J. C., Asgian J. L., Ekici Ö. D., James K. E., Chem. Rev. 2002, 102, 4639–4750. [DOI] [PubMed] [Google Scholar]
- 16.
- 16a. Mladenovic M., Junold K., Fink R. F., Thiel W., Schirmeister T., Engels B., J. Phys. Chem. B. 2008, 112, 5458–5469; [DOI] [PubMed] [Google Scholar]
- 16b. Mladenovic M., Ansorg K., Fink R. F., Thiel W., Schirmeister T., Engels B., J. Phys. Chem. B 2008, 112, 11798–11808. [DOI] [PubMed] [Google Scholar]
- 17. Yamamoto A., Tomoo K., Matsugi K., Hara T., In Y., Murata M., Kitamura K., Ishida T., Biochim. Biophys. Acta 2002, 1597, 244–251. [DOI] [PubMed] [Google Scholar]
- 18.
- 18a. Grabowsky S., Pfeuffer T., Morgenroth W., Paulmann C., Schirmeister T., Luger P., Org. Biomol. Chem. 2008, 6, 2295–2307; [DOI] [PubMed] [Google Scholar]
- 18b. Grabowsky S., Schirmeister T., Paulmann C., Pfeuffer T., Luger P., J. Org. Chem. 2011, 76, 1305–1318. [DOI] [PubMed] [Google Scholar]
- 19. Shi M. W., Stewart S. G., Sobolev A. N., Dittrich B., Schirmeister T., Luger P., Hesse M., Chen Y.-S., Spackman P. R., Spackman M. A., Grabowsky S., J. Phys. Org. Chem. 2017, 30, e3683. [Google Scholar]
- 20.
- 20a. Grabowsky S., Luger P., Buschmann J., Schneider T., Schirmeister T., Sobolev A. N., Jayatilaka D., Angew. Chem. Int. Ed. 2012, 51, 6776–6779; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2012, 124, 6880–6884; [Google Scholar]
- 20b. Chęcińska L., Morgenroth W., Paulmann C., Jayatilaka D., Dittrich B., Cryst. Eng. Comm. 2013, 15, 2084–2090. [Google Scholar]
- 21.
- 21a. Jayatilaka D., Dittrich B., Acta Crystallogr. Sect. A 2008, 64, 383–393; [DOI] [PubMed] [Google Scholar]
- 21b. Capelli S. C., Bürgi H.-B., Dittrich B., Grabowsky S., Jayatilaka D., IUCrJ 2014, 1, 361–379; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21c. Woińska M., Grabowsky S., Dominiak P. M., Woźniak K., Jayatilaka D., Sci. Adv. 2016, 2, e1600192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.
- 22a. Jayatilaka D., Phys. Rev. Lett. 1998, 80, 798–801; [Google Scholar]
- 22b. Jayatilaka D., Grimwood D. J., Acta Crystallogr. Sect. A 2001, 57, 76–86; [DOI] [PubMed] [Google Scholar]
- 22c. Grimwood D. J., Jayatilaka D., Acta Crystallogr. Sect. A 2001, 57, 87–100. [DOI] [PubMed] [Google Scholar]
- 23. Grabowsky S., Genoni A., Bürgi H.-B., Chem. Sci. 2017, 8, 4159–4176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Dittrich B., Matta C. F., IUCrJ 2014, 1, 457–469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.
- 25a. Krijn M., Graafsma H., Feil D., Acta Crystallogr. Sect. B 1988, 44, 609–616; [Google Scholar]
- 25b. Spackman M. A., Byrom P. G., Alfredsson M., Hermansson K., Acta Crystallogr. Sect. A 1999, 55, 30–47; [DOI] [PubMed] [Google Scholar]
- 25c. de Vries R. Y., Feil D., Tsirelson V. G., Acta Crystallogr. Sect. B 2000, 56, 118–123; [DOI] [PubMed] [Google Scholar]
- 25d. Dittrich B., Spackman M. A., Acta Crystallogr. Sect. A 2007, 63, 426–436. [DOI] [PubMed] [Google Scholar]
- 26. Coppens P., Volkov A., Acta Crystallogr. Sect. A 2004, 60, 357–364. [DOI] [PubMed] [Google Scholar]
- 27.
- 27a. Dovesi R., Causa’ M., Orlando R., Roetti C., Saunders V. R., J. Chem. Phys. 1990, 92, 7402–7411; [Google Scholar]
- 27b. Feil D., J. Mol. Struct. 1990, 237, 33–46; [Google Scholar]
- 27c. Dittrich B., Sze E., Holstein J. J., Hübschle C. B., Jayatilaka D., Acta Crystallogr. Sect. A 2012, 68, 435–442. [Google Scholar]
- 28. Ernst M., Genoni A., Macchi P., J. Mol. Struct. 2020, 1209, 127975. [Google Scholar]
- 29.
- 29a. Espinosa E., Lecomte C., Ghermani N. E., Devémy J., Rohmer M. M., Bénard M., Molins E., J. Am. Chem. Soc. 1996, 118, 2501–2502; [Google Scholar]
- 29b. Mata I., Molins E., Alkorta I., Espinosa E., J. Phys. Chem. A 2007, 111, 6425–6433; [DOI] [PubMed] [Google Scholar]
- 29c. Matta C. F., J. Comput. Chem. 2014, 35, 1165–1198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Javier Luque F., Orozco M., J. Comput. Chem. 1998, 19, 866–881. [DOI] [PubMed] [Google Scholar]
- 31. Madsen A., J. Appl. Crystallogr. 2006, 39, 757–758. [Google Scholar]
- 32. Dolomanov O. V., Bourhis L. J., Gildea R. J., Howard J. A. K., Puschmann H., J. Appl. Crystallogr. 2009, 42, 339–341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Bučinský L., Jayatilaka D., Grabowsky S., Acta Crystallogr. Sect. A 2019, 75, 705–717. [DOI] [PubMed] [Google Scholar]
- 34. Genoni A., Dos Santos L. H. R., Meyer B., Macchi P., IUCrJ 2017, 4, 136–146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Hansen N. K., Coppens P., Acta Crystallogr. Sect. A 1978, 34, 909–921. [Google Scholar]
- 36. Humphrey W., Dalke A., Schulten K., J. Mol. Graphics 1996, 14, 33–38. [DOI] [PubMed] [Google Scholar]
- 37. Wolf H., Jørgensen M. R. V., Chen Y.-S., Herbst-Irmer R., Stalke D., Acta Crystallogr. Sect. B 2015, 71, 10–19. [DOI] [PubMed] [Google Scholar]
- 38. Kocher N., Henn J., Gostevskii B., Kost D., Kalikhman I., Engels B., Stalke D., J. Am. Chem. Soc. 2004, 126, 5563–5568. [DOI] [PubMed] [Google Scholar]
- 39. Leusser D., Henn J., Kocher N., Engels B., Stalke D., J. Am. Chem. Soc. 2004, 126, 1781–1793. [DOI] [PubMed] [Google Scholar]
- 40. Henn J., Ilge D., Leusser D., Stalke D., Engels B., J. Phys. Chem. 2004, 108, 9442–9452. [Google Scholar]
- 41. Lu T., Chen F., J. Comput. Chem. 2012, 33, 580–592. [DOI] [PubMed] [Google Scholar]
- 42. Woińska M., Jayatilaka D., Dittrich B., Flaig R., Luger P., Woźniak K., Dominiak P. M., Grabowsky S., ChemPhysChem 2017, 18, 3334–3351. [DOI] [PubMed] [Google Scholar]
- 43. Bader R. F. W., Atoms in Molecules—A Quantum Theory, Oxford University Press, Oxford, 1990. [Google Scholar]
- 44.Rigaku (2004). RAPID-AUTO. Rigaku Corporation, Tokyo, Japan.
- 45. Sheldrick G. M., Acta Crystallogr. Sect. A 2015, 71, 3–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.D. Jayatilaka, D. J. Grimwood, Computational Sciece—ICCS 2003, 142–151. (https://github.com/dylan-jayatilaka/tonto.git), version d26b49d.
- 47. Kleemiss F., Dolomanov O. V., Bodensteiner M., Peyerimhoff N., Midgley L., Bourhis L. J., Genoni A., Malaspina L. A., Jayatilaka D., Spencer J. L., White F., Grundkötter-Stock B., Steinhauer S., Lentz D., Puschmann H., Grabowsky S., Chem. Sci. 2021. 10.1039/D0SC05526C. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Zoete V., Cuendet M. A., Grosdidier A., Michielin O., J. Comput. Chem. 2011, 32, 2359–2368. [DOI] [PubMed] [Google Scholar]
- 49. Kalé L., Skeel R., Bhandarkar M., Brunner R., Gursoy A., Krawetz N., Phillips J., Shinozaki A., Varadarajan K., Schulten K., J. Comput. Phys. 1999, 151, 283–312. [Google Scholar]
- 50. Phillips J. C., Braun R., Wang W., Gumbart J., Tajkhorshid E., Villa E., Chipot C., Skeel R. D., Kale L., Schulten K., J. Comput. Chem. 2005, 26, 1781–1802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Ahlrichs R., Baer M., Haeser M., Horn H., Koelmel C., Chem. Phys. Lett. 1989, 162, 165–169. (Used version of Turbomole: 7.0.1). [Google Scholar]
- 52. Sherwood P., de Vries A. H., Guest M. F., Schreckenbach G., Catlow C. R. A., French S. A., Sokol A. A., Bromley S. T., Thiel W., Turner A. J., Billeter S., Terstegen F., Thiel S., Kendrick J., Rogers S. C., Casci J., Watson M., King F., Karlsen E., Sjøvoll M., Fahmi A., Schäfer A., Lennartz C., J. Mol. Struct. 2003, 632, 1–28. [Google Scholar]
- 53. Kästner J., Carr J. M., Keal T. W., Thiel W., Wander A., Sherwood P., J. Phys. Chem. A 2009, 113, 11856. [DOI] [PubMed] [Google Scholar]
- 54. MacKerell A. D., Feig M., Brooks C. L., J. Comput. Chem. 2004, 25, 1400–1415. [DOI] [PubMed] [Google Scholar]
- 55. A. D. MacKerell Jr , Bashford D., Bellott M., R. L. Dunbrack Jr , Evanseck J. D., Field M. J., Fischer S., Gao J., Guo H., Ha S., J. Phys. Chem. B 1998, 102, 3586–3616. [DOI] [PubMed] [Google Scholar]
- 56. Best R. B., Zhu X., Shim J., Lopes P. E. M., Mittal J., Feig M., MacKerell A. D., J. Chem. Theory Comput. 2012, 8, 3257–3273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Jones T. A., Zou J.-Y., Cowan S. T., Kjeldgaard M., Acta Crystallogr. Sect. A 1991, 47, 110–119. [DOI] [PubMed] [Google Scholar]
- 58.“Similarity, Dissimilarity, and Distance Measure”: Gower J. C., in Encyclopedia of Biostatistics, Wiley, Hoboken, 2005. [Google Scholar]
- 59. Jaccard P., New Phytol. 1912, 11, 37–50. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.