Abstract
Carbonyl-carbonyl n→π* interactions where a lone pair (n) of the oxygen atom of a carbonyl group is delocalized over the π* orbital of a nearby carbonyl group have attracted a lot of attention in recent years due to their ability to affect the 3D structure of small molecules, polyesters, peptides, and proteins. In this paper, we report the discovery of a “reciprocal” carbonyl-carbonyl interaction with substantial back and forth n→π* and π→π* electron delocalization between neighboring carbonyl groups. We have carried out experimental studies, analyses of crystallographic databases and theoretical calculations to show the presence of this interaction in both small molecules and proteins. In proteins, these interactions are primarily found in polyproline II (PPII) helices. As PPII are the most abundant secondary structures in unfolded proteins, we propose that these local interactions may have implications in protein folding.
Carbonyl-carbonyl π* non covalent interactions affect the structure and stability of small molecules and proteins. Here, the authors carry out experimental studies, analyses of crystallographic databases and theoretical calculations to describe an additional type of carbonyl-carbonyl interaction.
Introduction
Nature effectively uses combinations of weak noncovalent interactions in the functional forms of various biologically important molecules such as nucleic acids and proteins1–3. Intermolecular noncovalent interactions of varying magnitude are also responsible for the existence of different states of matter4. Carbonyl-carbonyl (C═O···C═O) n→π* interactions where one of the lone pairs (n) on the oxygen atom of a carbonyl group is delocalized over the antibonding π* orbital of a nearby carbonyl C═O bond (π*C═O) along the Bürgi-Dunitz trajectory5 (∠O···C ═ O ~ 109°) have attracted a great deal of attention in recent years6–11. Previous studies have shown that C═O···C═O n→π* interactions not only influence geometries of important small molecules12–15 but also play crucial roles in determining the three dimensional structures of polyesters16, peptides17, peptoids18–21 and proteins22–25. C═O···C═O interactions between the side-chain and backbone carbonyl groups of Asp, Asn, Glu, and Gln were also observed in the high-resolution crystal structures of proteins26, 27. C═O···C═O n→π* interaction is characterized by a short O···C═O distance (d) of less than 3.22 Å [the sum of van der Waals radii of carbon and oxygen atom28], bond angle ∠O···C═O (θ) of ~109° and the pyramidality (Δ, Θ) of the acceptor carbon atom towards the donor oxygen atom9, 14, 17, 25, 29. Direct spectroscopic evidence for n→π* interaction was recently reported by using gas-phase infrared spectroscopy30.
We anticipated that due to n→π* interaction both donor and acceptor C═O bonds will be polarized, which will make the acceptor carbonyl oxygen atom a better electron donor and the donor carbonyl carbon atom a better electron acceptor. The acceptor carbonyl oxygen, therefore, can donate electrons to another nearby carbonyl carbon either to form a sequential chain of O···C contacts (Fig. 1a) or it can donate electrons back to the original donor carbonyl carbon atom forming “reciprocal” n→π* interactions (Fig. 1b). Although, the sequential n→π* interactions were previously observed in poly(lactic acid)16 and proteins22, reciprocal n→π* interactions remained unexplored. Allen and coworkers reported anti-parallel arrangements of carbonyl groups in ketone dimers that were bound together by two intermolecular C═O···C═O short contacts of dipolar nature31. Maccallum et al reported a similar geometrical arrangement of carbonyl groups in right-twisted β-strands and observed two chemically distinct dipolar C═O···C═O short contacts32. However, these C═O···C═O short contacts were considerably longer than the sum of van der Waals radii of C and O atoms.
In this paper, we hypothesized that the polarization of the carbonyl groups by n→π* interactions should lead to back and forth donations between the carbonyl pairs. Based on our hypothesis, we discovered the presence of “reciprocal C═O···C═O interactions” both in small molecules and proteins. To establish the existence of reciprocal C═O···C═O interactions, we designed and synthesized model compounds and carried out X-ray crystallographic and theoretical studies. Further, we carried out Cambridge Structural Database (CSD)33 and Protein Data Bank (PDB)34 analyses to show that these interactions are widely present in small molecules and proteins. In proteins, these interactions are primarily found in random coils and turn regions. Based on our observations we propose that reciprocal C═O···C═O interactions may be a key local interaction that restricts the number of conformers of unfolded proteins and may have a role in protein folding.
Results
Reciprocal carbonyl-carbonyl interactions in N,N′-diacylhydrazines
To test our hypothesis of reciprocal n→π* interactions, we have synthesized N,Nʹ-diacylhydrazines 1-8 having various substituents on either side of the carbonyl groups (Fig. 2a). In N,Nʹ-diacylhydrazines 1-8, the two amide carbonyl groups [CO-I and CO-II; Fig. 2b] are separated by three covalent bonds and 1,5-type n→π* interactions are feasible from both sides. We propose that due to the repulsion between the nitrogen lone pairs, the N,Nʹ-diacylhydrazines should be nonplanar with the carbonyl groups orientated favorably for reciprocal n→π* interactions. Incorporation of electron donating and withdrawing substituents near the carbonyl groups in 1–8 should help us to tune these interactions.
As anticipated, the N,Nʹ-diacylhydrazines (1-8) crystallized in nonplanar form with the carbonyl groups oriented almost orthogonal to each other (C═O···C═O dihedral angle = +70° to +85° or −70° to −85°) (Supplementary Table 1 and Supplementary Fig. 1). We observed that in compounds 1–3, that lack any strong electron donating or withdrawing substituent near the carbonyl groups (i.e., X = CH3, CH3CH2; Y = H, CH3), the two carbonyl groups stay far apart. The crystallographic distances d1 and d2 (Fig. 2b) in 1–3 are longer than 3.22 Å (Table 1) and natural bond orbital (NBO)35 calculations carried out on the high-resolution crystal geometries of 1–3 (Supplementary Table 2) show no evidence of n→π* interaction (Table 1 and Supplementary Table 3). In compound 4, when X is an electron withdrawing group (e.g., X = CH2Cl), an increase in the acceptor ability of the carbonyl CO-II is expected, which should increase n→π* interaction from the oxygen atom of CO-I to the π*C═O orbital of CO-II. However, the inductive electron withdrawal of Cl can be negated by the electron donation from the Cl lone pairs into the antibonding orbitals (σ* and π*) of the adjacent carbonyl group (CO-II). In compound 4, we observed such electron delocalizations from the Cl lone pairs to both σ* and π* orbitals of the C=O bonds of CO-II group, which contributed 0.67 kcal mol−1 to the stabilization (Supplementary Table 4). Such electron donation should enhance the donor ability of CO-II in 4. A short crystallographic O2···C1 distance (d2 = 3.037 Å is shorter than the sum of van der Waals radii of C and O) and presence of NBO second order perturbation energy [E 2 (n→π*) = 0.34 kcal mol−1] for n→π* interaction from CO-II to CO-I in 4 supports this assumption (Table 1). We anticipate that due to this electron donation from CO-II to CO-I, the CO-I group in 4 will be polarized and the carbonyl oxygen of CO-I will become a better donor. We clearly observed back donation of electrons from CO-I to CO-II in 4 as evidenced by a short crystallographic O1···C2 distance d1 of 3.103 Å and n→π* interaction energy [E 1 (n→π*)] of 0.10 kcal mol−1 obtained by NBO analysis. This is in accordance with our hypothesis that donation from CO-II to CO-I increases the donor ability of CO-I and acceptor ability of CO-II thereby inducing a back donation of electrons from CO-I to CO-II. Similarly, when Y is an electron donating group (e.g., Y = OCH3), CO-I is expected to be a better donor and, accordingly, we observed short distance d1 in compound 5 (Table 1). We also observed back donation from CO-II to CO-I in 5 (Table 1).
Table 1.
Compounds | Y | X | d1 | d2 | θ 1 | θ 2 | ∆1 | ∆2 | Θ1 | Θ2 | n→π* (kcal mol−1) | Total n→π* (kcal mol−1) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
(Å) | (Å) | (°) | (°) | (Å) | (Å) | (°) | (°) | E 1 (n→π*) | E 2 (n→π*) | E t (n→π*) | |||
1 | H | CH3 | 3.487 | 3.542 | 98.7 | 95.9 | 0.018 | 0.007 | 2.39 | 0.99 | NP | NP | NP |
2 | H | CH2CH3 | 3.314 | 3.331 | 93.7 | 93.2 | 0.013 | 0.026 | 1.70 | 3.43 | 0.03 | 0.03 | 0.06 |
3 | CH3 | CH3 | 3.367 | 3.365 | 92.8 | 92.7 | 0.008 | 0.015 | 0.99 | 1.89 | 0.01 | 0.02 | 0.03 |
4 | CH3 | CH2Cl | 3.103 | 3.037 | 82.8 | 85.6 | −0.015 | 0.000 | −1.96 | 0.012 | 0.10 | 0.34 | 0.44 |
5 | OCH3 | CH3 | 3.233 | 3.152 | 81.8 | 85.4 | 0.005 | −0.005 | 0.63 | −0.69 | 0.03 | 0.16 | 0.19 |
6 | OCH3 | CH2Cl | 3.072 | 3.009 | 82.1 | 84.9 | −0.005 | −0.019 | −0.70 | −2.59 | 0.14 | 0.42 | 0.56 |
7 | Br | CH2Cl | 3.123 | 3.014 | 81.3 | 86.2 | −0.019 | −0.009 | −2.53 | −1.17 | 0.08 | 0.39 | 0.47 |
8 | OCH3 | CH2Br | 3.082 | 3.017 | 81.9 | 84.8 | −0.014 | −0.017 | −1.92 | −2.34 | 0.14 | 0.40 | 0.54 |
The calculations were carried out at B3LYP/6-311 + G(2d,p) level of theory. d1 = O1···C2; d2 = O2···C1; θ 1 = ∠O1···C2 = O2; θ 2 = ∠O2···C1 = O1 (see Fig. 2b). E 1 (n→π*) is the NBO second order perturbation energy for electron donation from oxygen lone pair (n O) of the first carbonyl group (CO-I) to the π*C=O orbital of the second carbonyl group (CO-II). E 2 (n→π*) is the NBO second order perturbation energy for electron donation from oxygen lone pair (n O) of the second carbonyl group (CO-II) to the π*C=O orbital of the first carbonyl group (CO-I). E t (n→π*) = E 1 (n→π*) + E 2 (n→π*)
NP not present
Among the synthetic compounds 1–8, significantly shorter d1 and d2 are observed in 6–8 where electron donating or withdrawing groups are present on both sides of the carbonyl groups. The second order perturbation energies obtained by NBO calculations are also relatively higher for these compounds (Table 1). The NBO orbital overlaps between the oxygen lone pairs (n O) and π*C═O orbitals in compound 6 are shown in Figs. 2c, d. Note that in compounds 4 and 6–8 where X = CH2Cl or CH2Br, d2 is shorter than d1 and stronger n→π* interactions from CO-II to CO-I are observed by NBO analysis. This is due to the electron donation from the Cl or Br atom to the σ*C=O and π*C═O orbitals of CO-II, which increases the donor ability of the CO-II oxygen atom (Supplementary Table 4). Such electron donations from α-halogens to carbonyl groups and their effect on n→π* interactions were previously reported in the literature36, 37.
Interestingly, the values of ∠O···C = O angles θ 1 and θ 2 are much smaller (~82°) in compounds 4–8 where reciprocal n→π* interactions are observed than in 1–3 that lack n→π* interactions (Table 1). In fact, the values of θ 1 and θ 2 are much smaller than what is expected for one-sided n→π* interactions (∠O···C = O ~ 109°) reported previously6. This may be due to the geometrical arrangement required for reciprocal n→π* interactions, which forces θ 1 and θ 2 away from the Bürgi-Dunitz trajectory.
Another important signature of n→π* interactions is the pyramidality of the acceptor carbonyl carbon atom measured by parameters Δ and Θ9, 14, 17, 25, 29. Positive values of Δ and Θ indicate pyramidalization of the acceptor carbonyl carbon towards the donor oxygen atom whereas negative values of Δ and Θ indicate pyramidalization of the acceptor carbon away from the donor oxygen atom. In compounds 1–8, however, we have not observed a correlation of pyramidality (Θ) with O···C distance and the strength of n→π* interactions. One reason for this could be the stronger donation from the α-halogen atoms to the nearby carbonyl, which would force the acceptor carbonyl carbons towards the halogen atoms away from the donor oxygen atoms. Also, the crystal packing forces may have some influence in the observed geometries and the pyramidalization of the two nitrogen atoms between the carbonyl groups may influence the pyramidalization of the acceptor carbonyl carbons. Moreover, the individual n→π* interactions in compounds 1–8 may not be strong enough to exert a significant effect on pyramidalization of the carbonyl carbons.
Overall, these data suggest that, in compounds 1–8, the geometrical constraints imposed by the repulsion between the nitrogen lone pairs orient the two carbonyl groups favorably for reciprocal n→π* interactions. We could tune these interactions by introducing electron donating or withdrawing substituents near the carbonyl groups. Interestingly, we observed that an increase in n→π* interaction from one side also leads to an increase in the n→π* interaction from the other side in compounds 1–8. This correlation suggests that n→π* interactions in these compounds could be synergistic (Figs. 2e, f). For example, shorter d1 and higher E 1 (n→π*) values are observed in 4 compared to 3 although 3 and 4 have same the substituent (4–CH3–Ph) attached to CO-I. Similarly, higher donation from CO-I to CO-II is observed in 6 compared to 5 although 5 and 6 have same the substituent (4–OCH3–Ph) attached to CO-I.
To find out if geometry optimization has any effect on the computed n→π* interactions in comparison to the unrelaxed X-ray geometries, we also carried out geometry optimizations in compounds 1–8 by freezing the dihedral angles of the side chains involved in reciprocal interactions to their X-ray values and freely optimizing the remaining degrees of freedom (bond lengths, angles, and dihedrals) (Supplementary Fig. 2). We observed that reciprocal n→π* interactions were retained after geometry optimizations but they became slightly weaker than what were observed from the NBO calculations on the crystal geometries (Supplementary Table 5). The coordinates of the optimized geometries of 1–8 are provided in Supplementary Data 1. We also observed that, during gas phase geometry optimization, in absence of any packing and intermolecular forces that are present in the X-ray geometries, the Cl or Br atoms attached to the methylene carbons in 4, 6–8 moved to an anti-periplanar geometry (trans) with respect to the oxygen atom of the nearby carbonyl group (CO-II). This is probably due to higher hyperconjugative delocalization between the halogen lone pairs and carbonyl π* orbital in the anti-periplanar geometry that would provide more stability to the isolated gas phase molecule. Note that such elongation of carbonyl-carbonyl (O···C) short contacts (weakening of n→π* interactions) in gas phase optimized geometry relative to the X-ray geometries are well known9, 13, 14.
Reciprocal carbonyl-carbonyl interactions in small organic molecules
To probe whether intramolecular reciprocal C═O···C═O interactions are also present in other small molecules we carried out a CSD search. In our search, we looked for organic molecules having at least two carbonyl groups with intramolecular O2···C5 (d1) and O6···C1 (d2) distances ≤ 3.2 Å (Supplementary Fig. 3). The search was carried out for cases where the two carbonyl groups are separated by at least three covalent bonds (1,5-type interaction). No restriction was imposed on the ∠O···C = O angles (θ 1 and θ 2) during the search. The CSD search provided 1432 molecules which fulfilled our search criteria (Supplementary Table 6).
The plots showing the distribution of O···C distances (d1 and d2) and ∠O···C = O angles (θ1 and θ2) of all the molecules obtained from the CSD search are shown in Figs. 3a, b, respectively. As can be seen from Fig. 3a, in most cases d1 and d2 fall in 2.90–3.20 Å range indicating that reciprocal interactions are in general weak. The values of θ 1 and θ 2 are mainly concentrated in the 70–100° range with majority of the molecules having θ 1 and θ 2 in the range 80–90°. Interestingly, we also observed similar values for O···C distances (d1 and d2) and ∠O···C = O angles (θ 1 and θ 2) in compounds 4–8 that showed reciprocal n→π* interactions. Therefore, it is quite clear that the ∠O···C = O (θ) angle deviates significantly from the Bürgi-Dunitz trajectory in reciprocal C═O···C═O short contacts. The d1 vs. θ 1 and d2 vs. θ 2 plots (Figs. 3c, d, respectively) show that when the angle of approach of donor oxygen atoms to the acceptor carbonyl C=O bonds deviates from Bürgi-Dunitz trajectory, the O···C distances (d1 and d2) increase, suggesting weakening of interactions. NBO analyses of crystal geometries of 30 randomly chosen molecules (Supplementary Fig. 4) having d1 and d2 ≤ 3.20 Å and covering the range of observed ∠O···C = O angles (θ) values (70–100°) showed the presence of reciprocal interactions in them (Table 2). The NBO orbital overlaps between the oxygen lone pairs (n O) and π*C═O orbitals in one such molecule (Fig. 3e) (CCDC ref. code: JUHQEK) are shown in Figs. 3f,g.
Table 2.
CCDC Ref. code | d1 | d2 | θ 1 | θ 2 | ∆1 | ∆2 | Θ1 | Θ2 | n→π* (kcal mol−1) | Total n→π* (kcal mol−1) | |
---|---|---|---|---|---|---|---|---|---|---|---|
(Å) | (Å) | (°) | (°) | (Å) | (Å) | (°) | (°) | E 1 (n→π*) | E 2 (n→π*) | E t (n →π*) | |
PHTHAC05 | 2.997 | 2.996 | 72.0 | 72.1 | 0.031 | 0.032 | 3.76 | 3.86 | 0.12 | 0.13 | 0.25 |
PODHUM | 3.068 | 3.062 | 72.3 | 72.6 | −0.007 | −0.025 | −0.88 | −2.96 | 0.07 | 0.04 | 0.11 |
GECYEU | 2.992 | 2.972 | 74.7 | 75.5 | 0.032 | 0.009 | 3.92 | 1.10 | 0.31 | 0.16 | 0.47 |
LEBRER | 2.861 | 2.856 | 77.3 | 77.6 | 0.006 | 0.021 | 0.67 | 2.45 | 0.77 | 0.60 | 1.37 |
KOXBIK | 2.959 | 3.073 | 79.0 | 73.8 | −0.012 | 0.015 | −1.43 | 1.80 | 0.32 | 0.12 | 0.44 |
CAJVIU | 3.006 | 2.987 | 77.6 | 78.5 | −0.004 | 0.023 | −0.43 | 2.68 | 0.41 | 0.12 | 0.53 |
AZULUD | 2.979 | 3.009 | 80.0 | 78.6 | −0.013 | 0.011 | −1.52 | 1.33 | 0.36 | 0.20 | 0.56 |
ZUKVUY | 2.887 | 2.829 | 81.5 | 84.2 | 0.006 | 0.027 | 0.74 | 3.24 | 0.86 | 0.34 | 1.20 |
GAPDIK | 3.164 | 3.107 | 82.6 | 85.7 | 0.011 | 0.014 | 1.26 | 1.73 | 0.06 | 0.23 | 0.29 |
LAGTIX | 3.137 | 3.119 | 82.7 | 83.0 | 0.008 | 0.001 | 0.94 | 0.06 | 0.18 | 0.06 | 0.24 |
SUDAXAS01 | 3.171 | 3.101 | 82.7 | 85.9 | −0.013 | 0.008 | −1.42 | 0.88 | 0.21 | 0.08 | 0.29 |
JUHQEK | 2.836 | 2.839 | 83.1 | 82.6 | 0.022 | 0.006 | 2.73 | 0.70 | 0.92 | 0.91 | 1.83 |
ACBZO01 | 3.042 | 3.016 | 83.4 | 84.6 | −0.008 | 0.001 | −0.90 | 0.11 | 0.21 | 0.27 | 0.48 |
WOCHIF | 2.885 | 2.832 | 83.1 | 85.3 | −0.009 | −0.011 | −1.13 | −1.28 | 0.55 | 0.85 | 1.40 |
MODYIO | 3.136 | 3.102 | 83.8 | 85.5 | 0.003 | 0.005 | 0.35 | 0.58 | 0.15 | 0.17 | 0.32 |
YEXQOH | 3.191 | 3.118 | 83.8 | 87.8 | −0.003 | 0.004 | −0.44 | 0.49 | 0.06 | 0.15 | 0.21 |
CIQNEW | 3.044 | 3.086 | 84.1 | 82.2 | 0.001 | 0.001 | 0.15 | 0.14 | 0.23 | 0.20 | 0.43 |
LUCHEY | 2.959 | 2.987 | 84.4 | 82.9 | −0.012 | 0.025 | −1.49 | 2.92 | 0.57 | 0.24 | 0.81 |
BECLAW | 2.816 | 2.894 | 85.3 | 81.3 | −0.054 | 0.005 | −6.53 | 0.62 | 1.14 | 0.56 | 1.70 |
DESPAT | 3.084 | 2.883 | 86.6 | 96.3 | 0.008 | 0.010 | 0.89 | 1.11 | 0.19 | 0.98 | 1.17 |
GIRQAA | 3.032 | 2.998 | 90.7 | 92.1 | −0.005 | 0.013 | −0.64 | 1.55 | 0.32 | 0.36 | 0.68 |
PUFBEZ | 3.132 | 3.107 | 93.9 | 94.9 | −0.036 | 0.002 | −4.46 | 0.26 | 0.08 | 0.20 | 0.28 |
JOSGIH | 2.994 | 2.993 | 95.0 | 95.1 | 0.027 | 0.027 | 3.39 | 3.31 | 0.26 | 0.30 | 0.56 |
IKAXII | 3.005 | 3.026 | 95.2 | 94.5 | 0.005 | −0.004 | 0.63 | −0.47 | 0.30 | 0.18 | 0.48 |
OPAKIA | 2.985 | 3.008 | 95.3 | 93.4 | 0.005 | −0.020 | 0.58 | −2.39 | 0.23 | 0.34 | 0.57 |
OMINII | 2.987 | 3.088 | 95.4 | 90.7 | 0.001 | 0.015 | 0.06 | 1.79 | 0.33 | 0.20 | 0.53 |
XACLUK | 2.956 | 2.909 | 96.7 | 99.2 | −0.009 | 0.00 | −1.17 | 0.03 | 0.30 | 0.38 | 0.68 |
EZELOK01 | 3.058 | 3.158 | 100.1 | 95.2 | −0.006 | 0.000 | −0.72 | −0.02 | 0.18 | 0.10 | 0.28 |
WIHKAB | 2.866 | 3.054 | 103.2 | 93.4 | 0.012 | 0.004 | 1.31 | 0.58 | 0.57 | 0.04 | 0.61 |
LOVNIT | 3.003 | 3.068 | 105.7 | 101.8 | 0.011 | −0.001 | 1.27 | −0.11 | 0.16 | 0.11 | 0.27 |
The calculations were carried out at B3LYP/6-311 + G(2d,p) level of theory. d1 = O2···C5; d2 = O6···C1; θ 1 = ∠O2···C5 = O6; θ 2 = ∠O6···C1 = O2 (Supplementary Fig. 3), E 1 (n→π*), E 2 (n→π*) and E t (n→π*) have same meaning as described before in Table 1. [CO-I and CO-II are randomly chosen in these molecules]
In most of the molecules obtained from the CSD search, reciprocal C═O···C═O interactions were stabilized by both n→π* and π→π* interactions between the carbonyl groups (Table 2 and Supplementary Table 7). We observed substantial C═O···C═O π→π* interactions in molecules having θ 1 and θ 2 values > 90° (Supplementary Table 7). In some cases, π→π* interactions are even stronger than n→π* interactions. When θ 1 and θ 2 values were <90°, π→π* interactions were observed for molecules having relatively shorter O···C distances (both d1 and d2 <2.90 Å) and stronger n→π* interactions. We propose that although the contribution of individual orbital interaction is small, the overall contribution of two n→π* and two π→π* interactions to the stabilization of molecules having reciprocal C═O···C═O interactions could be significant. Based on the NBO calculations at B3LYP/6-311 + G(2d,p) level, we observed that reciprocal C═O···C═O interactions contribute 0.11–3.37 kcal mol−1 (with an average value of 0.98 kcal mol−1) to the stabilization of small molecules (see the last column in Supplementary Table 7).
We observed positive values of Δ and Θ for the carbonyl carbons in most of the molecules from the CSD listed in Table 2, which indicate their pyramidalization towards the donor oxygen atoms. The plots of Θ with O···C distances and the strength of the reciprocal interactions in compounds obtained from the CSD search are shown in Supplementary Fig. 5. Although the correlation between pyramidality of second carbonyl (CO-II) carbon (Θ2) and d1 looks better than the correlation between pyramidality of first carbonyl (CO-I) carbon (Θ1) and d2, the CO-I and CO-II are chosen completely randomly in these molecules. As the pyramidalization also depends on other factors like θ and the elasticity of the carbonyl group, a strong correlation between pyramidalization and the O···C distance and strength of n→π* interactions may not be observed in these molecules having different types of carbonyl groups as well as different θ values.
To get some insights into the structures of the small molecules having reciprocal C═O···C═O interactions, we manually analyzed small molecules from the CSD having 1,5-type reciprocal interactions with both d1 and d2 ≤ 3.00 Å. A total of 249 molecules fulfill the above criteria [1, 5-interaction; both d1 and d2 ≤ 3.00 Å]. As can be anticipated, the nature of the two atoms/groups between the interacting carbonyl groups plays a key role in keeping the two carbonyl groups non coplanar and provides them the conformation required for reciprocal interactions (Supplementary Table 8). Interestingly, majority of these molecules (117, ~47%) have one heteroatom and one chiral carbon between the two interacting carbonyl pairs, a feature that resembles peptides and proteins.
Reciprocal carbonyl-carbonyl interactions in proteins
The presence of reciprocal C═O···C═O interactions in the X-ray crystal geometries of small organic molecules inspired us to look for their presence in protein crystal structures. To probe the presence of reciprocal C═O···C═O interactions in proteins, we analyzed a total of 2269 protein crystal structures with resolution ≤ 1.6 Å from the PDB with redundancy (pairwise sequence identity) less than 10%, out of which 2184 showed the presence of reciprocal interactions in them. The PDB protein structures ranked by the number of reciprocal C═O···C═O interactions present in them are included in Supplementary Data 2. For the PDB search, the distance between the carbonyl oxygen of ith amino acid residue and the carbonyl carbon of (i + 1)th amino acid residue is defined as d1. The distance between the carbonyl oxygen of (i + 1)th residue and carbonyl carbon of ith residue is defined as d2. The corresponding ∠O···C═O angles are defined as θ 1 and θ 2, respectively (Supplementary Fig. 6). During the search, both d1 and d2 were kept ≤ 3.20 Å but no restriction was imposed on θ 1 and θ 2. The plot of d1 and d2 values obtained from the search show that most of them fall in 2.90−3.20 Å range (Fig. 4a). The angles θ 1 and θ 2 (~85 ± 15°) deviates significantly from the Bürgi-Dunitz trajectory (Fig. 4b). These observations are consistent with the trend that was observed for small molecules discussed above. Analyses of d1 and d2 for all amino acid residues in all proteins (2184) studied here show that shorter distances d1 and d2 ≤ 3.2 Å fall within the tail of the full distribution (Supplementary Fig. 7).
In a previous study22, Bartlett et al reported one-sided n→π* interactions with d ≤ 3.20 Å and 99o ≤ θ ≤ 119o. As we have applied the same distance (d ≤ 3.20 Å) and resolution (<1.6 Å) criteria, the reciprocal interactions observed here for angles 99o ≤ θ 1, θ 2 ≤ 119o would be observed as one-sided n→π* interactions by using the criteria of Bartlett et al. As can be seen from Fig. 4b, the distribution of θ 1 and θ 2 in the range of 99°–119° (regions II, III, and IV) is a very small percentage (6.5%) of the total number of reciprocal C═O···C═O interactions that are being reported here. This indicates that reciprocal C═O···C═O interactions are novel and distinct from one-sided n→π* interactions reported previously.
NBO analysis of 30 amino acid pairs (Supplementary Fig. 8) with short O···C distances (both d1 and d2 ≤ 3.20 Å) that covers the complete range of observed ∠O···C = O angle (θ) (70–110°) clearly showed the presence of reciprocal n→π* interactions (Table 3, Figs. 4c, d). Similar to CSD molecules, in proteins also we observed substantial C═O···C═O π→π* interactions between the amino acid pairs having θ 1 and θ 2 values > 90° (Supplementary Table 9). π→π* NBO orbital overlap between the two carbonyl groups in an amino acid pair is shown in Figs. 4e, f [Leu-Pro (141–142); [PDB: 2x5o]. For molecules having relatively stronger n→π* interactions (both d1 and d2 < 2.90 Å), π→π* interactions were observed for θ 1 and θ 2 values <90° also (Table 3). This indicates that the overall contribution of reciprocal interactions (two n→π* and two π→π* interactions) could be substantial to protein stabilization. Based on the NBO calculations at B3LYP/6-311 + G(2d,p) level, we observed that reciprocal C═O···C═O interactions contribute 0.27–4.41 kcal mol−1 (with an average value of 1.34 kcal mol−1) to the stabilization of proteins per amino acid pair (see the last column in Supplementary Table 9).
Table 3.
Amino acid pair | PDB code | Residues | d1 | d1 | θ 1 | θ 2 | n→π* (kcal mol−1) | Total n→π* (kcal mol−1) | |
---|---|---|---|---|---|---|---|---|---|
(Å) | (Å) | (°) | (°) | E 1 (n→π*) | E 2 (n→π*) | E t (n→π*) | |||
Ile–Pro | 2opc | 135–136 | 2.675 | 2.768 | 75.6 | 71.8 | 1.75 | 0.74 | 2.49 |
Lys–Pro | 1k3i | 50–51 | 2.975 | 2.938 | 78.9 | 80.5 | 0.44 | 0.38 | 0.82 |
Cys–Pro | 1gcy | 251–252 | 2.815 | 2.834 | 80.6 | 79.8 | 1.24 | 0.60 | 1.84 |
Leu–Pro | 1g5a | 379–380 | 2.956 | 2.978 | 80.7 | 79.6 | 0.61 | 0.27 | 0.88 |
Ile–Pro | 1o7i | 107–108 | 2.986 | 3.082 | 81.6 | 77.8 | 0.44 | 0.19 | 0.63 |
Val–Pro | 1jnd | 294–295 | 3.119 | 3.135 | 81.6 | 81.1 | 0.17 | 0.10 | 0.27 |
Ala–Pro | 2xu9 | 264–265 | 2.568 | 2.852 | 83.8 | 72.0 | 3.63 | 0.56 | 4.19 |
Thr–Pro | 1fj2 | 3–4 | 3.035 | 3.015 | 84.2 | 85.4 | 0.26 | 0.34 | 0.60 |
Ile–Pro | 1e2w | 208–209 | 3.114 | 3.143 | 84.4 | 83.2 | 0.24 | 0.07 | 0.31 |
Leu–Pro | 1a2p | 20–21 | 2.904 | 3.007 | 85.3 | 80.5 | 0.72 | 0.33 | 1.05 |
Pro–Pro | 3cx2 | 186–187 | 2.890 | 2.838 | 86.3 | 88.7 | 0.98 | 0.39 | 1.37 |
Glu–Pro | 1eu1 | 623–624 | 3.144 | 3.194 | 88.0 | 85.8 | 0.22 | 0.03 | 0.25 |
Leu–Pro | 2x5o | 141–142 | 2.712 | 2.771 | 88.9 | 86.5 | 1.58 | 1.30 | 2.88 |
Ala–Pro | 1g12 | 103–104 | 3.108 | 3.152 | 89.3 | 87.3 | 0.32 | 0.10 | 0.42 |
Pro–Ser | 4psc | 32–33 | 2.998 | 2.946 | 90.4 | 92.4 | 0.46 | 0.33 | 0.79 |
Thr–Glu | 4pdy | 108–109 | 2.918 | 2.936 | 91.5 | 90.4 | 0.72 | 0.25 | 0.97 |
His–Ser | 1b6a | 331–332 | 3.128 | 3.108 | 92.3 | 93.2 | 0.13 | 0.14 | 0.27 |
Ala–Asp | 2bi8 | 95–96 | 2.985 | 2.973 | 92.8 | 93.3 | 0.52 | 0.13 | 0.65 |
Phe–Pro | 1n08 | 76–77 | 2.943 | 2.941 | 93.2 | 93.3 | 0.52 | 0.10 | 0.62 |
Leu–Pro | 1eb6 | 110–111 | 3.176 | 3.175 | 93.4 | 93.2 | 0.14 | 0.10 | 0.24 |
Leu–Tyr | 3u26 | 101–102 | 2.826 | 2.893 | 93.6 | 90.2 | 0.96 | 0.34 | 1.30 |
Ser–Asp | 3ry4 | 79–80 | 2.952 | 2.981 | 94.2 | 92.9 | 0.56 | 0.12 | 0.68 |
Ala–Phe | 4y1w | 139–140 | 2.947 | 2.973 | 95.3 | 93.9 | 0.54 | 0.26 | 0.80 |
Gln–Lys | 3wcq | 9–10 | 2.815 | 2.816 | 95.9 | 96.9 | 1.03 | 0.16 | 1.19 |
Thr–Thr | 3uxf | 349–350 | 2.987 | 2.975 | 96.4 | 97.3 | 0.28 | 0.13 | 0.42 |
Ala–Arg | 1ejd | 119–120 | 3.175 | 3.171 | 99.3 | 99.0 | 0.13 | NP | 0.13 |
Phe–Gly | 1odv | 28–29 | 3.129 | 3.169 | 102.4 | 100.3 | 0.13 | 0.02 | 0.15 |
Asp–Pro | 2vzp | 2–3 | 3.026 | 3.026 | 103.6 | 103.3 | 0.07 | 0.19 | 0.26 |
Ala–Leu | 1ikp | 388–389 | 3.141 | 3.183 | 104.5 | 102.4 | 0.12 | NP | 0.12 |
Ala–Ala | 3s5m | 402–403 | 2.485 | 2.999 | 113.6 | 84.2 | 0.89 | 0.01 | 0.90 |
The calculations were carried out at B3LYP/6-311 + G(2d,p) level of theory. For the definitions of d1, d2, θ 1 and θ 2 see Supplementary Fig. 6. E 1 (n→π*), E 2 (n→π*) and E t (n→π*) have same meaning as described before in Table 1. [CO-I and CO-II are randomly chosen in these molecules]
NP not present
The plot of torsion angles (φ, ψ) (Supplementary Fig. 6) of the residue between the two interacting carbonyl groups along with other residues in the proteins show that the reciprocal interactions are mainly concentrated in the polyproline II (PPII), β-turn and right-twisted β-strand regions (Fig. 5a). Unlike the one-sided n→π* interactions reported previously22, 23 that are abundant in proteins, the abundance of these newly discovered reciprocal C═O···C═O interactions is low (~7.2%). Secondary structure analyses using Stride38 show that reciprocal C═O···C═O interactions have considerable abundance in random coils (~20%) and turn regions (10%) of proteins but negligible presence in α-helices (0.35%) (Table 4). This is in contrast to the one-sided n→π* interactions that are most abundant in α-helices22, 23. As PPII helix is not included as an independent secondary structure in most secondary structure predication programs, many PPII helices remain unassigned even though they are present in the experimentally solved structures. We observed that the coil regions having reciprocal C═O···C═O interactions are dominated by PPII structures [(φ, ψ) : (−75°, 145°)]. We have confirmed this by plotting the φ, ψ angles of residues in the random coil regions having reciprocal interactions (Fig. 5b). This is not surprising given that PPII conformations are known to dominate coil regions of folded proteins39.
Table 4.
Secondary structure type | Total number of amino acids | Amino acids involved in reciprocal C═O···C═O interactions | % |
---|---|---|---|
Coil | 93743 | 18422 | 19.65 |
Turn | 124938 | 12577 | 10.07 |
β-sheet | 131564 | 6195 | 4.71 |
310-helix | 22712 | 408 | 1.80 |
α-helix | 156248 | 541 | 0.35 |
Overall | 529205 | 38143 | 7.21 |
We also manually analyzed 789 reciprocal C═O···C═O interactions in 10 proteins having the highest numbers of reciprocal C═O···C═O interactions (Supplementary Table 10). In agreement with Stride prediction, manual inspection revealed that reciprocal C═O···C═O interactions are mostly present in coil/PPII and turn regions of these proteins. α-helices that have reciprocal C═O···C═O interactions are distorted, while the β-sheets having reciprocal n→π* interactions are twisted (Fig. 6). We also observed reciprocal C═O···C═O interactions between amino acid pairs at the interfaces of different secondary structure types (Fig. 6e and Supplementary Table 11).
The other secondary structure that has significant abundance of reciprocal C═O···C═O interactions is β-turn. In a β-turn, the peptide groups (NH and C = O) of the central two amino acids do not participate in any inter-residue hydrogen bonding. Therefore, we assume that these residues may participate in local reciprocal C═O···C═O interactions either between themselves or with their other neighbors, which should compensate for the lack hydrogen bonding interactions in them. A careful examination of the orientations of the carbonyl groups in various common β-turns indicated that reciprocal C═O···C═O interaction may be feasible between the first and the second residues of type Iʹ and II β-turns due to the favorable orientations of the two carbonyl groups but likely to be unfavoured in type I and IIʹ β-turns. In fact, analysis of the 10 protein crystal structures discussed above show that most of the reciprocal C═O···C═O interaction pairs found in β-turns were type II, followed by type IV (Supplementary Table 12). In 41 cases, the reciprocal n→π* interactions were present between first and the second amino acid residues while in other 43 cases they were between the third and the fourth amino acid residues of β-turns in these 10 proteins. However, in no case the second and the third residues of the β-turn were involved in reciprocal C═O···C═O interactions between them (Supplementary Fig. 9).
Analysis of distribution of reciprocal C═O···C═O interactions among various amino acids suggests that proline is involved in the largest number of reciprocal C═O···C═O interactions in various proteins followed by glutamic acid and serine (Fig. 5c). This trend is different from what was previously observed for one-sided n→π* interactions in α-helices and β-sheets22 (Pro > Gly > Ala). Analysis of distribution of reciprocal C═O···C═O interactions among the amino acid pairs in various proteins reveals that Pro–Pro is the most abundant pair (Fig. 5d). The 10 most prominent amino acid pairs that participate in reciprocal C═O···C═O interactions, all contain a proline residue (Fig. 5d). These results may be expected given the abundance of reciprocal interactions in PPII regions.
Possible role of reciprocal carbonyl-carbonyl interactions in protein folding
PPII helices and turns are the major secondary structures where reciprocal C═O···C═O interactions are observed. PPII is the major well-defined backbone structure present in denatured, unfolded, and natively unfolded proteins40 and random coil regions of folded proteins39. As PPII lacks stable non-local amino acid interactions such as hydrogen bonding, we propose that local reciprocal interactions could possibly contribute to their stability. Levinthal proposed that protein folding is speeded and guided by the rapid formation of local interactions in the unfolded state, which then determine the further folding of the peptide41. The fact that local reciprocal interactions contribute to the stabilization of the PPII conformation that are abundant in unfolded proteins, reciprocal interactions could play a role in protein folding. Also, turn regions that are stabilized by reciprocal interactions are known to act as nucleation sites for protein folding. Therefore, an open question is how important such reciprocal interactions might be for protein folding.
Nature of reciprocal carbonyl-carbonyl interactions
The nature of C═O···C═O interactions has been debated in the literature. While some consider them n→π* orbital interactions9, 11, others believe them to be dipolar in nature7, 8, 10. We have so far discussed reciprocal C═O···C═O interactions as n→π* and π→π* orbital interactions because of the following reasons. Firstly, the plots of the n→π* and sum of n→π* and π→π* orbital interaction energies against the O···C distances (d) show a strong correlation (Figs. 7a, b). In Fig. 7a, we have plotted the distances (d1 and d2 values) against the stabilization energies due to n→π* interactions [NBO second order perturbation energies E 1 (n→π*) and E 2 (n→π*)] reported in Tables 1–3. The plot suggests that the stabilization energies E (n→π*) for n→π* interactions decreases with an increase in the O···C (d) in synthetic molecules 1–8, molecules taken from CSD and interacting amino acid pairs obtained from PDB (Tables 1–3). The overall orbital interaction energies (sum of n→π* [E (n→π*)] and π→π* [E (π→π*)] interaction energies reported in Tables 1–3) plotted in Fig. 7b also show a similar correlation with O···C (d) distances. These correlations indicate that orbital interaction is the major mechanism for the stabilization of these reciprocal C═O···C═O short contacts. Secondly, we carried out NBO deletion analysis on all the molecules reported in Tables 1–3 (Supplementary Table 13) and observed that deletion of n→π* interactions increases charge on donor oxygen lone pair (n O) and depletes it on acceptor carbonyl π* C═O orbital, which correlate well with the strength of O···C distances (Supplementary Fig. 10a, b). Similarly, deletion of π→π* interactions increases charge on π C═O orbital of donor carbonyl and depletes it on π* C═O orbital of the acceptor carbonyl (Supplementary Table 14), which also can be correlated to the strength of C═O···C═O short contacts (Supplementary Fig. 11a, b). The overall accumulation of charges on the acceptor carbonyl π* C═O orbitals due to donation from the oxygen lone pairs and π C═O orbital of donor carbonyl is shown in Figs. 7c–d, which correlate well with the strength of C═O···C═O short contacts. This also suggests that electron delocalization is a major contributor in reciprocal C═O···C═O interactions. Finally, C = O···C = O torsion angles of the carbonyl groups involved in reciprocal interactions indicate a net zero dipole-dipole interaction eliminating the possibility of these interactions being dipolar in nature. To emphasize this point, in Figs. 7e–f, we have plotted the values of C = O···C = O torsion angles of the 1432 molecules obtained from the CSD search. The torsion angle (T) between two dipoles could be used to understand the dipolar nature of interaction between them. As we know, antiparallel (T ~ 180°) dipoles attract and parallel dipoles (T ~ 0°) repel each other whereas two orthogonal dipoles (T ~ 90°) have net zero dipolar interaction. In case of reciprocal interaction, the C = O···C = O torsion angles show an orientational preference [C═O···C═O torsion angle falls in 60° to 90° (or −60° to −90°) range] as a consequence of the simultaneous restrictions on d1 and d2 (≤3.2 Å). However, the values of the C═O···C═O torsion angles (~90°) suggest that there would be almost net zero interaction between the dipoles, eliminating the possibility of strong dipolar interactions. Therefore, we conclude that orbital delocalization is the major driving force for the stabilization of reciprocal C═O···C═O interactions. An elaborate energy decomposition analysis may be required for the accurate deconvolution of various factors contributing to the stabilization of reciprocal C═O···C═O short contacts.
We conclude that reciprocal carbonyl-carbonyl interactions exist both in small organic molecules and proteins. However, due to geometrical constraints associated with such interactions, the approach of the donor oxygen atoms to the acceptor carbon atoms deviates significantly from the Bürgi-Dunitz trajectory, and therefore, electron delocalization between the oxygen lone pair (n O) and π*C═O orbital is weak. This weak donation from the first carbonyl group to the second is compensated by a back donation from the second carbonyl group to the first. In many cases, reciprocal π→π* interactions were also observed along with reciprocal n→π* interactions and their overall contributions to the stabilization of molecules having reciprocal C═O···C═O short contacts could be significant. In proteins, C═O···C═O n→π* interactions are present in all types of secondary structures. While one-sided n→π* interactions are prevalent in α-helices22, 23, reciprocal interactions are abundant in PPII helices and turn regions. Prevalence of reciprocal C═O···C═O interactions in PPII helices and turn regions of proteins suggests a possible role for these interactions in protein folding. Further, the presence of reciprocal C═O···C═O interactions in distorted α-helices and twisted β-sheets suggests that these interactions could stabilize secondary structures that deviate from their regular geometries. The reciprocal C═O···C═O interactions present at the interface of two different types of secondary structures could also help in stabilizing the strained amino acid residues that are present at these interfaces. In future, it would be interesting to investigate the ability of amino acid pairs having high propensity to get involved in reciprocal C═O···C═O interactions to stabilize PPII helices and β-turns. It would also be interesting to investigate if some non-peptidic fragments obtained from the CSD search having strong reciprocal C═O···C═O interactions could be used to stabilize PPII conformation or design peptide-turns. Finally, an energy decomposition analysis would provide better understanding of the forces that contributes to the stabilization of reciprocal C═O···C═O interactions.
Methods
Crystallization method
Single crystals of compounds 1–8 were grown by slow evaporation. Various solvent combinations were used to crystallize the compounds either at room temperature or low temperature (4 °C). Details of the crystallization conditions are given in Supplementary Table 1.
X-ray crystal structure determination method
Single crystal structures of compound 1–8 were determined by measuring X-ray intensity data. Bruker D8Venture APEX 342 single crystal home source X-ray diffractometer equipped with CMOS PHOTON 100 detector and Monochromated microfocus sources Mo Kα radiation (λ = 0.71073 Å) were used for data collection in phi (ϕ) and omega (ω) scan strategy at room temperature (298 K). The data was processed using SAINT 43 and absorption correction was done using SADABS 44 implemented in APEX 3. For structure solution XSHELL program based on SHELX45 was used. The non-hydrogen atoms were refined anisotropically and located in successive difference Fourier syntheses. The hydrogen atoms were fixed to neutron bond length using appropriate HFIX commands. ORTEP diagrams of compounds 1–8 (CCDC 1486577- 1486584) is provided in Supplementary Fig. 1. Compound 5 crystallized with a water molecule in the asymmetric unit. However, for clarity we have not shown the water molecule in its ORTEP diagram. Compound 7 has disorder at chlorine atom; the occupancy of disordered chlorine atom namely Cl1A and Cl1B was refined using the PART command. Similar ADP restraint SIMU46 and rigid bond restraint DELU46 was applied to stabilize the anisotropic refinement. SADI46 instruction was used to restrain the distance to equal. The anisotropic displacement parameter for disordered chlorine atom was fixed using EADP46 constraint.
CSD analysis
Intramolecular C═O···C═O noncovalent interactions were searched and structural data were retrieved from Cambridge Structural Database33 (CSD version 5.21 Nov. 2015) using Conquest47 (version 1.18) program. The fragment chosen for the search is shown in Supplementary Fig. 3, where X is indicative for any atom. Only unique matching fragments were taken and the fragment was chosen in such a way that there are at least two carbonyl groups irrespective of their nature. Distances d1 (O2–C5) and d2 (O6–C1) are restricted to ≤3.2 Å. Angles [O2–C5–O6 (θ 1) and O6–C1–O2 (θ 2)] and dihedral angles (C1–O2–C5–O6 and C5–O6–C1–O2) were printed without any restriction. Only crystalline, non-ionic and non-polymeric organic molecules having no disorder and error with R factor ≤ 5% having at least three covalent bond separations between the carbonyl groups were considered in this search.
PDB analysis
A subset of 2269 protein was culled out from RCSB PDB34 using a search criterion of resolution <1.6 Å with redundancy (pairwise sequence identity) less than 10%, downloaded on 19 January 2016. Out of 2269 proteins, 2184 showed the reciprocal n→π* interaction. For proteins existing in polymeric form or for proteins containing amino acids in more than one conformation, Chain A and conformation A were chosen, except for 57 proteins where chain A is absent. Distance d1 is defined as distance between the i th amide oxygen to the subsequent (i + 1)th amide carbon, while d2 is defined as distance between the (i + 1)th amide oxygen to the i th amide carbon (Supplementary Fig. 6). We used d1 ≤ 3.2 Å and d2 ≤ 3.2 Å criteria for selecting amino acid pairs participating in reciprocal n→π* interactions. Secondary structure assignment was done using the Stride code39. Ramachandran plots were generated for the proteins using Gnuplot (http://www.gnuplot.info/).
Computational methods
All the calculations were performed by using Gaussian09 suite of quantum chemistry programs48. The Hartree-Fock (HF)49 and the hybrid Becke 3-Lee-Yang-Parr (B3LYP)50, 51 exchange correlation functional with 6-311 + G (2d,p) basis set were used for the calculations. Natural bond orbital (NBO)35 analyses were performed on the crystal geometries of the synthetic molecules and small organic molecules obtained from CSD search. For proteins, the coordinates of the interacting amino acid residue pair were extracted using PyMOL52. The α-carbons of the amino acid residues adjacent to N and C termini of the amino acid pair were also included, so as to mimic a dipeptide with N and C termini capped with N(CO)Me and (CO)NMe, respectively. Finally, hydrogen atoms were added to the structure using PyMOL (Supplementary Fig. 8). NBO analyses were carried out on crystal geometries at B3LYP/6-311 + G(2d,p) and HF/6-311 + G(2d,p) level of theory. The NBO second order perturbative energies E (n→π*) and E (π→π*) obtained from NBO calculations were taken as the stabilization energy due to n→π* and π→π* interactions. NBO deletion analysis was carried out on crystal geometries at HF/6-311 + G(2d,p) level of theory.
Data availability
The authors declare that the data supporting the findings of this study are available within the paper and its Supplementary Information files, and also are available from the corresponding author upon reasonable request. X-ray crystallographic data for structures reported in this study have been deposited at the Cambridge Crystallographic Data Centre (CCDC), under deposition number CCDC 1486577-1486584. These data can be obtained free of charge from the CCDC via www.ccdc.cam.ac.uk/.
Electronic supplementary material
Acknowledgements
We thank E. Arunan, M. Goswami, B.B. Dhar, R.P. Vivekananth and R. Ayana for helpful discussions. This project was funded by Shiv Nadar University and Early Career Research Grant (ECR/2015/000337) from Science and Engineering Research Board (SERB), Department of Science and Technology (DST), Government of India.
Author contributions
B.K.S. conceived the project. A.R. carried out the synthesis and characterization of the compounds. A.R. crystallized the compounds, K.K.J. collected the X-ray data and K.K.J. and A.R. solved the structures. B.K.S. and A.R. designed the CSD analyses. A.R. performed the CSD analyses. B.K.S. and P.S. designed the PDB analyses. P.S. performed the PDB analyses. B.K.S. designed the computational studies and A.R. performed them. B.K.S. wrote the manuscript. B.K.S., A.R., N.S., P.S., and K.K.J. discussed the results and edited the manuscript.
Competing interests
The authors declare no competing financial interests.
Footnotes
Abdur Rahim and Pinaki Saha contributed equally to this work.
Electronic supplementary material
Supplementary Information accompanies this paper at doi:10.1038/s41467-017-00081-x.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Strekowski L, Wilson B. Noncovalent interactions with DNA: An overview. Mutat. Res. Fund. Mol. Mech. Mut. 2007;623:3–13. doi: 10.1016/j.mrfmmm.2007.03.008. [DOI] [PubMed] [Google Scholar]
- 2.Baldwin, R. L. Protein Folding Handbook Ch. 6 (Wiley-VCH Verlag GmbH, 2005).
- 3.Karshikoff, A. Non-covalent Interactions in Proteins (Imperial College Press, 2006).
- 4.Hill, J. W., Petrucci, R. H., McCreary, T. W. & Perry, S. S. General Chemistry Ch. 11 (Pearson Prentice Hall, 2005).
- 5.Burgi HB, Dunitz JD, Shefter E. Chemical reaction paths. IV. Aspects of O···C=O interactions in crystals. Acta Cryst. 1974;B30:1517–1527. doi: 10.1107/S0567740874005188. [DOI] [Google Scholar]
- 6.Singh SK, Das A. The n→π* interaction: a rapidly emerging non-covalent interaction. Phys. Chem. Chem. Phys. 2015;17:9596–9612. doi: 10.1039/C4CP05536E. [DOI] [PubMed] [Google Scholar]
- 7.Paulini R, Müller K, Diederich F. Orthogonal multipolar interactions in structural chemistry and biology. Angew. Chem. Int. Ed. 2005;44:1788–1805. doi: 10.1002/anie.200462213. [DOI] [PubMed] [Google Scholar]
- 8.Fischer FR, Wood PA, Allen FH, Diederich F. Orthogonal dipolar interactions between amide carbonyl groups. Proc. Natl. Acad. Sci. U.S.A. 2008;105:17290–17294. doi: 10.1073/pnas.0806129105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Choudhary A, Gandla D, Krow GR, Raines RT. Nature of amide carbonyl−carbonyl interactions in proteins. J. Am. Chem. Soc. 2009;131:7244–7246. doi: 10.1021/ja901188y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Worley B, Richard G, Harbison GS, Powers R. 13C NMR reveals no evidence of n−π* interactions in proteins. PLoS ONE. 2012;7:e42075. doi: 10.1371/journal.pone.0042075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kamer KJ, Choudhary A, Raines RT. Intimate interactions with carbonyl groups: dipole–dipole or n→π*? J. Org. Chem. 2013;78:2099–2103. doi: 10.1021/jo302265k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Choudhary A, Kamer KJ, Raines RT. An n→π* Interaction in aspirin: Implications for structure and reactivity. J. Org. Chem. 2011;76:7933–7937. doi: 10.1021/jo201389d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Newberry RW, Raines RT. A Key n→π* interaction in N-acyl homoserine lactones. ACS Chem. Biol. 2014;9:880–883. doi: 10.1021/cb500022u. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Newberry RW, VanVeller B, Guzei IA, Raines RT. n→π* Interactions of amides and thioamides: implications for protein stability. J. Am. Chem. Soc. 2013;135:7843–7846. doi: 10.1021/ja4033583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.DeRider ML, Wilkens SJ, Waddell MJ, Bretscher LE, Weinhold F, Raines RT, Markley JL. Collagen stability: Insights from NMR spectroscopic and hybrid density functional computational investigations of the effect of electronegative substituents on prolyl ring conformations. J. Am. Chem. Soc. 2002;124:2497–2505. doi: 10.1021/ja0166904. [DOI] [PubMed] [Google Scholar]
- 16.Newberry RW, Raines RT. n→π* interactions in poly(lactic acid) suggest a role in protein folding. Chem. Commun. 2013;49:7699–7701. doi: 10.1039/c3cc44317e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Choudhary A, Raines RT. Signature of n→π* Interaction in α-helices. Protein Sci. 2011;20:1077–1081. doi: 10.1002/pro.627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Gorske BC, Bastian BL, Geske GD, Blackwell HE. Local and tunable n→π* Interactions regulate amide isomerism in the peptoid backbone. J. Am. Chem. Soc. 2007;129:8928–8929. doi: 10.1021/ja071310l. [DOI] [PubMed] [Google Scholar]
- 19.Gorske BC, Stringer JR, Bastian BL, Fowler SA, Blackwell HE. New strategies for the design of folded peptoids revealed by a survey of noncovalent interactions in model systems. J. Am. Chem. Soc. 2009;131:16555–16567. doi: 10.1021/ja907184g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Stringer JR, Crapster JA, Guzei IA, Blackwell HE. Extraordinary Robust polyproline type I peptoid helices generated via the incorporation of α-chiral aromatic N-1 naphthylethyl side chains. J. Am. Chem. Soc. 2011;133:15559–15567. doi: 10.1021/ja204755p. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Laursen JS, Engel-Andreasen J, Fristrup P, Harris P, Olsen CA. Cis–Trans Amide Bond Rotamers in β-Peptoids and Peptoids: Evaluation of Stereoelectronic Effects in Backbone and Side Chains. J. Am. Chem. Soc. 2013;135:2835–2844. doi: 10.1021/ja312532x. [DOI] [PubMed] [Google Scholar]
- 22.Bartlett GJ, Choudhary A, Raines RT, Woolfson DN. n→π* interactions in proteins. Nat. Chem. Biol. 2010;6:615–620. doi: 10.1038/nchembio.406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Fufezan C. The role of Buergi-Dunitz interactions in the structural stability of proteins. Proteins. 2010;78:2831–2838. doi: 10.1002/prot.22800. [DOI] [PubMed] [Google Scholar]
- 24.Bartlett GJ, Newberry RW, VanVeller B, Raines RT, Woolfson DN. Interplay of Hydrogen Bonds and n→π* Interactions in Proteins. J. Am. Chem. Soc. 2013;135:18682–18688. doi: 10.1021/ja4106122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Newberry RW, Bartlett GJ, VanVeller B, Woolfson DN, Raines RT. Signatures of n→π* interactions in proteins. Protein Sci. 2014;23:284–288. doi: 10.1002/pro.2413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Pal TK, Sankararamakrishnan R. Quantum chemical investigations on intraresidue carbonyl−carbonyl contacts in aspartates of high-resolution protein structures. J. Phys. Chem. B. 2010;114:1038–1049. doi: 10.1021/jp909339r. [DOI] [PubMed] [Google Scholar]
- 27.Pal TK, Sankararamakrishnan R. Self-contacts in Asx and Glx residues of high-resolution protein structures: Role of local environment and tertiary interactions. J. Mol. Graph. Model. 2008;27:20–33. doi: 10.1016/j.jmgm.2008.02.001. [DOI] [PubMed] [Google Scholar]
- 28.Bondi A. van der Waals volumes and radii. J. Phys. Chem. 1964;68:441–451. doi: 10.1021/j100785a001. [DOI] [Google Scholar]
- 29.Esposito L, Vitagliano L, Zagari A, Mazzarella L. Pyramidalization of backbone carbonyl carbon atoms in proteins. Protein Sci.. 2000;9:2038–2042. doi: 10.1110/ps.9.5.831. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Singh SK, Mishra KK, Sharma N, Das A. Direct spectroscopic evidence for an n→π* interaction. Angew. Chem. Int. Ed. 2016;55:1–6. doi: 10.1002/anie.201510990. [DOI] [PubMed] [Google Scholar]
- 31.Allen FH, Baalham CA, Lommerse JPM, Raithby PR. Carbonyl–carbonyl interactions can be competitive with hydrogen bonds. Acta Cryst. 1998;B54:320–329. doi: 10.1107/S0108768198001463. [DOI] [Google Scholar]
- 32.Maccallum PH, Poet R, Milner-White EJ. Coulombic attractions between partially charged main-chain atoms stabilise the right-handed twist found in most β–Strands. J. Mol. Biol. 1995;248:374–384. doi: 10.1016/s0022-2836(95)80057-3. [DOI] [PubMed] [Google Scholar]
- 33.Groom CR, Allen FH. The cambridge structural database in retrospect and prospect. Angew. Chem. Int. Ed. 2014;53:662–671. doi: 10.1002/anie.201306438. [DOI] [PubMed] [Google Scholar]
- 34.Berman HM, et al. The protein data bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Glendening, E. D., Reed, A. E., Carpenter, J. E. & Weinhold, F. NBO Version 3.1.
- 36.Choudhary A, Fry CG, Raines RT. Modulation of an n→π* interaction with α-fluoro groups. Arkivoc. 2010;viii:251–262. [PMC free article] [PubMed] [Google Scholar]
- 37.Yoshinaga F, Tormena CF, Freitas MP, Rittner R, Abraham RJ. Conformational analysis of 2-halocyclohexanones: an NMR, theoretical and solvation study. J. Chem. Soc. Perkin Trans. 2002;2:1494–1498. doi: 10.1039/b204635k. [DOI] [Google Scholar]
- 38.Frishman D, Argos P. Knowledge‐based protein secondary structure assignment. Proteins. 1995;23:566–579. doi: 10.1002/prot.340230412. [DOI] [PubMed] [Google Scholar]
- 39.Adzhubeil AA, Sternberg MJE, Makarov AA. Polyproline-II helix in proteins: structure and function. J. Mol. Biol. 2013;425:2100–2132. doi: 10.1016/j.jmb.2013.03.018. [DOI] [PubMed] [Google Scholar]
- 40.Shi Z, Chen K, Liu Z, Kallenbach NR. Conformation of the backbone in unfolded proteins. Chem. Rev. 2006;106:1877–1897. doi: 10.1021/cr040433a. [DOI] [PubMed] [Google Scholar]
- 41.Levinthal, C. How to fold graciously. in Mossbauer Spectroscopy in Biological Systems: Proceeding of a meeting held at Allerton House (eds De Brunner, J.T.P. & Munck, E.) 22–24, (Monticello, Illinois, University of Illinois Press, 1969).
- 42.Bruker APEX 3, Version 2, User Manual S86 –EXS053 (Bruker AXS Inc., 2015).
- 43.Siemens, SMART system (Siemens Analytical X-ray Instruments Inc., 1995).
- 44.Sheldrick, G. M. SADABS (Bruker AXS Inc., 2007).
- 45.Sheldrick GM. A short history of SHELX. Acta Cryst. Sect. A. 2008;64:112–122. doi: 10.1107/S0108767307043930. [DOI] [PubMed] [Google Scholar]
- 46.Muller, P., Herbst-Irmer, R., Spek, A. L., Schneider, T. R. & Sawaya, M. R. Crystal Structure Refinement - A Crystallographer’s Guide to SHELXL (Oxford University Press, 2006).
- 47.Bruno IJ, et al. New software for searching the Cambridge Structural Database and visualizing crystal structures. Acta Crystallogr. Sect. B. 2002;58:389–397. doi: 10.1107/S0108768102003324. [DOI] [PubMed] [Google Scholar]
- 48.Frisch, M. J. et al. Gaussian 09, Revision D.01 (Gaussian, Inc., 2013).
- 49.Slater JC. A simplification of the Hartree-Fock method. Phys. Rev. 1951;81:385–390. doi: 10.1103/PhysRev.81.385. [DOI] [Google Scholar]
- 50.Lee C, Yang W, Parr RG. Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B. 1988;37:785–789. doi: 10.1103/PhysRevB.37.785. [DOI] [PubMed] [Google Scholar]
- 51.Becke AD. Density‐functional thermochemistry. III. The role of exact exchange. J. Chem. Phys. 1993;98:5648–5652. doi: 10.1063/1.464913. [DOI] [Google Scholar]
- 52.The PyMOL Molecular Graphics System, Version 1.8, Schrödinger, LLC.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The authors declare that the data supporting the findings of this study are available within the paper and its Supplementary Information files, and also are available from the corresponding author upon reasonable request. X-ray crystallographic data for structures reported in this study have been deposited at the Cambridge Crystallographic Data Centre (CCDC), under deposition number CCDC 1486577-1486584. These data can be obtained free of charge from the CCDC via www.ccdc.cam.ac.uk/.