Abstract
For the first time, at the MP2/6-311++G(2df,pd)//B3LYP/6-311++G(d,p) level of theory, a comprehensive quantum-mechanical investigation of the physico-chemical mechanism of the tautomeric wobblization of the four biologically-important G·C nucleobase pairs by the participation of the monomers in rare, in particular mutagenic, tautomeric forms (marked with an asterisk) was provided. These novel tautomeric transformations (wobblization or shifting of the bases within the pair) are intrinsically inherent properties of the G·C nucleobase pairs. In this study, we have obtained intriguing results, lying far beyond the existing representations. Thus, it was shown that Löwdin's G*·C*(WC) base pair does not tautomerize according to the wobblization mechanism. Tautomeric wobblization of the G*·C*(rWC) (relative Gibbs free energy ΔG = 0.00/relative electronic energy ΔE = 0.00 kcal·mol−1) (“r”—means the configuration of the base pair in reverse position; “WC”—the classic Watson-Crick configuration) and G*t·C*(H) (ΔG = −0.19/ΔE = 0.29 kcal·mol−1) (“H”—Hoogsteen configuration;”t” denotes the O6H hydroxyl group in the trans position) base pairs are preceded by the stages of the base pairs tautomerization by the single proton transfer (SPT). It was established that the G*t·C*(rH) (ΔG = 2.21/ΔE = 2.81 kcal·mol−1) base pair can be wobbled through two different pathways via the traditional one-stage mechanism through the TSs, which are tight G+·C− ion pairs, stabilized by the participation of only two intermolecular H-bonds. It was found out that the G·C base pair is most likely incorporated into the DNA/RNA double helix with parallel strands in the G*·C*(rWC), G·C*(rwwc), and G*·C(rwwc) (“w”—wobble configuration of the pair) tautomeric forms, which are in rapid tautomeric equilibrium with each other. It was proven that the G*·C*(rWC) nucleobase pair is also in rapid tautomeric equilibrium with the eight tautomeric forms of the so-called Levitt base pair. It was revealed that a few cases of tautomerization via the DPT of the nucleobase pairs by the participation of the C8H group of the guanine had occurred. The biological role of the obtained results was also made apparent.
Keywords: DNA, RNA, G•C base pair, tautomerization mechanism, wobblization, wobble base pair, Levitt base pair, Löwdin's base pair
Introduction
Shortly after the establishment of the spatial organization of the DNA molecule by James Watson and Francis Crick (Watson and Crick, 1953a,b), the tautomeric hypothesis was formulated (Watson and Crick, 1953b; Crick and Watson, 1954), which considers the transformation or transition of the nucleotide bases from the main (canonical) into the rare (mutagenic) tautomeric form as the main source of the origin of spontaneous point mutations. Since that time, the topic of tautomerism has remained active over the decades to the present day (Löwdin, 1963, 1966; Topal and Fresco, 1976; Florian et al., 1994; Gorb et al., 2004; Brovarets' et al., 2014; Godbeer et al., 2015; Turaeva and Brown-Kennerly, 2015).
However, up until recently it was considered that only a few unusual tautomers existed for the G·C Watson-Crick nucleobase pair (Pous et al., 2008; Alvey et al., 2014; Brovarets' and Hovorun, 2014a; Nikolova et al., 2014; Poltev et al., 2016; Szabat and Kierzek, 2017; Brovarets' et al., 2019a,b; Srivastava, 2019). In particular, tautomerization via the double proton transfer (DPT) has been carefully investigated in the reverse Löwdin G*·C*(rWC), Hoogsteen (H) G*′·C*(H), and reverse Hoogsteen G*′·C*(rH) base pairs (Brovarets' et al., 2019b), leading to the novel structures: G·(rWC), C(rWC), G*′N2·C(rWC), C(H), and G*′N7·C(rH).
Eventually a great contribution into the further development of the tautomeric hypothesis was made by Per-Orlov Löwdin (Löwdin, 1963, 1966) and Topal and Fresco (Topal and Fresco, 1976; Brovarets' et al., 2014). Thus, Per-Orlov Löwdin expressed the revolutionary, non-trivial opinion that the ability of the nucleotide bases to transform into the rare tautomeric form is provided by the electronic structure of the canonical DNA base pairs and qualitatively substantiated this assumption from the position of quantum mechanics. Subsequently, Topal and Fresco elaborated this approach in more detail, by using simple and visual models, and extended it for the explanation of the limited accuracy of codon-anticodon recognition (Topal and Fresco, 1976; Brovarets' et al., 2014).
In recent years, an alternative view in this area of the research was suggested, which could be characterized as the “Renaissance” of the tautomeric hypothesis [see Chapter Brovarets' and Hovorun (2018) and bibliography provided there]. According to this investigation the new, unusual pathways of the tautomeric interconversions between wobble (w) and Watson-Crick (WC) base pairs have been provided (Brovarets' and Hovorun, 2009, 2015a,b,c,d,e,f):
- for usual A·T and G·C DNA base pairs: A·T(WC)↔A*·T(w)/A(w)/A·T*(w) and G·C(WC)↔G·(w)/G*·C↓(w)/G·(w)/G*·C↑(w) (Brovarets' and Hovorun, 2015a);
- for unusual purine-pyrimidine wobble A·C and G·T DNA base pairs: A·C(w)↔A·C*(WC) and G·T(w)↔G*·T(WC) (Brovarets' and Hovorun, 2009, 2015b,c);
- for incorrect purine-purine A·A, G·G, and A·G DNA base pairs: A·A(w)↔A*·A(WC), G·G(w)↔G*·G(WC), A·G(WC)↔A·(w), A·G(WC)↔A*·G↑(w) (Brovarets' and Hovorun, 2015d,e);
- for pyrimidine-pyrimidine base pairs: C·T(WC)↔C*·T↑(w), C·T(WC)↔C·(w), T·T(w)↔T·T*(WC), C·C(w)↔C·C*(WC) (Brovarets' and Hovorun, 2015f).
Thus, by utilizing modern quantum-mechanical (QM) methods, the mechanisms of the mutagenic tautomerization of the pairs of nucleotide bases were investigated in detail, which were revealed to be active players in the field of spontaneous point mutagenesis (Brovarets' and Hovorun, 2018). It was established, in which cases Löwdin's approach was adequate and in which cases another approach should be reconsidered and supplemented.
Thus, it was suggested that the mechanism of the mutagenic tautomerization of the DNA base pairs, in particular classic Watson-Crick pairs, are accompanied by the mutual shifting (wobblization) of the bases one relative to the other into the minor or major DNA grooves at the intrapair sequential proton transfer (Brovarets' and Hovorun, 2015a; Brovarets' et al., 2019a). This valuable finding enables researchers to figure out, how the incorrect DNA base pairs, which architecture is different from the Watson-Crick configuration, can acquire the enzymatically-competent conformation, that guarantees their successful chemical incorporation into the composition of the main carrier of the genetic information—DNA—by the high-fidelity DNA-polymerase. Notably, even though these theoretical approaches have been realized in quite basic model objects, they correctly reflect the real state-of-affairs at the macromolecular level, since they have been experimentally confirmed for macromolecular objects.
In this research, the objects of the investigation have been extended—except the Watson-Crick (WC) nucleobase pair, to the other biologically-important G·C nucleobase pairs—reverse Watson-Crick G·C(rWC), Hoogsteen G·C(H), and reverse Hoogsteen G·C(rH). Also, it was exactly established why the classic A·T(WC) DNA base pair was selected for the construction of the genetic material (Brovarets' and Hovorun, 2009, 2015a,b,c,d,e,f; Brovarets' et al., 2018a). The novel mechanism of the mutagenic tautomerization of the biologically-important A·T DNA base pairs through the quasi-orthogonal transition state and also through the protonated amino-group (Brovarets' et al., 2018b,c,d,e,f) was revealed for the first time. Based on these data an assumption was expressed about their possible biological role.
At the same time, investigations into the mechanisms of the mutagenic tautomerization of the pairs of nucleotide bases seemed to be quite a complicated issue, which may not be evident at a first glance. Thus, recent investigations into the tautomerization mechanisms of the biologically-important G·C nucleobase pairs, in which monomers are in the rare, in particular mutagenic, tautomeric form, continue to challenge researchers by its mystery (Brovarets' et al., 2019a,b).
It is still not possible to formulate simple physico-chemical rules, that would predict the course of these biologically important processes. Obviously, this is due to the fact that despite the enormous theoretical and experimental efforts of researchers, the present material remains insufficient for its final generalization.
This work aims to deepen the existing ideas about the microstructural mechanisms of the tautomerization of the biologically important pairs of nucleobases using the example of the G·C base pair (Brovarets' and Hovorun, 2014a), for which both monomers are in the rare tautomeric form.
Such a task is completely substantiated—we have investigated a few surprising tautomerizations, which significantly expand the existing ideas on tautomerization mechanisms and their biological applications. They will be outlined and discussed in more detail below.
Computational Methods
Density Functional Theory Calculations of the Geometry and Vibrational Frequencies
Equilibrium geometries of the investigated nucleobase pairs and the transition states (TSs) of their mutual tautomeric transformations, as well as their harmonic vibrational frequencies have been calculated at the B3LYP/6-311++G(d,p) level of QM theory (Hariharan and Pople, 1973; Krishnan et al., 1980; Lee et al., 1988; Parr and Yang, 1989; Tirado-Rives and Jorgensen, 2008), using the Gaussian'09 program package (Frisch et al., 2010). An applied level of theory has proved itself to be successful for the calculations of similar systems (Brovarets' and Hovorun, 2010a,b, 2015g; Matta, 2010; Brovarets' et al., 2015). A scaling factor that is equal to 0.9668 has been applied in the present work for the correction of the harmonic frequencies of all complexes and TSs of their tautomeric transitions (Palafox, 2014; Brovarets' and Hovorun, 2015g; Brovarets' et al., 2015; El-Sayed et al., 2015). We have confirmed the local minima and TSs, localized by a synchronous transit-guided quasi-Newton method (Peng et al., 1996), on the potential energy landscape by the absence or presence, respectively, of the imaginary frequency in the vibrational spectra of the complexes. We applied standard TS theory for the estimation of the activation barriers of the tautomerization reaction (Atkins, 1998).
All calculations have been carried in the continuum with ε = 1, that adequately reflects the processes occurring in real biological systems without deprivation of the structurally functional properties of the bases in the composition of DNA/RNA and satisfactorily models the substantially hydrophobic recognition pocket of the DNA-polymerase machinery as a part of the replisome (Bayley, 1951; Dewar and Storch, 1985; Petrushka et al., 1986; García-Moreno et al., 1997; Mertz and Krishtalik, 2000; Brovarets' and Hovorun, 2014a,b).
Single Point Energy Calculations
We continued geometry optimizations with electronic energy calculations as single point calculations at the MP2/6-311++G(2df,pd) level of theory (Frisch et al., 1990; Kendall et al., 1992).
The Gibbs free energy G for all structures was obtained in the following way:
| (1) |
where Eel–electronic energy, while Ecorr–thermal correction.
Evaluation of the Interaction Energies
Electronic interaction energies ΔEint have been calculated at the MP2/6-311++G(2df,pd) level of theory as the difference between the total energy of the base pair and energies of the monomers, which have been corrected for the basis set superposition error (BSSE) (Boys and Bernardi, 1970; Gutowski et al., 1986) through the counterpoise procedure (Sordo et al., 1988; Sordo, 2001).
QTAIM Analysis
Bader's quantum theory of atoms in molecules (QTAIM) (Bader, 1990; Matta and Hernández-Trujillo, 2003; Matta et al., 2006; Cukrowski and Matta, 2010; Keith, 2010; Matta, 2014; Lecomte et al., 2015), using the program package AIMAll (Keith, 2010), was applied to analyze the electron density distribution. The presence of the bond critical point (BCP), namely the so-called (3,-1) BCP, and a bond path between the hydrogen donor and acceptor, as well as the positive value of the Laplacian at this BCP (Δρ > 0), were considered as criteria for the H-bond formation (Bader, 1990; Matta and Hernández-Trujillo, 2003; Matta et al., 2006; Cukrowski and Matta, 2010; Matta, 2014; Lecomte et al., 2015). Wave functions were obtained at the level of QM theory used for geometry optimization.
The atomic numbering scheme used for the nucleobases is conventional (Saenger, 1984). In this study mutagenic or rare tautomeric forms of the bases (Brovarets' and Hovorun, 2009, 2014a, 2015a,b,c,d,e,f; Brovarets' and Hovorun, 2018; Brovarets' et al., 2018a,b,c,d,e,f; Brovarets' et al., 2019a,b) are denoted by the asterisk.
Obtained Results and Their Discussion
So, based on the obtained data, let us firstly formulate the basic results, which have been obtained for the first time and which have the closest connection to the structural biology and molecular biophysics (Figures 1, 2, Table 1).
Figure 1.
Investigated pathways of the tautomeric wobblization of the biologically-important G·C nucleobase pairs – G*·C*(rWC), G*·C*(H) and G*·C*(rH) pairs obtained at the MP2/6-311++G(2df,pd)//B3LYP/6-311++G(d,p) level of QM theory. ΔG-relative Gibbs free energy and ΔE-electronic energy (in kcal·mol−1; MP2/6-311++G(2df,pd)//B3LYP/6-311++G(d,p) level of QM theory); ΔEint-electronic and ΔGint-Gibbs free energies of the interaction (MP2/6-311++G(2df,pd)//B3LYP/6-311++G(d,p) level of QM theory, in kcal·mol−1). νi–imaginary frequency. Intermolecular AH…B H-bonds are designated by dotted lines, their lengths H…B are presented in angstroms.
Figure 2.
Total geometries of the investigated G·C nucleobase pairs, corresponding to the local minima, presented altogether with their relative Gibbs free energies ΔG and electronic energies (ΔE in kcal·mol−1 under normal conditions) obtained at the MP2/6-311++G(2df,pd)//B3LYP/6-311++G(d,p) level of QM theory.
Table 1.
Energetic characteristics of the tautomers of the G·C nucleobase pairs obtained at the MP2/6-311++G(2df,pd)//B3LYP/6-311++G(d,p) level of QM theory in vacuum (ε = 1) (see Figure 2).
| N | G·C complex | ΔGa | ΔEb |
|---|---|---|---|
| rWC/rwWC | |||
| 1 | G*·C*(rWC) | 0.00 | 0.00 |
| 2 | G·C*(rwWC) | 1.39 | 2.29 |
| 3 | G*·C(rwWC) | 1.40 | 2.33 |
| 4 | G·C*O2(rWC) | 3.51 | 3.80 |
| 5 | G+·C−(rWC) | 4.44 | 5.68 |
| 6 | G*t·C*O2(rwWC) | 17.16 | 18.65 |
| 7 | G*t·(rwWC) | 17.26 | 18.61 |
| 8 | G*·C*O2(rwWC) | 17.31 | 20.03 |
| 9 | G*·(rwWC) | 18.09 | 19.76 |
| 10 | ·C*(rwWC) | 24.36 | 25.54 |
| 11 | G*N2·C*t(rwWC) | 24.54 | 25.73 |
| 12 | ·C*t(rwWC) | 23.56 | 24.76 |
| 13 | ·C(rWC) | 24.74 | 24.58 |
| 14 | G*N2·C*(rwWC) | 25.70 | 26.96 |
| H/wH | |||
| 15 | G*t·C*(H) | 0.00 | 0.00 |
| 16 | G*N7·C(H) | 3.20 | 3.07 |
| 17 | G*·C(wH) | 3.25 | 5.07 |
| 18 | G*t·C(wH) | 5.16 | 7.41 |
| 19 | G*N7·C*t(wH) | 12.04 | 12.25 |
| 20 | G*t·(wH) | 13.70 | 14.37 |
| 21 | G*N7·C*(wH) | 16.43 | 17.33 |
| 22 | G*t·C*O2(wH) | 17.07 | 17.80 |
| 23 | G*N7·C*(wH) | 30.33 | 30.90 |
| 24 | C*(wH) | 34.30 | 35.29 |
| rHrwH | |||
| 25 | G*t·C*(rH) | 0.00 | 0.00 |
| 26 | G*t·C(rwH) | 0.26 | 0.88 |
| 27 | G*N7·C*(rwH) | 10.81 | 11.72 |
| 28 | G*t·C*O2(rwH) | 18.29 | 19.49 |
| 29 | C*(rwH) | 34.06 | 34.95 |
Relative Gibbs free energy (T=298.15 K), kcal·mol−1.
Relative electronic energy, kcal·mol−1.
Before providing the discussion of the investigated material, let us firstly give attention to the novel mechanisms of the G*·C*(rWC) tautomerization, which complement the results of the previous work (Brovarets' et al., 2019a).
So, in the G*·C*(rWC) base pair, the non-usual DPT-tautomerization was fixed by the participation of the protons at the N3(C) and N2(G) atoms (Figure 1, part I): G*·C*(rWC)↔C(rWC). This process is unusual, since the transfer of the proton from the C* to the G* base along the intermolecular (C)N3H.N1(G) H-bond provokes the rotation of the amino group of the G base into the trans-position relative to the C2=N3 double bond. As a result, a significantly non-planar of the tautomerization reaction is formed, which proceeds through the asynchronous mechanism and the significantly non-planar product of the tautomerization—the C(rWC) base pair, which is stabilized by the three intermolecular H-bonds (G)O6H…O2(C), (G)N1H…N3(C), and (C)N4H…N2(G). Its characteristic structural specificity has significant non-planarity and out-of-plane deformation of the purine ring of the O6H, N1H, and N2H atomic groups with trans-orientation relatively to the neighboring C2N3 bond.
Further, it was found out that Löwdin's G*·C*(WC) DNA base pair, which is formed from the classic G·C(WC) DNA base pair through the DPT and is stabilized by the participation of the three intermolecular (G)O6H…N4(C), (C)N3H…N1(G), and (G)N2H…O2(C) H-bonds (Brovarets' and Hovorun, 2014a), does not tautomerize in the wobble manner.
In this case all localized transition states of tautomerization in this manner and its pathways are the same as in the case of the wobble mutagenic tautomerization of the G·C(WC) DNA base pair, which has been investigated and described earlier (Brovarets' and Hovorun, 2015a). In other words, in order to tautomerize in the wobble-manner, the Löwdin's G*·C*(WC) DNA base pair should revert back to the classic G·C(WC) configuration (form) (Brovarets' and Hovorun, 2015a, 2018).
This bright fact allows us to claim that the functional role of the tautomeric G·C(WC) → G*·C*(WC) transition consists in the removal of the steric obstacles for the conformational G·C(WC) → G*·C*(rWC) transition (Brovarets' et al., 2019a) and is not directly related to the origin of the spontaneous point mutations—transitions and transversions, as it was suggested earlier (please, refer to work (Brovarets' and Hovorun, 2014a) and references provided therein for more details).
This aforementioned conformational transition, in its turn, guarantees the integration of the G·C(WC) nucleobase pair into the DNA/RNA with parallel strands.
3. Opposite to the previously considered methods both the so-called correct and incorrect DNA base pairs (Brovarets' and Hovorun, 2009, 2015a,b,c,d,e,f; Brovarets' and Hovorun, 2018), the process of the tautomeric wobblization in the investigated G*·C*(rWC) (Figure 1, parts II and III), and G*t·C*(H) (Figure 1, part X) base pairs is preceded by the process of the tautomerization via the single proton transfer (SPT). At this point, both processes of the wobblization of the G*·C*(rWC) base pair occur through the joint intermediate—tight G+·C− ion pair, which is stabilized by the participation of the three (G)O6+H…O2−(C), (G)N1+H…N3−(C), and (G)N2+H…N4−H(C) H-bonds (Brovarets' and Hovorun, 2014a). This dynamically non-stable intermediate is associated with the local minimum on potential (electronic) energy surface (PES). This situation is observed for the first time. Up until now the commonly accepted idea, that mutagenic tautomerization of the classic DNA base pairs is assisted by the intermediate corresponding to the local minimum on the PES, has not been confirmed.
The first process of the tautomeric wobblization of the G*·C*(rWC) base pair (Figure 1, part II)—G*·C*(rWC)↔G·(rWC)↔G·C(rwWC)—is most likely tightly connected with the incorporation of the G·C(WC) base pair into the DNA/RNA with parallel strands (Watson and Crick, 1953b).
Another tautomerization process (Figure 1, part III)—G*·C*(rWC)↔, which proceeds through the unique path with the (G)N1-H-O2(C) covalent bridge, is most probably concerned with the mechanisms of maintaining the RNA spatial architecture due to the incorporation of the non-stable (in the main tautomeric state) Levitt base pair (Crick and Watson, 1954; Levitt, 1969). This suggestion is based on the established structural mechanism of the tautomeric interconversion of the G*·C*(rWC) pair into the eight stable planar tautomeric forms of the Levitt base pair (Watson and Crick, 1953a) (Figure 1, parts IV-VI)—, , , , , , , and (Figure 2, Table 1)—and in principle, allows us to understand the dynamic of the formation of the Levitt base pair, which has not been considered before in the literature. It would be interesting to investigate how the tautomers of the Levitt base pair is stabilized in RNA by the H-bonds and surrounding environment further in the future (Oliva et al., 2007).
4. A quite interesting situation is observed for the tautomeric wobblization of the G*t·C*(H) base pair (Figure 1, part VII): G*t·C*(H)↔C(C)↔G*t·(wH)↔C*(wH).
The transition of the tautomer of the cytosine (C) within the (wH) base pairs with cis-orientation of the N4H C-imino group into the trans-orientation through its inversion leads to the decreasing of the energy in the tautomerization (Figure 1, part VIII): G*t· (wH)↔G*t·. This decreasing of energy occurs when the affinity of the tautomer according to the “complementary” G*t tautomer is higher than the tautomer. This decreasing of the energy with excess overrides the increasing of the internal energy of the tautomer at its tautomerization .
In the another pathway of the tautomeric wobblization of the G*t·C*(H) base pair (Figure 1, part VII, VIII) the decreasing of energy in the course of the process is achieved by the conformational transition of the G*t tautomer within the complex into the low-energy mutagenic tautomeric form , which is zwitterion.
5. At this, the G*t·C(wH)↔C*(wH) DPT tautomerization process does not really occur, since its barrier is negative under normal conditions (Figure 1, part IX): G*t·C(wH)↔G*·C(wH)↔C*(wH)↔C*(wH). The same situation is also observed for the G*t·C*(H)↔C(H)↔G*t·C(wH)↔C*(wH) DPT tautomerization (Figure 1, part X).
6. Tautomeric wobblization of the G*t·C*(rH) base pair (Figure 1, part XI) occurs through the two traditional pathways without any preparatory SPT stages through the TSs, which represent themselves as the covalently bonded tight G+·C− ion pairs in reverse Hoogsteen conformation, which are only supported by two H-bonds: G*t·C*(rH)↔G*t·(rwH)↔C*(rwH) and G*t·C*(rH)↔G*t·C*(rwH)↔C*(rwH). The transition of the G*t tautomer within the G*t·(rwH) complex into the G* mutagenic tautomer through the orthogonal TS decreases the energy of the further process of tautomerization.
7. Also, in addition to the previously revealed processes, DPT tautomerization was also fixed by the participation of the proton at the C8 carbon atom of G, which lead to the dynamically-stable, but short-lived, complexes by the participation of the yilidic forms of the G base (Figure 1, parts IX-XI).
8. Finally, there are three more fixed mysteries, which deserve more attention. Several G·C base pairs, in which both bases were in the rare tautomeric form and their energy of stabilization significantly exceeded the analogical values for the classic G·C(WC) DNA base pair were fixed.
Despite the structural softness of the heterocycles of the G and C bases for the out-of-plane deformational bending (Hovorun et al., 1999), it was not revealed that there was any deviation from the plane in the investigated processes of the tautomerization of the base pairs.
Obtained data convincingly show that among all possible tautomeric wobblizations of the G*·C*(rWC), G*t·C*(H), and G*t·C*(rH) DNA base pairs, which possess Watson-Crick, Hoogsteen, and reverse Hoogsteen configurations and both monomers of which are in the rare tautomeric form, at least one non-dissociative transition was absent, which would recover the tautomeric status of both the G*/G*t and C* bases to the canonical G and C bases, correspondingly. This fact altogether with the results, obtained in our previous work (Brovarets' et al., 2019a), soundly exhibits why the Watson-Crick DNA base pairs were chosen for the building of genetic material (Brovarets' et al., 2018a).
Conclusion
Concluding the obtained results, we arrived to a summation after providing an investigation of the tautomeric wobblization of the biologically-important G·C(WC), G*·C*(WC), G*·C*(rWC), G*t·C*(H), and G*t·C*(rH) nucleobase pairs and extended the existing thoughts about the microstructural mechanisms of these processes, as well as about their functional roles. Thus, it was established that the G·C base pair is the most likely to be incorporated into the DNA/RNA double helix with parallel strands in the form of the G*·(rWC), G·C*(rwWC), and G*·C(rwWC) tautomers, which are in rapid tautomeric equilibrium with each other.
For the first time we have formulated rules, defining these biologically-important processes.
Data Availability Statement
All datasets generated for this study are included in the article/supplementary material.
Author Contributions
OB: idea formulation, setting of the task, calculation of the data, building of the graphs, data extrapolation, preparing, and proofreading of the draft of the manuscript. AM: idea formulation, calculation of the data, building of the graphs, preparing, and proofreading of the draft of the manuscript. DH: idea formulation, preparing, and proofreading of the draft of the manuscript. All authors contributed to the article and approved the submitted version.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
Authors sincerely grateful for computational facilities of joint computer cluster of SSI Institute for Single Crystals of the National Academy of Sciences of Ukraine (NASU) and Institute for Scintillation Materials of the NASU incorporated into Ukrainian National Grid.
References
- Alvey H. S., Gottardo F. L., Nikolova E. N., Al-Hashimi H. M. (2014). Widespread transient Hoogsteen base-pairs in canonical duplex DNA with variable energetics. Nat. Comm. 5, 4786–4794. 10.1038/ncomms5786 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Atkins P. W. (1998). Physical Chemistry. Oxford: Oxford University Press. [Google Scholar]
- Bader R. F. W. (1990). Atoms in Molecules: A Quantum Theory. Oxford: Oxford University Press. [Google Scholar]
- Bayley S. T. (1951). The dielectric properties of various solid crystalline proteins, amino acids and peptides. Trans. Faraday Soc. 47, 509–517. 10.1039/tf9514700509 [DOI] [Google Scholar]
- Boys S. F., Bernardi F. (1970). The calculation of small molecular interactions by the differences of separate total energies. Some procedures with reduced errors. Mol. Phys. 19, 553–566. 10.1080/00268977000101561 [DOI] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2009). Physicochemical mechanism of the wobble DNA base pairs Gua·Thy and Ade·Cyt transition into the mismatched base pairs Gua*·Thy and Ade·Cyt* formed by the mutagenic tautomers. Ukr. Bioorg. Acta 8, 12–18. [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2010a). Quantum-chemical investigation of tautomerization ways of Watson-Crick DNA base pair guanine-cytosine. Ukr. Biochem. J. 82, 55–60. [PubMed] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2010b). Quantum-chemical investigation of the elementary molecular mechanisms of pyrimidine·purine transversions. Ukr. Biochem. J. 82, 57–67. [PubMed] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2014a). Why the tautomerization of the G·C Watson–Crick base pair via the DPT does not cause point mutations during DNA replication? QM and QTAIM comprehensive analysis. J. Biomol. Struct. Dynam. 32, 1474–1499. 10.1080/07391102.2013.822829 [DOI] [PubMed] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2014b). Can tautomerisation of the A·T Watson-Crick base pair via double proton transfer provoke point mutations during DNA replication? A comprehensive QM and QTAIM analysis. J. Biomol. Struct. Dynam. 32, 127–154. 10.1080/07391102.2012.755795 [DOI] [PubMed] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2015a). New structural hypostases of the A·T and G·C Watson-Crick DNA base pairs caused by their mutagenic tautomerisation in a wobble manner: a QM/QTAIM prediction. RSC Adv. 5, 99594–99605. 10.1039/C5RA19971A [DOI] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2015b). Tautomeric transition between wobble A·C DNA base mispair and Watson-Crick-like A·C* mismatch: microstructural mechanism and biological significance. Phys. Chem. Chem. Phys. 17, 15103–15110. 10.1039/C5CP01568E [DOI] [PubMed] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2015c). How many tautomerisation pathways connect Watson-Crick-like G*·T DNA base mispair and wobble mismatches? J. Biomol. Struct. Dynam. 33, 2297–2315. 10.1080/07391102.2015.1046936 [DOI] [PubMed] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2015d). Wobble↔Watson-Crick tautomeric transitions in the homo-purine DNA mismatches: a key to the intimate mechanisms of the spontaneous transversions. J. Biomol. Struct. Dynam. 33, 2710–2715. 10.1080/07391102.2015.1077737 [DOI] [PubMed] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2015e). Novel physico-chemical mechanism of the mutagenic tautomerisation of the Watson–Crick-like A·G and C·T DNA base mispairs: a quantum-chemical picture. RSC Adv. 5, 66318–66333. 10.1039/C5RA11773A [DOI] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2015f). A novel conception for spontaneous transversions caused by homo-pyrimidine DNA mismatches: a QM/QTAIM highlight. Phys. Chem. Chem. Phys. 17, 21381–21388. 10.1039/C5CP03211C [DOI] [PubMed] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2015g). The nature of the transition mismatches with Watson-Crick architecture: the G*·T or G·T* DNA base mispair or both? A QM/QTAIM perspective for the biological problem. J. Biomol. Struct. Dynam. 33, 925–945. 10.1080/07391102.2014.924879 [DOI] [PubMed] [Google Scholar]
- Brovarets' O. O., Hovorun D. M. (2018). “Renaissance of the tautomeric hypothesis of the spontaneous point mutations in DNA: new ideas and computational approaches,” in Mitochondrial DNA—New Insights, ed Herve Seligmann (London: IntechOpen; ), 31–55. 10.5772/intechopen.77366 [DOI] [Google Scholar]
- Brovarets' O. O., Oliynyk T. A., Hovorun D. M. (2019a). Novel tautomerisation mechanisms of the biologically important conformers of the reverse Löwdin, Hoogsteen, and reverse Hoogsteen G*·C* DNA base pairs via proton transfer: a quantum-mechanical survey. Front. Chem. 7:597. 10.3389/fchem.2019.00597 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brovarets' O. O., Oliynyk T. A., Hovorun D. M. (2019b). “Novel conformers of the G·C DNA base pair and their mutual interconversions via the proton transfer: a quantum-mechanical study” in Joint 12th EBSA Congress and 10th ICBP-IUPAP Congress, Vol. 48 (Madrid: ), S90. 10.1007/s00249-019-01373-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brovarets' O. O., Tsiupa K. S., Dinets A., Hovorun D. M. (2018c). Unexpected routes of the mutagenic tautomerization of the T nucleobase in the classical A·T DNA base pairs: A QM/QTAIM comprehensive view. Front. Chem. 6:532. 10.3389/fchem.2018.00532 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brovarets' O. O., Tsiupa K. S., Hovorun D. M. (2018a). The A·T(rWC)/A·T(H)/A·T(rH) ↔ A·T*(rwWC)/A·T*(wH)/A·T*(rwH) mutagenic tautomerization via sequential proton transfer: a QM/QTAIM study. RSC Adv. 8, 13433–13445. 10.1039/C8RA01446A [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brovarets' O. O., Tsiupa K. S., Hovorun D. M. (2018b). Unexpected A·T(WC)↔A·T(rWC)/A·T(rH) and A·T(H)↔A·T(rH)/A·T(rWC) conformational transitions between the classical A·T DNA base pairs: A QM/QTAIM comprehensive study. Int. J. Quantum. Chem. 118:e25674 10.1002/qua.25692 [DOI] [Google Scholar]
- Brovarets' O. O., Tsiupa K. S., Hovorun D. M. (2018d). Non-dissociative structural transitions of the Watson-Crick and reverse Watson-Crick A·T DNA base pairs into the Hoogsteen and reverse Hoogsteen forms. Sci. Repts. 8:10371. 10.1038/s41598-018-28636-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brovarets' O. O., Tsiupa K. S., Hovorun D. M. (2018e). Novel pathway for mutagenic tautomerization of classical A·T DNA base pairs via sequential proton transfer through quasi-orthogonal transition states: a QM/QTAIM investigation. PLoS ONE 13:e0199044. 10.1371/journal.pone.0199044 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brovarets' O. O., Tsiupa K. S., Hovorun D. M. (2018f). Surprising conformers of the biologically important A·T DNA base pairs: QM/QTAIM proofs. Front. Chem. 6:8. 10.3389/fchem.2018.00008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brovarets' O. O., Zhurakivsky R. O., Hovorun D. M. (2014). Does the tautomeric status of the adenine bases change upon the dissociation of the A*·Asyn Topal-Fresco DNA mismatch? A combined QM and QTAIM atomistic insight. Phys. Chem. Chem. Phys. 16, 3715–3725. 10.1039/c3cp54708f [DOI] [PubMed] [Google Scholar]
- Brovarets' O. O., Zhurakivsky R. O., Hovorun D. M. (2015). DPT tautomerisation of the wobble guanine·thymine DNA base mispair is not mutagenic: QM and QTAIM arguments. J. Biomol. Struct. Dynam. 33, 674–689. 10.1080/07391102.2014.897259 [DOI] [PubMed] [Google Scholar]
- Crick F. H. C., Watson J. D. (1954). The complementary structure of deoxyribonucleic acid. Proc. Roy. Soc. A 223, 80–96. 10.1098/rspa.1954.0101 [DOI] [Google Scholar]
- Cukrowski I., Matta C. F. (2010). Hydrogen–hydrogen bonding: a stabilizing interaction in strained chelating rings of metal complexes in aqueous phase. Chem. Phys. Lett. 499, 66–69. 10.1016/j.cplett.2010.09.013 [DOI] [Google Scholar]
- Dewar M. J. S., Storch D. M. (1985). Alternative view of enzyme reactions. Proc. Natl. Acad. Sci. U. S. A. 82, 2225–2229. 10.1073/pnas.82.8.2225 [DOI] [PMC free article] [PubMed] [Google Scholar]
- El-Sayed A. A., Tamara Molina A., Alvarez-Ros M. C., Alcolea Palafox M. (2015). Conformational analysis of the anti-HIV Nikavir prodrug: comparisons with AZT and thymidine, and establishment of structure-activity relationships/tendencies in other 6'-derivatives. J. Biomol. Struct. Dynam. 33, 723–748. 10.1080/07391102.2014.909743 [DOI] [PubMed] [Google Scholar]
- Florian J., Hrouda V., Hobza P. (1994). Proton transfer in the adenine-thymine base pair. J. Am. Chem. Soc. 116,1457–1460. 10.1021/ja00083a03422414119 [DOI] [Google Scholar]
- Frisch M. J., Head-Gordon M., Pople J. A. (1990). Semi-direct algorithms for the MP2 energy and gradient. Chem. Phys. Lett. 166, 281–289. 10.1016/0009-2614(90)80030-H [DOI] [Google Scholar]
- Frisch M. J., Trucks G. W., Schlegel H. B., Scuseria G. E., Robb M. A., Cheeseman J. R., et al. (2010). GAUSSIAN 09 (Revision B.01). Wallingford CT: Gaussian Inc. [Google Scholar]
- García-Moreno B. E., Dwyer J. J., Gittis A. G., Lattman E. E., Spencer D. S., Stites W. E. (1997). Experimental measurement of the effective dielectric in the hydrophobic core of a protein. Biophys. Chem. 64, 211–224. 10.1016/S0301-4622(96)02238-7 [DOI] [PubMed] [Google Scholar]
- Godbeer A. D., Al-Khalili J. S., Stevenson P. D. (2015). Modelling proton tunnelling in the adenine-thymine base pair. Phys. Chem. Chem. Phys. 7, 13034–13044. 10.1039/C5CP00472A [DOI] [PubMed] [Google Scholar]
- Gorb L., Podolyan Y., Dziekonski P., Sokalski W. A., Leszczynski J. (2004). Double-proton transfer in adenine–thymine and guanine–cytosine base pairs. A post Hartree–Fock ab initio study. J. Am. Chem. Soc. 126, 10119–10129. 10.1021/ja049155n [DOI] [PubMed] [Google Scholar]
- Gutowski M., Van Lenthe J. H., Verbeek J., Van Duijneveldt F. B., Chalasinski G. (1986). The basis set superposition error in correlated electronic structure calculations. Chem. Phys. Lett. 124, 370–375. 10.1016/0009-2614(86)85036-9 [DOI] [Google Scholar]
- Hariharan P. C., Pople J. A. (1973). The influence of polarization functions on molecular orbital hydrogenation energies. Theor. Chim. Acta 28, 213–222. 10.1007/BF00533485 [DOI] [Google Scholar]
- Hovorun D. M., Gorb L., Leszczynski J. (1999). From the nonplanarity of the amino group to the structural nonrigidity of the molecule: a post-Hartree-Fock ab initio study of 2-aminoimidazole. Int. J. Quantum. Chem. 75, 245–253. [DOI] [Google Scholar]
- Keith T. A. (2010). AIMAll (Version 10.07.01). Available online at: aim.tkgristmill.com (accessed October 23, 2020).
- Kendall R. A., Dunning Jr T. H., Harrison R.J. (1992). Electron affinities of the first-row atoms revisited. Systematic basis sets and wave functions. J. Chem. Phys. 96, 6796–6806. 10.1063/1.462569 [DOI] [Google Scholar]
- Krishnan R., Binkley J. S., Seeger R., Pople J. A. (1980). Self-consistent molecular orbital methods. XX. A basis set for correlated wave functions. J. Chem. Phys. 72, 650–654. 10.1063/1.438955 [DOI] [Google Scholar]
- Lecomte C., Espinosa E., Matta C. F. (2015). On atom–atom “short contact” bonding interactions in crystals. IUCrJ 2, 161–163. 10.1107/S2052252515002067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee C., Yang W., Parr R. G. (1988). Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B. 37, 785–789. 10.1103/PhysRevB.37.785 [DOI] [PubMed] [Google Scholar]
- Levitt M. (1969). Detailed molecular model for transfer ribonucleic acid. Nature 224, 759–763. 10.1038/224759a0 [DOI] [PubMed] [Google Scholar]
- Löwdin P.-O. (1963). Proton tunneling in DNA and its biological implications. Rev. Mod. Phys. 35, 724–732. 10.1103/RevModPhys.35.724 [DOI] [Google Scholar]
- Löwdin P.-O. (1966). “Quantum genetics and the aperiodic solid: some aspects on the biological problems of heredity, mutations, aging, and tumors in view of the quantum theory of the DNA molecule,” in Advances in Quantum Chemistry, ed Löwdin P.-O. (New York, NY; London: Academic Press; ), 213–360. 10.1016/S0065-3276(08)60076-3 [DOI] [Google Scholar]
- Matta C. F. (2010). How dependent are molecular and atomic properties on the electronic structure method? Comparison of Hartree-Fock, DFT, and MP2 on a biologically relevant set of molecules. J. Comput. Chem. 31, 1297–1311. 10.1002/jcc.21417 [DOI] [PubMed] [Google Scholar]
- Matta C. F. (2014). Modeling biophysical and biological properties from the characteristics of the molecular electron density, electron localization and delocalization matrices, and the electrostatic potential. J. Comput. Chem. 35, 1165–1198. 10.1002/jcc.23608 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matta C. F., Castillo N., Boyd R. J. (2006). Atomic contributions to bond dissociation energies in aliphatic hydrocarbons. J. Chem. Phys. 125:204103. 10.1063/1.2378720 [DOI] [PubMed] [Google Scholar]
- Matta C. F., Hernández-Trujillo J. (2003). Bonding in polycyclic aromatic hydrocarbons in terms of the electron density and of electron delocalization. J. Phys. Chem. A 107, 7496–7504. 10.1021/jp034952d [DOI] [Google Scholar]
- Mertz E. L., Krishtalik L. I. (2000). Low dielectric response in enzyme active site. Proc. Natl. Acad. Sci. U. S. A. 97, 2081–2086. 10.1073/pnas.050316997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nikolova E. N., Zhou H., Gottardo F. L., Alvey H. S., Kimsey I. J., Al-Hashimi H. M. (2014). A historical account of Hoogsteen base pairs in duplex DNA. Biopolymers 99, 955–968. 10.1002/bip.22334 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oliva R., Tramontano A., Cavallo L. (2007). Mg2+ binding and archaeosine modification stabilize the G15–C48 Levitt base pair in tRNAs. RNA 13, 1427–1436. 10.1261/rna.574407 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Palafox M. A. (2014). Molecular structure differences between the antiviral nucleoside analogue 5-iodo-2‘-deoxyuridine and the natural nucleoside 2‘-deoxythymidine using MP2 and DFT methods: conformational analysis, crystal simulations, DNA pairs and possible behavior. J. Biomol. Struct. Dynam. 32, 831–851. 10.1080/07391102.2013.789402 [DOI] [PubMed] [Google Scholar]
- Parr R. G., Yang W. (1989). Density-Functional Theory of Atoms and Molecules. Oxford: Oxford University Press. [Google Scholar]
- Peng C., Ayala P. Y., Schlegel H. B., Frisch M. J. (1996). Using redundant internal coordinates to optimize equilibrium geometries and transition states. J. Comput. Chem. 17, 49–56. [DOI] [Google Scholar]
- Petrushka J., Sowers L. C., Goodman M. (1986). Comparison of nucleotide interactions in water, proteins, and vacuum: model for DNA polymerase fidelity. Proc. Natl. Acad. Sci. U. S. A. 83, 1559–1562. 10.1073/pnas.83.6.1559 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poltev V. I., Anisimov V. M., Sanchez C., Deriabina A., Gonzalez E., Garcia D., et al. (2016). Analysis of the conformational features of Watson–Crick duplex fragments by molecular mechanics and quantum mechanics methods. Biophysics 61, 217–226. 10.1134/S0006350916020160 [DOI] [PubMed] [Google Scholar]
- Pous J., Urpi L., Subirana J. A., Gouyette C., Navaza J., Campos J. L. (2008). Stabilization by extra-helical thymines of a DNA duplex with Hoogsteen base pairs. J. Am. Chem. Soc. 130, 6755–6760. 10.1021/ja078022+ [DOI] [PubMed] [Google Scholar]
- Saenger W. (1984). Principles of Nucleic Acid Structure. New York, NY: Springer; 10.1007/978-1-4612-5190-3 [DOI] [Google Scholar]
- Sordo J. A. (2001). On the use of the Boys–Bernardi function counterpoise procedure to correct barrier heights for basis set superposition error. J. Mol. Struct. 537, 245–251. 10.1016/S0166-1280(00)00681-3 [DOI] [Google Scholar]
- Sordo J. A., Chin S., Sordo T. L. (1988). On the counterpoise correction for the basis set superposition error in large systems. Theor. Chim. Acta 74, 101–110. 10.1007/BF00528320 [DOI] [Google Scholar]
- Srivastava R. (2019). The role of proton transfer on mutations. Front Chem. 7:536. 10.3389/fchem.2019.00536 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Szabat M., Kierzek R. (2017). Parallel-stranded DNA and RNA duplexes: structural features and potential applications. FEBS J. 284, 3986–3998. 10.1111/febs.14187 [DOI] [PubMed] [Google Scholar]
- Tirado-Rives J., Jorgensen W. L. (2008). Performance of B3LYP density functional methods for a large set of organic molecules. J. Chem. Theory Comput. 4, 297–306. 10.1021/ct700248k [DOI] [PubMed] [Google Scholar]
- Topal M. D., Fresco J. R. (1976). Complementary base pairing and the origin of substitution mutations. Nature 263, 285–289. 10.1038/263285a0 [DOI] [PubMed] [Google Scholar]
- Turaeva N., Brown-Kennerly V. (2015). Marcus model of spontaneous point mutation in DNA. Chem. Phys. 461, 106–110. 10.1016/j.chemphys.2015.09.005 [DOI] [Google Scholar]
- Watson J. D., Crick F. H. C. (1953a). Molecular structure of nucleic acids: a structure for deoxyribose nucleic acid. Nature 171, 737–738. 10.1038/171737a0 [DOI] [PubMed] [Google Scholar]
- Watson J. D., Crick F. H. C. (1953b). The structure of DNA. Cold. Spring Harb. Symp. Quant. Biol. 18, 123–131. 10.1101/SQB.1953.018.01.020 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
All datasets generated for this study are included in the article/supplementary material.


