Abstract
Phosphodiester groups occur ubiquitously in nature, e.g. in nucleic acids or in cyclic (di‐)nucleotides important for signal transduction. Proteins often use polar or positively charged amino acids to interact with the negatively charged phosphodiester groups via hydrogen bonds and salt bridges. In contrast, the acidic amino acids aspartate and glutamate are generally not considered as important determinants for phosphodiester group recognition. Instead, they are regarded as detrimental to such interactions due to the assumed charge repulsion between their deprotonated, negatively charged side chain carboxylate groups and the phosphodiester. Accordingly, acidic amino acids are often purposefully introduced into proteins to abrogate nucleic acid interactions in functional studies. Here, we show that in appropriate structural contexts, glutamate side chains are readily protonated even at neutral pH and act as hydrogen bond donors to phosphodiester groups using a c‐di‐GMP binding protein – the GSPII‐B domain of PilF from Thermus thermophilus – as an example. Surveying available RNA‐protein and DNA‐protein complex structures in the PDB, we found that hydrogen bonds between apparently protonated carboxylate groups of glutamate and aspartate and phosphodiester groups occur frequently in many different functional protein classes. Thus, the functional role of acidic amino acids in phosphodiester group recognition needs to be re‐evaluated.
Keywords: Aspartate, Glutamate, Hydrogen bonding, Phosphodiester group, Protein nucleic acids interactions
The highly negatively charged phosphodiester groups of nucleic acids and cyclic (di‐)nucleotides often interact with amino acids containing polar or positively charged sidechains. Due to charge repulsion with their negatively charged carboxylate groups in their side chains, glutamate and aspartate are considered to be detrimental to such interactions. Here, we show that glutamate residues with pKa values shifted towards neutral pH due to their chemical environment are easily protonated and thus able to form strong hydrogen bonds with phosphodiester groups.

Introduction
The individual nucleotide building blocks in all DNA and RNA molecules are covalently linked by phosphodiester groups. Phosphodiester groups also occur in cyclic nucleotides and dinucleotides, which serve as important second messengers in cellular signal transduction pathways across all kingdoms of life.
The pKa values for the phosphodiester groups in cyclic mono‐ and dinucleotides as well as in DNA and RNA are reported to be ∼ 1.[ 1 ] Under physiological pH conditions, these groups are generally deprotonated and negatively charged with the charge delocalized across the two non‐bonding oxygen atoms of the phosphodiester group. Therefore, phosphodiester groups are in principle strong hydrogen bond acceptor groups and simultaneously able to form salt bridges with positively charged functional groups. Particularly strong, charge‐assisted hydrogen bonds can be formed with functional groups that are both positively charged and hydrogen bond donors.
Proteins that bind to phosphodiester groups in DNA, RNA, or cyclic (di‐)nucleotides to regulate gene expression or transmit cellular signals often rely on the polar side chains of amino acids such as serine, threonine, glutamine, asparagine, or tyrosine to act as hydrogen bond donors, or on the positively charged side chains of the basic amino acids arginine and lysine to simultaneously form salt bridges and charge‐assisted hydrogen bonds with their targets.[ 2 , 3 , 4 , 5 , 6 , 7 , 8 ]
In DNA‐protein complexes, such interactions contribute mainly to the binding affinity due to the regular spacing of the phosphodiester groups along the backbone of the DNA double helix,[ 9 ] while in RNA‐protein complexes, such interactions are often important for the shape‐specific and charge‐distribution‐specific recognition of highly structured RNAs.[ 10 , 11 ] Notably, a common approach for testing the functional importance of phosphodiester interactions is to mutate the involved amino acids into aspartate or glutamate. This is based on the reported pKa values for their side chain carboxylate groups of ∼ 4.4 and 3.9 for glutamate and aspartate, respectively,[ 12 , 13 , 14 ] and the resulting almost universally accepted assumption that the introduction of these formally negatively charged amino acid side chains will result in charge repulsion and thereby lower the binding affinity or prevent binding altogether.
Here, we use the example of a cyclic dinucleotide binding protein to show that, contrary to common expectations, the carboxylate groups in the side chains of glutamate and aspartate residues can engage in productive hydrogen bonding interactions with phosphodiester groups. The analysis of previously reported structures of nucleic acid/protein complexes shows that such interactions are widespread, thereby presenting a hitherto underappreciated phosphodiester group recognition mode in biomacromolecular complexes.
The common bacterial second messenger c‐di‐GMP regulates a wide range of processes, including biofilm formation,[ 15 , 16 ] motility,[ 17 ] and virulence.[ 18 , 19 ] One target protein for c‐di‐GMP is PilF, a hexameric ATPase from the thermophilic bacterium Thermus thermophilus important for pilus formation, cell motility, and DNA uptake from the environment.[ 20 ] The N‐terminus of PilF contains three homologous so‐called general secretory pathway II (GSPII) domains, GSPII‐A, ‐B, and ‐C.[ 21 ] Our recent structures of GSPII‐B (PilF159‐302)[ 22 ] showed that the two nucleotide building blocks of c‐di‐GMP are recognized by the protein in a very similar manner through a duplicated amino acid sequence motif. Both copies of the motif fold into two tightly packed helices (Figure 1a). Together, the two motifs form a four‐helix bundle (Figure 1a) that binds c‐di‐GMP with very high affinity (KD ∼ 6 nM). Other GSPII domain containing proteins bind c‐di‐GMP in the same structurally conserved manner.[ 23 ]
Figure 1.

Phosphodiester group recognition in the GSPII‐B mutants Q190E and Q218E is pH‐dependent and relies on a protonated glutamate side chain. a) X‐ray structure of WT GSPII‐B (PilF159‐302) in complex with c‐di‐GMP (PDB entry 8pdk).[ 22 ] The protein is shown in ribbon representation and c‐di‐GMP is shown as sticks with carbon atoms in gray, nitrogen atoms in blue, phosphorous in orange and oxygen in red. The brackets highlight the location of the N‐terminal (NTD) and the C‐terminal (CTD) subdomain of the protein. The first copy of the c‐di‐GMP binding sequence motif is colored yellow, the second orange. b) Schematic representation of the intermolecular hydrogen bonds (dashed red lines) between the GSPII‐B domain and the phosphodiester groups of c‐di‐GMP in the WT protein. Important amino acid residues are labeled.[ 22 ] c) c‐di‐GMP binding affinities of the Q190E (left) and Q218E (right) mutants at pH values of 8.5 and 4.0 according to ITC measurements. d, e) Intermolecular interactions between the phosphodiester group of c‐di‐GMP and E190 in the mutated binding pocket of the Q190E single‐site mutant (d) and E218 in the mutated binding pocket of the Q218E protein variant (e) according to the X‐ray structures of the two complexes. The electron density at the 1.0 σ level is indicated as a gray mesh. Hydrogen bonds are indicated as black and red dashed lines. f) 2D‐Hγ(Cγ)Cδ NMR spectra of the PilF159‐221 WT/c‐di‐GMP complex (left) and the PilF159‐221 Q190E/c‐di‐GMP complex (right). The corresponding slices from a 3D‐H(CCO)NH spectrum for the Q190 and E190 side chains, respectively, for both variants in complex with c‐di‐GMP are shown on top. Black dashed lines indicate resonance positions belonging to the Hγ protons from the same methylene group. Gray dashed lines indicate the assignment of the Hγ/Cδ correlation signals via the Hγ/Cγ signals in the slices from the 3D‐H(CCO)NH spectra.
In the GSPII‐B complex with c‐di‐GMP, both c‐di‐GMP phosphodiester groups accept a hydrogen bond from a backbone amide group of a conserved leucine (L168 from the first motif and L197 from the second motif) and from a side chain amino group of a conserved glutamine (Q190 from the first motif and Q218 from the second motif) (Figure 1b). The side chain oxygen atoms of these two glutamines simultaneously serve as hydrogen bond acceptors in hydrogen bonds with the 2′‐OH groups of the adjacent ribose moieties (Figure 1b). Importantly, the intermolecular hydrogen bonds between the negatively charged non‐bridging oxygen atoms of the phosphodiester groups from c‐di‐GMP and polar backbone and side chain groups of GSPII‐B are similar to those commonly observed in protein‐nucleic acid complexes. The high affinity for the ligand, the nature of the bivalent binding motif, and the role of the two conserved glutamine residues make GSPII‐B a well suited model system to investigate the role of sidechain interactions in phosphodiester bond recognition.
Results and Discussion
We previously individually or simultaneously mutated Q190 and Q218 to glutamate residues (Q190E and Q218E, Figure 1b) in order to purposefully disrupt c‐di‐GMP binding to the GSPII‐B domain by creating electrostatic repulsion between the ligands phosphodiester groups and the carboxylate groups of the glutamate side chains. In vivo, these mutations resulted in reduced bacterial twitching motility and cell adhesion defects.[ 22 ] In vitro at pH 7.5, which corresponds to the intracellular pH of T. thermophilus, c‐di‐GMP did not bind to the double mutant, and showed significantly reduced affinity for the two Q190E and Q218E single‐site mutants as expected.[ 22 ]
However, we were intrigued by the relative orientation of the Q190 and Q218 side chains and the phosphodiester groups to each other in the WT complex (Figure 1b). It suggested that either the protonation of the newly introduced carboxylate group in the Q/E single‐site mutants or the protonation of the phosphodiester group could lead to an energetically favorable hydrogen bonding interaction and a pH‐sensitive protein‐ligand interaction instead of the commonly expected electrostatic repulsive interaction between these two functional groups.
To explore this possibility, we investigated the binding affinity of the two single‐site Q/E mutants (Q190E and Q218E, respectively) over a wider pH range (Figure 1c). While the c‐di‐GMP binding affinity of the WT protein is similar at pH 4.0 and 8.5 according to ITC titration experiments (Figure S1), both mutants display a drastic increase in ligand affinity at lower pH (Figure 1c). The affinity of the Q190E mutant for c‐di‐GMP increased ∼ 20fold with the KD changing from ∼ 800 nM at pH 8.5 to ∼ 40 nM at pH 4.5. Likewise, the KD of the Q218E mutant changed from ∼ 450 nM at pH 8.5 to ∼ 25 nM at pH 4.5. This strongly supports the notion that a protonation event, either within c‐di‐GMP or in the GSPII‐B Q/E single‐site variants, can modulate the ligand‐protein interaction, thereby stabilizing the complex at acidic pH values.
To gain insights into the structural basis for the pH‐dependent ligand recognition, we crystallized both the Q190E and the Q218E single‐site mutant of the PilF GSPII‐B domain in complex with c‐di‐GMP using very similar crystallization conditions at pH 7.0 (Figure S2). Importantly, both protein‐ligand complexes yielded well diffracting crystals and structures were solved at a resolution of 1.9 Å (Q190E, PDB entry 9GL5) and 1.6 Å (Q218E, PDB entry 9GLG), respectively (Table S1). There are no differences in the global structure when comparing the Q190E and Q218E mutants complexed with c‐di‐GMP with each other (global backbone heavy atom RMSD of 0.14 Å) or with the WT (0.15 Å and 0.16 Å, respectively) (Figure S2a, Table S2).
Due to the high‐quality electron density maps for both mutants, their ligand recognition mode could be discerned in detail (Figures 1d,e, S2b,c). In both the Q190E and the Q218E single point mutants, one phosphodiester‐binding pocket can be found in the mutant configuration (Figure 1d,e) and the other one in the WT configuration due to the duplication of the c‐di‐GMP phosphodiester binding motifs (Figure S2b‐e). The comparison of the “WT” binding pockets of the single mutants which is composed of Q218/L168 in the Q190E mutant and Q190/L197 in the Q218E mutant showed that they are structurally highly similar to the equivalent binding pockets in the WT protein, including the orientation of the respective sidechains (Figure S3).
Figure 2.

pKa values of the glutamate side chain carboxylate groups in the Q190E and Q218E single mutants in complex with c‐di‐GMP. a) Thermodynamic cycle describing the coupling of c‐di‐GMP binding and protonation of side chain glutamates in the two mutants. The equation describing the experimentally observed KD values as a function of the pKa of the free protein (pKaf) and the pKa of the c‐di‐GMP bound protein (pKab) is shown in the center. b) pH dependence of the KD for c‐di‐GMP for the Q190E mutant (blue) and the Q218E mutant (green). c) 2D‐Hγ(Cγ)Cδ NMR spectra of free PilF159‐221 WT (left) and free PilF159‐221 Q190E (right) with the corresponding slices from the respective 3D‐H(CCO)NH spectra shown on top. Black dashed lines indicate resonance positions belonging to the Hγ protons from the same methylene group. Gray dashed lines indicate the assignment of the Hγ/Cδ correlation signals via the Hγ/Cγ signals in the slices from the 3D‐H(CCO)NH spectra.
Importantly, the carboxylate groups of the newly introduced glutamate sidechains in either of the two Q/E mutants were found to still point directly towards a non‐bridging oxygen atom of the phosphodiester groups (Figure 1d, e), as seen previously for the carboxamide groups of the native glutamine side chains (Figures 1, S2d, e). The shortest distances between the oxygen atoms of the glutamate carboxylate groups and the phosphodiester groups were found to be 2.6 Å (Q190E) and 2.5 Å (Q218E), respectively, which is well below the sum of the van der Waals radii of the two oxygen atoms. For negatively charged carboxylate and phosphodiester groups, this would represent an energetically prohibitive situation due to a strong repulsion of like charges. However, protonation of one of the two groups can alleviate this energetically unfavorable situation by neutralizing one of the charges. In addition, the protonated group could also serve as a hydrogen bond donor while the unprotonated group would be the hydrogen bond acceptor.
To observe such a potential protonated state directly by X‐ray crystallography, a resolution of <1.0 Å would be required.[ 24 ] Unfortunately, we were not able to optimize our crystallization conditions to this extent. However, NMR‐spectroscopy is a powerful tool to analyze the protonation state of functional groups.[ 25 , 26 ] In the case at hand, the 31P shifts for the phosphodiester groups of c‐di‐GMP and the 13C shifts of the Cδ carbon atoms of the glutamate carboxylate groups are sensitive reporters of their respective protonation states. We first tested the effect of pH on the chemical shift of the 31P signals of c‐di‐GMP. Free c‐di‐GMP gives rise to a single 31P NMR signal due to its inherent symmetry. Lowering the pH leads to an upfield shift of this signal as an indicator of the (at least partial) protonation of the phosphodiester groups (Figure S4a). In contrast, comparing the 31P chemical shifts for c‐di‐GMP bound to the WT protein and the two E/Q mutants reveals a downfield shift for the 31P signal of the phosphodiester group close to the glutamate side chain in the mutants compared to those of the phosphodiester groups hydrogen‐bonded to the glutamine side chains of the WT (Figure S4b). This suggests that the respective c‐di‐GMP phosphodiester group is not protonated when bound to the mutant proteins and might instead serve as a hydrogen bond acceptor group.
Next, we aimed to investigate the protonation state of the newly introduced glutamate side chains in the phosphodiester‐binding pocket of GSPII‐B using the Q190E mutant as the model for both Q‐to‐E mutants. To enhance spectral quality and resolution, we used a C‐terminally truncated protein construct containing only the α‐helical N‐terminal subdomain (64 residues, PilF159‐221, Figure. 1a). Importantly, the fingerprint 1H, 15N‐HSQC‐spectra of the isolated subdomain and the full‐length domain are very similar in both the apo‐ and the ligand‐bound states (Figure S5a,b), the c‐di‐GMP‐induced chemical shift changes are similar for the isolated subdomain and the full‐length domain (Figure S5c,d) and the subdomain binds c‐di‐GMP with an affinity comparable to the KD of the full‐length domain,[ 22 ] showing that the minimal construct faithfully replicates the c‐di‐GMP binding mode of the full‐length GSPII‐B domain.
To elucidate the protonation state of E190 in the Q190E variant bound to c‐di‐GMP, we established the complete backbone (Figure S6a,b) and the relevant side chain NMR signal assignments for both the PilF159‐221 WT and the PilF159‐221 q190e construct. 3D‐H(CCO)NH and 2D‐Hγ(Cγ)Cδ NMR spectra[ 27 ] for the WT and the Q190E protein in complex with c‐di‐GMP recorded at pH 5.8 were analyzed and compared to unambiguously identify the resonances belonging to the E190 side chain in the mutant (Figure 1f). By correlating the chemical shifts of the Hγ protons with those of the Cδ carbon atom from the carboxylate group of the same glutamate side chain, the 2D‐Hγ(Cγ)Cδ NMR spectrum allows to interrogate the protonation state of the glutamate side chains (Figures 1f and S7). While unprotonated, negatively charged glutamate carboxylate groups have Cδ chemical shifts of 182–186 ppm, the upfield Cδ chemical shift of ∼ 179 ppm for the E190 side chain (Figure 1f) is indicative for the presence of a neutral protonated carboxylate group.[ 26 ] It should be noted that the HγCδ correlation signals for E190 are significantly broader and of lower intensity compared to the corresponding correlations for all other glutamate residues in the protein. This is very likely due to an exchange with the residual population of unprotonated E190 which apparently occurs in the intermediate exchange regime on the NMR time scale.
In conjunction with the short distance and the relative orientation between the carboxylate and phosphodiester oxygen atoms observed in the X‐ray structure the upfield chemical shift observed for Cδ of E190 suggests that in the Q190E mutant complex with c‐di‐GMP, the E190 side chain carboxylate group is protonated and forms a strong hydrogen bonding interaction with the phosphodiester group where the protonated carboxylate group of E190 is the hydrogen bond donor and the non‐bonding oxygen of the phosphodiester group is the hydrogen bond acceptor.
The relevance of the unexpected occurrence of a protonated glutamate side chain as an interaction partner for a phosphodiester group at physiological pH is underscored by the fact that our NMR experiments were conducted at pH 5.8, while the crystal structures of the two Q/E mutants bound to c‐di‐GMP were obtained at pH 7.0. To quantitatively characterize this protonation event, we determined the corresponding pKa values for the unbound, and the c‐di‐GMP‐bound Q190E and the Q218E mutants using the thermodynamic cycle shown in Figure 2a and the formula:
It describes the experimentally measured KD as a function of ligand binding to both the protonated and the unprotonated form and the pH.[ 28 ] Fitting the KD’s measured over a pH range from 4.0 to 8.5 (Figure 2b, Table S3) to this formula yielded pKa values of 7.5 and 6.3 for the Q190E/c‐di‐GMP complex and the free Q190E mutant protein, respectively. For the Q218E/c‐di‐GMP‐complex and the free Q218E protein, pKa values of 7.1 and 5.9 were determined (Figure 2b). This shows that at physiologically relevant pH values of ∼ 7.0, the protonation of the glutamate side chain contributes significantly to ligand binding. pKa values typically measured for glutamate carboxylate groups in unstructured peptides and on protein surfaces are on the order of ∼4.5–5.[ 13 , 29 ] In contrast, the pKa values of E190 and E218 in the ligand free proteins are already significantly higher. This suggests an important role for the specific protein environment in the protonation‐mediated ligand binding process. The increased pKa for the carboxylate group of residue E190 in the unliganded Q190E mutant is also in agreement with the Cδ chemical shift for the E190 side chain at pH 5.8 which is 179.9 ppm, a significant upfield shift compared to the Cδ chemical shifts of the other, unprotonated glutamate side chains[ 26 ] (Figures 2c and S8).
In the WT protein, the two phosphodiester groups of c‐di‐GMP are also hydrogen bonded to the amino groups of the Q190 and Q218 side chains as well as to the backbone amide groups of L197 and L168, respectively[ 22 ] (Figure 1b). Furthermore, the Q190 and Q218 side chain oxygen atoms form hydrogen bonds to the 2′‐OH groups of the ribose moieties (Figure 1b). Using NMR spectroscopy, we analyzed how this hydrogen bonding network is impacted by the replacement of the carboxamide group in glutamine with a protonated carboxylate group in the Q/E mutants.
The 15N‐HSQC spectra for the WT and the Q190E subdomain bound to c‐di‐GMP are overall very similar but reveal a large chemical shift difference for the backbone amide group of L197 (Figure S6c). This backbone amide group forms a hydrogen bond to the same phosphodiester group as the E190 carboxylate group (Figure 1b). Since the L197 backbone amide group signal is shifted upfield by ∼ 1.0 ppm in the Q190E mutant compared to the WT (Figure S6c), this suggests a significant weakening of the respective intermolecular hydrogen bond to the ribose. A quantitative measurement of the 2hJHP cross‐hydrogen bond scalar coupling between the backbone amide proton and the 31P nucleus[ 30 , 31 ] of the phosphodiester group also showed a reduction of the coupling constant from 1.9 Hz in the WT to 0.8 Hz in the mutant in agreement with a significant lengthening of this hydrogen bond (Figure 3a). In contrast, the coupling constant between the backbone amide proton of the second leucine residue, L168, and the phosphorous nucleus in the unaltered ligand binding site is similar between WT and mutant (2.4 vs. 2.1 Hz).
Figure 3.

The intermolecular hydrogen bond network in the GSPII‐B domain phosphodiester binding pockets is modulated by the Q190E and Q218E substitutions. a) 2hJHP cross‐hydrogen bond scalar couplings of the L168 amide and the L197 amide hydrogen bonds to the non‐bridging oxygen atoms of the phosphodiester groups of c‐di‐GMP. The scalar couplings of the ligand‐bound PilF159‐221 WT are shown in red (L168) and orange (L197) and for the ligand‐bound PilF159‐221 Q190E in blue (L168) and cyan (L197). b) Overlay of the 2D‐13C‐TOCSY‐HSQC spectra of 13C‐labelled c‐di‐GMP bound to PilF159‐221 WT (orange) and PilF159‐221 Q190E (blue). Correlations of the 2′‐OH signals to the C2′ resonances are indicated as grey dashed lines. c) Mutant and native phosphodiester binding pockets of Q190E (left) and Q218E (right) in complex with c‐di‐GMP. Hydrogen bonds are indicated as black dashed lines. Hydrogen bond lengths are shown in blue for L197, in cyan for L168, red for E190 (left), and Q190 and in orange for Q218 and E218 (right).
Hydrogen bonding protects the 2′‐OH protons from exchange with the solvent. Accordingly, the corresponding NMR signals for these protons can be detected in 13C‐edited TOCSY‐HSQC experiments[ 32 ] with 13C‐labeled c‐di‐GMP bound to unlabeled protein (Figure 3b). This allows to qualitatively characterize the hydrogen bonds between the 2′‐OH groups of the ribose moieties from c‐di‐GMP and the side chains of protein residues 190 and 218. In the 13C‐TOCSY‐HSQC of the WT protein, two ribose 2′‐OH signals are observable for the two ribose units of the bound c‐di‐GMP ligand (Figure 3b). In contrast, the signal corresponding to the 2′‐OH proton of the ribose hydrogen bonded to the E190 side chain is no longer visible showing that this hydrogen bond is weakened in the mutant. These results agree with the hydrogen bond length measurements in our crystal structures of the two mutants (Figure 3c), where we observed that the hydrogen bond between the L197 amide group and the non‐bridging oxygen of the phosphodiester group is 0.2 Å longer in the Q190E construct than the same hydrogen bond in the Q218E mutant, where this phosphodiester binding pocket is in its WT configuration. Similarly, the hydrogen bond length between the L168 amide group and the phosphodiester oxygen in the Q218E mutant is increased by 0.2 Å compared to the native binding site in the Q190E construct (Figure 3c, d, orange numbers). Finally, the lengths of the hydrogen bonds from E190 to the 2′‐OH group of c‐di‐GMP in the Q190E mutant and from E218 to the 2′‐OH group in the Q218E mutant are increased by 0.4 Å if compared to the native sites in both constructs (Figure 3c, d, red numbers). Together, the NMR data and X‐ray structures suggest that upon substitution of the native glutamine with glutamate, the hydrogen bonding network in the c‐di‐GMP binding site of the GSPII‐B domain of PilF is adjusted towards favoring a short hydrogen bond between the protonated oxygen in the carboxylate group of the glutamate and the phosphodiester at the expense of the other two intermolecular hydrogen bonds in the hydrogen bond network.
Our detailed characterization of the interaction of the GSPII‐B domain Q‐to‐E mutants with the phosphodiester groups of c‐di‐GMP indicated that the carboxylate groups of the relevant acidic amino acid side chains are readily protonated at physiologically relevant pH values and then able to act as hydrogen bond donor groups thereby contributing favorably to ligand affinity. A decisive factor for the importance of such interactions is the specific structural environment of the respective carboxylate group bearing side chains since it likely contributes significantly to shifts of their pKa’s toward neutrality. We therefore wondered whether the potential of acidic sidechains as a general binding motif for phosphodiester groups may have been overlooked so far in previous structural studies of protein‐nucleic acid interactions.
To elucidate whether the interaction of phosphodiester groups with protonated glutamate and aspartate side chains could be a more general feature of biomacromolecular interactions, we looked at all entries of the protein data bank (PDB, www.rcsb.org) containing X‐ray structures of RNA‐protein or DNA‐protein complexes with a resolution better than 2.8 Å. The atomic coordinates were used to identify entries with distances smaller than 3.0 Å between the Oγ and Oδ oxygen atoms of aspartates and glutamates, respectively, with the OP1 and OP2 non‐bridging oxygen atoms of phosphodiester groups. Since this initial list contained many entries of RNA/DNA‐polymerases, nucleases, gyrases, ligases, and other enzymes acting on nucleic acids which often contain aspartate and glutamate residues complexed to divalent metal ions in their active centers which are naturally in close proximity to the phosphodiester backbone due to their catalytic activities, we excluded such examples from further analysis upon visual inspection of the respective entries. In addition, entries of complexes crystallized at pH values below the pKa value of glutamate carboxylate groups in unstructured peptides (∼ 4.4, the pKa value for aspartate carboxylate groups in unstructured peptides is ∼ 3.9[ 12 , 13 , 14 ]) were excluded. Nonetheless, numerous protein nucleic acid complexes featuring glutamate and aspartate side chain carboxylate groups oriented relative to phosphodiester groups in a fashion strongly suggestive of the formation of hydrogen bond interactions between protonated carboxylate groups and the non‐bridging oxygen atoms of the phosphodiester groups were identified. Examples come from different functional protein categories including, e.g., DNA ligases, restriction enzymes and RNA modification enzymes and cover proteins interacting both with DNA and RNA (Figures 4 and S9).
Figure 4.

Examples of glutamate/aspartate carboxylate groups in phosphodiester group recognition by DNA/RNA binding proteins. Shown are select examples of crystal structures for a DNA ligase (PDB: 4GLX),[ 33 ] a restriction endonuclease (PDB: 3IMB),[ 34 ] an endoribonuclease (PDB: 2Y8Y),[ 35 ] and a pseudouridine synthase (PDB: 4LGT)[ 36 ] in complex with a DNA or RNA ligand. For each example, the protein name, the respective pdb code, and the obtained resolution are given. Hydrogen bonds are indicated as black dashed lines and electron density is shown as a grey mesh at a σ level of 1.0.
Conclusion
According to our results presented here, hydrogen bonding interactions with protonated carboxylate groups of acidic amino acid side chains serving as hydrogen bond donors should be more widely considered when analyzing nucleic acid and cyclic (di‐) nucleotide binding motifs in proteins. Particular care should be taken when interpreting the results of mutation experiments where polar and positively charged amino acid side chains are replaced by glutamate and aspartate residues. In summary, our work shows that the interaction of phosphodiesters with the carboxylate groups of the acidic amino acid side chains in their protonated states seem to be a hitherto underappreciated feature in protein nucleic acid interactions.
Supporting Information
The experimental methods and supplementary data (Tables S1‐S3 and Figures S1–S9) are available in the Supporting Information. The authors have cited additional references in the Supporting Information.[ 26 , 27 , 37 , 38 , 39 , 40 , 41 , 42 , 43 , 44 , 45 , 46 , 47 , 48 , 49 , 50 , 51 , 52 , 53 , 54 , 55 , 56 , 57 ]
Conflict of Interests
The authors declare no conflict of interest.
Supporting information
Supporting Information
Acknowledgements
The Deutsche Forschungsgemeinschaft (DFG) supported this work through grant Wo 901/12–1 to J. W. and through the Cluster of Excellence “Balance of the Microverse” (EXC 2051—Project‐ID 390713860 to U. A. H.)
The Center for Biomolecular Magnetic Resonance (BMRZ) at the Goethe‐University Frankfurt is generously supported by the state of Hesse. The acquisition of the quadruple resonance QCI‐P cryoprobe was supported by the DFG through Grant INST161/816–1 FUGG. U. A. H. acknowledges an instrumentation grant for a high‐field NMR spectrometer by the REACT‐EU EFRE Thuringia (Recovery assistance for cohesion and the territories of Europe, ERDF, Thuringia) initiative of the European Union. Synchrotron X‐ray diffraction data were collected at beamline P13 operated by EMBL Hamburg at the PETRA III storage ring (DESY, Hamburg, Germany).
The authors are grateful to S. Panneerselvam and G. Bourenkov for beam line assistance. Beam time at DESY was allocated through the proposals MX‐939 and MX‐1001. They are grateful to Stefanie Düsterhus for help with sample preparation and Peter Kötter for help with the creation of plasmids for mutants and truncation variants and to Harald Schwalbe for critical reading of the manuscript.
Open access funding enabled and organized by Projekt DEAL.
Neißner K., Duchardt‐Ferner E., Wiedemann C., Kraus J., Hellmich U. A., Wöhnert J., Angew. Chem. Int. Ed. 2025, 64, e202501589. 10.1002/anie.202501589
Data Availability Statement
The data have been submitted to the PDB under accession codes 9GLG and 9GL5 and are on “hold until publication” as is common practice in structural biology.
References
- 1. Thaplyal P., Bevilacqua P. C., Methods Enzymol 2014, 549, 189–219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Nadassy K., Wodak S. J., Janin J., Biochemistry 1999, 38, 1999–2017. [DOI] [PubMed] [Google Scholar]
- 3. Coulocheri S. A., Pigis D. G., Papavassiliou K. A., Papavassiliou A. G., Biochimie 2007, 89, 1291–1303. [DOI] [PubMed] [Google Scholar]
- 4. Jones S., van Heyningen P., Berman H. M., Thornton J. M., J. Mol. Biol. 1999, 287, 877–896. [DOI] [PubMed] [Google Scholar]
- 5. Bahadur R. P., Zacharias M., Janin J., Nucleic Acids Res. 2008, 36, 2705–2716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Treger M., Westhof E., J. Mol. Recog. 2001, 14, 199–214. [DOI] [PubMed] [Google Scholar]
- 7. Jones S., Daley D. T. A., Luscombe N. M., Berman H. M., Thornton J. M., Nucleic Acids Res. 2001, 29, 943–954. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Zeng X., Huang M., Sun Q.‐X., Peng Y.‐J., Xu X., Tang Y.‐B., Zhang J.‐Y., Yang Y., Zhang C.‐C., Proc. Natl. Acad. Sci. USA 2023, 120, e2221874120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Luscombe N. M., Austin S. E., Berman H. M., Thornton J. M., Genome Biol. 2000, 1, reviews001.1–reviews001.37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Draper D. E., Annu. Rev. Biochem. 1995, 64, 593–620. [DOI] [PubMed] [Google Scholar]
- 11. Draper D. E., J. Mol. Biol. 1999, 293, 255–270. [DOI] [PubMed] [Google Scholar]
- 12. Nozaki Y., Tanford C., J. Biol. Chem. 1967, 242, 4731–4735. [PubMed] [Google Scholar]
- 13. Tollinger M., Forman‐Kay J. D., Kay L. E., J. Am. Chem. Soc. 2002, 124, 5714–5717. [DOI] [PubMed] [Google Scholar]
- 14. Keim P., Vigna R. A., Morrow J. S., Marshall R. C., Gurd F. R. N., J. Biol. Chem. 1973, 248, 7811–7818. [PubMed] [Google Scholar]
- 15. Simm R., Morr M., Kader A., Nimtz M., Römling U., Mol. Microbiol. 2004, 53, 1123–1134. [DOI] [PubMed] [Google Scholar]
- 16. Wolfe A. J., Visick K. L., J. Bacteriol. 2008, 190, 463–475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Guzzo C. R., Salinas R. K., Andrade M. O., Farah C. S., J. Mol. Biol. 2009, 393, 848–866. [DOI] [PubMed] [Google Scholar]
- 18. Tamayo R., Pratt J. T., Camilli A., Annu. Rev. Microbiol. 2007, 61, 131–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Hengge R., Nat. Rev. Microbiol. 2009, 7, 263–273. [DOI] [PubMed] [Google Scholar]
- 20. Salzer R., Joos F., Averhoff B., Appl. Environ. Microbiol. 2014, 80, 644–652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Kruse K., Salzer R., Joos F., Averhoff B., Extremophiles 2018, 22, 461–471. [DOI] [PubMed] [Google Scholar]
- 22. Neißner K., Keller H., Kirchner L., Düsterhus S., Duchardt‐Ferner E., Averhoff B., Wöhnert J., J. Biol. Chem. 2025, 301, 108041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Wang Y.‐C., Chin K.‐H., Tu Z.‐L., He J., Jones C. J., Sanchez D. Z., Yildiz F. H., Galperin M. Y., Chou S.‐H., Nat. Commun. 2016, 7, 12481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Chen J. C.‐H., Hanson B. L., Fisher S. Z., Langan P., Kovalevsky A. Y., Proc. Natl. Acad. Sci. USA 2012, 109, 15301–15306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Hass M. A. S., Mulder F. A. A., Ann Rev Biophys 2015, 44, 53–75. [DOI] [PubMed] [Google Scholar]
- 26. Betz M., Löhr F., Wienk H., Rüterjans H., Biochemistry 2004, 43, 5820–5831. [DOI] [PubMed] [Google Scholar]
- 27. Pellecchia M., Iwai H., Szyperski T., Wüthrich K., J. Magn. Reson. 1997, 124, 274–278. [DOI] [PubMed] [Google Scholar]
- 28. Bradshaw J. M., Waksman G., Biochemistry 1998, 37, 15400–15407. [DOI] [PubMed] [Google Scholar]
- 29. Forsyth W. R., Antosiewicz J. M., Robertson A. D., Proteins 2002, 48, 388–403. [DOI] [PubMed] [Google Scholar]
- 30. Duchardt‐Ferner E., Ferner J., Wöhnert J., Angew Chem Int Ed Engl 2011, 50, 7927–7930. [DOI] [PubMed] [Google Scholar]
- 31. Duchardt‐Ferner E., Wöhnert J., J. Biomol. NMR 2017, 69, 101–110. [DOI] [PubMed] [Google Scholar]
- 32. Hennig M., Fohrer J., Carlomagno T., J. Am. Chem. Soc. 2005, 127, 2028–2029. [DOI] [PubMed] [Google Scholar]
- 33. Surivet J.‐P., Lange R., Hubschwerlen C., Keck W., Specklin J.‐L., Ritz D., Bur D., Locher H., Seiler P., Strasser D. S., Prade L., Kohl C., Schmitt C., Chapoux G., Ilhan E., Ekambaram N., Athanasiou A., Knezevic A., Sabato D., Chambovey A., Gaertner M., Enderlin M., Boehme M., Sippel V., Wyss P., Bioorg. Med. Chem. Lett. 2012, 22, 6705–6711. [DOI] [PubMed] [Google Scholar]
- 34. Sokolowska M., Kaus‐Drobek M., Czapinska H., Tamulaitis G., Szczepanowski R. H., Urbanke C., Siksnys V., Bochtler M., J. Mol. Biol. 2007, 369, 722–734. [DOI] [PubMed] [Google Scholar]
- 35. Sashital D. G., Jinek M., Doudna J. A., Nat. Struct. Mol. Biol. 2011, 18, 680–687. [DOI] [PubMed] [Google Scholar]
- 36. Czudnochowski N., Ashley G. W., Santi D. V., Alian A., Finer‐Moore J., Stroud R. M., Nucleic Acids Res. 2014, 42, 2037–2048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Mejia E., Burak M., Alonso A., Larraga V., Kunkel T. A., Bebenek K., Garcia‐Diaz M., DNA Repair 2014, 18, 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Choi S.‐J., Ban C., Sci. Rep. 2016, 11, 34998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Cheng L., Li F., Jiang Y., Yu H., Xie C., Shi Y., Gong Q., Nucleic Acids Res. 2018, 47, 495–508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Hashimoto H., Olanrewaju Y. O., Zheng Y., Wilson G. G., Zhang X., Cheng X., Genes Dev. 2014, 28, 2304–2313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Edelheit O., Hanukoglu A., Hanukoglu I., BMC Biotechnol. 2009, 9, 61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Gibson D. G., Young L., Chuang R.‐Y., Venter J. C., Hutchison C. A., Smith H. O., Nat. Methods 2009, 6, 343–345. [DOI] [PubMed] [Google Scholar]
- 43. Neißner K., Keller H., Duchardt‐Ferner E., Hacker C., Kruse K., Averhoff B., Wöhnert J., Biomol NMR Assign 2019, 13, 383–390. [DOI] [PubMed] [Google Scholar]
- 44. Rao F., Pasunooti S., Ng Y., Zhuo W., Lim L., Liu A. W., Liang Z.‐X., Anal. Biochem. 2009, 389, 138–142. [DOI] [PubMed] [Google Scholar]
- 45. Keller H., Weickhmann A. K., Bock T., Wöhnert J., RNA 2018, 24, 1390–1402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Kabsch W., Acta Cryst D 2010, 66, 133–144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Evans P., Acta Cryst D 2006, 62, 72–82. [DOI] [PubMed] [Google Scholar]
- 48. Evans P. R., Acta Cryst D 2011, 67, 282–292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Liebschner D., Afonine P. V., Baker M. L., Bunkóczi G., Chen V. B., Croll T. I., Hintze B., Hung L.‐W., Jain S., McCoy A. J., Moriarty N. W., Oeffner R. D., Poon B. K., Prisant M. G., Read R. J., Richardson J. S., Richardson D. C., Sammito M. D., Sobolev O. V., Stockwell D. H., Terwilliger T. C., Urzhumtsev A. G., Videau L. L., Williams C. J., Adams P. D., Acta Cryst D 2019, 75, 861–877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Afonine P. V., Grosse‐Kunstleve R. W., Echols N., Headd J. J., Moriarty N. W., Mustyakimov M., Terwilliger T. C., Urzhumtsev A., Zwart P. H., Adams P. D., Acta Cryst D 2012, 68, 352–367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Williams C. J., Headd J. J., Moriarty N. W., Prisant M. G., Videau L. L., Deis L. N., Verma V., Keedy D. A., Hintze B. J., Chen V. B., Jain S., Lewis S. M., Arendall W. B., Snoeyink J., Adams P. D., Lovell S. C., Richardson J. S., Richardson D. C., Protein Sci. 2018, 27, 293–315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Markley J. L., Bax A., Arata Y., Hilbers C. W., Kaptein R., Sykes B. D., Wright P. E., Wüthrich K., J. Mol. Biol. 1998, 70, 117–142. [DOI] [PubMed] [Google Scholar]
- 53. Powers R., Gronenborn A. M., Marius Clore G., Bax A., J. Magn. Reson. 1991, 94, 209–213. [Google Scholar]
- 54. Grzesiek S., Bax A., J Magn Reson Series B 1993, 102, 103–106. [Google Scholar]
- 55. Raschka S., JOSS 2017, 2, 279. [Google Scholar]
- 56. Maciejewski M. W., Schuyler A. D., Gryk M. R., Moraru I. I., Romero P. R., Ulrich E. L., Eghbalnia H. R., Livny M., Delaglio F., Hoch J. C., Biophys. J. 2017, 112, 1529–1534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Baskaran K., Craft D. L., Eghbalnia H. R., Gryk M. R., Hoch J. C., Maciejewski M. W., Schuyler A. D., Wedell J. R., Wilburn C. W., Front. Mol. Biosci. 2022, 8, 817175. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supporting Information
Data Availability Statement
The data have been submitted to the PDB under accession codes 9GLG and 9GL5 and are on “hold until publication” as is common practice in structural biology.
