Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2021 Jun 24;297(2):100909. doi: 10.1016/j.jbc.2021.100909

Structural basis of substrate specificity in human cytidine deaminase family APOBEC3s

Shurong Hou 1, Jeong Min Lee 1, Wazo Myint 2, Hiroshi Matsuo 2, Nese Kurt Yilmaz 1,, Celia A Schiffer 1,
PMCID: PMC8313598  PMID: 34171358

Abstract

The human cytidine deaminase family of APOBEC3s (A3s) plays critical roles in both innate immunity and the development of cancers. A3s comprise seven functionally overlapping but distinct members that can be exploited as nucleotide base editors for treating genetic diseases. Although overall structurally similar, A3s have vastly varying deamination activity and substrate preferences. Recent crystal structures of ssDNA-bound A3s together with experimental studies have provided some insights into distinct substrate specificities among the family members. However, the molecular interactions responsible for their distinct biological functions and how structure regulates substrate specificity are not clear. In this study, we identified the structural basis of substrate specificities in three catalytically active A3 domains whose crystal structures have been previously characterized: A3A, A3B- CTD, and A3G-CTD. Through molecular modeling and dynamic simulations, we found an interdependency between ssDNA substrate binding conformation and nucleotide sequence specificity. In addition to the U-shaped conformation seen in the crystal structure with the CTC0 motif, A3A can accommodate the CCC0 motif when ssDNA is in a more linear (L) conformation. A3B can also bind both U- and L-shaped ssDNA, unlike A3G, which can stably recognize only linear ssDNA. These varied conformations are stabilized by sequence-specific interactions with active site loops 1 and 7, which are highly variable among A3s. Our results explain the molecular basis of previously observed substrate specificities in A3s and have implications for designing A3-specific inhibitors for cancer therapy as well as engineering base-editing systems for gene therapy.

Keywords: APOBEC3, specificity, structural analysis, molecular modeling, molecular dynamics simulation

Abbreviations: CBE, cytosine base editor; CTD, C-terminal domain; NTD, N-terminal domain; pMD, parallel molecular dynamics; ssDNA, single-strand DNA


APOBEC3s (A3s) are a family of cytidine deaminases that have seven members in human with functions in innate immunity and roles in cancer (1, 2, 3, 4, 5). All A3 domains share a conserved structural fold with an active site zinc tetrahedrally coordinated with catalytic His and Cys residues and an additional water. The human A3s have either one (A3A, A3C, and A3H) or two zinc-binding domains (A3B, A3D, A3F, and A3G). The two-domain A3s consist of a catalytically active C-terminal domain (CTD) and an N-terminal domain (NTD) that binds to substrate but has no deamination activity. A3s deaminate cytosine to uracil on single-strand DNA (ssDNA) and certain RNAs (6, 7), thus creating mutations.

Through deamination, A3s play crucial roles in innate immunity by mutating foreign pathogenic genomes and thus protecting host cells against retroviruses and retrotransposons (8, 9, 10, 11, 12, 13, 14). Specifically, A3s deaminate cytosines to uracils on ssDNA during reverse transcription to create G to A hypermutations on the complementary strand. However, misregulated A3 deamination activity may promote cancer and the development of therapeutic resistance. Overexpressed A3s, especially A3A, A3B, and A3H, have been shown to cause heterogeneities in multiple cancers, including breast, bladder, head and neck, cervical, and lung cancer (15, 16, 17, 18). The A3 mutational signature, which is C to T transition in TC context, has been observed in multiple cancer genomes (15, 16, 19). Moreover, study of human cancer cell lines has suggested that A3s may be involved in the origination of cancer (20). Recently, coupled with CRIPSR/Cas9, A3s are explored as novel base editors to treat genetic diseases (21, 22).

The structures of A3s provide the basis for understanding the underlying molecular mechanisms in A3 biology. Several crystal and NMR structures of human or primate A3 single domains (A3A, A3C, A3H; CTDs of A3B, A3F, A3G; NTDs of A3B, A3G) in the apo state have been determined by our group (23, 24, 25, 26, 27, 28, 29) and others (30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46). The A3 domain fold consists of six alpha-helices and five beta-strands. The catalytic active site, which also coordinates zinc binding, is in the middle of the domain (Fig. 1A). Recently, our laboratory (47, 48), along with other groups (48, 49), has solved the crystal structures of several A3–ssDNA complexes (A3A-DNA, chimeric A3B-CTD–DNA, A3G-CTD–DNA). These structures identified the binding conformation of DNA, revealed the critical residues for binding, and provided insights into substrate specificity, especially at the −1′ position upstream of the target C. Of these structures, three (A3A, A3B, and A3G) have substrate DNA bound at the active site, with the target cytidine to be deaminated in essentially the same conformation. However, the rest of ssDNA can bind to A3 in different conformations; either in a conformation wrapped around the gatekeeper residue, U-shape, as seen around His29 in A3A (PDB: 5KEG; 5SWW) (47, 49) and chimeric A3B-CTD (PDB: 5TD5) (49), or a more extended linear shape, L-shape, as seen in A3G-CTD (PDB: 6BUX) (48), (Fig. 1). As these complex structures were determined with varying ssDNA sequences, the conformation of the bound DNA may determine the differential specificity of these enzymes.

Figure 1.

Figure 1

Structure and active site loops of A3s.A, crystal structure of ssDNA-bound A3A where the active site loops 1, 3, 5, and 7 are highlighted (PDB ID: 5KEG). ssDNA (orange) and the residues coordinating the zinc (H70, E72A, C101, and C106) are shown as sticks. The catalytic zinc is shown as gray sphere. Amino acid sequence alignments of active site loops in A3A, A3B-CTD, and A3G-CTD are given on the right. B, crystal structure of chimeric A3B-CTD (PDB ID: 5TD5) and A3G-CTD (PDB ID: 6BUX) in cartoon representation. C, schematic representation of the ssDNA binding conformations: U-shape as seen in A3A or chimeric A3B-CTD, or a more extended linear (L) shape as seen in A3G-CTD. The target C is in the zero (0) position with the active site zinc indicated by the dark orange circle, and the other nucleotide positions are represented by orange diamonds along the ssDNA from 5′ to 3′ direction (blue arrow).

Although A3s share highly similar overall structural folds, they have varying levels of deamination activity and substrate specificity. For instance, the activity of A3A, which is the highest in A3 family, can be up to 5000-fold higher compared with the least active A3D (50). All A3 proteins deaminate deoxy-cytidines in ssDNA, but vary in their preferred hotspot sequences or deamination motifs: 5’- (T/C)TC(A/G) for A3A, 5′-ATC(A/G) for A3B and 5′-CCC(A/C/T) for A3G (50, 51, 52, 53, 54). However, based on the amino acid sequence or even the available structures, the molecular mechanisms that are responsible for varied ssDNA binding affinity and deamination activity as well as substrate specificities among A3 domains are not apparent. The loops (loop 1, 3, 5, and 7) surrounding the active site pocket are the least conserved region of A3s (Fig. 1A; Fig. S1). In addition, these active site loops undergo substantial conformational changes relative to apo structures upon ssDNA binding. Therefore, changes surrounding the active site including variations in the arrangements of these loops are likely largely responsible for distinctions in substrate specificity, binding affinity, and deamination activity for ssDNA, as well as the physiological functions of A3s. Structural analysis using molecular modeling and dynamics simulations have been used for investigating the molecular mechanisms underlying biological functions of A3s (55, 56, 57, 58, 59, 60). These studies have revealed the importance of active site loops in substrate binding/activity and specificity. However, the structural mechanism for substrate specificities in A3s is largely unknown.

In this study, we investigate the structural mechanism of substrate specificity and ssDNA-binding conformation in A3s using a combination of molecular modeling, structural analysis, and parallel molecular dynamics (pMD) simulations (761, 62, 63, 64, 65). Three domains of the human A3 family that have high enzyme sequence similarity yet varied substrate specificity are the focus of this study: A3A, A3B-CTD, and A3G-CTD. These enzymes are the best experimentally characterized A3s in terms of available structures and substrate specificity (Figs. 1 and 2) (47, 48, 49, 61). In our analyses, A3 domains were characterized bound to ssDNA of varying nucleotide sequences in either U or L-shape conformation, and the results show an interdependence between substrate specificity and ssDNA conformation. Although the ssDNA was crystallized in a U-shape with A3A and (chimeric) A3B-CTD (47, 49), we find that the A3A-ssDNA enzyme complex adopts varied conformations in a substrate dependent manner, binding ssDNA with a CTC0 motif in a U-shape and CCC0 in an L-shape, A3B can accommodate either conformation in the recognition of its preferred ATC0 substrate motif, while A3G only stably recognizes CCC0 with the ssDNA in an L-shape. These sequence-specific conformations are stabilized by specific interactions with the highly variable loops, loop-1 and loop-7, of the A3 enzymes (Fig. 1). Thus, A3 family of enzymes accommodate varied substrate specificity through conformational rearrangements of both the active site loops and the ssDNA substrate.

Figure 2.

Figure 2

Differential dynamics of ssDNA during the pMD simulations elucidate the stable substrate-bound A3 complexes. The root-mean-squared fluctuations (RMSF) of each nucleotide in four complexes with different sequences (TC or CC) and binding conformations (U or L) are shown as histograms. Two examples of stable (middle column) and unstable (right column) complexes for each of (A) A3A, (B) A3B-CTD, (C) A3G-CTD are depicted. In each panel, seven snapshots of the ssDNA conformation evenly spaced throughout a 100 ns MD trajectory are shown. The A3 proteins are in gray cartoon representation. The ssDNA is shown as stick; colored based on the simulation time from red to blue. The target (0 C) and −1′ position nucleotide in the active site are labeled. The catalytic zinc is shown as red sphere.

Results

The three A3s investigated, A3A, A3B-CTD, and A3G-CTD, were modeled and energy minimized with the DNA conformation either in a wrapped (U) or linear (L) shape based on that in the cocrystal structures with A3A and A3G-CTD (47, 48) respectively (Table S1). The preferred trinucleotide deamination motif is (C/T)TC0 for A3A, ATC0 for A3B-CTD, and CCC0 for A3G-CTD (50, 51, 52, 53, 54). For each A3 enzyme the preferred motif of substrate DNA in both the U and L conformation were characterized. The complexes investigated together with full ssDNA nucleotide sequences are tabulated in Table S1. To analyze the relative stability of these A3–ssDNA complexes, fully solvated pMD simulations were performed in triplicate at 25 °C (300 K) to critically compare and evaluate the molecular interactions with DNA between the A3s.

Substrate specificity and conformation correlate with overall dynamics in the simulations

The wild-type A3A–CTC0 (U) complex, which has DNA conformation in the cocrystal structure of A3A–DNA (47) with the preferred substrate sequence, was stable during the MD simulations as expected. The bound DNA had relatively small root-mean-square fluctuations (RMSF), less than 1.8 Å for the central five nucleotides (Fig. 2A; Fig. S2A). In addition, the key molecular interactions between the target cytidine (C0) and A3A observed in crystal structures were all well maintained throughout the course of the simulation. These included hydrogen bond interactions with His29 (gatekeeper), Thr31, Asn57, Ala71, Glu72, Ser99, Tyr130 and stacking interactions with Tyr130 and His70 (Fig. 3A; Fig. S3). To probe whether A3A can also stably bind ssDNA in a conformation similar to that observed in the A3G-CTD DNA-bound crystal structure, the identical substrate sequence was simulated in a linear conformation in the A3A-CTC0 (L) complex. The ssDNA in this complex had significantly larger fluctuations compared with A3A-CTC0 (U) complex (Fig. 2A; Fig. S2A), with the notable exception of the C0 position (RMSF A3A-CTC0 (U)/A3A-CTC0 (L): 2.9/1.6 Å at −3′, 1.6/1.1 Å at −2′, 1.1/0.7 Å at −1′, 0.5/0.6 Å at 0′, 1.8/0.9 Å at +1′ and 2.8/1.8 Å at +2′ position). In addition, the gatekeeper residue, His29, which is critical for stabilizing ssDNA binding in the U-shape seen in the crystal structure (47), flipped to the apo conformation (Fig. 3A; Fig. S3). As a result, the stacking interactions with downstream nucleotides (+1, +2, +3) and hydrogen bonds with the DNA backbone were lost. Together these results suggested that A3A prefers binding ssDNA bearing a CTC0 motif in U-shape rather than a linear conformation.

Figure 3.

Figure 3

The molecular interactions between the target cytidine (C0) and A3 active sites elucidate stable ssDNA bound complexes. The final snapshot of pMD simulations where the gatekeeper residue (H29, R211, H216 for A3A, A3B and A3G, respectively) and Asn side chain coordinating the DNA are shown in stick representation and labeled. A, A3A (green; unstable: CTC0 (L), stable: CTC0 (U)) (B) A3B-CTD (pink; stable: ATC0 (L), unstable ACC0 (U)) (C) A3G-CTD (yellow; stable: CCC0 (L), unstable: CCC0 (U)). The A3 proteins are shown in cartoon representation. The active site residues that have molecular interactions with target cytidine are shown as sticks and labeled. ssDNA is shown as orange sticks. The catalytic zinc is shown as gray sphere.

Next, A3A bound to DNA with a CCC0 motif, which is still deaminated but reportedly with an ∼3-fold lower efficiency compared with CTC0 (6), was investigated. In contrast to A3A-CTC0 (L), the A3A-CCC0 (L) complex was highly stable, comparable to the A3A-CTC0 (U) complex as indicated by both the RMSF of the ssDNA and stability of critical interactions with A3A (Figs. 2A and 3A; Figs. S2A and S4). The A3A-CCC0 (L) complex was also significantly more stable than A3A-CCC0 (U) as shown by the relative RMSFs (Fig. 2A). In fact, in the destabilized A3A-CCC0 (U) simulation, His70, which coordinates the catalytic zinc, lost stacking interactions with the target cytidine (70% occupancy in CCC0 (L); 50% occupancy in CTC0 (U)), appearing to destabilize the active site. Thus, A3A was able to stably bind ssDNA with both CTC0 and CCC0 motifs (6), albeit using varied interactions and with the DNA in distinct conformations: CTC0 motif was stable in a U-shape while the CCC0 motif was stable in an L-shape.

Similar to the case with A3A, the A3B-ATC0 (U) complex (61), which corresponds to our previously presented wild-type complex (derived from the crystal structure of chimeric A3B/A3A–DNA complex (49)), was stable during the MD simulations. However, unlike A3A, which preferred binding ssDNA depending on the target dinucleotide motif in either U- or L-shaped, A3B showed strong preference for ATC0 over ACC0 motif in MD simulations regardless of the DNA conformation. The RMSF of bound DNA in both A3B-ATC0 complexes, especially for −1′ and 0′ nucleotides (less than 1 Å and 0.6 Å, respectively), was relatively small compared with those in A3B-ACC0 complexes (Fig. 2B; Fig. S2). The molecular interactions between the deamination target nucleotide C0 and A3B were maintained only in ATC0 but not ACC0 complexes (Fig. 3B; Fig. S4). In the A3B-ACC0 (U) complex, the side chain of active site residue Asn240 changed conformation, and hydrogen bond with the ssDNA backbone was lost. This conformer change is similar to what was previously observed for A3G nonsubstrate (rC) simulations (7), which suggests that cytidine was unstable and not poised for deamination. In conclusion, A3B can accommodate both DNA conformations but strongly prefers ATC0 substrate.

The A3G–DNA complexes were considerably more dynamic compared with A3A and A3B, correlating with lower binding affinity (Table S2). ssDNA-binding affinity has been shown to be a good indicator of A3 activity and substrate specificity (6). The wild-type A3G–CCC0 (L) complex, which represents the crystal structure, was the most stable A3G complex during the MD simulations as indicated by relatively low fluctuations of bound DNA (especially the central five nucleotides) and the most stable interactions between 0′ and −1′ cytidines and protein (Fig. 2C; Fig. S5). In contrast, the A3G-CCC0 (U) complex had the highest RMSF of substrate DNA with 4.3 Å. The hydrogen bond interactions as well as the stacking interactions between His216 and ssDNA were lost in both the A3G–CTC0 (U) and A3G–CCC0 (U) complexes (Fig. 3C; Fig. S5). Thus, for A3G only DNA with the CCC0 motif in a linear conformation formed a stable complex in the simulations.

Overall, out of the 12 complexes simulated and analyzed (each of the three A3s bound to two substrate motifs in two conformations), only five A3–DNA complexes were found to be stable and further analyzed; these include A3A-CTC0 (U), A3A-CCC0 (L), A3B-ATC0 (U), A3B-ATC0 (L), and A3G-CCC0 (L).

APOBEC3 preferred ssDNA-binding conformation is defined through interactions with “loop 1”

Among all the active site loops, loop 1, whose sequence is highly variable, had the most extensive molecular contacts with ssDNA (Fig. 1). ssDNA either wrapped around (as observed in the U-shaped complexes) or extended along (as observed in the L-shaped complexes) loop 1 (Fig. 4A). As a result, loop 1 contributed considerable van der Waals (vdW) interactions with ssDNA to stabilize binding (Fig. 4B; Fig. S6). In A3A, the shorter loop 1, with a three-residue deletion relative to A3B and A3G, may allow ssDNA to bind in both conformations. In the A3A–CTC0 (U) complex, ssDNA wrapped around the gatekeeper residue His29 in loop 1. The neighboring Arg28 stacked with upstream bases (−2; −3) and thus stabilized the U conformation. These two residues in loop 1 had the most vdW contacts with ssDNA. In A3A–CCC0 (L) complex, the three-residue deletion in loop 1 allowed ssDNA to extend and make interactions with His182 in alpha-helix 6. In this binding conformation, Ile26 of loop 1 packed with upstream bases and thus had the second highest vdW interactions with ssDNA. Thus loop 1 of A3A both enabled accommodating two different DNA conformations, which permits differential specificity, and established critical interactions with the substrate in both cases.

Figure 4.

Figure 4

The surface representations of the five stable ssDNA-A3 complexes reveal the importance of active site loops in substrate binding. Active site loops and electrostatics of stable ssDNA–A3 complexes displayed for representative frames from pMD simulations. Proteins are shown in surface representation, and ssDNA is shown as orange sticks. A, active site loops (1, 3, 5, 7) are colored with different shades of blue and shown on protein surface. The color caption of each loop is shown on the right. B, vdW contacts between protein and ssDNA over MD simulations. The residues are colored on a rainbow scale from blue to red for increasing contacts; hence warmer colors indicate residues with the most contribution to the intermolecular contacts. The cutoff for the scale is −0.5 kcal/mol. The range of the contacts is shown on the right. C, the electrostatics of protein surfaces, where red is negative and blue is positive charge. The contour levels from positive (+3) to negative (−3).

In A3B and A3G, loop 1 is longer compared with A3A (Fig. 1B). In the A3G–DNA cocrystal structure, Trp211 in the 210-PWV-212 insertion stacks with the −3′ base and thus stretches the bound ssDNA into the more extended L-shape. This interaction was maintained in the complex during simulations and Trp211 was the residue with the strongest vdW interactions with DNA, indicating that Trp211 is critical for the L binding conformation of ssDNA. Similarly, in the A3B–ATC0 (L) complex, Pro206 of the 206-PLV-208 insertion in loop 1 packed with the −3′ nucleotide, thus stabilizing the overall extended L-shape. In A3B, binding DNA in a U-shape may be enabled by Arg210 in loop 1. Arg210 in A3B has a unique conformation compared with other A3s (40, 61): Arg210 replaces the position of the conserved Arg313 position in loop 7 and stabilizes a conserved network of hydrogen bonds initially revealed in apo A3 crystal structures as providing structural stability. As a result, the side chain of Arg210 is oriented toward the core of the protein and thus results in a cavity next to the gatekeeper residue Arg211 (61). This cavity appears to allow ssDNA to wrap around Arg211 and bind in a U conformation in the A3B–ATC0 (U) complex (Fig. 4B). Loops surrounding the active site, including loop 1, also contribute to the electrostatics of the enzyme surface to accommodate the negatively charged DNA backbone. Although still positively charged, the active site of A3G-CTD is weaker and not as positively charged compared with A3A and A3B-CTD (Fig. 4C), correlating with the lower binding affinity (Table S2).

Substrate conformation and nucleotide specificity at −1′ position of the substrate DNA

We previously reported that A3A can bind either thymidine or cytidine at −1′ position with slight preference of T over C (experimental Kd ∼85 nM versus ∼250 nM) (6). Our MD simulations indicate that the TC0 and CC0 motifs are accommodated in distinct DNA-binding conformations and provide insights into preference for the TC0 motif: thymidine in A3A–CTC0 (U) complex forms stable hydrogen bonds with the side chain of Asp131 and the backbone of Tyr132 (Fig. 5A left and Table S3). The same hydrogen bonds are also observed in A3A–DNA cocrystal structure (47, 49). However, the C−1 in the unstable A3A–CCC0 (U) complex (see above) lost the hydrogen bond interaction with the backbone of Tyr132 and was more dynamic with RMSF about twofold higher than that of −1′ thymidine in A3A–CTC0 (U) (Fig. 2A; Fig. S2). In contrast, C−1 in A3A–CCC0 (L) complex revealed stable interactions with A3A protein (Fig. 2A; Fig. S2). The C−1 formed stable hydrogen bonds with the backbone of Tyr132 and side chain of Asp131 through a different side chain conformation (Fig. 5A right). The RMSF of C−1 in CCC0 (L) was about the same as T−1 in CTC0 (U) complex (Fig. 2A). Together these molecular interactions reveal how A3A may accommodate and deaminate TC and CC substrate sequence motifs with varied DNA-binding conformations.

Figure 5.

Figure 5

The structural mechanism for substrate specificity at −1′ position in A3s. A representative frame from pMD simulations for (A and B) A3A (green), (C and D) A3B-CTD (pink), (E and F) A3G-CTD (yellow) are used to show the molecular interactions between −1′ nucleotide and the active site. The A3 proteins are shown in cartoon representation. The active site residues that have molecular interactions with −1′ nucleotide base are shown as sticks. ssDNA is shown as orange sticks. The interacting residues as well as −1′ and 0 nucleotides are labeled.

To further examine the interdependency between substrate-binding conformation and sequence specificity at the −1′ position, additional MD simulations with the four ssDNA-bound A3A complexes were carried out at a higher temperature of 37 °C (310.2 K). These simulations accentuated the instability of the unfavored DNA conformations and facilitated the potential for a conformational switch to a more favorable conformation. ssDNA in both A3A–CTC0 (U) and A3A–CCC0 (L) complexes was again stable during the higher temperature simulations (Fig. 6, A and C). In contrast, ssDNA initiated in A3A–CTC0 (L) and A3A–CCC0 (U) complexes underwent conformational changes to the preferred mode of binding (Fig. 6, B and D). ssDNA in the A3A–CTC0 (L) complex transitioned from an L-shape to U-shape, while ssDNA initiated in a A3A–CCC0 (U) conformation transitioned from U-shape to L-shape during the simulations. These additional simulations not only validated our results at 25 °C but also supported the interdependency between ssDNA-binding conformation and substrate sequence specificity for A3A.

Figure 6.

Figure 6

The interdependency between substrate-binding conformation and sequence specificity at the −1′ position. The conformational changes of ssDNA over time in A3A 37 °C MDs: (A) A3A-CTC0 (U) (B) A3A-CCC0 (U) (C) A3A-CTC0 (L) (D) A3A-CCC0 (L). In each panel, seven snapshots of the ssDNA conformation evenly spaced throughout the 100 ns MD trajectory are shown. The ssDNA is shown as sticks and colored by simulation time (red represents the beginning of the simulations while blue represents the end). The red arrow indicates the conformational transition of ssDNA to a stable position.

A3B exhibits stronger preference for T rather than C at the −1′ position, unlike A3A (53). However, for A3A and A3G, the complexes analyzed above had a C at the upstream −2′ position. To investigate any influence of −2′ nucleotide on substrate preference, additional A3B complexes with CTC0 motif were characterized and compared with ATC0 motif. When the DNA was in a U-shape, i.e., in both A3B–ATC0 (U) and A3B-CTC0 (U) complexes, −1′ T formed stable hydrogen bonds with the backbone of Tyr315 and the side chain of Asp314, which was similar to what was observed in A3A and consistent with our previous results (61) (Fig. 5B left; Table S3). In the A3B–ATC0 (L) complex, unlike what was observed in A3A–CTC0 (L), Asp314 maintained the same side hydrogen bond as in U-shape conformation and thus promoted T over C (Fig. 5B right; Table S2). Thus, these results indicate that within an ATC0 motif A3B can accommodate T at −1′ position in either DNA conformation.

A3G binds ssDNA in an L shape and thus prefers CC0 rather than TC0 at −1′ position. A3G is the only A3 that prefers cytidine over thymidine at −1′ position. In our MD simulations of the A3G-CCC0 (L) complex, the side chain of Asp316 and the backbone of Asp317 formed stable hydrogen bonds with −1′ cytidine (Fig. 5C left; Table S3). These direct hydrogen bonds were lost in the unstable A3G–CCC0 (U) complex simulation (Fig. 5C right; Table S3), further confirming the preference of A3G-CCC0 for the more stable L-shaped conformation.

Nucleotide specificity at −2′ position of substrate DNA

Intra-DNA interactions in A3A, as we previously revealed, may underlie the substrate specificity of T/C at −2′ position (6). However, atomic interactions defining the −2′ specificity for A3B or A3G have remained elusive. To address the molecular mechanism of specificity at −2′ position in A3B, additional models of A3B with a C at position −2′ (A3B–CTC0) and (A3B–CCC0) were compared with A3B’s preferred ATC sequence (53) which was used in the complexes described above. In A3B–ATC0 (L) complex simulation, A−2 formed stable hydrogen bonds with the side chain of Arg311 (occupancy 38% during the simulations), backbone of Ile312 (occupancy 90%), Asp314 (occupancy 98%), and stacking interactions with Trp281 (occupancy 32%) (Fig. 7A). However, all these interactions were significantly weaker in the A3B–CTC0 (L) complex simulations (Fig. 7B) especially for Trp281. Moreover, the side chain of the gatekeeper residue for DNA binding, Arg211, lost interactions with DNA backbone, which likely impairs the binding of target cytidine C0 in the active site (Fig. S4). The destabilization of these interactions is consistent with preference of A3B for the ATC over CTC sequence.

Figure 7.

Figure 7

The structural mechanism for substrate specificity at −2′ position in A3s. A representative frame from MD simulations for (A) A3B-CTD (pink), (B) A3G-CTD (yellow) are used to show the molecular interactions between −2′ nucleotide and A3s. The residues that have molecular interactions with −2′ nucleotide base are shown as sticks. ssDNA is shown as orange sticks except −2′ nucleotide, which is colored in blue. −2′, −1′ and 0′ positions as well as the residues contacting −2′ nucleotide are labeled.

To investigate the specificity at −2′ position for A3G, a model with ACC trinucleotide motif was compared with CCC, which is the preferred motif for A3G (30, 66, 67, 68). In the A3G–CCC0 (L) complex simulation, C−2 was locked in an extensive hydrogen bonding network with residues Pro210 (water-mediated; occupancy 37%), Arg374 (water-mediated; occupancy 31%, 41%), backbones of Ile314 (water-mediated; occupancy 41%), and Val212 (water-mediated; occupancy 34%) and Asp316 (occupancy 43%) (Fig. 7C). Even though there is enough space to accommodate an A, the larger base of A in the ACC0 (L) complex simulation did not occupy the space where water-mediated hydrogen bonds had been; instead, the A−2 was significantly destabilized and formed only one hydrogen bond with Asp316 (occupancy 42%) (Fig. 7D). Thus, A3G preferred to accommodate the smaller C at the −2′ position through an extensive water-mediated hydrogen bonding network.

Discussion

In this study, we investigated the structural molecular mechanism for substrate specificity in A3 enzymes and found an interdependence between substrate conformation and specificity. Specifically, our results indicate that A3A and A3B can accommodate DNA in a linear conformation, in addition to the wrapped U-shaped binding conformation of substrate DNA observed in crystal structures. For A3A, the linear conformation permits recognition of CC0 dinucleotide motif while the U-shape prefers TC0 as observed in the crystal structure. The active site loops are key in defining the overall binding surface and conformation adopted by the DNA to bind A3s. For A3A, A3B, and A3G, loop 1 is critical with extensive interactions with DNA including the gatekeeper residues (His29 in A3A, Arg211 in A3B, and His216 in A3G), which locks DNA in the active site. We described that the three-residue insertion in loop 1 of A3B and A3G compared with A3A underlies the preference for the more extended L-shaped conformation of the DNA through specific stacking interactions. These results indicate that DNA conformation should be considered together with nucleotide preference in defining substrate specificity of A3 enzymes.

Another key loop in substrate recognition is loop 7; previous studies have shown that swapping loop 7 from A3G into A3B altered the substrate specificity in A3B from TC0 to CC0 (40). Additionally, changing Asp317 of A3G into the corresponding residue of A3A (Tyr132) caused A3G to adopt a more A3A-like 5’-TC0 preference (69). While these results suggested that Tyr132 (Tyr315 in A3B) in loop 7 might be important for substrate specificity at −1’ position, our analysis demonstrates there is no specific hydrogen bond between this residue and −1′ base during the MD simulations. Instead we propose this tyrosine is key for stabilizing the U-shaped binding conformation of DNA, and thus the TC0 preference in A3A/A3B and not in A3G. This is supported by the significantly higher vdW contacts of Tyr132/315 with DNA compared with those of Asp317 in A3G (Fig. S6). Thus tyrosine at this critical position in loop 7 stabilizes the U-shape of bound DNA, allowing accommodation of a T at this position.

Unlike human CDAs that bind single nucleosides (70), A3s require at least five consecutive nucleotides for stable substrate recognition (6, 30, 71). The interdependence between binding interactions around the active site suggests that the sequence of nucleotides flanking the target cytidine is also critical for stable substrate binding. We observed interdependence between preference and interactions of various nucleotides positions in our pMD analysis. The nucleotide at −1′ position affects the binding stability of the target nucleotide in the catalytic site (0 position). Having a disfavored nucleotide at the −1′ position destabilized the target nucleotide. Additionally, the nucleotide at −2′ position can influence the specificity at −1′ position. In A3B, the interactions between −2′ A and Asp314 locked the side chain in a conformation that promotes thymidine over cytidine at −1′ position. In both A3A and A3B, preference for T at the −1 position was possible only when the substrate DNA was in a U-shape, which cannot be accommodated by A3G.

Considering the roles of A3s in viral infections and cancer, a better understanding of the molecular mechanism by which A3s recognize different oligonucleotides will be critical for developing therapeutics. Currently, combined with catalytically inactive Cas9 (dCas9), A3s are investigated as novel base editors for direct modification of genomic DNA at single-base resolution (21, 22); as cytosine base editors (CBE), A3s can create mutations to potentially correct genetic diseases. Nevertheless, these editors still require optimization, considering that CBEs can have significant off-target effects, low purity, and wide editing windows (72). To overcome these problems, several versions of CBEs have been engineered; UGI added to increase product purity (73) and generate high fidelity Cas9 with reduced off-target effects (74) or different Cas nucleases (75, 76, 77, 78, 79) were used to narrow activity window. However, modifications to improve the efficiency and fidelity of base editing have not focused on the deaminase. Another major problem in applying CBEs to treat genetic diseases is that the target site must naturally exist in the preferred DNA sequence context for the cytidine deaminase, which may or may not be the case for the desired modification. Therefore, having a library of A3s with different substrate specificities as context-dependent base editors would expand the toolkit available for targeted base editing. Our results revealing the molecular mechanisms underlying A3 specificities could help guide engineering of A3s, especially with modifications to the active site loops, to rationally design A3s with the desired sequence specificity.

Experimental procedures

Protein sequence alignment

Protein sequence alignment was generated by program Geneious 9.0.5 using default multiple alignment.

Molecular modeling

All structure models in this study were first generated with MODELLER9.23 and then optimized using Protein Preparation Wizard in Maestro from Schrodinger suite. The optimization was performed at pH 7.0 and minimized using restrained minimization panel. The ssDNA-bound crystal structures of A3A (PDB: 5KEG for protein; 5SWW for ssDNA) and A3G-CTD2 (PDB: 6BUX) were used as templates for molecular modeling of wild-type A3-ssDNA complexes. ssDNA sequences were mutated through program Coot. ssDNA-bound A3B-CTD structures were modeled using the crystal structures of both apo A3B-CTD (PDB: 5CQH) and A3A-ssDNA complex.

Molecular dynamics simulations

All molecular dynamics simulations were performed for 100 ns using program Desmond from the Schrodinger suite. The simulation systems were built using SPC solvation model and cubic boundary conditions of 12 Å buffer box size with OPLS3 force field through system builder in Maestro. The final systems were neutral and had 0.15 M sodium chloride. A multistage MD simulation protocol was used, which was previously described (61), at 298 °K (25 °C). The four A3A-ssDNA models were also simulated at 310.2 °K (37 °C) using the same simulation protocol.

Analysis of molecular dynamics simulations

The RMSD and molecular interactions (hydrogen bond, stacking interaction occupancies) over the trajectories were calculated using Simulation Interaction Diagram in Maestro from Schrodinger. The per base RMSFs of ssDNA were calculated using Schrodinger python API. The residue vdW potential between A3s and ssDNA during the MD simulations was extracted from the simulation energies using Desmond.

The frame closest to the average RMSD was used as a representative structure for each MD simulation. The electrostatic distributions were calculated using PDB2PQR server and Pymol with the APBS plugin and visualized with contour levels positive (+3) and negative (−3). The time series representations of ssDNA were generated with program VMD using 2000 frames as time step (total 20,000 frames for each MD trajectory). All other structural graphics were made using the program PyMol.

Supporting information

This article contains supporting information.

Conflict of interest

The authors declare no conflict of interest.

Acknowledgments

The authors would like to thank Florian Leidner for technical support.

This work was supported by the US National Institutes of Health Institute of General Medical Sciences (R01GM118474) and National Institute of Allergy and Infectious Diseases (R01AI150478).

Author contributions

S. H., N. K. Y., and C. A. S. conceptualization; S. H., N. K. Y., C. A. S., J. M. L., W. M., and H. M. formal analysis; H. M. and C. A. S. funding acquisition; S. H., J. M. L., W. M., and H. M. investigation; N. K. Y. project administration; H. M., N. K. Y., and C. A. S. supervision; S. H., J. M. L., N. K. Y. and C. A. S. writing-original draft; S. H., N. K. Y., C. A. S., J. M. L., W. M., and H. M. writing-review and editing.

Funding and additional information

W. M. is supported in part by the NIH office of intramural training and education’s Intramural AIDS Research Fellowship. For W. M. and H.M., this project has been funded in whole or in part with federal funds from the National Cancer Institute, National Institutes of Health, under contract HHSN26120080001E. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does it mention of trade names, commercial products, or organizations imply endorsement by the US Government. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Edited by Wolfgang Peti

Contributor Information

Nese Kurt Yilmaz, Email: Nese.KurtYilmaz@umassmed.edu.

Celia A. Schiffer, Email: Celia.Schiffer@umassmed.edu.

Supporting information

Supplemental Figures S1–S6 and Tables S1–S3
mmc1.docx (55.3MB, docx)

References

  • 1.Betts L., Xiang S., Short S.A., Wolfenden R., Carter C.W., Jr. Cytidine deaminase. The 2.3 A crystal structure of an enzyme: Transition-state analog complex. J. Mol. Biol. 1994;235:635–656. doi: 10.1006/jmbi.1994.1018. [DOI] [PubMed] [Google Scholar]
  • 2.Jarmuz A., Chester A., Bayliss J., Gisbourne J., Dunham I., Scott J., Navaratnam N. An anthropoid-specific locus of orphan C to U RNA-editing enzymes on chromosome 22. Genomics. 2002;79:285–296. doi: 10.1006/geno.2002.6718. [DOI] [PubMed] [Google Scholar]
  • 3.Wedekind J.E., Dance G.S., Sowden M.P., Smith H.C. Messenger RNA editing in mammals: New members of the APOBEC family seeking roles in the family business. Trends Genet. 2003;19:207–216. doi: 10.1016/S0168-9525(03)00054-4. [DOI] [PubMed] [Google Scholar]
  • 4.Conticello S.G., Thomas C.J., Petersen-Mahrt S.K., Neuberger M.S. Evolution of the AID/APOBEC family of polynucleotide (deoxy)cytidine deaminases. Mol. Biol. Evol. 2005;22:367–377. doi: 10.1093/molbev/msi026. [DOI] [PubMed] [Google Scholar]
  • 5.LaRue R.S., Andresdottir V., Blanchard Y., Conticello S.G., Derse D., Emerman M., Greene W.C., Jonsson S.R., Landau N.R., Lochelt M., Malik H.S., Malim M.H., Munk C., O'Brien S.J., Pathak V.K. Guidelines for naming nonprimate APOBEC3 genes and proteins. J. Virol. 2009;83:494–497. doi: 10.1128/JVI.01976-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Silvas T.V., Hou S., Myint W., Nalivaika E., Somasundaran M., Kelch B.A., Matsuo H., Kurt Yilmaz N., Schiffer C.A. Substrate sequence selectivity of APOBEC3A implicates intra-DNA interactions. Sci. Rep. 2018;8:7511. doi: 10.1038/s41598-018-25881-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Solomon W.C., Myint W., Hou S., Kanai T., Tripathi R., Kurt Yilmaz N., Schiffer C.A., Matsuo H. Mechanism for APOBEC3G catalytic exclusion of RNA and non-substrate DNA. Nucleic Acids Res. 2019;47:7676–7689. doi: 10.1093/nar/gkz550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sheehy A.M., Gaddis N.C., Choi J.D., Malim M.H. Isolation of a human gene that inhibits HIV-1 infection and is suppressed by the viral Vif protein. Nature. 2002;418:646–650. doi: 10.1038/nature00939. [DOI] [PubMed] [Google Scholar]
  • 9.Zheng Y.H., Irwin D., Kurosu T., Tokunaga K., Sata T., Peterlin B.M. Human APOBEC3F is another host factor that blocks human immunodeficiency virus type 1 replication. J. Virol. 2004;78:6073–6076. doi: 10.1128/JVI.78.11.6073-6076.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Dang Y., Siew L.M., Wang X., Han Y., Lampen R., Zheng Y.H. Human cytidine deaminase APOBEC3H restricts HIV-1 replication. J. Biol. Chem. 2008;283:11606–11614. doi: 10.1074/jbc.M707586200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Dang Y., Wang X., Esselman W.J., Zheng Y.H. Identification of APOBEC3DE as another antiretroviral factor from the human APOBEC family. J. Virol. 2006;80:10522–10533. doi: 10.1128/JVI.01123-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Bogerd H.P., Wiegand H.L., Doehle B.P., Lueders K.K., Cullen B.R. APOBEC3A and APOBEC3B are potent inhibitors of LTR-retrotransposon function in human cells. Nucleic Acids Res. 2006;34:89–95. doi: 10.1093/nar/gkj416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Muckenfuss H., Hamdorf M., Held U., Perkovic M., Lower J., Cichutek K., Flory E., Schumann G.G., Munk C. APOBEC3 proteins inhibit human LINE-1 retrotransposition. J. Biol. Chem. 2006;281:22161–22172. doi: 10.1074/jbc.M601716200. [DOI] [PubMed] [Google Scholar]
  • 14.Mangeat B., Turelli P., Caron G., Friedli M., Perrin L., Trono D. Broad antiretroviral defence by human APOBEC3G through lethal editing of nascent reverse transcripts. Nature. 2003;424:99–103. doi: 10.1038/nature01709. [DOI] [PubMed] [Google Scholar]
  • 15.Burns M.B., Temiz N.A., Harris R.S. Evidence for APOBEC3B mutagenesis in multiple human cancers. Nat. Genet. 2013;45:977–983. doi: 10.1038/ng.2701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Roberts S.A., Lawrence M.S., Klimczak L.J., Grimm S.A., Fargo D., Stojanov P., Kiezun A., Kryukov G.V., Carter S.L., Saksena G., Harris S., Shah R.R., Resnick M.A., Getz G., Gordenin D.A. An APOBEC cytidine deaminase mutagenesis pattern is widespread in human cancers. Nat. Genet. 2013;45:970–976. doi: 10.1038/ng.2702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Starrett G.J., Luengas E.M., McCann J.L., Ebrahimi D., Temiz N.A., Love R.P., Feng Y., Adolph M.B., Chelico L., Law E.K., Carpenter M.A., Harris R.S. The DNA cytosine deaminase APOBEC3H haplotype I likely contributes to breast and lung cancer mutagenesis. Nat. Commun. 2016;7:12918. doi: 10.1038/ncomms12918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Cortez L.M., Brown A.L., Dennis M.A., Collins C.D., Brown A.J., Mitchell D., Mertz T.M., Roberts S.A. APOBEC3A is a prominent cytidine deaminase in breast cancer. PLoS Genet. 2019;15 doi: 10.1371/journal.pgen.1008545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Alexandrov L.B., Nik-Zainal S., Wedge D.C., Aparicio S.A., Behjati S., Biankin A.V., Bignell G.R., Bolli N., Borg A., Borresen-Dale A.L., Boyault S., Burkhardt B., Butler A.P., Caldas C., Davies H.R. Signatures of mutational processes in human cancer. Nature. 2013;500:415–421. doi: 10.1038/nature12477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Petljak M., Alexandrov L.B., Brammeld J.S., Price S., Wedge D.C., Grossmann S., Dawson K.J., Ju Y.S., Iorio F., Tubio J.M.C., Koh C.C., Georgakopoulos-Soares I., Rodriguez-Martin B., Otlu B., O'Meara S. Characterizing mutational signatures in human cancer cell lines reveals episodic APOBEC mutagenesis. Cell. 2019;176:1282–1294.e1220. doi: 10.1016/j.cell.2019.02.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Komor A.C., Kim Y.B., Packer M.S., Zuris J.A., Liu D.R. Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature. 2016;533:420–424. doi: 10.1038/nature17946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Nishida K., Arazoe T., Yachie N., Banno S., Kakimoto M., Tabata M., Mochizuki M., Miyabe A., Araki M., Hara K.Y., Shimatani Z., Kondo A. Targeted nucleotide editing using hybrid prokaryotic and vertebrate adaptive immune systems. Science. 2016;353 doi: 10.1126/science.aaf8729. [DOI] [PubMed] [Google Scholar]
  • 23.Chen K.M., Harjes E., Gross P.J., Fahmy A., Lu Y., Shindo K., Harris R.S., Matsuo H. Structure of the DNA deaminase domain of the HIV-1 restriction factor APOBEC3G. Nature. 2008;452:116–119. doi: 10.1038/nature06638. [DOI] [PubMed] [Google Scholar]
  • 24.Harjes E., Gross P.J., Chen K.M., Lu Y., Shindo K., Nowarski R., Gross J.D., Kotler M., Harris R.S., Matsuo H. An extended structure of the APOBEC3G catalytic domain suggests a unique holoenzyme model. J. Mol. Biol. 2009;389:819–832. doi: 10.1016/j.jmb.2009.04.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Shandilya S.M., Nalam M.N., Nalivaika E.A., Gross P.J., Valesano J.C., Shindo K., Li M., Munson M., Royer W.E., Harjes E., Kono T., Matsuo H., Harris R.S., Somasundaran M., Schiffer C.A. Crystal structure of the APOBEC3G catalytic domain reveals potential oligomerization interfaces. Structure. 2010;18:28–38. doi: 10.1016/j.str.2009.10.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Li M., Shandilya S.M., Carpenter M.A., Rathore A., Brown W.L., Perkins A.L., Harki D.A., Solberg J., Hook D.J., Pandey K.K., Parniak M.A., Johnson J.R., Krogan N.J., Somasundaran M., Ali A. First-in-class small molecule inhibitors of the single-strand DNA cytosine deaminase APOBEC3G. ACS Chem. Biol. 2012;7:506–517. doi: 10.1021/cb200440y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Bohn M.F., Shandilya S.M., Albin J.S., Kouno T., Anderson B.D., McDougle R.M., Carpenter M.A., Rathore A., Evans L., Davis A.N., Zhang J., Lu Y., Somasundaran M., Matsuo H., Harris R.S. Crystal structure of the DNA cytosine deaminase APOBEC3F: The catalytically active and HIV-1 Vif-binding domain. Structure. 2013;21:1042–1050. doi: 10.1016/j.str.2013.04.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Bohn M.F., Shandilya S.M., Silvas T.V., Nalivaika E.A., Kouno T., Kelch B.A., Ryder S.P., Kurt-Yilmaz N., Somasundaran M., Schiffer C.A. The ssDNA mutator APOBEC3A is regulated by cooperative dimerization. Structure. 2015;23:903–911. doi: 10.1016/j.str.2015.03.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kouno T., Luengas E.M., Shigematsu M., Shandilya S.M., Zhang J., Chen L., Hara M., Schiffer C.A., Harris R.S., Matsuo H. Structure of the Vif-binding domain of the antiviral enzyme APOBEC3G. Nat. Struct. Mol. Biol. 2015;22:485–491. doi: 10.1038/nsmb.3033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Chelico L., Pham P., Calabrese P., Goodman M.F. APOBEC3G DNA deaminase acts processively 3′--> 5′ on single-stranded DNA. Nat. Struct. Mol. Biol. 2006;13:392–399. doi: 10.1038/nsmb1086. [DOI] [PubMed] [Google Scholar]
  • 31.Holden L.G., Prochnow C., Chang Y.P., Bransteitter R., Chelico L., Sen U., Stevens R.C., Goodman M.F., Chen X.S. Crystal structure of the anti-viral APOBEC3G catalytic domain and functional implications. Nature. 2008;456:121–124. doi: 10.1038/nature07357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Chelico L., Sacho E.J., Erie D.A., Goodman M.F. A model for oligomeric regulation of APOBEC3G cytosine deaminase-dependent restriction of HIV. J. Biol. Chem. 2008;283:13780–13791. doi: 10.1074/jbc.M801004200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Furukawa A., Nagata T., Matsugami A., Habu Y., Sugiyama R., Hayashi F., Kobayashi N., Yokoyama S., Takaku H., Katahira M. Structure, interaction and real-time monitoring of the enzymatic reaction of wild-type APOBEC3G. EMBO J. 2009;28:440–451. doi: 10.1038/emboj.2008.290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Chelico L., Prochnow C., Erie D.A., Chen X.S., Goodman M.F. Structural model for deoxycytidine deamination mechanisms of the HIV-1 inactivation enzyme APOBEC3G. J. Biol. Chem. 2010;285:16195–16205. doi: 10.1074/jbc.M110.107987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kitamura S., Ode H., Nakashima M., Imahashi M., Naganawa Y., Kurosawa T., Yokomaku Y., Yamane T., Watanabe N., Suzuki A., Sugiura W., Iwatani Y. The APOBEC3C crystal structure and the interface for HIV-1 Vif binding. Nat. Struct. Mol. Biol. 2012;19:1005–1010. doi: 10.1038/nsmb.2378. [DOI] [PubMed] [Google Scholar]
  • 36.Siu K.K., Sultana A., Azimi F.C., Lee J.E. Structural determinants of HIV-1 Vif susceptibility and DNA binding in APOBEC3F. Nat. Commun. 2013;4:2593. doi: 10.1038/ncomms3593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Byeon I.J., Ahn J., Mitra M., Byeon C.H., Hercik K., Hritz J., Charlton L.M., Levin J.G., Gronenborn A.M. NMR structure of human restriction factor APOBEC3A reveals substrate binding and enzyme specificity. Nat. Commun. 2013;4:1890. doi: 10.1038/ncomms2883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Mitra M., Hercik K., Byeon I.J., Ahn J., Hill S., Hinchee-Rodriguez K., Singer D., Byeon C.H., Charlton L.M., Nam G., Heidecker G., Gronenborn A.M., Levin J.G. Structural determinants of human APOBEC3A enzymatic and nucleic acid binding properties. Nucleic Acids Res. 2014;42:1095–1110. doi: 10.1093/nar/gkt945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lu X., Zhang T., Xu Z., Liu S., Zhao B., Lan W., Wang C., Ding J., Cao C. Crystal structure of DNA cytidine deaminase ABOBEC3G catalytic deamination domain suggests a binding mode of full-length enzyme to single-stranded DNA. J. Biol. Chem. 2015;290:4010–4021. doi: 10.1074/jbc.M114.624262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Shi K., Carpenter M.A., Kurahashi K., Harris R.S., Aihara H. Crystal structure of the DNA deaminase APOBEC3B catalytic domain. J. Biol. Chem. 2015;290:28120–28130. doi: 10.1074/jbc.M115.679951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Shaban N.M., Shi K., Li M., Aihara H., Harris R.S. 1.92 Angstrom zinc-free APOBEC3F catalytic domain crystal structure. J. Mol. Biol. 2016;428:2307–2316. doi: 10.1016/j.jmb.2016.04.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Byeon I.J., Byeon C.H., Wu T., Mitra M., Singer D., Levin J.G., Gronenborn A.M. Nuclear magnetic resonance structure of the APOBEC3B catalytic domain: Structural basis for substrate binding and DNA deaminase activity. Biochemistry. 2016;55:2944–2959. doi: 10.1021/acs.biochem.6b00382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Xiao X., Li S.X., Yang H., Chen X.S. Crystal structures of APOBEC3G N-domain alone and its complex with DNA. Nat. Commun. 2016;7:12193. doi: 10.1038/ncomms12193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Shi K., Demir O., Carpenter M.A., Wagner J., Kurahashi K., Harris R.S., Amaro R.E., Aihara H. Conformational switch regulates the DNA cytosine deaminase activity of human APOBEC3B. Sci. Rep. 2017;7:17415. doi: 10.1038/s41598-017-17694-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Xiao X., Yang H., Arutiunian V., Fang Y., Besse G., Morimoto C., Zirkle B., Chen X.S. Structural determinants of APOBEC3B non-catalytic domain for molecular assembly and catalytic regulation. Nucleic Acids Res. 2017;45:7494–7506. doi: 10.1093/nar/gkx362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Ito F., Yang H., Xiao X., Li S.X., Wolfe A., Zirkle B., Arutiunian V., Chen X.S. Understanding the structure, multimerization, subcellular localization and mC selectivity of a genomic mutator and anti-HIV factor APOBEC3H. Sci. Rep. 2018;8:3763. doi: 10.1038/s41598-018-21955-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Kouno T., Silvas T.V., Hilbert B.J., Shandilya S.M.D., Bohn M.F., Kelch B.A., Royer W.E., Somasundaran M., Kurt Yilmaz N., Matsuo H., Schiffer C.A. Crystal structure of APOBEC3A bound to single-stranded DNA reveals structural basis for cytidine deamination and specificity. Nat. Commun. 2017;8:15024. doi: 10.1038/ncomms15024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Maiti A., Myint W., Kanai T., Delviks-Frankenberry K., Sierra Rodriguez C., Pathak V.K., Schiffer C.A., Matsuo H. Crystal structure of the catalytic domain of HIV-1 restriction factor APOBEC3G in complex with ssDNA. Nat. Commun. 2018;9:2460. doi: 10.1038/s41467-018-04872-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Shi K., Carpenter M.A., Banerjee S., Shaban N.M., Kurahashi K., Salamango D.J., McCann J.L., Starrett G.J., Duffy J.V., Demir O., Amaro R.E., Harki D.A., Harris R.S., Aihara H. Structural basis for targeted DNA cytosine deamination and mutagenesis by APOBEC3A and APOBEC3B. Nat. Struct. Mol. Biol. 2017;24:131–139. doi: 10.1038/nsmb.3344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Ito F., Fu Y., Kao S.A., Yang H., Chen X.S. Family-wide comparative analysis of cytidine and methylcytidine deamination by eleven human APOBEC proteins. J. Mol. Biol. 2017;429:1787–1799. doi: 10.1016/j.jmb.2017.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Beale R.C., Petersen-Mahrt S.K., Watt I.N., Harris R.S., Rada C., Neuberger M.S. Comparison of the differential context-dependence of DNA deamination by APOBEC enzymes: Correlation with mutation spectra in vivo. J. Mol. Biol. 2004;337:585–596. doi: 10.1016/j.jmb.2004.01.046. [DOI] [PubMed] [Google Scholar]
  • 52.Pham P., Bransteitter R., Petruska J., Goodman M.F. Processive AID-catalysed cytosine deamination on single-stranded DNA simulates somatic hypermutation. Nature. 2003;424:103–107. doi: 10.1038/nature01760. [DOI] [PubMed] [Google Scholar]
  • 53.Chan K., Roberts S.A., Klimczak L.J., Sterling J.F., Saini N., Malc E.P., Kim J., Kwiatkowski D.J., Fargo D.C., Mieczkowski P.A., Getz G., Gordenin D.A. An APOBEC3A hypermutation signature is distinguishable from the signature of background mutagenesis by APOBEC3B in human cancers. Nat. Genet. 2015;47:1067–1072. doi: 10.1038/ng.3378. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Langlois M.A., Beale R.C., Conticello S.G., Neuberger M.S. Mutational comparison of the single-domained APOBEC3C and double-domained APOBEC3F/G anti-retroviral cytidine deaminases provides insight into their DNA target site specificities. Nucleic Acids Res. 2005;33:1913–1923. doi: 10.1093/nar/gki343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Matsumoto T., Shirakawa K., Yokoyama M., Fukuda H., Sarca A.D., Koyabu S., Yamazaki H., Kazuma Y., Matsui H., Maruyama W., Nagata K., Tanabe F., Kobayashi M., Shindo K., Morishita R. Protein kinase A inhibits tumor mutator APOBEC3B through phosphorylation. Sci. Rep. 2019;9:8307. doi: 10.1038/s41598-019-44407-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Autore F., Bergeron J.R., Malim M.H., Fraternali F., Huthoff H. Rationalisation of the differences between APOBEC3G structures from crystallography and NMR studies by molecular dynamics simulations. PLoS One. 2010;5 doi: 10.1371/journal.pone.0011515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Wagner J.R., Demir O., Carpenter M.A., Aihara H., Harki D.A., Harris R.S., Amaro R.E. Determinants of oligonucleotide selectivity of APOBEC3B. J. Chem. Inf. Model. 2019;59:2264–2273. doi: 10.1021/acs.jcim.8b00427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Ng J.C.-F., Fraternali F. Understanding the structural details of APOBEC3-DNA interactions using graph-based representations. Curr. Res. Struct. Biol. 2020;2:130–143. doi: 10.1016/j.crstbi.2020.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Shi K., Demir O., Carpenter M.A., Banerjee S., Harki D.A., Amaro R.E., Harris R.S., Aihara H. Active site plasticity and possible modes of chemical inhibition of the human DNA deaminase APOBEC3B. FASEB Bioadv. 2020;2:49–58. doi: 10.1096/fba.2019-00068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Shandilya S.M., Bohn M.F., Schiffer C.A. A computational analysis of the structural determinants of APOBEC3's catalytic activity and vulnerability to HIV-1 Vif. Virology. 2014;471-473:105–116. doi: 10.1016/j.virol.2014.09.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Hou S., Silvas T.V., Leidner F., Nalivaika E.A., Matsuo H., Kurt Yilmaz N., Schiffer C.A. Structural analysis of the active site and DNA binding of human cytidine deaminase APOBEC3B. J. Chem. Theory Comput. 2019;15:637–647. doi: 10.1021/acs.jctc.8b00545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Ozen A., Lin K.H., Kurt Yilmaz N., Schiffer C.A. Structural basis and distal effects of Gag substrate coevolution in drug resistance to HIV-1 protease. Proc. Natl. Acad. Sci. U. S. A. 2014;111:15993–15998. doi: 10.1073/pnas.1414063111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Soumana D.I., Kurt Yilmaz N., Ali A., Prachanronarong K.L., Schiffer C.A. Molecular and dynamic mechanism underlying drug resistance in genotype 3 hepatitis C NS3/4A protease. J. Am. Chem. Soc. 2016;138:11850–11859. doi: 10.1021/jacs.6b06454. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Paulsen J.L., Leidner F., Ragland D.A., Kurt Yilmaz N., Schiffer C.A. Interdependence of inhibitor recognition in HIV-1 protease. J. Chem. Theory Comput. 2017;13:2300–2309. doi: 10.1021/acs.jctc.6b01262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Henes M., Lockbaum G.J., Kosovrasti K., Leidner F., Nachum G.S., Nalivaika E.A., Lee S.K., Spielvogel E., Zhou S., Swanstrom R., Bolon D.N.A., Kurt Yilmaz N., Schiffer C.A. Picomolar to micromolar: Elucidating the role of distal mutations in HIV-1 protease in conferring drug resistance. ACS Chem. Biol. 2019;14:2441–2452. doi: 10.1021/acschembio.9b00370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Zhang H., Yang B., Pomerantz R.J., Zhang C., Arunachalam S.C., Gao L. The cytidine deaminase CEM15 induces hypermutation in newly synthesized HIV-1 DNA. Nature. 2003;424:94–98. doi: 10.1038/nature01707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Yu Q., Konig R., Pillai S., Chiles K., Kearney M., Palmer S., Richman D., Coffin J.M., Landau N.R. Single-strand specificity of APOBEC3G accounts for minus-strand deamination of the HIV genome. Nat. Struct. Mol. Biol. 2004;11:435–442. doi: 10.1038/nsmb758. [DOI] [PubMed] [Google Scholar]
  • 68.Suspene R., Sommer P., Henry M., Ferris S., Guetard D., Pochet S., Chester A., Navaratnam N., Wain-Hobson S., Vartanian J.P. APOBEC3G is a single-stranded DNA cytidine deaminase and functions independently of HIV reverse transcriptase. Nucleic Acids Res. 2004;32:2421–2429. doi: 10.1093/nar/gkh554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Rathore A., Carpenter M.A., Demir O., Ikeda T., Li M., Shaban N.M., Law E.K., Anokhin D., Brown W.L., Amaro R.E., Harris R.S. The local dinucleotide preference of APOBEC3G can be altered from 5'-CC to 5'-TC by a single amino acid substitution. J. Mol. Biol. 2013;425:4442–4454. doi: 10.1016/j.jmb.2013.07.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Cacciamani T., Vita A., Cristalli G., Vincenzetti S., Natalini P., Ruggieri S., Amici A., Magni G. Purification of human cytidine deaminase: Molecular and enzymatic characterization and inhibition by synthetic pyrimidine analogs. Arch. Biochem. Biophys. 1991;290:285–292. doi: 10.1016/0003-9861(91)90543-r. [DOI] [PubMed] [Google Scholar]
  • 71.Harjes S., Solomon W.C., Li M., Chen K.M., Harjes E., Harris R.S., Matsuo H. Impact of H216 on the DNA binding and catalytic activities of the HIV restriction factor APOBEC3G. J. Virol. 2013;87:7008–7014. doi: 10.1128/JVI.03173-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Rees H.A., Liu D.R. Base editing: Precision chemistry on the genome and transcriptome of living cells. Nat. Rev. Genet. 2018;19:770–788. doi: 10.1038/s41576-018-0059-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Komor A.C., Zhao K.T., Packer M.S., Gaudelli N.M., Waterbury A.L., Koblan L.W., Kim Y.B., Badran A.H., Liu D.R. Improved base excision repair inhibition and bacteriophage Mu Gam protein yields C:G-to-T:A base editors with higher efficiency and product purity. Sci. Adv. 2017;3 doi: 10.1126/sciadv.aao4774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Kleinstiver B.P., Pattanayak V., Prew M.S., Tsai S.Q., Nguyen N.T., Zheng Z., Joung J.K. High-fidelity CRISPR-Cas9 nucleases with no detectable genome-wide off-target effects. Nature. 2016;529:490–495. doi: 10.1038/nature16526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Ran F.A., Cong L., Yan W.X., Scott D.A., Gootenberg J.S., Kriz A.J., Zetsche B., Shalem O., Wu X., Makarova K.S., Koonin E.V., Sharp P.A., Zhang F. In vivo genome editing using Staphylococcus aureus Cas9. Nature. 2015;520:186–191. doi: 10.1038/nature14299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Zetsche B., Gootenberg J.S., Abudayyeh O.O., Slaymaker I.M., Makarova K.S., Essletzbichler P., Volz S.E., Joung J., van der Oost J., Regev A., Koonin E.V., Zhang F. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system. Cell. 2015;163:759–771. doi: 10.1016/j.cell.2015.09.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Kim E., Koo T., Park S.W., Kim D., Kim K., Cho H.Y., Song D.W., Lee K.J., Jung M.H., Kim S., Kim J.H., Kim J.H., Kim J.S. In vivo genome editing with a small Cas9 orthologue derived from Campylobacter jejuni. Nat. Commun. 2017;8:14500. doi: 10.1038/ncomms14500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Kleinstiver B.P., Prew M.S., Tsai S.Q., Topkar V.V., Nguyen N.T., Zheng Z., Gonzales A.P., Li Z., Peterson R.T., Yeh J.R., Aryee M.J., Joung J.K. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature. 2015;523:481–485. doi: 10.1038/nature14592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Lee C.M., Cradick T.J., Bao G. The Neisseria meningitidis CRISPR-Cas9 system enables specific genome editing in mammalian cells. Mol. Ther. 2016;24:645–654. doi: 10.1038/mt.2016.8. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Figures S1–S6 and Tables S1–S3
mmc1.docx (55.3MB, docx)

Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES