Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2010 Mar 6.
Published in final edited form as: J Mol Biol. 2009 Mar 6;386(4):1054–1065. doi: 10.1016/j.jmb.2009.01.007

Signatures of protein-DNA recognition in free DNA binding sites

Jason W Locasale 1,2,**, Andrew A Napoli 1,**, Shengfeng Chen 1,**, Helen M Berman 1, Catherine L Lawson 1,*
PMCID: PMC2753591  NIHMSID: NIHMS100309  PMID: 19244617

Summary

One obstacle to achieving complete understanding of the principles underlying sequence-dependent recognition of DNA is the paucity of structural data for DNA recognition sequences in their free (unbound) state. Here we carried out crystallization screening of 50 DNA duplexes containing cognate protein binding sites and obtained new crystal structures of free DNA binding sites for three distinct modes of DNA recognition: anti-parallel beta strands (MetR), helix-turn-helix motif + hinge helices (PurR), and zinc fingers (Zif268). Structural changes between free and protein-bound DNA are manifested differently in each case. The new DNA structures reveal that distinctive sequence-dependent DNA geometry dominates recognition by MetR, protein-induced bending of DNA dictates recognition by PurR, and deformability of DNA along the A-B continuum is important in recognition by Zif268. Together, our findings show that crystal structures of free DNA binding sites provide new information about the nature of protein-DNA interactions and thus lend insights towards a structural code for DNA recognition.

Keywords: DNA structure, transcription factors, indirect readout, protein-DNA interactions, gene regulation

Introduction

Sequence-dependent recognition of DNA by transcription factors and other DNA binding proteins is central to the regulation of gene regulatory processes. Direct physical interactions between DNA base pairs and protein residues play an important role in DNA sequence recognition, but the identities of non-contacted DNA bases can also markedly affect binding affinities.

The collective contributions of DNA sequence to the recognition process, referred to as indirect readout, are complicated to understand because they arise as a global consequence of the whole protein-DNA interaction.1,2 Indirect readout is known to include sequence-dependent effects on DNA deformability as well as sequence-dependent effects on DNA conformation, local solvent and ion structure. However, the general guiding principles that underlie sequence dependent protein-DNA recognition are far from entirely understood.

Analyses of X-ray crystal structures of protein-DNA complexes have provided many insights into the nature of protein-DNA recognition,3,4 but these studies do not parse the extent to which the DNA structure adopted in the complex is induced by protein binding, or is instead intrinsic to the DNA sequence. To fully grasp how these effects separately contribute to protein-DNA recognition, an analysis of the DNA structure in both the free and bound state is necessary.

There are only a few examples of cognate DNA sequences with crystal structures reported in both the free state and in the protein-DNA complex. These include the EcoRI endonuclease binding site,5,6 the trp repressor operator region,1,7,8 the papillomavirus E2 binding site,9,10 runt-domain containing transcription regulator binding sites,11,12,13,14 and a Nuclear Factor Kappa B binding site.15,16 In each of these cases, unique insights into the specificity of the protein-DNA interaction were obtained by comparing bound and free DNA structures.

In this work, we carried out systematic crystallization screening of fifty DNA oligonucleotide duplexes with sequences derived from crystal structures of protein-DNA complexes (see supplement). By following up on positive hits from the screen, we obtained new crystal structures of cognate recognition sites for three distinct gene regulatory proteins: the methionine repressor (MetR), the purine repressor (PurR), and a zinc finger binding protein, Zif268. Each of these DNA binding proteins exhibits a different mode of DNA recognition. In each case, comparison of bound and free DNA permits identification of signatures (i.e., sequence-dependent conformational features) that must play a role in protein-DNA recognition. The new structures also permit comparison of the central ACGT tetrad that is shared by MetR, PurR, and E2 binding sites but is very differently employed in each of the three protein-DNA interactions. The DNA sequence immediately adjacent to ACGT appears to play a role in defining how ACGT is utilized in protein-DNA recognition. Our findings underscore the importance of sequence context in determining the contributions of DNA conformation and DNA deformability in protein-DNA interactions.

Results and Discussion

Recognition of the methionine repressor “met box” binding site

The MetR “met box” DNA dodecamer (free DNA site), d-(5′-GCAGACGTCTGC-3′)2, was extracted from methionine repressor-DNA complex crystal structure DNA sequences d-(5′-TTAGACGTCT-3′)•d-(5′-AGACGTCTA-3′) (PDB id 1cma17) and d-(5′-GAGACGTCTC-3′)2 (PDB id 1mjm18, bound DNA site). The MetR dodecamer (free DNA site) crystallized in the space group P1 with 3 duplexes in the asymmetric unit. The crystal structure was determined to 2.5 Å (Table 1). The three independent dodecamers are essentially identical in structure and adopt a conformation close to ideal B-form DNA.

Table 1. Crystallographic data and refinement statistics.

Zif268 DNA
decamer (FIN-A)
Zif268 DNA
decamer (FIN-B)
MetR DNA dodecamer purR DNA dodecamer
DNA Duplex 5′-GCGTGGGCAC-3′
3′-CGCACCCGTG-5′
5′-GCGTGGGACC-3′
3′-CGCACCCTGG-5′
5′-GCAGACGTCTG-3′
3′-GTCTGCAGACG-5′
5′-GCAAACGTTTGC-3′
3′-CGTTTGCAAACG-5′

Crystal Parameters
space group P6122 P6122 P1 P3121
a, b, c (Å) 38.26, 38.26, 78.17 38.48, 38.48, 78.23 39.99, 40.37, 40.27 39.64, 39.64, 98.59
α, β, γ (°) 90, 90, 120 90, 90, 120 82.04, 69.13, 81.49 90, 90, 120
X-ray Diffraction
resolution limit (Å) 1.9 2.2 2.5 2.1
temperature (°K) 108 108 108 108
radiation source NSLS X12C NSLS X12C NSLS X26C NSLS X8C; Cu Kα
wavelength (Å) 1.007 1.007 1.1004 1.0722; 1.5418
reflections measured 41269 10824 33594 54956
unique reflections 2905 1752 7790 5576
Rmergea .051 .076 .088 .064
completeness .973 .868 .893 .946
Final Model
asymmetric unit 1 duplex 1 duplex 3 duplexes 1 duplex
content (50% occupancy) (50% occupancy)
number of atoms 434 425 1479 596
solvent atoms 29 18 50 100
bond lengths (Å) 0.017 0.010 0.014 0.002
bond angles (°) 1.79 1.40 1.97 2.07
torsion angles (°) 9.95 10.9 18.5 17.1
Average B factor (Å2) 21.1 23.1 62.7 41.6
R factorb .215 .230 .235 .237
R freec .251 .280 .283 .266
a

Rmerge = ΣhΣk | < Ik − Iki > | / ΣhΣiIk

b

R = Σ ‖Fo| - |Fc‖ / Σ |Fo|

c

R free = ΣT ‖Fo| - |Fc‖ / ΣT |Fo|, where T is the test set comprised of 10% of observations excluded from refinement.

The MetR DNA dodecamer crystal lattice has a somewhat unusual packing arrangement. Two other DNA dodecamers (representing free binding sites for papilloma virus E2 protein) also crystallize in space group P1 with similar lattice dimensions and 3 duplexes in the asymmetric unit;10,19 the MetR DNA dodecamer crystallization condition, however, is unusual in its requirement for cobalt hexamine. Cobalt hexamine ions support the lattice structure by forming bridges between neighboring dodecamers. A similar arrangement has been reported in only one other B-form DNA crystal structure.20

The free MetR DNA dodecamer visually resembles the DNA in the protein-bound complexes, where two symmetry-related anti-parallel β strands of the MetR dimer are inserted into the major groove (Fig. 1). Relative to the free DNA, the bound DNA is smoothly bent towards the protein by ∼25°, resulting in compression of the major groove and expansion of the minor groove. The root-mean-square (r.m.s.) deviation between the free and protein-bound DNA structures is ∼2.0 Å (supplement 2, figure 1).

Figure 1. Methionine repressor DNA binding site in free and protein-bound crystal structures.

Figure 1

(a) The MetR dodecamer (5′-GCAGACGTCTGC-3′)2 adopts a straight B-DNA structure (shown is one of three independent copies). (b) The protein-bound binding site is smoothly bent ∼25° towards the major groove (PDBid 1cma17). The DNA base color scheme is red: adenosine, blue: thymidine, green: guanidine, yellow: cytidine.

To compare the geometry of the free and bound structures, we considered four helical parameters: base pair step roll and twist angles, and major and minor groove widths. Roll and twist angles are commonly used to characterize deformation from idealized DNA structures.3 Major and minor groove widths provide additional measurements of helical geometry.

Plots of base pair step parameters across the common AGACGTCT sequence (Fig. 2) reveal structural patterns that appear to be shared by the free and bound DNA. Particularly striking are alternating regions of high and low values of major groove width and twist angle (Figs. 2a,d). The pattern is especially strong for the free DNA duplexes, where twist angles between consecutive base pair steps can differ by up to 20 degrees. Roll angle also displays some alternating behavior, though to a lesser extent (Fig. 2c). Minor groove width, on the other hand, appears uncorrelated between bound and free structures (Fig. 2b).

Figure 2. Base pair step parameters of the MetR DNA binding site (5′-AGACGTCT-3′)2 in free and bound crystal forms.

Figure 2

(a) Major groove width, (b) Minor groove width, (c) roll angle, (d) twist angle. Free 1, Free 2, Free 3: the three MetR DNA dodecamers in the crystal asymmetric unit; Bound 1, Bound 2: Met repressor/DNA complexes, respectively PDBids 1cma17 and 1mjm.18

To further investigate the structural similarity of the free and bound MetR DNA binding sites, we calculated correlations for base pair step parameters across the common sequence between structure pairs. We first averaged the parameter values of the three free DNA duplexes (the parameter values do not vary significantly and considering each free structure individually does not alter the qualitative results). We then computed the correlation between the average free DNA and each protein-bound DNA structure (bound 1 = 1CMA, bound 2 = 1MJM), as well as between the two bound structures, for each parameter (Fig. 3). The correlation values largely confirm trends apparent in Fig. 2. For every structure pair compared (average free vs. bound 1, average free vs. bound 2, bound 1 vs. bound 2), major groove width patterns (Fig. 3a) and twist angle patterns (Fig. 2d) are highly correlated. Roll angle patterns are also correlated, though to a lesser extent (Fig. 2c). In contrast, there is no consistent trend for minor groove width pattern (Fig. 3b).

Figure 3. Helical parameter correlations in MetR DNA structures.

Figure 3

Correlation coefficients are shown for the helical parameter profiles for free DNA (free = averaged parameters for 3 independent dodecamers) versus bound DNA in 1cma (bound 1) and 1mjm (bound 2). (a) Major groove width, (b) Minor groove width, (c) roll angle, (d) twist angle. Significant correlations are apparent in major groove width and twist angle.

Together, these observations indicate that there is a consistent structural pattern shared by free and MetR bound DNA structures for the major groove width and twist angle base pair steps across the sequence AGACGTCT. The twist pattern follows the predicted trend based on statistical averages for base pair steps (average twist for AG=CT and AC=GT steps is ∼32 degrees, for GA=TC and CG steps, ∼36 degrees 3), but the amplitude of the twist pattern is appreciably stronger, particularly in the free DNA structure. Our findings suggest that the MetR DNA binding site bears an intrinsic sequence-dependent conformational signature that is incorporated into the architecture of the protein-DNA interaction.

The methionine repressor protein-DNA interaction involves an extensive network of direct and water-mediated hydrogen bonds to DNA bases and backbone phosphates, with two side-chains from each β strand contributing direct hydrogen-bonded contacts to the six central base pairs of the met box, GACGTC.17,18 Mutation of the central CG step of the met box to a TA results in a ∼50 fold decrease in binding affinity,21 a strong indication of the sequence specificity of the interaction.

Contacts are also made by the methionine repressor to the sugar-phosphate backbone of the flanking AG=CT sequence of the met box, with phosphates being displaced by ∼2 Å from expected positions for ideal B-DNA. The positions of these phosphates appear to be essential in establishing the specificity of DNA recognition: any mutation of the central CTAG sequence within a double met box tandem operator (AGACGTCTAGACGTCT) decreases binding affinity by ∼80 fold. Based on protein-DNA complex structures alone, it was noted that these phosphate positions correspond to an alternating pattern of low and high twist angles that accompany an alternating contraction and expansion of the major groove.18

In the crystal structure of the free MetR DNA site, we also observe alternating low and high twist angles that coincide with an alternating contraction and expansion of the major groove, and the helical geometries of free and bound AGACGTCT met box sequences are strongly correlated. Our findings suggest that in addition to the backbone-mediated contacts to AG=CT, additional specificity is conferred through a partial “lock and key” fit between the GACGTC of the consensus sequence and the inserted β strands. The alternating twist pattern of the met box provides a surface poised to fit the β strand pair, requiring only the introduction of a small overall bend towards the major groove to achieve full protein-DNA complementarity.

Recognition of the purine repressor binding site

The free PurR DNA dodecamer, d-(5′-GCAAACGTTTGC-3′)2, was extracted from the purine repressor-DNA complex crystal structure DNA sequence, (d-(5′-TACGCAAACGTTTGCGT-3′)2, PDB id 1qpz.22,23 The free PurR dodecamer crystallized in space group P3121 with one complete dodecamer in the crystal asymmetric unit. The crystal structure was determined to 2.1 Å (Table 1). This is a novel crystal form for dodecamer DNA with relatively high solvent content (volume per base pair =1862 Å3); the majority of DNA dodecamers crystallize in space group P212121 (volume per base pair ∼1390 Å3).

The structure adopted by the free PurR dodecamer is also very close to that of ideal B-form DNA (Fig 4a, and Fig. 5, blue squares). The average helical twist between successive pairs is 35.6° and the average rise per base pair is 3.3° resulting in a helix with 10.1 base pairs per turn. The CA=TG steps and the central CG step have slightly larger than average roll angles (∼6°), in agreement with the observation that pyrimidine-purine base pair steps are more deformable than other base pair steps 3. Both short A-tracts (AAA=TTT) feature a narrow minor groove (Fig. 5b) with a strong spine of hydration; the central A:T base pairs of the A-tracts have modestly high propeller twist (∼ -16°).

Figure 4. Purine repressor DNA binding site in free and protein-bound crystal structures.

Figure 4

(a) The PurR dodecamer (5′-GCAAACGTTTGC-3′)2 adopts a straight B-DNA structure. (b) The protein-bound binding site23 is bent ∼50° towards the major groove at the central CG base pair (top right, PDBid 1qpz). The DNA binding region of purine repressor is shown as a semi-transparent surface. The color scheme is as in Fig. 1.

Figure 5. Base pair step parameters of the PurR DNA binding site (5′-AAACGTTT-3′)2 in free and bound crystal forms.

Figure 5

Parameters are plotted vs. base pair steps. (a) Major groove width, (b) Minor groove width, (c) roll angle, (d) twist angle. Free: PurR DNA dodecamer; Bound: Purine repressor/DNA complex, PDBid 1qpz.

In the purine repressor dimer-DNA complex,22,23 the bound DNA displays a sharp kink (Fig. 4b). This severe local deformation is accomplished by intercalation of a pair of leucine residues into the minor groove side of the central CG step and is characterized by a high roll angle for the central CG step (∼50°), reduced twist angles for the central ACGT base pair steps, a narrowed major groove and an enlarged minor groove (Fig. 5).

The central kink in the protein-DNA complex is the main conformational difference between the free and bound forms of the PurR binding site. Superimposition of the atoms of all twelve base pairs of the free and bound DNA structures yields a large r.m.s. deviation (3.7 Å). However, when the bound structures is split at the central CG base pair step and the half-sites are superimposed onto the free structure, the r.m.s. deviation is greatly reduced (1.3-1.4 Å). There is essentially no conformational change between free and bound DNA in the flanking GCAA=TTGC recognized by the purine repressor helix-turn-helix motifs (supplement 2, figure 2).

Three different crystal structures of a decamer duplex with the related sequence d-(CCAACGTTGG)2 have been reported.24,25 The central six base pairs of the free PurR dodecamer and the decamer structures (AACGTT) have very similar conformations (r.m.s. deviations ∼1.2 A, typical values for free B-DNA structures fitted to a fiber B-DNA model4). We conclude that in the free state, the central base pairs of the PurR DNA binding site prefer a conformation close to canonical B-DNA. The free PurR binding site structure is essentially equivalent to the bound PurR binding site structure with the localized kink removed.

ACGT motif within protein binding sites

The PurR and MetR binding sites share the common central tetrad ACGT. PurR bends ACGT dramatically towards the major groove via contacts in the minor groove, while MetR causes a milder deformation in the same direction via contacts in the major groove. ACGT also plays a crucial role in the papillomavirus E2 protein interaction with its binding site (CGACCGACGTCGGTCG).9,10 E2 bends the central ACGT towards the minor grove, with negative roll values at all three central base pair steps. E2 makes no direct contact with ACGT, yet the sequence is essential for recognition. ACGT also forms the center of recognition sites for leucine zipper proteins GCN4 (ATGACGTCAT) and max (TCACGTGA) where the protein-bound DNA has essentially canonical B-DNA form.26,27 These findings suggest that although ACGT tends to adopt B-DNA conformation in the absence protein contact, it is a highly deformable sequence that can be utilized in dramatically different ways.

The extended sequence context of ACGT appears to play an important role in how it is utilized in protein-DNA recognition. ACGT is embedded in three different octamer sequences with free and bound structure representatives, PurR: AAACGTTT, MetR: AGACGTCT, and E2: CGACGTCG. The relatively rigid short flanking A-tracts in the PurR binding site likely aid localization of the protein-induced kink to the central four base pairs, serving as a structural buffer between the central hinge-helices-DNA interaction and flanking helix-turn-helix-DNA interactions. In contrast, flanking pyrimidine-purine CG base pair steps of the E2 binding site extend flexibility, allowing protein-induced deformation to be distributed over the entire octamer. The flanking AG base pair steps of the free MetR binding site extend an alternating twist pattern inherent in the central ACGT (compare Figs. 2d and 5d), forming a distinctive recognition surface that is retained with overall deformation in the protein-DNA complex.

Recognition of Zif268 zinc finger binding sites

The free decamers FIN-A, d-(5′-GCGTGGGCAC-3′)•d-(5′-GTGCCCACGC-3′) and FIN-B, d-(5′-GCGTGGGACC-3′)•d-(5′-GGTCCCACGC-3′) contain binding sites for the zinc-finger containing protein Zif268. The sequences were extracted from crystal structures of Zif268 bound to DNA duplexes d-(5′-AGCGTGGGCAC-3′)•d-(5′-TGTGCCCACGC-3′), PDB id 1a1h, and d-(5′-AGCGTGGGACC-3′)•d-(5′-TGGTCCCACGC-3′), PDB id 1a1f, respectively.28

Both free decamers crystallized in space group P6122 with ½ decamer in the crystal asymmetric unit. By difference Fourier analysis, the crystals were shown to have discrete disorder about the crystallographic 2-fold axis (see Methods). Full decamer duplex structures with ½ occupancy were obtained for FIN-A to 1.9 Å and for FIN-B to 2.2 Å (Table 1). Both decamer structures are similar to canonical A-form DNA29 and to each other (r.m.s. deviations 1.1-1.3 Å). The crystal packing of the free decamers is typical for A-DNA and potentially stabilizes the A-form.30

Free and bound Zif268 DNA binding sites are shown in Fig. 6 (see also supplement 2, figures 3 and 4). The FIN-A and FIN-B DNA decamers adopt A-DNA conformation, though with a narrower minor groove than is typical for A-DNA. In the Zif268-DNA interaction, three consecutive zinc fingers wrap around the 10 base pair consensus site, each inserting one α-helix into the major groove.28 The bound DNA structures are largely B-form, though as reported previously, they have some A-like character, including a wider minor groove than is typical for B-DNA.

Figure 6. Structures of Zif268 zinc-finger DNA binding sites in free and protein-bound crystal forms.

Figure 6

The free (5′-GCGTGGGCAC-3′)•(5′-GTGCCCACGC-3′) Zif268 zinc-finger binding site decamer (FIN-A, top left) and the free (5′-GCGTGGGACC-3′)•(5′-GGTCCCACGC-3′) Zif268 zinc-finger binding site decamer (FIN-B, bottom left) adopt A-DNA structures while the corresponding protein-bound binding sites (top right, PDBid 1a1h; bottom right, PDBid 1a1f)28 adopt B-DNA conformations with A-like character. Color scheme as in Figure 1.

The A-B transition has been investigated in a number of DNA structures, and may be biologically significant, at least as an indication of intrinsic deformability.8,13,31,32,33,34 Two parameters used to describe where a structure lies in the A-B continuum are x-displacement (displacement of a base pair away from the helix axis in the direction of the base pair short axis),13,34,35 and Zp (mean z-coordinate of the backbone phosphorous atoms with respect to individual base pair dimer reference frames).31,36 By both of these criteria the free and bound Zif268 DNA binding sites appear as intermediates, with the free sites closer to canonical A-DNA and the bound sites closer to canonical B-DNA, although the spread is more convincing for base-pair geometry (x-displacement, Fig. 7a) than for backbone conformation (Zp, Fig. 7b).

Figure 7. Base pair step parameters of Zif268 zinc finger free and bound binding sites.

Figure 7

(a) X-displacement; (b) Zp; (c) Roll vs. slide. Values are also plotted for the internal TGGG of the d-(CATGGGCCCATG)2 A/B intermediate DNA structure.38 Values for canonical A-DNA and B-DNA53 are also indicated.

The structural transition from B to A DNA involves an increase in roll angle from 1.5° to 12.0° and a decrease in slide from +0.5 Å to −1.4 Å.36 A plot of roll vs. slide for individual base pairs (Fig. 7c) shows that both free and bound Zif268 DNA binding sites fall between canonical A-DNA and canonical B-DNA conformations. The free Zif268 DNA binding sites have A-DNA-like slide values but intermediate roll values, while the bound Zif268 DNA binding sites have slide and roll values intermediate between canonical B-DNA and canonical A-DNA.

The dodecamer d-(CATGGGCCCATG)2 has been characterized as an A/B intermediate.37,38 The free dodecamer shares with the Zif268 binding sites the internal sequence TGGG; parameters for dodecamer TGGG are included in Figs. 7 and 8 for comparison. As determined by the backbone conformation, the dodecamer TGGG is very close to canonical A-DNA and the free Zif268 binding sites (Fig. 7b). But by base pair geometry criteria, the dodecamer TGGG is clearly an intermediate between the bound and free Zif268 binding sites, both in terms of x-displacement (Fig. 7a) and roll vs. slide (Fig. 7c).

Figure 8. Comparison of free and bound A/B intermediate DNA structures.

Figure 8

Average roll (degrees) vs. average slide (Ångstroms) are plotted per duplex for free and bound Zif268 binding sites (red triangles, red diamonds) and free and bound binding sites for Runt (cyan triangles, cyan diamond). Two clusters formed by the Zif268 and runt binding site structures are indicated with ovals. For reference, average roll vs. average slide are also plotted for the TGGG-containing A/B intermediate (green triangle, PDB id 1dc0),38 and A/B intermediate structures of d-(GGCGCC) derivatives34 (grey dots; PDB ids 1ih2, 1ih3, 1ih4, 1ih5, 1ih6).

A/B hybrid binding sites

An investigation of bound vs. free binding sites for another GC-rich DNA sequence that tends to adopt A-like conformation, the runt-domain binding site (TGCGGTC), has been carried out by the Shakked group.11,13 Even though details of the protein-DNA interaction differ (IgG-like runt domains bind to the major groove of their 7 base pair recognition site via a beta-hairpin), the results from the runt-domain system strongly parallel ours for the Zif268 system. Free runt-domain DNA binding sites were observed in two distinct A/B intermediate conformations: in nonamer duplexes, TGCGGTC adopts A-like structures,13 while in a dodecamer duplex, the same sequence is closer to B-DNA conformation, albeit with some A-like character.11 TGCGGTC in runt-domain bound DNA duplexes is also close to B-DNA conformation, but with somewhat more A-like character than the free dodecamer.

Using average roll vs. slide as an indicator, runt-domain binding sites and Zif268 binding sites cluster into two distinct A/B intermediate groups within the A-B continuum (Fig. 8). One free site (the runt-domain dodecamer) plus all four bound sites from the two systems form a cluster with conformations closer to B-DNA; all other free sites form a second cluster with conformations closer to A-DNA. The A/B intermediate d-(CATGGGCCCATG)2 dodecamer38 and A/B intermediate d-(GGCGCC)2 derivative structures34 form a third (uncircled) cluster at the lower-left corner of the roll-slide plot, with positions intermediate between the two runt/Zif268 binding site clusters.

The Zif268 and runt-domain binding sites both feature central G-tracts, which are commonly found in crystals of A-DNA, but are rarely found in crystals of B-DNA.30 The triplet GGG, in particular, has not yet been crystallized in canonical B-DNA conformation except within protein-DNA complexes. Both of these DNA-binding sequences have base pair step trimers that have been defined as A-philic (GGG for Zif268 and GGT for runt-domain) and both also lack trimers defined as B-philic.39 We conclude that the particular DNA sequences in these two binding sites strongly influence local DNA conformation and deformability, enabling hybrid A/B conformations closer to B-DNA or closer to A-DNA, depending on the specific environmental condition. We predict that either binding site can transition between A/B intermediate forms via a small energy barrier, and that either conformation can exist in solution.

Conclusions

Using a structural genomics approach, we targeted cognate DNA binding sites of known protein-DNA complexes and obtained four new crystal structures of free DNA binding sites, corresponding to three distinct protein-binding sites. By this method we have significantly increased the number of DNA recognition motifs of proteins for which both bound and free structures are available.

For each of the three recognition motifs examined, the DNA is recognized in a fundamentally different manner, yet in each case we observed significant structural correlations between free and bound structures. The results strongly underscore the importance of DNA sequence context in protein-DNA recognition.

In the case of the MetR binding site, a distinctive geometric pattern with alternating twist angles, roll angles, and major groove width is shared by the free and bound DNA structures. Strong correlations in the helical geometry of free and bound DNA structures are reported here for the first time and provide evidence for a structural code of DNA recognition in the form of a constrained geometry in the major groove.

In the case of the PurR binding site, the free DNA duplex crystallized in a nearly ideal B form, nearly identical to the bound DNA duplex but without the prominent central kink present in the protein-DNA complex. The observation that the free DNA structure exhibits no kink is consistent with the current model that indirect readout occurs primarily through protein induced bending. Short flanking A-tracts provide conformational rigidity to help isolate the central deformation.

In the case of the Zif268 binding sites, the free structures reveal an intrinsic tendency to adopt hybrid A/B DNA structure, facilitating recognition and binding of the target transcription factor, which also recognizes an A/B intermediate conformation. The Zif268 binding sites, like those for runt,13 demonstrate that the intrinsic tendency of certain DNA sequences to adopt A/B hybrid conformations can play an important role in protein-DNA recognition. Deformation in the A-B continuum represents an alternative strategy to deformation by bending for optimization of complementarity between protein and DNA binding surfaces.

Materials and Methods

Selection of target DNA sequences

Fifty DNA decamer and dodecamer duplexes were designed for crystallization trials based on sequence recognition motifs represented in one or more known protein-DNA crystal structures. The DNA duplexes were screened for crystallization under a standard set of conditions. Sequences are listed in Supplement Table I, and further descriptions of the design strategy, screening efforts, and results are provided in accompanying text.

DNA synthesis and annealing

DNA oligonucleotides purified by C-18 dual reverse-phase HPLC were purchased from Integrated DNA Technologies, Inc (IDT). DNA duplexes were prepared in 5 mM sodium cacodylate buffer (pH 6.5), 10 mM MgCl2, and 5 mM EDTA to yield final concentrations of 0.5-1.0 mM duplex. The duplexes were annealed by heating at 90 °C for 5 minutes, and then slowly cooled to 25 °C over a period of 12 to 15 hours.

Data Collection, Structure Determination and Analysis

All crystals were prepared by hanging drop vapor diffusion and were mounted on nylon loops and flash-cooled to 100 °K. Except where noted below, X-ray diffraction data were collected at the Brookhaven National Laboratory National Synchrotron Light Source (NSLS). Intensity data were processed using DENZO and SCALEPACK 40, and model refinement was performed with CNS41 using maximum likelihood refinement, bulk solvent correction, anisotropic temperature factor scaling, and the DNA parameter restraints of Parkinson et al.42 Manual model building was performed using O43 or XtalView.44 Solvent atoms consistently observed in omit maps within 3.4 Å of nitrogen or oxygen atoms were retained in final models. Crystal parameters, diffraction and model refinement statistics are summarized in Table 1. DNA helical parameters were calculated using 3DNA.45 Uncertainties in helical parameters resulting from coordinate error are expected to be small relative to the differences discussed in the main text.45

MetR DNA dodecamer

Crystals of dodecamer (5′-GCAGACGTCTGC-3′)2 were grown in 10% w/v MPD, 40 mM sodium cacodylate buffer (pH 5.5), 20 mM cobalt hexamine, 20 mM MgCl2. The reservoir contained 500 μl of 35-55% MPD. Prior to data collection, a single crystal was equilibrated against a reservoir of 45% MPD. Native X-ray diffraction data were collected to 2.5 Å resolution at NSLS beamline X26C.

The structure was solved by molecular replacement using the program EPMR 46. The coordinates of NDB entry BD000110 (5′-ACCGACGTCGGT-3′)2 were used as the search model. Placement of three duplexes with corrected sequence in the P1 asymmetric unit yielded a starting R-factor of 35 %. After rigid body and simulated annealing refinement, inspection of 2Fo-Fc and Fo-Fc electron density maps revealed the presence of three ordered cobalt hexamine molecules, one in the major groove of each duplex. After completion of refinement with CNS, TLS parameters47 were refined for each independent duplex as a group using Refmac.48

For comparison of base pair step parameters between free and bound MetR binding sites, the elements of the correlation matrix C were calculated as:

Cij=(Ntotxixixjxj)1{base pair steps}(xixi)(xjxj).

Cij gives a quantitative measure of the degree of cross-correlation between the different structures (xi and xj are parameter values for structures i and j, brackets denote an average across the different base pairs, and Ntot is the total number of base pair steps that are considered).

PurR DNA dodecamer

Crystals of dodecamer (5′-GCAAACGTTTGC-3′)2 were grown in 1 mM spermine tetrahydrochloride, 30% MPD, 40 mM magnesium acetate, 50 mM sodium cacodylate buffer (pH 6.0). Before data collection, a single crystal was transferred to 1 mM spermine tetrahydrochloride, 45% MPD, 40 mM magnesium acetate, 50 mM sodium cacodylate buffer (pH 6.0). Native X-ray diffraction data were collected to 2.1 Å at NSLS beamline X8C. To improve completeness at low-resolution, the synchroton data were merged with data measured from a second native crystal using a Quantum 1 CCD detector with mirror-focused CuKα radiation from a rotating anode source.

Multiwavelength anomalous dispersion (MAD) data were collected to 2.1 Å from a single crystal of the bromine derivative (5′-GCAAACGT(5-Br-U)TGC-3′)2 at NSLS X8C. Data were measured at three incident radiation wavelengths corresponding to peak, inflection and remote wavelengths of the bromine K-edge. The two Br atoms were located by inspection of an anomalous difference Patterson map, and the relative positions of the atoms and absolute hand of lattice symmetry were found by difference Fourier methods. Phase refinement was carried out using MLPHARE,49 with the inflection dataset designated as native.

Solvent-flattening and histogram-matching density modification procedures were applied to native amplitudes and MAD-derived phases using the program DM.50 The resulting electron-density map revealed clear density for a duplex of 12 separate base pairs in the P3121 asymmetric unit. A fiber B-DNA dodecamer model generated using 3DNA45 was manually fitted into the density. Initial structure refinement employed a combined amplitude and density-modified phase maximum likelihood target.

Zif268 DNA decamers (FIN-A, FIN-B)

Crystals of FIN-A decamer (5′-GCGTGGGCAC-3′)•(5′-GTGCCCACGC-3′) were grown in 10% w/v MPD, 40 mM sodium cacodylate buffer (pH 5.5), 20 mM cobalt hexamine, 12 mM sodium chloride, and 80 mM potassium chloride. Crystals of FIN-B decamer (5′-GCGTGGGACC-3′)•(5′-GGTCCCACGC-3′) were grown in 10% w/v MPD, 40 mM sodium cacodylate buffer (pH 5.5), 20 mM cobalt hexamine, 80 mM sodium chloride, and 20 mM magnesium chloride. Crystals of FIN-A decamer bromine derivative (5′-GCG(5-Br-U)GGGCAC-3′) •(5′-G(5-Br-U)GCCCACGC-3′) were grown in either 10% w/v MPD, 40 mM sodium cacodylate buffer (pH 5.5), 20 mM cobalt hexamine, 20 mM magnesium chloride or 10% w/v MPD, 40 mM MES (pH 6.0), 20 mM cobalt hexamine, 20 mM magnesium chloride. In all cases, the reservoir was composed of 35% MPD. Prior to data collection, crystals were equilibrated against a reservoir of 45% MPD. Native datasets were collected at NSLS X12C from a single crystal of decamer A to 1.9 Å and from a single crystal of decamer B to 2.2 Å. FIN-A decamer bromine derivative data were collected on an R-axis IV detector with Yale mirror-focused CuKα Rigaku RU100 rotating anode source (statistics not shown). Merging statistics for the native and derivative datasets were consistent with the space group assignment P6122 (or P6522).

Plausible molecular replacement solutions were found for both FIN-A and FIN-B decamer crystals with the program AMORE,51 using the coordinates of the symmetric decamer (5′-GACCGCGGTC-3′)2 (NDB ID AD000152) as the search model. In each case, the molecular two-fold symmetry axis of the search model was found to coincide with a crystallographic two-fold symmetry axis of a P6122 lattice. Difference Fourier maps revealed discrete disorder for both bromine atoms of the FIN-A derivative about the putative crystallographic two-fold, confirming the space group assignment. For refinement, initial coordinate sets were constructed as full DNA duplexes with half occupancy. The starting models yielded R-values of 32% and 35% for FIN-A and FIN-B, respectively. After an initial round of rigid body refinement, alternating rounds of simulated annealing with torsion angle dynamics, energy minimization, B factor refinement of domains, and manual model adjustments were performed. At later stages, B factors were refined for base pair groups, and finally for individual atoms. The final structures have pseudo two-fold symmetry with overlapping base pairs in the crystal lattice sharing the same planes, and overlapping base-pair steps sharing similar roll and slide values. Any error in estimation of helical parameters resulting from ambiguity in atomic positions would be expected to be small.

Supplementary Material

1
2

Acknowledgments

We are grateful for the assistance of Jin Wu, Irina Persikova, Satish Bodige, and Suzanne Richman in carrying out crystallization trials. We thank Leon Flaks, Annie Heroux, and Anand Saxena for their assistance with data collection at National Synchrotron Light Source beamlines X8C, X12C, and X26C, which are supported by the Department of Energy and the National Institutes of Health (NIH). This study was supported by NIH award GM21589 to C.L.L. and H.M.B. A.A.N. was an NIH trainee (GM08319).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Accession codes

The atomic coordinates and structure factors file has been deposited in the PDB with accession codes 1HQ7 for PurR dodecamer (NDB id BD0047), 2B1D for MetR dodecamer (NDB id BD0090), 2B1B for FIN-A decamer (NDB id AD0058), and 2B1C for FIN-B decamer (NDB id AD0059).

References

  • 1.Otwinowski Z, Schevitz RW, Zhang RG, Lawson CL, Joachimiak A, Marmorstein RQ, Luisi BF, Sigler PB. Crystal structure of trp repressor/operator complex at atomic resolution. Nature. 1988;335:321–329. doi: 10.1038/335321a0. [DOI] [PubMed] [Google Scholar]
  • 2.Lawson CL, Berman HM. Indirect Readout of DNA Sequence by Proteins. In: Rice PA, Correll CC, editors. Protein-Nucleic Acid Interactions. Royal Society of Chemistry; Cambridge: 2008. pp. 66–86. [Google Scholar]
  • 3.Olson WK, Gorin AA, Lu XJ, Hock LM, Zhurkin VB. DNA sequence-dependent deformability deduced from protein-DNA crystal complexes. Proc Natl Acad Sci USA. 1998;95:11163–11168. doi: 10.1073/pnas.95.19.11163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Jones S, van Heyningen P, Berman HM, Thornton JM. Protein-DNA interactions: A structural analysis. J Mol Biol. 1999;287:877–896. doi: 10.1006/jmbi.1999.2659. [DOI] [PubMed] [Google Scholar]
  • 5.Drew HR, Wing RM, Takano T, Broka C, Tanaka S, Itakura K, Dickerson RE. Structure of a B-DNA dodecamer: conformation and dynamics. Proc Natl Acad Sci USA. 1981;78:2179–2183. doi: 10.1073/pnas.78.4.2179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kim Y, Grable JC, Love R, Greene PJ, Rosenberg JM. Refinement of Eco RI endonuclease crystal structure: a revised protein chain tracing. Science. 1990;249:1307–1309. doi: 10.1126/science.2399465. [DOI] [PubMed] [Google Scholar]
  • 7.Lawson CL, Carey J. Tandem binding in crystals of a trp repressor/operator half-site complex. Nature. 1993;366:178–182. doi: 10.1038/366178a0. [DOI] [PubMed] [Google Scholar]
  • 8.Shakked Z, Guzikevich-Guerstein G, Frolow F, Rabinovich D, Joachimiak A, Sigler PB. Determinants of repressor operator recognition from the structure of the trp operator binding site. Nature. 1994;368:469–473. doi: 10.1038/368469a0. [DOI] [PubMed] [Google Scholar]
  • 9.Hegde RS, Grossman SR, Laimins LA, Sigler PB. Crystal structure at 1.7 Å of the bovine papillomavirus-1 E2 DNA-binding domain bound to its DNA target. Nature. 1992;359:505–512. doi: 10.1038/359505a0. [DOI] [PubMed] [Google Scholar]
  • 10.Rozenberg H, Rabinovich D, Frolow F, Hegde RS, Shakked Z. Structural code for DNA recognition revealed in crystal structures of papillomavirus E2-DNA targets. Proc Natl Acad Sci USA. 1998;95:15194–15199. doi: 10.1073/pnas.95.26.15194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Bartfeld D, Shimon L, Couture GC, Rabinovich D, Frolow F, Levanon D, Groner Y, Shakked Z. DNA recognition by the RUNX1 transcription factor is mediated by an allosteric transition in the RUNT domain and by DNA bending. Structure. 2002;10:1395–407. doi: 10.1016/s0969-2126(02)00853-5. [DOI] [PubMed] [Google Scholar]
  • 12.Bravo J, Li Z, Speck NA, Warren AJ. The leukemia-associated AML1 (Runx1)--CBF beta complex functions as a DNA-induced molecular clamp. Nat Struct Biol. 2001;8:371–8. doi: 10.1038/86264. [DOI] [PubMed] [Google Scholar]
  • 13.Kitayner M, Rozenberg H, Rabinovich D, Shakked Z. Structures of the DNA-binding site of Runt-domain transcription regulators. Acta Crystallographica Section D-Biological Crystallography. 2005;61:236–246. doi: 10.1107/S0907444904032378. [DOI] [PubMed] [Google Scholar]
  • 14.Tahirov TH, Inoue-Bungo T, Morii H, Fujikawa A, Sasaki M, Kimura K, Shiina M, Sato K, Kumasaka T, Yamamoto M, Ishii S, Ogata K. Structural analyses of DNA recognition by the AML1/Runx-1 Runt domain and its allosteric control by CBFbeta. Cell. 2001;104:755–67. doi: 10.1016/s0092-8674(01)00271-9. [DOI] [PubMed] [Google Scholar]
  • 15.Chen YQ, Ghosh S, Ghosh G. A novel DNA recognition mode by the NF-kB p 65 homodimer. Nat Struct Biol. 1998;5:67–73. doi: 10.1038/nsb0198-67. [DOI] [PubMed] [Google Scholar]
  • 16.Huang DB, Phelps CB, Fusco AJ, Ghosh G. Crystal structure of a free kappa B DNA: Insights into DNA recognition by transcription factor NF-kappa B. Journal of Molecular Biology. 2005;346:147–160. doi: 10.1016/j.jmb.2004.11.042. [DOI] [PubMed] [Google Scholar]
  • 17.Somers WS, Phillips SEV. Crystal structure of the met repressor-operator complex at 2.8 Å resolution reveals DNA recognition by β-strands. Nature. 1992;359:387–393. doi: 10.1038/359387a0. [DOI] [PubMed] [Google Scholar]
  • 18.Garvie CW, Phillips SEV. Direct and indirect readout in mutant Met repressor-operator complexes. Structure Fold Des. 2000;8:905–914. doi: 10.1016/s0969-2126(00)00182-9. [DOI] [PubMed] [Google Scholar]
  • 19.Hizver J, Rozenberg H, Frolow F, Rabinovich D, Shakked Z. DNA bending by an adenine--thymine tract and its role in gene regulation. Proc Natl Acad Sci U S A. 2001;98:8490–5. doi: 10.1073/pnas.151247298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Gao YG, Robinson H, Sanishvili R, Joachimiak A, Wang AHJ. Structure and recognition of sheared tandem G center dot A base pairs associated with human centromere DNA sequence at atomic resolution. Biochemistry. 1999;38:16452–16460. doi: 10.1021/bi9914614. [DOI] [PubMed] [Google Scholar]
  • 21.Wild CM, McNally T, Phillips SE, Stockley PG. Effects of systematic variation of the minimal Escherichia coli met consensus operator site: in vivo and in vitro met repressor binding. Mol Microbiol. 1996;21:1125–35. doi: 10.1046/j.1365-2958.1996.00055.x. [DOI] [PubMed] [Google Scholar]
  • 22.Glasfeld A, Koehler AN, Schumacher MA, Brennan RG. The role of lysine 55 in determining the specificity of the purine repressor for its operators through minor groove interactions. J Mol Biol. 1999;291:347–361. doi: 10.1006/jmbi.1999.2946. [DOI] [PubMed] [Google Scholar]
  • 23.Schumacher MA, Choi KY, Zalkin H, Brennan RG. Crystal structure of LacI family member, PurR, bound to DNA: minor groove binding by alpha helices. Science. 1994;266:763–770. doi: 10.1126/science.7973627. [DOI] [PubMed] [Google Scholar]
  • 24.Chiu TK, Dickerson RE. 1 Å crystal structures of B-DNA reveal sequence-specific binding and groove-specific bending of DNA by magnesium and calcium. J Mol Biol. 2000;301:915–945. doi: 10.1006/jmbi.2000.4012. [DOI] [PubMed] [Google Scholar]
  • 25.Privé GG, Yanagi K, Dickerson RE. Structure of the B-DNA decamer C-C-A-A-C-G-T-T-G-G and comparison with isomorphous decamers C-C-A-A-G-A-T-T-G-G and C-C-A-G-G-C-C-T-G-G. J Mol Biol. 1991;217:177–199. doi: 10.1016/0022-2836(91)90619-h. [DOI] [PubMed] [Google Scholar]
  • 26.Ferré-D'Amaré AR, Prendergast GC, Ziff EB, Burley SK. Recognition by max of its cognate DNA through a dimeric b/HLH/Z domain. Nature. 1993;363:38–45. doi: 10.1038/363038a0. [DOI] [PubMed] [Google Scholar]
  • 27.Keller W, Konig P, Richmond TJ. Crystal structure of a bZIP/DNA complex at 2.2 Å: Determinants of DNA specific recognition. J Mol Biol. 1995;254:657–667. doi: 10.1006/jmbi.1995.0645. [DOI] [PubMed] [Google Scholar]
  • 28.Elrod-Erickson M, Benson TE, Pabo CO. High-resolution structures of variant Zif268-DNA complexes: implications for understanding zinc finger-DNA recognition. Structure. 1998;6:451–464. doi: 10.1016/s0969-2126(98)00047-1. [DOI] [PubMed] [Google Scholar]
  • 29.Arnott S. Polynucleotide secondary structures: an historical perspective. In: Neidle S, editor. Oxford Handbook of Nucleic Acid Structure. Oxford University Press; Oxford: 1999. pp. 1–38. [Google Scholar]
  • 30.Wahl MC, Sundaralingam M. Crystal structures of A-DNA duplexes. Biopolymers. 1997;44:45–63. doi: 10.1002/(SICI)1097-0282(1997)44:1<45::AID-BIP4>3.0.CO;2-#. [DOI] [PubMed] [Google Scholar]
  • 31.Lu XJ, Shakked Z, Olson WK. A-form conformational motifs in ligand-bound DNA structures. J Mol Biol. 2000;300:819–840. doi: 10.1006/jmbi.2000.3690. [DOI] [PubMed] [Google Scholar]
  • 32.Malinina L, Fernandez LG, Huynh-Dinh T, Subirana JA. Structure of the d(CGCCCGCGGGCG) dodecamer: a kinked A-DNA molecule showing some B-DNA features. J Mol Biol. 1999;285:1679–90. doi: 10.1006/jmbi.1998.2424. [DOI] [PubMed] [Google Scholar]
  • 33.Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Vargason JM, Henderson K, Ho PS. A crystallographic map of the transition from B-DNA to A-DNA. Proc Nat Acad Sci USA. 2001;98:7265–7270. doi: 10.1073/pnas.121176898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Dickerson RE, Ng HL. DNA structure from A to B. Proc Natl Acad Sci U S A. 2001;98:6986–8. doi: 10.1073/pnas.141238898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Calladine CR, Drew HR. Understanding DNA : the molecule and how it works. 2nd. Academic Press; San Diego: 1997. [Google Scholar]
  • 37.Ng HL, Dickerson RE. Mediation of the A/B-DNA helix transition by G-tracts in the crystal structure of duplex CATGGGCCCATG. Nucleic Acids Res. 2002;30:4061–7. doi: 10.1093/nar/gkf515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Ng HL, Kopka ML, Dickerson RE. The structure of a stable intermediate in the A <--> B DNA helix transition. Proc Natl Acad Sci U S A. 2000;97:2035–9. doi: 10.1073/pnas.040571197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Tolstorukov MY, Ivanov VI, Malenkov GG, Jernigan RL, Zhurkin VB. Sequence-dependent B×A transition in DNA evaluated with dimeric and trimeric scales. Biophys J. 2001;81:3409–3421. doi: 10.1016/S0006-3495(01)75973-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Otwinowski Z, Minor W. Processing of X-ray diffraction data collected in oscillation mode. Meth Enzymol. 1997;276:307–326. doi: 10.1016/S0076-6879(97)76066-X. [DOI] [PubMed] [Google Scholar]
  • 41.Brünger AT, Adams PD, Clore GM, DeLano WL, Gros P, Grosse-Kunstleve RW, Jiang JS, Kuszewski J, Nilges M, Pannu NS, Read RJ, Rice LM, Simonson T, Warren GL. Crystallographic and NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. 1998;D54:905–921. doi: 10.1107/s0907444998003254. [DOI] [PubMed] [Google Scholar]
  • 42.Parkinson G, Vojtechovsky J, Clowney L, Brünger AT, Berman HM. New parameters for the refinement of nucleic acid containing structures. Acta Crystallogr. 1996;D52:57–64. doi: 10.1107/S0907444995011115. [DOI] [PubMed] [Google Scholar]
  • 43.Jones TA, Zou JY, Cowan SW, Kjeldgaard Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr A. 1991;47(Pt 2):110–9. doi: 10.1107/s0108767390010224. [DOI] [PubMed] [Google Scholar]
  • 44.McRee DE. XtalView/Xfit--A versatile program for manipulating atomic coordinates and electron density. J Struct Biol. 1999;125:156–65. doi: 10.1006/jsbi.1999.4094. [DOI] [PubMed] [Google Scholar]
  • 45.Lu XJ, Olson WK. 3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. Nucleic Acids Res. 2003;31:5108–21. doi: 10.1093/nar/gkg680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Kissinger CR, Gehlhaar DK, Fogel DB. Rapid automated molecular replacement by evolutionary search. Acta Crystallogr D Biol Crystallogr. 1999;55(Pt 2):484–91. doi: 10.1107/s0907444998012517. [DOI] [PubMed] [Google Scholar]
  • 47.Winn MD, Isupov MN, Murshudov GN. Use of TLS parameters to model anisotropic displacements in macromolecular refinement. Acta Crystallogr D Biol Crystallogr. 2001;57:122–33. doi: 10.1107/s0907444900014736. [DOI] [PubMed] [Google Scholar]
  • 48.Murshudov GN, Vagin AA, Dodson EJ. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr. 1997;D53:240–255. doi: 10.1107/S0907444996012255. [DOI] [PubMed] [Google Scholar]
  • 49.Otwinowski Z. Maximum likelihood refinement of heavy atom parameters. In: Wolf W, Evans PR, Leslie AGW, editors. Proceedings of the CCP4 study weekend. SERC Daresbury Laboratory; Warrington, UK: 1991. pp. 80–86. [Google Scholar]
  • 50.Cowtan K. dm: An automated procedure for phase improvement by density modification. Joint CCP4 and ESF-EACBM Newsletter on Protein Crystallography. 1994;31:34–38. [Google Scholar]
  • 51.Navaza J. AMoRe: an automated package for molecular replacement. Acta Crystallogr. 1994;A50:157–163. [Google Scholar]
  • 52.Finley JB, Luo M. X-ray crystal structures of half the human papilloma virus E2 binding site: d(GACCGCGCTC) Nucleic Acids Res. 1998;26:5719–5727. doi: 10.1093/nar/26.24.5719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Chandrasekaran R, Arnott S. In: Landolt-Börnstein numerical data and functional relationships in science and technology, group VII/1b, nucleic acids. Saenger W, editor. Springer-Verlag; Berlin: 1989. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2

RESOURCES