Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2010 Feb 17.
Published in final edited form as: Biochemistry. 2009 Feb 17;48(6):1315–1321. doi: 10.1021/bi8015235

An FTIR Investigation of Flanking Sequence Effects on the Structure and Flexibility of DNA Binding Sites

Talia R Kahn 1,2, Kimberly K Fong 1, Brian Jordan 1, Janista C Lek 1, Rachel Levitan 1,3, Patrick S Mitchell 1,4, Corrina Wood 1,5, Mary E Hatcher 1,*
PMCID: PMC2701481  NIHMSID: NIHMS91562  PMID: 19166330

Abstract

Fourier transform infrared (FTIR) spectroscopy and a library of FTIR marker bands have been used to examine the structure and relative flexibilities conferred by different flanking sequences on the EcoRI binding site. This approach allowed us to examine unique peaks and subtle changes in the spectra of d(AAAGAATTCTTT)2, d(TTCGAATTCGAA)2, and d(CGCGAATTCGCG)2, and thereby identify local changes in base-pairing, base-stacking, backbone conformation, glycosidic bond rotation and sugar puckering in the studied sequences. The changes in flanking sequences induce differences in the sugar puckers, glycosidic bond rotation and backbone conformations. Varying levels of local flexibility are observed within the sequences in agreement with previous biological activity assays. The results also provide supporting evidence for the presence of a splay in the G4-C9 base pair of the EcoRI binding site and a potential pocket of flexibility at the G4 cleavage site that have been proposed in the literature. In sum, we have demonstrated that FTIR is a powerful methodology for studying the effect of flanking sequences on DNA structure and flexibility, for it can provide information about the local structure of the nucleic acid and the overall relative flexibilities conferred by different flanking sequences.

Keywords: FTIR, DNA, backbone conformations, flexibility, indirect readout


During both replication and transcription, amino acids of the protein factor involved recognize and interact electrostatically with the base pairs of the specific DNA sequence containing the recognition site (direct readout) with remarkable specificity. Upon binding of a protein in this region, a bending of the DNA occurs which enhances the stability of the complex by increasing the number of favorable contacts made between the DNA and the protein 1; 2. The four classifications of these interactions are: (i) van der Waals interactions between aliphatic amino acids and the C5-methyl groups of thymines or the C5-hydrogen groups of cytosines, (ii) hydrogen bonds between the nucleic acids and the amide and carbonyl groups of the peptide backbone, (iii) hydrogen bonding bridges constructed with water molecules between amino acids and base-pairs 3; 4; 5, and (iv) interlacing of amino acid side-chains between two bases 6. Although these are the major interactions, direct contacts at the recognition site alone do not account for the conformation of the DNA-protein complex; base pairs on either side of the recognition site, referred to as flanking sequences, are also thought to play a role in the conformation of the DNA-protein complex (indirect readout) 1; 7; 8.

In most cases, it is necessary for the protein to distort the DNA in order to achieve the transition state. To compensate for the energetic costs of deformation, the DNA-protein complex may make any modification that improves the binding free energy such as releasing water from non-polar surfaces and/or increasing the number of direct contacts made between the DNA and protein 9. DNA that requires less deformation to reach the conformation of the transition state will have a lower free energy of association 10. Similarly, sequences that have an inherently greater distortability will also have a lower binding free energy. Studies examining the effect of flanking sequences on DNA-protein binding have indicated that flanking sequences are involved in dynamics and confer flexibility or rigidity to a sequence 1; 7; 8; 11; 12; 13; 14. Since flexible DNA has a greater distortability, it is able to compensate more effectively and thereby more efficiently recognize and bind proteins and drugs than a similar, but more rigid DNA sequence 15; 16.

Various methods have been commonly used to examine the effect of flanking sequences on DNA including conformational free energy calculations 8, 1H-NMR and 31P-NMR 11; 17; 18, crystal structures 3,Shui, 1998 #36, restriction enzyme digests 12; 13; 19, molecular dynamics simulations 20, UV melting 14; 19, sequence dependent anisotropic bend-ability (SDAB) 21, and CD 14; 19; 22. These techniques have revealed valuable information about the features in DNA that change when flanking sequences are modified including changes in the unwinding and overwinding of the double helix, formation of hydrogen bonds, twist and glycosidic angles, helical parameters, flexibility, and restriction endonuclease cutting rates 3; 11; 12; 20. Although these methods are highly advanced, there is some disparity with regards to which sequences are most flexible and most rigid 11; 12; 21.

In principle, Fourier transform infrared (FTIR) spectroscopy can detect local conformational changes with high sensitivity 16; 23; 24. For example, FTIR has been used to study DNA structural changes caused by CpG methylation 16 and the structural effects of G-tract length 22. Since this technique is non-destructive and provides a “snapshot” of the nucleic acid's structure, we have chosen to apply FTIR along with a library of IR bands compiled by Banyay et al. 23 to interpret major features and changes in nucleic acid structure, conformations and conformational sub-states as a result of different flanking sequences.

MATERIALS AND METHODS

Sample Preparation

The DNA dodecamers studied were obtained from Alpha-DNA (Quebec, Canada). Samples were desalted by centrifugation in Y300 Centricons (Millipore). After desalting, samples were lyophilized overnight and stored at 4°C until analysis.

Fourier Transform Infrared (FTIR) Experiments

Samples for FTIR analysis were prepared by diluting to a final concentration of 3.0 mM in 100 mM sodium phosphate buffer (pH 7.4). Freshly made samples were used for all FTIR measurements. FTIR spectra were recorded at room temperature on a ZnSe crystal using a Thermo Nicolet Avatar 370 ATR-FTIR spectrometer (Thermo Scientific, Massachusetts). A background (atmospheric plus pure buffer) was collected after which DNA samples were run. For each DNA sample, FTIR spectra were taken in triplicate to ensure reproducibility. Experiments were run with a resolution of 4 cm-1 and 512 scans. Using EZ-OMNIC software (Thermo Scientific), all DNA spectra underwent baseline correction and 15 point smoothing before peaks were assigned. Derivative spectra were 25 point smoothed. Final spectra were normalized with respect to the symmetric PO2- stretching mode appearing at 1090-1080 cm-1 25; 26. Spectral assignments were made using the Banyay et al library23. This library provides wavenumber ranges of ~5-10 cm-1 for many assignments so differences less than 10 cm-1 may not be significant.

31P NMR experiments

DNA samples were obtained from Alpha DNA (Montreal, Canada). These were salted with 100 mM phosphate buffer at pH = 7.4 then lyophilized. The samples were re-hydrated in 98% D2O, re-lyophilized and finally dissolved in 99.9% D2O, immediately before use.

2D 1H nuclear Overhauser effect (NOESY) experiments were performed with mixing times of 500 ms and a gradient pulse applied during the mixing time. 4096 data points were collected in the t2 dimension and 512 increments in t1. The matrix was zero-filled to 4096 × 1024 points before double Fourier transformation. Phase sensitive 31P-1H HSQC were recorded using Echo/Antiecho-TPPI gradient selection27; 28; 29 and decoupling during acquisition. All HSQC were performed with 2048 points in the t2 dimension and 256 t1 increments using a relaxation delay of 4 s. The chemical shift of H2O and the center of the DNA phosphorus resonances were set as the center of HSQC and NOESY spectra. A spectral width of 9 ppm was used for all 1D experiments and in both dimensions of the 2D experiments.

Since H2' and H2" are strongly coupled with H1', H3', H5' (sugar protons) and to the base protons, enabling the assignment of protons on the NOESY using a sequential walk30. The strong correlations between 31P and H3' and H5'/H4' then enabled us to assign the HSQC using the proton assignments determined from the NOESY.

%BII was determined as a function of the 31P chemical shift using a linear relationship between the observed fast-exchange chemical shift and the %BII conformation 31:

%BII=12.3680.005110TδP(ppm)+1.3450.002031T2.3680.005110T

RESULTS

We studied DNA sequences containing the EcoRI binding site as this binding site has been extensively studied by a number of techniques. In particular, we used the well-characterized Dickerson Sequence32 as a control so we can compare our results with the wealth of structural and dynamical data available in the literature on this sequence. Flanking sequences were chosen based on previous research on the effect of flanking sequences on DNA structure, dynamics and protein-DNA interactions.11; 12; 18; 26 Table 1 lists the EcoRI sequences that were examined using FTIR. The spectral peaks were assigned using the Banyay Library of DNA infrared bands 23.

Table 1.

Sequences of DNA samples studied. The bold lettering indicates the EcoRI binding site.

DNA Sample Sequence
EcoRI AAA/TTT d(AAAGAATTCTTT)2
EcoRI TTC/GAA d(TTCGAATTCGAA)2
EcoRI CGC/GCG d(CGCGAATTCGCG)2

Effect of Flanking Sequences on the Base Region (1800-1500 cm-1)

Figure 1 shows the 1800-1500 cm-1 region for EcoRI binding site containing sequences. This region reports on changes in the bases of nucleic acids.23 The vibrations of base-paired pyrimidines occur between 1670-1630 cm-1. The EcoRI TTC/GAA sequence has a broad range of peaks within this region including distinct peaks at 1666 and 1639 cm-1. The peak at 1666 cm-1 falls into the range for single stranded guanines (1673-1660 cm-1). EcoRI AAA/TTT has similar breadth in this region but differing intensities. EcoRI CGC/GCG has a broad peak with a clear resonance at 1645 cm-1. The strong resonance at 1645 cm-1 is indicative of single stranded cytosines. The breadth and lack of resolution in this region for the EcoRI CGC/GCG sequence may be the result of the large number of CG base pairs in this sequence. A peak at between 1604-1600 cm-1 was present in all samples (although only weakly in EcoRI CGC/GCG) and is also a marker for single stranded guanines. The vibrations of base-paired cytosines occur at 1525 cm-1 in EcoRI TTC/GAA and shifts upfield to 1535 cm-1 in EcoRI AAA/TTT. There is a low intensity, broad resonance in this region for EcoRI CGC, again likely due to the increased number of CG pairs. An intense peak at 1710 cm-1 is evident EcoRI AAA/TTT and EcoRI TTC/GAA, but only weakly present in EcoRI CGC/GCG. The Banyay Library attributes this peak to guanines involved in Hoogsteen binding to a third strand, but this is unlikely in the short DNA sequences at low concentrations studied here. Therefore, these peaks are more likely the carbonyl stretches of bases in A-form 33.

Figure 1.

Figure 1

Spectra for the 1800-1500 cm-1 in-plane base vibrations; EcoRI AAA/TTT (top), EcoRI TTC/GAA (middle), and EcoRI CGC/GCG (bottom).

Effect of Flanking Sequences on the Base-sugar Region (1500-1250 cm-1)

The spectrum for the 1500-1250 cm-1 region demonstrating the effect of flanking sequence changes on the glycosidic bond rotation, the backbone conformation and the sugar puckering modes of nucleic acids is illustrated in Figure 2. In EcoRI CGC/GCG, sugar ring vibrations of purines occur as a weak, broad peak centered at 1490 cm-1. This becomes two peaks at 1493 and 1486 cm-1 in EcoRI TTC/GAA and weakly in EcoRI AAA/TTT. Changes in the position of this band reflect interactions at N7 sites of purines 23. Similarly, the broad peak due to the vibrations of adenines in A-/B-form conformation at 1458 cm-1 in EcoRI CGC/GCG, splits into two distinct peaks at 1465 and 1458 cm-1 in EcoRI AAA/TTT. This peak shifts to 1442 cm-1 in EcoRI TTC/GAA. In EcoRI AAA/TTT there is a peak at 1420 cm-1, which corresponds to S-type sugars in B-form helices; this marker is weakly present in EcoRI TTC/GAA and EcoRI CGC/GCG. A broad peak encompassing the Z-form marker at 1434-1440 cm-1 is present in EcoRI TTC/GAA. Peaks at 1373 and 1365 cm-1 for purines in anti form are present in EcoRI AAA/TTT and a broad resonance encompassing this region is present in EcoRI TTC/GAA. A weak, broad resonance at 1371 cm-1 is present in EcoRI CGC/GCG. This shift suggests modified glycosidic torsion angles 16. In EcoRI AAA/TTT and EcoRI TTC/GAA, a marker for thymines with S-type sugars and a glycosyl torsional angle χ in anti conformation is evident at 1328 and 1323 cm-1, respectively, but absent in EcoRI CGC/GCG. A marker corresponding to C4NH2 vibrations of a cytosine occurs in all samples between 1297-1287 cm-1. The CN3H bends of thymines with S-type sugars and anti conformations occurs in EcoRI AAA/TTT at 1282 cm-1; however, this marker appears only very weakly in EcoRI TTC/GAA and not at all in EcoRI CGC/GCG.

Figure 2.

Figure 2

Spectra for the 1500-1250 cm-1 base-sugar vibrations; EcoRI AAA/TTT (top), EcoRI TTC/GAA (middle) and EcoRI CGC/GCG (bottom).

Effect of Flanking Sequences on the sugar-phosphate Region (1250-1000 cm-1)

Figure 3 illustrates marker bands between 1250-1000 cm-1 that identify changes in the backbone conformation. All EcoRI sequences exhibited a marker at 1216 cm-1, which corresponds to B-form DNA. All three samples also demonstrate symmetric PO2- vibrations that are insensitive to the B-to-A transition between 1082-1076 cm-1. The shoulder to this symmetric PO2- peak at 1051 cm-1 is evident in EcoRI AAA/TTT and EcoRI TTC/GAA but is subsumed into a broad resonance in EcoRI CGC/GCG. This vibration is attributed to the CO stretch of the backbone and is strongly enhanced in Z-form DNA.25; 26 In EcoRI TTC/GAA, a marker for furanose vibrations in Z-DNA is evident at 1025 cm-1. Finally, all three sequences have a weak marker at 1172 cm-1, an A-form marker due to C3' endo/anti puckering.

Figure 3.

Figure 3

Spectra for the 1250-1000 cm-1 sugar-phosphate vibrations; EcoRI AAA/TTT (top), EcoRI TTC/GAA (middle) and EcoRI CGC/GCG (bottom).

Effect of Flanking Sequences on the Sugar Region (1000-800 cm-1)

The effects of sugar conformation on EcoRI spectra provided in the 1000-800 cm-1 region are depicted in Figure 4. Careful examination of this region, particularly between 900-800 cm-1, was required because important structural differences often manifested as minor spectral changes.

Figure 4.

Figure 4

Spectra for the 1000-800 cm-1 sugar vibrations; EcoRI AAA/TTT (top), EcoRI TTC/GAA (middle), and EcoRI CGC/GCG (bottom).

To aid in characterizing peaks contributing to the broad resonances observed, second derivative spectra for this region are provided in Figure 5 16. A significant peak at 970 cm-1 correlating to the CC stretch of the B-form backbones is present in EcoRI AAA/TTT and EcoRI TTC/GAA. EcoRI CGC/GCG and to a much less intensity, EcoRI AAA/TTT demonstrate a resonance at 993 cm-1. The derivative spectra show that EcoRI AAA/TTT has breadth but all of its intensity is localized about the 970 cm-1 peak. EcoRI TTC/GAA has a peak at 995 cm-1 and a broader one at 969 cm-1 with additional resolution.

Figure 5.

Figure 5

The second derivative spectra of the FTIR region reporting on the backbone and sugar conformations. EcoRI AAA/TTT (top), EcoRI TTC/GAA (middle) and EcoRI CGC/GCG (bottom).

EcoRI CGC/GCG has a narrow peak at 995 cm-1 and a broad peak at 985 cm-1. This suggests a large variety of backbone conformations in these samples and some dynamic averaging between them. The backbone dynamics of the EcoRI CGC/GCG sequence have been previously studied and demonstrate facile interconversion of the BI and BII states in the C3pG4 and C9pG10 steps.31; 34 Figure 6 shows the %BII as a function of backbone step for each sequence as determined by 31P NMR. This plot demonstrates that the EcoRI AAA/TTT samples mainly BI backbone conformations across the entire sequence whereas EcoRI TTC/GAA shows steps with dramatic %BII character. This is in agreement with the FTIR data showing a broad resonance at 970 cm-1 for EcoRI AAA/TTT and resonances at varying backbone conformations for EcoRI TTC/GAA. The EcoRI CGC/GCG sequence also shows variation in backbone conformation, again agreeing with the variety and breadth of backbone resonances apparent in its FTIR spectrum.

Figure 6.

Figure 6

The variation in backbone conformation (reported as %BII) as a function of nucleotide step for EcoRI AAA/TTT (black), EcoRI TTC/GAA (red) and EcoRI CGC/GCG (black).

The marker for adenine-thymine base pairs in B-form helices occurs in all samples at 928-931 cm-1 (Figure 4). In addition, the deoxyribose peak at 898-893 cm-1 is evident in all samples but broadest in the EcoRI CGC/GCG sequence. Broad peaks centered between 839-835 cm-1, arising from S-type sugar puckers, are evident in three samples. The derivative spectra (Figure 5) show an almost equal population into two S-types markers at 829 and 835 cm-1 as well as peaks at 815 and 804 cm-1, indicative of N-type sugars in EcoRI CGC/GCG. Again, these results demonstrating varying furanaose structure and the presence of furnanose dynamics are consistent with previous studies of the furanose structure and dynamics in the EcoRI CGC/GCG sequence. 34; 35; 36 The EcoAAA/TTT and EcoRI TTC/GAA sequences demonstrate peaks at 846, 836 and 804 cm-1 and an O4' endo sugar puckering marker at 825 cm-1.16

DISCUSSION

Our findings demonstrate that one can use FTIR and the library of infrared marker bands compiled by Banyay et al.23 to determine the structural and dynamical changes conferred by different flanking sequences. This approach allowed us to examine unique peaks and subtle changes in the spectra of EcoRI binding sites that have been flanked by various tri- nucleotides, and thereby identify local changes in base-pairing, base-stacking, backbone conformation, glycosidic bond rotation and sugar puckering in the studied sequences. We included the analysis of the well-studied Dickerson dodecamer in this investigation so that structural features observed via FTIR could be directly compared with previously reported NMR, X-Ray and Molecular Dynamics investigations of this sequence.

Variations in the nucleotide sequence produce different vibrations within both the direct and indirect readout regions. Based on the peak assignments in the 1800-1500 cm-1 region, most of the nucleotides in the EcoRI sequences are double stranded. However, in all dodecamers studied peaks between 1673-1660 and 1608-1600 cm-1, corresponding to a single stranded guanine nucleotide, are observed. These peaks are even evident in the EcoRI AAA/TTT sample that contains only one guanine nucleotide adjacent to the EcoRI restriction endonuclease cutting site (AAAG↑AATTCTTT). This finding supports the presence of a splay in the G4-C9 base pair in the EcoRI cutting site, which was described by Drew et al. and seen in NMR and Molecular Dynamics studies 32; 35; 37 in the crystal structure of the B-form helix. We also observed a number of changes in base-stacking that are attributed to the identities and positions of the bases in the flanking sequences of these dodecamers. Moreover, they can be credited to vibrations from the flanking sequences affecting the nucleotides within the GAATTC binding site. For example, in EcoRI TTC/GAA, the vibrations of base-paired cytosines occur at 1525 cm-1 and shift downfield to 1535 cm-1 in EcoRI AAA/TTT, a change that may be attributed to a decrease in the percentage of cytosines in the dodecamer. This region does show significant variations in the base stacking patterns of samples with different flanking sequences. This variation in base stacking as a function of flanking sequence is in agreement with the FTIR results of Lindqvist and Graslund.22. It has been previously demonstrated that changes in base stacking effects are directly related to backbone conformations.38; 39; 40 The current FTIR results of the backbone resonance breadth and 31P NMR chemical shift data are thus in keeping with the variety of base stacking resonances we observe.

Changes in the endocyclic torsion angles of the furnaose ring are evident by studying the 1000-800 cm-1 region of the FTIR spectra. Our results show that all sequences contain both N-type and S-type sugars and that the S-type sugars are well resolved into C2' endo and O4' endo peaks. The EcoRI CGC/GCG sequence has broad sugar markers that demonstrate a variety of conformations in the derivative spectrum. Such furanose dynamics have been directly observed in the EcoRI CGC/GCG sample using solid state 2H NMR.34 All sequences have the B-form backbone marker at 970 cm-1 with the marker in the EcoRI TTC/GAA and EcoRI CGC/GCG sequences being broad and possessing a backbone resonance at 995 cm-1. The derivative spectra shown in Figure 5 demonstrate various backbone states in these samples. We attribute this to the presence of the BI and BII conformational substates of the backbone 41; 42, The breadth of the peak in the EcoRI CGC/GCG sequence is in agreement with recent 31P dynamic NMR results in the EcoRI CGC/GCG sequence that demonstrate facile backbone dynamics at the C3G4 and C9G10 backbone steps.31

The region between 1250-1000 cm-1 is sensitive to vibrations along the sugar-phosphate backbone, and these marker bands provide information about the overall conformation of the double helix 23. With the vibrations we observed in this region, we have been able to conclude that all EcoRI dodecamers exist predominantly in the B-form conformation as they exhibit peaks between 1224-1221 cm-1. Notably, these peaks are broad, again suggesting the sampling of different backbone conformations. Despite the fact that the sequences all favor the B-form as their primary backbone conformation, the EcoRI AAA/TTT and TTC/AAG sequences exhibit B-, A-, and Z-form conformation markers in the 1500-1250 cm-1 region as well as N-type sugar puckering, in the case of the EcoRI AAA/TTT sequence. A mixture of these markers can be attributed to fraying of the helix near the ends of the sequences, since the two other regions of interest of the spectra (1250-1000 and 1000-800 cm-1) also exhibit a dominant B-form. However, it is more likely that the flanking sequences of these dodecamers confer rigidity or flexibility to these sequences. For example, the Z-type marker at 1434-1440 cm-1 is present in EcoRI TTC. The N-type sugar marker at 1406 cm-1 in EcoRI AAA/TTT that became much weaker in EcoRI TTC/GAA and completely disappeared in EcoRI CGC/GCG may indicate that EcoRI AAA/TTT and EcoRI TTC/GAA tend to exhibit more of the rigid A-form and Z-form conformations whereas EcoRI CGC/GCG primarily favors the flexible B-form. The marker for purines in the anti conformation appears at 1373 and 1365 cm-1. The strong presence of this peak in EcoRI AAA/TTT is likely due to the A-tract flanking sequence. The EcoRI TTC/GAA sequence has a broad range of peaks in this region encompassing the syn marker at 1323 cm-1. The EcoRI CGC/GCG sequence shows far less resolution in this region, again suggesting a more dynamic sugar ring.

Our findings suggest that the EcoRI AAA/TTT and EcoRI TTC/GAA sequences are more similar to each other than to the EcoRI CGC/GCG dodecamer in the 1800-1500 cm-1 and 1500-1250 cm-1 base-reporting regions. The EcoRI CGC/GCG spectrum is much smoother with broader peaks and more poorly defined marker bands than the other sequences. This evidence, along with the observation that local nucleotides in the rigid A-form and Z-forms occur more frequently within the EcoRI AAA/TTT and EcoRI TTC/GAA sequences, suggests that EcoRI CGC/GCG is the most flexible sequence.

In fact, the FTIR regions reporting on the backbone and sugar moieties all show increased breadth for the EcoRI CGC/GCG sequence. This is in keeping with the previous 31P31 and 2HNMR34 work on EcoRI CGC/GCG and the work of Windolph et al who demonstrated increased cleavage rates of the K130E mutant of EcoRI on sequences with `favorable' flanking sequences 12. The co-crystal structure of EcoRI bound the EcoRI CGC/GCG sequence shows that there are no direct base contacts to the flanking sequences 43 so these sequences must instead modulate the DNA structure or flexibility. The Windolph study used known algorithms 44; 45 to study this modulation and found that there was no real difference in the stacking energies of their starting pool and their `best cut' or `worst cut' pools, indicating that stacking energies are not responsible for the changes in cutting rates. However, they did demonstrate increased flexibility of their best cut pool using the flexibility algorithms of Satchwell and Travers 46. We have shown this flexibility in the EcoRI CGC sequence using 31P31 and 2H NMR.34 Here, we directly demonstrate this increased flexibility as a function of flanking sequence using FTIR.

We have been able to conclude this using FTIR as this methodology provides a “snapshot” of the nucleic acid sequences where sharp resonances are the result of defined conformations while broad ones suggest dynamical averaging on the FTIR timescale. That flanking sequences are able to affect a sequence's flexibility is in keeping with those of Lefebvre et al.18 who used NMR to study the effect of modified flanking sequences on a CpG step and found that overall d(CATCGATG)2 sequence was more flexible than the d(CTTCGAAG)2 octamer. Similarly, Cordier et al.,11 who used 1H-NMR and 31P-NMR, found that A and T tracts of three base pairs on either side of the CRE sequence, d(GAAAACGTTTTC)2, induced rigidity in this sequence as compared to flexibility observed in d(ATGACGTCAT)2. This confirmation of differing flexibility due to flanking sequence has important biological ramifications. It is known from the cocrystal structure of the EcoRI restriction endonucelase in complex with its recognition sequence that this protein-DNA interaction utilizes an induced fit mechanism 47. The neokinks, distortions in the DNA structure upon complexation, incur an energetic cost and this cost is presumed to vary based on surrounding sequences 9. Thus, the changes in cleavage rates observed by Windolph were suggested to be caused by altered DNA flexibility caused by flanking sequence effects. Here, we have provided direct experimental evidence supporting that hypothesis.

In conclusion, we have shown variations in FTIR spectra that efficiently provide preliminary information about the specific local structural response of the EcoRI binding sites to changes in the tri- nucleotides flanking these sites. In particular we found that different flanking sequences can (1) promote A- or Z- type structures in primarily B-form DNA sequences, (2) alter backbone and sugar conformations and (3) promote dynamical averaging of backbone and furanose conformations. While these results are consistent with previous research 11; 18, we recognize that this is an underdetermined problem and we cannot confidently identify the source of IR band shifts. We plan to continue this investigation using our recent 31P dynamic NMR method 31 to provide the quantitative dynamic data FTIR cannot yield. However, the application of this method to the well-studied EcoRI CGC/GCG sequence provides confirmation for the usefulness of FTIR in demonstrating the flexibility of the DNA backbone. In addition, by using the EcoRI CGC/GCG sequence as a control, we have demonstrated that our FTIR results correspond to the structural features determined using other techniques.31; 32; 34; 35; 48 Thus, we have shown that FTIR provides a rapid and efficient method for examining a large number of DNA sequences in order to uncover trends and to determine which samples are worthy of further investigation by more quantitative techniques.

The obvious changes in FTIR spectra caused by altering flanking sequences confirm that flanking sequences have an important role in local structure and flexibility in DNA sequences. This is of tremendous importance because it suggests that the local environment of the DNA binding site determines its binding affinity. Understanding this phenomenon underscores the important role of indirect readout in protein-DNA recognition events. 15; 16; 42

ACKNOWLEDGEMENTS

This work was supported by a grant from the NIH (1R15CA117891). The authors gratefully acknowledge programming assistance from Aaron Skeers, NMR support from Professor Len Mueller and Yi Tian at UC Riverside and insightful discussions and careful editing by Diana Buckett, Stephanie McCarty and Sarah Primrose.

REFERENCES

  • 1.Grohima M. Influence of DNA stiffness in protein-DNA recognition. J Biotechnol. 2005;117:137–145. doi: 10.1016/j.jbiotec.2004.12.016. [DOI] [PubMed] [Google Scholar]
  • 2.Harrington R. DNA curving and bending in protein-DNA recognition. Mol. Microbiol. 1992;6:2549–2555. doi: 10.1111/j.1365-2958.1992.tb01431.x. [DOI] [PubMed] [Google Scholar]
  • 3.Narayana N, Ginell SL, Russu IM, Berman HM. Crystal and molecular structure of a DNA fragment: d(CGTGAATTCACG) Biochemistry. 1991;30 doi: 10.1021/bi00232a011. [DOI] [PubMed] [Google Scholar]
  • 4.Otwinowski Z, Schevitz RW, Zhang RG, Lawson CL, Joachimiak A, Marmorstein RQ, Luisi BF, Sigler PB. Crystal structure of trp repressor/operator complex at atomic resolution. Nature. 1988;335:321–329. doi: 10.1038/335321a0. [DOI] [PubMed] [Google Scholar]
  • 5.Shakked Z, Guzikevich-Guerstein G, F. Frolow F, D. Rabinovich D, Joachimiak A, Sigler PB. Determinants of repressor/operator recognition from the structure of the trp operator binding site. Nature. 1994;368:469–473. doi: 10.1038/368469a0. [DOI] [PubMed] [Google Scholar]
  • 6.Carey M, Smale ST. Transcriptional Regulation in Eukaryotes: Concepts, Strategies, and Techniques. Cold Spring Harbor Laboratory Press; New York: 2000. [Google Scholar]
  • 7.Hogan ME, Austin RH. Importance of DNA stiffness in protein-DNA binding specificity. Nature. 1987;329:263–266. doi: 10.1038/329263a0. [DOI] [PubMed] [Google Scholar]
  • 8.Sarai A, Mazur J, Nussinov R, Jernighan RL. Sequence dependence of DNA conformation flexibility. Biochemistry. 1989;28 doi: 10.1021/bi00445a046. [DOI] [PubMed] [Google Scholar]
  • 9.Lesser DR, Kurpiewski MR, Waters T, Connolly BA, Jen-Jacobson L. Facilitated distortion of the DNA site enhances EcoRI endonuclease-DNA recognition. Biochemistry. 1993;90:7548–7552. doi: 10.1073/pnas.90.16.7548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Nathan D, Crothers DM. Bending and flexibility of methylated and unmethylated EcoRI DNA. J. Mol. Biol. 2002;316:7–17. doi: 10.1006/jmbi.2001.5247. [DOI] [PubMed] [Google Scholar]
  • 11.Cordier C, Marcourt L, Petitjean M, Dodin G. Conformation variation of the central CG site in d(ATGACGTCAT)2 and d(GAAAACGTTTTC)2. An NMR, molecular modeling and 3D-homology investigation. Eur. J. Biochem. 1999;261:722–733. doi: 10.1046/j.1432-1327.1999.00314.x. [DOI] [PubMed] [Google Scholar]
  • 12.Windolph S, Fritz A, Oelgeschlager T, Wolfes H, Alves J. Sequence context influencing cleavage activity of the K130E mutant of restriction endonuclease EcoRI identified by a site selection assay. Biochemistry. 1997;36:9478–9485. doi: 10.1021/bi970076g. [DOI] [PubMed] [Google Scholar]
  • 13.Riccelli PV, Vallone PM, Kashin I, Faldasz BD, Lane MJ, Benight AS. Thermodynamic, spectroscopic, and equilibrium binding studies of DNA sequence context effects in six 22-base pair deoxyoligonucleotides. Biochem. 1999;38:11197–11208. doi: 10.1021/bi9904407. [DOI] [PubMed] [Google Scholar]
  • 14.Qu X, Ren J, Riccellie PV, Benight AS, Chaires JB. Enthalpy/Entropy Compensation: Influence of DNA Flanking Sequences on the binding of 7-Amino Actinomycin D to Its Primary Binding Site in Short DNA Duplexes. Biochem. 2003;42:11960–11967. doi: 10.1021/bi0347813. [DOI] [PubMed] [Google Scholar]
  • 15.Wellenzohn B, Flader W, Winger RH, Hallbrucker A, Mayer E, Liedl KR. Complex of B-DNA with Polyamides Freezes DNA Backbone Flexibility. J. Am. Chem. Soc. 2001;123:5044–5049. doi: 10.1021/ja003639b. [DOI] [PubMed] [Google Scholar]
  • 16.Banyay M, Graslund A. Structural effects of cytosine methylation on DNA sugar pucker studied by FTIR. J. Mol. Biol. 2002;324:667–676. doi: 10.1016/s0022-2836(02)01104-x. [DOI] [PubMed] [Google Scholar]
  • 17.Lankas F, Sponer J, Langowski J, Cheatham TE. DNA basepair step deformability inferred from molecular dynamics simulations. Biophys. J. 2003;85:2872–2883. doi: 10.1016/S0006-3495(03)74710-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lefebvre AOM, el Antri S, Monnot M, Lescot E, Fermandjian S. Sequence dependent effects of CpG cytosine methylation. A joint 1H-NMR and 31PNMR study. Eur. J. Biochem. 1995;229:445–454. doi: 10.1111/j.1432-1033.1995.0445k.x. [DOI] [PubMed] [Google Scholar]
  • 19.Vallone PM, Benight AS. Thermodynamic, Spectroscopic, and Equilibrium Binding Studies of DNA Sequence Context Effects in Four 40 Base Pair Deoxynucleotides. Biochem. 2000;39:7835–7846. doi: 10.1021/bi000326k. [DOI] [PubMed] [Google Scholar]
  • 20.Siebert E, Ross JB, Osman R. Contribution of opening and bending dynamics to specific recognition of DNA damage. J. Mol. Biol. 2003;330:687–703. doi: 10.1016/s0022-2836(03)00598-9. [DOI] [PubMed] [Google Scholar]
  • 21.Grohima MM. Structure based sequence dependent stiffness scale for trinucleotides: A direct method. J. Biol. Phys. 2000;26:43–50. doi: 10.1023/A:1005250718139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lindqvist M, Graslund A. An FTIR and CD Study of the Structural Effects of G-tract Length and Sequence Context on DNA Conformation in Solution. Journal of Molecular Biology. 2001;314:423–432. doi: 10.1006/jmbi.2001.5164. [DOI] [PubMed] [Google Scholar]
  • 23.Banyay M, Sakar M, Graslund A. A library of IR bands of nucleic acids in solution. Biophys. Chem. 2003;104:477–488. doi: 10.1016/s0301-4622(03)00035-8. [DOI] [PubMed] [Google Scholar]
  • 24.Pichler A, Rudisser S, Winger RH, Liedl KR, Hallbrucker A, Mayer E. The Role of Water in B-DNAs BI to BII Conformer Substates Interconversion: A Combined Study by Calorimetry, FT-IR Spectroscopy and Computer Simulation. Chem. Phys. 2000;258:391–404. [Google Scholar]
  • 25.Liquier J, Akhebat A, Taillandier E, Ceolin F, Huynh Dinh T, Igolen J. Characterization by FTIR spectroscopy of the oligoribonucleotide duplexes r(A-U)6 and r(A-U)8. Spectrochim. Acta. 1991;47:177–186. [Google Scholar]
  • 26.Tsuobi M. Application of infrared spectroscopy to structure studies of nucleic acids. In: Brame EGJ, editor. Applied Spectroscopy Reviews. Dekker; New York: 1969. pp. 45–90. [Google Scholar]
  • 27.Kay LE, Keifer P, Saarinen T. Pure Absorption Gradient Enhanced Heteronuclear Single Quantum Correlation Spectroscopy with Improved Sensitivity. J. Am. Chem. Soc. 1992;114:10663–10665. [Google Scholar]
  • 28.Palmer AG, Cavanaugh J, Wright PE, Rance M. Sensitivity Improvement in Proton-Detected 2-Dimensional Heteronuclear Correlation Nmr-Spectroscopy. J. Magn. Reson. 1991;93:151–170. [Google Scholar]
  • 29.Schleucher J, Schwendinger M, Sattler M, Schmidt P, Schedletsky O, Glaser SJ, Sorensen OW, Griesinger C. A General Enhancement Scheme in Heteronuclear Multidimensional Nmr Employing Pulsed-Field Gradients. Journal of Biomolecular NNR. 1994;4:301–306. doi: 10.1007/BF00175254. [DOI] [PubMed] [Google Scholar]
  • 30.Wuthrich K. NMR of Proteins and Nucleic Acids. Wiley; New York: 1986. [Google Scholar]
  • 31.Tian Y, Kayatta M, Shultis K, Gonzalez A, Mueller LJ, Hatcher ME. 31P NMR Investigation of Backbone Dynamics in DNA Binding Sites. Journal of Physical Chemistry, B. 2008 doi: 10.1021/jp711203m. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Drew HR, Wing RM, Takano T, Broka C, Tanaka S, Itakura K, Dickerson RE. Structure of a B-DNA Dodecamer - Conformation and Dynamics .1. Proceedings of the National Academy of Sciences of the United States of America-Biological Sciences. 1981;78:2179–2183. doi: 10.1073/pnas.78.4.2179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Lee C, Cho M. Vibrational Dynamics of DNA: IV. Vibrational Spectroscopic Characteristics of A-, B-, and Z-form DNAs. J. Chem. Phys. 2007;126:145102–145110. doi: 10.1063/1.2715602. [DOI] [PubMed] [Google Scholar]
  • 34.Hatcher ME, Mattiello DL, Meints GA, Orban J, Drobny GP. A solid-state deuterium NMR study of the localized dynamics at the C9pG10 step in the DNA dodecamer [d(CGCGAATTCGCG)]2. J. Am. Chem. Soc. 1998;120:9850–9862. [Google Scholar]
  • 35.Nerdal W, Hare DR, Reid BR. Solution structure of the EcoRI DNA sequence: refinement of NMR-derived distance geometry structures by NOESY spectrum back-calculations. Biochemistry. 1989;28:10008–10021. doi: 10.1021/bi00452a020. [DOI] [PubMed] [Google Scholar]
  • 36.Bax A, Lerner L. Measurement of 1H-1H Coupling Constants in DNA Fragments by 2D NMR. J. Magn. Reson. 1988;79:429–438. [Google Scholar]
  • 37.Withka JM, Swaminathan S, Srinivasan J, Beveridge DL, Bolton PH. Toward a Dynamical Structure of DNA: Comparison of Theoretical and Experimental NOE Intensities. Science. 255:597–599. doi: 10.1126/science.1736362. [DOI] [PubMed] [Google Scholar]
  • 38.Hartmann B, Piazzola D, Lavery R. BI-BII Transitions in B-DNA. Nucleic Acids Research. 1993;21:561–568. doi: 10.1093/nar/21.3.561. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Heddi B, Foloppe N, Hantz E, Hartmann B. The DNA structure responds differently to physiological concentrations of K+ or Na+ Journal of Molecular Biology. 2007;368:1403–1411. doi: 10.1016/j.jmb.2007.03.010. [DOI] [PubMed] [Google Scholar]
  • 40.Djuranovic D, Hartmann B. Conformational characteristics and correlations in crystal structures of nucleic acid oligonucleotides: Evidence for sub-states. Journal of Biomolecular Structure & Dynamics. 2003;20:771–788. doi: 10.1080/07391102.2003.10506894. [DOI] [PubMed] [Google Scholar]
  • 41.Rudisser S, Hallbrucker A, Mayer E. B-DNA's Conformational Substates Revealed by Fourier Transform Infrared Difference Spectroscopy. J. Am. Chem. Soc. 1997;119:12251–12256. [Google Scholar]
  • 42.Flader W, Wellensohn B, Winger RH, Hallbrucker A, Mayer E, Liedl KR. B1->BII Substate Transitions Induce Changes in the Hydration of B-DNA, Potentially Mediating Signal Transduction from the Minor to Major Groove. J. Phys. Chem. 2001;105:10379–10387. [Google Scholar]
  • 43.Kim Y, Grable JC, Love R, Greene PJ, Rosenberg JM. Science. 1990;249:1307–1309. doi: 10.1126/science.2399465. [DOI] [PubMed] [Google Scholar]
  • 44.Santa Lucia J, Allawi HT, Senevirante PA. Biochemistry. 1996;35:3555–2562. doi: 10.1021/bi951907q. [DOI] [PubMed] [Google Scholar]
  • 45.Delcourt SG, Blake RD. J. Biol. Chem. 1991;266:15160–15169. [PubMed] [Google Scholar]
  • 46.Satchwell SC, Travers AA. EMBO J. 1989;8:229–238. doi: 10.1002/j.1460-2075.1989.tb03368.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.McClarin JA, Frederick CA, Wang BC, Boyer HW, Grable JC, Rosenberg JM. Science. 1986;234:1526–1541. doi: 10.1126/science.3024321. [DOI] [PubMed] [Google Scholar]
  • 48.Shui XQ, McFail-Isom L, Hu GG, Williams LD. The B-DNA dodecamer at high resolution reveals a spine of water on sodium. Biochemistry. 1998;37:8341–8355. doi: 10.1021/bi973073c. [DOI] [PubMed] [Google Scholar]

RESOURCES