Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Mar 1.
Published in final edited form as: Chem Sci. 2013 Jan 25;4(3):1319–1329. doi: 10.1039/C2SC21775A

Site-specific inter-strand cross-links of DNA duplexes

Miao Ye 1, Johan Guillaume 1, Yu Liu 1, Ruojie Sha 1, Risheng Wang 1, Nadrian C Seeman 1,, James W Canary 1,
PMCID: PMC3719409  NIHMSID: NIHMS436782  PMID: 23894693

Abstract

We report the development of technology that allows inter-strand coupling across various positions within one turn of DNA. Four 2′-modified nucleotides were synthesized as protected phosphoramidites and incorporated into DNA oligonucleotides. The modified nucleotides contain either 5-atom or 16-atom linker components, with either amine or carboxylic acid functional groups at their termini, forming 10 or 32 atom (11 or 33 bond) linkages. Chemical coupling of the amine and carboxylate groups in designed strands resulted in the formation of an amide bond. Coupling efficiency as a function of trajectory distance between the individual linker components was examined. For those nucleotides capable of forming inter-strand cross-links (ICLs), coupling yields were found to depend on temperature, distance, and linker length, enabling several approaches that can control regioselective linkage. In the most favorable cases, the coupling yields are quantitative. Spectroscopic measurements of strands that were chemically cross-linked indicate that the global structure of the DNA duplex does not appear to be distorted from the B form after coupling. Thermal denaturing profiles of those strands were shifted to somewhat higher temperatures than those of their respective control duplexes. Thus, the robust amide ICLs formed by this approach are site-specific, do not destabilize the rest of the duplex, and only minimally perturb the secondary structure.

Introduction

The two strands of the DNA double helix can be separated readily because they are connected by hydrogen-bonded interactions, rather than by covalent bonds.1 Nevertheless, there are a number of reasons to cross-link the two strands covalently. Examples include (1) to maintain 3D structures associated with functionalities,2 (2) to enhance the stability of DNA nanostructures,3, 4 and (3) to examine features of DNA metabolism, such as repair.5 As events closely related to cell death, cross-linking phenomena involved in cancer therapy have been studied intensely.5, 6 Consequently, the goal of this work is to connect specific nucleotides on opposite strands of nucleic acid double helices via organic moieties, which produce covalent bonds in a systematic regioselective fashion.

Cross-linking can be effected by an array of different chemistries. Inter-strand cross-links (ICLs) have been introduced into unmodified DNA by abasic sites,7 by oxidants,8 by chemical additives including drugs,9 and by low pH.10 ICLs can also form in modified DNA by reactions such as enzymatic restriction and ligation,11 phosphate backbone linking,12 disulfide bond formation,13 in situ oxidation of furanyl modifications,14 photocleavage induced abasic sites15 and “click chemistry” such as the Cu(I) catalysed [3+2] azide-alkyne cycloaddition (CuAAC) reaction.16,17 Most cross-linking strategies have involved linkages over short distances, such as between neighboring base pairs. ICLs over greater distances have also been reported.16,18

Although several groups have used different methods to introduce ICLs to fixed distances such as adjacent base pairs,19,20 one base pair apart,21 and two base pairs apart,22 most current methods do not readily allow for the coupling of specific nucleotides in a nucleic acid structure, especially at varying distances. For example, psoralen is one of the most frequently used agents for cross-linking.3, 4, 23 However, with psoralen, reactivity is dependent on the proper placement of pyrimidine nucleotides and it can be difficult to control the amount of cross-linking. Chemotherapeutic alkylating agents such as cisplatin5, 24 or nitrogen mustards5 react regioselectively with the nucleophiles present in nucleic acids, but such complexes may be limited to studies that do not require homogenous or regio-specific cross-links.

In addition to biological processes, such modifications are certainly important in DNA nanotechnology,3, 4, 25 where it becomes key to develop nucleic acid analogs with modifications that bring precise spatial control of chemical reactivity, or that allow them to assume functions that are not associated with a biological DNA context. Locking nucleic acid molecules (Fig. 1) into specific conformations would enable the formation of novel DNA motifs, structures and topologies, particularly through hierarchical approaches.

Fig. 1.

Fig. 1

Inter-strand cross-linking of DNA by amide bond formation, mediated by DMTMM. The major groove is not differentiated from the minor groove in this diagram, which represents coupling across either of the grooves schematically.

Previously, we reported the addition of 2′-alkylamine or 2′-alkylcarboxylate groups to nucleotides where the 2′-oxygen of a ribonucleotide had been replaced by a sulfur atom.26, 27, 28, 29 In these reports, we incorporated special nucleotides into DNA molecules and joined the derivatized nucleotides together by forming amide bonds in a reaction using the chemical coupling reagent DMTMM (4-(4,6-dimethoxy-1,3,5-triazin-2-yl)-4-methylmorpholinium chloride) in aqueous solution. We have also used this methodology to ligate DNA strands by 2′,2′ coupling.27 Additionally, we reported the formation of novel DNA catenanes by forming an intra-strand cross-link across a full turn of double helix.30 In this report, we have inserted special nucleotides containing these or similar modifications on opposite strands of the double helix. From Fig. 1, it is evident that this approach can join the two complementary strands of a double helical DNA molecule to each other by a covalent amide linkage. In principle, the linkage is capable of cross-linking strands in a simple double helix or in a more complex DNA motif. Here, we report a systematic study of amide-based inter-strand cross-linking in double helices at various distances with different linker lengths.

Results and Discussion

We prepared 2′-deoxy-2′-alkylmercaptophosphoramidite analogs of uridine-containing linker component groups of five atoms or sixteen atoms. The reagents were introduced into DNA strands using standard phosphoramidite chemistry31 to yield DNA molecules containing these modified nucleotides (Fig. 2). The nomenclature of the compounds is such that the superscript refers to the length of the linker component from the 2′-S atom to the nitrogen atom or carboxylate carbon atom at the termini; the subscript refers to the nature of the reactive chemical groups at the end of the linker component. N represents an amine group and C represents a carboxylate group. The modified nucleotides with 5-atom linker components have aliphatic side chains that are terminated with an amine group, UN5, or a carboxylate group, UC5. Compounds with 16-atom polyethylene glycol (PEG) linker components are designated UN16 (amine terminus) or UC16 (carboxylate). The syntheses of UN5,UC5, and UC16 were reported previously.26, 30 The synthesis of UN16 phosphoramidite followed a route similar to earlier compounds and is described in the supporting information. After incorporation into nucleic acid strands, the monomers were coupled via amide bond formation to produce ICLs. Two 5-atom linker components form a 10-atom linker (from 2′-S of one nucleotide to 2′-S of the other) and two 16-atom linker components form a 32-atom linker. From structural point of view, 10-atom linkers consist of 11 bonds from 2′-S to 2′-S and 32-atom linkers consist of 33 bonds.

Fig. 2.

Fig. 2

The chemical structures of modified nucleotides used to cross-link DNA strands. b) A duplex crosslinked using UN5 and UC5 modified nucleotides separated by one base pair.

As noted above, our goal is to develop and study methods to cross-link double helical DNA strands with high specificity. An ICL is one where a connection is made between one strand and its complement, as opposed to a connection within the same strand. The distance that is spanned by a specific cross-link is determined not only by the number of nucleotides that lie between the two modified nucleotides but also by the position of the second modified nucleotide relative to the first and the path that the connector must follow to reach it.

We choose the N′ position on the reference strand to place a modified nucleotide, then use its corresponding nucleotide N on the complementary strand as the starting position. (Fig. 3) The stereochemistry of a double stranded nucleic acid molecule is such that ICLs can be produced between N′ and nucleotides in either a positive (N+i) direction or a negative (N−i) direction from the starting nucleotide N. Our notation uses the standard convention that strands progress from the 5′ end to the 3′ end. From N, any nucleotide positioned towards its 5′ end is assigned a negative value; these sites are labelled as N-1, N-2, N-3, and so on. The nucleotides that are positioned towards the 3′ end of N are assigned a positive value, such as N+1, N+2, etc.

Fig. 3.

Fig. 3

Nomenclature for two-nucleotide inter-strand cross-links (ICLs). Cross-links are formed between N′ and N±i. To determine the location of a cross-link in relation to a nucleotide N, both strands in the duplex are read from the 5′ end toward the 3′ end. Cross-links formed between N′ and a nucleotide that is before N are assigned negative values and those after N are assigned positive values.

In this study, UN5 nucleotides are at a fixed position on the reference strand (N′), UC5 nucleotides are located on complementary strands with systematic variation in separation in both the 5′ and 3′ directions (N±i). Similarly, UC16 nucleotides are fixed in one position on the reference strand, and UN16 nucleotides are located in the 5′ direction of the complementary strands. To simplify our nomenclature and maintain consistency, we also use “N+i” and “N−i” to denote reactions and products of certain linkages. For example, N-1 can mean both the reaction between N and N-1 or the cross-linked product strand after the reaction. The designation “i” is a number varying from 1 to 7, denoting the separation in terms of number of nucleotides. Similar nomenclature such as N-6(L), N-7(L) and N-8(L) are given to reactions and products involving the longer linkers.

After DNA strands containing modified nucleotides were synthesized, they were purified by reverse phase HPLC and analyzed by MALDI-TOF mass spectrometry. The sequences and mass data for the oligonucleotides used in this study are listed in Table 1. The sequences were chosen to vary nucleotides evenly so as to avoid local distortions caused by the accumulation of purines or pyrimidines. Modified nucleotides were positioned near the centers of the duplexes to minimize potential impact from thermal motion at the ends. The two complementary strands were of differing length (18 and 16 nucleotides each) so that they could be monitored separately by denaturing PAGE. To form ICLs, complementary strands containing amine- and carboxylic acid-modified nucleotides were annealed to form double helices first, and were then coupled to form amide bonds, as shown in Fig. 1. The cross-linked products were purified using gel electrophoresis and then analyzed by MALDI-TOF (Tables 2 and 3).

Table 1.

Modified oligonucleotides used and their analysis by MALDI-TOF MS

graphic file with name nihms436782f13.jpg

Table 2.

Five-atom linker components coupling reaction products and their analysis by MALDI-TOF MS.

Coupled Products Reactant ODNs Separation (in nucleotides) Structures Mass
Calculated Found
N-1 N3 + C7 1 graphic file with name nihms436782t1.jpg 10560 10560
N-2 N2 + C4 2 graphic file with name nihms436782t2.jpg 10560 10560
N-3 N1 + C1 3 graphic file with name nihms436782t3.jpg 10560 10559
N-4 N2 + C5 4 graphic file with name nihms436782t4.jpg 10560 10560
N-5 N1 + C2 5 graphic file with name nihms436782t5.jpg 10560 10559
N-6 N2 + C6 6 graphic file with name nihms436782t6.jpg 10560 10559
N+1 N6 + C14 1 graphic file with name nihms436782t7.jpg 10560 10560
N+2 N5 + C11 2 graphic file with name nihms436782t8.jpg 10560 10559
N+3 N4 + C8 3 graphic file with name nihms436782t9.jpg 10560 10558
N+4 N5 + C12 4 graphic file with name nihms436782t10.jpg 10560 10558

Table 3.

16-atom linker components coupling reaction products and their analysis by MALDI-TOF MS.

Coupled Products Reactant ODNs Separation (in nucleotides) Structures Mass
Calculated Found
N-6(L) N9 + C15 6 graphic file with name nihms436782t11.jpg 10879 10884
N-7(L) N8 + C15 7 graphic file with name nihms436782t12.jpg 10879 10882
N-8(L) N7 + C15 8 graphic file with name nihms436782t13.jpg 10879 10885

We conducted N−i cross-linking reactions at both 4 °C and 20 °C to explore the temperature dependence of reactions in this range. Fig 4 shows the denaturing PAGE gel of coupled DNA nucleotides from N-1 to N-7 after reaction for 48 hours.

Fig. 4.

Fig. 4

Denaturing PAGE gels of coupled DNA nucleotides from N-1 to N-7. UN and UC ODNs contain 18 and 16 nucleotides, respectively. Upon coupling, a new upper band appeared. In both A and B, lane 1 contains 5 control groups with non-complementary UN and UC strands treated under the same conditions as other reaction groups. Lanes 2 to 9 represent, respectively, N-1 to N-7 coupling reactions; lane 2, N-1; lane 3, N-2; lane 4, N-3; lane 5, N-4; lane 6, N-5; lane 7, N-6; lane 8, N-7; lane 9, 10 bp marker. (A) Denaturing PAGE gel of coupling reactions occurring at 20 °C. (B) Denaturing PAGE gel of coupling reactions occurring at 4 °C.

Fig. 4A shows the denaturing PAGE gel of the coupling reactions performed at 20 °C. Fig. 4B shows the denaturing PAGE gel of the coupling reactions at 4 °C. In both panels A and B, lane 1 contains negative controls with non-complementary UN and UC strands treated under conditions identical to those for other reaction groups.

The UN and UC strands migrate as expected for 18 and 16 nucleotides respectively. Upon coupling, a new upper band appeared. The new band migrates slightly more slowly than the combined mass of 34 nucleotides, which might be attributable to the presence of intra-strand linkers producing a branched structure. Lower bands could still be observed around 18 and 16 nucleotides, representing uncoupled strands. The 20 °C reactions and 4 °C reactions display similar trends. For 20 °C reactions, N-1, N-2, N-3 and N-5 are apparently completely coupled, with one starting material band being undetectable. Cross-linking with N-4 and N-6 gave lower but still high yields. However, the N-7 reaction showed a sharp decrease in yield compared to the others. The coupling reaction yield of N-7 at 20 °C was barely observable, while it was not detected at 4°C. Reaction yield differences were expected at different reaction temperatures.27 The consistency of trends suggested that the linking reactivity is caused by the structure of the duplex itself rather than thermal fluctuations related to the temperature change. Once fixed on the duplex, UN and UC coupling reactions have high effective concentrations of the two reactants if they are within reach (N-1 to N-6). Once the two arms of linker components are out of reach (N-7), reactions can hardly occur, owing to the rigidity of the DNA duplex, even though the effective concentration is still high. Trace amounts of product can still be observed for the N-7 reaction at 20 °C, which may be attributable to thermal fluctuations. However, the amount of product is minimal, relative to the other reactions.

Fig. 5 contains a denaturing PAGE gel of coupled DNA nucleotides from N+1 to N+7 after reaction for 48 hours. Fig. 5A contains the denaturing PAGE gel of coupling reactions occurring at 20 °C. Fig. 5B contains the denaturing PAGE gel of coupling reactions occurring at 4 °C. In both A and B, lane 1 contains control groups with non-complementary UN and UC strands treated under conditions identical to the other reaction groups. Similar to the N−i reactions, UN and UC N+i strands migrate around 18 and 16 nucleotides on the gel respectively, while the product band migrates between 40 to 50 nucleotides. Reactions at 20 °C and 4 °C displayed similar trends. For 20 °C reactions, N+1, N+2 and N+3 were close to complete, with one starting material band being undetectable. The N+4 reaction had a lower yield but was still clearly visible. Traces of the upper band could be seen from N+5 to N+7 with a sharp reaction yield drop from N+4 to N+5. For 4 °C reactions, the same trends were retained, with a lower yield for each reaction. It is noticeable that the N+4 reaction yield dropped sharply compared to the 20 °C reactions. The N+5 to N+7 products were undetectable on denaturing gels. These observations suggest that N+4 is the limit for cross-linking reactions along the N+i direction, and that thermal fluctuations contribute to the reaction yield difference.

Fig. 5.

Fig. 5

Denaturing PAGE gels demonstrating the coupling reaction of DNA nucleotides from N+1 to N+7. UN and UC ODNs contain 18 and 16 nucleotides respectively. When coupled, a new upper band appeared. In both A and B, lane 1 contains control groups with non-complementary UN and UC strands treated under the same conditions as other reaction groups. Lanes 2 to 9 represent N+1 to N+7 coupling reactions; lane 2, N+1; lane 3, N+2; lane 4, N+3; lane 5, N+4; lane 6, N+5; lane 7, N+6; lane 8, N+7; lane 9, 10 bp marker. (A) Denaturing PAGE gel of coupling reactions occurring at 20 °C. (B) Denaturing PAGE gel of coupling reactions occurring at 4 °C.

From cross-linking reactions in both the 3′- and 5′-directions, mismatched sequences resulted in undetectable product formation under identical concentration and coupling conditions. This finding illustrates the effect of templation, which is a consequence of complementary strands forming a well-defined and predictable structure.27, 32 However, not all templated reactants resulted in successful linkage formation. Certain limits of the cross-linking reactions were also observed. The limit of cross-linking for N−i (5′ direction of complementary strands) is a separation of 7 nucleotides, and the limit for N+i (3′ direction of complementary strands) is 4 nucleotides. Surprisingly, the different distance requirement displayed in the 3′ vs. 5′ directions is not consistent with the structural requirement for N−i linkages to cross the major groove of DNA duplex and for N+i linkages to cross the minor groove. Since the width of major groove is wider (~22 Å) than the minor groove (~12 Å), we would naïvely expect that the linking of N−i should be more difficult than that of N+i, which contradicts our observations (7 nucleotides versus 4). We suspect that not only the distance but also the linker trajectory plays a key role in successful cross-linking. To challenge this hypothesis, we examined computerized DNA models, which will be discussed below in detail.

Since we postulated that the inability of short linkers to span the stiff DNA duplex led to the N-7 limit of N−i reactions, we decided to see whether a longer linker could enable us to cross this boundary and cross-link at greater separations. We synthesized nucleotides with 16 atom linker components to go from N-6 to N-8 positions, covering the range of boundaries set by short linkers. The result of this coupling is shown in Fig. 6. Cross-coupled products can be seen clearly from the denaturing gel, which demonstrates that the boundary restraining the cross-linking of short linkers can be overcome by extending linker length.

Fig. 6.

Fig. 6

Denaturing PAGE gel showing inter-strand coupling for 16-atom linker components at 20 °C. Lane 1: 10bp DNA marker, Lane 2: uncoupled strands, Lane 3: N-6(L), Lane 4: N-7(L), Lane 5: N-8(L).

To understand the rate of inter-strand linkage formation, we studied N+i and N−i coupling reactions to form 10-atom linkers at both 20 °C and 4 °C. (Fig. S1 and S2). A crude calibration was conducted using a time course study to give a semi-quantitative estimate of reaction yields. Generally, these ICLs formed efficiently at 20 °C: around 50 % of the final products were observed after 4 hours of reaction, while reactions at 4 °C generally have lower yields and reaction rates. The reaction yields correlated with distances in terms of base pairs separating two modified bases. Further separated modified base pairs gave lower reaction yields and slower reaction rates.

The effects of all inter-strand linkages on DNA duplex structure were examined by comparing the melting temperatures of the coupling reaction products with their respective unmodified control duplexes. Melting temperatures (Tm) are summarized in Table 4. For both N−i and N+i products, linked products generally have wider melting transitions. Products linked between proximal nucleotides tend to have wider melting transitions and higher Tm values than more distantly linked products. (Fig. 7)

Table 4.

Tm values of coupling reaction products and respective control duplexes

(a) N−i products and controls (b) N+i products and controls
Products Tm (°C) control duplexes
Tm (°C)
Δ Tma Products Tm (°C) control duplexes
Tm (°C)
Δ Tma
N-1 >93.0 63.9b >29.1 N+1 83.6 64.9b 18.7
N-2 81.6 63.8b 17.8 N+2 84.6 66.9b 17.7
N-3 79.6 64.6b 15.0 N+3 75.6 63.7b 11.9
N-4 69.9 63.8b 6.1
N-5 69.3 64.6b 4.7
N-6 68.4 63.8b 4.6
a

ΔTm is the Tm difference between coupled reaction products and corresponding control duplexes.

b

Control duplexes for each product: N-1-control for N-1; N-3,5-control for N-3 and N-5; N-2,4,6-control for N-2, N-4 and N-6; N+1-control for N+1; N+2-control for N+2; N+3-control for N+3.

Fig. 7.

Fig. 7

Melting curves for coupling reaction products and their respective control duplexes. Control duplexes share the sequence of respective products with modified nucleotides replaced by T. (Table 2) Three sets of control duplexes are listed because sequences of products vary. Control duplexes for each product: N-1-control for N-1; N-3,5-control for N-3 and N-5; N-2,4,6-control for N-2, N-4 and N-6; N+1-control for N+1; N+2-control for N+2; N+3-control for N+3. (A) Reaction N-1 to N-6 products and their respective control duplexes. N-1-control, N-3,5-control, N-2,4,6-control largely overlap on the left. N-2,4,6-control may not be seen clearly. (B) Reaction N+1 to N+3 products and their respective control duplexes. A = (A(T)−A(T0))/A(T0), where A(T0) is the absorbance at 20 °C.

The data listed in Table 4 show that inter-strand coupling of the amino and carboxyl linker groups led to higher stability of the duplex form of the products. The degree of Tm rise corresponds to the direction and location of cross-links. The Tm of the N-1 product was not reached under experimental conditions (up to 93 °C). The N-2 product gave a 17.8 °C increase. The difference (ΔTm) in Tm of the coupled product vs. the control decreased for linkers that are stretched further apart in the same direction. When linkers are separated by 3 and 4 base pairs, the ΔTm increases dropped to 15.0 °C (N-3) and 6.1 °C (N-4). ΔTm was further reduced for 5 and 6 base separations, (4.7 °C for N-5 and 4.6 °C for N-6). Similar trends were also observed for N+i products, with less pronounced differences in ΔTm. Separation by one base pair gave an 18.7 °C (N+1) of ΔTm while two and three base pair separations give increases of 17.7 °C (N+2) and 11.9 °C (N+3) respectively.

Overall, the melting profiles of the strands that were chemically cross-linked were shifted to somewhat higher temperatures than that of the control, indicating stabilization effects.11, 33 The extent of increase is related to the separations between linkers.

We also compared the thermal denaturation profiles of the coupled strands with the profiles of the uncoupled strands and control duplexes for N-1 product. The Tm of the uncoupled duplex containing two modified nucleotides is 11 °C lower than control duplex, which is more than 29 °C lower than linked products (Fig. S5). This result agrees with our earlier studies of intra-strand linkages in which uncoupled, modified sequences were less stable, while coupled linkages showed increased thermal stability.28

To understand the global structure of the cross-linked products compared to unmodified control duplexes, circular dichroism (CD) spectra were obtained. In Fig. 8, the overall CD curves of product strands are similar to their respective control duplexes, consistent with an overall B-form structure. All coupled strands show a less intense negative band around 250 nm and a similar or more intense positive band around 270–280 nm. The latter band is generally shifted toward shorter wavelength compared to their respective control duplexes. Similar effects were observed for N+i products, for which CD data are shown in Fig. 9. However, the changes in spectra of coupled products compared to control duplexes are less pronounced. The CD spectra of N+1 and its corresponding control duplex are nearly super-imposable. Therefore, the overall B-form conformation of DNA duplex is retained with only minor perturbation.

Fig. 8.

Fig. 8

CD spectra of N-1 to N-6 reaction products and unmodified control duplexes. (a) Overlay of N-1 product and control duplex N-1-control. (b) Overlay of N-3, N-5 products and control duplex N-2,4,6-control. (c) Overlay of N-2, N-4, and N-6 products and control duplex N-3,5-control.

Fig. 9.

Fig. 9

CD spectra of the N+1 to N+3 reaction products and unmodified control duplexes. (a) Overlay of the N+1 product and control duplex. (c) Overlay of the N+2 product and control duplex. (c) Overlay of the N+3 product and control duplex.

The CD spectrum of the N-1 product (Fig. 8a) looks different from others when compared to its control. In the range between 240 nm to 260 nm, there is an apparent inversion of the curve, which is likely owing to a perturbation caused by the linker.

As mentioned above, despite the fact that the major groove is around 10 Å wider than the minor groove in B-form DNA, our experiments clearly show that 10-atom linkers reach across the major groove more easily. To explain the difference in linking abilities, we examined the geometries with molecular models. Since CD experiments had shown that the DNA duplex adopted an overall B-form conformation, it is reasonable to build models based on a B-form structure, although the local structure of the modified nucleosides is likely to be 3′ endo (corresponding to A-form), because of the derivitization of their 2′ carbons.

In considering the required reach between linker components in real space, the trajectory of the linker between two components must be taken into account. Fig. 10 shows that in the B-form structure, N−i linkers are positioned by the orientations of the sugars: The vectors from the 2′-ribosyl carbon attachments are directed towards each other. Meanwhile, pairs of N+i linkers must cross the minor groove such that all linker components face away from one another. (Fig. 11) Thus, N−i linker components easily reach across the major groove of B-form DNA duplex, while N+i linker components need to follow a contour across the minor groove by wrapping around the phosphodiester backbone. We suggest that it is the cross-linker molecular trajectory that limits linking reactions in this direction, thus accounting for the difference.

Fig. 10.

Fig. 10

Molecular models of inter-strand linkages in N−i products across the major groove. (a) N-2 product. (b) N-3 product. (c) N-4 product. (d) N-5 product. (e) N-6 duplex, the distance between the nitrogen atom of UN linker component and carboxyl carbon of UC linker component is 2.8 Å. (f) N-7 duplex, the distance between the nitrogen atom of UN linker component and carboxyl carbon of UC linker component is 7.5 Å.

Fig. 11.

Fig. 11

Molecular models of inter-strand linkages in N+i products across the minor groove. (a) N+1 product. (b) N+2 product. (c) N+3 product. (d) N+4 duplex, the distance between the nitrogen atom of UN linker component and carboxyl carbon of UC linker component is 3.8 Å. (e) N+5 duplex, the distance between the nitrogen atom of UN linker component and carboxyl carbon of UC linker component is 6.8 Å.

To be more specific, as we can see from models in Fig. 10 and 11, N-1 to N-5 and N+1 to N+3 linkages can form readily; their conformations have been estimated by the methods described. However, distances between two functional groups increase as the number of base pairs separating them increases. For duplexes in which linkers cannot form straightforwardly, distances between the nitrogen atom of the amine groups and the carbon atom of the carbonyl groups of their respective linker components were measured.

The distances represented by duplexes N-7 and N+5 represent the extent to which functional groups cannot reach each other in space. At these locations, distances are 7.5 Å for N-7 (Fig. 10f) and 6.8 Å for N+5 (Fig. 11e), which explains the trace quantities of products observable. In the case of N-6 (Fig. 10e), the distance is around 2.8 Å. Given that the C-N bond length of an amide is around 1.33 Å, the 1.5 Å gap between those two groups must be compensated by thermal fluctuations of the DNA duplex, so as to form the product.

Similarly, in the case of N+4 (Fig. 11d), the closest distance between these two atoms is around 3.8 Å. To make an amide bond, a 2.5 Å distance gap needs to be overcome by thermal fluctuations. Compared to the case of N-6 (Fig. 10e), fewer base pairs (4 vs. 6) separating two modified nucleotides may contribute to a reduced flexibility, thereby resulting in a much lower reaction yield.

In the case of N-1, if the duplex conformation is maintained undistorted, the span between linked nucleotides is prevented by van der Waals violations with neighboring base pairs. (Fig. 12a) In an attempt to make a more sensible model to avoid this severe steric effect, a bent conformation was examined. (Fig. 12b) The bent model has a much lower energy, and accommodates linkers without violating van der Waals radii. Since product N-1 shows several unusual properties in comparison with the other products, including very high Tm, faster mobility, and a more pronounced CD deviation from the control, we postulate that all of these properties are somehow related to a distorted conformation. However, further experimental data will be needed to obtain a more precise structure of the product.

Fig. 12.

Fig. 12

Proposed structures of N-1 product (a) N-1 product unbent conformations, the linker goes through the space between two adjacent base pairs. (b) N-1 product bent conformation, adjacent base pairs neighboring linking sites are tilted to accommodate the linker.

Conclusions

In summary, we have developed convenient tools to cross-link selected positions on the opposite strands of a DNA double helix. We have thoroughly examined the linking ability of ten-atom linkers. We established the linking constraints of the ten-atom linkers to be N-1 to N-6, and N+1 to N+4. Reactions at 20 °C and 4 °C both display the same trends, showing that the linking selectivity of these reactions is largely temperature independent in this range. Reaction yields and rates are generally related to the number of nucleotides separating two linking positions. The shortest separations provide quantitative yields. Longer linkers generally have lower reaction yields and lower reaction rates. To explain the difference between linking yields in the two directions, we have examined structural models. Structures based on B-form DNA demonstrate that the orientations of linkers and the resulting contour lengths of the linker molecular trajectories between them largely determine reaction yields. However, if linker lengths are extended, the set of constraints originating in the DNA duplex structure applied to 10-atom linkers can easily be overcome by using 32-atom linkers, suggesting the possibility of expanding our system to realize cross-linking selectively over larger distances if desired.

We employed several approaches to characterize cross-linked products. MALDI-TOF mass spectrometry and denaturing gel electrophoresis verified successful links between strands. Thermal denaturation showed that the existence of linkers increases the melting temperatures of duplexes. While a general stabilizing effect was discovered, the extent depends heavily on the separation of the linkers. Short linkers can greatly increase Tm, whereas longer linkers contribute much smaller changes, as would be expected from the extent of restraint the linkers impose on the double helix. CD spectra demonstrated that the overall character of B-form DNA is maintained despite the fact that the uridine nucleotides containing the linkers are ribonucleotides and likely to prefer the 3′-endo, A-form conformation.

The N-1 product revealed unusual properties, such as a very high ΔTm, disturbance of the CD spectrum and faster mobility on denaturing gels. We suggest that there may be a stronger structural perturbation at play.

The cross-linking procedure that we have introduced here should find wide applications in nucleic acid chemistry, biotechnology and nanotechnology. Readily accessible modified bases can be introduced using phosphoramidite-based oligonucleotide synthesis31 to produce strands ready for cross-linking without further functionalization. It should be straightforward to insert a fragment containing cross-linked or cross-linkable synthetic fragments into a larger piece of DNA or other nucleic acids, such as RNA or LNA. Such fragments will prove resistant to strand separation and to strand displacement through the processes of strand invasion and branch migration, thereby facilitating DNA-based hierarchical assembly. Likewise, the type of strand exchange needed for PX-DNA formation34 is likely to be impaired. Both DNA nanotechnology and DNA-based computation will be able to benefit from molecules that are resistant to strand invasion, branch migration and separation. Furthermore, it is likely that analysis of those biological processes that entail these phenomena will be facilitated by the developments reported here. In addition, the possible roles of strand separation and empty grooves, perhaps currently unsuspected in a variety of biological processes, can be explored and challenged using this system.

Materials and Methods

General procedure for MALDI sample preparation

Amicon Ultra-0.5 centrifugal filter devices with nominal molecular weight limit (NMWL) 3,000 Da were used to desalt samples. The matrix solution consisted of 2:1 v:v ratio of 2′,4′,6′-Trihydroxyacetophenone (THAP) (35 mg, 0.2 mmol in 1 mL of 50% MeOH/H2O) to ammonium citrate solution (0.1 M). Samples and standards (e.g., unmodified T18) were prepared in vials, to which the matrix solution was added. Each sample (1 μL) was mixed with matrix solution (3 μL) using vortex mixing followed by centrifugation (5 s). A total volume of 4 μL of the mixture was then added to wells on the target.

General procedure for amide bond formation on DNA

A UN5-containing strand (100 pmol, DNA oligomer modified with a 5-atom amine linker component) and a UC5-containing strand (100 pmol, DNA oligomer modified with a 5-atom carboxylic acid linker component) were dissolved in 100 μL of MOPS buffer (100 mM MOPS, 1M NaCl, pH 7.0). The solution was annealed overnight. A portion of 100 mM DMTMM reagent (100 μL) in MOPS buffer was added to the DNA mixture so that the final duplex solution became 500 nM and the final DMTMM concentration was 50 mM. The reaction was allowed to proceed for 48 hours in the dark.

General procedure for time-dependent amide formation on DNA

A UN5-containing strand (500 pmol) and a UC5-containing strand (500 pmol) were dissolved in 500 μL of MOPS buffer (100 mM MOPS, 1 M NaCl, pH 7.0) and annealed overnight. A portion (500 μL) of 100 mM DMTMM solution was added to start the reaction. An aliquot of the reaction mixture (200 μL) was taken at the designated time and then quenched by 300 μL of 1 M ammonium acetate solution. The quenched solutions were then desalted and analyzed by denaturing PAGE. The same procedure was performed at both 4 °C and 20 °C.

General procedure to prepare samples for thermal denaturing studies

Each complementary strand (2 nmol) was dissolved in 500 μL of sodium cacodylate buffer (40 mM sodium cacodylate, 100 mM NaCl). The solution containing DNA strands was pre-annealed. The melting curve was obtained at 260 nm using a Varian Cary 100 Bio UV-Vis Spectrophotometer. The temperature was changed at the rate of 0.2 °C/min at a data interval of 0.5 °C. Each sample was heated from 20.0 °C to 93.0 °C, then cooled back down to 20.0 °C. This temperature was maintained for 5 min at both 20.0 °C and 93.0 °C. Two such runs were repeated for each sample.

CD experiments

The CD spectra were recorded from 350 nm to 200 nm on an AVIV Model 202SF Circular Dichroism Spectropolarimeter at ambient temperature in 0.2 cm cuvettes. Two complementary strands (2.8 nmol each) were pre-annealed and then dissolved in buffer (10 mM sodium phosphate, 150 mM sodium chloride, and 0.1 mM EDTA) to a final volume of 700 μL. The duplex solution was filtered through a 0.2 μm syringe filter (Millex) before recording CD spectra. At least three scans were collected for each sample and a buffer baseline was subtracted from the average of these scans to yield the CD plots.

Denaturing polyacrylamide gel electrophoresis (PAGE) analysis and purification

The coupling products were analyzed and purified on 20% polyacrylamide gels. The running buffer contained 89 mM Tris–HCl, pH 8.0, 89 mM boric acid, and 2 mM EDTA. The tracking dye contained buffer, 50% glycerol, and a trace amount of Bromphenol Blue and Xylene Cyanol FF. The gel was stained with ethidium bromide.

For analysis, gels were imaged on a Kodak Image Station Gel Logic 200. Semi-quantitative estimation of yields was calculated by measuring the band intensity using Kodak Molecular Imaging software.

For purification, product bands were cut out and eluted in a solution containing 500 mM ammonium acetate, 10 mM magnesium acetate,and 1 mM EDTA. Isolated yields were around 50% to 60%, estimated by OD260.

General procedure for DNA modelling

The DNA duplex structures were prepared using coordinates derived from X-ray fiber diffraction data.35 An 18 base-pair double-stranded B-DNA molecule was introduced. Linker components and linkers were built using Maestro (Maestro, version 9.2, Schrödinger, LLC). For unlinked duplexes, conformations with the shortest distances between amine and carboxyl were found and measured. For linked duplexes, conformational searches were conducted using the Monte Carlo (MCMM) method as implemented in MacroModel (MacroModel, version 9.9, Schrödinger, LLC). All atoms and all rotatable single bonds of linkers were included in the conformational search. All base pairs were maintained stationary; modified and unmodified deoxyribose and adjacent phosphate diester linkages were allowed to move only when at linking sites or in close proximity to them. The energy minimizations were carried out using the Powell-Reeves conjugate gradient (PRCG) method and the AMBER* force field, as implemented in MacroModel. The conformational searches were done for an aqueous solution using a constant dielectric electrostatic treatment with default settings. To bend cross-linked product N-1 from the starting duplex, the angle between two adjacent bases sandwiching the linker was estimated from a 4 base pair fragment of the original starting duplex. Following this, a conformational search was performed with the bent form.

Synthesis of oligonucleotides

Oligonucleotide strands were prepared from synthetic or commercial phosphoramidite reagents on an Applied Biosystem Model 394 automated synthesizer using standard phosphoramidite techniques.31 Synthetic amidites were prepared as described previously,26, 30 except for UN16 which was prepared as shown in Scheme S1 in the supporting information.

Supplementary Material

ESI

Acknowledgments

We thank Liang Ding for contributions during early phases of this work.

Funding

We (JWC and NCS) are grateful for funding to the National Science Foundation (CTS-0608889) and the Office of Naval Research (N000140911118). This work has also been supported by grants to NCS from the National Institute of General Medical Sciences (GM-29544) the National Science Foundation (CCF-1117210), the Office of Naval Research (N00014-11-1-0729) the Army Research Office (W911NF-11-1-0024), grant DE-SC0007991 from the Department of Energy. This work was supported partially by the MRSEC Program of the National Science Foundation under Award Number DMR-0820341. We thank the National Science Foundation for instrument grants CHE-0234863, CHE-0116222, CHE-0958457, and DUE-0126958, and the National Institutes of Health for construction grant C06 RR-16572-01.

Footnotes

Electronic Supplementary Information (ESI) available: Synthesis of uridine phosphoramidite for UN16; Time course studies of N-1 to N-6 and N+1 to N+4 coupling reactions; MALDI-TOF data for coupled products of 32-atom linkers; Melting curves of uncoupled N-1 duplex, single modified duplex with UC and UN modified nucleotides. See DOI: 10.1039/b000000x/

Contributor Information

Nadrian C. Seeman, Email: ned.seeman@nyu.edu.

James W. Canary, Email: james.canary@nyu.edu.

Notes and references

  • 1.Watson JD, Crick FHC. Nature. 1953;171:737–738. doi: 10.1038/171737a0. [DOI] [PubMed] [Google Scholar]
  • 2.(a) Sigurdsson ST. Methods in Enzymology. Vol. 318. Academic Press; 2000. Site-specific sulfhydryl groups for study of RNA conformation via disulfide cross-linking; pp. 165–166. [DOI] [PubMed] [Google Scholar]; (b) Stage-Zimmermann TK, Uhlenbeck OC. Nat Struct Biol. 2001;8:863. doi: 10.1038/nsb1001-863. [DOI] [PubMed] [Google Scholar]
  • 3.Seeman NC. Nature. 2003;421:427–431. doi: 10.1038/nature01406. [DOI] [PubMed] [Google Scholar]
  • 4.Rajendran A, Endo M, Katsuda Y, Hidaka K, Sugiyama H. J Am Chem Soc. 2011;133:14488–14491. doi: 10.1021/ja204546h. [DOI] [PubMed] [Google Scholar]
  • 5.Noll DM, Mason TM, Miller PS. Chem Rev. 2006;106:277–301. doi: 10.1021/cr040478b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Muniandy PA, Liu J, Majumdar A, Liu ST, Seidman MM. Crit Rev Biochem Mol Biol. 2010;45:23–49. doi: 10.3109/10409230903501819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Dutta S, Chowdhury G, Gates KS. J Am Chem Soc. 2007;129:1852–1853. doi: 10.1021/ja067294u. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Edfeldt NB, Harwood EA, Sigurdsson ST, Hopkins PB, Reid BR. Nucleic Acids Res. 2004;32:2795–2801. doi: 10.1093/nar/gkh607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.A) Champeil E, Paz MM, Ladwa S, Clement CC, Zatorski A, Tomasz M. J Am Chem Soc. 2008;130:9556–9565. doi: 10.1021/ja802118p. [DOI] [PMC free article] [PubMed] [Google Scholar]; B) Huang H, Kim HY, Kozekov ID, Cho YJ, Wang H, Kozekova A, Harris TM, Rizzo CJ, Stone MP. J Am Chem Soc. 2009;131:8416–8424. doi: 10.1021/ja809543j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Freese E, Cashel M. Biochim Biophys Acta. 1964;91:67–77. doi: 10.1016/0926-6550(64)90171-9. [DOI] [PubMed] [Google Scholar]
  • 11.Santangelo TJ, Roberts JW. Formation of Long DNA Templates Containing Site-Specific Alkane–Disulfide DNA Interstrand Cross-Links for Use in Transcription Reactions. In: Adhya Sankar L, Garges Susan., editors. Methods in Enzymology. Vol. 371. Academic Press; 2003. pp. 120–132. [DOI] [PubMed] [Google Scholar]
  • 12.Endo M, Majima T. J Am Chem Soc. 2003;125:13654–13655. doi: 10.1021/ja036752l. [DOI] [PubMed] [Google Scholar]
  • 13.Alefelder S, Sigurdsson ST. Bioorg Med Chem. 2000;8:269–273. doi: 10.1016/s0968-0896(99)00280-1. [DOI] [PubMed] [Google Scholar]
  • 14.Op de Beeck M, Madder A. J Am Chem Soc. 2011;133:796–807. doi: 10.1021/ja1048169. [DOI] [PubMed] [Google Scholar]
  • 15.Sczepanski JT, Hiemstra CN, Greenberg MM. Bioorg Med Chem. 2011;19:5788–5793. doi: 10.1016/j.bmc.2011.08.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Xiong H, Seela F. J Org Chem. 2011;76:5584–5597. doi: 10.1021/jo2004988. [DOI] [PubMed] [Google Scholar]
  • 17.(a) El-Sagheer A, Brown T. Chem Soc Rev. 2010;39:1388–1405. doi: 10.1039/b901971p. [DOI] [PubMed] [Google Scholar]; (b) Kumar R, El-Sagheer A, Tumpane J, Lincoln P, Wilhelmsson LM, Brown T. J Am Chem Soc. 2007;129:6859–6864. doi: 10.1021/ja070273v. [DOI] [PubMed] [Google Scholar]
  • 18.Andersen CS, Knudsen MM, Chhabra R, Liu Y, Yan H, Gothelf KV. Bioconjug Chem. 2009;20:1538–1546. doi: 10.1021/bc900078c. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wolfe SA, Verdine GL. J Am Chem Soc. 1993;115:12585–12586. [Google Scholar]
  • 20.Noll DM, Silva MW, Noronha AM, Wilds CJ, Colvin OM, Gamcsik MP, Miller PS. Biochemistry. 2005;44:6764–6775. doi: 10.1021/bi050014n. [DOI] [PubMed] [Google Scholar]
  • 21.Angelov T, Guainazzi A, Schärer OD. Org Lett. 2009;11:661–664. doi: 10.1021/ol802719a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Wolfe SA, Ferentz AE, Grantcharova V, Churchill MEA, Verdine GL. Chem Biol. 1995;2:213–221. doi: 10.1016/1074-5521(95)90271-6. [DOI] [PubMed] [Google Scholar]
  • 23.Cimino GD, Gamper HB, Isaacs ST, Hearst JE. Annu Rev Biochem. 1985;54:1151. doi: 10.1146/annurev.bi.54.070185.005443. [DOI] [PubMed] [Google Scholar]
  • 24.Jamieson ER, Lippard SJ. Chem Rev. 1999;99:2467–2498. doi: 10.1021/cr980421n. [DOI] [PubMed] [Google Scholar]
  • 25.Seeman NC. Methods Mol Biol. 2005;303:427–431. doi: 10.1385/1-59259-901-X:143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Zhu L, Lukeman PS, Canary JW, Seeman NC. J Am Chem Soc. 2003;125:10178–10179. doi: 10.1021/ja035186r. [DOI] [PubMed] [Google Scholar]
  • 27.Liu Y, Sha R, Wang R, Ding L, Canary JW, Seeman NC. Tetrahedron. 2008;64:8417–8422. doi: 10.1016/j.tet.2008.05. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Liu Y, Wang R, Ding L, Sha R, Lukeman PS, Canary JW, Seeman NC. Chem Bio Chem. 2008;9:1641–1648. doi: 10.1002/cbic.200800032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Liu Y, Wang R, Ding L, Sha R, Seeman NC, Canary JW. Chem Sci. 2012;3:1930–1937. doi: 10.1039/C2SC20129A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Liu Y, Kuzuya A, Sha R, Guillaume J, Wang R, Canary JW, Seeman NC. J Am Chem Soc. 2008;130:10882–10883. doi: 10.1021/ja8041096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Caruthers MH, Barone AD, Beaucage SL, Dodds DR, Fisher EF, McBride LJ, Matteucci M, Stabinsky Z, Tang J. Chemical synthesis of deoxyoligonucleotides by the phosphoramidite method. In: Wu Ray, Grossman Lawrence., editors. Methods in Enzymology. Vol. 154. Academic Press; 1987. pp. 287–313. [DOI] [PubMed] [Google Scholar]
  • 32.Snyder TM, Tse BN, Liu DR. J Am Chem Soc. 2008;130:1392–1401. doi: 10.1021/ja076780u. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Fridman AS, Galyuk EN, Vorobiev VI, Skvortsov AN, Lando DY. J Biomol Struct Dyn. 2008;26:175–186. doi: 10.1080/07391102.2008.10507233. [DOI] [PubMed] [Google Scholar]
  • 34.Wang X, Zhang X, Mao C, Seeman NC. Proc Nat Acad Sci. 2010;107:12547–12552. doi: 10.1073/pnas.1000105107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Arnott S, Hukins DWL, Dover SD. Biochem Biophys Res Commun. 1972;48:1392. doi: 10.1016/0006-291x(72)90867-4. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ESI

RESOURCES