Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2016 Feb 24;55(13):4348–4352. doi: 10.1002/anie.201510313

Orthogonal Protein Assembly on DNA Nanostructures Using Relaxases

Sandra Sagredo 1, Tobias Pirzer 2, Ali Aghebat Rafat 2, Marisa A Goetzfried 2, Gabriel Moncalian 1, Friedrich C Simmel 2,, Fernando de la Cruz 1,
PMCID: PMC5067690  PMID: 26915475

Abstract

DNA‐binding proteins are promising reagents for the sequence‐specific modification of DNA‐based nanostructures. Here, we investigate the utility of a series of relaxase proteins—TrwC, TraI, and MobA—for nanofunctionalization. Relaxases are involved in the conjugative transfer of plasmids between bacteria, and bind to their DNA target sites via a covalent phosphotyrosine linkage. We study the binding of the relaxases to two standard DNA origami structures—rodlike six‐helix bundles and flat rectangular origami sheets. We find highly orthogonal binding of the proteins with binding yields of 40–50 % per binding site, which is comparable to other functionalization methods. The yields differ for the two origami structures and also depend on the position of the binding sites. Due to their specificity for a single‐stranded DNA target, their orthogonality, and their binding properties, relaxases are a uniquely useful addition to the toolbox available for the modification of DNA nanostructures with proteins.

Keywords: bacterial conjugation, bioconjugation, DNA nanotechnology, DNA origami, relaxase


Recent advances in DNA nanotechnology have resulted in methodologies that enable the generation of molecular objects made from DNA with almost any user‐defined shape,1 and these structures also facilitate the arrangement of functional molecular components or nanoparticles (e.g., proteins) into three‐dimensional configurations with nanometer precision. One of the most popular methods is the DNA origami technique, which utilizes hundreds of short DNA oligomers, called “staples” to fold a long DNA scaffold into a desired shape.1a,1c

A variety of approaches were previously utilized for the functionalization of DNA nanostructures with proteins. An often used modification is streptavidin, which binds to biotin with a dissociation constant of K D=40 fm,2 constituting one of the strongest noncovalent bonds known in nature. In order to facilitate the binding of streptavidin to DNA nanostructures, biotinylated oligonucleotide staples are incorporated into the structures at the desired locations. The effective yield for streptavidin‐binding on origami structures is roughly 85 % per binding site.3 Naturally occurring DNA‐binding proteins typically show dissociation constants in the nanomolar range and thus could also be used for modification of DNA nanostructures. A good example are zinc finger proteins, which bind to DNA in the presence of Zn2+ and result in effective binding yields of 30–70 % for DNA nanostructures.4 Other proteins that have been used in DNA origami assemblies are DNA–antibody conjugates5 and engineered proteins containing His‐,6 Snap‐, and Halo‐tags.7 Other covalent conjugation techniques such as coupling by transglutaminase8 or methyltransferases,9 expressed protein ligation, and enzymatic ligation10 also provide site‐specific conjugation methods with controlled stoichiometry. Alternatively, chemical crosslinkers containing maleimide or N‐hydroxysuccinimide groups are extensively used for nonspecific covalent coupling (for a more complete overview see Ref. 11). The disadvantage of most of the aforementioned methods is that they are laborious and that the structure and thereby the activity of the proteins is compromised by the introduction of nonnative chemical tags.

In this work we present a novel strategy for DNA coupling based on relaxase proteins. Relaxases are involved in DNA processing for bacterial conjugation,12 the horizontal transfer of genomic material among bacteria. They are single‐strand (ss) DNA cleaving endonucleases, which naturally remain covalently bound to DNA after cleavage. Thus, relaxases offer a potentially superior alternative to existing methods for protein conjugation to DNA nanostructures, due to their sequence specificity, orthogonality and also their natural capacity to form covalent bonds with standard oligonucleotides. Relaxases belong to the HUH protein superfamily,13 and the three‐dimensional structure of some relaxases is known.14 Their overall shape resembles a left hand, where the β‐sheet “palm” holds the HUH motif (histidine–hydrophobic amino acid–histidine) responsible for metal coordination, and the α‐helix “thumb” contains the catalytic tyrosine. These conserved residues form the active center of each relaxase and allow cleavage of a phosphodiester bond located at a specific plasmid DNA sequence called the nic site. Each plasmid contains a particular relaxase, recognizing a different nic site, so the number of relaxase/nic site pairs is enormous. The cleavage reaction also occurs in vitro on ssDNA containing the nic region.15 The nic region contains an inverted repeat (IR) and an AT‐rich region downstream from nic. Upon binding ssDNA in vitro, the relaxase bends its substrate into a U shape and fits the nic sequence into its active center; here, the tyrosine generates a covalent 5′‐phosphotyrosine intermediate and a free 3′‐OH end (cf. Figure 1). In this state, cleavage and religation of substrate are in fact in a dynamic equilibrium.15b, 16 Each relaxase recognizes its own nic region with a K D between 1 nm and 400 nm, depending on the individual protein and the target sequence.

Figure 1.

Figure 1

Strategy for the coupling of relaxases onto DNA nanostructures. A) Extended staples contain the recognition sequence of a relaxase and several base pairs that allow the target to protrude from the nanostructure. Binding of the relaxase to its cognate site involves bending of the ssDNA to fit the nic site into the relaxase active center. Inset: the catalytic tyrosine carries out a transesterification reaction that links it covalently to the 5′‐phosphate between the scissile nucleotides. As a result, the relaxase becomes covalently bound to the extended staple. B) DNA nanostructures used in this approach to study the coupling of four relaxases.

In the following, we utilize the N‐terminal domain of relaxases TrwC from conjugative plasmid R388 (denoted as TrwCR388), TraI from R100 (TraIR100), MobA from R1162 (MobAR1162), and TraI from pKM101 (TraIpKM101). All relaxase domains are small monomeric proteins (20–30 kDa) with low K D 15a, 17 that can be engineered and fused to other proteins such as fluorescent proteins (Figures S6, S7, and S9). TraIR100 and TrwCR388 are able to cleave proficiently the nic region lacking the distal arm of the IR. The reported K d values of single‐stranded R100 and R388 nic regions are 2.4 nm 17a and 320 nm,17b respectively. It was shown that MobAR1162 effectively binds targets with the entire IR, but binding was compromised after deletion of the distal arm.15a The relaxase TraIpKM101 had not been characterized yet, but we expected a similar behavior as it has 51 % and 37.7 % sequence identity with TrwCR388 and TraIR100, respectively. Moreover, the pKM101 target is 64.7 % identical to the R100 target. Comparison of targets with and without a complete IR showed that in the pKM101 target a complete IR is also needed (cf. Figure S8).

DNA origami structures used in this work were long, rodlike six‐helix bundles (6HB) or flat, twist‐corrected rectangular origami sheets (tcRO), with dimensions of 430 nm×5 nm and 90 nm×60 nm, respectively. The binding sites with the target sequence were designed with a double‐stranded stem as a rigid linker in close proximity to the nanostructure followed by a single‐stranded region with the nic site (cf. Figures 1 A and S10 for further details). Along its length, each 6HB is equipped with specific binding positions for the aforementioned four relaxases. In order to increase the local concentration of binding sites, each binding position (see Figure 1 B and the Supporting Information) was equipped with five binding sites each for the specific relaxase. Due to the small size of the relaxase domains, we used fusion proteins of relaxases with fluorescent proteins mKATE or mCFP (45–62 kDa). These fused proteins resulted in increased visibility in TEM images. Each relaxase was incubated for 24 h with 6HB nanotubes containing all four binding positions. After incubation and purification, the number of occupied binding positions was determined from TEM images (Figure 2). The resulting binding yields per position were 93 % for TraIR100mmKATE, 84 % for TraIpKM101mCFP, 73 % for TrwCR388mCFP, and 34 % for MobAR1162mCFP.

Figure 2.

Figure 2

Orthogonal binding of relaxases to six‐helix bundles (6HB). A) TEM images of the relaxases TrwCR388mCFP, TraIR100mKATE, and MobAR1162mCFP bound to 6HB. Scale bar: 50 nm. B) Graphical representation of total binding yields and their orthogonality. For details on the statistical analysis see the Supporting Information. C) Recognition sequences of the relaxases used in this study. The nic site is depicted with ^ and the proximal arm of the inverted repeat is underlined.

Since the 6HB contained all four binding positions, quantitative values for the specificity of each relaxase to its target could be measured (see Figure 2). Proteins TraIR100mKATE, TrwCR388mCFP, and MobAR1162mCFP show highly selective binding abilities, with unspecific binding yields ranging from 0 % to 7.1 % (cf. Figure 2). Despite its high yield for its specific target, TraIpKM101mCFP shows significant nonspecific binding and therefore was not used simultaneously with the other relaxases in the following.

In further experiments we used the twist‐corrected rectangular origami sheet tcRO, for which we designed six binding positions with up to three redundant binding sites each (cf. Figure 3 A). We first compared the binding of the orthogonal relaxases TraIR100mKATE and TrwCR388mCFP onto the origami sheets. To test their binding yields, only position 1 for TraIR100mKATE and position 5 for TrwCR388mCFP, both equivalently at the rim of the tcRO next to the corners (cf. Figure 3), were used. Each protein was incubated for 24 h with the tcRO, each sheet containing two binding sites per position for each protein, and purified afterwards (see the Methods section in the Supporting Information for details). For visualization of the bound proteins we used atomic force microscopy (AFM) imaging (Figure 3). The binding yield of TraIR100mKATE was found to be 70 %, and 48 % for TrwCR388mCFP. Nonspecific binding to the unwanted position was 1 % for TraIR100mKATE and 3 % for TrwCR388mCFP.

Figure 3.

Figure 3

Position‐dependent binding of relaxases to the rectangular DNA sheet tcRO. A) Top: Illustration of the six binding positions with two or three binding sites each. Bottom: AFM image of the nanostructures decorated with TraIR100mKATE at position 1. B) Binding yields of TrwCR388mCFP and TraIR100mKATE to available sites. The relaxases bind preferably to their specifc target. C) Binding yield of TrwCR388 to tcRO equipped with two and three targets per position. Binding positions at the rim of the tcRO (1, 2, 5, and 6) show lower yields than in the center (3 and 4). For details on the statistical analysis see the Supporting Information.

We then checked for a potential position dependence of relaxase binding. To this end, the tcRO was equipped with all designed positions containing only R388 binding sites. From these experiments, the binding yield at the tcRO rim (positions 1, 2, 5, and 6 in Figure 3 A) was found to range between 40 % and 60 %, whereas it appeared to be significantly higher (between 60 % and 90 %) in the tcRO center (positions 3 and 4, cf. Figure 3 C). In general, binding yields varied considerably from experiment to experiment (±10 %). Surprisingly, within the range of this variation, we found no clear dependence of the binding yield on the number of binding sites per binding position. When only a single binding site for R388 was used, binding yields were as high as for multiple binding sites (between 40–50 %, see the Supporting Information).

Overall, the binding yields obtained by AFM were significantly lower than that expected based on the law of mass action (see the Supporting Information for details). For instance, in previous experiments with single‐stranded targets containing the nic site, the K D for the binding of TrwCR388 to its recognition sequence was determined to be 320 nm.17b This should translate into an equilibrium binding yield of 86 % for a single binding site and even 99.7 % for three binding sites under our incubation conditions, where TrwCR388 and tcRO concentrations were ca. 2 μm and 100 nm, respectively. As AFM imaging is performed with purified samples in buffer (i.e., without relaxases), however, re‐equilibration of the sample will result in unbinding of the proteins from the target and thus a reduced apparent binding yield. This is essentially unavoidable for relatively high K D values in the nanomolar range.

Since the covalent phosphotyrosine linkage of a relaxase to its nic site is in dynamic equilibrium with the unbound state, we also used phosphorothioate suicide oligonucleotides to shift the equilibrium towards the covalent bond (and thus achieve a much lower effective K D). In suicide oligonucleotides, the 3′ oxygen is replaced by sulfur at the scissile phosphodiester bond, preventing the religation reaction.18 Surprisingly, the binding yield did not improve with these oligonucleotides (see the Supporting Information for details). The strong coupling theoretically expected for a covalent bond is thus not reflected in the actual binding yields.

Our findings are in line with results for other protein‐modification approaches for DNA origami. Streptavidin binding to biotin showed only about 85 %3 binding yield (with a theoretical value of close to 100 %) and also for covalent coupling methods only 84–90 % experimental binding yields were obtained.3 Apart from the re‐equilibration issue mentioned above, binding yields on origami structures may be reduced by a variety of other factors. Some target sites may be inaccessible or simply absent (e.g., because of missing staples), and diffusion to the target may be affected by the geometry of the binding site or the overall shape and flexibility of the underlying origami structure. For instance, in our experiments with six‐helix bundles, the binding sites were accessible from all sides, and we typically observed higher binding yields than for the tcRO structures.

In summary, we have introduced DNA‐binding relaxase proteins as promising reagents for the sequence‐addressable modification of DNA nanostructures with proteins. Binding yields on DNA origami structures ranged between 40–50 % for a single binding site, and were thus comparable to those of other modification strategies. Depending on the origami geometry, the specific relaxase, and also the utilization of redundant binding sites in close proximity, in some cases apparent binding yields as high as 90 % could be achieved. Upon further optimization it should be possible to fully exploit the covalent phosphotyrosine linkage between the relaxases and the DNA target site, and thus achieve even higher binding yields. At least two of the four investigated relaxases (TraI from plasmid R100 and TrwC from R388) showed very good orthogonal binding performance. Relaxases can be easily engineered and fused to other proteins of choice. As many relaxases from different sources and with unique recognition sequences are available, they potentially can be developed into a whole new class of orthogonal, sequence‐selective protein linkers for DNA nanotechnology. Of particular interest could be their application for the biological generation of DNA–protein hybrid nanostructures and for the modification of DNA nanostructures in vivo.

Supporting information

As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.

Supplementary

Acknowledgements

We gratefully acknowledge financial support through the Volkswagen Foundation (grant 86 395‐1), the European Commission (FP7 grant no. 248919 (BACTOCOM)), the DFG (SFB 1032 TP A2 and Nanosystems Initiative Munich), and EMBOEMBO. We thank F. Praetorius for providing the M13 origami scaffold and A. Kuzyk for initial work on this project.

S. Sagredo, T. Pirzer, A. Aghebat Rafat, M. A. Goetzfried, G. Moncalian, F. C. Simmel, F. de la Cruz, Angew. Chem. Int. Ed. 2016, 55, 4348.

Contributor Information

Prof. Dr. Friedrich C. Simmel, Email: simmel@tum.de.

Prof. Dr. Fernando de la Cruz, Email: delacruz@unican.es

References

  • 1. 
  • 1a. Rothemund P. W. K., Nature 2006, 440, 297–302; [DOI] [PubMed] [Google Scholar]
  • 1b. Douglas S. M., Dietz H., Liedl T., Högberg B., Graf F., Shih W. M., Nature 2009, 459, 414–418; [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 1c. Kick B., Praetorius F., Dietz H., Weuster-Botz D., Nano Lett. 2015, 15, 4672–4676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Green N. M., Methods Enzymol. 1990, 184, 51–67. [DOI] [PubMed] [Google Scholar]
  • 3. Voigt N. V., Tørring T., Rotaru A., Jacobsen M. F., Ravnsbæk J. B., Subramani R., Mamdouh W., Kjems J., Mokhir A., Besenbacher F., Gothelf K. V., Nat. Nanotechnol. 2010, 5, 200–203. [DOI] [PubMed] [Google Scholar]
  • 4. Nakata E., Liew F. F., Uwatoko C., Kiyonaka S., Mori Y., Katsuda Y., Endo M., Sugiyama H., Morii T., Angew. Chem. Int. Ed. 2012, 51, 2421–2424; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2012, 124, 2471–2474. [Google Scholar]
  • 5. 
  • 5a. Yamazaki T., Heddle J. G., Kuzuya A., Komiyama M., Nanoscale 2014, 6, 9122–9126; [DOI] [PubMed] [Google Scholar]
  • 5b. Shaw A., Benson E., Högberg B., ACS Nano 2015, 9, 4968–4975. [DOI] [PubMed] [Google Scholar]
  • 6. 
  • 6a. Goodman R. P., Erben C. M., Malo J., Ho W. M., McKee M. L., Kapanidis A. N., Turberfield A. J., ChemBioChem 2009, 10, 1551–1557; [DOI] [PubMed] [Google Scholar]
  • 6b. Shen W., Zhong H., Neff D., Norton M. L., J. Am. Chem. Soc. 2009, 131, 6660–6661. [DOI] [PubMed] [Google Scholar]
  • 7. Saccà B., Meyer R., Erkelenz M., Kiko K., Arndt A., Schroeder H., Rabe K. S., Niemeyer C. M., Angew. Chem. Int. Ed. 2010, 49, 9378–9383; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2010, 122, 9568–9573. [Google Scholar]
  • 8. Tominaga J., Kemori Y., Tanaka Y., Maruyama T., Kamiya N., Goto M., Chem. Commun. 2007, 401–403. [DOI] [PubMed] [Google Scholar]
  • 9. Metelev V. G., Kubareva E. A., Vorob'eva O. V., Romanenkov A. S., Oretskaya T. S., FEBS Lett. 2003, 538, 48–52. [DOI] [PubMed] [Google Scholar]
  • 10. 
  • 10a. Takeda S., Tsukiji S., Nagamune T., Bioorg. Med. Chem. Lett. 2004, 14, 2407–2410; [DOI] [PubMed] [Google Scholar]
  • 10b. Sørensen R. S., Okholm A. H., Schaffert D., Kodal A. L. B., Gothelf K. V., Kjems J., ACS Nano 2013, 7, 8098–8104. [DOI] [PubMed] [Google Scholar]
  • 11. 
  • 11a. Yang Y. R., Liu Y., Yan H., Bioconjugate Chem. 2015, 26, 1381–1395; [DOI] [PubMed] [Google Scholar]
  • 11b. Saccà B., Niemeyer C. M., Chem. Soc. Rev. 2011, 40, 5910–5921. [DOI] [PubMed] [Google Scholar]
  • 12. de la Cruz F., Frost L. S., Meyer R. J., Zechner E. L., FEMS Microbiol. Rev. 2010, 34, 18–40. [DOI] [PubMed] [Google Scholar]
  • 13. Chandler M., de la Cruz F., Dyda F., Hickman A. B., Moncalián G., Ton-Hoang B., Nat. Rev. Microbiol. 2013, 11, 525–538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. 
  • 14a. Guasch A., Lucas M., Moncalián G., Cabezas M., Pérez-Luque R., Gomis-Rüth F. X., de la Cruz F., Coll M., Nat. Struct. Biol. 2003, 10, 1002–1010; [DOI] [PubMed] [Google Scholar]
  • 14b. Datta S., Larkin C., Schildbach J. F., Structure 2003, 11, 1369–1379; [DOI] [PubMed] [Google Scholar]
  • 14c. Monzingo A. F., Ozburn A., Xia S., Meyer R. J., Robertus J. D., J. Mol. Biol. 2007, 366, 165–178; [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14d. Edwards J. S., Betts L., Frazier M. L., Pollet R. M., Kwong S. M., Walton W. G., Ballentine W. K., Huang J. J., Habibi S., Del Campo M., Meier J. L., Dervan P. B., Firth N., Redinbo M. R., Proc. Natl. Acad. Sci. USA 2013, 110, 2804–2809; [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14e. Nash R. P., Habibi S., Cheng Y., Lujan S. A., Redinbo M. R., Nucleic Acids Res. 2010, 38, 5929–5943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. 
  • 15a. Bhattacharjee M. K., Meyer R. J., Nucleic Acids Res. 1993, 21, 4563–4568; [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15b. Llosa M., Grandoso G., de la Cruz F., J. Mol. Biol. 1995, 246, 54–62. [DOI] [PubMed] [Google Scholar]
  • 16. Pansegrau W., Lanka E., J. Biol. Chem. 1996, 271, 13068–13076. [DOI] [PubMed] [Google Scholar]
  • 17. 
  • 17a. Harley M. J., Schildbach J. F., Proc. Natl. Acad. Sci. USA 2003, 100, 11243–11248; [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17b. Lucas M., Gonzalez-Perez B., Cabezas M., Moncalian G., Rivas G., de la Cruz F., J. Biol. Chem. 2010, 285, 8918–8926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. González-Pérez B., Lucas M., Cooke L. A., Vyle J. S., de la Cruz F., Moncalián G., EMBO J. 2007, 26, 3847–3857. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.

Supplementary


Articles from Angewandte Chemie (International Ed. in English) are provided here courtesy of Wiley

RESOURCES