Abstract
Squaramate‐linked 2′‐deoxycytidine 5′‐O‐triphosphate was synthesized and found to be good substrate for KOD XL DNA polymerase in primer extension or PCR synthesis of modified DNA. The resulting squaramate‐linked DNA reacts with primary amines to form a stable diamide linkage. This reaction was used for bioconjugations of DNA with Cy5 and Lys‐containing peptides. Squaramate‐linked DNA formed covalent cross‐links with histone proteins. This reactive nucleotide has potential for other bioconjugations of nucleic acids with amines, peptides or proteins without need of any external reagent.
Keywords: bioconjugation, cross-linking reactions, DNA, DNA polymerase, proteins
Protein–DNA interactions are of crucial importance in DNA packaging, replication, transcription, epigenetic modifications, and repair.1 Transcription factors (TFs) are particularly important DNA‐binding proteins that regulate gene expression through sequence‐specific binding to promoter sequences. Among the approximately 1600 known human TFs, the detailed biological role and binding motifs are fully understood only for a small fraction.2 Although, there is a number of methods for studying of protein–DNA interactions and for identification of DNA‐binding proteins,3 there is still an urgent need of other alternative methods, in particular for weakly binding proteins. Covalent cross‐linking is one of the most promising methods for identification of DNA‐binding proteins, but covalent protein–DNA conjugates are also useful for other applications in chemical biology or biosensing.4
There are some general non‐specific cross‐linking methods based on photochemical generation of radicals (from 5‐halouracils)5 or carbenes (from diazirine‐linked nucleobases)6 in DNA which bind randomly to neighboring amino acids through C−H activations. More challenging but potentially very useful are reactions specific for one or several amino‐acid side‐chains, but so far a very limited number of them have been reported for DNA–protein cross‐linking. Thiol‐linked DNA can cross‐link with Cys‐containing proteins through disulfide formation.7 Vinylsulfonamide‐linked DNA was reported to cross‐link with Cys,8 whereas chloroacetamide cross‐linked with proteins through Cys or His.9 In both cases, proximity effect was crucial for efficient formation of the covalent cross‐link between modified DNA and protein. Most frequent were reports on cross‐linking of aldehyde‐linked DNA with Lys either through inefficient and reversible Schiff‐base formation10 or (more often) through irreversible reductive amination,11, 12 which, requires an additional stoichiometric reductant (for example, toxic NaBH3CN, which complicates any in cellulo or in vivo usage). Lys–DNA interactions are very frequent and important, in particular in histones. So far, no reactive nucleobase‐modification in DNA has been reported to form irreversible cross‐links with Lys without an external reagent.
Mono‐amides of squaric acid (squaramates) are often used for bioconjugations with Lys and other amines.13 Diamides (squaramides) have been used as a phosphate surrogate in nucleotide14 or oligonucleotide (ON) analogues.15 A chemically synthesized 2′‐sugar‐linked squaramate–RNA conjugate, prepared through reaction of 2′‐amino‐modified RNA with diethyl squarate, was reported to cross‐link to aminoacyl‐transferase FemXWv,16 as the only example of its use in nucleic‐acid conjugation. Within the framework of our program aimed at base‐functionalized nucleic acids for applications in chemical biology,17 we designed novel squaramate‐linked cytosine 2′‐deoxyribonucleoside triphosphate (dNTP) for the enzymatic synthesis of modified DNA and cross‐linking with proteins.
The synthesis of the desired modified nucleotides started with the preparation of 5‐(3‐aminopropynyl)‐2′‐deoxycytidine (1) by deacylation of known trifluoroacetylamide,18 see Scheme S1 in Supporting Information. The reaction of amine 1 with 2 equiv of diethyl squarate gave the squaramate‐linked nucleoside dCESQ in 80 % yield (Scheme 1). Standard Yoshikawa phosphorylation19 with POCl3 gave the monophosphate dCESQMP in 37 % yield, whereas the triphosphorylation20 with POCl3 followed by pyrophosphate and triethylammonum bicarbonate (TEAB) gave the triphosphate dCESQTP in 7 % yield. The monophosphate dCESQMP served as model compound for reactions with Lys and peptides (Scheme 1). The reaction of dCESQMP with Ac‐Lys or a Lys‐containing tripeptide proceeded at room temperature in borate buffer (pH 9) overnight to give the desired conjugates dCESQLysMP or dCESQ3 pept MP in 54 and 63 %, respectively. These nucleotide–peptide conjugates were isolated by HPLC in pure form and fully characterized by NMR spectroscopy and mass spectrometry (MS) to confirm the expected formation of the amide bond with Lys.
Then, we tested the squaramate‐linked dNTP (dCESQTP) as substrate for DNA‐polymerase‐catalyzed synthesis of modified DNA (Figure 1 a). First, we performed the primer extension (PEX) in the presence of KOD XL DNA polymerase with either 19‐, 20‐, 31‐ or 98‐mer template and a 13‐, 15‐ or 25‐mer primer (for the oligonucleotide sequences, see Tables S1 and S2 in the Supporting Information). In all cases (Figure 1 b and Supporting Information, Figure S1), we observed the formation of full‐length PEX products containing one, four or eighteen dCESQ modifications. PCR amplification with either 98‐bp or 235‐bp templates also proceeded well, giving a strong band corresponding to the modified amplicon (Supporting Information, Figure S2). This shows that the reactive dCESQTP nucleotide does not react with DNA polymerase (neither during extension nor when modified DNA is used as template) and is a good substrate and building block for the enzymatic synthesis of reactive, modified DNA probes.
Next, we tested the cross‐linking reactions of dCESQ‐linked DNA with amines and peptides (Figure 1 a,c). The reactions of 20‐bp DNA_CESQ were performed at room temperature in borate buffer (pH 9). The reaction with Sulfo‐Cy5‐NH2 (100 equiv) gave the desired fluorescently labeled DNA_CESQCy5 (Supporting Information, Figure S3). Analogous reactions of DNA_CESQ with Ac‐Lys and Lys‐containing tripeptide or decapeptide were conducted with large excess (approximately 2500 equiv) of the peptides, since no proximity effect was expected in these non‐DNA‐binding peptides. Under these conditions, the reactions proceeded with moderate efficiency and gave the desired cross‐linked conjugates DNA_CESQLys, DNA_CESQ3pept or DNA_CESQ10pept with 50, 43, and 20 % conversions, respectively (Figure 1 c). All conjugates were characterized and confirmed by MALDI MS (Supporting Information, Table S4).
Finally, the reactive DNA probe DNA_CESQ was tested in reactions with proteins. We used bovine serum albumin (BSA) as negative control of a protein containing 60 lysines that does not interact with DNA, the core‐domain of p5321 as a DNA‐binding protein containing lysines but not in the binding site, and a set of recombinant H2A, H2B, H3.1, and H4 histones, as examples of Lys‐rich proteins that strongly bind to DNA. The cross‐linking reactions were performed with only 2 equiv of the corresponding proteins. Since the histones are known to form dimers and oligomers, we assume that this ratio is probably effectively close to equimolar. To be closer to physiological conditions, we used phosphate (or TRIS or HEPES, see Figure S8 in the Supporting Information) buffers (pH 7.4).
At first, we performed a simple kinetic study of the cross‐linking reaction of DNA_CESQ with histone H3.1 to show that the reaction reaches the maximum conversion in 16–24 h (Supporting Information, Figure S5). Therefore, we used 36 h reactions in other cases to ensure sufficient conversions. Figure 2 shows the results of the cross‐linking experiments with proteins. To our delight, the reactions of DNA_CESQ with all four recombinant histones gave the covalent cross‐linked conjugates with lower mobility on a denaturing SDS‐PAGE gel (Figure 2 b). The conversions of these reactions calculated from the SDS‐PAGE were 31–34 % (Supporting Information, Table S5). The identity of the covalent DNA–protein conjugates with H2B, H3.1, and H4 histones was also confirmed by SDS‐PAGE with protein staining (Coomassie Blue, Supporting Information, Figure S7) and by HPLC‐MS analysis using electrospray ionization (Supporting Information, Figures S16–S18). Also, a longer 98‐bp PEX product containing 18 squaramate groups reacted with histone H3.1, though mixture of cross‐linked products was obtained (Supporting Information, Figure S10). On the other hand, DNA_CESQ did not cross‐link with BSA or p53 (Supporting Information, Figure S9) or with DNA polymerase during the PEX or PCR. These results show that the proximity effect (presence of lysine(s) close to the DNA‐binding site of protein) is crucial for efficient cross‐linking in the absence of large excess of the peptide or protein.
In conclusion, we designed and synthesized a novel squaramate‐linked dNTP (dCESQTP) and demonstrated that it was a very good substrate for KOD XL DNA polymerase in PEX or PCR synthesis of reactive DNA probes. The squaramate group reacts with amines to form a stable covalent diamide (squaramide) linkage. We have shown that the dCESQ‐modified DNA probes reacted with amino‐linked Cy5 to form fluorescently labeled DNA. Its reactions with Lys‐containing peptides proceeded only when a large excess of the peptide was present. On the other hand, in reactions with Lys‐containing DNA‐binding proteins, where the proximity effect helps, the reactions proceed with good conversions even in almost equimolar ratio. Compared to previously reported DNA–Lys conjugations based on reductive amination,11, 12 the squaramate modification and its transformation to a stable amide proceeds under physiological conditions (at pH 7.4–9) and does not require any external reagent (i.e. toxic NaBH3CN used in reductive aminations11, 12). Therefore, this reactive modification and the presented methodology has good potential in the post‐synthetic labeling of DNA,22 bioconjugations of DNA with peptides, proteins or other biomolecules,4 as well as in cross‐linking experiments to identify and study DNA‐binding proteins. Further research along these lines is under way in our lab.
Conflict of interest
The authors declare no conflict of interest.
Supporting information
Acknowledgements
This work was supported by the Czech Academy of Sciences (Praemium Academiae award to M. Ho.) and by the Czech Science Foundation (18‐03305S to I. I. and M. Ho.). The authors thank J. Konč (University of Cambridge) for advice with MS characterization of the conjugates.
I. Ivancová, R. Pohl, M. Hubálek, M. Hocek, Angew. Chem. Int. Ed. 2019, 58, 13345.
Contributor Information
Ivana Ivancová, http://www.uochb.cas.cz/hocekgroup.
Prof. Dr. Michal Hocek, Email: hocek@uochb.cas.cz.
References
- 1. Rohs R., Jin X., West S. M., Joshi R., Honig B., Mann R. S., Annu. Rev. Biochem. 2010, 79, 233–269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Lambert S. A., Jolma A., Campitelli L. F., Das P. K., Yin Y., Albu M., Chen X., Taipale J., Hughes T. R., Weirauch M. T., Cell 2018, 172, 650–665. [DOI] [PubMed] [Google Scholar]
- 3. Furey T. S., Nat. Rev. Genet. 2012, 13, 840–852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.
- 4a. Trads J. B., Tørring T., Gothelf K. V., Acc. Chem. Res. 2017, 50, 1367–1374; [DOI] [PubMed] [Google Scholar]
- 4b. Tretyakova N. Y., Groehler A., Ji S., Acc. Chem. Res. 2015, 48, 1631–1644; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4c. Niemeyer C. M., Angew. Chem. Int. Ed. 2010, 49, 1200–1216; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2010, 122, 1220–1238; [Google Scholar]
- 4d. Vinkenborg J. L., Mayer G., Famulok M., Angew. Chem. Int. Ed. 2012, 51, 9176–9180; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2012, 124, 9311–9315; [Google Scholar]
- 4e. Rosen C. B., Kodal A. L. B., Nielsen J. S., Schaffert D. H., Scavenius C., Okholm A. H., Voigt N. V., Enghild J. J., Kjems J., Tørring T., Gothelf K. V., Nat. Chem. 2014, 6, 804–809; [DOI] [PubMed] [Google Scholar]
- 4f. Nielsen T. B., Thomsen R. P., Mortensen M. R., Kjems J., Nielsen P. F., Nielsen T. E., Kodal A. L. B., Clo E., Gothelf K. V., Angew. Chem. Int. Ed. 2019, 10.1002/anie.201903134; [DOI] [Google Scholar]; Angew. Chem. 2019, 10.1002/ange.201903134. [DOI] [Google Scholar]
- 5. Norris C. L., Meisenheimer P. L., Koch T. H., J. Am. Chem. Soc. 1996, 118, 5796–5803. [Google Scholar]
- 6.
- 6a. Winnacker M., Breeger S., Strasser R., Carell T., ChemBioChem 2009, 10, 109–118; [DOI] [PubMed] [Google Scholar]
- 6b. Lercher L., McGouran J. F., Kessler B. M., Schofield C. J., Davis B. G., Angew. Chem. Int. Ed. 2013, 52, 10553–10558; [DOI] [PMC free article] [PubMed] [Google Scholar]; Angew. Chem. 2013, 125, 10747–10752. [Google Scholar]
- 7.
- 7a. Huang H., Chopra R., Verdine G. L., Harrison S. C., Science 1998, 282, 1669–1675; [DOI] [PubMed] [Google Scholar]
- 7b. Mishina Y., He C., J. Am. Chem. Soc. 2003, 125, 8730–8731. [DOI] [PubMed] [Google Scholar]
- 8. Dadová J., Orság P., Pohl R., Brázdová M., Fojta M., Hocek M., Angew. Chem. Int. Ed. 2013, 52, 10515–10518; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2013, 125, 10709–10712. [Google Scholar]
- 9. Olszewska A., Pohl R., Brázdová M., Fojta M., Hocek M., Bioconjugate Chem. 2016, 27, 2089–2094. [DOI] [PubMed] [Google Scholar]
- 10.
- 10a. Raiber E.-A., Portella G., Cuesta S. Martínez, Hardisty R., Murat P., Li Z., Iurlaro M., Dean W., Spindel J., Beraldi D., Liu Z., Dawson M. A., Reik, Balasubramanian W. S., Nat. Chem. 2018, 10, 1258–1266; [DOI] [PubMed] [Google Scholar]; Balasubramanian S., Nat. Chem. 2018, 10, 1258–1266;30349137 [Google Scholar]
- 10b. Li F., Zhang Y., Bai J., Greenberg M. M., Xi Z., Zhou C., J. Am. Chem. Soc. 2017, 139, 10617–10620; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10c. Ji S., Shao H., Han Q., Seiler C. L., Tretyakova N. Y., Angew. Chem. Int. Ed. 2017, 56, 14130–14134; [DOI] [PMC free article] [PubMed] [Google Scholar]; Angew. Chem. 2017, 129, 14318–14322. [Google Scholar]
- 11.
- 11a. Ji S., Fu I., Naldiga S., Shao H., Basu A. K., Broyde S., Tretyakova N. Y., Nucleic Acids Res. 2018, 46, 6455–6469; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11b. Pande P., Ji S., Mukherjee S., Schärer O. D., Tretyakova N. Y., Basu A. K., Chem. Res. Toxicol. 2017, 30, 669–677; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11c. Wickramaratne S., Mukherjee S., Villalta P. W., Schärer O. D., Tretyakova N. Y., Bioconjugate Chem. 2013, 24, 1496–1506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.
- 12a. Raindlová V., Pohl R., Hocek M., Chem. Eur. J. 2012, 18, 4080–4087; [DOI] [PubMed] [Google Scholar]
- 12b. Krömer M., Bártová K., Raindlová V., Hocek M., Chem. Eur. J. 2018, 24, 11890–11894. [DOI] [PubMed] [Google Scholar]
- 13.
- 13a. Storer R. I., Aciro C., Jones L. H., Chem. Soc. Rev. 2011, 40, 2330–2346; [DOI] [PubMed] [Google Scholar]
- 13b. Wurm F. R., Klok H.-A., Chem. Soc. Rev. 2013, 42, 8220–8236; [DOI] [PubMed] [Google Scholar]
- 13c. Marchetti L. A., Kumawat L. K., Mao N., Stephens J. C., Elmes R. B. P., Chem 2019, 5, 1398–1485. [Google Scholar]
- 14.
- 14a. Niewiadomski S., Beebeejaun Z., Denton H., Smith T. K., Morris R. J., Wagner G. K., Org. Biomol. Chem. 2010, 8, 3488–3499; [DOI] [PubMed] [Google Scholar]
- 14b. Soukarieh F., Nowicki M. W., Bastide A., Pöyry T., Jones C., Dudek K., Patwardhan G., Meullenet F., Oldham N. J., Walkinshaw M. D., Willis A. E., Fischer P. M., Eur. J. Med. Chem. 2016, 124, 200–217; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14c. Zhang Y., Jumppanen M., Maksimainen M. M., Auno S., Awol Z., Ghemtio L., Venkannagari H., Lehtiö L., Yli-Kauhaluoma J., Xhaard H., Gennäs G. Boije Af, Bioorg. Med. Chem. 2018, 26, 1588–1597. [DOI] [PubMed] [Google Scholar]
- 15.
- 15a. Sato K., Seio K., Sekine M., J. Am. Chem. Soc. 2002, 124, 12715–12724; [DOI] [PubMed] [Google Scholar]
- 15b. Dürr E.-M., Doherty W., Lee S. Y., El-Sagheer A. H., Shivalingam A., McHugh P. J., Brown T., McGouran J. F., ChemistrySelect 2018, 3, 12824–12829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Fonvielle M., Sakkas N., Iannazzo L., Le Fournis C., Patin D., Mengin-Lecreulx D., El-Sagheer A., Braud E., Cardon S., Brown T., Arthur M., Etheve-Quelquejeu M., Angew. Chem. Int. Ed. 2016, 55, 13553–13557; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2016, 128, 13751–13755. [Google Scholar]
- 17.
- 17a. Hocek M., J. Org. Chem. 2014, 79, 9914–9921; [DOI] [PubMed] [Google Scholar]
- 17b. Hocek M., Acc. Chem. Res. 2019, 52, 1730–1737. [DOI] [PubMed] [Google Scholar]
- 18. Jäger S., Rasched G., Kornreich-Leshem H., Engeser M., Thum O., Famulok M., J. Am. Chem. Soc. 2005, 127, 15071–15082. [DOI] [PubMed] [Google Scholar]
- 19. Yoshikawa M., Kato T., Takenishi T., Tetrahedron Lett. 1967, 8, 5065–5068. [DOI] [PubMed] [Google Scholar]
- 20. Ludwig J., Acta Biochim. Biophys. Hung. 1981, 16, 131–134. [PubMed] [Google Scholar]
- 21.
- 21a. Brazdova M., Palecek J., Cherny D. I., Billova S., Fojta M., Pecinka P., Vojtesek B., Jovin T. M., Palecek E., Nucleic Acids Res. 2002, 30, 4966–4974; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21b. Fojta M., Pivonkova H., Brazdova M., Nemcova K., Palecek J., Vojtesek B., Eur. J. Biochem. 2004, 271, 3865–3876. [DOI] [PubMed] [Google Scholar]
- 22.Reviews:
- 22a. Singh Y., Murat P., Defrancq E., Chem. Soc. Rev. 2010, 39, 2054–2070; [DOI] [PubMed] [Google Scholar]
- 22b. Merkel M., Peewasan K., Arndt S., Ploschik D., Wagenknecht H.-A., ChemBioChem 2015, 16, 1541–1553. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.