Abstract
N 6-Adenosine methylation is the most abundant modification of mRNA. The three members of the YTH domain family proteins (YTHDF1–3) recognize in the cytoplasm the m6A-RNA modification. We screened a library of about 500,000 fragments (i.e., molecules with 11–20 non-hydrogen atoms) by docking into YTHDF2, which resulted in the identification of six active compounds among 47 tested in vitro (hit rate of 13%). The acquisition of 28 analogues of the docking hits provided an additional set of 10 active compounds (IC50 < 100 μM). Protein crystallography-guided optimization of a ligand-efficient fragment by the synthesis of 32 derivatives culminated in a series of YTHDF2 ligands, which show low-micromolar affinity measured by a fluorescence polarization (FP) assay and a homogeneous time-resolved fluorescence-based (HTRF) assay. The series is characterized by very favorable ligand efficiency (of about 0.3–0.4 kcal/mol per non-hydrogen atom). Compound 23 binds to YTHDF2 according to the FP and HTRF assays with a K d value of 1.3 μM and an IC50 value of 11 μM, respectively, and it is selective against all of the other YTH reader proteins. Several compounds of the series bind to the three YTHDF proteins with similar low-micromolar affinity, while they are less potent for YTHDC1 and YTHDC2. In contrast, compounds 17 and 30 bind also to YTHDC2, with K d of 6.3 and 4.9 μM, respectively. We also disclose six crystal structures of YTHDF2 in the complex with the fragments identified by docking.
Keywords: epitranscriptomics, m6A readers, docking, FRET binding assay, molecular dynamics, protein crystallography
Introduction
The YTH (YT521-B homology) domain-containing proteins are a family of RNA-binding proteins that specifically recognize N 6-methyladenosine (m6A), the most abundant internal modification in eukaryotic RNA. These proteins play critical roles in various biological processes, including mRNA metabolism, splicing, stability, and translation, influencing gene expression and cellular functions. − The YTH family consists of five members: YTHDF1 (from now on termed DF1), YTHDF2 (DF2), YTHDF3 (DF3), YTHDC1 (DC1), and YTHDC2 (DC2). DC1 is primarily nuclear, where it participates in mRNA splicing, processing, and export. In contrast, the DF proteins (DF1, DF2, and DF3) are mainly cytoplasmic and play essential roles in mRNA translation, stability, and degradation. Finally, DC2 is the latest discovered member of this protein family and is mostly cytoplasmic, even though it has been found to interact with nuclear components, suggesting a dual role in RNA processing and regulation. , Its functions appear to overlap with those of DF proteins, particularly in regulating RNA translation and stability. While other family members are broadly expressed across various cell types, DC2 is notably enriched in the testes, where it plays a critical role in germ cell development and maturation.
While it has been established that each YTH protein possesses unique functions, there is evidence indicating that they can compensate for one another under certain conditions, leading to functional redundancy. ,, Research has demonstrated that DF proteins can engage in context-dependent functional compensation. For instance, when one DF protein is knocked down, the others can partially compensate for the loss, maintaining the overall regulation of mRNA metabolism. , This phenomenon highlights the complexity of the YTH protein family, where the precise roles of individual members may vary depending on the cellular context, the specific mRNA targets, and the presence of other regulatory factors. ,,
Given their crucial role in gene expression regulation, it is not surprising that YTH proteins and m6A regulation are heavily implicated in various diseases, especially cancer. Our study focuses on DF2, which is involved in multiple types of cancer, including prostate cancer, MYC-driven breast cancer, and acute myeloid leukemia (AML). This makes DF2 a highly attractive target for drug discovery and is gaining more and more attention. Even though we focus mainly on DF2, the highly conserved m6A binding site of the DF proteins hinders the development of a DF2-selective ligand. Furthermore, the discussed compensatory effects make a pan-DF ligand desirable.
Only a few small-molecule ligands have been identified for the DF protein family. Among them are Ebselen, Tegaserod, and Salvianolic Acid C, previously known compounds repurposed from other targets. Reviews of known inhibitors of the YTH proteins can be found in refs and . In our earlier publication, we reported the first small-molecule binders of DF2; the X-ray crystal structure of one of them (compound 11, IC50 = 174 μM) is the starting point of this work. Subsequently, Wang et al. reported the discovery of DC-Y13–27, a DF2 inhibitor with an IC50 of 21.8 μM (measured using an AlphaScreen assay) and a K d of 38 μM (determined by microscale thermophoresis). The compound showed weaker activity on DF1 (IC50 = 165 μM in the AlphaScreen assay), but was not tested on DF3. A more recent study identified a series of functionalized pyrazoles as selective DF2 binders. The most potent compound, CK-75, exhibited an IC50 of 13.2 μM in an AlphaScreen assay and was found to be inactive against all other members of the YTH protein family. Notably, CK-75 induced cell cycle arrest and apoptosis in the K567 leukemia cell line, further supporting DF2 as a promising therapeutic target.
Here we present a new series of DF2 binders identified by docking, followed by structure–activity relationship (SAR)-by-catalog. We discovered new chemotypes that compete with m6A-RNA for binding to DF2. Medicinal chemistry optimization of a ligand-efficient scaffold resulted in a series of low-micromolar binders of DF2. Most compounds of the series show a preference for DF proteins against DC1 and DC2. Compound 23 binds only to DF2, being selective against all of the other YTH proteins.
Results and Discussion
Docking of a Library of Fragments
The chemical diversity represented by a library of N fragments (with up to 18–20 non-hydrogen atoms) is substantially larger than the diversity of a library of N molecules with more than 25–30 heavy atoms. As a consequence, fragment screening by docking is an efficient alternative to the screening of ultralarge libraries of compounds. Thus, we decided to start with a fragment docking campaign in DF2. A library of 500,000 fragments was docked using the program SEED. , The fragments were selected from the ZINC20 database according to the following rules: between 11 and 20 heavy atoms, at least one ring, and at least one sp3-hybridized carbon. The first two rules reflect the properties of the program SEED, which was developed for docking mainly rigid fragments. The third rule was selected as our first campaign had identified ligand fragments with methyl, ethyl, or cyclopropyl in the bottom of the aromatic pocket. For each of the extracted compounds, up to 20 conformers were generated using a distance geometry-based algorithm, and docked by SEED. Two structures of the m6A-RNA recognition domain of DF2 were used for docking (Figure ). The crystal structure in the complex with 6-cyclopropylpyrimidine-2,4-diol (compound 11 in ref PDB ID: 7R5W) and a snapshot obtained by molecular dynamics (MD) simulations started from the same crystal structure. The MD snapshot was selected by analysis of the time series of the volume of the recognition pocket. It has a more open recognition loop with a pocket volume of 600 Å3, which is significantly larger than the volume of 324 Å3 in the crystal structure 7R5W (Figure ).
1.

Two structures of the DF2 reader employed for docking. (A) X-ray crystal structure of DF2 in the complex with 6-cyclopropylpyrimidine-2,4-diol (compound 11 of ref PDB: 7R5W). (B) MD-simulation snapshot with a larger aperture of the m6A-recognition pocket (see Materials and Methods). The surface of DF2 is colored by the electrostatic potential (red, negative; blue, positive), and 6-cyclopropylpyrimidine-2,4-diol (carbon atoms in green) and the binding site residues are shown by sticks (carbon atoms in yellow).
The two protein structures were kept rigid during docking and evaluation of the binding energy. SEED calculates the binding energy by a force field with implicit treatment of the electrostatic effects of the solvent. The docked compounds were ranked according to two energy terms, namely, the total binding energy and the difference between the electrostatic contribution to the intermolecular interaction energy in the solvent and the solvation energy of the ligand. The top-scoring compounds were then selected if they showed the crucial hydrogen bond with the backbone carbonyl of C433, which is the acceptor of the N 6 of the natural ligand m6A, and another polar interaction within the binding site (see Materials and Methods). If an interesting compound was not commercially available, then a structurally similar analogue was chosen. Finally, 25 and 22 compounds were selected from the docking campaigns that made use of the crystal structure and MD snapshot, respectively (Table S1).
In Vitro Validation
A previously reported homogeneous time-resolved fluorescence (HTRF)-based assay was used to measure the binding affinity of the 47 ordered compounds (see Materials and Methods). For the nine compounds with residual signal at 1 mM smaller than 60% (with respect to dimethyl sulfoxide (DMSO) control), the IC50 value was determined by dose–response experiments (Table S1). Among these, the thiobarbiturate derivatives 1 and 2 were the strongest binders of DF2, with IC50 values of 19 and 170 μM, respectively (Table ). The thiobarbiturate derivative 1 shows a very favorable ligand efficiency (LE) of 0.50 kcal/(mol HAC) [HAC = heavy atoms count, i.e., number of non-hydrogen atoms] while the toluene group of compound 2 does not seem to contribute to binding.
1. m6-Adenine and 16 Ligands of DF2 Identified by Docking, Followed by SAR by Catalogue .
The NH group interacting as a hydrogen-bond donor with the backbone carbonyl of C433 (blue) and the group in the bottom of the tryptophan cage (red) are emphasized for the compounds with a crystal structure in the complex with DF2.
The residual signal at 1 mM compound concentration is measured using an HTRF-based binding assay as previously reported. The signal decreases (with respect to buffer-only measurement) when the fragment competes with the binding of the natural ligand, i.e., m6A-oligoRNA. Thus, the lower the signal, the higher the affinity of the fragment. The reported values are the average of two technical replicates.
We use the term IC50 for the concentration of the compound that reduces the signal by 50% with respect to buffer-only. YTH readers are not enzymes, but we still prefer the more frequently used inhibitory concentration (IC50) rather than effective concentration. The IC50 value for the DF2 reader domain was measured only for the fragments that, at a concentration of 1 mM, decrease the signal by more than 60%.
The ligand efficiency is calculated as and the values are reported in kcal/(mol HAC), HAC = heavy atom count.
Compounds 1–6 were identified by docking, while compounds 7–16 were selected by SAR by catalogue. * Interference or poor solubility observed at 1 mM. # Interference or poor solubility observed at higher concentrations, IC50 could not be determined.
The X-ray crystal structure was solved for compounds 3–6 at high resolution (Table and Figure B–E). The unique binding mode of compound 3 provided evidence that the ene-thiobarbiturate substructure of compounds 1–3 specifically and noncovalently binds to DF2 despite the potential Michael acceptor reactivity.
2.
Crystal structures of DF2/fragment complexes. (A–G) Binding modes of m6-adenine (compound 1 in ref , PDB: 7YWB), compounds 3 (PDB: 9QEM), 4 (PDB: 9QEL), 5 (PDB: 9QEO), 6 (PDB: 9QFI), 13 (PDB: 9QIU), and 15 (PDB: 9QFL), respectively. The conserved water molecule (red sphere) and the hydrogen bonds (yellow dashed lines) are also shown. (H) Structural overlap of panels (A)–(G). The carbon atoms of the ligands are colored green and those of the protein in cyan.
The six binders 1–6 (Table ) belong to four distinct chemotypes: thiobarbiturates (1–3), uracil (4), nicotinamide (5), and pyrazolopyrimidine (6). Despite their chemical diversity, these chemotypes feature an NH group that is involved as a hydrogen-bond donor with the backbone carbonyl of C433 (Figure B–E). Furthermore, they act as hydrogen-bond acceptor of the conserved water that forms hydrogen bonds with the side chains of W432 and D528. Notably, compounds 1–3 occupy the bottom of the aromatic cage with their sulfur atom, while compounds 4–6 have a methyl group as in the natural ligand. Compounds 1–4 and 6 act as hydrogen-bond acceptors for the backbone NH of Y418, and they also form a hydrogen bond with the side chain of D422. This interaction may offer selectivity against the nuclear reader DC1, which features the N367 hydrogen-bond donor NH2 in this position. In our first screening campaign, we described a series of uracil analogues; compound 4 is a new member of this series with a modest IC50 value of 250 μM. The amide nitrogen atom of compound 5 is involved in a hydrogen bond with the side chain of D528, but it does not form favorable interactions with D422 and the backbone NH of Y418. The 70° relative orientation and 4.8 Å distance between the ring centers indicate that the aromatic side chain of Y418 engages in an edge-to-face π-stacking interaction with 5. ,
SAR by Catalogue
From the information gained in the first screening, a second set of 28 compounds was ordered (Table S2). This set consisted of 14 top-ranking docked molecules with a chemotype similar to that of compounds 1–3, and other 14 molecules that are closely related to the discovered binders 1–6 but were not present in the library used for docking. A total of 16 of the ordered compounds belong to the thiobarbiturate chemotype, which was considered very promising from the previous results. Ten of the 28 compounds (ligands 7–16) showed an IC50 < 100 μM.
Among the thiobarbiturate derivatives 7–10, ligand 8 is the most potent (IC50 = 6 μM and LE = 0.36). The carboxylic acid substituent is likely to contribute to the binding affinity, as the solvent-exposed portion of the DF2 binding pocket is rich in positively charged residues (K416, K490, and R527; Figure H), which typically interact with the negatively charged phosphates of RNA, its natural substrate.
Other molecules closely related to the thiobarbiturates were identified as interesting binders: barbiturates 11 and 12, triazine 13, and condensed bicycles 14–16. The X-ray crystal structures of compounds 13 and 15 confirm their binding (Figure F,G), again showing the sulfur atom interacting within the lipophilic tryptophan cage.
Compound 15 can establish all of the interactions previously discussed: in addition to the interactions of m6-adenine, it can interact with both D422 and D528. (Note though that m6-adenine is likely to be protonated in the bound state as suggested by the distance of 2.6 Å between its N1 atom and the nearest O atom of the D422 side chain (PDB entry 7YWB).) Unfortunately, the multiple interactions of compound 15 do not translate into a very strong binding (IC50 = 86 μM). This might be a consequence of the major tautomer represented in Table being in equilibrium with other tautomeric forms, which are not ideal for efficient binding (∼67% of the desired tautomer, as calculated with the Chemaxon tautomers generator plugin https://plugins.calculators.cxn.io/tautomers/). Compounds 13 and 14 are characterized by very favorable LE values of 0.73 and 0.60, respectively. We decided to further explore the derivatization of compound 14 because of its favorable LE, the two possible vectors for substitution at the two carbon atoms of its thiophene ring, respectively, and the relatively accessible synthesis (see below). We could not determine the crystal structure of DF2 in complex with 14. The choice of compound 14 was supported by its pose predicted by docking, which overlapped with the binding mode of compound 15 in the high-resolution crystal structure with DF2 (PDB entry 9QFL).
Derivatization of Compound 14
The exploration of analogues of 14 has been pursued by synthesizing new derivatives (Tables and S4) and ordering commercially available variants (Table S3). The replacement of the exocyclic sulfur atom with oxygen was detrimental to the potency, from 18 to 290 μM (compound S61). Several 5- and 6-membered ring alternatives to the thiophene were explored (compounds 15, 16, S5, S62–S64), and only compound 16 resulted in an IC50 comparable to that of fragment 14 (30 vs 18 μM).
2. Expansion of Hit Fragment 14 .


For compounds with a single value, it refers to DF2. The dose–response curves and replicates are shown in the Figure S1.
The LE formula is shown in the caption of Table .
The K d values were determined with the Cheng–Prusoff equation using a K d value of 5 nM (DF2), 7 nM (DC2), and 5 nM (DC1) for the 5′-fluorescein-labeled m6A-DNA probe. The reported values are the averages of one (DC1) or two (DF2 and DC2) biological replicates, and each replicate is the average of four technical replicates. The dose–response curves are in Figures S2 (DF2), S3 (DC2), and S4 (DC1).
Compounds 17–31 are more active for DF2 than compound 14 in the HTRF and/or the FP assays.
The medicinal chemistry campaign focused on derivatizing the thiophene ring of 14 through the addition of R1 and/or R2 groups (Table and Table S4). A total of 32 molecules were synthesized (Table S4). Table shows the molecules with higher affinity than compound 14 as measured by the HTRF-based binding assay and/or FP. As mentioned above, we were not able to determine the crystal structure of 14 in complex with DF2. Thus we hypothesized a similar binding mode as in the crystal structure with compound 15 (Figure A). The putative binding poses will be further analyzed and discussed in the next section.
3.
Predicted binding modes of compounds 14, 17, 23, and 27 in DF2. (A) The pseudosymmetry of fragment 14 (carbon atoms in gray) results in two binding modes called here A (top) and B (bottom). The crystal structure of DF2 (cyan, PDB: 9QFL) in complex with fragment 15 (carbon atoms in green) is overlapped for comparison. (B–D) The most populated pose of compounds 17 (pose B), 23 (pose A), and 27 (pose A). Their alternative poses are shown in Figure S6. (E) Analysis of the MD simulations started from the two potential poses (pose A, left; pose B, right). The time series shows the median ligand root-mean-square deviation (RMSD) with respect to the first frame, with a colored band corresponding to one median absolute deviation around the median. (F) Principal component analysis (PCA) projections of the distances between atoms of the ligand and representative residues of the DF2 binding site. Data for MD snapshots saved every 1 ns are shown for the two poses (A, cross; B, diamond), and the centers of the two clusters are emphasized (yellow star).
Compound S74 (R2 = phenyl) and its derivatives (17–19, S75, and S76) were synthesized to try to obtain a favorable π–π stacking interaction with Y418, as seen in the X-ray crystal structure of the N-methyl-3-phenyl-1H-pyrazolo[4,3-d]pyrimidin-7-amine (compound 7 in ref , PDB: 7YXE). Among these compounds, 17 and 18 resulted in a 3-fold improved IC50 in the HTRF-based binding assay (Table ), possibly due to a hydrogen bond between the carbonyl group and the hydroxyl of Y418 or the −NH2 of N462. At R1, we started with small polar substituents to try to establish interactions with D528 and/or with the structural water molecule or to try to replace the water molecule (compounds 20, 21, S79, and S80). The ethyl ester of 21 at R1 enhanced the binding (IC50 = 6.3 μM), as also the bulkier benzyl (22, IC50 = 10 μM) and methylpyridine (23, IC50 = 11 μM) groups, both connected to the thiophene via an amide bond. We hypothesized that the −CH2– linker enables the aromatic ring to orient toward the solvent-exposed region of the pocket, which is enriched with positively charged residues that facilitate binding to the negatively charged RNA. Based on this, we explored modifications such as adding a carboxylic acid (24) and further increasing the flexibility by replacing the benzyl group with an alkyl chain (25 and S83). However, none of these changes led to an improvement in the potency.
We continued the exploration of R1 with substituted benzyl and phenyl rings directly connected to the thiophene (compounds 26–30 and S84–S87). Compounds 26 and 27 resulted in an improvement of 3- to 4-fold compared to fragment 14.
An FP competition assay was used to further validate the binding of the compounds to DF2. The main difference with respect to a previously published FP assay is the use of an oligo-DNA as competitor ligand (see Materials and Methods section). For most compounds, there is a factor of 2–5 difference between the IC50 values for DF2 measured by HTRF and FP (Table ). The largest discrepancy is a factor of 16 for compound 19 (32 and 2 μM by HTRF and FP, respectively; note that Table shows the K d value for FP, which is equal to the IC50/1.6 for DF2). These differences might originate from the varying conditions in the two assays, such as the competitor mRNA (5′-biotin-AAGAACCGG(m6A)CUAAGCU-3′) in HTRF and DNA (5′-FAM-AAGAACCGG(m6A)CTAAGCT-3′) in FP, the salt concentration (150 mM NaCl and 100 mM KF in HTRF, and 150 mM NaCl in FP), and the pH (7.5 in HTRF and 7.4 in FP). The FP-based assay was also employed to assess the selectivity against DC1 and DC2 (see Selectivity section).
Computational Analysis
We could not determine the crystal structure of the complex of DF2 with compound 14 or any of its derivatives by soaking the ligands into apo DF2 crystals or cocrystallization. Thus, we decided to run MD simulations to investigate the binding mode of 14 and its derivatives 17, 23, and 27 (Figure ). As already observed during the docking campaign, the symmetry of the thiourea substructure is congruent with two distinct poses (A and B), which are flipped by a rotation of 180° around the SC double bond. For each compound and pose, eight independent 0.2-μs MD simulations were performed for a cumulative sampling of 3.2 μs per compound, starting from the docked poses of compound 14, or the alignment of the derivatives 17, 23, and 27 to it.
The analysis of the MD trajectories was carried out by adapting the protocol described in ref (see Materials and Methods). Only the second half of each MD run, i.e., the trajectory segments from 100 to 200 ns, was used for the analysis to allow for sufficient ligand relaxation. From these segments, we extracted the simulation frames where the compound is bound, defined as having a distance lower than 5 Å between the exocyclic sulfur atom and the N atom of C433. Then we calculated a set of 10 distances between the heavy atoms of the compounds (all located in the scaffold, i.e., present in compound 14) and the binding pocket. Finally, principal component analysis (PCA) was used to project the multidimensional space into two dimensions. The two-dimensional data were then clustered using the Gaussian Mixture algorithm. The trajectory frame closest to the center of each cluster (centroid) was extracted and used as the reference pose (Figure F).
Figure A shows the two reference poses obtained for compound 14 compared to the crystal structure of compound 15. In both poses, the exocyclic sulfur atom is positioned within the lipophilic tryptophan cage. As mentioned above, compound 14 can adopt two distinct orientations, which are related by a 180-degree rotation. This flipping rearranges the hydrogen-bond interactions while preserving the same number of favorable polar contacts. In both poses, the two NH groups of the thiourea act as hydrogen-bond donors for the backbone carbonyl of C433 and the side chain of D422, respectively. Moreover, the carbonyl group and the thiophene sulfur atom are inverted between the two poses. In pose A, the carbonyl oxygen interacts with the backbone NH of Y418, as observed for the crystal structure with compound 15 (Figures G and A top). In pose B, it instead forms a hydrogen bond with the conserved water molecule (Figure A, bottom). The sulfur atom in the thiophene ring points toward the structural water in pose A and the backbone NH of Y418 in pose B. In both poses, it acts as a weak hydrogen-bond acceptor. − The root-mean-square deviation (RMSD) analysis of the MD runs started from the two poses suggests that pose A is slightly more stable (Figure E, top).
A similar analysis was performed for compounds 17, 23, and 27 to investigate the impact of bulky R1 and/or R2 substituents on the stability of the two poses. For compound 17, a smaller RMSD is observed for the B pose in comparison to A, and the opposite was observed for compound 23 (Figure E). A smaller difference emerges for compound 27, with pose A being slightly more stable.
We also calculated the population of each cluster (Figure F). As expected, the most populated cluster corresponds to the most stable pose based on RMSD analysis. Specifically, 55% of the bound frames of compound 14 belong to the cluster of pose A (Figure A, top); 61% of the bound frames of compound 17 to pose B (Figure B); 56% of compound 23 to pose A (Figure C); and 53% of compound 27 to pose A (Figure D).
Selectivity
The HTRF-based assay was employed to evaluate the selectivity of the series against other DF proteins. The compounds were tested at a concentration of 100 μM and most of them also inhibit DF1 and DF3 (Table S5).
A dose–response experiment on DF1 and DF3 was conducted for compounds 17, 18, 23, and 27 (Figure S5). Compounds 17, 18, and 27 are active on three DF proteins. Their IC50 values for DF1 are approximately twice as high as those for DF2, while for DF3, they are 4–12 times higher. In contrast, compound 23 is selective for DF2, with residual signals (at the highest employed concentration of 62.5 μM) of 51 and 87% for DF1 and DF3, respectively.
The binding affinity for DC1 could not be evaluated by HTRF because of interference of the compounds with the assay, which has a smaller assay window when performed with DC1 compared to the DF proteins. Thus, the FP assay was used to evaluate binding to DC1 and DC2. The affinity of the compounds for DC2 varied significantly depending on the substituents, with K d values ranging from 5 μM for compound 30 to over 400 μM for compounds 23, S74, S78, S79, S82, S83, and S89 (Tables and S4).
The parent scaffold, compound 14, exhibits approximately a 20-fold and 8-fold higher binding affinity for DF2 compared to DC2 and DC1, with K d values of 3.8, 88, and 32 μM, respectively, as measured by FP (Table ). Many derivatives maintain a strong preference for DF2. Notably, compound 23 shows high selectivity, being about 350 and 100 times more potent for DF2 than DC2 and DC1, respectively. MD simulations of compound 23 in complex with DC1 were performed following the same strategy described for DF2 (see the Computational Analysis section). The RMSD analysis (Figure S7) indicates a higher stability (lower RMSD) of both poses A and B in DF2 compared to DC1. These results are consistent with the selectivity of compound 23 for DF2 and against DC1.
In contrast, compounds with a substituted phenyl ring directly connected to the thiophene (R1 or R2) exhibited little to no selectivity against the DC proteins (e.g., ratio K d DC1/DF2 and DC2/DF2 of only 2–4 for ligands 29 and 30).
Compounds 17, 18, and 28–30 are the most potent DC binders of the series. They feature benzoic acid (at R2 in compound 18 and R1 in compound 30), benzoic ester (at R2 in compound 17 and R1 in compound 29), or para-methoxyphenyl (at R1 in compound 28). Their similar behavior indicates that the previously discussed two poses (A and B, see previous section) may also be populated in the DC1 and DC2 binding sites. These results suggest that the substituent, at either R1 or R2, likely occupies the same region of the binding pocket. To the best of our knowledge, the only previously identified DC2-binder in the literature is the pan-YTH binder “N-7”, with a reported IC50 of 30 μM (as measured by FP). Thus, compounds 17 and 30 are the most potent DC2 ligands as of today (K d values of 6.3 and 4.9 μM, respectively, measured by FP).
Chemistry
The general synthetic approach begins with the synthesis of the ethyl 2-aminothiophene-3-carboxylate derivatives (37a–g), with the corresponding R1 and/or R2 substituents. For compounds 37a–e this was achieved via a classic one-pot Gewald reaction (Scheme ). ,
1. Synthesis Route for Compounds 37a–e .
a Reagents and conditions: (a) for 37a–d: S8, morpholine, EtOH, 70 °C; for 37e: S8, morpholine, room temperature (rt).
For the preparation of compounds 37g–m, the substituted phenyl group was added via a Suzuki–Miyaura coupling from compound 39 (Scheme ), prepared by bromination and protection of compound 38. , Finally, the amino group was deprotected, affording ethyl 2-aminothiophene-3-carboxylate intermediates 35g–i.
2. Synthesis Route for Compounds 37f–g .
a Reagents and conditions: (a) (i) Boc2O, 4-dimethylaminopyridine (DMAP), dioxane, 0–80 °C; (ii) N-bromosuccinimide (NBS), AcOH/dichloromethane (DCM), −15 °C (b) boronic acid, K2PO3, Pd(PPh3)4, dimethylformamide (DMF)/H2O, 80 °C; (c) trifluoroacetic acid (TFA), DCM; (d) HCl 4 N in dioxane, MeOH.
Finally, the synthesized 2-aminothiophene-3-carboxylate intermediates (37a–g), and the commercially available ones (see Supporting Information (SI)), were reacted with benzoyl isothiocyanate 42 (Scheme ). The ring closure was then performed in basic conditions under reflux. The synthesis of some final compounds required additional transformations, including hydrolysis (18, 24, 25, 30, S80, and S87), reduction (27 and S79), and amide coupling (22–25, S81, and S83). Full experimental details are provided in the Supporting Information.
3. Synthesis Route for the Bicyclic Final Compounds .

a Reagents and conditions: (a) CH3CN, 45 °C; (b) EtONa, EtOH, reflux.
Conclusions
We employed a fragment-based approach for identifying ligand-efficient small molecules that occupy the m6A-RNA recognition pocket of the DF2 reader domain. A crystal structure of a previously disclosed ligand-efficient fragment binder of DF2 (PDB 7R5W) and a molecular dynamics snapshot were used for fragment docking. Each of the two docking campaigns yielded three active compounds and, thus, an overall hit rate of 13% (6/47). SAR by catalogue and the synthesis of 32 derivatives of the thioxo-dihydrothienopyridinone scaffold 14 resulted in a series of ligand-efficient, low-micromolar binders of DF2. Most members of the series are selective against DC1 and DC2, while they bind with low-micromolar affinity also to DF1 and DF3. In contrast, compound 23 displays distinct selectivity as it binds exclusively to DF2 (K d = 1.3 μM and IC50 = 11 μM measured by FP and HTRF, respectively). Using a similar fragment-based strategy, in our previous screening campaign for DF2 we identified 6-cyclopropyluracil as hit compound with an IC50 value of 170 μM. Compound 23 is more potent for DF2 than 6-cyclopropyluracil by a factor of 15, and is significantly more selective against DF1 and DF3. It is important to note that protein crystallography played a key role in both screening campaigns, as crystal structures were used for docking and molecular dynamics simulations. Moreover, in the present campaign the crystal structure of DF2 in the complex with compound 15 guided the derivatization of compound 14, which has a similar scaffold and binding mode.
A few compounds (e.g., 17 and 18) have a comparable affinity for the five YTH-containing proteins. Among them, compounds 17 and 30 are currently the most potent ligands of DC2 (K d values of 6.3 and 4.9 μM, respectively, measured by FP). We have also presented the crystal structures of DF2 in complex with six ligands, which represent six distinct chemotypes (compounds 3–6, 13, and 15). This structural data is useful for the development of a new series of ligands of the YTHDF m6A-RNA readers, and for the further training of machine learning models.
Materials and Methods
Fragment Docking and Ranking
We used force field-based docking to identify small-molecule binders of the m6A reader YTHDF2. The structure of the YTHDF2 domain used for docking is the one in the complex with the ligand 6-cyclopropyl-1H-pyrimidine-2,4-dione (PDB code: 7R5W). We prepared the protein structure for docking using CAMPARIv5. The SEED ,, docking program was used for rigid docking to the crystal structure itself and a snapshot obtained by MD simulations. These MD simulations were described in a previous study. After clustering the snapshots, a representative pose with a large aperture of the m6A binding site was selected (volume of 600 vs 324 Å3 in the crystal structure). The pocket volume was obtained by structural alignment of the crystal structure (PDB 7R5W) and the MD snapshot, and using the dpocket functionality of the fpocket tool with the ligand as reference. A library of 500,000 small molecules was considered for screening by SEED. The molecules were selected from the ZINC2020 database with the number of non-hydrogen atoms between 11 and 20, at least one ring and one sp3 carbon in the structure. For each of the extracted compounds, up to 20 conformers were generated using a distance geometry-based algorithm.
The two protein structures were kept rigid during docking and evaluation of the binding energy. The residue C433 was selected for posing the fragments by SEED, which is similar to a pharmacophoric constraint. For the evaluation of the SEED energy, the binding site consisted of all of the DF2 residues within 20 Å of C433 and all charged side chains of DF2. A structural water molecule was also considered as part of the binding site because it is consistently resolved in all of the crystal structures obtained and is involved as a hydrogen-bond acceptor and donor with the side chains of W432 and D528, respectively. , The partial charges and van der Waals parameters for the atoms in the protein and the small molecules were taken from the CHARMM36 all-atom force field − and the CHARMM general force field (CGenFF), respectively. Importantly, the CHARMM36 force field and CGenFF are fully consistent in their partial charges and van der Waals parameters. The evaluation of the binding energy in the program SEED consists of a force field-based energy function with a continuum dielectric approximation of desolvation penalties by the generalized Born model. The values of the dielectric constant were 2.0 and 78.5 for the regions of the space occupied by the solute and solvent, respectively. Fragment screening by SEED requires about 1 s per fragment. SEED is available as an open-source code from GitLab (https://gitlab.com/CaflischLab).
From both docking campaigns, the compounds were ranked according to two energy terms calculated by SEED, namely, the total binding energy (SEED total) and the difference between the electrostatic contribution to the intermolecular interaction energy in the solvent and the solvation energy of the ligand (Delec). The top-scoring compounds were then selected if they showed the crucial hydrogen bond with the backbone carbonyl of C433 and a favorable interaction with the backbone NH of Y418, the side chain of D422, and/or the conserved water molecule. Finally, 47 compounds were purchased on the basis of commercial availability and structural diversity: 25 selected from the docking performed on the crystal structure and 22 from that on the MD snapshot with a large aperture of the binding site.
HTRF Assay
GST-YTHDC1, GST-YTHDF1, GST-YTHDF2, and GST-YTHDF3 were purified as previously reported. The HTRF assay was assembled as detailed in ref with the only difference being that the starting concentration of the dose–response experiments used for the IC50 determination was variated dependently from the tested compound. The same protocol was applied to the four proteins. The competitive inhibition data of GST-YTHDF1 (single dose experiment at 100 μM compound concentration and dose–response curves), GST-YTHDF3 (single dose experiment at 100 μM compound concentration and dose–response curves), and GST-YTHDF2 (single dose experiment at 100 μM compound concentration) were normalized as described in ref to mitigate interference. The signal was measured as described in ref .
GST-YTHDC2 Production
The N-terminally GST-tagged YTH domain of YTHDC2 (residues 1285–1424, cloned into the pGEX-6P-1 vector) was overexpressed in Rosetta (DE3) cells grown overnight at 20 °C following induction with 0.4 mM isopropyl-β-d-thiogalactopyranoside (IPTG) at an OD600 of 0.8. The cells were harvested and resuspended in a lysis buffer containing 100 mM Tris–HCl (pH 8.0) and 500 mM NaCl. After cell lysis, the lysate was clarified by centrifugation at 48,000g for 1 h, and the soluble proteins were loaded onto a column packed with Glutathione Sepharose 4B (GE Healthcare), then eluted with 20 mM reduced glutathione in lysis buffer. Finally, a size-exclusion chromatography step (HiLoad 16/600 Superdex 200 pg column, GE Healthcare) was performed to further purify the protein in 20 mM Tris–HCl (pH 7.4) and 150 mM NaCl.
DNA-Fluorescence Polarization (FP) Assay
For fluorescence polarization (FP) experiments, a 5′-fluorescein-labeled m6A-DNA probe (5′-FAM-AAGAACCGG(m6A)CTAAGCT-3′) was synthesized by Microsynth AG.
The final concentration of the fluorescein-labeled DNA was kept constant at 3 nM. For YTHDC2 measurements, the protein concentration was set to 25 nM, while for YTHDF2 measurements, it was set to 10 nM, and for YTHDC1, it was set at 15 nM. The compound concentration was serially diluted to obtain the dose–response curve. The competition experiments were conducted in a final volume of 20 μL in a buffer containing 20 mM Tris–HCl (pH 7.4), 150 mM NaCl, and 0.01% bovine serum albumin (BSA), using a 384-well black flat-bottomed microplate (Corning 3575).
After incubating the mixture for 1 h, the anisotropy values were measured using a Tecan SPARK plate reader with 485/20 nm excitation and 535/25 nm emission polarization filters, suitable for fluorescein, at 25 °C. To obtain the equilibrium dissociation constant (K d), the IC50 values were first derived by fitting a dose–response curve to the data using nonlinear regression analysis in GraphPad Prism. These IC50 values were then converted to K d by using the Cheng–Prusoff equation:
IC50 is the observed value from FP assay, [L] is the probe concentration used, and K d is the binding affinity of probe for the target.
YTHDF2 Protein Crystallography and Soaking
The YTH domain of YTHDF2 was expressed, purified, crystallized, and soaked as described in ref . The X-ray diffraction experiment was performed on the X06DA beamline of Paul Scherrer Institute’s Swiss Light Source. The resulting data were analyzed as described in ref .
Molecular Dynamics Simulations and Clustering
We used molecular dynamics (MD) simulations to analyze the interactions between the YTHDF2 domain and compounds 14, 17, 23, and 27. Compound 14 (and its derivatives 17, 23 and 27) can, in principle, bind in two poses due to the symmetry of its thiourea ring. This was also observed in the docking results of compound 14. Therefore, we defined two poses, A and B, depending respectively on the orientation of the exocyclic sulfur atom pointing toward the tryptophan cage or outward. We used the ParaLig software to modify the two docked poses of compound 14 to obtain derivatives 17 and 27. The protein/ligand structures were then prepared using the software CAMPARIv5. Simulations were run using TIP3P (CHARMM) water model, with a 0.15 M concentration of Na+ and Cl– ions. We equilibrated the systems first using an NPT ensemble to reach 300 K and 1 bar under 10 kJ/(mol Å2) positional restraints. We then applied four successive 1 ns NVT equilibrations with weakening restraints of 10, 5, 2.5, and 1.25 kJ/(mol Å2), respectively. All simulations were done using the CHARMM36m force field with the July 2022 GROMACS port. Production MD simulations consisted of eight independent runs for each compound and pose and a sampling of 200 ns per run. Production simulations were performed at the Swiss Supercomputing Center (CSCS) with the support of grant s1272 using GROMACS 2021.5.
For clustering, we first subsampled the trajectories by selecting MD frames at every nanosecond of the simulation segments from 100 to 200 ns. In other words, we discarded the first half of each run for allowing the ligand to equilibrate in the binding pocket. The bound frames were selected according to the distance between the N atom of C433 and the thiourea sulfur atom and choosing frames with a distance lower than 5 Å. We applied PCA to a set of 10 protein–ligand distances to reduce the data to two dimensions. These 10 distances involved atoms of the thiourea ring and the residues C433 (four distances), D422 (two distances), and Y418, W432, W486, and W491 (one distance each). The Gaussian mixture algorithm with full covariance was employed for clustering the two-dimensional data in PC space. The MD snapshot closest to the center of the cluster (centroid) was used as a representative pose. The root-mean-square deviation (RMSD) of the ligand in the binding pocket was calculated for all of the trajectories. First, all MD snapshots were overlapped with the equilibrated starting structure using the Cα atoms of the protein. Then the coordinates of the heavy atoms of the ligand were employed for the calculation of the RMSD. All analyses were done using MDTraj and SciKit learn.
Supplementary Material
Acknowledgments
The authors thank Beat Blattmann and Görkem Kurtuldu at the Protein Crystallization Center of UZH for the assistance with the crystallization, and the beamline scientists at the Swiss Light Source at Paul Scherrer Institute for their help with the X-ray diffraction experiments. We thank Maria Paula Flores Espinoza and Thomas Frei for their technical assistance. This work was financially supported by the Swiss National Science Foundation to A.C. (Grant number 310030_212195), the Swiss Cancer Research Foundation to A.C. (Grant number KFS 5748-02-2023), the Swiss National Supercomputing Center (CSCS, project ID s1272 on Piz Daint), and the UZH Candoc Grant to A.I. (Grant number FK-24-031).
Glossary
Abbreviations
- YTHDF
YT521-B homology domain family
- m6A
N 6-methyladenosine
- YTHDC
YTH domain containing
- oligoRNA
oligoribonucleotide
- HTRF
homogeneous time-resolved fluorescence
- LE
ligand efficiency
- FP
fluorescence polarization
- PCA
principal component analysis
- RMSD
root-mean-square deviation
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsbiomedchemau.5c00099.
Supplementary figures, tables, materials, synthetic procedures, characterization data, 1H and 13C NMR spectra, HPLC traces for all final compounds, and X-ray data collection and refinement statistics for the 6 complex YTH-YTHDF2 crystal structures (PDF)
The authors declare no competing financial interest.
References
- Shi H., Wei J., He C.. Where, When, and How: Context-Dependent Functions of RNA Methylation Writers, Readers, and Erasers. Mol. Cell. 2019;74(4):640–650. doi: 10.1016/j.molcel.2019.04.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liao J., Wei Y., Liang J., Wen J., Chen X., Zhang B., Chu L.. Insight into the Structure, Physiological Function, and Role in Cancer of M6A ReadersYTH Domain-Containing Proteins. Cell Death Discovery. 2022;8:137. doi: 10.1038/s41420-022-00947-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen L., Gao Y., Xu S., Yuan J., Wang M., Li T., Gong J.. N6-Methyladenosine Reader YTHDF Family in Biological Processes: Structures, Roles, and Mechanisms. Front. Immunol. 2023;14:1162607. doi: 10.3389/fimmu.2023.1162607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dai X. Y., Shi L., Li Z., Yang H. Y., Wei J. F., Ding Q.. Main N6-Methyladenosine Readers: YTH Family Proteins in Cancers. Front. Oncol. 2021;11:635329. doi: 10.3389/fonc.2021.635329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xiao W., Adhikari S., Dahal U., Chen Y. S., Hao Y. J., Sun B. F., Sun H. Y., Li A., Ping X. L., Lai W. Y., Wang X., Ma H. L., Huang C. M., Yang Y., Huang N., Jiang G. B., Wang H. L., Zhou Q., Wang X. J., Zhao Y. L., Yang Y. G.. Nuclear M6A Reader YTHDC1 Regulates MRNA Splicing. Mol. Cell. 2016;61(4):507–519. doi: 10.1016/j.molcel.2016.01.012. [DOI] [PubMed] [Google Scholar]
- Hsu P. J., Zhu Y., Ma H., Guo Y., Shi X., Liu Y., Qi M., Lu Z., Shi H., Wang J., Cheng Y., Luo G., Dai Q., Liu M., Guo X., Sha J., Shen B., He C.. Ythdc2 Is an N6-Methyladenosine Binding Protein That Regulates Mammalian Spermatogenesis. Cell Res. 2017;27(9):1115–1127. doi: 10.1038/cr.2017.99. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kretschmer J., Rao H., Hackert P., Sloan K. E., Höbartner C., Bohnsack M. T.. The M6A Reader Protein YTHDC2 Interacts with the Small Ribosomal Subunit and the 5′–3′ Exoribonuclease XRN1. RNA. 2018;24(10):1339–1350. doi: 10.1261/rna.064238.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lasman L., Krupalnik V., Viukov S., Mor N., Aguilera-Castrejon A., Schneir D., Bayerl J., Mizrahi O., Peles S., Tawil S., Sathe S., Nachshon A., Shani T., Zerbib M., Kilimnik I., Aigner S., Shankar A., Mueller J. R., Schwartz S., Stern-Ginossar N., Yeo G. W., Geula S., Novershtern N., Hanna J. H.. Context-Dependent Functional Compensation between Ythdf M6A Reader Proteins. Genes Dev. 2020;34(19–20):1373–1391. doi: 10.1101/gad.340695.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li F., Zeng C., Liu J., Wang L., Yuan X., Yuan L., Xia X., Huang W.. The YTH Domain-Containing Protein Family: Emerging Players in Immunomodulation and Tumour Immunotherapy Targets. Clin. Transl. Med. 2024;14(8):e1784. doi: 10.1002/ctm2.1784. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li J., Meng S., Xu M., Wang S., He L., Xu X., Wang X., Xie L.. Downregulation of N6-Methyladenosine Binding YTHDF2 Protein Mediated by MiR-493–3p Suppresses Prostate Cancer by Elevating N6-Methyladenosine Levels. Oncotarget. 2018;9(3):3752–3764. doi: 10.18632/oncotarget.23365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Einstein J. M., Perelis M., Chaim I. A., Meena J. K., Nussbacher J. K., Tankka A. T., Yee B. A., Li H., Madrigal A. A., Neill N. J., Shankar A., Tyagi S., Westbrook T. F., Yeo G. W.. Inhibition of YTHDF2 Triggers Proteotoxic Cell Death in MYC-Driven Breast Cancer. Mol. Cell. 2021;81(15):3048.e9–3064.e9. doi: 10.1016/j.molcel.2021.06.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang J.-y., Lu A.-q.. The Biological Function of M6A Reader YTHDF2 and Its Role in Human Disease. Cancer Cell Int. 2021;21:109. doi: 10.1186/S12935-021-01807-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nai F., Nachawati R., Zálešák F., Wang X., Li Y., Caflisch A.. Fragment Ligands of the M6A-RNA Reader YTHDF2. ACS Med. Chem. Lett. 2022;13(9):1500–1509. doi: 10.1021/acsmedchemlett.2c00303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Micaelli M., Dalle Vedove A., Cerofolini L., Vigna J., Sighel D., Zaccara S., Bonomo I., Poulentzas G., Rosatti E. F., Cazzanelli G., Alunno L., Belli R., Peroni D., Dassi E., Murakami S., Jaffrey S. R., Fragai M., Mancini I., Lolli G., Quattrone A., Provenzani A.. Small-Molecule Ebselen Binds to YTHDF Proteins Interfering with the Recognition of N6-Methyladenosine-Modified RNAs. ACS Pharmacol. Transl. Sci. 2022;5(10):872–891. doi: 10.1021/acsptsci.2c00008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Madia V. N., Messore A., Saccoliti F., Tudino V., De Leo A., De Vita D., Bortolami M., Scipione L., Pindinello I., Costi R., Di Santo R.. Tegaserod for the Treatment of Irritable Bowel Syndrome. Anti-Inflammatory Anti-Allergy Agents Med. Chem. 2020;19(4):342–369. doi: 10.2174/1871523018666190911121306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zou Z., Wei J., Chen Y., Jin P., Luo C., Kang Y., Shi H., Yang F., Shi Z., Chen S., Zhou Y., Sepich-Poore C., Zhuang X., Zhou X., Jiang H., Wen Z., He C.. FMRP Phosphorylation Modulates Neuronal Translation through YTHDF1. Mol. Cell. 2023;83:4304.e8–4317.e8. doi: 10.1016/j.molcel.2023.10.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feng G., Wu Y., Hu Y., Shuai W., Yang X., Li Y., Ouyang L., Wang G.. Small Molecule Inhibitors Targeting M6A Regulators. J. Hematol. Oncol. 2024;17:30. doi: 10.1186/s13045-024-01546-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nai, F. Development of Chemical Probes for Epitranscriptomic Proteins; University of Zurich, Faculty of Science: Zurich, 2024; pp 5–9. [Google Scholar]
- Wang L., Dou X., Chen S., Yu X., Huang X., Zhang L., Chen Y., Wang J., Yang K., Bugno J., Pitroda S., Ding X., Piffko A., Si W., Chen C., Jiang H., Zhou B., Chmura S. J., Luo C., Liang H. L., He C., Weichselbaum R. R.. YTHDF2 Inhibition Potentiates Radiotherapy Antitumor Efficacy. Cancer Cell. 2023;41(7):1294.e8–1308.e8. doi: 10.1016/j.ccell.2023.04.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qiu X., Kemker C., Goebel G. L., Lampe P., Wallis N., Schiller D., Bigler K., Jiang M., Sievers S., Yeo G. W., Wu P.. Phenylpyrazoles as Inhibitors of the M6A RNA-Binding Protein YTHDF2. JACS Au. 2025;5(2):618–630. doi: 10.1021/jacsau.4c00754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kolb P., Kipouros C. B., Huang D., Caflisch A.. Structure-Based Tailoring of Compound Libraries for High-Throughput Screening: Discovery of Novel EphB4 Kinase Inhibitors. Proteins: Struct., Funct., Bioinf. 2008;73(1):11–18. doi: 10.1002/prot.22028. [DOI] [PubMed] [Google Scholar]
- Marchand J. R., Caflisch A.. In Silico Fragment-Based Drug Design with SEED. Eur. J. Med. Chem. 2018;156:907–917. doi: 10.1016/j.ejmech.2018.07.042. [DOI] [PubMed] [Google Scholar]
- Majeux N., Scarsi M., Apostolakis J., Ehrhardt C., Caflisch A.. Exhaustive Docking of Molecular Fragments With Electrostatic Solvation. Proteins: Struct., Funct., Genet. 1999;37:88–105. doi: 10.1002/(SICI)1097-0134(19991001)37:1<88::AID-PROT9>3.0.CO;2-O. [DOI] [PubMed] [Google Scholar]
- Goossens K., Wroblowski B., Langini C., van Vlijmen H., Caflisch A., de Winter H.. Assessment of the Fragment Docking Program SEED. J. Chem. Inf. Model. 2020;60(10):4881–4893. doi: 10.1021/acs.jcim.0c00556. [DOI] [PubMed] [Google Scholar]
- Irwin J. J., Tang K. G., Young J., Dandarchuluun C., Wong B. R., Khurelbaatar M., Moroz Y. S., Mayfield J., Sayle R. A.. ZINC20 - A Free Ultralarge-Scale Chemical Database for Ligand Discovery. J. Chem. Inf. Model. 2020;60(12):6065–6073. doi: 10.1021/acs.jcim.0c00675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riniker S., Landrum G. A.. Better Informed Distance Geometry: Using What We Know to Improve Conformation Generation. J. Chem. Inf. Model. 2015;55(12):2562–2574. doi: 10.1021/acs.jcim.5b00654. [DOI] [PubMed] [Google Scholar]
- Li Y., Bedi R. K., Moroz-Omori E. V., Caflisch A.. Structural and Dynamic Insights into Redundant Function of YTHDF Proteins. J. Chem. Inf. Model. 2020;60(12):5932–5935. doi: 10.1021/acs.jcim.0c01029. [DOI] [PubMed] [Google Scholar]
- Wiedmer L., Eberle S. A., Bedi R. K., Śledź P., Caflisch A.. A Reader-Based Assay for M6A Writers and Erasers. Anal. Chem. 2019;91(4):3078–3084. doi: 10.1021/acs.analchem.8b05500. [DOI] [PubMed] [Google Scholar]
- Zálešák F., Nai F., Herok M., Bochenkova E., Bedi R. K., Li Y., Errani F., Caflisch A.. Structure-Based Design of a Potent and Selective YTHDC1 Ligand. J. Med. Chem. 2024;67(11):9516–9535. doi: 10.1021/acs.jmedchem.4c00599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sinnokrot M. O., Valeev E. F., Sherrill C. D.. Estimates of the Ab Initio Limit for π-π Interactions: The Benzene Dimer. J. Am. Chem. Soc. 2002;124(36):10887–10893. doi: 10.1021/ja025896h. [DOI] [PubMed] [Google Scholar]
- Zhao Y., Li J., Gu H., Wei D., Xu Y.-c., Fu W., Yu Z.. Conformational Preferences of π–π Stacking Between Ligand and Protein, Analysis Derived from Crystal Structure Data Geometric Preference of π–π Interaction. Interdiscip. Sci.: Comput. Life Sci. 2015;7(3):211–220. doi: 10.1007/s12539-015-0263-z. [DOI] [PubMed] [Google Scholar]
- Wang C. H., Zhou H.. Discovery of a New Inhibitor for YTH Domain-Containing M6A RNA Readers. RSC Chem. Biol. 2024;5(9):914–923. doi: 10.1039/D4CB00105B. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nai F., Flores Espinoza M. P., Invernizzi A., Vargas-Rosales P. A., Bobileva O., Herok M., Caflisch A.. Small-Molecule Inhibitors of the M7G-RNA Writer METTL1. ACS Bio Med Chem Au. 2024;4(2):100–110. doi: 10.1021/acsbiomedchemau.3c00030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Bergen L. A. H., Alonso M., Palló A., Nilsson L., De Proft F., Messens J.. Revisiting Sulfur H-Bonds in Proteins: The Example of Peroxiredoxin AhpE. Sci. Rep. 2016;6:30369. doi: 10.1038/srep30369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gregoret L. M., Rader S. D., Fletterick R. J., Cohen F. E.. Hydrogen Bonds Involving Sulfur Atoms in Proteins. Proteins: Struct., Funct., Bioinf. 1991;9(2):99–107. doi: 10.1002/prot.340090204. [DOI] [PubMed] [Google Scholar]
- Mundlapati V. R., Ghosh S., Bhattacherjee A., Tiwari P., Biswal H. S.. Critical Assessment of the Strength of Hydrogen Bonds between the Sulfur Atom of Methionine/Cysteine and Backbone Amides in Proteins. J. Phys. Chem. Lett. 2015;6(8):1385–1389. doi: 10.1021/acs.jpclett.5b00491. [DOI] [PubMed] [Google Scholar]
- Sabnis R. W., Rangnekar D. W., Sonawane N. D.. 2-Aminothiophenes by the Gewald Reaction. J. Heterocycl. Chem. 1999;36(2):333–345. doi: 10.1002/jhet.5570360203. [DOI] [Google Scholar]
- El-Mekabaty A.. Chemistry of 2-Amino-3-Carbethoxythiophene and Related Compounds. Synth. Commun. 2014;44(1):1–31. doi: 10.1080/00397911.2013.821618. [DOI] [Google Scholar]
- Dong Y., Navarathne D., Bolduc A., McGregor N., Skene W. G.. α,A′-N-Boc-Substituted Bi- and Terthiophenes: Fluorescent Precursors for Functional Materials. J. Org. Chem. 2012;77(12):5429–5433. doi: 10.1021/jo300687d. [DOI] [PubMed] [Google Scholar]
- Aurelio L., Figler H., Flynn B. L., Linden J., Scammells P. J.. 5-Substituted 2-Aminothiophenes as A1 Adenosine Receptor Allosteric Enhancers. Bioorg. Med. Chem. 2008;16(3):1319–1327. doi: 10.1016/j.bmc.2007.10.065. [DOI] [PubMed] [Google Scholar]
- Wang T., Zheng C. H., Liu S., Chen H. Z.. Synthesis and Biological Activity of a Series of New Thieno[2,3-d]Pyrimidines. Phosphorus, Sulfur Silicon Relat. Elem. 2010;185(7):1543–1549. doi: 10.1080/10426500903127565. [DOI] [Google Scholar]
- Vargas-Rosales P. A., Caflisch A.. The Physics-AI Dialogue in Drug Design. RSC Med. Chem. 2025;16(4):1499–1515. doi: 10.1039/D4MD00869C. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vitalis A., Pappu R. V.. Chapter 3 Methods for Monte Carlo Simulations of Biomacromolecules. Annu. Rep. Comput. Chem. 2009;5:49–76. doi: 10.1016/S1574-1400(09)00503-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Majeux N., Scarsi M., Caflisch A.. Efficient Electrostatic Solvation Model for Protein-Fragment Docking. Proteins: Struct., Funct., Genet. 2001;42:256–268. doi: 10.1002/1097-0134(20010201)42:2<256::AID-PROT130>3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]
- Le Guilloux V., Schmidtke P., Tuffery P.. Fpocket: An Open Source Platform for Ligand Pocket Detection. BMC Bioinf. 2009;10(1):168. doi: 10.1186/1471-2105-10-168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKerell A. D., Bashford D., Bellott M., Dunbrack R. L., Evanseck J. D., Field M. J., Fischer S., Gao J., Guo H., Ha S., Joseph-McCarthy D., Kuchnir L., Kuczera K., Lau F. T. K., Mattos C., Michnick S., Ngo T., Nguyen D. T., Prodhom B., Reiher W. E., Roux B., Schlenkrich M., Smith J. C., Stote R., Straub J., Watanabe M., Wiórkiewicz-Kuczera J., Yin D., Karplus M.. All-Atom Empirical Potential for Molecular Modeling and Dynamics Studies of Proteins. J. Phys. Chem. B. 1998;102(18):3586–3616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
- MacKerell A. D., Feig M., Brooks C. L.. Improved Treatment of the Protein Backbone in Empirical Force Fields. J. Am. Chem. Soc. 2004;126(3):698–699. doi: 10.1021/ja036959e. [DOI] [PubMed] [Google Scholar]
- Best R. B., Zhu X., Shim J., Lopes P. E. M., Mittal J., Feig M., MacKerell A. D.. Optimization of the Additive CHARMM All-Atom Protein Force Field Targeting Improved Sampling of the Backbone φ, ψ and Side-Chain X1 and X2 Dihedral Angles. J. Chem. Theory Comput. 2012;8(9):3257–3273. doi: 10.1021/ct300400x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vanommeslaeghe K., Hatcher E., Acharya C., Kundu S., Zhong S., Shim J., Darian E., Guvench O., Lopes P., Vorobyov I., Mackerell A. D.. CHARMM General Force Field: A Force Field for Drug-like Molecules Compatible with the CHARMM All-Atom Additive Biological Force Fields. J. Comput. Chem. 2010;31(4):671–690. doi: 10.1002/jcc.21367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scarsi M., Apostolakis J., Caflisch A.. Continuum Electrostatic Energies of Macromolecules in Aqueous Solutions. J. Phys. Chem. A. 1997;101(43):8098–8106. doi: 10.1021/jp9714227. [DOI] [Google Scholar]
- Vitalis A., Parker D., Radler F., Vargas-Rosales P. A., Zhang Y., Marchand J.-R., Caflisch A.. Parallelized Tools for the Preparation and Curation of Large Libraries for Virtual Screening. J. Comput. Chem. 2025;46(7):e70074. doi: 10.1002/jcc.70074. [DOI] [PubMed] [Google Scholar]
- Boonstra S., Onck P. R., Van Der Giessen E.. CHARMM TIP3P Water Model Suppresses Peptide Folding by Solvating the Unfolded State. J. Phys. Chem. B. 2016;120(15):3692–3698. doi: 10.1021/acs.jpcb.6b01316. [DOI] [PubMed] [Google Scholar]
- Huang J., Rauscher S., Nawrocki G., Ran T., Feig M., De Groot B. L., Grubmüller H., MacKerell A. D.. CHARMM36m: An Improved Force Field for Folded and Intrinsically Disordered Proteins. Nat. Methods. 2017;14(1):71–73. doi: 10.1038/nmeth.4067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abraham M. J., Murtola T., Schulz R., Páll S., Smith J. C., Hess B., Lindah E.. GROMACS: High Performance Molecular Simulations through Multi-Level Parallelism from Laptops to Supercomputers. SoftwareX. 2015;1–2:19–25. doi: 10.1016/j.softx.2015.06.001. [DOI] [Google Scholar]
- McGibbon R. T., Beauchamp K. A., Harrigan M. P., Klein C., Swails J. M., Hernández C. X., Schwantes C. R., Wang L. P., Lane T. J., Pande V. S.. MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics Trajectories. Biophys. J. 2015;109(8):1528–1532. doi: 10.1016/j.bpj.2015.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., Prettenhofer P., Weiss R., Dubourg V., Vanderplas J., Passos A., Cournapeau D., Brucher M., Perrot M., Duchesnay É.. Scikit-Learn: Machine Learning in Python. J. Mach. Learn. Res. 2011;12(85):2825–2830. doi: 10.5555/1953048.2078195. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.







