Abstract
The nuclear receptor Liver Receptor Homolog-1 (LRH-1, NR5A2) is a ligand-regulated transcription factor and validated drug target for several human diseases. LRH-1 activation is regulated by small molecule ligands, which bind to the ligand binding domain (LBD) within the full-length LRH-1. We recently identified 57 compounds that bind LRH-1, and unexpectedly found these compounds regulated either the isolated LBD, or the full-length LRH-1 in cells, with little overlap. Here, we correlated compound binding energy from a single rigid-body scoring function with full-length LRH-1 activity in cells. Although docking scores of the 57 hit compounds did not correlate with LRH-1 regulation in wet lab assays, a subset of the compounds had large differences in binding energy docked to the isolated LBD vs. full-length LRH-1, which we used to empirically derive a new metric of the docking scores we call "ΔΔG". Initial regressions, correlations and contingency analyses all suggest compounds with high ΔΔG values more frequently regulated LRH-1 in wet lab assays. We then docked all 57 compounds to 18 separate crystal structures of LRH-1 to obtain averaged ΔΔG values for each compound, which robustly and reproducibly associated with full-length LRH-1 activity in cells. Network analyses on the 18 crystal structures of LRH-1 suggest unique communication paths exist between the subsets of LRH-1 crystal structures that produced high vs. low ΔΔG values, identifying a structural relationship between ΔΔG and the position of Helix 6, a previously established regulatory helix important for LRH-1 regulation. Together, these data suggest rigid-body computational docking can be used to quickly calculate ΔΔG, which positively correlated with the ability of these 57 hit compounds to regulate full-length LRH-1 in cell-based assays. We propose ΔΔG as a novel computational tool that can be applied to LRH-1 drug screens to prioritize compounds for resource-intense secondary screening.
Keywords: Prioritize hit compounds, Nuclear receptor compound docking
Graphical Abstract
1. Introduction
Prioritizing hit compounds identified from a primary compound screen is important for drug development [1], [2], [3], [4], [5], [6] as follow-up secondary assays are resource-intensive, particularly cell-based assays of target protein function [7], [8]. Cell-based assays monitor compound activity in a physiologically relevant environment, increasing confidence in results [7], [9], [10], [11] while retaining compatibility with higher-throughput screening formats [12]. The choice of secondary assay must therefore strike a balance between testing fewer, high-priority compounds in a higher-confidence assay, or more compounds in a lower-confidence assay, since resources not unlimited [13]. Computational docking has been used to help prioritize compounds [14], which can improve the hit rate from secondary assays [2], [3], [4], [5], [6], [15]. However, little wet-lab data are available validating direct relationships between docking scores and functional regulation in cells [16], [17], [18], particularly for non-enzyme allosterically-regulated target proteins [19], [20]. Still, the speed, accessibility and low cost of computational docking provides an attractive way to prioritize compounds [21], [22]. Thus, there is a need for docking approaches that have been validated with wet-lab data, correlating docking scores with compound activity in cell-based assays [2], [3], [4], [23], [24] particularly for allosterically-regulated targets [5], [20], [22]. Here, we discovered and validated one such approach, retrospectively correlating published cell-based functional data [1] with new rigid-body docking of compounds to Liver Receptor Homolog-1 (LRH-1, NR5A2), an allosterically regulated nuclear receptor and drug target for several human diseases.
The nuclear receptor superfamily is a group of DNA-binding transcription factors regulated by hydrophobic ligands such as cholesterol-based steroids [25], fatty acids [26], heme-based metabolites [27] and phospholipids [28], [29]. LRH-1 (NR5A2) is a nuclear receptor expressed ubiquitously in humans [30], [31], but is particularly important for adult function in the liver [32], [33], [34], [35], [36], [37], pancreas [38], [39] and gut [40], [41], [42]. A wide variety of physiological studies have provided pre-clinical validation of LRH-1 as a drug target in diabetes and non-alcoholic fatty liver disease (NAFLD) [33], [36], pancreatic cancer [43], [44], [45] and ulcerative colitis [42], all diseases with unmet clinical need. Although there has been great recent progress in the development of LRH-1 regulatory compounds [42], [46], [47], [48)], [49], no FDA-approved drugs currently target LRH-1, so there remains need for novel approaches that can hasten LRH-1 drug development.
Nuclear receptor ligands bind to a canonical cleft in the ligand binding domain (LBD), which alters the conformation of the LBD, which permits interaction with a variety of transcriptional co-regulator proteins [50)], [51], [52]. This principle applies to LRH-1 and the highly homologous SF-1 nuclear receptors, which are both regulated through this canonical mechanism by several ligands, including natural phospholipids [37], [42], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62] and synthetic small molecules [37], [42], [46], [47], [48)], [49], [63)], [64]. These regulatory ligands all bind LRH-1 with 1:1 stoichiometry, as shown in many different crystal structures of human LRH-1 in the protein data bank (PDB). The allosteric mechanism of ligand-induced interaction with coregulator proteins is utilized in LRH-1 drug screening platforms such as AlphaScreen [65], in which compounds induce a measured interaction between LRH-1 and a small peptide that represents a transcriptional coregulator protein [66], [67], [68], such as TIF2 [69] or PGC1α [70]. These screens, however, can only identify compounds that regulate LRH-1 through the canonical coregulator-recruitment mechanism, and are not able to identify any compounds that bind and regulate LRH-1 through non-canonical mechanisms.
To address this, we executed an LRH-1 compound screen that identified 58 new compounds which simply bind directly to LRH-1 from the 2322 compound Spectrum Discovery library [1]. This FRET-based screen measured the ability of each compound to compete with a probe installed in the canonical ligand binding site of LRH-1, suggesting these compounds also bind LRH-1 at the canonical ligand binding site. The hit compounds were subjected to several secondary assays monitoring functional regulation of LRH-1, which showed 14 of the 58 compounds regulated either 1) the isolated LBD in a coregulator binding assay or 2) full-length LRH-1 in a luciferase reporter assay driven by the CYP8B1 promoter or 3) the CYP17A1 promoter, both established LRH-1 target promoters in cells. Surprisingly and against the standard dogma of nuclear receptor regulation, there was almost no overlap in these compounds. The compounds either regulated coregulator binding to the LBD, or the compounds regulated full-length LRH-1 assays in cells, with no overlap, even though these compounds were all identified based on their ability to directly bind pure, recombinant LRH-1 [1].
Here, we examined the interaction of these compounds with LRH-1 by applying PyRx computational rigid docking [71] to predict relative docked binding energies to a total of 19 structural models of LRH-1 (18 crystal structures of the isolated LBD and 1 integrative model of the full-length LRH-1 [72]). We then correlated those docking results retrospectively with published cell-based functional data from our previous study (439 total functional assay measurements) [1]. We found that docking to the full-length model of LRH-1 [72] predicts different binding positions for compounds active on full-length LRH-1, which were closer to the entrance of the ligand binding pocket and Helix 6, an important helix that helps mediate ligand activation of LRH-1 37,64,69,70,73–75. The scores of compounds docked to full-length LRH-1 or the isolated LBD did not correlate with LRH-1 regulation in wet lab assays, however several compounds had large differences in docking scores (∆G) to the full-length vs. isolated LBD of LRH-1, an arbitrary metric we call ∆∆G. We found ∆∆G positively associated with the ability of our set of 58 hit compounds to regulate full-length LRH-1 in cell-based assays. Network analyses suggest the position of Helix 6 as a structural element that may contribute to the relationship between ∆∆G and ligand-mediated LRH-1 regulation in cells, consistent with analyses by independent groups [37], [64], [69], [70], [73], [74], [75]. The data presented here suggest a role for Helix 6 in ligand-mediated regulation of LRH-1, similar to other studies which have applied comparative crystallography, MD simulations and network analyses [37], [64], [69], [70], [73], [74], [75]. The ∆∆G metric can help prioritize hit compounds for expensive secondary screening, which can hasten LRH-1 compound development. ∆∆G can be rapidly generated even for large hit lists, as minimal resources are required for rigid-body PyRx-based docking, highlighting practical utility.
2. Results
Compounds which functionally regulated the isolated LBD, did not regulate full-length LRH-1 in cells. We previously identified 58 compounds that bind directly to the ligand binding domain (LBD) of LRH-1 (Fig. 1A) using a FRET-based competition screen [1] (Fig. 1B). We then applied three independent high-throughput assays to test these 58 compounds for functional regulation of either the purified recombinant LBD in a coregulator binding screen (98 independent assay measurements) or full-length LRH-1 regulation in cells (341 independent assay measurements), for a total 439 independently measured events of compound-induced regulation of LRH-1 function [1] (Supplemental Spreadsheet 1). Regulation of the LBD was determined by fluorescence anisotropy (Fig. 1C), which measured compound-induced interaction between the isolated LBD of LRH-1 and a peptide representing a known transcriptional coregulator of LRH-1 [70]. Full-length LRH-1 regulation in cells was determined by measuring compound-induced LRH-1 activation of a luciferase reporter gene, driven by one of two well-established LRH-1 promoter DNA sequences (CYP8B1 or CYP17A1) in HEK293T cells (Fig. 1D). We showed of the 58 compounds identified to bind LRH-1, 14 compounds regulated LRH-1 function in at least one of these three assays [1] (Dunnett's corrected padj<0.05, Supplemental Spreadsheet 2). Here, we applied principal component analysis (Fig. 1E) of the equally scaled log2 fold-change data from all 439 wet lab assay measurements, suggesting distinct patterns of regulation for the isolated LBD vs. full-length LRH-1 assays, despite all compounds were shown to directly bind to purified LRH-1 [1]. Contingency analyses suggest compounds that significantly regulated the isolated LBD, less frequently regulated full-length LRH-1 in cells (Fig. 1F, 90 % vs. 7 %, p < 0.0001 by Fisher’s exact), and compounds that regulated full-length LRH-1 in cells less frequently regulated the isolated LBD (Fig. 1G, 68 % vs. 8 %, p = 0.0005 by Fisher’s exact). Together these data suggest compounds that regulated the isolated LBD did not regulate full-length LRH-1 in cells, despite that all the compounds were identified based on their ability to directly bind LRH-1 (Fig. 1B). We next used computational docking to determine if there were any differences in the docking scores (docked binding energies) of these compounds to the isolated LBD vs. full-length models of LRH-1.
Fig. 1.
Compounds identified to bind LRH-1 LBD from a previous wet-lab screen regulated either the ligand-binding domain (LBD) or full-length LRH-1 in cells, with little overlap.A. Previously published screening strategy highlighting B. primary screen that identified 58 compounds based on their ability to compete with a FRET probe installed at the canonical ligand binding site of the isolated ligand binding domain (LBD) of human LRH-1. C. Schematic of isolated LBD assay that measured compound-induced interaction between PGC1α coregulator peptide and the isolated LBD of LRH-1. D. Schematic of full-length LRH-1 luciferase reporters in HEK293T cells using either the CYP17A1 or CYP8B1 promoters. E. Biplot of principal component analysis of all 439 wet-lab assayed events (assays from panels C-D) induced by all 58 LRH-1 hit compounds, suggesting the isolated LBD assay vs. full-length LRH-1 assays do not cluster together. F. Contingency analysis of the frequency of LRH-1 regulation induced by all compounds that regulated the isolated LBD assay (5 compounds, 40 assays), showing LBD-regulating compounds less frequently regulated full-length LRH-1 in cells (90 % vs. 7 %, p < 0.0001 by Fisher’s exact test), regulation defined as padj< 0.05 Dunnett's corrected and log2 fold change at least ± 0.5. G. Contingency of LRH-1 regulation by the compounds that regulated full-length LRH-1 in cells (9 compounds, 53 assays), showing full-length regulating compounds less frequently regulated the isolated LBD assay (68 % vs. 8 %, p = 0.0005 by Fisher’s exact). These data suggest that even though all these compounds were identified based on their ability to bind the isolated LBD, compounds that regulated the isolated LBD did not regulate the full-length LRH-1 in cells as frequently, and vice versa.
Rigid-body compound docking to full-length LRH-1 vs. the isolated LBD yields divergent binding positions and binding energies. Our previous study docked all 2322 compounds to a crystal structure of the isolated ligand binding domain (LBD) of LRH-1 (PDB:6OQX), which showed no correlation between the docking scores and regulation of LRH-1 [1]. Here, we used the same PyRx rigid-body docking approach, but now examined docking to the full-length LRH-1 model (PDB_DEV: 00000035, Fig. 2A, Supplemental Spreadsheet 3). Importantly, the model of full-length LRH-1 is a solution-based, integrated structure that was extensively computationally optimized by Rosetta and several other approaches [72]. We therefore expected ligand docking to this model to result in lower binding energies when compared to a crystallographically restrained model of the isolated LBD. Indeed, we observed significantly lower docked binding energies for compounds docked to full-length LRH-1 vs. the isolated LBD (Fig. 2B). The docked positions of the compounds were also different (Fig. 2C), specifically the 5 compounds that regulated the isolated LBD (Fig. 1C) docked within the well-established, canonical ligand binding site, deep in the core of the LRH-1 protein (Fig. 2D). However, some of the 9 compounds that regulated full-length LRH-1 in cells (Fig. 1D) docked to the full-length LRH-1 at the entrance to the canonical ligand biding site, clustered around Helix 6 (Fig. 2E). Our previous work suggests all 9 of these compounds (Fig. 2E) bind directly to the isolated LRH-1 LBD, as well as regulate full-length LRH-1 function in cells [1]. Further, we determined saturable binding constants for two compounds that docked at the entrance of the ligand binding site near Helix 6, VU0243218 (IC50 =9.4 μM, 95 %CI 8.1–10.8 μM) and VU0656021 (IC50 =27.0 μM, 95 %CI 16.1–171.5 μM), again suggesting direct interactions of these compounds with the LRH-1 ligand binding domain (Fig. 2E). Full transcriptomes induced by 10 μM VU0243218 showed selective regulation of endogenous ChIP-seq target genes of LRH-1, and a specific chemical competitor of LRH-1 attenuated the gene expression response of endogenous LRH-1 target genes to VU0243218 [1]. This level of validation could not be applied to all hit compounds from the screen, however the data suggest these compounds indeed bind and regulate LRH-1, despite docking at the entrance to the ligand binding pocket. No crystal structures have been solved for any of these recently identified LRH-1 compounds, thus the precise binding mode for the compounds awaits those studies. These data suggest ligands that bind and regulate the isolated LBD vs. full-length LRH-1 in cells have different rigid-body docking to the isolated LBD vs. full-length models of human LRH-1. Since these are the first computational ligand docking studies to the full-length LRH-1 [72], we next sought to determine if docked binding energy to full-length LRH-1 correlated with the ability of compounds to regulate LRH-1.
Fig. 2.
The 58 hit compounds computationally docked to full-length LRH-1 and the isolated LBD differently. A. Schematic showing differences in isolated LBD vs. full length LRH-1 used for PyRx rigid docking. B. Plot of docking scores (relative binding energies, kcal/mol) of the 58 hit compounds docked to the isolated LBD (PDB:6OQX, teal) vs. full-length LRH-1 (PDB_DEV: 00000035, pink), showing lower binding energy of compounds to full-length LRH-1, * ** p = 0.0003 by two-tailed paired t-test. C. Close up of the different docked positions of all 58 hit compounds docked to the isolated LBD (PDB:6OQX, teal) vs. full-length LRH-1 (pink), with the protein removed for clarity. D. Docked positions in LRH-1 LBD (PDB:6OQX) of only the 5 hit compounds that regulated the isolated LBD, co-crystalized NCOA2 peptide (gold) is shown for orientation. E. Docked positions of only the 9 hit compounds that regulated the full-length LRH-1 in cells, DNA oligo shown in gray, Zinc atoms purple spheres, PGC1α peptide shown in gold. These data suggest compounds dock to the isolated LBD vs. full-length LRH-1 models slightly differently.
Compound ∆∆G associates with the ability of compounds to regulate full-length LRH-1 in cells. We previously established that the relative docking scores of compounds to the isolated ligand binding domain of LRH-1 (PDB:6OQX) did not correlate with regulation of full-length LRH-1 in cells [1]. However, given the different docking scores we observed (Fig. 2B), we asked if docked binding energy to full-length LRH-1 associated with the ability of compounds to regulate full-length LRH-1 in cells. Since the hit compounds both activated and inhibited LRH-1 in the wet lab assays, the data contain both negative (inhibitory) and positive (activating) log2 fold change values for each compound, compared to DMSO controls (Supplemental Spreadsheet 1). Thus, we converted these continuous variables to discrete variables by applying log2 fold change cutoffs to define discrete “regulated events” vs. “not regulated events” and treated each wet lab measurement of LRH-1 function (each well in the high throughput assays) as an independent event. We analyzed the continuous data (Fig. 6) and present these discrete analyses first. Plotting the binding energy of each compound (∆G) docked to either the isolated LBD of LRH-1 (PDB:6OQX, Fig. 3A) or full-length LRH-1 (Fig. 3B) as a function of the discrete number of wet-lab events regulated by each compound, and using a very inclusive cutoff for regulation ( ± 0.25 log2fold change vs. DMSO control) failed to show a correlation by Spearman rank, or a non-zero slope of a linear regression by F-test, for either the isolated LBD (p = 0.48) or full-length LRH-1 (p = 0.52). These data suggest the relative docked binding energy of compounds to full-length LRH-1 did not correlate with the ability of that compound to regulate LRH-1 in the assays examined here. However, we noted some compounds had large differences in binding energy to the isolated LBD vs. full-length LRH-1, and those compounds appeared to more frequently regulate full-length LRH-1 in the wet lab assays. Thus, we empirically derived the heuristic ∆∆G metric, which is simply the absolute value of the PyRx docking score (relative docked binding energy for each compound) to the full-length LRH-1 (∆GFL), less the absolute value of the ∆G for each compound binding to the isolated LBD of LRH-1 (∆GLBD, Fig. 3C). We plotted ∆∆G against the identical wet-lab functional data, which showed a slight upward trend to the regression when the 6OQX crystal structure was used to calculate ∆∆G6OQX, however the slope was not significantly non-zero (Fig. 3D). We tested this further by docking to a different crystal structure of the LBD (PDB:1YOK) to derive the ∆∆G metric, which produced a significant Spearman rank correlation (r = 0.291, * p = 0.014) and a slightly non-zero slope to a linear regression (* p = 0.043 by F-test). We excluded one compound from these analyses (VU0656093), as it was the only compound to produce positive (unfavorable) binding energies docked to PDB:1YOK (+15.3 kcal/mol), full-length LRH-1 (+26.6 kcal/mol), and PDB:6OQX (+44.1 kcal/mol), more than 11-fold higher than the next highest binding energy compound (Supplemental Spreadsheet 3). Thus, this compound was excluded and only 57 compounds were analyzed for the remainder of this study. These initial data suggested a weak, but perhaps non-zero association might exist between ∆∆G1YOK full-length LRH-1 regulation in cell-based assays. Since any computational approach that could improve cell-based secondary screens would have great practical value in larger LRH-1 drug screens in both large pharmaceutical companies and within academia, we followed up on this initial observation.
Fig. 6.
Compounds with high ∆∆G values more frequently regulate of full-length LRH-1 in cells.A. Contingency analysis of the frequency of full-length LRH-1 regulation in cells induced by compounds in the top 10th percentile (Top 10 %) of all 57 ∆∆G values vs. all other compounds (Remainder), examining only the 341 full-length LRH-1 assayed events in cells (isolated LBD assay results were excluded), percentages indicate the percentage of regulated (pink) events within that group of compounds, at indicated L2FC cutoffs ( ± 1.0, ± 0.5, or ± 0.25 L2FC). These data suggest high ∆∆G values associated with a compound’s ability to regulate full-length LRH-1 in cells, also supported by further analyses in supplemental data. B. The same contingency analysis as in A, but replacing ∆∆G with the simple binding energy to full-length LRH-1, comparing compounds in the top 10th percentile of lowest docked binding energies (top 10 % or the best 10 % binding energies) to full-length LRH-1 vs. all other compounds (Remainder), suggesting compounds with lower binding energies did not more frequently regulate full-length LRH-1 in cells, at all L2FC cutoffs tested, also supported by further analyses in supplemental data. These data suggest compounds with higher ∆∆G values more frequently regulated full-length LRH-1 in cells.
Fig. 3.
Relative binding energy (∆G) of compounds docked to either the ligand binding domain or full-length LRH-1 did not associate with LRH-1 regulation in cells.A. Plot of docked binding energy of 57 compounds to the isolated ligand binding domain (LBD, PDB:6OQX) vs. the number of wet lab assay events regulated by each compound (full-length LRH-1 regulation of the CYP17A1 and CYP7B2 luciferase promoters), with each of the wells in the high-throughput assays treated as an individual event (replicates not averaged) and regulation defined as meeting ± 0.25 log2 fold change cutoff vs. DMSO control. Solid line indicates linear regression for all points, shaded area is 95 % confidence of regression, slope of the regression and F-test p value for non-zero slope indicated, with no correlation by Spearman rank correlation. B. Identical as A, but plotting docked binding energy of compounds to full-length LRH-1 (PDB_DEV: 00000035). C. ∆∆G is an arbitrary metric based on the difference in docked binding energy of a compound to the full-length LRH-1 (∆GFL) vs. the isolated LBD (∆GLBD). D. Plot of ∆∆G calculated using PDB:6OQX (∆∆G6OQX) vs. the number of wet lab assay events regulated by each compound (full-length LRH-1 regulated events from the CYP17A1 and CYP8B1 luciferase assays). E. Identical as D, but ∆∆G calculated using docking scores to LRH-1 structure 1YOK (∆∆G1YOK), showing slightly non-zero slope and significant one-tailed Spearman correlation, calculated in Prism. These data suggest ∆∆G values of compounds could correlate with wet lab activity on LRH-1.
Averaged ∆∆G across 18 crystal structures of LRH-1 associates with the ability of a compound to regulate LRH-1 in all wet-lab assays. We docked the 57 hit compounds to 18 crystal structures of the human LRH-1 ligand binding domain (see methods for the list of all 18 structures from the protein data bank, PDB) and calculated ∆∆G values for all 57 compounds to each of the 18 structures of LRH-1, which generated a matrix of 1026 ∆∆G values (Supplemental Sheet 4). We averaged the 18 ∆∆G values for each compound (one ∆∆G value for each of the 18 crystal structures) and plotted this “averaged ∆∆G” (referred to as ∆∆G for the remainder of this manuscript) for each compound as a function of the number of wet-lab events regulated by that compound. Spearman correlations were statistically significant, and slopes of linear regressions were non-zero at ± 0.25 L2FC (Fig. 4A), ± 0.5 L2FC (Fig. 4B) or ± 1.0 L2FC (Fig. 4C) cutoffs, again suggesting an association between ∆∆G and the ability of a compound to regulate LRH-1 function in wet-lab assays. Contingency analyses further suggested compounds with higher ∆∆G values were significantly more likely to functionally regulate LRH-1 (Fig. 5A-F). Specifically, compounds with a ∆∆G value higher than 1.0 kcal/mol regulated LRH-1 in 23 % of all wet lab measured events, whereas compounds with ∆∆G below 1.0 kcal/mol regulated LRH-1 in only 9 % of all wet lab events (Fig. 5A, p = 0.0001 by Fisher's exact test). Compounds in the highest 10th percentile of ∆∆G values regulated LRH-1 in 30 % of wet lab events, while all remaining compounds regulated LRH-1 in only 11 % of wet lab events (Fig. 5B, p = 0.0014). Compounds in the highest quartile of ∆∆G values regulated LRH-1 in 24 % of wet lab events, all remaining compounds regulated LRH-1 in only 9 % of wet lab events (Fig. 5C, p = 0.0002). The highest 3 quartiles of ∆∆G values also selectively regulated LRH-1 in 16 % of wet lab events, while all remaining compounds regulated LRH-1 in only 4 % of wet lab events (Fig. 5B, p = 0.0014). This association was significant if a more rigorous cutoff of ± 1.0 log2 fold change was used (Fig. 5E-F). Together these data suggest that among these 57 hit compounds, compounds with higher values of ∆∆G more frequently associated with regulation of LRH-1 in the wet lab.
Fig. 4.
Linear regression of ∆∆G averaged across all 18 crystal structures of LRH-1 plotted vs. the number of compound-regulated LRH-1 assay events has a non-zero slope.A. 57 of 58 hit compounds (one compound excluded due to high positive binding energies docked to several LRH-1 models, see methods) were docked to 18 crystal structures of the human LRH-1 LBD and ∆∆G values calculated, producing a matrix of 1102 ∆∆G values, provided in supplemental data. The average of the 18 ∆∆G values (one value for each of the 18 crystal structures) for each of the 57 compounds were plotted as a function of the number of all LRH-1 wet-lab regulated events induced by that compound, with regulation discretely defined as at least ± 0.25 Log2 fold change from DMSO control, B. ± 0.50 Log2FC or C. ± 1.0 Log2FC. For all plots solid line indicates linear regression for all points, shaded area is 95 % confidence interval of regression, slope and p value from an F-test for a non-zero value of the slope, r and p values for one-tailed Spearman correlation indicated. Spearman correlations were one-tailed as positive values of initial linear regressions suggested a positive correlation, making a negative correlation unlikely. These data suggest compounds with high averaged ∆∆G might associate with LRH-1 activity.
Fig. 5.
Compounds with high ∆∆G values more frequently regulated LRH-1 in wet lab assays. Contingency analyses of the frequency of LRH-1 regulation induced by all 57 compounds in all 439 wet lab assayed events (y-axis), with each well in the high-throughput assays treated as an individual event. Since compounds both inhibited and activated LRH-1, regulation in each well was defined as meeting the indicated Log2 fold change cutoff (L2FC) compared to DMSO control. Pink section of bars is the number of events that met the L2FC cutoff (percentages indicate percentage of regulated events), white section of bars is the number of events that did not meet L2FC cutoff. A. Contingency analysis of the frequency of LRH-1 regulation in all 439 assays, comparing compounds with ∆∆G values lower than 1.0 (∆∆G<1.0) vs. higher than 1.0 (∆∆G>1.0), showing compounds with higher ∆∆G values more frequently regulated LRH-1 (16 % vs. 7 %) by Fisher’s exact test (p = 0.0001). B. Contingency comparing compounds with ∆∆G values in the top 10th percentile (Top 10 %) of all 57 ∆∆G values vs. all other compounds (Remainder), showing compounds with higher ∆∆G values more frequently regulated LRH-1 (30 % vs. 11 %, p = 0.0014 by Fisher’s exact). C. Contingency showing compounds in the highest quartile (Top 25 %) of ∆∆G values more frequently regulated LRH-1 (p = 0.0265 by Fisher’s exact). D. Contingency showing compounds in the highest 3 quartiles (Top 75 %) of ∆∆G values more frequently regulated LRH-1 (p = 0.0004). E. Contingency showing compounds in the Top 25 % of ∆∆G values more frequently regulated LRH-1 at a more exclusive L2FC cutoff ( ± 1.0 L2FC, p = 0.0012). F. Contingency showing compounds in the Top 75 % of ∆∆G values more frequently regulated LRH-1 at a more exclusive ± 1.0 L2FC cutoff (p = 0.0007). These data suggest ∆∆G positively associated with the ability of compounds to regulate LRH-1 in the wet lab assays we measured.
Higher ∆∆G compounds associated with full-length LRH-1 regulation in cells. We next asked if ∆∆G for each compound would associate with regulation of full-length LRH-1 in cells. Indeed, plotting ∆∆G for each compound as a function of only the full-length LRH-1 cell-based assay data showed a significantly non-zero slope to the linear regression at all cutoffs tested (Fig S2A). When ∆∆G was plotted as a function of only the isolated LBD assays the slope of the linear regression failed to be significantly non-zero at all cutoffs tested and was not significantly correlated by Spearman (Fig S2B). Contingency analyses of the full-length LRH-1 assays (Fig. 6) suggests compounds in the highest 10th percentile of ∆∆G values regulated LRH-1 in 25 % of full-length LRH-1 cell-based assay events, all remaining compounds regulated full-length LRH-1 in only 3 % of events (Fig. 6A, p < 0.0001). The increased frequency of full-length LRH-1 regulation in cells held if more inclusive cutoffs of ± 0.5 L2FC (p = 0.0001) or ± 0.25 L2FC (p < 0.0007) were used (Fig. 6A). Compounds in the top quartile of ∆∆G values using the more exclusive ± 1.0 L2FC cutoff again showed higher ∆∆G values associated with more frequent regulation of full-length LRH-1 in cells (Fig S3A, p = 0.0002), although more inclusive L2FC cutoffs of ± 0.5 and ± 0.25 did not reach significance (Fig S3A). Compounds in the top 3 quartiles of ∆∆G values also more frequently regulated full-length LRH-1 in cells, regardless of L2FC cutoff applied (Fig S3B). We then tested if simply the docking score (relative docked binding energy) to full-length LRH-1 (∆GFL) would associate with a compound’s ability to regulate full-length LRH-1 in cells. Contingency analyses using the same full-length LRH-1 cell-based assay data found no association between compound binding energy to full-length LRH-1 (∆GFL) and the ability of the 57 compounds to regulate LRH-1 in cells (Fig. 6B), regardless of L2FC cutoff (Fig S4A) or how the compounds were grouped (Fig S4B). We also confirmed the 57 compounds had no preference to regulate either the isolated LBD or full-length LRH-1 assays in cells when the compounds were considered together, e.g., the 57 compounds together regulated 6 % of isolated LBD assays and 6 % of all full-length LRH-1 assays in cells (Fig S5). Together, these data suggest the group of compounds with higher ∆∆G values more frequently regulated full-length LRH-1 in cells.
Applying continuous data suggests ∆∆G associates with the ability of compounds to regulate full-length LRH-1 in cells. Our analyses up to this point converted the positive and negative log2 fold change continuous data to discrete variables that could be counted, for simplicity. We now plotted the continuous values of ∆∆G for each compound as a function of the absolute value of the log2 fold changes vs. DMSO control from each of the three wet-lab assays, absolute values were used to account for the equally scaled activation (positive L2FC) and repression (negative L2FC) by various compounds in the wet lab assays. Spearman rank correlation was significant for the isolated LBD coregulator peptide anisotropy assay (r = 0.373, ** p = 0.0042) while the slope of linear regression failed to be non-zero by F-test (p = 0.073, Fig. 7A). All assays on full-length LRH-1 suggested a significant relationship in both the CYP17A1-luciferase data (F-test, ** p = 0.0072, Spearman rank r = 0.252, * p = 0.291, Fig. 7B) and the CYP8B1-luciferase data (F-test * p = 0.018, Spearman r = 0.224, * p = 0.043, Fig. 7C). Analyzing all full-length LRH-1 assays in cells showed a positive, non-zero slope to the linear regression (F-test *** p = 0.0005) and a significant Spearman correlation (r = 0.399, ** p = 0.0010, Fig. 7D). The positive correlation is being driven by the top 10 of 58 compounds with the highest ∆∆G values, of course during a compound screen these “outlier” compounds with highest activity that are of most interest. To determine if PyRx docking would generate reproducible docking scores (∆G), as well as ∆∆G values that reproducibly correlate with the cell-based assays, we executed duplicate PyRx docking runs to all 19 structures (Run 2) and directly compared these two independent runs (Fig. 8). The duplicate runs produced indistinguishable binding scores (Fig. 8A), ∆∆G values (Fig. 8B) and correlations with the continuous wet lab data for all cell-based assays (Fig. 8C-E). These data suggest the PyRx docking scores and ∆∆G reproducibly correlated with regulation of full-length LRH-1 in cell-based assays, between two independent docking runs. We next began to explore these crystal structures more deeply to identify the structural feature being sensed by rigid body docking.
Fig. 7.
Continuous values of full-length LRH-1 assays positively associate with ∆∆G. Previous analyses converted assay data from the 57 compounds to discrete values since the compounds induced both positive (activation) and negative (repression) log2 fold changes to LRH-1 regulation. Here we plotted ∆∆G values as a function of the continuous absolute values of the log2 fold change (compared to DMSO control) induced by each compound in each indicated assay: A. the co-regulator peptide binding assay using the isolated LBD, B. the full-length LRH-1 assay in HEK cells using CYP17A1-luciferase reporter or C. the full-length LRH-1 assay in HEK cells using the CYP8B1-luciferase reporter or D. all full-length LRH-1 luciferase assays in HEK cells combined. For all plots, solid line indicates linear regression, shaded area is 95 % confidence interval, slope and p value indicated from F-test for non-zero value of the slope, and the one-tailed Spearman r and p values also indicated, calculated in Prism. These data suggest a positive association exists between ∆∆G of a compound and the ability of that compound to regulate LRH-1.
Fig. 8.
Duplicate PyRx docking runs produced reproducible docking scores (∆G), ∆∆G values and correlations with LRH-1 regulation.A. Comparison of PyRx docking scores (∆G values) from duplicate runs (Run 1 vs. Run 2) were compared, showing no significant difference by paired t-tests of each of the 57 compounds, each point represents the averaged ∆G from 18 crystal structures of LRH-1. B. Comparison of ∆∆G values from duplicate PyRx runs (Run 1 vs. Run 2), again showing no difference in ∆∆G by paired t-tests, suggesting duplicate PyRx runs produced reproducible ∆∆G values, each point represents averaged ∆∆G across 18 crystal structures of LRH-1 for each compound. C. ∆∆G values from Run 1 and Run 2 plotted as a function of the absolute value of the log2 fold change (compared to DMSO control) of LRH-1 regulation induced by each compound in the CYP17A1 cell-based activity assay, showing significant correlation by non-zero linear regression slope and by one-tailed Spearman correlation. D. Same as in C. but ∆∆G plotted as a function of CYP8B1 cell-based assays, again showing a significant correlation. E. Same as in C, but ∆∆G plotted as a function of both cell-based assays combined. Spearman was one-tailed, all statistics were calculated in Prism. These data suggest duplicate PyRx docking runs of the 57 compounds to 18 crystal structures of LRH-1 produced reproducible ∆G and ∆∆G values, which correlated with LRH-1 activity in cell-based assays.
High ∆∆G values were generated from small-molecule bound LRH-1 LBD crystal structures. Previous work from other groups has established that ligands can induce changes in LRH-1 crystal structures. These ligand-induced differences in co-crystal structures might drive the association of ∆∆G with full-length LRH-1 regulation, therefore the nature of the co-crystalized ligands may indicate how ∆∆G correlates with full-length LRH-1 activity. LRH-1 binds phospholipids, we compared 9 structures in the protein data bank (PDB) of LRH-1 co-crystalized with phospholipids (1YOK, 1YUC, 1ZDU, 3TX7, 4DOR, 4DOS, 4ONI, 4PLE and 4RWV) to another 9 LRH-1 structures co-crystalized with synthetic small molecules (3PLZ, 4PLD, 5UNJ, 5L11, 5SYZ, 6OR1, 6VC2, 6OQX and 6OQY). The ∆∆G values produced by these 18 crystal structures varied for each compound, so we averaged the ∆∆G values for each crystal structure across all 57 compounds, to produce an average ∆∆G value for each crystal structure (∆∆GXSTAL), permitting comparison of crystal structures, rather than comparing compounds. We immediately noted the 9 phospholipid-bound crystal structures all had lower ∆∆GXSTAL values than the 9 small-molecule bound crystal structures (Fig. 9A), also confirmed in pairwise comparison of the ∆∆GXSTAL values across the 18 structures (Fig. 9B), consistent with the small-molecule co-crystalized structures producing smaller ligand binding pockets [64]. We next tested if these differences were the source of the association between compound ∆∆G values and full-length LRH-1 regulation by generating a new term ∆∆G* (Fig. 9C), which is the averaged ∆G from the 9 small molecule-bound structures, less the averaged ∆G from the 9 phospholipid-bound structures. Although the magnitude of ∆∆G* values were higher than ∆∆G values (Fig S6A-B, p < 0.0001), the value of ∆∆G* did not associate with regulation of full-length LRH-1 in cells, regardless of how the data were analyzed (Fig. 9D, Fig S6C). We found ∆∆G values calculated only from the small molecule-bound crystal structures correlated slightly better with LRH-1 regulation in cells than ∆∆G values calculated from only the phospholipid-bound structures of LRH-1 (Fig S7A-C). These data suggest crystal structures that produced the highest ∆∆G values were co-crystalized with synthetic small molecules, while phospholipid-bound structures produced lower ∆∆G values. To identify differences in these two classes of crystal structures, we next applied network analyses.
Fig. 9.
Crystal structures that produced the highest ∆∆G values (∆∆GXSTAL) were bound to small molecules. Since the value of ∆∆G generated from small molecule-bound structures appeared higher than ∆∆G generated from phospholipid-bound structures, we generated a new value called ∆∆GXSTAL which is the average of all 57 ∆∆G values (one ∆∆G value for each of 57 compounds), for each of the 18 crystal structures of LRH-1. A. ∆∆GXSTAL for small molecule-bound structures was significantly higher than ∆∆GXSTAL for phospholipid-bound structures (t-test, p < 0.0001, each point represents ∆∆GXSTAL for one of 18 crystal structures). B. Pairwise comparison of all 18 ∆∆GXSTAL values listed by PDB code, suggesting small molecule vs. phospholipid-bound structures generated different ∆∆GXSTAL values. C. To test if higher ∆∆G values from small molecule-bound structures is the source of the association between ∆∆G and full-length LRH-1 regulation in cells, we determined ∆∆G* , defined as the difference between the average ∆G for each compound bound to the 9 small molecule-bound structures (∆GSmall Molecule) less the average ∆G bound to the 9 phospholipid-bound structures (∆GPhospholipid). D. Contingency showing compounds in the highest 10th percentile of ∆∆G* values did not more frequently regulate LRH-1, regardless of L2FC cutoff by Fisher’s exact test, also supported by further analyses in supplemental data, p values calculated in Prism. These data suggest ∆∆G might reflect a structural aspect of small-molecule bound crystal structures of LRH-1.
High ∆∆GXSTAL structures have unique network properties. Several previous studies have used networking approaches to better understand how activating LRH-1 ligands are allosterically paired with transcriptional coregulators [69], [70]. To identify any properties of the communication networks within the crystal structures that produced the highest ∆∆GXSTAL values, we performed network analyses on the same 18 structures of LRH-1. We assigned an eigenvector centrality value to each amino acid in each of the 18 structures using RING (3.0) [76] (Supplemental Spreadsheet 5). Eigenvector centrality reflects the connectedness of an amino acid with other amino acids in the 3D structure [77]. We then assigned averaged eigenvector centrality across all amino acids in each secondary structural element of the LRH-1 LBD (12 alpha helices, 6 connecting loops and 2 beta sheets, see methods for amino acid numbering, Supplemental Spreadsheet 6), and applied principal component analysis to analyze this multi-dimensional data. Variance in the top principal component (PC1) formed two clear clusters: 1) low ∆∆GXSTAL structures bound by phospholipids and 2) high ∆∆GXSTAL structures bound by synthetic small molecules (Fig. 10A). We then examined edges between nodes in the network (Supplemental Spreadsheet 7) that were unique to each of these two groups of crystal structures (Supplemental Spreadsheet 8), which showed clear differences between the low vs. high ∆∆GXSTAL structures (Fig. 10B). Superposition of the 18 crystal structures shows that in the nine highest ∆∆GXSTAL structures (Fig. 10C, red structures), Helix 6 is in a position that constricts the entrance to the ligand binding pocket (Fig. 10D), also suggested by the position of Helix 3 closer to Helix 6 (Fig. 10E). In the nine lowest ∆∆GXSTAL structures Helix 6 is in the opposite position, opened at the entrance to the ligand binding pocket (Fig. 10C-E). Previous network studies based on molecular dynamics simulations of the ligand binding domain have also suggested the position of Helix 6 as an important determinant of LRH-1 conformations that associate with ligand regulation [70]. The data presented here now suggest the position of Helix 6 correlates with ∆∆G values, and ∆∆G correlates with full-length LRH-1 activity in cell-based assays. The data here suggest ∆∆G has utility in prioritizing hit compounds from screening campaigns for follow-up screening in full-length LRH-1 cell-based assays.
Fig. 10.
The position of Helix 6 associates with higher ∆∆GXSTALvalues. The ∆∆GXSTAL is the averaged ∆∆G value of 57 compounds for each crystal structure (indicated in this figure simply as ∆∆G). A. Principal Component Analysis of eigenvector centrality values assigned to each secondary structural element (12 Helices, 3 loops and 1 beta strand) in all 18 crystal structures of LRH-1. PCA shows clustering of structures with low ∆∆GXSTAL values co-crystalized with phospholipids vs. structures with high ∆∆GXSTAL values co-crystalized with small molecules, see methods for network analysis details, ∆∆GXSTAL values indicated. B. Edges between network nodes that were unique to the low ∆∆GXSTAL structures (blue, left) or the high ∆∆GXSTAL structures (red, right) were mapped onto PDB:6OQX. Note that among many differences, one difference in the high ∆∆GXSTAL structures is the presence of more edges between Helix 6 and Helix 3 at the entrance to the ligand binding pocket, indicated with dashed circle and arrows. C. Superposition of all 9 low ∆∆GXSTAL structures bound by phospholipids (blue) and all 9 high ∆∆GXSTAL structures bound by small molecules (red), less ordered loop regions were removed for clarity. Examination of the superposition showed the largest change in position was associated with Helix 6. D. Closeup of boxed region in panel C., showing change in Helix 6 position in the low ∆∆GXSTALvs. high ∆∆GXSTAL structures. E. End-on alternative view of Helix 6, showing the position of Helix 6 favoring closer proximity to Helix 3 in the high ∆∆GXSTAL structures. These data suggest that positions of Helix 6 which close the entrance to the ligand binding pocket associate with higher values of ∆∆G.
3. Discussion
Prioritizing hit compounds from a large screen is often a necessary experimental constraint [1], [2], [3], [4], [5], [6] as secondary assays in living cells can be very resource-intensive [7], [8], [13)]. Therefore, in terms of drug screening, the few “outlier” compounds with the most biological activity are of the highest biomedical interest. Identifying these high activity “outlier” compounds could be arguing is the purest goal of drug screening. Here, we present a computational heuristic (∆∆G) with utility for prioritizing LRH-1 compounds for secondary assays in cells. We validated the ∆∆G metric retrospectively by using a total of 439 orthogonal functional assays. We also present data suggesting the more direct metric of docked binding energy to LRH-1 did not correlate with the ability of compounds to regulate full length LRH-1 in cells. This study identifies and validates a link between in silico docking score and LRH-1 function in cell-based high throughput functional assays. Binding constants determined in the wet lab have been linked to LRH-1 regulatory activity for several, but not all individual compounds [42], [49], however these approaches are resource-intensive. We propose ∆∆G could be applied to prioritize hits from large primary compound screens targeting LRH-1.
Although it is difficult for the computational data presented here to definitively establish how ∆∆G associates with LRH-1 regulation, we found three lines of evidence suggesting a connection with the position of Helix 6 at the entrance to the LRH-1 ligand binding pocket. First, the docked positions of high ∆∆G compounds clustered around the entrance to the ligand binding pocket at Helix 6 (Fig. 2), suggesting changes to the position of Helix 6 might be responsible for altered compound docking and higher ∆∆G values. Second, our network analyses suggested the crystal structures that generated the highest ∆∆GXSTAL values had more edges connecting Helix 6 to Helix 3. Third, the entrance to the ligand binding pocket in high ∆∆GXSTAL structures was in a more closed position. Here we must mention that comparing crystal structures from non-identical space groups must be interpreted with caution. Perhaps most importantly, convincing work from the Ortlund lab at Emory and now the Okafor lab at Penn State [75] have used extensive MD simulations, network analyses and direct comparisons of different ligand-bound structures of LRH-1 to independently suggest an important role for Helix 6 in ligand-regulated activation of LRH-1 [37], [64], [69], [70], [73], [74], [75], linking the mobility of residues at the entrance to the ligand binding pocket near Helix 6 to synthetic small molecule-induced LRH-1 activity [74]. The volume of the ligand-binding pocket is smaller in structures bound to synthetic small molecules vs. phospholipids [64], consistent with the docking studies presented here, as well as hydrogen-deuterium exchange data [37], [64], [70], [74], MD simulations [64], [69], [70], [73], [74], [75] and other network analyses [69], [74]. Together, these studies support a role for Helix 6 is an important element in translating ligand binding events from the binding pocket to the transcriptional coregulator [69], [70]. Our analyses of cell-based data are consistent with those findings, however it will be important to apply structural biology to determine how high vs. low ∆∆G compounds regulate the dynamics of Helix 6 in future analyses.
The structures generating the highest ∆∆G values were co-crystalized with synthetic small molecules, while the phospholipid-bound structures produced the lowest ∆∆G values. The full-length LRH-1 structure is a computational model based on a phospholipid-bound LBD crystal structure (PDB:1YOK). We therefore hypothesized it might be this phospholipid-bound starting point in the full-length modeling that produced high ∆∆G values. Calculating ∆∆G* using the average binding energies of each compound to the phospholipid-bound vs. synthetic small molecule-bound LBD crystal structures produced ∆∆G* values that did not associate with any regulation of full-length LRH-1 in cells. However, since ΔΔG values calculated from all 18 structures together and the small-molecule structures only gave similar correlations (Fig S7A-B), we would suggest future researchers use all 18 structures to calculate ΔΔG when prioritizing compounds from their screens. Should docking resources become limiting, we would suggest eliminating the phospholipid-bound structures, as these ΔΔG values were not as robustly correlated with cellular LRH-1 activity (Fig S7C). It therefore remains unclear what aspects of rigid docking to the full-length LRH-1 results in the association between ∆∆G and full-length LRH-1 regulation in cells, but we will speculate on two potential hypotheses. The first is rather trivial, simply that the modeled position of Helix 6 in full-length LRH-1 is somewhat "between" the position of Helix 6 in the high vs. low ∆∆GXSTAL structures, and that this position imparts the ability of the full-length model to uniquely “sense” or “block” interactions between compounds and LRH-1. The second is highly speculative but worth mentioning. The published model of full-length LRH-1 does not position Helix 6 in the interface between the LBD and the DNA-binding domain (DBD). However, while developing the full-length LRH-1 model [72], a lower-energy model was produced by Rosetta, in which Helix 6 was directly in the LBD-DBD interface, as were residues at the entrance to the ligand binding pocket, referred to as “Model 2”. Although Model 2 was supported by chemical crosslinking experiments, Model 2 was tested using mutants of LRH-1 (I415Q and S418D) which did not affect full-length LRH-1 luciferase activity [72], suggesting Model 2 was not functionally relevant, at least in the assays used to test the model. Still, it remains formally possible that in other cellular contexts that remain untested, that Helix 6 could exist in the interface between the LBD and DBD in particular conformational states of full-length LRH-1. Accordingly, the high ∆∆G compounds might simply bind LRH-1 at the entrance to the ligand binding pocket near Helix 6 (Fig. 2E), directly in the LBD-DBD interface predicted by Model 2. Supporting this hypothesis, compounds that operate in this way would be expected to bind the isolated LBD (Fig. 1B) [1] and regulate full-length LRH-1 (Fig. 1G) but would not be expected to regulate the isolated LBD (Fig. 1F). A total of 9 compounds of the 57 examined here meet those criteria (Supplemental Spreadsheet 2) and will be of great interest to determine co-crystal structures. More wet-lab study will be needed to establish the role of Helix 6 in full-length LRH-1 structural regulation. It will also be interesting to determine if ∆∆G might translate to other nuclear receptors, particularly Steroidogenic Factor-1 (SF-1, NR5A1), a close homolog of LRH-1. There is no experimentally validated structure of full-length SF-1, and only one crystal structure of SF-1 bound to a synthetic small molecule (PDB:8DAF) [78]. Since small molecule-bound structures of LRH-1 generated higher ∆∆G values gave more power to the LRH-1 analyses, we would hypothesize that the lack of SF-1 structures bound to small molecules may statistically weaken ∆∆G analyses of SF-1.
One limitation of this study is the small size of the compound library (only 57 compounds from a 2322 compound library), as it remains possible that in larger (or smaller) screens the ∆∆G metric would not associate with compound activity on full-length LRH-1 in cells. The 57 hit compounds were identified to directly bind LRH-1 from the Discovery Spectrum Library, which is a library of 2322 synthetic compounds and natural products. Lipinski parameters for these 57 compounds are provided (Supplemental Spreadsheet 1), the 57 compounds had an average mass of 340 ± 189 at. mass units; an average of 3.1 ± 2.6 hydrogen bond donors; an average of 5.5 ± 4.0 hydrogen bond acceptors; average molar refractivity of 66.1 ± 42.5. The small size of the primary screen permitted application of several independent, orthogonal secondary assays, a clear strength of this study. Executing more compound screens using different library sizes with follow-up secondary assays in cells would be a very resource-intensive task and outside the scope here. The secondary screens in our previous paper were limited to only the hit compounds identified, we did not perform secondary screening on all 2322 compounds, as resources are not unlimited. That our analyses found correlations in the data suggests similar associations might hold in larger screens, which can improve ongoing LRH-1 compound screening and development efforts. Another limitation of this study is that the correlation observed may not hold if a different docking program were used. Further, the compounds studied here only induced effect sizes in biological wet lab assays that were relatively small, those effects all significantly differed from DMSO controls, suggesting small but significant modulatory roles for this set of compounds. That ΔΔG associated with the biological activity of this set of marginal regulators of LRH-1 further highlights the potential power of the ΔΔG metric. However, we acknowledge that it remains possible that ΔΔG may not associate with the activity of “better” LRH-1 compounds, which induce larger effect sizes in biological assays. Regardless of how ∆∆G associates with cell-based LRH-1 activity, the utility of ∆∆G in prioritizing hits from wet lab drug screens remains.
This study introduces a new metric called ∆∆G, derived from the rigid docking scores of compounds docked to 19 different structural models of LRH-1, which positively correlates with the ability of 57 compounds to regulate full-length LRH-1 in cell-based assays. The docking scores themselves did not have any association with full-length LRH-1 activity in cells. Network analyses suggest a closed position of Helix 6 at the mouth of the ligand binding pocket associates with the highest ∆∆GXSTAL structures, also observed in other computational studies of LRH-1 [70]. This observation provides a potential explanation as to how ∆∆G associates with compound activity on LRH-1, which awaits further testing using wet lab structural biology. This study provides a new computational tool that can aid in the prioritization of hit compounds for follow-up secondary screens in LRH-1 drug development efforts, while further supporting an important role for Helix 6 in ligand-regulation of full-length LRH-1.
4. Methods
4.1. Materials
All new data in this manuscript is computational, all wet lab assay data were previously published [1], with the most relevant data provided as Supplemental Spreadsheets with this manuscript for convenience. Rigid body docking was executed with PyRx (Version 0.8) using Autodock Vina [79], run on a single Dell Precision 5820 machine running Ubuntu Linux 20.04 on an Intel Xeon 8-core, 3.9 GHz processor. Structures were visualized and figures generated using Schrodinger PyMOL (Version 2.4.2) and UCSF ChimeraX (Version 1.6.1), protein structure networks were generated using RING (Version 3.0) mapped onto structures by UCSF ChimeraX, eigenvector centrality was calculated using python library NetworkX (Version 3.1) and data were processed in Microsoft Excel (Version 16.74).
4.2. Statistical analyses
Principal component analyses used equally scaled data for all variables analyzed (log2 fold change compared to DMSO control) and parallel analysis with 1000 simulations and selected the 2 components shown in biplots in Fig. 1 and Figure 11 were calculated in GraphPad Prism (Version 10.0.0), the matrices for the PCA in Fig. 1E is provided as Supplemental Spreadsheet 1, and for Figure 11A the PCA matrix is provided in Supplemental Spreadsheet 6. Fisher’s exact tests of contingency were two-sided for all analyses also calculated in Prism. Spearman rank correlation r values were approximated, all Spearman p values were one-tailed as the slope of initial linear regressions were positive making negative correlations unlikely. Linear regressions, slopes and F-tests for non-zero slopes were also calculated in Prism, regressions were performed without constraint on x-axis intersection. Pairwise comparison of ∆∆GXSTAL values used a Euclidean distance matrix calculated by Heatmapper.ca, paired t-test was two-tailed and calculated in Prism, all analysis files are available upon request. The statistical tests are listed in figure legend, “ns” represents any p value that is not significant (tests null) for indicated test, “ *p” represents any p-value less than 0.05, “ * *p” represents any p-value less than 0.01, “ * **p” represents any p-value less than 0.005, “ * ** *p” represents any p-value less 0.0001. All analysis files are available on request.
4.3. Exclusion of VU0656093
Although 58 compounds were identified in the previously published compound screen, only 57 compounds were analyzed for the majority of this study. We excluded one compound (VU0656093) based on the PyRx docking to human LRH-1 ligand binding domain crystal structure PDB:6OQX from our previously published data [1], as PyRx docking resulted in an extraordinarily high positive binding energy (+44.9 kcal/mol) to 6OQX. In this study, VU0656093 was also the only compound to produce positive values for the docked binding energy to the LBD structure 1YOK (+15.3 kcal/mol) and the full-length LRH-1 model (+26.6 kcal/mol). PyRx relative docked binding energies for VU0656093 are included in Supplemental Spreadsheet 3 associated with this manuscript for completeness.
4.4. Overview of published wet lab data
The 58 hit compounds were identified in a previous publication to directly bind the isolated LBD of LRH-1, using a FRET-based high-throughput screen [1]. Briefly, the 2322 compound Discovery Spectrum Collection library was used in that FRET screen, the 58 compounds were identified to decrease FRET between a fluorophore-labeled phospholipid probe installed in LRH-1 at the canonical ligand binding site and the fluorophore-labeled LRH-1 LBD protein, as FRET donor and acceptor respectively. The design of the screen suggests the identified hit compounds compete with the phospholipid probe for binding to the canonical ligand binding site in LRH-1, however no structures of the 58 hit compounds co-crystalized with LRH-1 have demonstrated this unequivocally. Binding constants to LRH-1 were not determined for most hit compounds from the screen, but a handful of binding constants were determined to validate the screen, including two high ∆∆G compounds relevant to the current study, VU0243218 (IC50 =9.4 μM [95 %CI 8.1–10.8 μM] for binding to the isolated LRH-1 LBD) and VU0656021 (IC50 =27.0 μM [95 %CI 16.1–171.5 μM] for binding to the isolated LRH-1 LBD). Both these compounds are high ∆∆G compounds that had fully saturable binding curves that could only be fit to a one-site binding model, strongly suggesting a direct, stoichiometric interaction of at these two compounds with the LRH-1 ligand binding domain, despite docking outside the canonical ligand binding pocket of LRH-1 (PDB:6OQX). The 58 hit compounds were subjected to secondary functional screens: 1) 10 μM compound-induced interaction between recombinantly expressed and purified LRH-1 LBD and a fluorophore-labeled peptide (representing the transcriptional coactivator PGC1α) by fluorescence anisotropy. 2) 10 μM compound-induced luciferase expression of a CYP8B1-promoter driven reporter in HEK293T cells expressing full-length human LRH-1 by transient co-transfection. Importantly, control luciferase reactions in the absence of co-transfected LRH-1 were used for normalization, so only the compound-regulated luciferase signal dependent upon LRH-1 co-transfection was analyzed. 3) 10 μM compound-induced luciferase expression of a CYP17A1-promoter driven reporter in HEK293T cells expressing full-length human LRH-1 by transient co-transfection, using the same control methods as above. Results from the previously published assays are available associated with this manuscript in Supplemental Spreadsheet 1, for complete technical details on all the methods and data see DOI: 10.1021/acschembio.2c00805.
4.5. PyRx computational docking
PyRx [79] rigid body computational docking was used, the 58 hit compounds were first docked to 6OQX PDB structure of human LRH-1 ligand-binding domain (LBD) [1], then docked to the integrated structural model of full-length human LRH-1 (PDB_DEV: 00000035), then PDB 1YOK as another crystal structure of human LRH-1 LBD, followed by the remaining 16 crystal structures analyzed in this study for a total of 18 human LRH-1 LBD crystal structures used for PyRx docking, listed here (PDB: 6OQX, 6VC2, 6OR1, 6OQY, 5UNJ, 5SYZ, 5L11, 4RWV, 4PLE, 4PLD, 4ONI, 4DOR, 4DOS, 3TX7, 3PLZ, 1ZDU, 1YUC, 1YOK), plus the integrated structural model of full-length LRH-1 (PDB_DEV: 00000035). The full-length LRH-1 structure is based on in-solution biophysical restraints applied to Rosetta-based computational docking, in which the ligand binding domain was computationally optimized, which was then validated using genetics, biochemistry and solution structural analyses [72] but is not a crystal structure. For PyRx ligand docking, all proteins were prepared for docking by removing all co-crystallized ligands, ions, and water molecules. The 2D ligand structures in SDF format were converted to 3D using OpenBabel [80] and energy-minimized using a universal force field with 200 steps, saved in PDBQT format, all ligands are available in Supplemental_Zip_File_1.zip available on the publisher’s website or by contact the corresponding author. Size and position of the grid box, search space, and scoring function were set in PyRx [71], the docking grid box size was X:25 Å, Y:25.2 Å, Z:25 Å centered on the canonical ligand binding site in human LRH-1. PyRx generated nine docked poses for each ligand with a corresponding docking score (relative docked binding energy, kcal/mol), pose associated with the lowest energy docking score was used for analyses. Output files were saved in PDBQT format, all docked poses were visualized using molecular graphics software PyMOL [81] or UCSF ChimeraX [82]. Docking scores are provided in Supplemental Spreadsheet 3, PDB files for all docked poses are available upon request and are included in the Supplemental_Zip_File_1.zip.
4.6. Protein structure networks
Connectivity within sets of LRH-1 crystal structures were evaluated using protein structure networks (PSN) generated using RING 3.0 [76]. Identical parameters were selected for all 18 structures (PDB: 6OQX, 6VC2, 6OR1, 6OQY, 5UNJ, 5SYZ, 5L11, 4RWV, 4PLE, 4PLD, 4ONI, 4DOR, 4DOS, 3TX7, 3PLZ, 1ZDU, 1YUC, 1YOK), using a relaxed model that returns one edge between two amino acid nodes. Protein structure networks in RING were set to include the closest nodes using relaxed distance thresholds, a single edge, with water molecules excluded and distance parameters set to 5.5 Å for hydrogen bonds, 5 Å for ionic interactions, 0.8 Å for Van der Waals, 7 Å for π-π stacking, 7 Å for π-cation interactions and 3 Å for disulfide bonds. Protein structure networks were generated using edge files by creating an adjacency list from Nodes a and b, each representing the two structures being pairwise compared (a,b), and the edges mapped between the C-alpha atom coordinates from each PDB file. The network edges were imposed on protein structures for visualization by mapping pseudo bond structures in UCSF ChimeraX [82], the NetworkX Python package was used to generate the graph objects [76], [82].
4.7. Eigenvector centrality and principal components analysis
Protein structure networks generated by RING 3.0 [76] were used in eigenvector centrality (EC) analysis using python library NetworkX to yield an EC value for each amino acid node in the network for 18 LRH-1 crystal structures from the PDB (PDB: 6OQX, 6VC2, 6OR1, 6OQY, 5UNJ, 5SYZ, 5L11, 4RWV, 4PLE, 4PLD, 4ONI, 4DOR, 4DOS, 3TX7, 3PLZ, 1ZDU, 1YUC, 1YOK), the same 18 crystal structures used throughout this study. EC values are reported in Supplemental Spreadsheet 5. EC values at amino acid nodes were averaged within the standard secondary structural elements of the LRH-1 LBD, each element assigned a label according to the standard 12-helix ligand binding domain as Helix 1(300−310), Helix 2(314−330), Loop 2(331−339), Helix 3(340−362), Helix 4(365−369), Helix 5(370−397), Loop 4(398−401), Helix 6(413−418), Helix 7(421−442), Helix 8(444−457), Loop 8(458−465), Helix 9(466−489), Loop 9(490−493), Helix 10(494−501), Helix 11(502−523), Loop 10(524−529), and Helix 12(530−538). For each of these 17 secondary structural elements, the EC values of all amino acid nodes in the element were averaged, averaged EC values for each secondary structural element in LRH-1 were used as 17 features for principal component analyses. PCA used 1000 simulations, executed in GraphPad Prism (Version 10.0.0). Eigenvector centrality (EC) is a metric of how strongly the centrality score of a node in a network is influenced by its connections to other central nodes. The largest eigenvalue of the adjacency matrix scales the magnitude of the associated eigenvector and satisfies the equation Av= λv, where A is the network adjacency matrix and λ is the largest eigenvalue of the adjacency matrix. This implies that for a network of size j, the eigenvector value for a given vertex vi is such that vi = λ-1 ∑jAij vj. In other words, the eigenvector value vi is the sum of the eigenvector values of its first-degree neighbors scaled by the largest eigenvalue.
4.8. Identifying unique network edges
To identify edges unique to each cluster in the principal component analysis, all unique edges for structures with high ∆∆G values were grouped together in Excel (PDB: 3PLX, 4IS8, 4PLD, 5I11, 5SYZ, 5UNJ, 6OQX, 6OQY, 6OR1, 6VC2) and all unique edges for structures with low ∆∆G values grouped together (PDB: 1YOK, 1YUC, 1ZDU, 3TX7, 4DOR, 4DOS, 4ONI, 4PLE, 4RWV). Common edges between the two groups were identified and removed in Excel, edges outside the intersection for each cluster were mapped onto PDB:6OQX by mapping pseudo bond structures using UCSF ChimeraX [82].
Funding
Funding was provided by GM138873 to R.D.B. and training grant support on HG008341 to Z.H.
CRediT authorship contribution statement
Raymond D. Blind: Writing – review & editing, Writing – original draft, Visualization, Validation, Supervision, Resources, Project administration, Investigation, Funding acquisition, Formal analysis, Data curation, Conceptualization. David Foutch: Writing – review & editing, Investigation, Formal analysis. Zeinab Haratipour: Writing – review & editing, Validation, Investigation, Formal analysis.
Declaration of Competing Interest
None.
Acknowledgements
We thank all members of the Blind Lab for helpful discussions but particularly Ana Chang-Gonzalez, James C. Poland, Alexis N. Campbell, Wong Jae Choi and Pratima Chapagain.
Footnotes
Supplementary data associated with this article can be found in the online version at doi:10.1016/j.csbj.2024.07.021.
Appendix A. Supplementary material
Supplementary material
.
Supplementary material
.
Data Availability
All results associated with this manuscript are provided as supplemental spreadsheets, all other files are available upon request.
References
- 1.Malabanan M.M., Chapagain P., Haratipour Z., Choi W.J., Orun A.R., Blind R.D. New high-throughput screen discovers novel ligands of full-length nuclear receptor LRH-1. ACS Chem Biol. 2023 doi: 10.1021/ACSCHEMBIO.2C00805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Di Marco G., Vallese F., Jourde B., Bergsdorf C., Sturlese M., De Mario A., et al. A high-throughput screening identifies MICU1 targeting compounds. Cell Rep. 2020;30(7):2321–2331. doi: 10.1016/J.CELREP.2020.01.081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Durairaj G., Demir Ö., Lim B., Baronio R., Tifrea D., Hall L.V., et al. Discovery of compounds that reactivate P53 mutants in vitro and in vivo. Cell Chem Biol. 2022;29(9):1381–1395. doi: 10.1016/J.CHEMBIOL.2022.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Han H., Jain A.D., Truica M.I., Izquierdo-Ferrer J., Anker J.F., Lysy B., et al. Small-molecule MYC inhibitors suppress tumor growth and enhance immunotherapy. Cancer Cell. 2019;36(5):483–497. doi: 10.1016/J.CCELL.2019.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wei T.T., Chandy M., Nishiga M., Zhang A., Kumar K.K., Thomas D., et al. Cannabinoid receptor 1 antagonist genistein attenuates marijuana-induced vascular inflammation. Cell. 2022;185(10):1676–1693. doi: 10.1016/J.CELL.2022.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Jin Z., Du X., Xu Y., Deng Y., Liu M., Zhao Y., et al. Structure of Mpro from SARS-CoV-2 and discovery of its inhibitors. Nature. 2020;582(7811):289–293. doi: 10.1038/S41586-020-2223-Y. [DOI] [PubMed] [Google Scholar]
- 7.Eder J., Sedrani R., Wiesmann C. The discovery of first-in-class drugs: origins and evolution. Nat Rev Drug Discov. 2014;13(8):577–587. doi: 10.1038/NRD4336. [DOI] [PubMed] [Google Scholar]
- 8.David E., Tramontin T., Zemmel R., Pharmaceutical R. &D: the road to positive returns. Nat Rev Drug Discov. 2009;8(8):609–610. doi: 10.1038/NRD2948. [DOI] [PubMed] [Google Scholar]
- 9.Fen W., Sicen W., Xilan G. A review for cell-based screening methods in drug discovery. Biophys Rep. 2021;7(6):504–516. doi: 10.52601/BPR.2021.210042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hughes J.P., Rees S.S., Kalindjian S.B., Philpott K.L. Principles of early drug discovery. Br J Pharm. 2011;162(6):1239–1249. doi: 10.1111/J.1476-5381.2010.01127.X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Michelini E., Cevenini L., Mezzanotte L., Coppa A., Roda A. Cell-based assays: fuelling drug discovery. Anal Bioanal Chem. 2010;398(1):227–238. doi: 10.1007/S00216-010-3933-Z. [DOI] [PubMed] [Google Scholar]
- 12.Blay V., Tolani B., Ho S.P., Arkin M.R. High-throughput screening: today’s biochemical and cell-based approaches. Drug Discov Today. 2020;25(10):1807–1821. doi: 10.1016/J.DRUDIS.2020.07.024. [DOI] [PubMed] [Google Scholar]
- 13).Van Vleet T.R., Liguori M.J., Lynch J.J., Rao M., Warder S. Screening strategies and methods for better off-target liability prediction and identification of small-molecule pharmaceuticals. SLAS Discov. 2019;24(1):1–24. doi: 10.1177/2472555218799713. [DOI] [PubMed] [Google Scholar]
- 14.Sadybekov A.V., Katritch V. Computational Approaches Streamlining Drug Discovery. Nature. 2023;616(7958):673–685. doi: 10.1038/S41586-023-05905-Z. [DOI] [PubMed] [Google Scholar]
- 15.de Souza Neto L.R., Moreira-Filho J.T., Neves B.J., Maidana R.L.B.R., Guimarães A.C.R., Furnham N., et al. In silico strategies to support fragment-to-lead optimization in drug discovery. Front Chem. 2020;8 doi: 10.3389/FCHEM.2020.00093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kar S., Roy K. How far can virtual screening take us in drug discovery? Expert Opin Drug Discov. 2013;8(3):245–261. doi: 10.1517/17460441.2013.761204. [DOI] [PubMed] [Google Scholar]
- 17.Slater O., Kontoyianni M. The compromise of virtual screening and its impact on drug discovery. Expert Opin Drug Discov. 2019;14(7):619–637. doi: 10.1080/17460441.2019.1604677. [DOI] [PubMed] [Google Scholar]
- 18.Schneider G. Virtual screening: an endless staircase? Nat Rev Drug Discov. 2010;9(4):273–276. doi: 10.1038/NRD3139. [DOI] [PubMed] [Google Scholar]
- 19.Weiss D.R., Karpiak J., Huang X.P., Sassano M.F., Lyu J., Roth B.L., et al. Selectivity challenges in docking screens for GPCR targets and antitargets. J Med Chem. 2018;61(15):6830–6845. doi: 10.1021/ACS.JMEDCHEM.8B00718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kampen S., Rodríguez D., Jørgensen M., Kruszyk-Kujawa M., Huang X., Collins M., et al. Structure-based discovery of negative allosteric modulators of the metabotropic glutamate receptor 5. ACS Chem Biol. 2022;17(10):2744–2752. doi: 10.1021/ACSCHEMBIO.2C00234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Forli S., Huey R., Pique M.E., Sanner M.F., Goodsell D.S., Olson A.J. Computational protein-ligand docking and virtual drug screening with the AutoDock suite. Nat Protoc. 2016;11(5):905–919. doi: 10.1038/NPROT.2016.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kaplan A.L., Confair D.N., Kim K., Barros-Álvarez X., Rodriguiz R.M., Yang Y., et al. Bespoke library docking for 5-HT2A receptor agonists with antidepressant activity. Nature. 2022;610(7932):582–591. doi: 10.1038/S41586-022-05258-Z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Rohban M.H., Fuller A.M., Tan C., Goldstein J.T., Syangtan D., Gutnick A., et al. Virtual screening for small-molecule pathway regulators by image-profile matching. Cell Syst. 2022;13(9):724–736. doi: 10.1016/J.CELS.2022.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Pinzi L., Rastelli G. Molecular docking: shifting paradigms in drug discovery. Int J Mol Sci. 2019;20(18) doi: 10.3390/IJMS20184331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bell P.A., Munck A. Steroid binding and stabilization of glucocorticoid-receptor proteins from rat thymus. Biochem J. 1972;126(3) doi: 10.1042/bj1260011pa. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Chakravarthy M.V., Lodhi I.J., Yin L., Malapaka R.R.V., Xu H.E., Turk J., et al. Identification of a physiologically relevant endogenous ligand for PPARa in liver. Cell. 2009;138(3):476–488. doi: 10.1016/j.cell.2009.05.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Raghuram S., Stayrook K.R., Huang P., Rogers P.M., Nosie A.K., McClure D.B., et al. Identification of heme as the ligand for the orphan nuclear receptors REV-ERBα and REV-ERBβ. Nat Struct Mol Biol. 2007;14(12):1207–1213. doi: 10.1038/nsmb1344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Musille P.M., Kohn J.A., Ortlund E.A. Phospholipid--driven gene regulation. FEBS Lett. 2013;587(8):1238–1246. doi: 10.1016/j.febslet.2013.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Crowder M.K.M.K., Seacrist C.D.C.D., Blind R.D.R.D. Phospholipid regulation of the nuclear receptor superfamily. 2017;Vol. 63:6–14. doi: 10.1016/j.jbior.2016.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Faial T. NR5A2 activates the zygotic genome. Nat Genet. 2023;55(1):1. doi: 10.1038/S41588-022-01289-4. [DOI] [PubMed] [Google Scholar]
- 31.Gassler J., Kobayashi W., Gáspár I., Ruangroengkulrith S., Mohanan A., Hernández L.G., et al. Zygotic genome activation by the totipotency pioneer factor Nr5a2. Science. 2022;378(6626):1305–1315. doi: 10.1126/SCIENCE.ABN7478. [DOI] [PubMed] [Google Scholar]
- 32.Mamrosh J.L., Lee J.M., Wagner M., Stambrook P.J., Whitby R.J., Sifers R.N., et al. Nuclear receptor LRH-1/NR5A2 is required and targetable for liver endoplasmic reticulum stress resolution. Elife. 2014;3:2014. doi: 10.7554/ELIFE.01694.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Klatt K.C., Petviashvili E.J., Moore D.D. LRH-1 induces hepatoprotective nonessential amino acids in response to acute liver injury. J Clin Invest. 2023;133(7) doi: 10.1172/JCI168805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Xu P., Oosterveer M.H., Stein S., Demagny H., Ryu D., Moullan N., et al. LRH-1-dependent programming of mitochondrial glutamine processing drives liver cancer. Genes Dev. 2016;30(11):1255–1260. doi: 10.1101/gad.277483.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Oosterveer M.H., Mataki C., Yamamoto H., Harach T., Moullan N., van Dijk T.H., et al. LRH-1-dependent glucose sensing determines intermediary metabolism in liver. J Clin Invest. 2012;122(8):2817–2826. doi: 10.1172/JCI62368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lee J.M., Lee Y.K., Mamrosh J.L., Busby S.A., Griffin P.R., Pathak M.C., et al. A nuclear-receptor-dependent phosphatidylcholine pathway with antidiabetic effects. Nature. 2011;474(7352):506–510. doi: 10.1038/nature10111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Musille P.M., Pathak M.C., Lauer J.L., Hudson W.H., Griffin P.R., Ortlund E.A. Antidiabetic phospholipid-nuclear receptor complex reveals the mechanism for phospholipid-driven gene regulation. Nat Struct Mol Biol. 2012;19(5):532–537. doi: 10.1038/nsmb.2279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Cobo I., Martinelli P., Flández M., Bakiri L., Zhang M., Carrillo-de-Santa-Pau E., et al. Transcriptional regulation by NR5A2 links differentiation and inflammation in the pancreas. Nature. 2018;554(7693):533–537. doi: 10.1038/nature25751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Holmstrom S.R., Deering T., Swift G.H., Poelwijk F.J., Mangelsdorf D.J., Kliewer S.A., et al. LRH-1 and PTF1-L coregulate an exocrine pancreas-specific transcriptional network for digestive function. Genes Dev. 2011;25(16):1674–1679. doi: 10.1101/GAD.16860911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.de Lange K.M., Moutsianas L., Lee J.C., Lamb C.A., Luo Y., Kennedy N.A., et al. Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease. Nat Genet. 2017;49(2):256–261. doi: 10.1038/ng.3760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Fernandez-Marcos P.J., Auwerx J., Schoonjans K. Emerging actions of the nuclear receptor LRH-1 in the gut. Biochim Biophys Acta. 2011;1812(8):947–955. doi: 10.1016/J.BBADIS.2010.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Mays S.G., D’Agostino E.H., Flynn A.R., Huang X., Wang G., Liu X., et al. A phospholipid mimetic targeting LRH-1 ameliorates colitis. Cell Chem Biol. 2022 doi: 10.1016/j.chembiol.2022.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Benod C., Vinogradova M.V., Jouravel N., Kim G.E., Fletterick R.J., Sablin E.P. Nuclear receptor liver receptor homologue 1 NR5A2 (LRH-1) regulates pancreatic cancer cell growth and proliferation. Proc Natl Acad Sci USA. 2011;108(41):16927–16931. doi: 10.1073/pnas.1112047108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Petersen G.M., Amundadottir L., Fuchs C.S., Kraft P., Stolzenberg-Solomon R.Z., Jacobs K.B., et al. A genome-wide association study identifies pancreatic cancer susceptibility loci on chromosomes 13q22.1, 1q32.1 and 5p15.33. Nat Genet. 2010;42(3):224–228. doi: 10.1038/ng.522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Lin Q., Aihara A., Chung W., Li Y., Huang Z., Chen X., et al. LRH1 as a driving factor in pancreatic cancer growth. Cancer Lett. 2014;345(1):85–90. doi: 10.1016/j.canlet.2013.11.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Flynn A.R., Mays S.G., Ortlund E.A., Jui N.T. Development of hybrid phospholipid mimics as effective agonists for liver receptor homologue-1. ACS Med Chem Lett. 2018;9(10):1051–1056. doi: 10.1021/acsmedchemlett.8b00361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Mays S.G., Stec J., Liu X., D’Agostino E.H., Whitby R.J., Ortlund E.A. Enantiomer-specific activities of an LRH-1 and SF-1 dual agonist. Sci Rep. 2020;10(1) doi: 10.1038/S41598-020-79251-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48).Cornelison J.L., Cato M.L., Johnson A.M., D’Agostino E.H., Melchers D., Patel A.B., et al. Development of a new class of liver receptor homolog-1 (LRH-1) agonists by photoredox conjugate addition. Bioorg Med Chem Lett. 2020;30(16) doi: 10.1016/j.bmcl.2020.127293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Mays S.G., Flynn A.R., Cornelison J.L., Okafor C.D., Wang H., Wang G., et al. Development of the first low nanomolar liver receptor homolog-1 agonist through structure-guided design. J Med Chem. 2019 doi: 10.1021/acs.jmedchem.9b00753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50).Wagner R.L., Apriletti J.W., Mc Grath M.E., West B.L., Baxter J.D., Fletterick R.J. A structural role for hormone in the thyroid hormone receptor. Nature. 1995;378(6558):690–697. doi: 10.1038/378690a0. [DOI] [PubMed] [Google Scholar]
- 51.Evans R.M. The nuclear receptor superfamily: a rosetta stone for physiology. Mol Endocrinol. 2005;19(6):1429–1438. doi: 10.1210/me.2005-0046. [DOI] [PubMed] [Google Scholar]
- 52.Evans R.M., Mangelsdorf D.J. Nuclear receptors, RXR, and the big bang. Cell. 2014;157(1):255–266. doi: 10.1016/j.cell.2014.03.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Sablin E.P., Blind R.D., Krylova I.N., Ingraham J.G., Cai F., Williams J.D., et al. Structure of SF-1 bound by different phospholipids: evidence for regulatory ligands. Mol Endocrinol. 2009;23(1) doi: 10.1210/me.2007-0508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Krylova I.N., Sablin E.P., Moore J., Xu R.X., Waitt G.M., MacKay J.A., et al. Structural analyses reveal phosphatidyl inositols as ligands for the NR5A orphan receptors SF-1 and LRH-1. Cell. 2005;120(3):343–355. doi: 10.1016/j.cell.2005.01.024. [DOI] [PubMed] [Google Scholar]
- 55.Blind R.D., Suzawa M., Ingraham H.A. Direct modification and activation of a nuclear receptor - PIP2 complex by the inositol lipid kinase IPMK. Sci Signal. 2012;5(229) doi: 10.1126/scisignal.2003111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Blind R.D., Sablin E.P., Kuchenbecker K.M., Chiu H.J., Deacon A.M., Das D., et al. The signaling phospholipid PIP3 creates a new interaction surface on the nuclear receptor NR5A1 (SF-1) Proc Natl Acad Sci USA. 2014;111(42):15054–15059. doi: 10.1073/pnas.1416740111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Bryant J.M., Malabanan M.M., Vanderloop B.H., Nichols C.M., Haratipour Z., Poon K.T., et al. The acyl chains of phosphoinositide PIP3 alter the structure and function of nuclear receptor steroidogenic factor-1 (NR5A1) J Lipid Res. 2021 doi: 10.1016/j.jlr.2021.100081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Mullaney B.C.C., Blind R.D.D., Lemieux G.A.A., Perez C.L.L., Elle I.C.C., Faergeman N.J.J., et al. Regulation of C. elegans fat uptake and storage by Acyl-CoA synthase-3 is dependent on NR5A family nuclear hormone receptor Nhr-25. Cell Metab. 2010;12(4):398–410. doi: 10.1016/j.cmet.2010.08.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Blind R.D.D., Pineda-Torra I., Xu Y., Xu H.E.E., Garabedian M.J.J. Ligand structural motifs can decouple glucocorticoid receptor transcriptional activation from target promoter occupancy. Biochem Biophys Res Commun. 2012;420(4):839–844. doi: 10.1016/j.bbrc.2012.03.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Sablin E.P., Blind R.D., Uthayaruban R., Chiu H.-J., Deacon A.M., Das D., et al. Structure of liver receptor homolog-1 (NR5A2) with PIP 3 hormone bound in the ligand binding pocket. J Struct Biol. 2015;192(3):342–348. doi: 10.1016/j.jsb.2015.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Musille P.M., Pathak M., Lauer J.L., Griffin P.R., Ortlund E.A. Divergent sequence tunes ligand sensitivity in phospholipid-regulated hormone receptors. J Biol Chem. 2013;288(28):20702–20712. doi: 10.1074/jbc.M113.472837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Ortlund E.A., Lee Y., Solomon I.H., Hager J.M., Safi R., Choi Y., et al. Modulation of human nuclear receptor LRH-1 activity by phospholipids and SHP. Nat Struct Mol Biol. 2005;12(4):357–363. doi: 10.1038/nsmb910. [DOI] [PubMed] [Google Scholar]
- 63).Cornelison J.L., Cato M.L., Johnson A.M., D’Agostino E.H., Melchers D., Patel A.B., et al. Development of a new class of liver receptor homolog-1 (LRH-1) agonists by photoredox conjugate addition. Bioorg Med Chem Lett. 2020;30(16) doi: 10.1016/j.bmcl.2020.127293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Mays S.G., Okafor C.D., Whitby R.J., Goswami D., Stec J., Flynn A.R., et al. Crystal structures of the nuclear receptor, liver receptor homolog 1, bound to synthetic agonists. J Biol Chem. 2016;291(49):25281–25291. doi: 10.1074/jbc.M116.753541. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Rouleau N., Turcotte S., Mondou M.H., Roby P., Bosse R. Development of a versatile platform for nuclear receptor screening using AlphaScreen. J Biomol Screen. 2003;8(2):191–197. doi: 10.1177/1087057103252605. [DOI] [PubMed] [Google Scholar]
- 66.Ishigami-Yuasa M., Kagechika H. Chemical screening of nuclear receptor modulators. Int J Mol Sci. 2020;21(15):1–19. doi: 10.3390/ijms21155512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Lea W.A., Simeonov A. Fluorescence polarization assays in small molecule screening. Expert Opin Drug Discov. 2011;6(1):17–32. doi: 10.1517/17460441.2011.537322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Whitby R.J., Dixon S., Maloney P.R., Delerive P., Goodwin B.J., Parks D.J., et al. Identification of small molecule agonists of the orphan nuclear receptors liver receptor homolog-1 and steroidogenic factor-1. J Med Chem. 2006;49(23):6652–6655. doi: 10.1021/jm060990k. [DOI] [PubMed] [Google Scholar]
- 69.Musille P.M., Kossmann B.R., Kohn J.A., Ivanov I., Ortlund E.A. Unexpected allosteric network contributes to LRH-1 coregulator selectivity. J Biol Chem. 2015 doi: 10.1074/jbc.M115.662874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Mays S.G., Okafor C.D., Tuntland M.L., Whitby R.J., Dharmarajan V., Stec J., et al. Structure and dynamics of the liver receptor homolog 1-PGC1α complex. Mol Pharm. 2017;92(1):1–11. doi: 10.1124/mol.117.108514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Dallakyan S., Olson A.J. Small-molecule library screening by docking with PyRx. Methods Mol Biol. 2015;1263:243–250. doi: 10.1007/978-1-4939-2269-7_19. [DOI] [PubMed] [Google Scholar]
- 72.Seacrist C.D., Kuenze G., Hoffmann R.M., Moeller B.E., Burke J.E., Meiler J., et al. Integrated structural modeling of full-length LRH-1 reveals inter-domain interactions contribute to receptor structure and function. Structure. 2020;28(7) doi: 10.1016/j.str.2020.04.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Cato M.L., D’Agostino E.H., Spurlin R.M., Flynn A.R., Cornelison J.L., Johnson A.M., et al. Comparison of activity, structure, and dynamics of SF-1 and LRH-1 complexed with small molecule modulators. J Biol Chem. 2023 doi: 10.1016/J.JBC.2023.104921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Cato M.L., Cornelison J.L., Spurlin R.M., Courouble V.V., Patel A.B., Flynn A.R., et al. Differential modulation of nuclear receptor LRH-1 through targeting buried and surface regions of the binding pocket. J Med Chem. 2022;65(9):6888–6902. doi: 10.1021/ACS.JMEDCHEM.2C00235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Mays S.G., Hercules D., Ortlund E.A., Okafor C.D. The nuclear receptor LRH-1 discriminates between ligands using distinct allosteric signaling circuits. Protein Sci. 2023;32(10) doi: 10.1002/PRO.4754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Clementel D., Del Conte A., Monzon A.M., Camagni G.F., Minervini G., Piovesan D., et al. RING 3.0: fast generation of probabilistic residue interaction networks from structural ensembles. Nucleic Acids Res. 2022;50(W1):W651–W656. doi: 10.1093/NAR/GKAC365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Piovesan D., Minervini G., Tosatto S.C.E. The RING 2.0 web server for high quality residue interaction networks. Nucleic Acids Res. 2016;44(W1):W367–W374. doi: 10.1093/NAR/GKW315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Cato M.L., D’Agostino E.H., Spurlin R.M., Flynn A.R., Cornelison J.L., Johnson A.M., et al. Comparison of activity, structure, and dynamics of SF-1 and LRH-1 complexed with small molecule modulators. J Biol Chem. 2023;299(8) doi: 10.1016/J.JBC.2023.104921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Dallakyan S., Olson A.J. Small-molecule library screening by docking with PyRx. Methods Mol Biol. 2015;1263:243–250. doi: 10.1007/978-1-4939-2269-7_19. [DOI] [PubMed] [Google Scholar]
- 80).O’Boyle N.M., Banck M., James C.A., Morley C., Vandermeersch T., Hutchison G.R. Open Babel: an open chemical toolbox. J Chemin- 2011;3(1):1–14. doi: 10.1186/1758-2946-3-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Yuan S., Chan H.C.S., Hu Z. Using PyMOL as a platform for computational drug design. Wiley Inter Rev Comput Mol Sci. 2017;7(2) [Google Scholar]
- 82.Goddard T.D., Huang C.C., Meng E.C., Pettersen E.F., Couch G.S., Morris J.H., et al. UCSF ChimeraX: meeting modern challenges in visualization and analysis. Protein Sci. 2018;27(1):14–25. doi: 10.1002/PRO.3235. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary material
Supplementary material
Data Availability Statement
All results associated with this manuscript are provided as supplemental spreadsheets, all other files are available upon request.











