Abstract
Skp1 is an adapter that links F-box proteins to cullin-1 in the Skp1/cullin-1/F-box (SCF) protein family of E3 ubiquitin ligases that targets specific proteins for polyubiquitination and subsequent protein degradation. Skp1 from the amoebozoan Dictyostelium forms a stable homodimer in vitro with a Kd of 2.5 µM as determined by sedimentation velocity studies, yet is monomeric in crystal complexes with F-box proteins. To investigate the molecular basis for the difference, we determined the solution NMR structure of a doubly truncated Skp1 homodimer (Skp1ΔΔ). The solution structure of the Skp1ΔΔ dimer reveals a 2-fold symmetry with an interface that buries ~750 Å2 of predominantly hydrophobic surface. The dimer interface overlaps with subsite-1 of the F-box interaction area, explaining why only the Skp1 monomer binds F-box proteins (FBPs). To confirm the model, Rosetta was used to predict amino acid substitutions that might disrupt the dimer interface, and the F97E substitution was chosen to potentially minimize interference with F-box interactions. A nearly full-length version of Skp1 with this substitution (Skp1ΔF97E) behaved as a stable monomer at concentrations up to 500 µM and actively bound a model FBP, mammalian Fbs1, which suggests that the dimeric state is not required for Skp1 to carry out a basic biochemical function. Finally, Skp1ΔF97E is expected to serve as a monomer model for high-resolution NMR studies previously hindered by dimerization.
INTRODUCTION
The Skp1/cullin-1/F-box (SCF) protein family of E3 ubiquitin ligases is an important mediator of protein turnover in yeast/fungi, higher plants, and animals, owing to the role of polyubiquitination in serving as a signal for recognition and degradation in the 26S-proteasome. Evidence supports the importance of the SCF complex in the protist kingdom as well,1 where a novel posttranslational modification has been discovered in Skp1 orthologs from groups as diverse as amoebozoa (Dictyostelium discoideum), apicomplexans (Toxoplasma gondii), and oomycetes (Pythium ultimum).2 Protist Skp1 is subject, in the presence of sufficient O2 and α-ketoglutarate, to hydroxylation of a Pro-residue that lies on the backside of subsite-2 of the F-box binding domain of Skp1. Once Skp1 is hydroxylated, the hydroxyproline (Hyp) residue is recognized and glycosylated by a series of glycosyltransferases, resulting in the assembly of a canonical pentasaccharide. Mutational studies show that both hydroxylation and full glycosylation are required for optimal O2-sensing in Dictyostelium and Toxoplasma.1,3 Biophysical and computational studies have generated a model by which the relatively organized structure of the pentasaccharide organizes the surrounding intrinsically disordered region of Dictyostelium Skp1 in such a way as to be more conducive to binding the F-box domain of FBPs,4 and recent studies indicate that this model is also relevant to Skp1 from Toxoplasma.3 Confirmatory biophysical studies using NMR are hampered by the dimeric state of glycosylated Skp1 (GGFGGn-Skp1) because of its relatively large size, 324 amino acids.
Recombinant guinea pig Skp1, whose sequence is identical across mammals, was previously reported to dimerize with a Kd of 1.1 µM.5 This value is significantly below the estimated concentration of Skp1 (a.k.a. OCP2) in the inner ear tissues (2 mM) where Skp1/OCP-2 was initially characterized, suggesting that dimerization might influence Skp1 activity in cells. Studies of Dictyostelium Skp1, where small angle X-ray scattering, gel filtration, and NMR studies confirmed its dimeric status at higher concentrations, indicate that glycosylation modestly inhibits dimerization.6 We sought to investigate the significance of Skp1 dimerization by mapping its dimer interface. The structure of Skp1 from mammals, yeast and higher plants is known when it is bound to F-box proteins.7,8 In contrast to the extensive sequence variations of F-box domains, the sequence and structure of Skp1 in these complexes is highly conserved. However, free Skp1 has defied structural characterization, potentially because of intrinsically disordered regions (including its C-terminal region that contributes to F-box domain recognition) that interfere with the formation of crystals for X-ray crystallography. At the same time, the Skp1 homodimer is too large for solution NMR studies without resorting to 2H-isotope labeling. We have found that Dictyostelium Skp1A remains a dimer in the absence of both an internal disordered region that was originally removed to allow crystallization with F-box proteins,7 and the predominantly disordered region that comprises the C-terminal subsite-2 of the F-box binding region. The doubly truncated Skp1ΔΔ dimer variant (2 × 118 aa) was sufficiently small to pursue high-resolution solution NMR structure determination using uniform 15N- and 13C-isotope labeling. We found that the homodimer interface overlaps with the F-box binding interface and confirmed the finding by Rosetta-guided mutagenesis, and discuss the implications for Skp1 function and future studies on its posttranslational regulation.
MATERIALS AND METHODS
Expression Plasmids.
The E. coli expression plasmid pET19b-Skp1AΔΔ was derived from pET19b-Skp1A9 by site-directed mutagenesis, in which primers designed to bridge the deleted sequence as described in Figure S1C were used in a PCR reaction with Q5® High-Fidelity DNA Polymerase (New England Biolabs) to amplify the modified vector. After treatment with DpnI to destroy the original vector, the sample was cloned into E. coli strain BL21-Gold(DE3).
For improved recovery and purification of Skp1, the Dictyostelium Skp1A coding sequence was codon optimized for expression in E. coli, and appended with an N-terminal His6-tag which, when excised by treatment with TEV protease, yielded the native sequence with an N-terminus of SMSL-, compared to the N-terminal SL- that occurs natively after removal of the start Met.10 The cDNA (Figure S1A) was synthesized and provided in pUC57 by GenScript, excised using NcoI and BamHI, and ligated into the NcoI and BamHI restriction sites of pET19b, yielding pET19b-His6DdSkp1A-optim. A second cDNA in which 12 internal amino acids (SPQGDDKKDEKR) were replaced with GGSG (Figure S1B) was synthesized and similarly ligated into pET19B yielding pET19b-His6DdSkp1AΔLoop-optim. This plasmid was modified to generate an F97E point mutation by site-directed mutagenesis, in which the indicated primers were used in a PCR reaction to amplify the modified vector as above.
Expression and Purification of Skp1 constructs.
Skp1 and Skp1ΔΔ, which each lacked an affinity tag to ensure native-like behavior, were purified from E. coli to near homogeneity under non-denaturing conditions (DEAE, phenyl, Q and S200 Superdex columns) as described previously.6 Sample purity and integrity was assessed by SDS-PAGE and Coomassie blue staining to be >90%.
E. coli cells expressing His6Skp1Δ or His6Skp1ΔF97E were incubated at 37 ºC in 2 × 1 L of Terrific Broth medium in the presence of 100 µg/ml ampicillin. At an OD600 of 0.6, protein expression was induced by addition of 125 µM isopropyl 1-thio-β-D-galactopyranoside (IPTG) at 20 ºC. After 12-16 h, bacteria were collected by centrifugation at 5000 × g for 10 min and resuspended in 50 mM Na+/K+ phosphate (pH 7.8), 300 mM NaCl, 5 µg/ml aprotinin, 5 µg/ml leupeptin at 4 ºC. Cells were lysed using a probe sonicator (model 500, Thermo Fisher Scientific) for a total sonication time of 5 min. The lysate was centrifuged at 25,000 × g for 45 min at 4 ºC, and the supernatant was immediately applied to a 1.5-ml column of Co2+ Talon resin (Clontech) pre-equilibrated at 4 ºC in the buffer described above. The column was washed successively with the same buffer supplemented with either 1 M NaCl, 10% glycerol, or 5 mM imidazole. Protein was eluted with buffer containing 300 mM imidazole, and dialyzed against 50 mM Tris-HCl (pH 8.0), 300 mM NaCl, 1 mM EDTA, 2 mM β-mercaptoethanol. The sample was incubated overnight at 22 °C with His6TEV protease to cleave the His6-tag from Skp1, and the sample was re-applied to the Talon resin. The flow-through was concentrated to 1.5 ml using a spin concentrator (Amicon) with a 3 kDa molecular weight cut-off. The concentrated sample was further purified over a Superdex 200 Hi-load 16/60 gel filtration column (GE Healthcare) pre-equilibrated with 20 mM potassium phosphate (pH 7.4), 50 mM KCl, and 0.5 mM TCEP. The sample was estimated to be >95% pure by SDS-PAGE and staining with Coomassie blue.
Stable isotope labeled Skp1ΔΔ and His6Skp1ΔF97E were prepared by expression in E. coli in the presence of isotope enriched minimal media as previously described.9 His6Skp1ΔF97E was uniformly enriched with 15N, and Skp1ΔΔ was labeled with 15N and 13C. The final Skp1ΔΔ NMR sample (105 µl in a 3-mm Shigemi tube) contained a 1:1 mixture of 15N,13C-Skp1ΔΔ and natural abundance Skp1ΔΔ at ~1.0 mM concentration in 20 mM MES-NaOH (pH 6.0), 50 mM NaCl, 5 mM dithiothreitol, 0.05% NaN3. 15N-labeled Skp1ΔF97E NMR samples (300 µl in 5-mm Shigemi tubes) were prepared at concentrations of 100 µM and 500 µM in the same buffer. All NMR samples contained 10% D2O for spectrometer lock.
Analytical Ultracentrifugation.
Protein was quantified based on molar absorptivity calculated from the protein sequence using ProtParam.11 Samples were loaded into 12-mm double-sector Epon centerpieces equipped with quartz windows and equilibrated for 2 h at 20 °C in an An60 Ti rotor. Sedimentation velocity data were collected using an Optima XLA analytical ultracentrifuge (Beckman Coulter) at 50,000 rpm at 20 °C. Data were recorded with absorbance optics at 280 nm, 230 nm or 215 nm in radial step sizes of 0.003 cm. SEDNTERP12 was used to model the partial specific volume as well as the density and viscosity of the buffer. SEDFIT13 was used to analyze sedimentation data. All data were modeled as continuous c(s) distributions and were fit using baseline, meniscus, frictional coefficient, and systematic time-invariant and radial-invariant noise. Predicted sedimentation coefficient (s) values for Skp1 monomer and dimer states were calculated using HYDROPRO14 with a homology model generated on the ROBETTA server.15 Data fit and c(s) plots were generated using GUSSI.16 Weight-averaged S values (Sw) at each concentration were determined by integrating c(s) distributions. Constructed Sw isotherms were fitted with a A+A⇌AA self-association model using SEDPHAT to calculate the dissociation constant.17
NMR Spectroscopy and Structure Determination of Skp1ΔΔ.
NMR spectra for Skp1ΔΔ were acquired at 35 °C using a Bruker AVANCE NEO 800 MHz spectrometer equipped with a 5-mm cryogenic TCI 1H{13C,15N} probe, and an Agilent VNMRS 600 MHz spectrometer equipped with a 3-mm cryogenic 1H{13C,15N} probe. NMR spectra for 100 µM and 500 µM Skp1ΔF97E samples were acquired using a Bruker AVANCE NEO 900 MHz spectrometer equipped with a 5-mm cryogenic TXO 13C,15N{1H} probe, and the 600 MHz spectrometer equipped with a 5-mm cryogenic 1H{13C,15N} probe. The acquired NMR spectra are summarized in Table S1. NOE mixing times were 70 ms for 13C/15N-edited [1H,1H]-NOESY and 120 ms for 13C/15N-filtered 13C/15N-edited [1H,1H]-NOESY experiments. Fourier transform was performed with TopSpin (Bruker BioSpin) for Bruker NMR data, and NMRPipe18 for Varian NMR data. 1H chemical shifts were referenced relative to 4,4-dimethyl-4-silapentane-1-sulfonic acid (DSS), and 13C and 15N chemical shifts were referenced indirectly via gyromagnetic ratios. 2D and 3D NMR spectra were analyzed using CARA.19
Relaxation delays were 0.1, 0.2, 0.3, 0.4, 0.7, 1.0, 1.5, and 2.0 seconds in 1D proton-detected 15N T1 experiments, and 10, 30, 50, 70, 90, 110, 130, 150 and 170 ms in 1D proton-detected 15N T2 experiments acquired for the 500 µM Skp1ΔF97E sample. 1D 15N T1/T2 relaxation spectra were processed and analyzed with VnmrJ v4.2 (Agilent Inc). Macro “tc” (wiki.nesg.org) was used to integrate the regions between 1H chemical shifts of 8.8 and 9.9 ppm, determine average 15N T1 and T2 relaxation times via exponential fitting, and calculate global rotational correlation time τc.
Sequence-specific backbone and side-chain resonance assignments for Skp1ΔΔ were derived using CARA based on existing resonance assignments of full-length Skp1.9 Structure calculation of the Skp1ΔΔ homodimer was performed using CYANA20 based on 1H-1H upper distance constraints derived from 13C/15N-edited [1H,1H]-NOESY, as well as backbone φ and ψ and side-chain χ1 dihedral angle restraints from TALOS-N.21 Automated NOESY peak assignment was performed initially with CYANA, with 22 manually assigned intermolecular 5 Å 1H-1H upper distance constraints (Table S2) applied after cycle 1 of simulated annealing. These intermolecular 1H-1H upper distance constraints were derived from selected strong peaks in a 13C/15N-filtered 13C/15N-edited [1H,1H]-NOESY spectrum. Only those peaks that could be unambiguously assigned and could not be explained by intramolecular contacts were chosen. After several rounds of iterative refinement of NOE peak assignments and calibration of distance constraints, the final structure calculation was performed with CYANA. Stereospecific resonance assignment of methylene proton spins and methyl groups of Leu and Val residues were obtained with the GLOMSA module of CYANA. Out of 100 calculated conformers, 20 conformers with the lowest target function values were selected for subsequent refinement in explicit water bath using the program CNS22 with upper distance constraints relaxed by 5%. The structure statistics are summarized in Table 1.
Table 1.
Completeness of resonance assignmentsa [%] | |
Backbone/Side-chain | 100.0/100.0 |
Conformation-restricting distance constraintsb | |
Intra-residue [i = j] | 696 |
Sequential [|i - j| = 1] | 1256 |
Medium range [1 < |i - j| < 5] | 1360 |
Long range [| i - j | ≥ 5] | 1470 |
Total | 4782 |
Intermolecular NOE constraints (included in above) | 182 |
Dihedral angle constraints | 406 |
NOE constraints per restrained residue (of those, long range) | 22.6 (6.4) |
CYANA target function [Å2] | 2.33 |
Average number of distance constraint violations per conformer | |
0.1 - 0.2 Å | 9.25 |
0.2 – 0.5 Å | 2.2 |
>0.5 Å | 0 |
Average number of dihedral angle constraint violations per conformer | |
>10° | 0.1 |
Average RMSD from mean coordinates [Å] | |
backbone atomsc (all) | 0.8 (1.2) |
heavy atomsc (all) | 1.1 (1.4) |
Global quality scoresc (raw / Z-score) | |
PROCHECK2 G-factor (phi-psi) | 0.16/0.94 |
PROCHECK2 G-factor (all) | 0.14/0.83 |
Molprobity3 clash score | 2.55/1.09 |
ProsaII4 | 0.74/0.37 |
Molprobity3 Ramachandran summary [%] | |
Most favored regions | 98.7 |
Additionally allowed regions | 1.3 |
Disallowed regions | 0.0 |
Commonly observed protein NMR resonances. Excludes amino group of N-terminal Ser, side-chain amino groups of Lys, side-chain guanidinium groups of Arg, carboxyl groups of Asp and Glu, thiol and hydroxyl 1H of Cys, Ser, Thr and Tyr, and non-protonated aromatic 13C.
Calculated with Protein Structure Validation Software (PSVS 1.5; http://psvs.nesg.org/)
Ordered residue ranges: 3-34, 38-44, 46-64, 74-115
Ref. 33;
Ref. 34;
Ref. 35;
Ref. 36
Predicted rotational correlation time τc was calculated for Skp1ΔΔ and Skp1ΔF97E using HYDRONMR,23 assuming water viscosity as 0.0072 cP at 35 °C. The lowest energy conformer of Skp1ΔΔ NMR ensemble was used, and only the first chain was used to calculate τc of a hypothetical Skp1ΔΔ. Homology model of Skp1ΔF97E monomer for τc calculation was generated with SWISS-MODEL based on X-ray structure of human Skp1 (PDB ID 3L2O) as a template. A hypothetical dimer model of Skp1ΔF97E was then produced by structural alignment of individual subunits to those of Skp1ΔΔ dimer in Chimera (http://www.rbvi.ucsf.edu/chimera),24 followed by adjustment of backbone dihedral angles to eliminate clashes in the C-terminal region and energy minimization.
Analysis of the Dimer Interface.
Computational investigation of the dimer interface was performed using the conformer with the lowest CYANA target function of the initial NMR structures of Skp1ΔΔ. To prevent inaccurate predictions due to small clashes in the structure, the protein was prepared using the standard Rosetta optimization protocol, “FastRelax”.25,26 Briefly, five cycles of rotamer packing and minimization were carried out, ramping up the repulsive weight in the scoring function within each cycle. After three rounds of symmetrical FastRelax with atom-atom pair distance constraints, the quality of the generated models was validated with the Molprobity web service.27 The lowest scoring structure based on the Rosetta energy score and the Molprobity score was selected for mutational analysis.
To identify mutations that could disrupt the Skp1 dimerization, an all amino acids-scanning mutagenesis was carried out across the homodimer interface in silico. The “flex ddG” protocol implemented in RosettaScripts28,29 was used to model and predict the effect of the mutations on the binding free energy of the complex. All parameters in the protocol were set up according to the default values described previously.28 Overall, the flex ddG method takes advantage of the Rosetta backrub approach30 to sample side-chain and backbone conformational changes around the mutated position. Once the backrub ensembles are generated, the structures are optimized by sidechain repacking and torsion minimization. The interface ΔΔG score corresponds to the average difference in binding free energy between the mutant structure and the wild-type complex. Stabilizing mutations are defined as those with interface ΔΔG scores < −1.0 Rosetta energy units (REU), while destabilizing mutations are assigned to interface ΔΔG scores > 1.0.28
To determine which dimer-destabilizing substitutions do not perturb the stability of the individual monomers, the change in the total free energy of the monomer was estimated for all mutations. This analysis was performed with the current state-of-the-art Rosetta ΔΔG protocol, “cartesian_ddg”.31 As a preparation step, the wild-type monomer was relaxed in Cartesian space, using backbone and sidechain coordinates constraints. Since the input structure was previously refined with FastRelax, the restrained Cartesian-refinement was intended to adjust the (x,y,z) position of each atom without large deviations from the optimized structure. In this regard, coordinate constraints corresponding to harmonic restraints (Supplementary information- Scripts) were used to penalize large movements32. The model with the lowest Rosetta score was then used as input for the cartesian_ddg protocol. Within the cartesian_ddg application, the protein was relaxed again in the Cartesian space, allowing movement of only the backbone and sidechains around the mutated position.31 All parameters in the cartesian_ddg protocol were configured as previously described.31 The total ΔΔG score was finally considered as the difference in the total ΔΔG between the mutant and the wild-type monomer, multiplied by an energy scaling factor of 1.0/2.94. As above, stabilizing mutations correspond to total ΔΔG scores < -1.0, and destabilizing mutations refer to total ΔΔG scores > 1.0.
All Rosetta commands for this report were run with the same Rosetta static executable (RosettaCommons/main.git2019-03-07, version 4ab48a76160c888257155619edb9817845bd8a67). The protocols previously described can be found at Supplementary information- Scripts.
Analytical Gel Filtration.
Skp1Δ and Skp1ΔF97E with or without Fbs1 at a limiting concentration were subjected to Superdex 200 PC 3.2/30 gel filtration analysis using a Pharmacia SMARTSystem HPLC as previously described.6
RESULTS
Characterization of the Skp1 Dimer.
Sedimentation velocity experiments were conducted on Dictyostelium Skp1A (Skp1) that was recombinantly expressed without a peptide tag in E. coli. Over a concentration range of 0.5 – 45 µM , the samples yielded peaks at 1.8 S and 2.7 S (Figure 1A), values which are slightly less than the predicted S-values for monomer and dimer forms, 1.9 S and 2.8 S. The homology model used for predicting S-values assumed that the C-terminal region of Skp1 is organized as α-helices as occurs in complexes with F-box proteins. However, the C-terminal region of free Skp1 is predominantly disordered based on previous NMR studies,9 which is expected to cause Skp1 to sediment more slowly and would explain the slight discrepancy between the observed and predicted S-values. The separate peaks in this non-equilibrium analysis indicate that interconversion between monomer and dimer states is slow relative to the time scale of sedimentation. A Skp1 dimer binding isotherm constructed using weighted S-values from across the concentration range yielded a dissociation constant for the Skp1 dimer of 2.5 µM (Figure 1B).
Skp1 has resisted crystallization and the large size of the full-length homodimer (324 amino acids) inhibited structure determination by NMR. To initiate mapping of the dimer interface, we examined a truncated Skp1 variant, Skp1ΔΔ, which lacks the mainly disordered C-terminal F-box binding domain and an internal disordered loop that is frequently removed for Skp1/FBP crystallization (Figure 2A). We demonstrated that Skp1ΔΔ (118 × 2 amino acids) still forms a stable homodimer based on sedimentation velocity studies, which yielded an S-value of 1.9, in agreement with the 1.9 S-value predicted by HYDROPRO (Figure S2). Also, a 2D [15N,1H] HSQC spectrum of Skp1ΔΔ correlated well with 2D [15N,1H] TROSY of full-length Skp1 (data not shown), indicating that truncations in Skp1ΔΔ do not perturb the overall structure.
Solution NMR Structure of Skp1ΔΔ.
Using a suite of standard NMR experiments (Table S1), we obtained complete sequence-specific assignments of backbone and side-chain 1H, 15N and 13C resonances of Skp1ΔΔ (Table 1; Figure S3). These resonance assignments allowed us to pursue structure calculations based on 1H-1H distance constraints derived from 3D 15N/13C-edited [1H,1H] NOESY spectrum. To ensure proper modeling of the subunit interaction, we applied weak intermolecular distance 1H-1H constraints (Table S2) derived from NOE peaks identified in a separate 13C/15N-filtered and 13C-edited [1H,1H] NOESY spectrum recorded with a sample of mixed U-15N,13C-labeled and natural abundance Skp1ΔΔ. Representative fragments of NOESY spectra are shown in Figure S4. Full attenuation of intra-chain NOE cross-peaks was not achieved in the 13C/15N-filtered spectrum (right strip in Figure S4) because of incomplete isotope incorporation (~85%). However, comparison with the corresponding strip from the NOESY spectrum without isotope filtering (strip on the left) allowed us to distinguish between peaks of comparable intensity (inter-chain) and peaks with significantly reduced intensity in the filtered set (intra-chain).
We obtained a high-quality solution NMR structure of Skp1ΔΔ (Table 1; Figure 2). A ribbon diagram of the best scoring structure of the top 20 conformers (Figure 2B) is shown in Figure 2C. The structure features a semi-parallel orientation of subunits with respect to their N- and C-termini, in contrast to the previously hypothesized dimerization model from SAXS analysis.6 The dimer interface is organized as a four-helix bundle, with symmetrical packing contributions from residues of helices 5 & 6 of each chain (Figure S5) as identified using PISA.37 The interface buries ~743 Å2 of predominantly hydrophobic surface whose participating amino acids are labeled in Figure 2D. Superimposition of the corresponding Cα atoms with human Skp1 from a crystal structure in complex with the human FBP βTRCP revealed an RMSD of 1.1 Å (Figure S6), indicating that the structure of amino acids 1-125 of free Skp1 changes little when complexed as a monomer with FBPs. Furthermore, the homodimer interface involves the previously described subsite-1 of the binding site for F-box domains.7 The physical overlap (compare Figures 2E, F) was quantified by modeling the F-box domain of human FBXW7 from a crystal structure with human Skp1, with Skp1ΔΔ. This revealed a ~650 Å2 overlap of the homodimer and heterodimer interfaces, in the region of helices 5 and 6, and explains why only the Skp1 monomer is found in complexes with FBPs.
Conservation of the Dimer Interface.
The two α-helices contributing to the homodimer interface, extending from L96 to I123, are highly conserved throughout phylogeny. Furthermore, each dimer contact residue except K117 is almost perfectly conserved from stramenopiles to humans (Figure S7), and the dimer interface region is immediately surrounded by Gly or Pro residues and length variations (indels), indicating that this region represents a functional unit under selective pressure to remain intact. This is consistent with the finding that human Skp1 also dimerizes in this concentration range.5
Computationally-Guided Selection of a Skp1 Monomer Mutant.
To test the dimer structure model, we searched for point mutations of interface amino acids that might destabilize the dimer. An initial alanine-scanning mutagenesis calculation suggested Phe97 Leu101, and Ile123 as potential destabilizing positions based on a weakening of the binding free energy of the complex (interface ΔΔG score > 2) (Figures 3A, 3B). All 20 amino acids were then substituted at each position to predict the effect of different side chains. Indeed, most substitutions at these 3 positions were destabilizing, and 85% of the variations at position 97 yielded interface ΔΔG scores greater than 1.5. The mutations F97G, F97D and F97E showed the highest dimer-destabilizing effect, with interface ΔΔG scores of 5.41, 5.23 and 4.87, respectively (Figure 3C).
To determine whether the dimer-destabilizing mutations might affect monomer stability, the total monomer ΔΔG was estimated for each substitution. Remarkably, most of the mutations at positions 97, 101, and 123 yielded total ΔΔG scores between 1 and -1 (Figure S8). In particular, the highly dimer destabilizing F97E substitution is predicted to have a neutral effect on monomer stability with its ΔΔG score at 0.15.
Based on the above analysis, the conserved Phe97 was changed to Glu in the nearly full length Skp1Δ isoform. Glu was chosen over Asp to allow extension of the carboxyl group to provide better accessibility to solvent rather than clashing when binding an FBP. As predicted by the dimer interface model, Skp1ΔF97E eluted as a monomer based on gel filtration and was predominantly a monomer by AUC at 100 µM (Figure 4A). A 2D [1H,15N]-HSQC spectrum recorded for a 15N-labeled Skp1ΔF97E sample exhibited peak dispersion consistent with a well-folded protein (Figure 4B). The peak pattern was comparable to that of wild-type Skp1 considering that there are a number of differences including an N-terminal SM-extension, replacement of 12 amino acids with 4 different amino acids in the internal loop, and truncation after amino acid 125 (not shown). Based on average 15N T1 and T2 relaxation times measured for all 1H amide resonances between 8.8 and 9.9 ppm in a 500 µM sample of Skp1ΔF97E, the effective rotational correlation time (τc) was calculated at 9.4 ± 0.5 ns. This is close to the 10.9 ns value predicted for the monomer (Table 2), and contrasts with the previously reported 19.9 ± 2.2 ns value of the full-length protein.4 Thus, Skp1ΔF97E remains predominantly monomeric even at high concentrations typical for solution NMR.
Table 2.
Skp1ΔF97E is Binding Competent with a Model FBP.
Phe97 is conserved as a Phe or Tyr in known Skp1s (Figure S7). Analysis of Skp1 in crystal structures of complexes with 3 different FBPs (Tir1, βTRCP1, Fbs1) shows that, compared to the homodimer, Phe (or Tyr) resides in a different rotamer state with solvent exposure (not shown). This suggests that the F97E replacement will, though removing a favorable hydrophobic contact, not directly disrupt the FBP interaction. To test the functionality of Skp1ΔF97E, we used a Superdex200 gel filtration column to examine the elution profile of Skp1 in the absence and presence of a heterologous FBP, guinea pig Fbs1. As previously described6 and replicated in Figure 4C, a mixture of Fbs1 and Skp1 eluted prior to the elution positions of either protein alone. As shown in Figure 4D, Skp1ΔF97E exhibited similar behavior. In addition, Skp1ΔF97E clearly eluted later than native Skp1, consistent with its monomeric character as described by AUC and its NMR-derived rotational correlation time. Thus Skp1(F97E) retained its FBP binding function, though a possible reduction in affinity is not excluded by this analysis.
DISCUSSION
Our findings confirm that dimerization is a highly conserved property of Skp1, based on similar dissociation constant values from organisms as phylogenetically distant as Dictyostelium and humans. Their measured Kd values range from 1.1 to 2.5 µM under in vitro conditions, though actual affinities in the cell may vary. These values are similar to the predicted total monomer Skp1 concentration in a mammalian cell line, ~2 µM.38 The significance of dimerization is suggested by the exceptionally high degree of conservation of the contact residues (labeled D in Figure S7), and its ability to mask the hydrophobic character of this solvent exposed surface.
The Skp1 dimer interface occupies ~740 Å2 of predominantly hydrophobic surface, a substantial area that overlaps subsite-1 of its FBP binding site (Figure 2E). In human Skp1, there is a lack of NMR resonance assignments in the corresponding interface region (residues 99-130 in human vs. residues 93-124 in Dictyostelium Skp1, Figure S7)39, potentially due to effects of unanticipated dimerization of human Skp1. The homodimerization model was supported by the predicted effect of an amino acid substitution within the interface, F97E, to destabilize the interaction (Figure 4A) without unfolding the protein (Figure 4B). Indeed, monomeric Skp1 maintained its ability to bind a model FBP, Fbs1 (Figure 4C, D), and was also competent to be enzymatically hydroxylated and fully glycosylated in vitro (unpublished data). A further contribution to the dimer interface from beyond the truncation site at residue 125 seems unlikely, because previous studies indicated substantial disorder for this region in the free dimer.9
Based on the new structure, the dimer interface also contributes to subsite-1 of the F-box binding region of Skp1, which explains why Skp1 is a monomer in complexes with FBPs.8 Interestingly, 8 of the 13 alleles of Skp1 known to affect its function in budding and fission yeast have point mutations located on this region,40–44 raising the question of whether these mutations affect dimerization, FBP binding, or both. Since the Skp1/FBP interaction is very strong (Kd for binding to the FBP Fbs1/OCP1 is ca. two orders of magnitude smaller than the homodimerization Kd)45, and quantitative mass spectrometry indicates that Skp1 is not in great excess over FBPs in cells,38 the average concentration of Skp1 does not appear to be high enough to generate a substantial homodimer pool. However, if higher local concentrations occur in the cell, homodimerization might occur to protect Skp1 from interacting promiscuously with other macromolecules. Our ability to selectively perturb dimerization relative to Fbs1 binding using the F97E mutation might allow an investigation of this question in vivo; however, we cannot exclude the possibility that interaction with FBPs is quantitatively affected.
Chemical shift index analysis of assigned residues of free human39 and Dictyostelium9 Skp1s indicated that the overall secondary structure elements of the dimers were similar to one another and to human Skp1 in complexes with FBPs, except for the C-terminal F-box subsite-2 region (residues 126-162) which was predominantly disordered in free Skp1’s. The current study of residues 1-125 extends to show that free Skp1 (dimer) assumes essentially the same structure as for human Skp1 bound to the FBP βTRCP, with an RMSD for the corresponding Cα atoms of 1.1 Å (Figure S6). Thus interactions with proteins including Cul18 and Sgt1,46 whose crystallographically defined binding interfaces lie within this region but N-terminal to the dimer interface (ca. residues 1-90), are likely to be unaffected by the dimer status of Skp1.
Owing to the semi-parallel arrangement of the monomers, the two C-termini of the Skp1ΔΔ dimer are close enough to one another that the missing C-terminal regions have the potential to influence one another in the native protein. The availability of a stable monomeric form of Skp1 will now allow a direct NMR analysis of full length Skp1 and the consequences of its glycosylation, which has been postulated to influence the organization of F-box binding subsite-24,47 – the C-terminal region that was truncated to enable the dimer structure reported here.
CONCLUSIONS
Protein homo-dimerization is a common phenomenon with highly varied roles,48 and was originally described for Skp1 over two decades ago.5 Understanding the significance of Skp1 dimerization has been impeded by its recalcitrance to crystallization, likely owing to its intrinsically disordered regions, and the limitations of NMR to study larger protein structures. Here we assigned a Kd of 2.5 µM for native Skp1 from Dictyostelium, a protist that regulates its Skp1 via O2-dependent glycosylation. By preparing a truncated version that remains dimeric, we solved its solution structure using NMR of isotopically labeled samples. The structure revealed that the dimer interface is contributed by a pair of α-helices from each subunit and buries 740 Å2 of predominantly hydrophobic surface, with the monomer units assuming a semi-parallel orientation around a 2-fold rotational axis of symmetry. The portion of Skp1 remaining after truncation adopts a structure equivalent to that to the monomer unit found in crystal structures of complexes with F-box proteins. The dimer interface substantially overlaps with subsite-1 of the binding interface with the F-box domain of F-box proteins, but is not expected to compete with the other main binding function of Skp1 with cullin-1. The interface was verified by finding that a Rosetta-predicted point mutation interferes with homodimerization of full-length Skp1 but not with binding a model F-box protein. The high degree of conservation of the dimerization patch throughout phylogeny suggests a potential role in buffering Skp1 function in cells, and the availability of a monomeric isoform of Skp1 will enable new structural studies on the mechanism of its control by glycosylation.
Supplementary Material
ACKNOWLEDGMENT
The authors thank John Glushka for NMR technical support.
Funding Sources
This research was supported by NIH R01-GM037539, NIH R01-AI140245, NIH R01-GM033225, NIH T32-GM107004, and a collaboration facilitated by the Resource for Integrated Glycotechnology, NIH P41-GM103390. UCSF Chimera, which was utilized for some molecular graphics analyses, was developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from NIH P41-GM103311.
ABBREVIATIONS
- AUC
analytical ultracentrifugation
- FBP
F-box protein
- HNOE
heteronuclear NOE
- HSQC
heteronuclear single quantum coherence spectroscopy
- NOESY
nuclear Overhauser enhancement spectroscopy
- SCF
Skp1/cullin-1/F-box protein family of E3 ubiquitin ligases
- Skp1Δ
Skp1Δinternal loop
- Skp1ΔΔ
Skp1Δinternal loop/ΔC-terminus
- TEV
tobacco etch virus
Footnotes
The authors declare no competing financial interests.
ACCESSION CODES
Dictyostelium discoideum Skp1A sequence: UniProtKB/Swiss-Prot: P52285.1
Skp1ΔΔ dimer NMR structure statistics: PDB ID: 6V88, BMRB ID:30696 (embargoed)
REFERENCES
- 1.West CM,; Blader IJ, (2015) Oxygen Sensing by Protozoans: How They Catch Their Breath. Curr. Opin. Microbiol 26, 41–47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.van der Wel H,; Gas-Pascual E,; West CM, (2019) Skp1 Isoforms are Differentially Modified by a Dual Function Prolyl 4-Hydroxylase/N-Acetylglucosaminyltransferase in a Plant Pathogen. Glycobiology 29, 705–714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Mandalasi M,; Kim HW,; Thieker D,; Sheikh MO,; Gas-Pascual E,; Rahman K,; Zhao P,; Daniel N,; van der Wel H,; Ichikawa TH,; Glushka JN,; Wells L,; Woods RJ,; Wood ZA,; West CM, (2020) A Glycogenin Homolog Controls Toxoplasma gondii Growth Via Glycosylation of an E3 Ubiquitin Ligase. bioRxiv. https://www.biorxiv.org/content/10.1101/764241v2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sheikh MO,; Thieker D,; Chalmers G,; Schafer CM,; Ishihara M,; Azadi P,; Woods RJ,; Glushka JN,; Bendiak B,; Prestegard JH,; West CM, (2017) O2 Sensing–Associated Glycosylation Exposes the F-Box–Combining Site of the Dictyostelium Skp1 Subunit in E3 Ubiquitin Ligases. J. Biol. Chem 292, 18897–18915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Henzl MT,; Thalmann I,; Thalmann R, (1998) OCP2 Exists as a Dimer in the Organ of Corti. Hear. Res 126, 37–46. [DOI] [PubMed] [Google Scholar]
- 6.Sheikh MO,; Schafer CM,; Powell JT,; Rodgers KK,; Mooers BHM,; West CM, (2014) Glycosylation of Skp1 Affects its Conformation and Promotes Binding to a Model F-box Protein. Biochemistry 53, 1657–1669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Schulman BA,; Carrano AC,; Jeffrey PD,; Bowen Z,; Kinnucan ERE,; Finnin MS,; Elledge SJ,; Harper JW,; Pagano M,; Pavietich NP, (2000) Insights into SCF Ubiquitin Ligases from the Structure of the Skp1-Skp2 Complex. Nature 408, 381–386. [DOI] [PubMed] [Google Scholar]
- 8.Zimmerman ES,; Schulman BA,; Zheng N, (2010) Structural Assembly of Cullin-RING Ubiquitin Ligase Complexes. Curr. Opin. Struct. Biol 20, 714–721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Xu X,; Eletsky A,; Sheikh MO,; Prestegard JH,; West CM, (2018) Glycosylation Promotes the Random Coil to Helix Transition in a Region of a Protist Skp1 Associated with F-box Binding. Biochemistry 57, 511–515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.West CM,; Kozarov E,; Teng-umnuay P, (1997) The Cytosolic Glycoprotein FP21 of Dictyostelium discoideum is Encoded by Two Genes Resulting in a Polymorphism at a Single Amino Acid Position. Gene 200, 1–10. [DOI] [PubMed] [Google Scholar]
- 11.Gasteiger E,; Hoogland C,; Gattiker A,; Duvaud S,; Wilkins MR,; Appel RD,; Bairoch A, (2005) Protein Identification and Analysis Tools on the Expasy Server In: Walker JM, (ed): The Proteomics Protocols Handbook, Humana Press; pp. 571–607. [Google Scholar]
- 12.Laue TM,; Shah BD,; Ridgeway TM,; Pelletier SL, (1992) Computer-Aided Interpretation of Analytical Sedimentation Data for Proteins In: Harding SE, Rowe AJ, Horton J, editors. Analytical Ultracentrifugation in Biochemistry and Polymer Science. Cambridge: The Royal Society of Chemistry; pp. 90–125. [Google Scholar]
- 13.Schuck P, (2000) Size Distribution Analysis of Macromolecules by Sedimentation Velocity Ultracentrifugation and Lamm Equation Modeling. Biophys. J 78, 1606–1619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ortega A,; Amorós D,; Garcia de la Torre J, (2011) Prediction of Hydrodynamic and Other Solution Properties of Rigid Proteins from Atomic- and Residue-Level Models. Biophys. J 101, 892–898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kim DE,; Chivian D,; Baker D, (2004) Protein Structure Prediction and Analysis Using the Robetta Server. Nucleic Acid Res. 32(Web Server issue), W526–531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Brautigam CA, (2015) Calculations and Publication-Quality Illustrations for Analytical Ultracentrifugation Data. Meth. Enzymol 562,109–133. [DOI] [PubMed] [Google Scholar]
- 17.Schuck P, (2003) On the Analysis of Protein Self-Association by Sedimentation Velocity Analytical Ultracentrifugation. Anal. Biochem 320, 104–124. [DOI] [PubMed] [Google Scholar]
- 18.Delaglio F,; Grzesiek S,; Vuistere GW,; Zhu G,; Pfeifer J,; Bax A, (1995) NMRPipe: A Multidimensional Spectral Processing System Based on UNIX Pipes. J. Biomol. NMR 6, 277–293. [DOI] [PubMed] [Google Scholar]
- 19.Keller R, (2004) The Computer Aided Resonance Assignment Tutorial, CANTINA Verlag, Goldau.
- 20.Güntert P,; Buchner L, (2015) Combined Automated NOE Assignment and Structure Calculation with CYANA. J. Biomol. NMR 62, 453–471. [DOI] [PubMed] [Google Scholar]
- 21.Shen Y,; Bax A, (2013) Protein Backbone and Sidechain Torsion Angles Predicted from NMR Chemical Shifts Using Artificial Neural Networks. J. Biomol. NMR 56, 227–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Brunger AT,; Adams PD,; Clore GM,; Gros P,; Grosse-Kunstleve RW,; Jiang JS,; Kuszewski J,; Nilges N,; Pannu NS,; Read RJ,; Rice LM,; Simonson T,; Warren GL, (1998) Crystallography & NMR system (CNS), A New Software Suite for Macromolecular Structure Determination. Acta Cryst. 54, 905–921. [DOI] [PubMed] [Google Scholar]
- 23.García de la Torre J,; Huertas ML,.; Carrasco B, (2000) HYDRONMR: Prediction of NMR Relaxation of Globular Proteins from Atomic-Level Structures and Hydrodynamic Calculations. J. Magnet. Resonance 147, 138–146. [DOI] [PubMed] [Google Scholar]
- 24.Pettersen EF,; Goddard TD,; Huang CC,; Couch GS,; Greenblatt DM,; Meng EC,; Ferrin TE, (2004) UCSF Chimera—A Visualization System for Exploratory Research and Analysis. J. Comput. Chem 25, 1605–1612. [DOI] [PubMed] [Google Scholar]
- 25.Tyka MD,; Keedy DA,; Andre I,; Dimaio F,; Song Y,; Richardson DC,; Richardson JS,; Baker D, (2011) Alternate States of Proteins Revealed by Detailed Energy Landscape Mapping. J. Mol. Biol 405, 607–618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Das R,; Baker D, (2008) Macromolecular Modeling with Rosetta. Annu. Rev. Biochem 77, 363–382. [DOI] [PubMed] [Google Scholar]
- 27.Chen VB,; Arendall WB 3rd,; Headd JJ,; Keedy DA,; Immormino RM,; Kapral GJ,; Murray LW,; Richardson JS,; Richardson DC, (2010) MolProbity: All-Atom Structure Validation for Macromolecular Crystallography. Acta Crystallogr. Sec D Biol. Crystallogr 66, 12–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Barlow KA,; Ó Conchuir S,; Thompson S,; Suresh P,; Lucas JE,; Heinonen M,; Kortemme T, (2018) Flex ddG: Rosetta Ensemble-Based Estimation of Changes in Protein-Protein Binding Affinity upon Mutation. J. Phys. Chem. B 122, 5389–5399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Fleishman SJ,; Leaver-Fay A,; Corn JE,; Strauch EM,; Khare SD,; Koga N,; Ashworth J,; Murphy P,; Richter F,; Lemmon G,; Meiler J,; Baker D, (2011) RosettaScripts: a Scripting Language Interface to the Rosetta Macromolecular Modeling Suite. PLoS One 6, e20161 (10 pages). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Smith CA,; Kortemme T, (2008) Backrub-Like Backbone Simulation Recapitulates Natural Protein Conformational Variability and Improves Mutant Side-Chain Prediction. J. Mol. Biol 380, 742–756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Park H,; Bradley P,; Greisen P,; Liu Y,; Mulligan VK,; Kim DE,; Baker D,; DiMaio F, (2016) Simultaneous Optimization of Biomolecular Energy Functions on Features from Small Molecules and Macromolecules. J. Chem. Theory Comput 12, 6201–6212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Nivon LG,; Moretti R,; Baker D, (2013) A Pareto-Optimal Refinement Method for Protein Design Scaffolds. PLoS One 8, e59004 (5 pages). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Bhattacharya A,; Tejero R,; Montelione GT, (2007) Evaluating Protein Structures Determined by Structural Genomics Consortia. Proteins 50, 778–795. [DOI] [PubMed] [Google Scholar]
- 34.Laskowski RA,; Rullmannn JA,; MacArthur MW,; Kaptein R,; Thornton JM, (1996) AQUA and PROCHECK-NMR: Programs for Checking the Quality of Protein Structures Solved by NMR. J. Biomol. NMR 8, 477–486. [DOI] [PubMed] [Google Scholar]
- 35.Lovell SC,; Davis IW,; Arendall WB III,; de Bakker PIW,; Word JM,; Prisant MG,; Richardson JS,; Richardson DC, (2003) Structure Validation by C-alpha Geometry: phi,psi and Cbeta deviation. Proteins 50, 437–450. [DOI] [PubMed] [Google Scholar]
- 36.Sippl MJ, (1993) Recognition of Errors in Three-Dimensional Structures of Proteins. Proteins 17, 355–362. [DOI] [PubMed] [Google Scholar]
- 37.Krissinel E, (2015) Stock-Based Detection of Protein Oligomeric States in jsPISA. Nuc. Ac. Res 43, W314–319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Reitsma JM,; Liu,; Reichermeier KM,; Moradian A,; Sweredoski MJ,; Hess S,; Deshaies RJ, (2017) Composition and Regulation of the Cellular Repertoire of SCF Ubiquitin Ligases. Cell 171, 1326–1339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kachariy NN,; Dantu SC,; Kumar A, (2016) Backbone and Side Chain Assignments of Human Cell Cycle Regulatory Protein S-Phase Kinase-Associated Protein 1. Biomol. NMR Assign 10, 351–355. [DOI] [PubMed] [Google Scholar]
- 40.Bai C,; Sen P,; Hofmann K,; Ma L,; Goebl M,; Harper JW,; Elledge SJ, (1996) SKP1 Connects Cell Cycle Regulators to the Ubiquitin Proteolysis Machinery Through a Novel Motif, the F-box. Cell 86, 263–274. [DOI] [PubMed] [Google Scholar]
- 41.Connelly C,; Hieter P, (1996) Budding Yeast SKP1 Encodes an Evolutionarily Conserved Kinetochore Protein Required for Cell Cycle Progression. Cell 86, 275–285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Beltrao P,; Albanèse V,; Kenner LR,; Swaney DL,; Burlingame A,; Villén J,; Lim WA,; Fraser JS,; Frydman J,; Krogan NJ, (2012) Systematic Functional Prioritization of Protein Posttranslational Modifications. Cell 150, 413–425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lehmann A,; Katayama S,; Harrison C,; Dhut S,; Kitamura K,; McDonald N,; Toda T, (2004) Molecular Interactions of Fission Yeast Skp1 and its Role in the DNA Damage Checkpoint. Genes Cells 9, 367–382. [DOI] [PubMed] [Google Scholar]
- 44.Leber V,; Nans A,; Singleton MR, (2018) Structural Basis for Assembly of the CBF3 Kinetochore Complex. EMBO J. 37, 269–281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Tan A,; Tanner JJ,; Henzl MT, (2008) Energetics of OCP1-OCP2 Complex Formation. Biophys. Chem 134, 64–71. [DOI] [PubMed] [Google Scholar]
- 46.Willhoft O,; Kerr R,; Patel D,; Zhang W,; Al-Jassar C,; Daviter T,; Millson SH,; Thalassinos K,; Vaughan CK, (2017) The Crystal Structure of the Sgt1-Skp1 Complex: the Link between Hsp90 and Both SCF E3 Ubiquitin Ligases and Kinetochores. Sci. Rep 7, 41626 (13 pages). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.West CM,; Kim HW, (2019) Nucleocytoplasmic O-Glycosylation in Protists. Curr. Opin. Struct. Biol 56, 204–212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Marianayagam NJ,; Sunde M,; Matthews JM, (2004) The Power of Two: Protein Dimerization in Biology. Tr. Biochem. Sci 29, 618–625. [DOI] [PubMed] [Google Scholar]
- 49.West CM,; van der Wel H,; Gaucher EA, (2002) Complex Glycosylation of Skp1 in Dictyostelium: Implications for the Modification of Other Eukaryotic Cytoplasmic and Nuclear Proteins. Glycobiology 12, 17R–27R. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.