Proteomic methods have enabled the discovery of 100,000s of protein phosphorylation sites1, however we lack methods to systematically annotate their function. Phosphorylation has numerous biological functions that biochemically involve changes in protein structure and interactions. These biochemical changes can be detected by measuring the difference in thermal stability between the protein and the phosphoprotein. Building on recent work, we present a proteomic method to infer phosphosite functionality by reliably measuring such differences.
Recently, Huang et al.2 developed the Hotspot Thermal Profiling (HTP) method to identify phosphosites that alter protein thermal stability, reporting 719 out of 2,883 (25%) measured human phosphoisoforms with significant effects. The reported melting temperatures (Tm) for phosphoisoforms and their corresponding proteins showed low correlation (R2 = 0.18) (Fig. 1a), implying that phosphorylation largely functions by structurally reshaping the proteome. However, the poor Tm reproducibility between replicates (mean pairwise R2 = 0.42 for proteins and 0.21 for phosphoisoforms, Fig. 1b) suggests that the large effects observed may be due to technical variation, given that critical steps of the method are conducted separately for phosphopeptides and proteins and for the different temperature sample points (Supplementary Discussion).
Figure 1. Most phosphosites have little effect on protein stability.

a, Scatter plot and Pearson correlation between Tm of phosphopeptide isoforms (n=10) and Tm of the corresponding protein (n=12). Mean Tm values were obtained from Huang et al.. Significant phosphopeptide isoforms in blue. b, Boxplot depicting pairwise Pearson correlations between biological and technical replicates for HTP’s Tm values. The line represents the median, the box designates the interquartile range (IQR), and the whiskers define 1.5*IQR from the box ends. c, Volcano plots showing differences in protein thermal stability (Tm) between phosphopeptide isoforms (n=10, left panel) or unmodified peptides observed in the phosphopeptide enriched sample (n=10, right panel) and their corresponding protein (n=11) (two-sided Student’s t-test, significant hits at p-value <0.05 in blue), from Huang et al. data reanalysis. d, Dali workflow depicting SILAC labeling of yeast, the gradient temperature treatment of the protein extract, the inclusion of a 30°C control, the quantification of soluble protein by mass spectrometry, and the calculation of relative stability (Rs). e, Scatter plot and Pearson correlation as in (a) using Dali’s Rs data. f, Boxplot as in (b) with Pearson correlations between biological replicates for Dali’s Rs values. g, Volcano plots as in (c) with x-axis showing Rs values. Here n=6, using a two-sided Welch’s t-test, p-values were Benjamini-Hochberg corrected and significant hits at q-value < 0.05. Significant hits in blue, significant hits found in proteins with known cleavage or splicing events are in orange.
The HTP workflow consists of phosphopeptide enrichment followed by separate isotopic labeling and mass spectrometric analysis to derive Tm values for phosphoisoforms and proteins, respectively. To assess technical variation between the two samples, we used unmodified peptides identified in the phosphopeptide samples, which are expected to yield the same Tm as the protein (Supplementary Discussion). Disconcertingly, our re-analysis revealed significant stability effects on 1,223 out of 3,074 (40%) of the co-enriched unmodified peptides, a larger percentage than phosphoisoforms (29%, 774 out of 2,656 in our reanalysis), calling into question the reported hits (Fig. 1c, Dataset S1). Additionally, the Tm correlation of these unmodified peptides measured in the phosphopeptide samples with their protein was similarly low (R2 = 0.18) to the Tm correlation between phosphoisoforms and their proteins (Extended Data Fig. 1a). This suggests that the separate enrichment and independent labeling prior to mass spectrometric analysis of peptide and phosphopeptide samples could have introduced substantial technical error precluding the comparison between phosphoisoform and protein, and perhaps that the reported hits arise from a lack of stringency in the applied statistical analysis. Indeed, we show that reimplementing the t-test considering unequal variances for proteins and phosphoisoforms, consolidating technical (mass spectrometry reanalysis) replicates, and applying multiple testing correction dramatically decreases the number of significant hits (Extended Data Fig. 2, Supplementary Discussion).
To minimize technical noise derived from sample preparation, peptide samples should be labeled and mixed prior to phosphopeptide enrichment (Supplementary Discussion and accompanying manuscript3). Because scaling-up isobaric chemical labeling increases reagent costs substantially, we have developed an alternative approach to identify phosphosites that alter protein thermal stability, that we call Dali (Fig. 1d). Dali applies the Proteome Integral Stability Alteration (PISA) method4, a simplified version of thermal proteome profiling5, in which the soluble protein from the different temperature points are combined to provide an estimation of the area under the protein melting curve. To reliably compare phosphoisoforms to proteins, we normalize each measurement to a 30°C treated proteome reference that is labeled with heavy lysine, obtaining a relative stability (Rs) measurement for phosphoisoforms and proteins. This 30°C reference is mixed in with the temperature gradient treated samples prior to protein digestion, and it is present during phosphopeptide enrichment and mass spectrometry (MS) measurement of peptides and phosphopeptides.
We applied Dali to the S. cerevisiae proteome and found that the stability of phosphoisoforms correlates well with the stability of their respective proteins (R2 = 0.82 for mean Rs comparisons) (Fig. 1e), suggesting that most phosphosites do not alter protein stability, as also observed in human cells3. We obtained reproducible Rs measurements for proteins (average R2 = 0.77) and phosphoisoforms (average R2 = 0.66) (Fig. 1f). As expected, the stability of unmodified peptides present in the phosphopeptide-enriched samples also correlated well with their proteins (R2 = 0.92 for mean Rs comparisons), indicating that Rs measurements in the phosphopeptide samples and protein samples can be reliably compared (Extended Data Fig. 1b). Finally, our analysis yielded 68 phosphopeptide isoforms out of 1,978 (3%) with significantly different thermal stability than the unmodified protein (Fig. 1g, Dataset S2). We detected a few unmodified peptides with significant differences in stability, but this set constituted a much smaller fraction than the fraction of significant phosphoisoforms (Dataset S3). Most of these peptides (6 out of 8) were cases where the protein is known to be post-translationally processed via cleavage (e.g. RPS316) or splicing (e.g. VMA17), resulting in proteins and/or proteoforms of different thermal stability as our method reliably measured (Extended Data Fig. 3).
Among phosphosites that decreased protein thermal stability, we identified four sites located at protein interfaces (Ser56 on PUP2, Ser59 on ARO8, Ser79 on TPI1 and Ser201 on GAPDH) (Fig. 2a, Extended Data Fig. 4) that may act by disrupting protein-protein interactions. For example, PUP2 is the alpha 5 subunit of the 20S proteasome, and Ser56 is a known Cdc28 substrate8 located at the protein interaction interface with PRE6, the 20S proteasome alpha 4 subunit (Fig. 2a). The stability measured for the phosphopeptide spanning Ser56 is significantly lower than the stability of PUP2, which is similar to other proteins in the 20S proteasome, suggesting Ser56 phosphorylation may dissociate PUP2 from the 20S proteasome. We identified stabilizing phosphosites that may play a role in the protein translation process. For example, we found that phosphorylation at Ser38 on ribosomal protein RPL12/uL11 significantly increased protein stability (Fig. 2b). This phosphosite is an evolutionarily-conserved Cdc28 substrate8 that is regulated during the cell cycle9 and has been reported to be depleted in polysomes and influence mitotic translation10. Considering RPL12 location at the ribosome P-stalk and the proximity of residue Ser38 to elongation factor 2, we hypothesize that this phosphosite may modulate the interaction with EF-2 to aid ribosomal translocation during protein synthesis, and the change in conformation and binding may stabilize RPL12. We also identified a stabilizing phosphorylation on NEW1 at Thr1191 (delta Rs = 1.23) (Fig. 2c). A T1191A mutant has growth defects11 suggesting that phosphorylation is important for NEW1 function (Supplementary Discussion). Additionally, we identified phosphosites that may modulate the kinetics of key glycolytic enzymes (Thr331 on PGK1 destabilizing, Ser149 on GAPDH stabilizing) (Extended Data Fig. 5, Supplementary Discussion).
Figure 2. Examples of phosphosites that alter protein thermal stability.

a, Boxplot of Rs values and distributions for the phosphopeptide containing PUP2 Ser56, PUP2 protein, and all the proteins in the 20S proteasome. Shown at the right is the structure of the 20S proteasome with PUP2 in blue, Ser56 in red, and PRE6 in grey. b, Rs values for RPL12 and RPL12 S38 phosphopeptide isoform. c, Rs values for NEW1 and phosphopeptide isoform containing NEW1 T1191. Boxplots show results from 6 biological replicates, the line represents the median, the box designates the interquartile range (IQR), and the whiskers define 1.5*IQR from the box ends.
In this communication, we have detailed some technical issues in the HTP method and suggested changes to the experimental workflow and statistics to improve its reproducibility and reliability. We have also outlined a novel proteomic method that enables robust thermal stability comparison between proteins and phosphorylated proteoforms. Our method identified 3% phosphosites in the S. cerevisiae proteome that significantly changed protein melting behavior. Additional experiments will be needed to precisely characterize the function of these phosphosites. One potential limitation of our method is that the sensitivity to detect changes in thermal stability is lower for proteins with extreme melting temperature, which can be circumvented by performing the experiment using different temperature gradients. Our method can be extended to other model organisms and cell culture systems, as well as to other post-translational modifications, expanding the proteomic toolkit to functionally annotate dynamic protein modifications at scale.
ONLINE METHODS
Yeast strains
All yeast experiments were conducted on the Saccharomyces cerevisiae haploid strain BY4741 (MATa his3Δ1 leu2Δ0 met15Δ0 ura3Δ0).
S. cerevisiae growth, stable isotope labeling, and cell harvest
Two overnight yeast cultures were grown at 30°C in synthetic complete media (SCM) containing 6.7g/L yeast nitrogen base, 2g/L of synthetic complete mix minus lysine, 2% glucose, and supplemented with either regular lysine (light culture) or 2H4-lysine (heavy culture) at 0.872 mM final concentration. These cultures were used to seed three 50mL cultures of each light and heavy at OD600 0.15, which were grown at 30°C and 45mL were harvested at OD600 ~ 1 by centrifugation at 7,000 × g for 10min. Yeast pellets were washed by resuspension in 1.5mL ice-cold sterile water and centrifugation in 2mL screw cap tubes at 21,000 × g for 10min, and then snap-frozen in liquid nitrogen and stored at −80°C.
Cell lysis and protein extract temperature treatment
Frozen yeast cell pellets were resuspended in 700μL of non-denaturing lysis buffer (50mM HEPES pH 7.0, 75mM NaCl) containing 0.5X protease inhibitors (Pierce) and phosphatase inhibitors (50mM β-glycerophosphate, 10mM sodium pyrophosphate, 50mM of NaF, 1mM sodium orthovanadate) on ice. Cells were lysed by bead beating with 0.5mm zirconia/silica beads for 4 cycles of 60sec of mechanical agitation followed by 90sec rest on ice. Lysates were clarified by sequential centrifugation, first at 1,200 × g for 1min to remove the beads and then at 21,000 × g for 10min at 4°C to remove cell debris. To bring all protein extracts to the same concentration, extract volumes were adjusted to 1 OD600 unit from a 45mL culture in 1mL.
Each cell extract was aliquoted into 2 strips of 8 PCR tubes each (1×8 for the temperature gradient and 1×8 for the 30°C) dispensing 50μL of protein extract per tube. All samples were initially equilibrated to 30°C for 5 min. Temperature gradient samples were subjected to 45.6°C, 46.8°C, 48.3°C, 50°C, 52°C, 53.6°C, 54.9°C, and 57°C, one tube to each temperature, for 5min. In parallel, controls were subjected to an additional 30°C temperature treatment for 5min. All samples were cooled down to room temperature for 10min. For each replicate, temperature gradient samples were all pooled into one tube and 30°C controls were pooled into a separate tube prior to centrifugation at 21,000 × g for 30min at 4°C. The soluble protein fractions for the temperature gradient and 30°C controls were combined 2:1, three replicates with the temperature gradient labeled heavy and the 30°C controls labeled light, and three additional replicates with the labels swapped. We generated additional controls where heavy and light 30°C controls were combined to assess potential differences in protein expression due to the different labeling. Protein concentration was measured by the BCA assay.
Protein reduction, alkylation, LysC digestion, and desalting
Samples were diluted 2-fold with a buffer containing 8M urea, 50mM HEPES pH 8.9, 75mM NaCl, 1mM sodium orthovanadate, 50mM β-glycerophosphate, 10mM sodium pyrophosphate, 50mM NaF. Protein samples were subjected to reduction with 7.5mM dithiothreitol (DTT) for 30min at 55°C and alkylation with iodoacetamide (22.5mM) for 30min at room temperature in the dark with agitation. The alkylation reaction was quenched with an additional 7.5mM DTT at room temperature for 30min with agitation. The pH was adjusted to 8.5 with 1M Tris pH 8.9. Lysyl endopeptidase (LysC; Wako Chemicals) was added at a 1:100 enzyme to protein ratio and protein samples were incubated overnight with agitation at room temperature. LysC digestion was quenched by addition of trifluoroacetic acid (TFA) to a final concentration of 1% and pH ~2–3 and the digests were stored at −80°C.
Peptide samples were desalted by solid-phase extraction over 50mg Sep-Pak tC18 cartridges (Waters). Packing material was washed with 1mL methanol, 3 × 1mL 100% acetonitrile, 1mL 70% acetonitrile, 0.25% acetic acid, 1mL 40% acetonitrile, 0.5% acetic acid, and equilibrated with 3 × 1mL 0.1% TFA. Peptides were then loaded by gravity twice, washed with 3 × 1mL 0.1% TFA and 1mL 0.5% acetic acid. Peptides were eluted with 600μL of 40% acetonitrile, 0.5% acetic acid and 400μL 70% acetonitrile, 0.25% acetic acid, and aliquoted as follows: 40μg for high-pH reversed-phase fractionation, 200μg for Fe3+-IMAC phosphopeptide enrichment, and 10μg for preliminary LC-MS/MS analysis to assess sample quality. All samples were dried by vacuum centrifugation and stored at −80°C.
High-pH reversed-phase fractionation
Peptides were fractionated by high-pH reversed-phase fractionation on a 200μL pipette tip packed with 4 layers of SDB-XC material (Empore, 3M). The material was washed with 50μL methanol, 50μL 80% acetonitrile, 20mM ammonium formate, and 3 X 50μL 20mM ammonium formate. Peptides (40μg) were solubilized in 40μL of 5% acetonitrile, 20 mM ammonium formate, loaded onto the SDB-XC tip, and the flow-through was collected in a mass spectrometer vial (fraction 1). Peptide fractions 2–5 were obtained by stepwise elution with 40μL of 20mM ammonium formate in 10 %, 15%, 20%, and 80% acetonitrile and collection in mass spectrometry vials. Peptide fractions were dried by vacuum centrifugation, solubilized in 3% acetonitrile, 4% formic acid, and ~1μg of each fraction was analyzed by LC-MS/MS.
Fe3+-NTA IMAC phosphopeptide enrichment
Phosphopeptide enrichment was conducted by immobilized iron cation affinity chromatography in batch mode and automated in a 96-well format on a KingFisher magnetic particle processor as we described12. For each sample, ~200μg peptides were solubilized in 70 μL 0.1% TFA, 80 % acetonitrile and incubated with 80 μL of a 5% slurry of magnetic Fe-NTA beads (Cube Biotech) in the same solvent for 30min. Beads were washed three times with 150μL 0.1% TFA, 80% acetonitrile and phosphopeptides were eluted with 50μL 50% acetonitrile, 0.37M ammonium hydroxide. Eluates were acidified with 30μL 10% formic acid, 75% acetonitrile and filtered over two-layer C18 extraction disks (Empore, 3M) packed in 200μL pipette tip, which had been previously conditioned with 50μL100% methanol, 50μL 100% acetonitrile and 50μL 70% acetonitrile, 0.25% acetic acid. Filtered peptides were collected in a mass spectrometer vial and the peptides in the extraction disk were further eluted with 50μL 70% acetonitrile, 0.25% acetic acid and collected into the same mass spectrometry vial. Phosphopeptide-enriched samples were dried by vacuum centrifugation, solubilized in 3% acetonitrile, 4% formic acid, and one third of each sample was analyzed by LC-MS/MS.
Liquid chromatography coupled to tandem mass spectrometry
Peptide samples were analyzed by nLC-MS/MS on a nanoAcquity UPLC (Waters) coupled to an Orbitrap Fusion Lumos Tribrid mass spectrometer (Thermo Fisher) with Xcalibur (4.1.50) and Thermo Foundation 3.1 SP4 (3.1.190.0). Samples were loaded on a 100μm × 3-cm trap column packed with 3μm C18 beads (Dr. Maisch), separated on a 100μm × 30-cm capillary analytical column, packed with 1.9μm C18 beads (Dr. Maisch) and set at 50°C, using a 90-min reversed-phase gradient of acetonitrile in 0.125% formic acid, and online analyzed by mass spectrometry using data-dependent acquisition. Each cycle consisted of 3 sec where one full MS1 scan was acquired on the orbitrap at 120,000 resolution from 300 to 1575 m/z using an AGC of 7e5 and maximum injection time of 50ms followed by MS/MS dependent scans on most intense precursor m/z ions (only considering z = 2 to 5) until exhausting the 3 sec cycle time, using 1.6 m/z isolation window, HCD fragmentation at 28 normalized collision energy, and acquired at 15,000 resolution on the orbitrap with an AGC of 5e4 (peptide samples) and AGC of 1e5 (phosphopeptide samples) with a maximum injection time of 22ms. Dynamic exclusion was enabled to exclude fragmented precursors from repeated MS/MS selection for 30sec. To increase coverage, phosphopeptide samples were injected twice, and the data from the two technical replicates were combined.
Database searching, peptide quantification, phosphosite localization, and Rs calculation
MS data files for proteome samples were analyzed with MaxQuant13 (v.1.6.7.0) to obtain peptide identifications and quantifications, using the following parameters: protein sequence database S.cerevisiae downloaded from SGD in July 2014, LysC enzyme specificity (cleavage C-terminal to K), maximum of 2 missed cleavages, mass tolerance of 20ppm for MS1 and 20ppm for MS2, fixed modification of carbamidomethyl on cysteines, variable modifications of oxidation on methionines and acetylation on protein N-termini. Lysine residues were only allowed to be all light or all 2H4-Lys within the same peptide. Phosphoproteome samples were processed in MaxQuant similarly as above, with additional variable modification of phosphorylation on serine, threonine, and tyrosine residues. All searches were combined for MaxQuant filtering set to 1% FDR at the level of peptide spectral matches and protein.
Quantification values for heavy and light peptide features were extracted from the evidence.txt file. Quantification values for features corresponding to the same peptide sequence (e.g. same peptide identified at multiple charge states or fractions) were summed up. Phosphopeptide quantification features were aggregated to the phosphopeptide isoform level by summing features corresponding to the same peptide sequence (e.g. same peptide identified at multiple charge states or replicate injections) as well as overlapping peptide sequences sharing the same combination of modifications. For each phosphopeptide isoform we required the maximum localization probability to be greater than 75% for at least one site.
Rs values were calculated as log2 ratios of the quantification value for the temperature gradient treated divided by the respective quantification for the 30°C control. Peptide Rs distributions were median normalized to 0, and the same correction value derived for each replicate was applied to normalize the corresponding phosphopeptide isoform Rs distributions. To filter out poor quality quantifications, peptides and phosphopeptide isoforms with the 5% highest Rs standard deviation across replicates were excluded from the analysis. Protein Rs values were calculated as the median of peptide Rs for that protein, requiring a minimum of two peptides per protein, and each peptide observed in at least two replicates.
To identify phosphopeptide isoforms that have different Rs than their unmodified protein counterpart, we performed a t-test comparing phosphopeptide isoform Rs values (n=6) compared to protein Rs values (n=6) and assuming unequal variance. Phosphopeptide isoforms and protein counterparts were required to be observed in at least three replicates. P-values were corrected for multiple hypothesis testing using the Benjamini-Hochberg method14. All data analysis was conducted using R (v3.6.1) and RStudio (v1.2.1335), and data figures were generated in R and Adobe Illustrator CS5 (v15.0.0).
Structure visualization and bioinformatics
Protein complex annotations were extracted from the CYC2008 resource15. Protein structure coordinates were downloaded from PDB and visualized and manipulated with PyMOL16. For PUP2 interface analysis, we extracted 20S proteasome protein structure from PDB 1RYP17. Protein interface structures for ARO8, TPI1, and GAPDH were extracted from PDB (4JE518, 1NEY19, 3PYM20 respectively). To assess the stabilizing effect of S149 phosphorylation at the catalytic site of GAPDH, we aligned crystal structures of GAPDH with bound G3P (1NQO21) and inorganic phosphates (1GYP22) to a NAD-bound yeast GAPDH structure (3PYM20). Data on sequence conservation, protein interfaces, and predicted stability effects of mutations (ΔΔGpred) were obtained from the mutfunc resource23.
Reanalysis of the Huang et al. data
Supplementary data from Huang et al.2 was used to calculate the correlation between Tm for phosphopeptides and proteins and learn about their statistical parameters. For data reanalysis, all MS files from the study were downloaded from MassIVE data repository (dataset identifier: MSV000083786), converted to open format mzXML files with ReAdW (2015.1.0), and database searched with Comet24 (v.2015.02 rev2) to obtain peptide and phosphopeptide identifications, with the exception of “Bulk_6_2” which failed to convert. Database search parameters were: human protein sequence database from UniProt (UP000005640), mass tolerance of 50 ppm for precursor m/z and 0.2 Da for fragment ions, trypsin enzyme specificity (cleavage Ct to K, R, except for KP, RP), maximum of 2 missed cleavages, fixed modification of carbamidomethyl on cysteines and TMT10 (+229.1629) on lysines and peptide N-termini, and variable modifications of oxidation on methionines and acetylation on protein N-termini. Phosphorylation samples included variable modification of phosphorylation at serine, threonine, and tyrosine. Search results were filtered to a PSM 1% FDR with Percolator25. Phosphosite localization was conducted with an in-house C++ implementation of Ascore26 and sites with Ascore > 13 were considered confidently localized (P < 0.05). TMT10 reporter ion intensities were extracted from MS/MS scans using in-house TMT quantification software (IsobaricQuant 1.0).
We attempted to replicate the analysis by Huang et al. by following the method description provided in their manuscript. Biological and technical replicates were treated equally. For each replicate, TMT reporter ion intensities for all peptide spectral matches from the proteome files were summed to the protein level, and TMT reporter ion intensities for phosphopeptide spectral matches were summed to the phosphopeptide isoform level. In addition, we used the same strategy to aggregate to peptide-level the TMT signals for PSMs mapping to the same unmodified peptide observed in the phosphopeptide-enriched samples. We implemented the TPP package in R to fit melting curves for proteins, phosphopeptide isoforms, and unmodified peptides in the phosphorylation-enriched sample. To recapitulate the reported results, we had to conduct the fitting for all samples together (Supplementary Discussion). Melting curves were filtered for fitting R2 > 0.8. T-tests were conducted by comparing Tm values for phosphopeptide isoforms or unmodified peptides observed in the phosphopeptide-enriched samples to the unmodified protein Tm values, assuming equal variances and without multiple hypothesis correction (as implemented by Huang et al.). Of note, our reanalysis revealed that one of the phosphoproteome technical injections for biological replicate 5 was instead a repeated MS analysis of biological replicate 4.
DATA AVAILABILITY
The mass spectrometry proteomics data generated for this manuscript have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD016750.
CODE AVAILABILITY
All code to reproduce analysis and data figures is available at https://gitlab.com/villenlab/dali_phospho_thermalstability.
Extended Data
Extended Data Figure 1. Reproducibility and robustness of Dali compared to HTP.

a, Scatter plot and Pearson correlation between the mean Tm for unmodified peptides observed in the phosphopeptide enriched samples (n=10) and the mean Tm for their corresponding proteins (n=11). Results from the Huang et al. data reanalysis conducted by us. b, Scatter plot and Pearson correlation as in (a) with Rs values obtained from the Dali method (n=6).
Extended Data Figure 2. Phosphosites that significantly alter protein thermal stability using two different statistical settings.

Volcano plots showing ΔTm for mean phosphopeptide isoform to mean protein counterpart in the x-axis, and the two-sided Student’s t-test probability in the y-axis. a, Huang et al. implementation shows a p-value because multiple hypothesis correction was not applied. Significant phosphopeptide isoforms (blue) are defined by p-value < 0.05. b, Our proposed analysis consolidates data from MS reanalysis prior to statistical testing using a two-sided Welch’s t-test, which is performed assuming unequal variances between phosphopeptide isoform and proteins. Benjamini-Hochberg adjustment was used to correct p-values for multiple hypothesis testing. Significant phosphopeptide isoforms (blue) are defined by q-value < 0.05.
Extended Data Figure 3. Examples of significant hits on proteins that undergo post-translational splicing or cleavage.

a, Rs values for observed VMA1 unmodified peptides identified in phosphopeptide-enriched samples and proteome samples displayed across the length of VMA1. Spliced products from amino acid 2–283 and 738–1031 are joined to generate the V-type proton ATPase catalytic subunit A proteoform, extinguishing the 284–737 segment. Peptides derived from the proteome samples are colored in gray and significant unmodified peptides found in the phosphopeptide-enriched sample are in red. b, Similar plot to (a) for RPS31, which is cleaved to generate ubiquitin (1–76 amino acid segment) and 40S ribosomal protein S31 (77–152 amino acid segment) proteins.
Extended Data Figure 4. Examples of phosphosites that alter protein thermal stability and are located at protein interfaces.

Rs boxplots for a, ARO8 S59, b, TPI1 S79, and c, GAPDH S201 phosphopeptide isoforms and their protein counterparts. All boxplots show results from n=6 biological replicates, and the line represents the median, the box designates the interquartile range (IQR), and the whiskers define 1.5*IQR from the box ends. ARO8 S59, TPI1 S79, and GAPDH S201 reside at dimerization interfaces as shown in the structures to the right (PDB accession: 4JE5, 1NEY, and 3PYM, respectively). Phosphomimetic mutations ARO8 S59E and TPI1 S79E are predicted to disrupt protein interfaces (ΔΔGpred = 3.78 and ΔΔGpred =8.04 respectively). Additionally, TPI1 S79E mutation is predicted to alter protein conformational stability (ΔΔGpred = 2.39). ΔΔGpred > 2 is predicted to be destabilizing.
Extended Data Figure 5. Examples of phosphosites that alter protein thermal stability on glycolytic enzymes.

a, Rs values for PGK1 and all measured PGK1 phosphopeptide isoforms, with significantly destabilizing phosphosite S331 shown in red. ΔΔGpred for all glutamic acid phosphomimetic substitutions were obtained from mutfunc, with ΔΔGpred > 2 considered likely destabilizing. b, GAPDH S149 phosphopeptide is shared across all GAPDH paralogs (TDH1, TDH2, and TDH3). Boxplot shows Rs values and distributions for peptides unique to one isoform (TDH1, TDH2, TDH3), peptides shared among all GAPDH isoforms (all), all peptides for TDH3, and the S149 phosphopeptide isoform. Bottom panel shows localization of S149 on the GAPDH structure near the binding site of the enzyme substrate. Boxplots show results from 6 biological replicates, the line represents the median, the box designates the interquartile range (IQR), and the whiskers define 1.5*IQR from the box ends.
Supplementary Material
ACKNOWLEDGEMENTS
We thank members of the Villén lab for scientific discussions, in particular Bianca Ruiz, Mario Leutert, and Alex Hogrebe. We thank Ariadna Llovet Soto and Jimmy Eng for software developments on the data analysis pipeline. I.R.S. and K.N.H were supported by NIH training grant T32HG000035. A.S.V. was supported by NIH training grant T32LM012419. Most of this work was supported by NIH grant R35GM119536 to J.V. The Villén lab is additionally supported by NIH grants R01AG056359, R01NS098329, and RM1 HG010461, Human Frontiers Science Program grant RGP0034/2018, a Research Program grant from the W.M. Keck Foundation, and the University of Washington Proteome Resource UWPR95794.
Footnotes
COMPETING INTERESTS
The authors declare no competing interests.
REFERENCES
- 1.Hornbeck PV et al. PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res 43, D512–D520 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Huang JX et al. High throughput discovery of functional protein modifications by Hotspot Thermal Profiling. Nat. Methods 16, 894–901 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Potel C, Kurzawa N, Beecher I, Mateus A & Savitski MM Impact of phosphorylation on thermal stability of proteins. bioRxiv (2020) doi: 10.1101/2020.01.14.903849. [DOI] [PubMed] [Google Scholar]
- 4.Gaetani M et al. Proteome Integral Solubility Alteration: A High-Throughput Proteomics Assay for Target Deconvolution. J. Proteome Res 18, 4027–4037 (2019). [DOI] [PubMed] [Google Scholar]
- 5.Savitski MM et al. Tracking cancer drugs in living cells by thermal profiling of the proteome. Science 346, 1255784 (2014). [DOI] [PubMed] [Google Scholar]
- 6.Finley D, Bartel B & Varshavsky A The tails of ubiquitin precursors are ribosomal proteins whose fusion to ubiquitin facilitates ribosome biogenesis. Nature 338, 394–401 (1989). [DOI] [PubMed] [Google Scholar]
- 7.Kane PM et al. Protein splicing converts the yeast TFP1 gene product to the 69-kD subunit of the vacuolar H(+)-adenosine triphosphatase. Science 250, 651–657 (1990). [DOI] [PubMed] [Google Scholar]
- 8.Holt LJ et al. Global Analysis of Cdk1 Substrate Phosphorylation Sites Provides Insights into Evolution. Science 325, 1682–1686 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Dephoure N et al. A quantitative atlas of mitotic phosphorylation. Proc. Natl. Acad. Sci 105, 10762–10767 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Imami K et al. Phosphorylation of the Ribosomal Protein RPL12/uL11 Affects Translation during Mitosis. Mol. Cell 72, 84–98.e9 (2018). [DOI] [PubMed] [Google Scholar]
- 11.Viéitez C et al. Towards a systematic map of the functional role of protein phosphorylation. bioRxiv (2019) doi: 10.1101/872770. [DOI] [Google Scholar]
- 12.Leutert M, Rodriguez-Mias RA, Fukuda NK & Villén J R2-P2 rapid-robotic phosphoproteomics enables multidimensional cell signaling studies. Mol. Syst. Biol 15, e9021 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Cox J & Mann M MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol 26, 1367–1372 (2008). [DOI] [PubMed] [Google Scholar]
- 14.Benjamini Y & Hochberg Y Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B Methodol 57, 289–300 (1995). [Google Scholar]
- 15.Pu S, Wong J, Turner B, Cho E & Wodak SJ Up-to-date catalogues of yeast protein complexes. Nucleic Acids Res 37, 825–831 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.The PyMOL Molecular Graphics System, Version 1.2r3pre, Schrödinger, LLC. [Google Scholar]
- 17.Groll M et al. Structure of 20S proteasome from yeast at 2.4Å resolution. Nature 386, 463–471 (1997). [DOI] [PubMed] [Google Scholar]
- 18.Bulfer SL, Brunzelle JS & Trievel RC Crystal structure of Saccharomyces cerevisiae Aro8, a putative α-aminoadipate aminotransferase. Protein Sci 22, 1417–1424 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Jogl G, Rozovsky S, McDermott AE & Tong L Optimal alignment for enzymatic proton transfer: Structure of the Michaelis complex of triosephosphate isomerase at 1.2-Å resolution. Proc. Natl. Acad. Sci 100, 50–55 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Garcia-Saez I, Kozielski F, Job D & Boscheron C Structure of GAPDH 3 from S. cerevisiae at 2.0 Å resolution (2010) doi: 10.2210/pdb3PYM/pdb. [DOI] [Google Scholar]
- 21.Didierjean C et al. Crystal Structure of Two Ternary Complexes of Phosphorylating Glyceraldehyde-3-phosphate Dehydrogenase from Bacillus stearothermophilus with NAD and d-Glyceraldehyde 3-Phosphate. J. Biol. Chem 278, 12968–12976 (2003). [DOI] [PubMed] [Google Scholar]
- 22.Kim H, Feil IK, Verlinde CLMJ, Petra PH & Hol WGJ Crystal Structure of Glycosomal Glyceraldehyde-3-phosphate Dehydrogenase from Leishmania mexicana: Implications for Structure-Based Drug Design and a New Position for the Inorganic Phosphate Binding Site. Biochemistry 34, 14975–14986 (1995). [DOI] [PubMed] [Google Scholar]
- 23.Wagih O et al. A resource of variant effect predictions of single nucleotide variants in model organisms. Mol. Syst. Biol 14, e8430 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Eng JK, Jahan TA & Hoopmann MR Comet: an open-source MS/MS sequence database search tool. Proteomics 13, 22–24 (2013). [DOI] [PubMed] [Google Scholar]
- 25.Käll L, Canterbury JD, Weston J, Noble WS & MacCoss MJ Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat. Methods 4, 923–925 (2007). [DOI] [PubMed] [Google Scholar]
- 26.Beausoleil SA, Villén J, Gerber SA, Rush J & Gygi SP A probability-based approach for high-throughput protein phosphorylation analysis and site localization. Nat. Biotechnol 24, 1285–1292 (2006). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The mass spectrometry proteomics data generated for this manuscript have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD016750.
