Nonparametric Analysis of Thermal Proteome Profiles Reveals Novel Drug-binding Proteins

Dorothee Childs; Karsten Bach; Holger Franken; Simon Anders; Nils Kurzawa; Marcus Bantscheff; Mikhail M Savitski; Wolfgang Huber

doi:10.1074/mcp.TIR119.001481

. 2019 Oct 3;18(12):2506–2515. doi: 10.1074/mcp.TIR119.001481

Nonparametric Analysis of Thermal Proteome Profiles Reveals Novel Drug-binding Proteins^*

Dorothee Childs ^‡,^‖,^§§, Karsten Bach ^§,^¶,^§§, Holger Franken ^‖,^§§, Simon Anders ^**, Nils Kurzawa ^‡, Marcus Bantscheff ^‖, Mikhail M Savitski ^‡, Wolfgang Huber ^‡,^‡‡

PMCID: PMC6885700 PMID: 31582558

A method for nonparametric analysis of response curves from thermal proteome profiling (TPP) experiments is proposed. The approach achieves high proteome coverage and sensitivity while controlling the false discovery-rate. Free open source software for NPARC is provided.

Keywords: Drug targets, algorithms, biostatistics, tandem mass spectrometry, mathematical modeling, functional data analysis, thermal proteome profiling

Graphical Abstract

graphic file with name zjw0121960490006.jpg

Highlights

Method for the analysis of response curves from thermal proteome profiling (TPP).
NPARC uses nonparametric statistics and provides false discovery-rate (FDR) control.
Increased proteome coverage and sensitivity to identify drug-binding proteins.

Abstract

Detecting the targets of drugs and other molecules in intact cellular contexts is a major objective in drug discovery and in biology more broadly. Thermal proteome profiling (TPP) pursues this aim at proteome-wide scale by inferring target engagement from its effects on temperature-dependent protein denaturation. However, a key challenge of TPP is the statistical analysis of the measured melting curves with controlled false discovery rates at high proteome coverage and detection power. We present nonparametric analysis of response curves (NPARC), a statistical method for TPP based on functional data analysis and nonlinear regression. We evaluate NPARC on five independent TPP data sets and observe that it is able to detect subtle changes in any region of the melting curves, reliably detects the known targets, and outperforms a melting point-centric, single-parameter fitting approach in terms of specificity and sensitivity. NPARC can be combined with established analysis of variance (ANOVA) statistics and enables flexible, factorial experimental designs and replication levels. An open source software implementation of NPARC is provided.

Determining the cellular interaction partners of drugs and other small molecules remains a key challenge (1, 2, 3, 4). In drug research, better assays to detect targets (and off-targets) would provide valuable information on drugs' mechanisms of action, reveal potential reasons for side effects, and elucidate avenues for drug repurposing. More broadly, in cell biology basic research, the dynamical landscape of binding partners of metabolites, messengers or chemical probes contains much uncharted territory. Thermal proteome profiling (TPP)¹ addresses these needs by screening for protein targets of drugs or small molecules in living cells on a proteome-wide scale (5, 6). TPP combines multiplexed quantitative mass spectrometry with the cellular thermal shift assay (CETSA) (7), which identifies binding events from shifts in protein thermostability (see supplemental Fig. S1 for a detailed explanation). A typical TPP experiment generates temperature dependent abundance measurements for a large part of the cellular proteome. Drug binding proteins can then be inferred by comparing the melting curves of proteins between samples treated with drug and vehicle (negative control without drug).

Applications of TPP successfully identified previously unknown protein-ligand interactions (5), protein complexes (8) and downstream effects of drugs in signaling networks (6, 9, 10, 11) in human cells. Recently it has also been extended to study drug resistance in bacteria (12) and targets of antimalarial drugs in plasmodium (13). There is urgent interest in further advancing its component technologies, including experimental and computational aspects, in order to maximize its biological discovery potential (14, 15, 16, 17, 18, 19).

The central computational task in TPP data analysis is the comparison of the temperature dependent abundance measurements—which can be visualized as melting curves— for each protein with and without (or with various concentrations of) drug. The aim is to detect changes in thermostability from statistically significant changes in the melting curves.

A naïve approach is to summarize each curve into a single parameter, such as the melting point (T_m), which is defined as the temperature of half-maximum relative abundance (horizontal line in Fig. 1A). Its value is estimated by fitting a parametric model separately for the control and treatment conditions and comparing the estimates. Statistical significance is assessed using replicates and hypothesis testing, such as a t- or z-test. Although the approach has delivered valid and important results (5, 6, 20, 21), we will see in the following that it tends to lead to needlessly high rates of false negatives. There are three main reasons for that: first, drug-induced effects on thermostability do not always imply significant shifts in T_m (Fig. 1B–1C). Second, the true T_m of a protein can lie outside of the measured temperature range, which impairs its estimation (Fig. 1D). Both scenarios can result in important targets being missed in the analysis (Fig. 1E). The third reason is a more subtle statistical one: hypothesis tests using only the point estimates of T_m do not consider goodness-of-fit of the parametric model or the confidence range of the estimates. Thus, important information is ignored, which statistically leads to loss of power.

Fig. 1. — **TPP data analysis challenges.** *A–D*, Examples for protein melting curves with and without drug (see color keys). In each case, ten temperatures were assayed, and two experimental replicates were made per condition, indicated by circle and triangle symbols. Fits of the sigmoid model (Eq. (3)) to both replicates jointly are shown by smooth lines. A, For serine/threonine protein kinase 4 (STK4), the binding of staurosporine is reflected by a marked shift between the curves. The fitted values for the melting points (T_m) are shown. B, For Bruton's tyrosine kinase (BTK), there is a small but reproducible shift between the curves. C, Protein kinase C beta (PRKCB) is destabilized by staurosporine; the effect occurs mainly at lower temperatures. D, NAD(P)H quinone dehydrogenase 2 (NQO2) is strongly stabilized by staurosporine. Although in each case, the effects of drug binding are clearly reproducible between replicates, the T_m-based approach of (6) only detects (A) and misses (*B–D*). In the case of (B) and (C), the fitted T_m are too similar, so that the statistical test does not assess the difference as significant. In the case of (D), no reasonable estimate for T_m in the staurosporine treated condition can be obtained, as it would lie outside the measured temperature range, and the protein is discarded from the analysis. In contrast, NPARC, the method proposed in this article, detects all four cases. E, The fraction of proteins in each of the data sets of Table I that is missed by the T_m-based approach because of failure to estimate T_m or to meet the goodness-of-fit criterion (Table III).

Here, we propose an alternative approach that compares whole curves instead of summary parameters and does not rely on T_m estimation. The method, nonparametric analysis of response curves (NPARC), is based on a branch of statistical data analysis that works on continuous functions rather than individual numbers, termed functional data analysis (22). It considers the measured melting curves as samples from an underlying stochastic process with a smooth mean function—which can be modeled parametrically or nonparametrically (23)—and constructs its hypothesis tests directly on these samples. NPARC's F-statistic uses a more flexible model that makes fewer assumptions on the data than T_m-estimation, is computationally more stable, and it directly uses the information from replicates. Consequently, reliable estimates of the null distribution of this statistic can be obtained, it shows higher sensitivity for small but reproducible effects, and failures because of model misspecification or outliers are reduced. This increases proteome coverage, which can make the difference between missing or detecting an important drug target.

We demonstrate NPARC on the five published data sets introduced in Table I. We also compare its results to those of the T_m-based method used by (6). Three of the experiments used the cancer drugs panobinostat or dasatinib in different concentrations, one investigated the effects of the high-affinity, ATP-competitive pan-kinase inhibitor staurosporine, one the cellular metabolite ATP. Although the cancer drugs interact with limited sets of proteins, the two other compounds are promiscuous binders and affect the thermostability of a large fraction of the cellular proteome.

Table I. Datasets and sample sizes.

Data set	Treatment	Concentration	Buffer	Cell line	Intact cells or lysate	Proteins	Reference
ATP data	MgATP	2 μm	PBS	K562	Lysate	4177	[9]
Dasatinib 0.5 μm data	Dasatinib	0.5 μm	PBS	K562	Intact Cells	4625	[5]
Dasatinib 5 μm data	Dasatinib	5 μm	PBS	K562	Intact Cells	4154	[5]
Panobinostat data	Panobinostat	1 μm	PBS	K562	Intact Cells	3649	[6]
Staurosporine data	Staurosporine	20 μm	PBS	K562	Lysate	4505	[5]

Open in a new tab

EXPERIMENTAL PROCEDURES

Data Sets and Preprocessing

Five TPP data sets (Table I) were obtained from the supplements of the respective publications. Each data set contained relative abundance measurements per protein and temperature which had been scaled to the value measured at 37 °C (the lowest of the ten temperatures assayed) and subjected to the global normalization procedure described by Savitski et al. (5). Only proteins quantified with at least one unique peptide in each of two replicates of the vehicle and compound treated conditions were included in the analysis; the resulting proteome coverages are listed in Table I.

Curation of Lists of Expected Targets

Lists of expected protein targets for the pan-kinase inhibitor staurosporine and ATP were obtained from Gene Ontology Consortium annotations via the Bioconductor annotation packages AnnotationDbi (version 1.36.2), org.Hs.eg.db (version 3.4.0) and GO.db (version 3.4.0). Terms and numbers of annotated proteins are shown in Table II.

Table II. Expected targets per dataset.

Data set	Gene Ontology term	Number of proteins in data set with this term
ATP	ATP-binding	558
Staurosporine	protein kinase activity	187

Open in a new tab

Mathematical Model

NPARC is based on fitting two competing models to the data, a null model and an alternative model. The null model states that the relative protein abundance at temperature t (given in °C) is explained by a single smooth function μ₀(t) irrespective of the treatment condition (Fig. 2A). The alternative model posits two condition-specific functions: μ_T(t) for the treatment condition and μ_V(t) for the vehicle condition (Fig. 2B). Deviations between observed data and fitted model are referred to as residuals, and the sum of squared residuals (RSS) serves as an indicator of each model's goodness-of-fit. We then compute

R S S_{0} = \sum_{t, i, c} {(x_{t, i, c} - μ_{0} (t))}^{2},

(1)

R S S_{1} = \sum_{t, i, c} {(x_{t, i, c} - μ_{c} (t))}^{2},

(2)

where x_t,i,c is the measured value at temperature t for experimental replicate i and condition c ∈ {V,T}, and the summations extend over all temperatures, replicates, and conditions.

Fig. 2. — **Principles of NPARC, illustrated for protein STK4 under staurosporine treatment.** A, Fit of the null model, *i.e.* no treatment effect (black line). The goodness-of-fit is quantified by *RSS*₀, the sum of squared residuals (dashed lines). As in Fig. 1, the triangle and circle symbols indicate the experimental replicates. B, Fit of the alternative model, with separate curves for the treated (orange) and the vehicle condition (gray). Because of the higher flexibility of the model, the sum of squared residuals *RSS*₁ is always less than or equal to *RSS*₀. C, The question whether the improvement in the goodness-of-fit, *i.e.* the difference *RSS*₀ − *RSS*₁, is strong enough to reject the null hypothesis can be addressed with the variant of the F-test described in the main text. Each point in the plot corresponds to a different protein. The highlighted example STK4 has a large F-statistic and a small p value.

Choice of the Mean Function

The mean functions μ₀(t), μ_T(t) and μ_V(t) are each chosen from the same space of smooth functions f : ℝ₊ → [0,1] spanned by the three parameters a, b ∈ ℝ, f_∞ ∈ [0,1] and the prescription

f (t) = \frac{1 - f_{\infty}}{1 + e^{- (\frac{a}{t} - b)}} + f_{\infty} .

(3)

The shape of these functions is sigmoid, and the functional form (Eq. 3) can be motivated by simplifying protein thermodynamics considerations (5). The mean functions and the RSS values are computed separately from the data for each protein. In order not to overburden the notation, we omit the protein indices.

Hypothesis Test Statistic

To discriminate between null and alternative models, we compute the F-statistic

F = \frac{d_{2}}{d_{1}} \frac{R S S_{0} - R S S_{1}}{R S S_{1}},

(4)

with d₂/d₁ > 0 defined as below. F quantifies the relative reduction in residuals from null to alternative model. Although F is by definition always positive, it will be small for proteins not affected by the treatment, whereas a high value of F indicates a reproducible change in thermostability.

Null Distribution

To compute a p value from a value of the F-statistic (Eq. 4), we need its null distribution, i.e. its statistical distribution if the data generating process is described by a common mean function μ₀(t). If the residuals were independent and identically normal distributed, this distribution would be given by an analytical formula, namely that of the F(d₁, d₂)-distribution with parameters d₁, d₂ > 0, and these parameters—sometimes called degrees of freedom—would be explicitly given from the number of measurements and number of model parameters that go into the computation of RSS₀ and RSS₁. In practice, this is not the case, because the residuals are heteroscedastic (i.e. have different variances at different temperatures) and correlated. However, the family of F(d₁, d₂)-distributions is quite flexible, and we can approximate the distribution of the F-statistic (Eq. 4) on data occurring in practice with an F(d₁, d₂)-distribution with different “effective degrees of freedom” d₁, d₂. To this end, we separately approximate the numerator and denominator of F as

\frac{R S S_{0} - R S S_{1}}{d_{1}} \overset{H_{0}}{~} \frac{σ_{0}^{2}}{d_{1}} \cdot χ^{2} (d_{1}),

(5)

\frac{R S S_{1}}{d_{2}} \overset{H_{0}}{~} \frac{σ_{0}^{2}}{d_{2}} \cdot χ^{2} (d_{2}),

(6)

and use the fact that the ratio of two χ²-distributed random variables χ²(d₁), χ²(d₂) has an F(d₁, d₂)-distribution (24). The scale parameter σ²₀ and the effective degrees of freedom d₁ and d₂ are estimated from the empirical distributions—across proteins—of RSS₀ and RSS₁. Thus, we assume that σ²₀, d₁, d₂ are the same for all proteins. We estimate σ²₀ from the moments of RSS₀ − RSS₁ as

σ_{0}^{2} = \frac{1}{2} \frac{var (R S S_{0} - R S S_{1})}{mean (R S S_{0} - R S S_{1})}

(7)

where mean and variance are computed across proteins on the observed values of RSS₀ − RSS₁ (see Supplementary Methods for details). Then, d₁ and d₂ are obtained by numerical optimization of the likelihoods for models (Eq. 5) and (Eq. 6) using the fitdistr function of the R package MASS (25).

p Values

For each protein, a p value is computed from its F-statistic and the cumulative F-distribution with parameters d₁, d₂ as described above. The multiset of p values across all proteins is corrected for multiple testing with the method of Benjamini and Hochberg (26). The outcome of such an analysis is exemplarily shown in Fig. 2C.

RESULTS

Application to Panobinostat

We assessed the ability of NPARC to detect drug targets on a data set on panobinostat (Table I). Panobinostat is a broad-spectrum histone deacetylase (HDAC) inhibitor known to interact with HDAC1, HDAC2, HDAC6, HDAC8, HDAC10, and tetratricopeptide repeat protein 38 (TTC38) (6).

Out of 3649 proteins reproducibly quantified across both biological replicates in both treatment conditions, NPARC yielded 16 proteins with Benjamini-Hochberg adjusted p values ≤ 0.01. They contained the expected HDAC targets (Fig. 3A–3E) as well as TTC38, the histone proteins H2AFV or H2AFZ (the two variants could not be distinguished by mass spectrometry), and zinc finger FYVE domain-containing protein 28 (ZFYVE28) (Fig. 3F–3H). These proteins were previously identified as direct or indirect targets of panobinostat (6, 11). In addition, eight more proteins were detected for which no direct or indirect interactions with panobinostat have been described (supplemental Fig. S3). They reached statistical significance because they either showed effect sizes comparable to known panobinostat targets, or more subtle but highly reproducible changes in a similar strength to those already described for dasatinib target BTK (Fig. 1B). We reanalyzed the more recent 2D-TPP data set of short-term (15 min) panobinostat-treatment of HepG2 cells (11) for these proteins. All of them were identified and quantified at sufficient peptide coverage, but none of them showed stabilization. We thus conclude that the additionally found proteins are likely not direct binders of panobinostat, but rather indirect effects, like altered protein-protein interactions or post-translational modifications. The longer (5 h) incubation time of the assay used to generate the panobinostat data set in Table I makes it more sensitive to such effects.

Fig. 3. — **Direct and indirect targets of the HDAC inhibitor panobinostat detected by NPARC (FDR ≤ 0.01 according to the method of Benjamini and Hochberg (26)).** *A–E*, Data and curve fits for five HDACs that show significant shifts in their thermostability. HDAC1 and HDAC2 are not detected by the T_m-based approach of (6), because the higher variance between the replicates of the panobinostat-treated condition leads to them being eliminated by the filter heuristics of that method (Table III). In contrast, NPARC naturally takes the variance into account in the computation of the F-statistic and does not require such filtering steps. *F–H*, Data and curve fits for known non-HDAC targets.

Beyond Two-group Comparisons

Because NPARC is based on analysis of variance (ANOVA), it admits experimental designs in which the covariate has multiple levels. An example is the data set for the BCR-ABL inhibitor dasatinib, which comprises measurements on cells treated at two different concentrations as well as untreated cells. NPARC successfully identified known targets of dasatinib (supplemental Fig. S4).

Replicate Agreement and Model Fit Diagnostics

Application of the T_m -based approach by (6) to the panobinostat data failed to detect HDAC1 and HDAC2. This was because the data for these proteins had relatively high variance in the drug-treated condition, as is visible in Fig. 3A–3B. This led to their exclusion according to one of the data quality filter criteria of that method (Table III), namely the criterion that asks for sufficiently high coefficients of determination (R²). In contrast, a better and statistically sound trade-off between variability and effect size is an integral part of NPARC and does not require an ad hoc filter criterion.

Table III. A priori filters applied in the original TPP analysis workflow [6] to select proteins for hypothesis testing.

Rule number	Rule
1	Both fitted curves for the vehicle and compound treated condition have a coefficient of determination R² > 0.8, where R²: = 1 − $\frac{\sum_{t} {(x_{t} - μ_{c} (t))}^{2}}{\sum_{t} {(x_{t} - \bar{x})}^{2}}$ , with μ_c(t) being the model prediction for condition c at temperature t, x̄ being the mean of all measurements for the protein within a particular condition and replicate, and the summation extending over all temperatures.
2	The two curves fitted to the two replicates of the vehicle conditions have a plateau f_∞ < 0.3.
3	In each biological replicate, the steepest slope of the melting curve in the vehicle and treatment condition needs to be < −0.06 °C⁻¹.

Open in a new tab

To further assess the price of the various filter criteria of the T_m-based approach by (6), we tabulated the numbers of proteins affected by them in each of the five data sets. These proteins would, in principle, not be detectable by that method, no matter how strong the effect. Their numbers amounted to 14–25% of the total numbers of proteins for which melting points could be determined in both replicates (Fig. 1E and Table IV) and to 21–32% of all proteins irrespective of melting point availability (supplemental Fig. S2). In contrast, the F-test of NPARC could be applied to all proteins irrespective of these or similar criteria, a fact which contributed to the increased protein coverage and sensitivity of NPARC.

Table IV. Coverage of proteins applicable for hypothesis testing by the original TPP analysis workflow [6].

Data set	T_m outside measured range	T_m available but curves not passing a priori filters	T_m available and curves passing a priori filters
ATP data	220	1004	2953
Dasatinib 0.5 μm data	689	768	3168
Dasatinib 5 μm data	667	507	2980
Panobinostat data	320	461	2868
Staurosporine data	621	631	3253

Open in a new tab

Effects Beyond Those on the Melting Point

Many of the proteins detected by NPARC displayed reproducible changes in curve shape, whereas their T_m-shifts were small, and not considered significant by the T_m-based approach (Fig. 4). An example is the effect of staurosporine on protein kinase C beta (PRKCB), shown in Fig. 1C. PRKCB is part of the PKC family, whose members were the first reported staurosporine targets (27, 4) and also exhibit similar characteristics (supplemental Fig. S5).

Fig. 4. — **NPARC is sensitive to small but reproducible T_m-shifts.** The plot compares the effect size measure used by NPARC, namely *RSS*₀ − *RSS*₁ (y axis), to the T_m-difference (x axis) for those proteins in the staurosporine data set for which T_m estimates could be obtained. Proteins with Benjamini-Hochberg adjusted p values ≤ 0.01 are marked in red if they were exclusively found by NPARC, and in green if they were also detected by the T_m-based approach of (6). NPARC detects targets with small T_m-differences if the measurements are reproducible between replicates.

Further examples include the effects of staurosporine on RanGTP binding tRNA export receptor exportin-T (XPOT) and two members of the p38 MAPK signaling pathway: Mitogen-activated protein kinase 14 (MAPK14) and MAP kinase-activated protein kinase 2 (MAPKAPK2) (Fig. 4); and the effect of dasatinib on Bruton tyrosine kinase (BTK), an important drug target in B-cell leukemia (Fig. 1B).

Missing Melting Point Estimates

For highly thermostable proteins, the T_m in one or more of the treatment conditions can be outside of the tested temperature range of a TPP experiment (Fig. 1E). One example is NAD(P)H quinone dehydrogenase 2 (NQO2), a cytosolic flavoprotein and a common off-target of kinase inhibitors (28, 29, 30). In concordance with previous CETSA studies that found NQO2 to be highly stable (31), we observed denaturation only beginning at 67 °C (Fig. 1D). Staurosporine treatment further stabilized NQO2 to an extent that it showed no sign of melting in the tested temperature range. The T_m -based approach by (6) will discard such proteins, in order to avoid potential problems from extrapolation of the fit beyond the measured temperature range. In contrast, the functional data analysis approach of NPARC can detect changes in any part of the melting curves, without reference to a single point such as T_m.

Sensitivity and Specificity

So far, we have described increased sensitivity of NPARC, i.e. its ability to detect more true targets. However, this is only useful if at the same time specificity is maintained, i.e. if false positive detection remains under control. To compare these performance characteristics between NPARC and the T_m-based approach, we computed pseudo receiver operator characteristic (ROC) curves for each of these methods on the staurosporine data and the ATP data, using as pseudo ground truth lists of expected targets from Gene Ontology annotation (Table II). Here, the term pseudo refers to the fact that these target lists, and hence the ROC curves, are only approximations of the truth; however, the relative ranking of two methods in such a pseudo-ROC comparison is likely to be faithful even in the presence of such approximation error (32).

Fig. 5 shows the results of NPARC on both data sets, as well as those of the z-test of the T_m-based approach by (6) applied to the individual replicates (displayed as continuous lines parameterized by the z-cutoff), and those of the full procedure of (6) (shown by isolated points, because of its single, fixed cutoff). On the staurosporine data, the full procedure of (6) performs close to NPARC. For the individual z-tests, as well as overall on the ATP data, NPARC shows superior performance. Given that the decision rule set of (6) (listed in Table V) and its cutoff parameters were developed and tuned partly on the staurosporine data, these results indicate that NPARC has fewer “fudge parameters” and is likely to be superior in applications to new data sets.

Table V. Decision ruleset applied in the original TPP analysis workflow [6] to combine z-test p values across replicates in an experimental design with two biological replicates.

Rule number	Rule
1	The Benjamini-Hochberg adjusted z-test p values fulfill predefined thresholds in each replicate.
2	Both T_m differences are either positive or negative in the two biological replicates.
3	The smallest absolute difference between treatment and vehicle T_m is greater than the absolute T_m difference between the two vehicle experiments.

Open in a new tab

DISCUSSION

Thermal proteome profiling offers the possibility to comprehensively characterize ligand-protein interactions on a proteome-wide scale in living cells. However, the method poses the analytical challenge of how to identify statistically significant shifts in thermostability among thousands of measurements.

To address this challenge, we introduced a functional data analysis approach to test for treatment effects by comparing competing models by their goodness-of-fit. This enables detection of treatment effects even if a (de-)stabilization of a protein is not captured by a single summary parameter like the T_m. The presented method is based on a sound statistical foundation and does not rely on hard-to-choose cutoff or tuning parameters. We showed that our method compares favorably to previous approaches with respect to sensitivity and specificity for several exemplary data sets, including ones with specific and ones with promiscuous binders.

The approach fits into the framework of analysis of variance (ANOVA) or linear models and can thus be extended to experimental designs more complex than treatment-control comparisons, such as multiple levels (e.g. drug concentrations) per covariate, multiple covariates and interactions.

The suggested framework is flexible regarding the mean function used to represent the melting behavior and can be adapted to the particular biological process of interest. To represent nonlinear relationships, approaches include locally linear regression (33), spline regression (34, 35) and nonlinear parametric regression. Here, we chose the latter as it incorporates a priori knowledge about the data and thus has favorable estimation efficiency. For example, sigmoid curves have horizontal asymptotes at both sides of the temperature range. In contrast, splines and local regression tend to overfit data near the boundaries of the observation range.

In a cellular environment we occasionally observe nonsigmoid melting curves for subsets of proteins. One possible reason is the presence of protein subpopulations each with distinct melting curves (16). For example, the formation of protein complexes, the binding to other molecules, or the localization in cellular compartments can lead to deviations from the idealized sigmoid melting curve expected from the same protein in purified form. Our model currently does not account for such systematic and reproducible shape deviations. This could be adapted in future work by adding a low-parametric systematic modification to the sigmoid mean function.

We have considered CETSA experimental designs, where the temperature is the major experimental variable and drug concentration is either zero or a chosen value. It appears relatively straightforward to extend NPARC to the isothermal dose response (ITDR) design (6, 7) where temperature is held constant and the drug concentration is varied across a range of values. A further extension of interest would be to 2D-TPP (11) where both factors are changed.

We employ the same “average” null distribution for all proteins, which we obtain by estimating its parameters (d₁, d₂, σ₀) from the distributions of residuals across all proteins. It is conceivable that determining null distributions in a protein dependent manner, for instance by stratification, could increase the overall power of the method.

The here presented approach is likely to increase the accuracy of profiling protein-ligand interactions in living cells. We provide an open source R package NPARC, and all computations reported in the figures and tables of this article can be reproduced by running the Rmarkdown script provided in reference (36).

Additional Files

The following Figs. and Tables can be found in the Supplementary Material

Additional file 1 — Supplementary Methods

Detailed description of the fitting procedures for the scaling parameter, melting points, and mean functions of the model.

Additional file 2 — Table S1

Spreadsheet containing the results of the NPARC approach and of the T_m-based approach for all data sets listed in Table I.

Additional file 3 — Supplemental Figs. S1–S5

PDF containing Supplemental Figs. S1–S5

Additional file 4 — Supplemental Fig. S6

All proteins detected by the NPARC approach with Benjamini-Hochberg adjusted F-test p values ≤ 0.01 in the staurosporine data.

Additional file 5 — Supplemental Fig. S7

All proteins detected by the NPARC approach with Benjamini-Hochberg adjusted F-test p values ≤ 0.01 in the ATP data.

DATA AVAILABILITY

Datasets used in this work were downloaded as spreadsheets from journal websites. Table 1 summarizes all datasets used in this work and lists all respective references. The TPP-TR experiments based on staurosporine-, panobinostat-, ATP-, and dasatinib treatments are included in the supplemental materials of references (5, 6, 9). All results generated from this data are provided in the supplemental material attached to this work. A vignette describing the workflow and relevant code to reproduce the results is available for download (36). An open-source R package is available from https://github.com/Huber-group-EMBL/NPARC, and provision via Bioconductor is intended.

Supplementary Material

supplemental Figs. S6 and S7

TIR119.001481_index.html^{(1.6KB, html)}

Supplementary Methods

144658_1_supp_388240_px6jjs.pdf^{(169.9KB, pdf)}

Table S1

144658_1_supp_388212_px67md.xlsx^{(5.3MB, xlsx)}

Supplementary Figures S1-S5

144658_1_supp_388213_px6cr4.pdf^{(270.2KB, pdf)}

Supplementary Figure S6

144658_1_supp_388214_px6cr4.pdf^{(145.4KB, pdf)}

Supplementary Figure S7

144658_1_supp_388215_px6cr4.pdf^{(131.9KB, pdf)}

Acknowledgments

We thank Sindhuja Sridharan (EMBL Heidelberg) for help with figure preparation. We thank EMBL and GlaxoSmithKline for supporting the work.

Footnotes

* K.B. is supported by a Cambridge Cancer Centre studentship. S.A. is funded by the Deutsche Forschungsgemeinschaft, SFB 1036. W.H. acknowledges funding from the European Commission's H2020 Programme, Collaborative research project SOUND (Grant Agreement no 633974). One or more authors has an actual or perceived conflict of interest with the contents of this article. H.F., M.S., D.C., and M.B. are employees and/or shareholders of GlaxoSmithKline. N.K. is supported by a fellowship of the European Molecular Biology Laboratory International PhD Program.

This article contains supplemental material.

¹ The abbreviations used are:

TPP: Thermal proteome profiling
CETSA: Cellular thermal shift assay
FDR: False discovery rate
H₀: Null hypothesis
NPARC: Nonparametric analysis of response curves
ROC: Receiver operating characteristic
RSS: Residual sum of squares
T _m: Melting point.

REFERENCES

1. Comess, K. M., McLoughlin, S. M., Oyer, J. A., Richardson, P. L., Stöckmann, H., Vasudevan, A., Warder, S. E. (2018) Emerging approaches for the identification of protein targets of small molecules - a practitioners' perspective. J. Med. Chem. 61, 8504–8535 [DOI] [PubMed] [Google Scholar]
2. Simon, G. M., Niphakis, M. J., Cravatt, B. F. (2013) Determining target engagement in living systems. Nat. Chem. Biol. 9, 200–205 [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Bunnage, M. E., Chekler, E. L. P., Jones, L. H. (2013) Target validation using chemical probes. Nat. Chem. Biol. 9, 195–199 [DOI] [PubMed] [Google Scholar]
4. Arrowsmith, C. H., Audia, J. E., Austin, C., Baell, J., Bennett, J., Blagg, J., Bountra, C., Brennan, P. E., Brown, P. J., Bunnage, M. E., Buser-Doepner, C., Campbell, R. M., Carter, A. J., Cohen, P., Copeland, R. A., Cravatt, B., Dahlin, J. L., Dhanak, D., Edwards, A. M., Frederiksen, M., Frye, S. V., Gray, N., Grimshaw, C. E., Hepworth, D., Howe, T., Huber, K. V. M., Jin, J., Knapp, S., Kotz, J. D., Kruger, R. G., Lowe, D., Mader, M. M., Marsden, B., Mueller-Fahrnow, A., Müller, S., O'Hagan, R.C., Overington, J.P., Owen, D.R., Rosenberg, S.H., Ross, R., Roth, B., Schapira, M., Schreiber, S.L., Shoichet, B., Sundström, M., Superti-Furga, G., Taunton, J., Toledo-Sherman, L., Walpole, C., Walters, M.A., Willson, T.M., Workman, P., Young, R.N., Zuercher, W.J. (2015) The promise and peril of chemical probes. Nat. Chem. Biol. 11, 536–541 [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Savitski, M. M., Reinhard, F. B. M., Franken, H., Werner, T., Savitski, M. F., Eberhard, D., Martinez Molina, D., Jafari, R., Dovega, R. B., Klaeger, S., Kuster, B., Nordlund, P., Bantscheff, M., Drewes, G. (2014) Tracking cancer drugs in living cells by thermal profiling of the proteome. Science 346, 1255784. [DOI] [PubMed] [Google Scholar]
6. Franken, H., Mathieson, T., Childs, D., Sweetman, G. M., Werner, T., Tögel, I., Doce, C., Gade, S., Bantscheff, M., Drewes, G., Reinhard, F. B. M., Huber, W., Savitski, M. M. (2015) Thermal proteome profiling for unbiased identification of direct and indirect drug targets using multiplexed quantitative mass spectrometry. Nat. Protocols 10, 1567–1593 [DOI] [PubMed] [Google Scholar]
7. Martinez Molina, D., Jafari, R., Ignatushchenko, M., Seki, T., Larsson, E. A., Dan, C., Sreekumar, L., Cao, Y., Nordlund, P. (2013) Monitoring drug target engagement in cells and tissues using the cellular thermal shift assay. Science 341, 84–87 [DOI] [PubMed] [Google Scholar]
8. Tan, C. S. H., Go, K. D., Bisteau, X., Dai, L., Yong, C. H., Prabhu, N., Ozturk, M. B., Lim, Y. T., Sreekumar, L., Lengqvist, J., Tergaonkar, V., Kaldis, P., Sobota, R. M., Nordlund, P. (2018) Thermal proximity coaggregation for system-wide profiling of protein complex dynamics in cells. Science 359, 1170–1177 [DOI] [PubMed] [Google Scholar]
9. Reinhard, F. B. M., Eberhard, D., Werner, T., Franken, H., Childs, D., Doce, C., Savitski, M. F., Huber, W., Bantscheff, M., Savitski, M. M., Drewes, G. (2015) Thermal proteome profiling monitors ligand interactions with cellular membrane proteins. Nat. Methods 12, 1129–1131 [DOI] [PubMed] [Google Scholar]
10. Huber, K. V. M., Olek, K. M., Müller, A. C., Tan, C. S. H., Bennett, K. L., Colinge, J., Superti-Furga, G. (2015) Proteome-wide drug and metabolite interaction mapping by thermal-stability profiling. Nat. Methods 12, 1055–1057 [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Becher, I., Werner, T., Doce, C., Zaal, E. A.Tögel, I., Khan, C.A., Rüger, A., Mülbaier, M., Salzer, E., Berkers, C.R., Fitzpatrick, P.F., Bantscheff, M., Savitski, M. M. (2016) Thermal profiling reveals phenylalanine hydroxylase as an off-target of panobinostat. Nat. Chem. Biol. 12, 908–910 [DOI] [PubMed] [Google Scholar]
12. Mateus, A., Bobonis, J., Kurzawa, N., Stein, F., Helm, D., Hevler, J., Typas, A., Savitski, M. M. (2018) Thermal proteome profiling in bacteria: probing protein state in vivo. Mol. Syst. Biol. 14, 8242. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Dziekan, J. M., Yu, H., Chen, D., Dai, L., Wirjanata, G., Larsson, A., Prabhu, N., Sobota, R. M., Bozdech, Z., Nordlund, P. (2019) Identifying purine nucleoside phosphorylase as the target of quinine using cellular thermal shift assay. Sci. Translational Med. 11, 3174. [DOI] [PubMed] [Google Scholar]
14. Türkowsky, D., Lohmann, P., Mühlenbrink, M., Schubert, T., Adrian, L., Goris, T., Jehmlich, N., von Bergen, M. (2019) Thermal proteome profiling allows quantitative assessment of interactions between tetrachloroethene reductive dehalogenase and trichloroethene. J. Proteomics 192, 10–17 [DOI] [PubMed] [Google Scholar]
15. Dai, L., Zhao, T., Bisteau, X., Sun, W., Prabhu, N., Lim, Y. T., Sobota, R. M., Kaldis, P., Nordlund, P. (2018) Modulation of protein-interaction states through the cell cycle. Cell 173, 1481–1494 [DOI] [PubMed] [Google Scholar]
16. Becher, I., Andrés-Pons, A., Romanov, N., Stein, F., Schramm, M., Baudin, F., Helm, D., Kurzawa, N., Mateus, A., Mackmull, M.-T., Typas, A., Müller, C. W., Bork, P., Beck, M., Savitski, M. M. (2018) Pervasive protein thermal stability variation during the cell cycle. Cell 173, 1495–1507 [DOI] [PMC free article] [PubMed] [Google Scholar]
17. Azimi, A., Caramuta, S., Seashore-Ludlow, B., Boström, J., Robinson, J. L., Edfors, F., Tuominen, R., Kemper, K., Krijgsman, O., Peeper, D. S., Nielsen, J., Hansson, J., Egyhazi Brage, S., Altun, M., Uhlen, M., Maddalo, G. (2018) Targeting CDK2 overcomes melanoma resistance against BRAF and Hsp90 inhibitors. Mol. Systems Biol. 14, 7858. [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Miettinen, T. P., Peltier, J., Härtlova, A., Gierliński, M., Jansen, V. M., Trost, M., Björklund, M. (2018) Thermal proteome profiling of breast cancer cells reveals proteasomal activation by CDK4/6 inhibitor palbociclib. EMBO J. 37, 98359. [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Savitski, M. M., Zinn, N., Fälth-Savitski, M., Pöckel, D., Gade, S., Becher, I., Mülbaier, M., Wagner, A. J., Strohmer, K., Werner, T., Melchert, S., Petretich, M., Rutkowska, A., Vappiani, J., Franken, H., Steidel, M., Sweetman, G. M., Gilan, O., Lam, E. Y. N., Dawson, M. A., Prinjha, R. K., Grandi, P., Bergamini, G., Bantscheff, M. (2018) Multiplexed proteome dynamics profiling reveals mechanisms controlling protein homeostasis. Cell 173, 260–274 [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Childs, D., Kurzawa, N., Franken, H., Doce, C., Savitski, M., Huber, W (2019) TPP: Analyze Thermal Proteome Profiling (TPP) Experiments. R package version 3.12.0 [Google Scholar]
21. Volkening, J. (2017) Mstherm: Analyze MS/MS Protein Melting Data. R package version 0.4.7. https://CRAN.R-project.org/package=mstherm
22. Ramsey, J. O., Silverman, B. W. (2005) Functional Data Analysis 2nd ed., Springer; New York. [Google Scholar]
23. Wang, J. L., Chiou, J. M., Müller, H. G. (2016) Functional data analysis. Ann. Rev. Statistics Appl. 3, 257–295 [Google Scholar]
24. DeGroot, M. H., Schervish, M. J. (2012) Probability and Statistics 4th edn, Pearson Education; Boston. [Google Scholar]
25. Venables, W. N., Ripley, B. D. (2002) Modern Applied Statistics with S, 4th ed. Springer, New York. [Google Scholar]
26. Benjamini, Y., Hochberg, Y. (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Roy. Statistical Soc. 57, 289–300 [Google Scholar]
27. Ward, N. E., O'Brian, C.A. (1992) Kinetic analysis of protein kinase C inhibition by staurosporine: evidence that inhibition entails inhibitor binding at a conserved region of the catalytic domain but not competition with substrates. Mol. Pharmacol. 41, 387–392 [PubMed] [Google Scholar]
28. Leung, K. K. K., Shilton, B. H. (2015) Quinone reductase 2 is an adventitious target of protein kinase CK2 inhibitors TBBz (TBI) and DMAT. Biochemistry 54, 47–59 [DOI] [PubMed] [Google Scholar]
29. Winger, J. A., Hantschel, O., Superti-Furga, G., Kuriyan, J. (2009) The structure of the leukemia drug imatinib bound to human quinone reductase 2 (NQO2). BMC Structural Biol. 9, 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
30. Bantscheff, M., Eberhard, D., Abraham, Y., Bastuck, S., Boesche, M., Hobson, S., Mathieson, T., Perrin, J., Raida, M., Rau, C., Reader, V., Sweetman, G. M. A., Bauer, A., Bouwmeester, T., Hopf, C., Kruse, U., Neubauer, G., Ramsden, N. G., Rick, J., Kuster, B., Drewes, G. (2007) Quantitative chemical proteomics reveals mechanisms of action of clinical ABL kinase inhibitors. Nat. Biotechnol. 25, 1035–1044 [DOI] [PubMed] [Google Scholar]
31. Miettinen, T. P., Björklund, M. (2014) NQO2 is a reactive oxygen species generating off-target for acetaminophen. Mol. Pharmaceutics 11, 4395–4404 [DOI] [PMC free article] [PubMed] [Google Scholar]
32. Bourgon, R. W. (2006) Chromatin immunoprecipitation and high-density tiling microarrays: a generative model, methods for analysis, and methodology assessment in the absence of a “gold standard”. PhD thesis, Citeseer. [Google Scholar]
33. Loader, C. (1999) Local Regression and Likelihood. Springer; New York. [Google Scholar]
34. Storey, J. D., Xiao, W., Leek, J. T., Tompkins, R. G., Davis, R. W. (2005) Significance analysis of time course microarray experiments. Proc. Natl. Acad. Sci. USA 102, 12837–12842 [DOI] [PMC free article] [PubMed] [Google Scholar]
35. Berk, M., Ebbels, T., Montana, G. (2011) A statistical framework for biomarker discovery in metabolomic time course data. Bioinformatics 27, 1979–1985 [DOI] [PMC free article] [PubMed] [Google Scholar]
36. Childs, D., Kurzawa, N. (2019) Nonparametric analysis of thermal proteome profiles: Workflow. https://git.embl.de/childs/TPP-data-analysis [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplemental Figs. S6 and S7

TIR119.001481_index.html^{(1.6KB, html)}

Supplementary Methods

144658_1_supp_388240_px6jjs.pdf^{(169.9KB, pdf)}

Table S1

144658_1_supp_388212_px67md.xlsx^{(5.3MB, xlsx)}

Supplementary Figures S1-S5

144658_1_supp_388213_px6cr4.pdf^{(270.2KB, pdf)}

Supplementary Figure S6

144658_1_supp_388214_px6cr4.pdf^{(145.4KB, pdf)}

Supplementary Figure S7

144658_1_supp_388215_px6cr4.pdf^{(131.9KB, pdf)}

Data Availability Statement

[B1] 1. Comess, K. M., McLoughlin, S. M., Oyer, J. A., Richardson, P. L., Stöckmann, H., Vasudevan, A., Warder, S. E. (2018) Emerging approaches for the identification of protein targets of small molecules - a practitioners' perspective. J. Med. Chem. 61, 8504–8535 [DOI] [PubMed] [Google Scholar]

[B2] 2. Simon, G. M., Niphakis, M. J., Cravatt, B. F. (2013) Determining target engagement in living systems. Nat. Chem. Biol. 9, 200–205 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] 3. Bunnage, M. E., Chekler, E. L. P., Jones, L. H. (2013) Target validation using chemical probes. Nat. Chem. Biol. 9, 195–199 [DOI] [PubMed] [Google Scholar]

[B4] 4. Arrowsmith, C. H., Audia, J. E., Austin, C., Baell, J., Bennett, J., Blagg, J., Bountra, C., Brennan, P. E., Brown, P. J., Bunnage, M. E., Buser-Doepner, C., Campbell, R. M., Carter, A. J., Cohen, P., Copeland, R. A., Cravatt, B., Dahlin, J. L., Dhanak, D., Edwards, A. M., Frederiksen, M., Frye, S. V., Gray, N., Grimshaw, C. E., Hepworth, D., Howe, T., Huber, K. V. M., Jin, J., Knapp, S., Kotz, J. D., Kruger, R. G., Lowe, D., Mader, M. M., Marsden, B., Mueller-Fahrnow, A., Müller, S., O'Hagan, R.C., Overington, J.P., Owen, D.R., Rosenberg, S.H., Ross, R., Roth, B., Schapira, M., Schreiber, S.L., Shoichet, B., Sundström, M., Superti-Furga, G., Taunton, J., Toledo-Sherman, L., Walpole, C., Walters, M.A., Willson, T.M., Workman, P., Young, R.N., Zuercher, W.J. (2015) The promise and peril of chemical probes. Nat. Chem. Biol. 11, 536–541 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5. Savitski, M. M., Reinhard, F. B. M., Franken, H., Werner, T., Savitski, M. F., Eberhard, D., Martinez Molina, D., Jafari, R., Dovega, R. B., Klaeger, S., Kuster, B., Nordlund, P., Bantscheff, M., Drewes, G. (2014) Tracking cancer drugs in living cells by thermal profiling of the proteome. Science 346, 1255784. [DOI] [PubMed] [Google Scholar]

[B6] 6. Franken, H., Mathieson, T., Childs, D., Sweetman, G. M., Werner, T., Tögel, I., Doce, C., Gade, S., Bantscheff, M., Drewes, G., Reinhard, F. B. M., Huber, W., Savitski, M. M. (2015) Thermal proteome profiling for unbiased identification of direct and indirect drug targets using multiplexed quantitative mass spectrometry. Nat. Protocols 10, 1567–1593 [DOI] [PubMed] [Google Scholar]

[B7] 7. Martinez Molina, D., Jafari, R., Ignatushchenko, M., Seki, T., Larsson, E. A., Dan, C., Sreekumar, L., Cao, Y., Nordlund, P. (2013) Monitoring drug target engagement in cells and tissues using the cellular thermal shift assay. Science 341, 84–87 [DOI] [PubMed] [Google Scholar]

[B8] 8. Tan, C. S. H., Go, K. D., Bisteau, X., Dai, L., Yong, C. H., Prabhu, N., Ozturk, M. B., Lim, Y. T., Sreekumar, L., Lengqvist, J., Tergaonkar, V., Kaldis, P., Sobota, R. M., Nordlund, P. (2018) Thermal proximity coaggregation for system-wide profiling of protein complex dynamics in cells. Science 359, 1170–1177 [DOI] [PubMed] [Google Scholar]

[B9] 9. Reinhard, F. B. M., Eberhard, D., Werner, T., Franken, H., Childs, D., Doce, C., Savitski, M. F., Huber, W., Bantscheff, M., Savitski, M. M., Drewes, G. (2015) Thermal proteome profiling monitors ligand interactions with cellular membrane proteins. Nat. Methods 12, 1129–1131 [DOI] [PubMed] [Google Scholar]

[B10] 10. Huber, K. V. M., Olek, K. M., Müller, A. C., Tan, C. S. H., Bennett, K. L., Colinge, J., Superti-Furga, G. (2015) Proteome-wide drug and metabolite interaction mapping by thermal-stability profiling. Nat. Methods 12, 1055–1057 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11. Becher, I., Werner, T., Doce, C., Zaal, E. A.Tögel, I., Khan, C.A., Rüger, A., Mülbaier, M., Salzer, E., Berkers, C.R., Fitzpatrick, P.F., Bantscheff, M., Savitski, M. M. (2016) Thermal profiling reveals phenylalanine hydroxylase as an off-target of panobinostat. Nat. Chem. Biol. 12, 908–910 [DOI] [PubMed] [Google Scholar]

[B12] 12. Mateus, A., Bobonis, J., Kurzawa, N., Stein, F., Helm, D., Hevler, J., Typas, A., Savitski, M. M. (2018) Thermal proteome profiling in bacteria: probing protein state in vivo. Mol. Syst. Biol. 14, 8242. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13] 13. Dziekan, J. M., Yu, H., Chen, D., Dai, L., Wirjanata, G., Larsson, A., Prabhu, N., Sobota, R. M., Bozdech, Z., Nordlund, P. (2019) Identifying purine nucleoside phosphorylase as the target of quinine using cellular thermal shift assay. Sci. Translational Med. 11, 3174. [DOI] [PubMed] [Google Scholar]

[B14] 14. Türkowsky, D., Lohmann, P., Mühlenbrink, M., Schubert, T., Adrian, L., Goris, T., Jehmlich, N., von Bergen, M. (2019) Thermal proteome profiling allows quantitative assessment of interactions between tetrachloroethene reductive dehalogenase and trichloroethene. J. Proteomics 192, 10–17 [DOI] [PubMed] [Google Scholar]

[B15] 15. Dai, L., Zhao, T., Bisteau, X., Sun, W., Prabhu, N., Lim, Y. T., Sobota, R. M., Kaldis, P., Nordlund, P. (2018) Modulation of protein-interaction states through the cell cycle. Cell 173, 1481–1494 [DOI] [PubMed] [Google Scholar]

[B16] 16. Becher, I., Andrés-Pons, A., Romanov, N., Stein, F., Schramm, M., Baudin, F., Helm, D., Kurzawa, N., Mateus, A., Mackmull, M.-T., Typas, A., Müller, C. W., Bork, P., Beck, M., Savitski, M. M. (2018) Pervasive protein thermal stability variation during the cell cycle. Cell 173, 1495–1507 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B17] 17. Azimi, A., Caramuta, S., Seashore-Ludlow, B., Boström, J., Robinson, J. L., Edfors, F., Tuominen, R., Kemper, K., Krijgsman, O., Peeper, D. S., Nielsen, J., Hansson, J., Egyhazi Brage, S., Altun, M., Uhlen, M., Maddalo, G. (2018) Targeting CDK2 overcomes melanoma resistance against BRAF and Hsp90 inhibitors. Mol. Systems Biol. 14, 7858. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] 18. Miettinen, T. P., Peltier, J., Härtlova, A., Gierliński, M., Jansen, V. M., Trost, M., Björklund, M. (2018) Thermal proteome profiling of breast cancer cells reveals proteasomal activation by CDK4/6 inhibitor palbociclib. EMBO J. 37, 98359. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B19] 19. Savitski, M. M., Zinn, N., Fälth-Savitski, M., Pöckel, D., Gade, S., Becher, I., Mülbaier, M., Wagner, A. J., Strohmer, K., Werner, T., Melchert, S., Petretich, M., Rutkowska, A., Vappiani, J., Franken, H., Steidel, M., Sweetman, G. M., Gilan, O., Lam, E. Y. N., Dawson, M. A., Prinjha, R. K., Grandi, P., Bergamini, G., Bantscheff, M. (2018) Multiplexed proteome dynamics profiling reveals mechanisms controlling protein homeostasis. Cell 173, 260–274 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B20] 20. Childs, D., Kurzawa, N., Franken, H., Doce, C., Savitski, M., Huber, W (2019) TPP: Analyze Thermal Proteome Profiling (TPP) Experiments. R package version 3.12.0 [Google Scholar]

[B21] 21. Volkening, J. (2017) Mstherm: Analyze MS/MS Protein Melting Data. R package version 0.4.7. https://CRAN.R-project.org/package=mstherm

[B22] 22. Ramsey, J. O., Silverman, B. W. (2005) Functional Data Analysis 2nd ed., Springer; New York. [Google Scholar]

[B23] 23. Wang, J. L., Chiou, J. M., Müller, H. G. (2016) Functional data analysis. Ann. Rev. Statistics Appl. 3, 257–295 [Google Scholar]

[B24] 24. DeGroot, M. H., Schervish, M. J. (2012) Probability and Statistics 4th edn, Pearson Education; Boston. [Google Scholar]

[B25] 25. Venables, W. N., Ripley, B. D. (2002) Modern Applied Statistics with S, 4th ed. Springer, New York. [Google Scholar]

[B26] 26. Benjamini, Y., Hochberg, Y. (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Roy. Statistical Soc. 57, 289–300 [Google Scholar]

[B27] 27. Ward, N. E., O'Brian, C.A. (1992) Kinetic analysis of protein kinase C inhibition by staurosporine: evidence that inhibition entails inhibitor binding at a conserved region of the catalytic domain but not competition with substrates. Mol. Pharmacol. 41, 387–392 [PubMed] [Google Scholar]

[B28] 28. Leung, K. K. K., Shilton, B. H. (2015) Quinone reductase 2 is an adventitious target of protein kinase CK2 inhibitors TBBz (TBI) and DMAT. Biochemistry 54, 47–59 [DOI] [PubMed] [Google Scholar]

[B29] 29. Winger, J. A., Hantschel, O., Superti-Furga, G., Kuriyan, J. (2009) The structure of the leukemia drug imatinib bound to human quinone reductase 2 (NQO2). BMC Structural Biol. 9, 7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B30] 30. Bantscheff, M., Eberhard, D., Abraham, Y., Bastuck, S., Boesche, M., Hobson, S., Mathieson, T., Perrin, J., Raida, M., Rau, C., Reader, V., Sweetman, G. M. A., Bauer, A., Bouwmeester, T., Hopf, C., Kruse, U., Neubauer, G., Ramsden, N. G., Rick, J., Kuster, B., Drewes, G. (2007) Quantitative chemical proteomics reveals mechanisms of action of clinical ABL kinase inhibitors. Nat. Biotechnol. 25, 1035–1044 [DOI] [PubMed] [Google Scholar]

[B31] 31. Miettinen, T. P., Björklund, M. (2014) NQO2 is a reactive oxygen species generating off-target for acetaminophen. Mol. Pharmaceutics 11, 4395–4404 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B32] 32. Bourgon, R. W. (2006) Chromatin immunoprecipitation and high-density tiling microarrays: a generative model, methods for analysis, and methodology assessment in the absence of a “gold standard”. PhD thesis, Citeseer. [Google Scholar]

[B33] 33. Loader, C. (1999) Local Regression and Likelihood. Springer; New York. [Google Scholar]

[B34] 34. Storey, J. D., Xiao, W., Leek, J. T., Tompkins, R. G., Davis, R. W. (2005) Significance analysis of time course microarray experiments. Proc. Natl. Acad. Sci. USA 102, 12837–12842 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B35] 35. Berk, M., Ebbels, T., Montana, G. (2011) A statistical framework for biomarker discovery in metabolomic time course data. Bioinformatics 27, 1979–1985 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B36] 36. Childs, D., Kurzawa, N. (2019) Nonparametric analysis of thermal proteome profiles: Workflow. https://git.embl.de/childs/TPP-data-analysis [DOI] [PMC free article] [PubMed]

PERMALINK

Nonparametric Analysis of Thermal Proteome Profiles Reveals Novel Drug-binding Proteins*

Dorothee Childs

Karsten Bach

Holger Franken

Simon Anders

Nils Kurzawa

Marcus Bantscheff

Mikhail M Savitski

Wolfgang Huber

Graphical Abstract

Abstract

Fig. 1.

Table I. Datasets and sample sizes.

EXPERIMENTAL PROCEDURES

Data Sets and Preprocessing

Curation of Lists of Expected Targets

Table II. Expected targets per dataset.

Mathematical Model

Fig. 2.

Choice of the Mean Function

Hypothesis Test Statistic

Null Distribution

p Values

RESULTS

Application to Panobinostat

Fig. 3.

Beyond Two-group Comparisons

Replicate Agreement and Model Fit Diagnostics

Table III. A priori filters applied in the original TPP analysis workflow [6] to select proteins for hypothesis testing.

Table IV. Coverage of proteins applicable for hypothesis testing by the original TPP analysis workflow [6].

Effects Beyond Those on the Melting Point

Fig. 4.

Missing Melting Point Estimates

Sensitivity and Specificity

Fig. 5.

Table V. Decision ruleset applied in the original TPP analysis workflow [6] to combine z-test p values across replicates in an experimental design with two biological replicates.

DISCUSSION

Additional Files

DATA AVAILABILITY

Supplementary Material

Acknowledgments

Footnotes

REFERENCES

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Nonparametric Analysis of Thermal Proteome Profiles Reveals Novel Drug-binding Proteins^*