Skip to main content
eLife logoLink to eLife
. 2020 Aug 6;9:e57264. doi: 10.7554/eLife.57264

How to measure and evaluate binding affinities

Inga Jarmoskaite 1,†,, Ishraq AlSadhan 1,, Pavanapuresan P Vaidyanathan 1,†,, Daniel Herschlag 1,2,3,
Editors: Sebastian Deindl4, John Kuriyan5
PMCID: PMC7452723  PMID: 32758356

Abstract

Quantitative measurements of biomolecule associations are central to biological understanding and are needed to build and test predictive and mechanistic models. Given the advances in high-throughput technologies and the projected increase in the availability of binding data, we found it especially timely to evaluate the current standards for performing and reporting binding measurements. A review of 100 studies revealed that in most cases essential controls for establishing the appropriate incubation time and concentration regime were not documented, making it impossible to determine measurement reliability. Moreover, several reported affinities could be concluded to be incorrect, thereby impacting biological interpretations. Given these challenges, we provide a framework for a broad range of researchers to evaluate, teach about, perform, and clearly document high-quality equilibrium binding measurements. We apply this framework and explain underlying fundamental concepts through experimental examples with the RNA-binding protein Puf4.

Research organism: None

Introduction

Molecular associations lie at the heart of biology. Their thermodynamics provides information critical for deriving a fundamental understanding of molecular functions. In a broader biological context, these associations are linked and interconnected in complex networks that allow sensitive and precise developmental programs and responses to environmental cues, and that are altered in disease states. The outputs of pathways and networks are determined by the quantitative interplay of their many constituent molecules and interactions. Thus, equilibrium constants for association between network components are needed to define, model, predict, and ultimately precisely manipulate biology.

A limitation of traditional biochemical measurements is their low throughput, especially in relation to the large number of cellular interactions. Excitingly, several strategies have recently emerged to obtain high-throughput, quantitative information for intermolecular associations (e.g. Buenrostro et al., 2014; Tome et al., 2014; Lambert et al., 2014; Nutiu et al., 2011; Maerkl and Quake, 2007; Adams et al., 2016; Jain et al., 2017). Given these potentially transformative advances, it is especially timely to assess the accuracy of equilibrium binding measurements. We wanted to know whether current practices are sufficient to ensure reliable and accurate measurements, and whether the reliability of these measurements can be readily ascertained from the information provided in published work.

Our survey of 100 literature binding measurements, presented below, uncovered recurring problems with a large majority of studies. Fortunately, there are straightforward procedures, laid out here, that can be followed to ensure that published binding measurements are reliable. The principles underlying these procedures have been discussed and we build on these previous reports (Pollard, 2010; Hulme and Trevethick, 2010; Sanders, 2010). We focus on a minimal set of critical actionable steps and controls that biologists of any background should be able to implement in their binding measurements. We apply these procedures with experimental examples and also demonstrate the pitfalls of omitting essential controls. To further streamline application of these standard procedures, we provide a convenient checklist that can organize and guide experiments and can be used as an aid in summarizing and presenting results for publication.

Results

Assessing the current state of binding measurements

We evaluated published binding measurements using RNA-protein interactions as an illustrative example. We surveyed 100 studies that reported equilibrium dissociation constants (KD values) and scored them based on two key criteria for reliable binding measurements: sufficient time to equilibration and proper concentration regime (Figure 1).

Figure 1. Assessment of published KD values for RNA-binding proteins.

We analyzed 100 papers reporting KD or ‘apparent KD’ values of RNA/protein interactions. Measurements were evaluated based on two criteria: demonstrating equilibration (horizontal axis) and controlling for titration (vertical axis). Detailed criteria are described in Materials and methods, and the source data are provided in Supplementary file 1. The right column includes predominantly studies that used ITC and SPR, techniques that inherently record binding progress over time (24/30 in this column). The fraction of studies that varied time to demonstrate equilibration in non-ITC/SPR experiments is considerably smaller (6 of the 76 papers that did not exclusively use ITC or SPR, or <10%).

Figure 1.

Figure 1—figure supplement 1. Survey of incubation times for published equilibrium dissociation constants.

Figure 1—figure supplement 1.

(A) Percentages of publications that did or did not report and vary the incubation time. The light gray portion of the first column indicates the studies using SPR and ITC, techniques in which time is varied by default. (B) Incubation times in papers that reported a single time.
Figure 1—figure supplement 2. Survey of titration controls in published binding studies.

Figure 1—figure supplement 2.

(A) Percentages of publications that did (blue) or did not (red) control for titration effects. The first category includes studies that systematically varied the limiting component concentration to rule out titration. Studies that reported using an appropriate concentration regime or analysis methods to minimize the effects of titration (second and third column, respectively) were considered titration controlled; nevertheless, we emphasize the importance of performing and reporting the control experiments described herein, instead of relying on concentrations alone (see section 'Avoid the titration regime'). The ‘Other’ category (n = 7) includes a study that reported KD values as upper limits, recognizing possible titration (n = 1), and studies that only used SPR (n = 6), where the concentration of the immobilized species is difficult to estimate, but mass transport is typically controlled for or accounted for during analysis, as indicated in most surveyed studies. (B) Breakdown of studies that did not report controlling for titration. The first three columns denote studies that assumed negligible concentration of the limiting component in their analysis; however, the reported concentrations and KD values were inconsistent with this assumption, with the ratio of the lowest measured KD value to the limiting component concentration indicated. The ‘Not reported/Other’ category includes studies that did not report the limiting component concentration (n = 4), or used the quadratic equation in a titration regime (limiting component concentration in >1000-fold excess over the KD), incompatible with reliable KD determination (n = 1; see below).

First, we asked if equilibration was demonstrated. By definition, an equilibrium state is invariant with time. So, determining a binding equilibrium constant requires showing that there is no change in the amount of bound complex over time. Of the 100 studies surveyed, 70 did not report varying time for reported equilibrium measurements (Figure 1; Supplementary file 1). Of the 30 studies that did vary time, 24 exclusively used techniques with built-in monitoring of progress over time (isothermal titration calorimetry (ITC) and surface plasmon resonance [SPR]). Of the remaining 76 studies—those using approaches such as native gel shifts, nitrocellulose filter binding, and fluorescence anisotropy—less than 10% reported varying time (Figure 1, Figure 1—figure supplement 1).

We know from individual discussions that some researchers carry out these controls, as we advocate below, but do not report them. Unfortunately, the published record then cannot distinguish between these studies and others that have not demonstrated equilibration.

A second critical control entails demonstrating that the KD is not affected by titration, as artifacts can arise when the concentration of the constant limiting component is too high relative to the dissociation constant (KD). Similar to varying time to establish equilibration, systematically varying the concentration of the limiting component provides a definitive control for effects of titration. In our survey, only 5% of studies reported performing this or equivalent control (Figure 1—figure supplement 2). Nevertheless, most authors appeared to be aware of the need to avoid titration, as the majority of studies (~70%) reported using appropriately low concentrations of the limiting component or employed advanced analysis methods. We consider these examples as reasonably titration-controlled for the purpose of the survey, but emphasize the importance of empirical controls in the sections below. Importantly, this leaves, at a minimum, one-fourth of studies at risk for titration (Figure 1, Figure 1—figure supplement 2).

To what extent do these limitations affect the reported equilibrium binding constants in practice? As an example, for Puf4 binding (see below), not controlling for the factors above gave apparent KD values that were up to seven-fold higher than the actual KD values. A more extreme literature example is discussed in the next section, with discrepancies reaching 1000-fold, and other examples have been previously noted (Hulme and Trevethick, 2010; Strohkendl et al., 2018). There is a tendency to be less careful about controls in pursuit of relative affinities (specificity) rather than absolute affinity. However, failing to account for the factors noted above can also underestimate specificity by orders of magnitude (see Figure 4—figure supplement 1 and Figure 5—figure supplement 4 below).

These observations highlight an urgent need to revisit the criteria for reliable binding measurements. There is a parallel need to render these criteria accessible to a broad range of biologists, regardless of background or training, in the form of clear and readily actionable guidelines. To meet these needs, we provide simple, concrete strategies so that any practitioner can carry out reliable binding measurements, clearly communicate their results, and evaluate results from others.

Fortunately, the key requirements for binding measurements can be broken down into a small number of steps. We present two required steps for equilibrium binding measurements—varying the incubation time (see section 'Vary incubation time to test for equilibration') and controlling for titration (see section 'Avoid the titration regime'), and we illustrate these steps for the example of RNA binding to the Saccharomyces cerevisiae Puf4 protein (Gerber et al., 2004; Miller et al., 2008). We also present additional steps that can be taken to further increase confidence in KD values and to obtain kinetic information about the binding event under investigation (see sections 'Test KD by an independent approach' and 'Determine the fraction of active protein'). Finally, we describe strategies to address cases where no binding is initially detected and explain why it is often premature to conclude an absence of binding (see section 'The case of no observed binding').

Practical considerations

In principle, one would like to have well-behaved and perfectly controlled measurements in all cases, but biology and biochemistry can be messy. There are many times, working with extracts and partially purified systems where protein concentrations cannot be accurately determined, where proteases and nucleases may limit achievable equilibration times, and where there may be additional interacting components. Regardless of these potential complications, the simple steps indicated below can establish the robustness of measured affinities and can diagnose and help overcome issues like loss of activity over time. Moreover, these controls (and quantitative measurements more generally) can help uncover new features and regulatory mechanisms, based on deviations from ‘ideal’ behavior of simple binding equilibria.

Vary incubation time to test for equilibration

The most basic test for whether a binding reaction has reached equilibrium is that the fraction of complex formed between two molecules does not change over time. Nevertheless, the majority of papers we surveyed that present binding measurements and report apparent affinities or equilibrium dissociation constants do not report that time has been varied (Figure 1). We first describe two related concepts that will help readers develop an intuition for the time scales of binding processes and we then apply these concepts to Puf4 binding.

Half-life

Binding and other simple kinetic processes, in general, follow exponential curves (Figure 2). The key property of an exponential curve is that it has a constant half-life (t1/2)—that is, the time it takes for the reaction to proceed from 0% to 50% complete, 50% to 75% complete, 75% to 87.5% complete, etc. is the same (Figure 2). After three half-lives, an exponential process is almost 90% complete (3t1/2 = 87.5%; Figure 2), which is close enough to equilibration for most applications. Below we adopt the more common standard of taking reactions to five half-lives, or 96.6% completion; this more conservative standard is safer given that there are multiple sources of potential error in practice.

Figure 2. Exponential kinetics used to estimate the time needed for binding equilibration.

Figure 2.

Arrows indicate reaction half-life t1/2. Fraction bound is defined by the equation 1 - e-t × ln2/t1/2 = 1 - e-t × kequil.

Equilibration rate constant

The equilibration rate constant is effectively the inverse of the binding half-life (kequil = ln2t1/21t1/2) and, importantly, is concentration-dependent. For the binding equilibrium shown in Figure 3, under conditions where one binding partner (here, the protein, P) is in large excess over the other (RNA), the rate equation for approach to equilibrium, kequil, is described as: 

kequil=kon[P]+koff (1)

kon is the association rate constant, [P] is the concentration of protein, or the binding partner in excess, and koff is the dissociation rate constant (Pollard, 2010). According to Equation 1, equilibration is the slowest at the lowest protein concentrations. For this reason, equilibration times need to be established from the low end of the concentration range. In practice, it is useful to consider the limiting case with the protein concentration approaching zero ([P] ~ 0), such that Equation 1 simplifies to Equation 2 (Hulme and Trevethick, 2010):

kequil,limit=koff (2)
Figure 3. Model for one-step, non-cooperative, 1:1 binding between two molecules.

Figure 3.

Protein (P) binding to an RNA (R) molecule is shown for illustrative purposes.

Thus, the more long-lived the complex (i.e. the lower its dissociation rate constant), the longer the incubation time required to reach equilibrium.

What is the range of equilibration times for typical biomolecular interactions? While koff measurements (and, consequently, kequil) are less common in literature than KD measurements, equilibration times can be readily estimated (Sanders, 2010). Given that KD = koffkon (Figure 3) and assuming that the binding of molecules occurs as fast as diffusional collisions (kon = 108 M−1s−1), we can calculate that an interaction with a KD value of 1 pM would require a 10 hr incubation to reach equilibrium, whereas a 1 µM KD interaction would only require 40 ms (Table 1). Notably, binding rate constants for processes involving macromolecules are often smaller than the diffusion driven limit of ~108 M−1s−1, for example when additional conformational rearrangements are required for stabilizing binding after two molecules collide (Karbstein and Herschlag, 2003; Peluso et al., 2000; Wu et al., 2002). As a result, equilibration can take much longer. Thus, equilibration times for two interactions with the same KD value can vary by orders magnitude, and some reactions in the biologically relevant affinity range can require equilibration times of 10s of hr or even longer in vitro (Table 1; Hulme and Trevethick, 2010; Sanders, 2010). These long times underscore that biology has developed mechanisms to circumvent or utilize such slow processes—for example, rapid association may be facilitated by high intracellular concentrations of binding partners, and cellular factors such as molecular chaperones, helicases, chromatin remodelers, or translation can speed up binding and dissociation.

Table 1. Equilibration times (tequil) for different affinities and association rate constants.
K D kon, M−1 s−1 tequil*
sec hr
1 µM 108 0.04
106 4
103 1
1 nM 108 40
106 1
103 1000
1 pM 108 10
106 1000
103 1,000,000

*tequil was calculated as five half-lives: tequil = 5t1/2 = 5 × 0.693/kequil, where kequil = koff = K× kon (Equation 2 and Figure 3).

Implications of insufficient equilibration

Despite the realistic possibility of long equilibration times for biological association events, nearly 90% of the reported incubation times were 1 hr or less (Figure 1—figure supplement 1B). As a concrete example, several ‘equilibrium’ dissociation constants reported for CRISPR nucleases, which are well known for tight RNA and/or DNA binding, were determined from incubations of 1 hr or less (e.g. Semenova et al., 2011; Westra et al., 2012; Westra et al., 2013; Sternberg et al., 2014; O'Connell et al., 2014; Wright et al., 2015; Ma et al., 2015; Jiang et al., 2015; Sternberg et al., 2015; Beloglazova et al., 2015; Rutkauskas et al., 2015; Abudayyeh et al., 2016; Supplementary file 2). But when target dissociation of these proteins was measured over time, it took many hours (Strohkendl et al., 2018; Richardson et al., 2016; Boyle et al., 2017; Raper et al., 2018), suggesting that equilibration takes much longer than an hour and that the reported KD values based on these short incubation times underestimate the true binding strength. In one striking example, kinetic measurements revealed an equilibration time of >100 hr for the Cas12a complex and an equilibrium constant that was 1000-fold lower than previously reported for the same enzyme at similar conditions after much shorter incubation time (Strohkendl et al., 2018). Insufficient incubation times for tight binders may have also led to underestimation of specificity, a topic of central concern for CRISPR targeting (and for much of biology). Figure 4—figure supplement 1 illustrates how target affinities that differ by two orders of magnitude may appear identical if the incubation time is too short.

An example in which extending the incubation time changed the mechanistic interpretation comes from studies of the signal recognition particle (SRP). Originally, the observation that 4.5S RNA enhanced the assembly of the signal recognition particle (SRP) and SRP receptor led to a proposed mechanism in which the 4.5S RNA stabilized the complex. Subsequently, binding studies extended to longer times revealed that the 4.5S RNA accelerated the otherwise slow SRP/receptor binding and dissociation without affecting the binding affinity (Peluso et al., 2000). Exploring the time dependence of the assembly process changed the mechanistic conclusions: 4.5S RNA could be shown to play a catalytic, rather than stabilizing role in SRP/receptor assembly.

Figure 4—figure supplement 1 illustrates how incubation times that are very far from equilibrium can lead to systematic deviations of the data from the fit to an equilibrium binding equation. While a poor fit is not sufficient to diagnose insufficient equilibration (and, conversely, a good fit does not prove complete equilibration), an inability to fit the data well to a simple binding model provides an important indicator that additional controls are required. Only after simple controls for equilibration and titration (see below) have been performed, should more complex binding models, such as cooperativity, be considered, unless such models are independently supported. Indeed, among the studies in our literature survey omitting one or both key controls, several included poorly fit binding curves. Importantly, graphs of fits of the data to a clearly defined equilibrium binding model should be published along with the KD values when possible, and the quality of the fit over the entire concentration range should always be carefully assessed. In summary, the incubation time must be varied to ensure equilibration, ideally across a range of at least 10-fold. Below we illustrate this control, and the need for it, with experimental results for Puf4 binding to its consensus RNA.

Time dependence of Puf4 binding at 25°C and 0°C

To establish the equilibration time for Puf4 binding to its cognate RNA sequence, Puf4 was mixed, over a series of concentrations, with a trace amount of labeled RNA (in this case, 32P-labeled; 0.002–0.016 nM) and incubated for a specified time (t1) (Figure 4A). The fraction of bound RNA was subsequently determined by non-denaturing gel electrophoresis (see Materials and methods).

Figure 4. Establishing equilibration in affinity measurements.

(A) Mixing scheme. RNA*: labeled RNA (here—5´-terminally labeled with 32P). In addition to varying equilibration time t1 (main text), the time and conditions between adding the loading buffer and loading (t2) are controlled (see Appendix 2—note 2). (B, C) Concentration dependence of Puf4 binding at 25°C (B) and at 0°C (C) after different incubation times. Data were collected at protein concentrations greater than or equal to the concentration of labeled RNA (0.002–0.016 nM, indicating the lower and upper limit of labeled RNA concentration; see section 'Avoid the titration regime' and Appendix 2—note 4).

Figure 4.

Figure 4—figure supplement 1. Insufficient equilibration times can lead to incorrect determination of relative affinities.

Figure 4—figure supplement 1.

(A) Binding parameters for protein (P) interactions with two ligands, L1 and L2. The dissociation rate constant (koff) for L1 is 100-fold lower than for L2, such that L1 requires much longer to equilibrate than L2 (Equation 2). (B) Simulated binding data for L1 and L2 with varying incubation times (t1). The binding to each ligand is measured individually with trace amounts of L1 (blue) or L2 (red). Solid lines are fits to an equilibrium binding equation (Equation 4b), with dashed lines indicating the protein concentration at which half of the ligand is bound. Because equilibration of L1 binding is not complete until t1 = 10 hr (while L2 equilibration only takes ~5 min), the observed relative affinity (KDapp(rel) = KD,2app/KD,1app) is time-dependent and underestimates the true specificity if the incubation time is shorter than ~10 hr. Arrows and numbers indicate KDapp(rel) values at each time point. Note the systematic deviations of the simulated data from the fit curve in cases where equilibrium has not been reached. The presence of such deviations in experimental data indicates the need for additional controls to establish equilibration and rule out titration.

At 25°C, we observed the same amount of binding with incubations of t1 = 30 min, 1.5 hr, and 4.5 hr at each protein concentration, providing strong evidence for equilibration even at the shortest time (Figure 4B). Consequently, we can proceed to the next key control at this condition, using an incubation time of ≥30 min.

We also present Puf4 binding results at 0°C as these data provide an example of slow equilibration and because many binding studies report incubations on ice to stabilize binding. Indeed, the results at 0°C were very different than those at 25°C. As shown in Figure 4C, Puf4 bound different amounts of RNA in the 30 min, 1.5 hr, and longer incubations. Not until the incubation was extended to 4.5 hr did the extent of binding level off at the lowest Puf4 concentrations—that is, the amount bound was the same after 4.5 and 24 hr. Consequently, equilibration of Puf4–RNA binding on ice requires at least 4.5 hr, and incubation for only 30 min would give an apparent KD value that is seven times higher than after a 24 hr incubation. Moreover, binding at 0°C was so tight that we were only able to obtain part of the binding curve while maintaining the protein concentrations in excess of labeled RNA (Figure 4C). The importance of this excess to obtain reliable KD values is described in the next section. In the 0°C case and more generally, it is important to re-assess the equilibration time after establishing that binding is in an appropriate concentration regime, as we demonstrate in later sections. Similarly, changes in conditions, such as salt concentration, temperature or pH, can affect both the affinity and the equilibration time and therefore should be accompanied by confirming that equilibration has occurred.

Avoid the titration regime

The most common approach to measuring affinity is to vary the concentration of one component, while keeping the concentration of the other binding partner constant. However, this experimental design is not always sufficient, as there are two limiting regimes, determined by the concentration of the constant component; only one of these concentration regimes allows the KD to be reliably determined, while the other does not.

In the first, ‘binding’ regime, the concentration of the constant (‘trace’) component, R, is well below the dissociation constant ([R]total << KD for the example in Figure 3). In this case, the concentration of the variable component (P in Figure 3) that gives half binding is equal to the KD (Figure 5A). In the other, ‘titration’ regime, the concentration of the constant component is much greater than the KD ([R]total >> KD) so that essentially all added P is depleted from solution due to binding to R, until there is no more free R left to bind. In this case, the concentration of P that gives half binding does not equal or even approximate the KD. Rather, at high excess of R over the KD, the concentration of P that gives half binding is simply half of the concentration of (active) R molecules—a value that can differ from the sought-after KD by orders of magnitude (Figure 5B; Figure 5—figure supplement 1).

Figure 5. Two concentration regimes.

(A) Binding curve for the model in Figure 3 in the ‘binding’ regime—that is, the trace binding partner concentration ([R]total) is much lower than KD and much lower than [P]total (Equation 4b). Here, the KD is simply the protein concentration at which half of the RNA is bound (K1/2, here corresponding to 1 nM). The same simulated binding curve is shown in linear (top) and log (bottom) plots, as both are useful and common in the literature. (B) Binding curve in the ‘titration’ regime, simulated for an interaction with a KD value of 0.01 nM and an [R]total of 2 nM. Although the K1/2 value in this example is identical to the example in Part A, here it does not equal KD, instead exceeding the real KD value by 100-fold.

Figure 5.

Figure 5—figure supplement 1. The effects of RNA (ligand) concentration on observed binding.

Figure 5—figure supplement 1.

(A) Circles indicate simulated data for an interaction with a KD = 10 pM in the presence of RNA concentrations ranging from 100-fold below to 100-fold above the KD. Curves indicate fits of the simulated data to a hyperbolic equation (Equation 4b). For RNA concentrations ≤10-fold below the KD, the data are well explained by a hyperbolic fit, and the protein concentration at which half-saturation occurs (K1/2; indicated with dashed lines for the 0.1 pM RNA curve) is consistent with the KD. Higher RNA concentrations lead to increasing deviations from a hyperbolic fit and have increasing K1/2 values as the RNA concentration increases. (B). The relationship between the observed K1/2 enhancement over the true KD (‘K1/2/KD’) and the total RNA concentration relative to KD (‘[R]total/KD’). K1/2 values were derived from the simulated data in part A using Equation 4b.
Figure 5—figure supplement 2. Fit to the quadratic binding equation becomes less sensitive to differences in KD when the RNA concentration is in large excess over the KD.

Figure 5—figure supplement 2.

Simulated binding curves for RNA/protein interactions of varying affinities are shown in the presence of 1 nM labeled RNA. In this example, KD = 1 pM (1000-fold lower than [R]total) would be essentially impossible to distinguish from KD = 0.1 pM (10,000-fold lower than [R]total) and from even lower KD values because of the nearly identical binding curves. To accurately measure KD = 10 pM (100-fold lower than [RNA]) it would be critical to have a large number of data points in the narrow protein concentration range that distinguishes this curve from weaker and especially from stronger binders (inset).
Figure 5—figure supplement 3. Application of the hyperbolic (Equation 4b) and quadratic (Equation 5) binding equations to simulated binding data with increasing noise levels.

Figure 5—figure supplement 3.

All binding curves are for an RNA-protein interaction with a KD of 0.1 nM, measured in the presence of different RNA concentrations (0.001–100 nM) and with increasing levels of random noise in the fraction bound (standard deviation of 0.01–0.2). Ten datasets were simulated per condition and noise level and were individually fit to Equation 4b (leftmost column) or Equation 5 (the remaining columns) to determine the KD. The binding curves are shown as black lines, and the overlaid white circles indicate the expected fractions bound if the data were not affected by noise, with error bars indicating the standard deviation. The fit KD values for each of the 10 simulated datasets are shown below each set of binding curves, and the error bars indicate the 95% confidence intervals (CIs) of the KD. Gray bars indicate that the KD could not be determined from a quadratic fit. CIs that extend beyond the axis limits indicate that the lower limit of the KD was not defined. Note that with increasing noise and increasing RNA concentration the KD values derived from the quadratic fits become increasingly poorly constrained, particularly the lower CIs. By contrast, using the binding regime and Equation 4b to fit the data (leftmost column) consistently yields well-defined KD values, even with substantial noise.
Figure 5—figure supplement 4. Effects of trace binding partner concentration on apparent relative affinities.

Figure 5—figure supplement 4.

(A) Affinities of protein P for ligands L1 and L2. (B) Simulated equilibrium binding curves. Binding to each ligand is measured individually with different concentrations of labeled ligand (L1* or L2*). Solid lines are fits to Eq. 4b, with dashed lines indicating the protein concentration at which half of the ligand is bound (corresponding to KD in Equation 4b). Arrows and numbers indicate apparent KD(rel) values at each concentration of L (KDapp(rel) = KD,2app/KD,1app; with KD,1app and KD,2app derived using Equation 4b). There is a pronounced dependence of apparent relative affinity on ligand concentration if [L] is not much lower than the KD for the most tightly bound ligand among the ligands being compared. If sufficiently low ligand concentrations are not accessible, Equation 5 should be used and results may be less reliable (see section 'Avoid the titration regime' of main text).
Figure 5—figure supplement 5. Concentration regimes that do not (A) and do (B) affect the determination of equilibrium binding constants.

Figure 5—figure supplement 5.

(A) Labeled RNA concentration is much lower than KD ([R*]total << KD; binding regime). (B) Labeled RNA concentration is greater than KD ([R*]total > KD; intermediate regime). In parts (A) and (B), concentrations are indicated schematically by the number of RNA (R*, red), protein (P, light blue) molecules and RNA-protein complexes (P●R*) shown. In each case, protein concentration is varied (6, 18, 54, 400 arbitrary units), and KD equals 18 (in the same units). The total RNA concentration is 4 (A) and 36 (B). (C) Protein concentration dependence of binding in each of the above regimes. In the binding regime (green, [R*]total << KD from part A), the protein concentration at which half of the RNA is bound corresponds to the KD. In contrast, in the intermediate regime (purple, [R*]total > KD from part B), a greater protein concentration is required to achieve half-saturation (40 vs. 18 arbitrary units). The discrepancy would further increase with higher RNA concentrations, as shown in Figure 5—figure supplement 1. We can understand the origin of this discrepancy as follows. In part (A), the RNA concentration (red) is below the KD value and below the protein concentration (blue), such that the free concentration of the protein is essentially unchanged after RNA binding at both saturating (complete binding of RNA) and sub-saturating protein concentrations. Changing the RNA concentration in this regime would not change the fraction of RNA bound at a given total protein concentration, as long as the [R*]total << KD condition remains met. On the contrary, in part (B), the RNA concentration exceeds the dissociation constant (KD) and is high enough that a large fraction of the total protein is bound by RNA. Thus, the free protein concentration, which determines the extent of binding according to Equation 4a, is depleted and can no longer be approximated by the total protein concentration in Equation 4b to obtain an accurate KD value. On the molecular scale, the lowered free protein results in less binding. Consequently, for a given KD, more protein is required to achieve half-saturation at higher RNA concentration than with a trace concentration of RNA. Intuitively, at a concentration of RNA that is greater than KD there simply isn’t enough protein to occupy half the RNA when the total protein concentration is equal to KD.

A potentially useful intermediate regime exists between the two extremes, with limiting component concentrations similar to or in modest excess over the KD. The KD can be determined in this regime by using an appropriate binding equation, although with potential pitfalls (see below).

Distinguishing between concentration regimes

The challenge is that distinguishing between the regimes requires the knowledge of the KD, and consequently it is impossible to know a priori which regime holds. A useful rule of thumb for avoiding the titration regime is to always maintain the concentration of the excess binding partner significantly above that of the trace limiting partner. The reason for this can be gleaned from the equation that describes the fraction of bound RNA for the simple binding scheme of Figure 3:

Fraction bound=[P]free[P]free+KD (4a)

Here [P]free is the unbound protein concentration and KD is simply the free protein concentration at which half of the RNA is bound. But while Equation 4a holds universally, in practice we only know the total concentration of P, [P]total—how much we added to the solution—not the free concentration ([P]free). Therefore, we want to operate under simplifying conditions where [P]free ≈ [P]total so that we can substitute [P]total into Equation 4a to give Equation 4b:

Fraction bound=[P]total[P]total+KD (4b)

The condition [P]free ≈ [P]total holds true if P is in large excess of RNA across the entire experiment, meaning that only a small fraction of total protein is used up by binding to RNA. Most importantly, this condition must hold for the protein concentration that gives half-saturation to determine the KD; hence the requirement for the binding regime that the concentration of the limiting component must be <<KD. Nevertheless, simply maintaining an excess of protein over the limiting component may not always be sufficient to maintain a binding regime, given the uncertainty often surrounding concentration measurements and even greater uncertainty surrounding active concentrations.

In principle, a more complex quadratic binding equation provides an alternative to working under the [P]free ≈ [P]total assumption, as it explicitly accounts for bound protein:

Fraction bound=([R]total+ [P]total + KD) ([R]total+ [P]total + KD)2 4 × [R]total × [P]total2×[R]total (5)

Indeed, several techniques (most notably ITC) commonly operate outside the binding regime and rely on Equation 5 (or equivalent formulations) for data fitting. Importantly, the quadratic equation is only applicable to the intermediate and binding regimes, but not the titration regime. The reason for this is that at very high concentrations relative to the KD, the contribution of KD in determining the fraction bound (Equation 5) becomes negligible, and as a result a meaningful KD value cannot be extracted from the fit to the binding data. Simulated data in Figure 5—figure supplements 2 and 3 illustrate this limitation. Consequently, even when using Equation 5, the concentration of the limiting component should be kept to a minimum to avoid the titration regime.

Where does the intermediate regime end and titration begin? The answer depends on the technique and the quality of the data. For ITC measurements, which provide highly precise information for each added binding aliquot, up to 1000-fold excess of the limiting species over the measured KD can be acceptable (Velázquez-Campoy et al., 2004). However, in most other cases, this limit is much lower. Simulations in Figure 5—figure supplement 3 suggest that up to ~10-fold excess consistently allows for reasonably well-defined KD values in the presence of typical binding data, and up to 100-fold excess can be useful for data with minimal noise. In contrast, performing the experiments in the binding regime (fit with Equation 4b) yields well-defined KD values even with substantial noise in the data (Figure 5—figure supplement 3).

Implications of the titration regime

Of the 100 literature studies we surveyed, most (65%) determined KD values under the assumption of the binding regime, by using Equation 4b or equivalent analysis. Nevertheless, the required condition that the limiting species concentration be <<KD was not always supported. One-third of the studies using Equation 4b (n = 21) reported KD values that were comparable to (<10-fold excess) the concentration of the trace component, including nine studies in which the reported KD was indistinguishable from (within ~2-fold) or even below the stated trace component concentration, consistent with an intermediate or even titration regime (Figure 1—figure supplement 2).

The implication in all these cases is that the reported KD values may underestimate the real affinities. Unfortunately, it is difficult to determine the extent of this underestimation post-factum without further experimental controls. To understand why, recall from the example in Figure 5B that in the titration regime the midpoint of the binding curve only reflects ~half the concentration of the limiting species, which sets a lower limit to the apparent KD derived from Equation 4b, even if the real KD is much lower. Conversely, if the midpoint of the binding curve (and the reported KD in the above cases) is approximately the same as the limiting component concentration (allowing for some uncertainty in the concentration), the real KD could be anything below this value, from several-fold to many orders of magnitude less. As with insufficient incubation, systematic deviations of the data from the fit to Equation 4b can be a clear indicator that the apparent KD is limited by titration, but a good fit should not be considered sufficient to prove the binding regime, as experimental uncertainties and other causes can mask deviations.

High-affinity interactions are most susceptible to titration, a corollary of the simple fact that for very low KD values it becomes increasingly difficult to maintain concentrations much lower than KD while still allowing for detection. Since CRISPR nucleases represent some of the most widely studied high-affinity binders, we surveyed a sample of studies to determine the concentration regime under which the reported KD values were measured (Supplementary file 2). Of the 15 studies, the majority (13, or 90%) assumed the binding regime in their analysis, indicated by the use of Equation 4b or equivalent. However, only two of these studies (15%) reported using labeled DNA or RNA concentrations considerably below the apparent KD, and in five cases the lowest reported KD was essentially identical to the labeled RNA or DNA concentration (within ~2-fold), consistent with possible titration.

Importantly, because relative affinities are typically based on the tightest binders, titration effects on the ‘wild-type’ substrate measurements can distort all specificity (relative affinity) values that are based on it. Figure 5—figure supplement 4 illustrates an example, in which two substrates with a 100-fold difference in affinity appear to have identical or near-identical affinities when titration is not controlled for.

Given the impossibility of designing experiments for the binding regime a priori, without knowing the affinity, it is important to rule out titration empirically. Thus, analogously to varying time to establish equilibration, we strongly recommend systematically varying the concentration of the limiting species to establish the binding regime (or, with use of Equation 5, the intermediate regime). The hallmark of a valid KD is that it is not affected by varying the concentration of the limiting component, whereas a titration regime would result in concentration-dependent apparent KD values. At a minimum, this control should always be performed when the measured KD value is comparable to the concentration of the limiting component (Equation 4b), or when Equation 5 yields poorly defined apparent KD values or values much lower than the limiting concentration. Below we demonstrate the titration control for Puf4 affinity measurements.

RNA concentration dependence of Puf4 binding at 25°C and 0°C

We systematically varied the labeled RNA concentration in Puf4 binding experiments at 25°C and 0°C, to illustrate the binding and intermediate regimes, respectively. Figure 5—figure supplement 5 provides a schematic description of the two regimes to help build the reader’s intuition.

At 25°C, the Puf4 binding curves were identical across a nine-fold range of RNA concentrations (Figure 6A,B), and the data were well described by Equation 4b. From the constancy of the binding curves in Figure 6B, we can conclude that the binding regime holds for Puf4 at 25°C, and thus that the observed KD value of 120 pM obtained from Equation 4b represents a true equilibrium constant. As expected for the binding regime, the measured KD is higher than the RNA concentrations (120 pM vs 2–18 pM).

Figure 6. Varying the concentration of the 'trace' binding partner.

Figure 6.

(A) Mixing scheme, as in Figure 4A but now with a series of labeled RNA concentrations. (B) Puf4 binding to different concentrations of 32P-labeled RNA at 25°C. For simplicity, only the lower limits of RNA concentration are indicated; the corresponding upper limits were 15–140 pM RNA (see Materials and methods and Appendix 2—note 4). Incubation time t1 was 0.5 hr, as established in Figure 4B. (C) Puf4 binding to different concentrations of 32P-labeled RNA at 0°C. Lower limits of labeled RNA concentration are indicated. Incubation time t1 was 40 hr. Note that these data are not fit well by Equation 4b, which assumes [R*]total << KD (solid lines). Quadratic fits, which do not assume negligible RNA concentration, are shown in dashed lines (Equation 5). (D) Effect of RNA concentration on apparent KD (KDapp) at 0°C. Red symbols indicate KDapp values from a hyperbolic fit (Equation 4b and solid lines in C) and grey symbols indicate KDapp values from fits to the quadratic equation (Equation 5). The error bars denote 95% confidence intervals, as determined by fitting the data to the indicated equation in Prism 8.

The situation is different at 0°C (Figure 6C). Here, varying the labeled RNA concentration revealed divergent binding curves and a pronounced dependence of apparent affinity (determined by fitting the data to Equation 4b) on the concentration of RNA, the constant component (Figure 6C,D). Moreover, the fits of the data to Equation 4b (solid lines in Figure 6C), which assumes [P]free ≈ [P]total, were poor, increasingly so for higher RNA concentrations. These data are indicative of protein depletion due to binding to labeled RNA. The apparent KD values vary by five-fold across the 30-fold range of RNA concentrations used (Figure 6D, red circles), and even greater discrepancies would arise at higher RNA concentrations (Figure 5—figure supplement 1). Consequently, only an upper limit of the real affinity can be extracted from these data (KD ≤ 2.3 pM, based on the fit value at the lowest RNA concentration used).

To address the limitation in our 0°C data we could, in principle, lower the concentration of labeled RNA even further, until the labeled RNA concentration is <<KD and until an RNA concentration-independent regime is established. But this is difficult when binding is very tight, as a limit is set by the sensitivity of the technique used. In our case, at ~1 pM 32P-labeled RNA we are already near the limit of reliable detection. If the concentration of the trace component cannot be lowered further, a more sensitive approach can sometimes be found. Kinetic approaches are particularly suitable for tight binders (see Appendix 1), or one can report an upper limit of the KD. In some cases, increasing the salt concentration or other changes to the solution or binding partners can be used to weaken binding to make it easier to obtain affinities at higher concentrations of the labeled species; this approach can be especially valuable if one is primarily interested in the relative affinities of multiple ligands (Altschuler et al., 2013).

As noted earlier, the quadratic binding equation enables KD determination for binding reactions in the intermediate regime. The quadratic equation provides a good fit to the 0°C data (Figure 6C, dashed lines) and yields uniform and well-defined KD values of ~1.9 pM across the different RNA concentrations, consistent with an intermediate (rather than titration) regime (Figure 6D, grey circles). The same KD value was obtained from kinetic experiments, providing independent support for and confidence in this determination (Appendix 1).

In summary, we want to use the binding regime whenever possible, as it allows for the most straightforward and reliable KD measurements. It is necessary to avoid the titration regime and caution is required in the intermediate regime. In practice, varying the concentration of both components is an essential control for ruling out titration, ruling out other potential artifacts, and ensuring the measurement of valid dissociation constants.

Re-evaluating the equilibration time at 0°C

In the previous section, we mentioned the need for re-evaluating the equilibration time for Puf4 binding at 0°C after a binding regime was established. In principle, after determining sufficiently low RNA concentration for the binding regime, one could vary the incubation time again, as done in Figure 4. In our case, we used the shortcut defined in Equation 2 and instead determined the upper limit of the equilibration time by measuring the koff at 0°C (Appendix 1; see also Appendix 2—note 1 for precautions when applying this shortcut). These measurements revealed an equilibration time of 30 hr (five half-lives), far above the typical incubation times of 1 hr or less (Figure 1—figure supplement 1).

Dependence of binding affinity on conditions

The 100-fold difference in Puf4 affinity between 0°C and 25°C underscores the important point that the equilibrium dissociation constant is only a constant value at a given set of conditions, and that the affinity can change dramatically when the conditions (temperature, salt, pH) are changed. This dependence on conditions should always be considered when comparing literature values or when applying in vitro results to biology.

Test KD by an independent approach

Even when no challenges are encountered, as in the case of Puf4 binding at 25°C, it is a good idea to determine the KD by a second approach to ensure that the measurement is not biased by experimental artifacts or idiosyncrasies of a particular technique. This is especially important when using a secondary readout (vs. a direct approach) such as native gel shift or nitrocellulose filter binding, where major loss (or gain) of bound complex can potentially occur between the equilibration and detection steps (see below and Appendix 2—note 2).

Of course, there are many approaches to carrying out equilibrium binding measurements one can choose from (e.g. Velázquez-Campoy et al., 2004; Wong and Lohman, 1993; Eftink, 1997; McDonnell, 2001). Here, we used a kinetic approach for independent KD determination for Puf4 at 25°C and 0°C, as described in Appendix 1. Kinetic measurements provide an information-rich alternative and complement to the equilibrium measurements and are often simple to carry out provided they fall within a measurable time range (Pollard, 2010; Hulme and Trevethick, 2010; Sanders, 2010; Pollard and De La Cruz, 2013). In case of Puf4, the affinities determined by kinetic measurements were within two-fold of those from equilibrium determinations, strongly supporting their accuracy.

Determine the fraction of active protein

The amount of bound ligand is determined not by the total protein concentration but by the concentration of total active protein. If 90% of the protein is damaged due to misfolding, aggregation, degradation or, for example, inactivated by phosphorylation at the binding interface, then the observed affinity will be that for only 10% of the total protein present—and will be ten-fold higher than the actual KD value. Moreover, if the binding-competent protein concentration is much lower than the total and therefore much closer to the limiting component concentration than expected, the binding regime may not be maintained, leading to even greater discrepancies between the real and observed KD. As a common cause of non-active or less active protein is aggregation, determining the monodispersity of the protein following purification is advisable (Altschuler et al., 2013).

In addition, we recommend, when possible, a titration experiment to determine the fraction of binding-competent protein (Altschuler et al., 2013). Here, a concentration of ligand that is much greater than the measured KD is intentionally used and the protein concentration is varied by approximately an order of magnitude above and below the ligand concentration. To ensure accurate ligand concentration and to prevent excessive signal (if labeled ligand is used), the trace labeled ligand should be mixed with a large excess of identical unlabeled molecule at a known concentration. Assuming that the stoichiometry of the bound complex is known and that the ligand is 100% active, the breakpoint in fraction bound versus the ratio of protein to ligand indicates the amount of active protein (Figure 7). For example, for a 1:1 complex, a breakpoint at a protein:RNA ratio of 2.0 suggests that half of the protein is active. In Figure 7, the ratio of 1.3 suggests that the Puf4 preparation is 75% active (0.75 = 1/1.3). Consequently, the apparent KD values determined in the previous sections should be multiplied by the active protein fraction (which ranged from 0.75 to 0.90 for Puf4) to determine the final KD value. In an alternative approach, the titration data could be fit to a quadratic equation, with a coefficient used to represent the active protein fraction (Figure 7—figure supplement 1).

Figure 7. Measuring the fraction of active protein by titration.

The fraction of active protein is derived from the breakpoint, that is, the intersection of linear fits to the low and high-Puf4 concentration data. See Figure 7—figure supplement 1 for an alternative strategy using Equation 5.

Figure 7.

Figure 7—figure supplement 1. Determination of the fraction of active protein from a quadratic fit.

Figure 7—figure supplement 1.

Fits of titration data at 100 nM (A) and 10 nM (B) RNA to the quadratic equation are shown. The quadratic equilibrium-binding equation (Equation 5) was modified to include a term for the active protein fraction.
Fraction bound =A × ([R]total+ F× [P]total + KD) ([R]total+ F × [P]total + KD)2 4 × [R]total × F × [P]total2 × [R]total+O (8)
A and O correspond to the amplitude and Y axis offset, respectively; F is the fraction of active protein; [R]total was constrained to the known RNA concentration (10 or 100 nM); here, the KD value was constrained to the known affinity (Table 2). The last constraint is optional, as the KD value contributes minimally to the fit at these high RNA concentrations and because the exact KD value may not yet be known at the time of measuring the active protein fraction. The fit fractions of active protein (F) are almost identical to those determined from linear fits of the same data in Figure 7 (~0.75).

A limitation of the titration experiment is that it assumes the constant component to be 100% active, which may not always be the case, especially in the case of protein-protein interactions. Therefore, one should ensure, to the extent possible, maximum purity of both binding components. Importantly, one should always make clear whether experiments were carried out to determine ‘fraction active’.

The case of no observed binding

Researchers often conclude that there is ‘no binding’—that ‘X does not bind to Y’. Typically, the underlying experimental observation is an absence of observed binding up to a certain protein (or ligand) concentration. Therefore, one should report a lower limit for the dissociation constant (KD), rather than draw an absolute conclusion of ‘no binding’. But even an accurate lower limit often requires additional experiments, because the absence of observed binding—say in a gel shift, filter binding, or pull-down experiment—can arise either because there is no significant binding or because the complex does not withstand the assay conditions (Pollard, 2010). While this objection may seem like a technicality, there are many instances where known binders do not give a gel shift or filter binding.

Immuno-precipitation and pull-down assays are pervasive in current biological investigations and are often interpreted in terms of ‘binding’ or ‘no binding’. But the reality of the interpretation of these experiments—and the reality of molecular interactions—is more nuanced (Pollard, 2010). A ligand with the same affinity, slightly lower affinity, or even higher affinity than another ligand with demonstrated binding can incorrectly be concluded to ‘not bind’.

Consider, for example, an RNA pull-down with an RNA binding protein with KD = 10−9 M and kon = 108 M−1 s−1; this gives koff = 0.1 s−1 or a half-life for dissociation of ~10 s. If the washing steps following a pull-down take 30 s, only ~10% of the complex is expected to remain. If the affinity is 10-fold weaker (koff = 1 s−1), then no detectable complex is likely to remain after 30 s of washing (10−13 of the starting amount). Further, if another RNA ligand binds with the same affinity, but 10-fold slower (and thus also dissociating 10-fold slower; koff = 0.01 s−1, half-life of ~100 s), most (~75%) of the complex will remain after the 30 s washing steps despite an identical KD to the first ligand. In addition, the limited dynamic range of visual readouts of gels that are often used to evaluate pull-down experiments increases the danger of misinterpretation or overinterpretation of these experiments.

Overall, observing binding in pull-downs and related experiments is a complex function of the experimental components and conditions. This doesn’t at all mean these experiments should not be done—they often provide critical clues and insights into biology. But, for these and all experiments, we need to keep in mind the nature of the assay, and thus what can and cannot be concluded from the experiment.

Whether binding is absent or not detected can be tested by using approaches that directly report on the equilibrium between bound and unbound components in solution (e.g. ITC, fluorescence anisotropy, and other fluorescence-based techniques), as opposed to indirect approaches like native gel shift and pull-downs that are based on physically separating bound and unbound components, so that unstable complexes may fall apart prior to the detection step. Nevertheless, direct approaches also have limitations. For example, fluorescence intensity or FRET (Förster resonance energy transfer) is limited at high concentrations by inner filter effects, and ITC will miss binding events when the release (or uptake) of heat upon binding is too small (i.e. the binding enthalpy is too small).

A simple way to test whether binding occurs when there is no binding signal is to carry out a competition experiment. If the ligand is bound but not detected in an approach such as native gel shift or filter binding, it will still lessen binding of another ligand for which there is an established signal. The amount lessened depends quantitatively on the KD values and concentrations of each ligand, given sufficient time for equilibration. A competition experiment to obtain the KD value for a weak RNA substrate of Puf4 is shown in Appendix 3, along with the binding scheme and equation to determine the KD value.

Competition binding measurements can also have a practical benefit; after an initial KD is determined for a labeled substrate, KD values for additional substrates can be determined by competition without labeling each substrate (Hulme and Trevethick, 2010; Sanders, 2010; Ryder et al., 2008).

Discussion

Given the increasingly multi-disciplinary nature of research, scientists are increasingly venturing into disciplines outside their expertise. Our goal is to support these valuable efforts by enabling both experts and non-experts in thermodynamics to get the most out of their binding experiments, and to help them evaluate work by others, published or under review for publication.

While the number of steps described to obtain reliable equilibrium data may initially seem daunting, the accompanying experimental illustrations and guides can transform an opaque process into one that is readily understandable and can be carried out in a straightforward, stepwise fashion by researchers from varied backgrounds.

We found it useful to develop and use an Equilibrium Binding Checklist to organize our approach and findings. We provide a template of such a checklist, along with completed examples in Appendix 4 (Appendix 4—figure 1, 2, 3). We expect that many readers will find these valuable.

There has been much discussion about problems with reproducibility and rigor in the scientific literature (Landis et al., 2012; Plant et al., 2014; Nature, 2013; Nosek and Errington, 2017; Koroshetz et al., 2020). Historically, a powerful means to ensure reliability of published data has been to develop community standards. Reporting guidelines have been successfully adopted by journals in a variety of fields, including structural biology (Berman et al., 2000), enzymology (http://www.beilstein-institut.de/en/projects/strenda/guidelines), organic synthesis (e.g. http://pubs.acs.org/page/joceah/submission/ccc.html), and many others, and new standards, guidelines and databases are continually being devised (see https://fairsharing.org/ for a curated list). We encourage journals to adopt analogous standards for reporting binding measurements. Contingent on implementation of such standards, we ultimately envision a well-curated and well-documented quantitative database that is routinely used to build and test models for individual molecular interactions and for cellular and molecular networks.

Materials and methods

Survey of published equilibrium binding measurements

We surveyed 100 papers, including 66 papers from the list of quantitative RNA/protein studies assembled by the Liu lab (Yang et al., 2013) and 34 additional studies reporting KD and apparent KD values for RNA/protein interactions (Supplementary file 1). To confirm that our survey was not biased, we also scored 20 publications from a single PubMed search for ‘RNA protein binding dissociation constant’, after confirming that they reported KD values for RNA/protein binding. Four of the 20 papers also appeared in the above list. The fractions of papers controlling for equilibration and/or titration were similar to those in the main survey (Figure 1): 30% of the 20 papers controlled both for equilibration and titration, 15% controlled for neither, 50% only controlled for titration and 5% only controlled for equilibration.

Equilibration was evaluated as follows. If a study reported systematically varying the incubation time, it was counted as controlled for equilibration. If dissociation kinetics were measured in addition to performing equilibrium measurements (n = 3), the study was scored as equilibration-controlled, but only if the reported incubation time was at least three half-lives based on the reported koff, and only if the kinetic and equilibrium experiments were performed at the same conditions (n = 1). Studies exclusively using approaches that intrinsically monitor the binding progress (ITC, SPR, biolayer interferometry [BLI]) also were counted as equilibration controlled. However, if several approaches were used in a given study to determine affinities for distinct binding interactions and/or conditions, and if for at least one approach time was not varied, the study was scored as not equilibration controlled. Some exceptions where equilibration can be reasonably assumed are noted in Supplementary file 1.

To generate Figure 1—figure supplement 1, we used the incubation times reported for non-equilibration controlled binding experiments. If a narrow range of times (e.g. 15–20 min, 45–60 min; n = 2) was indicated, this was not counted as systematically varying time and the longer time was used for Figure 1—figure supplement 1. If only a lower limit of the incubation time was reported (e.g. ‘at least 30 min’; n = 1), this lower limit was used for Figure 1—figure supplement 1. If two sequential incubations were performed at different temperatures (e.g. ‘10 min at room temperature and 10 min at 4°C’, n = 4), the total incubation time was used for the purposes of the survey. However, since affinity is condition-specific, only equilibration at a constant temperature can yield meaningful KD values, and two-temperature incubations should be avoided.

To evaluate if titration was controlled for, first, we confirmed if the concentration of the limiting species was systematically varied to determine effects on KD (n = 5); these studies were counted as titration controlled. If a study reported a range of concentrations of the limiting species, without stating that the effects on KD were assessed, we did not count this as a titration control, as in practice such a range typically only indicates optimization of radioactive/fluorescent signal to account for radioactive decay and/or varying labeling efficiencies. For the remaining studies, we asked if Equation 4b (which assumes the binding regime) or Equation 5 (which also allows for the intermediate regime) was used to fit the data. If no equation was indicated, or if the midpoint of the binding curve/gel signal was used to determine the KD, or if linear transformation was used in lieu of the hyperbolic fit, we counted the study as using Equation 4b. For studies using Equation 4b, we asked if the lowest apparent KD value was in at least 10-fold excess over the limiting component concentration, in which case we counted the study as titration controlled. If a range of limiting component concentrations was reported, we used the lowest value. If only the amount (not concentration) of the limiting species was reported, the concentration was calculated based on the provided volume or, if not indicated, based on a 10 µL reaction volume; nevertheless, binding equilibria depend on concentrations, not amounts, and concentrations, in units of ‘M’, should always be indicated. If Equation 5 was used (incl. all ITC measurements), we counted the study as titration controlled, unless the reported KD was more than 1000-fold below the limiting species concentration (corresponding to a cutoff typically used in ITC [Velázquez-Campoy et al., 2004]). For simplicity, we assumed that all SPR/BLI measurements (where the concentration of the immobilized species is difficult to estimate and not reported) were titration controlled; nevertheless, we emphasize the importance of explicitly reporting controls for mass transport in SPR measurements (Myszka, 1999). If multiple approaches were used, but at least in one approach titration was not controlled for according to the above criteria, the study was scored as not titration controlled, unless the affected values were corroborated by a titration-controlled approach in the same study.

If no details on the incubation time and/or the concentration of the limiting reagent were provided, but instead a previous study was cited (‘as described’, n = 4), the information for the above evaluation was obtained from the cited study. This included two cases in which the authors had performed rigorous equilibration and titration controls in their previous referenced work.

Puf4 purification

The RNA-binding domain (residues 537–888) of S. cerevisiae Puf4 was cloned into a custom pET28a-based expression vector in frame with an N-terminal 6X His-tag and a C-terminal SNAP tag (New England Biolabs, Ipswich, MA). The construct was transformed into E. coli protein expression strain BL21 (DE3) and protein expression was induced at an OD600 of 0.6 with 1 mM IPTG at 20°C for ~20 hr. Induced cells were harvested by centrifugation at 4500 × g for 20 min. Cell pellets were re-suspended in Buffer A (20 mM HEPES-sodium (HEPES-Na)), pH 7.4, 500 mM potassium acetate (KOAc), 5% glycerol, 0.2% Tween-20, 10 mM imidazole, 2 mM dithiothreitol (DTT), 1 mM phenylmethylsulfonyl fluoride (PMSF) and cOmplete, Mini, protease inhibitor cocktail (Roche Diagnostics GmbH, Mannheim, Germany) and lysed four times using an Emulsiflex (Avestin, Inc, Ottawa, ON, Canada). The lysate was clarified by centrifugation at 20,000 × g for 20 min, nucleic acids were precipitated with polyethylene imine (0.21% final concentration) at 4°C for 30 min with constant stirring and pelleted by centrifugation at 20,000 × g for 20 min. The supernatant was loaded on a Nickel-chelating HisTrap HP column (GE Healthcare, Pittsburgh, PA). Bound protein was washed extensively over a shallow 10–25 mM imidazole gradient and eluted over a linear 25–500 mM gradient of imidazole. Peak Puf4 protein fractions were pooled and desalted into Buffer B (20 mM HEPES-Na, pH 7.4, 50 mM KOAc, 5% glycerol, 0.1% Tween-20, 2 mM DTT) using a desalting column. The His-tag was cleaved by overnight incubation with His-tagged TEV protease at 4°C, and the protein was purified on a HisTrap HP column. The flow-through was desalted into Buffer B and loaded on a HiTrap Q HP column (GE Healthcare) and washed extensively with Buffer B to remove any bound RNA. Protein was eluted over a linear gradient of potassium acetate from 50 to 1000 mM. Protein fractions were pooled and desalted into Buffer C (20 mM HEPES-Na, pH 7.4, 100 mM KOAc, 5% glycerol, 0.1% Tween-20 and 2 mM DTT), concentrated and diluted two-fold with Buffer C containing 80% glycerol for final storage at −20°C. UV absorbance spectra indicated that the protein was free from significant RNA contamination (<1 RNA base per protein).

RNA 5´-end labeling

Puf4_HO RNA (AUGUGUAUAUUAGU; Integrated DNA Technologies (IDT), Coralville, IA; 5 µM) was labeled with equimolar [γ-32P] ATP (Perkin Elmer, Inc, Boston, MA) using T4 polynucleotide kinase (Thermo Fisher Scientific, Vilnius, Lithuania) and purified by non-denaturing gel electrophoresis (20% acrylamide). The RNA was eluted into TE buffer (10 mM Tris-HCl, pH 8.0; 1 mM EDTA) at 4°C overnight, and the lower limit of eluted RNA concentration, assuming no unlabeled RNA, was determined by scintillation counting and calibration against the specific activity of the [γ-32P] ATP stock used for labeling. The upper limit of RNA concentration was calculated from total RNA input and the elution buffer volume, assuming a 100% yield.

Equilibrium binding measurements

All reactions were performed in a binding buffer containing 20 mM HEPES-sodium or HEPES-potassium buffer, pH 7.4, 2 mM magnesium chloride (MgCl2), 100 mM KOAc, 2 mM DTT, 0.2% Tween 20, 5% glycerol, 0.1 mg/ml BSA, at 25 or 0°C, as indicated. The protein and labeled RNA dilutions were prepared in binding buffer at two-times the indicated concentration and were kept on ice until the binding reactions were initiated by mixing 10 µL of protein with 10 µL of labeled RNA. The pipette tips used for mixing and aliquoting the 0°C reactions were kept on ice. The labeled RNA concentrations and incubation times are indicated in the individual figure legends. Following the incubation, 7.5 µL aliquots were moved to 5 µL of ice-cold loading buffer containing 6.25% Ficoll PM 400 (Sigma-Aldrich, Saint Louis, MO), 0.075% bromophenol blue (BPB), and 2.5 µM unlabeled Puf4_HO RNA. The unlabeled RNA in the loading buffer prevented additional association to the labeled RNA from occurring during sample loading (Appendix 2—note 2). Control experiments indicated negligible re-equilibration in loading buffer (t1/2 ≥ 3 hr in three independent measurements), consistent with the slow dissociation rate constant measured in binding buffer at 0°C (Appendix 1). All samples were loaded on the gel within 20 min from mixing with the loading buffer. Non-denaturing acrylamide gels (20%) were pre-run for at least 1 hr at 42 V/cm constant voltage, 4–6°C with 0.5x TBE buffer (50 mM Tris, 42 mM boric acid, 0.5 mM EDTA•Na2, pH 8.5–8.6 final) using a circulating cooling system. Aliquots (7.5 µL) were carefully loaded on continuously running gels and separated for 45–90 min. (Extreme caution must be exercised at this step; see, e.g. https://ehs.stanford.edu/reference/electrophoresis-safety for electrical safety hazards.) The gels were dried and exposed to phosphorimager screens, scanned with a Typhoon 9400 Imager and quantified with TotalLab Quant software (TotalLab, Newcastle-Upon-Tyne, UK). Fitting was performed with KaleidaGraph 4.1 (Synergy Software, Reading, PA; RRID:SCR_014980).

The KD values in Table 2 indicate the average and standard error from five independent equilibrium experiments (25°C). For 0°C measurements, KD(hyperbolic) indicates the upper limit determined using Equation 4b at the lowest RNA concentration (Figure 6C,D); KD(quadratic) indicates the average and standard error of KD values determined with Equation 5 at the four RNA concentrations shown in Figure 6C,D.

Table 2. Summary of equilibrium and kinetic measurements of Puf4 affinity.

Equilibrium* Kinetic
Temperature,°C KD(hyperbolic), pM KD(quadratic), pM kon, M−1s−1* koff, s−1 KD (=koff/kon), pM
0 ≤1.7 1.39 ± 0.09 (2.85 ± 0.14)×107 (2.92 ± 0.17)×10−5 1.02 ± 0.08
25 120 ± 30 120 ± 30 (1.04 ± 0.14)×108 0.014 ± 0.003 130 ± 30

*The values have been normalized by active protein fraction (75–90%). KD(hyperbolic) and KD(quadratic) refer to values derived from fits to Equation 4b and Equation 5, respectively. Errors are defined in Materials and methods.

Kinetic measurements

Measurements of koff (Appendix 1) were performed by incubating the indicated concentrations of Puf4 with trace concentration of labeled Puf4_HO RNA for 10 min at 25°C or 0°C in the binding buffer described in Equilibrium binding measurements. Labeled RNA concentrations were 0.04–0.5 nM, corresponding to the lower and upper limits, as defined in RNA 5´-end labeling. Dissociation was initiated by transferring the binding reaction to 2.5x volume of unlabeled chase in binding buffer. The chase RNA concentrations in the final reaction were 250 nM and 1000 nM. At various times, 7.5 µL aliquots were moved to 5 µL of ice-cold loading buffer containing 6.25% Ficoll PM 400% and 0.075% BPB, and 7.5 µL aliquots were loaded on a pre-run, continuously running 20% non-denaturing gel at 4–6°C. All pipette tip boxes and solutions used for the 0°C reactions were kept on ice. The chase solution for the 25°C reaction was pre-warmed in a 25°C water bath for 10 min before initiating the dissociation reaction. All time courses were fit to single exponentials using KaleidaGraph 4.1.

The effectiveness of unlabeled Puf4_HO RNA chase was tested by pre-incubating 10 nM Puf4 with 100–1000 nM unlabeled RNA (final concentrations) for 12 min at 25°C before adding trace amount of labeled Puf4_HO RNA (0.04–0.4 nM). The fractions of bound labeled RNA ranged from 0.01 (1000 nM) to 0.1 (100 nM), compared to 0.95 fraction bound in the absence of chase, confirming the effectiveness of the chase.

The koff values reported in Table 2 indicate the average and standard error from two replicate experiments (25°C) or the average and standard error across different concentrations in a single experiment (0°C).

Values of kon were determined by mixing 40 µL each of trace labeled RNA solution (0.004–0.05 nM) and varying dilutions of Puf4. At varying times, 7.5 µL aliquots were transferred to 5 µL of ice-cold loading buffer containing 6.25% Ficoll PM 400, 0.075% BPB, and 2.5 µM unlabeled Puf4_HO RNA and loaded on a 20% gel as above. The protein and RNA solutions were pre-incubated at the reaction temperature (0°C or 25°C) before mixing, and ice-cold tips were used for the 0°C reactions. To control for titration by labeled RNA at the low protein concentrations used, at 0°C, the equilibration rate constants were also measured at three-fold higher labeled RNA concentration, giving consistent rate constants within 1.1–1.3-fold (Appendix 1).

The kon values reported in Table 2 are the slopes and standard errors of linear fits to observed rate constants from two replicate experiments (25°C) or a single experiment (0°C). The kon values were corrected for the active protein fraction.

Measuring the fraction of active protein by titration

Unlabeled Puf4_HO RNA (10 or 100 nM) was incubated for 30 min with varying Puf4 concentrations in the presence of trace labeled Puf4_HO RNA (0.06–0.4 nM); the labeled and unlabeled RNA was pre-mixed before adding Puf4. The fraction bound RNA was determined as described in Equilibrium binding measurements.

Competition measurements

Trace labeled Puf4_HO RNA (0.02–0.19 nM) was equilibrated with 0.4 nM or 1.2 nM Puf4 and diluted two-fold into solutions containing varying concentrations of unlabeled competitor RNA (CGUAUAUUA; IDT). The reactions were incubated at 25°C for the indicated time, followed by transfer of 7.5 µL aliquots to 5 µL ice-cold loading buffer (6.25% Ficoll PM 400, 0.075% BPB, and 2.5 µM unlabeled Puf4_HO RNA). The samples were loaded immediately on a continuously running native acrylamide gel (4–5°C). The curves were fit to Equation 9, as described in Appendix 3.

Simulations

The simulated data in Figure 5 were generated by using Equation 4b (panel A) and Equation 5 (panel B) to calculate the fraction of bound RNA at each total protein concentration. In Figure 5—figure supplements 1, 2, 4 and 5, Equation 5 was used to calculate fractions bound at each protein and ligand concentration. In Figure 4—figure supplement 1, Equation 4b was used to determine the fraction of ligand bound at each protein concentration at equilibrium, assuming [P] = [P]total. This equilibrium value was then used as an amplitude (A) term in the single-exponential equation shown in Figure 2 to determine the fraction of bound ligand at each time point t: Fraction bound(t) = A×(1  et × kequil) = Fraction bound(equilibrium)×(1  et × (kon[P] + koff)).

The simulated data in Figure 5—figure supplement 3 were generated as follows. First, Equation 5 was used to calculate the expected fraction of bound RNA at equilibrium for each [R]total and [P]total indicated in the figure. Two-fold serial dilution of protein was chosen as representative of a typical equilibrium binding experiment. In the case of 0.001 nM Rtotal, Equation 4b was used instead to calculate the expected fraction bound, as this condition satisfies the [P]free = [P]total assumption. Random noise in fraction bound was then generated around each predicted data point by sampling from a normal distribution with the indicated standard deviation, using the scipy and random packages in Python. Ten binding series were generated this way for each condition and each noise level. These datasets were then individually fit to Equation 5 (or Equation 4b in the case of 0.001 nM Rtotal) in Prism 8 (GraphPad Software, LLC, San Diego, CA; RRID:SCR_002798), with the equations modified to include amplitude (A) and y axis offset (O) terms:

Fraction bound=A× ([R]total+ [P]total + KD) ([R]total+ [P]total + KD)2 4 × [R]total × [P]total2×[R]total+O (6)
Fraction bound= A×[P]total[P]total+KD+O (7)

To facilitate fitting to Equation 6, [R]total was constrained to the known value, and the KD was constrained to positive values only, with the real affinity (0.1 nM) used as an initial estimate.

Acknowledgements

We thank Geeta Narlikar, Enrique De La Cruz, and members of the Herschlag lab for discussions and comments. We are also grateful to Hashim Al-Hashimi, Tom Cech, Katrin Karbstein, Olke Uhlenbeck, Chris Walsh, and Deborah Wuttke for critical feedback and suggestions. This work was funded by a grant from the US National Institutes of Health to DH (R01 GM132899).

Appendix 1

Kinetic approach to affinity determination

An equilibrium dissociation constant is the ratio of dissociation and binding rate constants (KD = koffkon), and thus can be determined by directly measuring these rate constants. Because koff is concentration-independent, it is the easiest and most robust parameter to measure. Appendix 1—figure 1 describes the steps for this measurement. After forming the complex between protein and a trace concentration of labeled RNA, a large excess of unlabeled RNA is added to the reaction. The role of the unlabeled 'chase' RNA is to bind any dissociated protein before it can re-bind the labeled RNA. Thus, the chase RNA must be in large excess of the protein concentration and must be a tight binder. The probability of rebinding can be further reduced by diluting the reaction mixture. At specified time points (t2; Appendix 1—figure 1A), the amount of remaining complex can be determined by native gel electrophoresis or another approach. Although in principle koff can be determined from a single binding reaction, as in any experiment, reliability is best established with several controls (see Appendix 2—note 6).

Appendix 1—figures 1B, C show dissociation of RNA from Puf4 at 25°C and 0°C, respectively. As expected for simple dissociation with an effective chase, the curves are well fit by a single exponential curve with endpoints that approach zero and the rate constant is independent of protein and chase concentrations. A critical control is to test that the koff is not affected by the chase concentration. This is because in some contexts of multi-step dissociation processes, the chase can facilitate dissociation (e.g. Hadizadeh et al., 2016).

Appendix 1—figure 1. Kinetics of Puf4 dissociation.

Appendix 1—figure 1.

(A) Mixing scheme for measuring the dissociation rate constant. After equilibration of a saturating or near-saturating concentration of Puf4 protein with a trace concentration of labeled RNA (t1), a large excess of unlabeled RNA is added, with concomitant dilution of the binding reaction to prevent rebinding after dissociation. (B–C) Time dependence of Puf4 dissociation from its consensus RNA at 25°C (C; koff = (0.014 ± 0.003) s−1) and at 0°C (D; koff = (2.92 ± 0.17) × 10−5 s−1).

To measure the association rate constant one can use a different type of chase experiment that we refer to as a ‘kon chase’ (Appendix 1—figure 2A; Hertel et al., 1994). Here the time that the protein and labeled RNA are incubated together is varied (t1) and the amount bound after each time t1 is determined by native gel shift or another assay. To ensure that the amount bound accurately reflects what has occurred during t1 and not subsequently, a chase is added to prevent, or quench, additional binding, analogous to the koff experiment above (Appendix 1—figure 1A). The time t2 is kept constant, removing potential variability from dissociation subsequent to the binding reaction during t1 (see Appendix 2—note 2).

Appendix 1—figure 2. Kinetics of Puf4/RNA association.

Appendix 1—figure 2.

(A) Mixing scheme for measuring association rate constants. (B, C) Time dependence of Puf4 association to its consensus RNA at 25°C (B) and 0°C (C). (D, E) Determination of kon from the slope of the Puf4 concentration dependence of equilibration rate constants in parts B and C, respectively (circles). The koff values from Appendix 1—figure 1 are also shown (diamonds) to illustrate the correspondence between the y-intercept and koff (Equation 1). Panels D and E show results from two and one independent experiments, respectively (error bars in E correspond to averages from measurements at two different labeled RNA concentrations).

The observed association rate constant is expected to vary with protein concentration—that is, it is first order in protein (Figure 3)—so it is important to carry out these measurements across a wide range of protein concentrations. Appendix 1—figures 2B, C show the data obtained at 25°C and 0°C, respectively. Each individual time course is well fit by an exponential, and Appendix 1—figures 2D, E plot the rate constants obtained from these time courses versus Puf4 concentration, giving the expected linear dependencies, the slopes of which correspond to kon (Appendix 1—figure 2D).

The plot in Appendix 1—figure 2D also shows a clear, non-zero intercept. While not intuitive, this intercept arises because the ‘kon’ experiment actually measures the rate constant to reach equilibrium, kequil, where kequil equals kon[P] + koff (Equation 1) so that the slope gives kon and the intercept gives koff (Appendix 1—figure 2D). There is good agreement between the intercepts and the independently measured koff values in our experiments (Appendix 1—figures 2D, E, diamonds). It is generally preferable to compare directly-obtained koff values to these intercepts, rather than relying on the intercept for koff determination, as this allows independent tests of data consistency and accuracy.

The KD values obtained in the equilibrium and kinetics experiments agree within two-fold, which is reasonable experimental agreement in our experience (Table 2). Such agreement strongly supports (although does not prove) that both methods are giving correct binding constants.

Appendix 2

Additional considerations

Note 1: koff as shortcut for establishing equilibration times

Measuring koff provides a fast and dependable way to determine the equilibration time needed for simple two-state binding reactions (Figure 3 and Equation 2). However, we still recommend monitoring the time course of complex formation in the presence of ligand, in case binding is more complex than a single step, for example involving an additional slow conformational step (e.g. LeCuyer and Crothers, 1994; Smith et al., 2009; Bevilacqua et al., 1992; Mueller-Planitz and Herschlag, 2008; James and Tawfik, 2005; Pisareva et al., 2006). If there is a slow step preceding binding, the rate of equilibration may become limited by this slow step. For example, the formation of long-lived stable alternative structures is well known for RNA (e.g. Uhlenbeck, 1995; Herschlag, 1995). Such alternative states can lead to rapid equilibrium binding for a sub-population and then slow binding as the misfolded, alternative state re-equilibrates to give partial binding or non-exponential kinetics. The following are diagnostics for these and related issues:

  1. Association and/or dissociation kinetics do not follow a single exponential. Such more complex kinetics indicate the presence of additional species that must be identified.

  2. Association kinetics are not first order in protein; that is, the binding rate constant is independent of protein concentration instead of the linear dependence seen in Appendix 1—figure 2D, E and predicted by Equation 1. This behavior indicates additional species in the binding process.

  3. Equilibration rate constant is dependent on protein concentration, but binding does not go to completion even at saturating concentrations. E.g. if only half of a ligand is bound at saturation, that may indicate that 50% of the ligand is trapped in a long-lived conformation or covalently heterogeneous (e.g. protein ligands that are partially or heterogeneously covalently modified). Alternatively, incomplete binding can mean that the ligand dissociates during disruptive sample processing steps (see Note 2). Further tests, such as extending the incubation time, pre-incubating an RNA ligand at increased temperatures, or analysis of the labeled ligand by HPLC, gel, sedimentation, or other methods, can determine if incomplete binding is caused by a slow conformational step or by covalent heterogeneity.

If these situations do not apply, koff is sufficient to determine the required equilibration time.

Note 2: Controls for changes during sample processing in ‘indirect’ binding measurements, and approaches to prevent these changes

Techniques such as native gel shift, nitrocellulose filter binding, and any pull-down-based approaches (Lambert et al., 2014; Wong and Lohman, 1993; Ryder et al., 2008; Campbell et al., 2012) involve sample processing steps between the binding incubation and detection of bound complex, and are thus ‘indirect.’ Changes in conditions during sample handling and analysis can perturb the amount of complex from that present at the end of the initial incubation. It is therefore important to control for and ideally prevent such changes, and additional confirmatory experiments, such as the kinetic experiments described in the main text, are necessary to obtain high-confidence KD values.

In the case of native gel shift experiments, following the incubation (time t1 in Figure 4A), the sample is transferred to a gel loading buffer, containing a high-density additive such as Ficoll or glycerol to facilitate gel loading, and typically a dye, and is loaded onto a non-denaturing gel. These steps are accompanied by changes in concentration, solution conditions, and often temperature, all of which have the potential to induce dissociation or additional binding.

Some general strategies to minimize such perturbations include:

  1. Minimizing any changes in conditions between the incubation (t1) and detection (t2) steps. E.g. the loading buffer can be omitted altogether by including sufficient glycerol in the reaction itself and the gel can be run at the same temperature as the binding reaction (Hellman and Fried, 2007). However, this will not always be feasible, e.g. due to rapid dissociation of the complex during room-temperature gel electrophoresis.

  2. Empirically assessing the effects of any changes in conditions on binding and the time scales on which these effects occur. Certain changes will have negligible and consistent effects on binding and will not affect the quantification if samples are handled quickly and consistently.

  3. Varying the original incubation conditions while maintaining the same gel loading and gel running conditions is a worthwhile control to establish that the observed fraction bound does reflect at least some property from the original incubation conditions.

For our Puf4 binding assays, we were able to prevent changes during the native gel shift assay by utilizing certain favorable properties of Puf4/RNA binding. Below we describe the specific steps we took, as some of the Puf4 strategies can be adapted to other systems with similar properties.

  1. Preventing complex dissociation:

    • During initial exploration, we found that Puf4 dissociates from its consensus RNA extremely slowly at 0°C (Appendix 1). Thus, by keeping our loading buffer on ice and running the gels at 4–5°C we were able to effectively ‘quench’ complex dissociation, with only negligible dissociation (t1/2 ≥ 3 hr) occurring during the short time (seconds–minutes) the samples spent in loading buffer or the gel running buffer.

    • While dissociation of weaker Puf4 ligands was non-negligible even at low temperatures (data not shown), loading these samples quickly (within seconds) and with a consistent loading time ensured that dissociation only affected the amplitude and not the shape of the equilibrium binding curve (and thus the KD determined from it). This was confirmed by competition measurements.

  2. Preventing additional complex formation:

    • In contrast to the slow dissociation, Puf4 association rate constant remains high even at 0°C (Appendix 1). Thus, additional binding can occur during sample loading, and the amount of additional binding will vary with the concentrations of the binding partners and the time prior to loading and gel entry.

    • To avoid the above complexities, we included a large, saturating excess of unlabeled RNA in the loading buffer (here we used the same oligonucleotide as the labeled RNA; more generally—a tight binder that binds at least as tightly as the labeled RNA should be used). This is equivalent to the kon chase used in the kinetics measurements (Appendix 1) and ensures that the additional binding that occurs before entering the gel is to the unlabeled RNA—such that the fraction of bound labeled RNA still accurately reflects the fraction bound during the original incubation.

    • If applying an analogous chase approach, it is important to keep in mind that the unlabeled RNA concentration in the loading buffer must be at least 10-fold higher than the protein concentration used. Otherwise, a substantial fraction of labeled RNA can still bind during t2, in a manner dependent on protein concentration. Using a chase RNA that is bound more tightly than the ligand being tested (e.g. wild-type RNA sequence vs. a mutant) allows the use of lower excess.

Additional measures to minimize changes during binding measurements by native gel shift:

  1. To maintain consistent loading time for all samples, and to minimize the time in the gel running buffer, the samples should be (carefully!) loaded on a continuously running gel. (DANGER: As there is sufficient current to cause injury or death, extreme caution is required in this step. Always be sure there are no leaks, never touch the gel while running, even with gloves on, and maintain a safe distance as current can arc; see, e.g. https://ehs.stanford.edu/reference/electrophoresis-safety for safety information.)

  2. The ratio of the sample volume to the area of the bottom of the well should be kept as low as possible. This ratio can be optimized by loading different sample volumes and varying the comb size and the gel thickness.

  3. The percentage of acrylamide is another variable that should be adjusted if excessive dissociation on the gel occurs (indicated by smearing), with higher acrylamide percentages recommended to increase complex stability in the gel (see also Altschuler et al., 2013).

  4. Using high-density compounds such as Ficoll or glycerol in the loading buffer facilitates rapid gel entry by concentrating the sample at the bottom of the well.

  5. It is advisable to vary the above and other factors (including voltage and temperature) to determine if they influence the results. Factors following sample incubation should not affect the results.

Note 3: Number of time points for establishing equilibration time

Although in principle using two well-separated equilibration times is sufficient, using three or more times is preferable. Using only two incubation times has the potential to give a misleading result—if binding continues to increase while, for example, protein is denaturing, these factors can cancel each other out to give apparently constant binding. It is also critical to use times that span a considerable range, preferably approaching or exceeding 10-fold; here the concern is that if the time interval is narrow, the (inevitable) measurement error can make it difficult to distinguish if measurements are or are not time-independent.

Note 4: Uncertainty in the concentration of labeled trace binding partner

Accurately quantifying trace concentrations of labeled ligands can be a challenge, for instance, when working with radioactively labeled oligonucleotides. The concentration of labeled material can be estimated from specific activity of the isotope used for labeling. However, if labeling is incomplete and the purification procedure used after labeling does not fully separate labeled from unlabeled material, more oligonucleotide will be present than accounted for by specific activity. To be conservative, the upper limit of labeled oligonucleotide concentration should always be calculated based on the total oligonucleotide input in the labeling reaction. The concentration determined by specific activity provides a lower limit.

If not all protein is active or if the protein concentration is inaccurate, one may have false confidence of being in the excess-protein regime described by Equation 4b. Thus, although keeping the trace component at least 10-fold below the dissociation constant is a useful benchmark, we recommend always varying the trace ligand concentration (section 'Avoid the titration regime'). When possible, the concentration of active protein should be determined by titration against a known concentration of the ligand (see section 'Determine the fraction of active protein').

Note 5: Additional factors that can affect binding-competent concentrations of protein and ligand

Depending on the system and technique, other factors can lead to concentration-dependent variation in the observed KD values. Performing the simple controls in the Equilibrium Binding Checklist (Appendix 4) will typically detect these problems and help assess the robustness of measured KD values. We provide a non-exhaustive list below that will be helpful in devising appropriate additional controls, but experimenters should consult references for technique-specific information and advice.

  1. Proteins can denature over time, and can also be subject to time-dependent proteolysis in extracts or partially purified systems.

    • Apparent weakening of binding with increased incubation time will help detect this behavior (for reactions where protein is the excess binding partner).

    • Optimizing solution conditions (e.g. lowering temperature, including glycerol, varying pH, etc.) and using protease inhibitors may help extend accessible times.

    • koff measurements can help identify the shortest feasible time to be used for equilibrium incubation to limit damage (in cases where damage occurs during unnecessarily long incubations).

  2. Proteins can aggregate/form higher-order complexes at very high concentrations.

    • Active protein concentration (section 'Determine the fraction of active protein') should be assessed at several protein concentrations, including one at the high-end of the concentration range used in equilibrium experiments (see also discussion in Altschuler et al., 2013).

    • If concentration-dependent changes in stoichiometry of the complex are detected (which can be detected with some approaches like gel electrophoresis, certain fluorescence-based methods), models beyond the simple model in Figure 3 should be devised and tested.

  3. Single-stranded oligonucleotides can form intermolecular base-pairs when used at high concentrations (e.g. in competition experiments) or during storage.

    • Nearest-neighbor base-pairing predictions can be used to estimate if base-pairing may be an issue at the temperature, salt conditions, and oligonucleotide concentrations used. If interactions between oligonucleotides are suspected, one can weaken base-pairing by using higher temperature or lowering salt concentrations in the binding experiments. If such changes are not possible or sufficient, the binding model should be modified to incorporate the oligonucleotide interactions.

    • For nucleic acid constructs with extensive complementarity that can form long-lived intermolecular interactions during storage, a dilute solution should be heated before use in binding experiments.

  4. Nucleic acids can be covalently damaged by nucleases and other factors.

    • Care should be taken to remove all potential nuclease contamination by using sterile, high-purity water and reagents, sterile supplies and surface decontaminants. In our experience, some of the most robust RNase contamination has come from contaminated lots of commercial RNase inhibitors; thus, we recommend testing these products before relying on their efficacy.

    • UV exposure should be limited or avoided during nucleic acid purification to avoid covalent damage (Kladwang et al., 2012; Greenfeld et al., 2011).

  5. Both proteins and nucleic acids can stick to tubes, lowering the concentrations accessible for binding.

    • Varying the type of reaction tube, and including small amounts of detergent and bovine serum albumin (BSA) can be used to assess and prevent sticking. (NOTE: Some BSA and other protein preparations contain nuclease contaminants.)

    • Varying the concentration of the labeled trace partner and measuring the dissociation constant by more than one approach (equilibrium vs. kinetics, or different techniques) can control for loss of material due to sticking.

  6. Long-lived misfolded RNA concentrations can reduce binding-competent concentration during short incubation times (see Note 1).

Note 6: Controls and considerations for dissociation rate constant measurements

We recommend the following steps to ensure accurate koff measurements.

  1. Establish that the chase is effective. Mixing the chase (in large excess over the protein concentration) with labeled ligand before addition of protein to form the complex should lead to no detectable protein binding to labeled ligand. If this is not the case, a higher chase concentration and/or a higher-affinity chase ligand is needed.

  2. Establish independence of koff from the chase ligand concentration. Multiple chase ligand concentrations should be used, preferably spanning at least an order of magnitude. It is expected that koff will be constant, but variation can indicate an experimental artifact (such as a chase component affecting koff) or, more interestingly, an ability of one ligand to facilitate dissociation of another. Here, dissociation by dilution becomes an important control—i.e. the bound complex formed with protein concentration near the KD is diluted by varying amounts until full or near-full dissociation is observed. The dissociation rate constant should be consistent across different dilution factors that give full dissociation. Incomplete dissociation (with the dissociation curve plateauing substantially above zero) most simply suggests insufficient chase (and/or dilution), which is usually readily resolved by increasing the chase concentration. Less commonly, incomplete dissociation can indicate heterogeneity of the bound complex, with a slowly dissociating sub-population remaining bound on the time scale of the experiment. In this case, increasing the chase concentration will not lead to complete dissociation, and the origins of complex heterogeneity should be investigated. Indeed, the more slowly dissociating fraction is more likely to represent a functional form, as it is more tightly associated.

  3. Establish independence of protein concentration. While the starting fraction bound may vary depending on how far above the KD the protein concentration is, the dissociation rate constant should always be the same. Changes in koff with protein concentration can indicate a contaminant in the protein solution or, more interestingly, the formation of a protein multimer that increases or decreases the RNA dissociation rate. Thus, koff measurements at several protein concentrations (ideally three or more spanning at least an order of magnitude), in addition to serving as controls, can help discover new complexes and pathways.

Appendix 3

Competition measurements

Considerations for competition measurements

  1. Protein concentrations ~2–5 times above the KD for the labeled ligand (KD* in Appendix 3—figure 1A) should typically be used. It is important to be near or above the KD* to have sufficient signal, but at the same time not too far above the KD* so that the protein concentration is sufficiently below the KD of unlabeled competitor (KD,comp) for most of the competitor to remain unbound (analogous to the ‘trace’ condition recommended for direct binding measurements; see section 'Avoid the titration regime'). Under these conditions ([P]total << KD,comp), the quadratic solution to the 'Lin and Riggs’ equation can be used to determine KD,comp (Equation 9, Appendix 3—figure 1C; Lin and Riggs, 1972). Protein concentrations below KD* can be used as controls for competitive binding (analogous to varying the labeled ligand concentration to rule out titration in direct binding measurements).

  2. If the unlabeled competitor binds with a KD very similar to, or lower than that of the labeled ligand, the condition [P]total << KD,comp for Equation 9 will not be satisfied. In such case, the cubic equation, which accounts for the depletion of the competitor, can be used (Wang, 1995). For the high-affinity competitors, it may be more reliable to measure binding directly (rather than by competition); alternatively, a higher-affinity labeled species could be used.

  3. The competitive binding equation includes labeled ligand concentration (Appendix 3—figure 1C). In a typical experiment where labeled ligand is used in trace ([R*]total << KD* in Appendix 3—figure 1C) this value is negligible and does not contribute substantially to the KD,comp. Nevertheless, it is still important to establish whether the labeled ligand concentration affects KD,comp in a given experiment by fitting the data with lower and upper limits of labeled ligand concentration used in the equation (see Appendix 2—note 4). In case of the Puf4 experiment, using the upper and lower limits of labeled RNA concentration resulted in similar KD,comp values (230 nM and 204 nM, respectively), but using the upper limit led to a poorly fit amplitude due to expected slight protein depletion. This observation, along with others, suggests that the real labeled RNA concentration was more closely approximated by the lower limit (based on 32P quantification) than the upper limit (based on total RNA used for R* preparation), and that negligible co-purified unlabeled RNA was present in our R* preparation (Appendix 2—note 4).

Appendix 3—figure 1. Measuring binding affinity by competition.

Appendix 3—figure 1.

(A) Competitive binding reaction scheme. R*: labeled RNA ligand; Rcomp: unlabeled competitor RNA; KD*: protein affinity for R*; KD,comp: protein affinity for Rcomp. (B) Mixing scheme for a competition measurement. (C) Competition between the U1C point mutant of the Puf4 consensus (Rcomp = CGUAUAUUA) and the labeled consensus RNA (R*= 32P-AUGUGUAUAUUAGU). The data were fit to the following equation (Lin and Riggs, 1972; Weeks and Crothers, 1992):
Fraction bound =A ×12[R]total[KD+ KDKD,comp[Rcomp]total+[P]total+ [R]total (KD+ KDKD,comp[Rcomp]total+[P]total+ [R]total)2  4[R]total[P]total]+O (9)

A indicates the maximum amplitude, constrained to the fit amplitude of the R* binding curve that is measured in parallel by a direct binding experiment (A = 0.89 for Puf4 binding to R*). O is the y axis offset (background). [R*]total was constrained to the lower limit of the labeled RNA concentration. KD* was constrained to Puf4 affinity for the labeled RNA, as determined by direct measurement in the same experiment (0.105 nM, after accounting for active protein fraction of 75%). [P]total was 0.45 nM, after accounting for active protein fraction. The fit KD,comp value was 204 nM. Incubation times of 10, 30, and 110 min gave consistent KD,comp values (190–210 nM), as did lowering the protein concentration by three-fold (180 nM). Equation 9 is applicable only for KD,comp >> KD*. For other cases see Wang, 1995.

Appendix 4

Equilibrium binding checklist

Appendix 4—figure 1. Equilibrium binding checklist template.

Appendix 4—figure 1.

Appendix 4—figure 2. Example of a completed equilibrium binding checklist based on Puf4/RNA binding at 25°C.

Appendix 4—figure 2.

Appendix 4—figure 3. Example of a completed equilibrium binding checklist based on Puf4/RNA binding at 0°C.

Appendix 4—figure 3.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Inga Jarmoskaite, Email: ijarmosk@stanford.edu.

Daniel Herschlag, Email: herschla@stanford.edu.

Sebastian Deindl, Uppsala University, Sweden.

John Kuriyan, University of California, Berkeley, United States.

Funding Information

This paper was supported by the following grant:

  • National Institutes of Health R01 GM132899 to Daniel Herschlag.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Formal analysis, Validation, Investigation, Visualization, Writing - review and editing.

Conceptualization, Resources, Data curation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Conceptualization, Supervision, Funding acquisition, Writing - original draft, Project administration, Writing - review and editing.

Additional files

Supplementary file 1. Literature survey of 100 RNA/protein binding studies.
elife-57264-supp1.xlsx (62.4KB, xlsx)
Supplementary file 2. Literature survey of CRISPR nuclease binding studies, representative of high-affinity interactions.
elife-57264-supp2.xlsx (12.8KB, xlsx)
Transparent reporting form

Data availability

No datasets were generated in this work. The figures include all data, or, where most appropriate for clarity, representative data from a single experiment for every type of experiment performed.

References

  1. Abudayyeh OO, Gootenberg JS, Konermann S, Joung J, Slaymaker IM, Cox DB, Shmakov S, Makarova KS, Semenova E, Minakhin L, Severinov K, Regev A, Lander ES, Koonin EV, Zhang F. C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector. Science. 2016;353:aaf5573. doi: 10.1126/science.aaf5573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Adams RM, Mora T, Walczak AM, Kinney JB. Measuring the sequence-affinity landscape of antibodies with massively parallel titration curves. eLife. 2016;5:e23156. doi: 10.7554/eLife.23156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Altschuler SE, Lewis KA, Wuttke DS. Practical strategies for the evaluation of high-affinity protein/nucleic acid interactions. Journal of Nucleic Acids Investigation. 2013;4:3–28. doi: 10.4081/jnai.2013.4646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Beloglazova N, Kuznedelov K, Flick R, Datsenko KA, Brown G, Popovic A, Lemak S, Semenova E, Severinov K, Yakunin AF. CRISPR RNA binding and DNA target recognition by purified cascade complexes from Escherichia coli. Nucleic Acids Research. 2015;43:530–543. doi: 10.1093/nar/gku1285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The protein data bank. Nucleic Acids Research. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bevilacqua PC, Kierzek R, Johnson KA, Turner DH. Dynamics of ribozyme binding of substrate revealed by fluorescence-detected stopped-flow methods. Science. 1992;258:1355–1358. doi: 10.1126/science.1455230. [DOI] [PubMed] [Google Scholar]
  7. Boyle EA, Andreasson JOL, Chircus LM, Sternberg SH, Wu MJ, Guegler CK, Doudna JA, Greenleaf WJ. High-throughput biochemical profiling reveals sequence determinants of dCas9 off-target binding and unbinding. PNAS. 2017;114:5461–5466. doi: 10.1073/pnas.1700557114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Buenrostro JD, Araya CL, Chircus LM, Layton CJ, Chang HY, Snyder MP, Greenleaf WJ. Quantitative analysis of RNA-protein interactions on a massively parallel array reveals biophysical and evolutionary landscapes. Nature Biotechnology. 2014;32:562–568. doi: 10.1038/nbt.2880. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Campbell ZT, Bhimsaria D, Valley CT, Rodriguez-Martinez JA, Menichelli E, Williamson JR, Ansari AZ, Wickens M. Cooperativity in RNA-protein interactions: global analysis of RNA binding specificity. Cell Reports. 2012;1:570–581. doi: 10.1016/j.celrep.2012.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Eftink MR. Fluorescence methods for studying equilibrium macromolecule-ligand interactions. Methods in Enzymology. 1997;278:221–257. doi: 10.1016/s0076-6879(97)78013-3. [DOI] [PubMed] [Google Scholar]
  11. Gerber AP, Herschlag D, Brown PO. Extensive association of functionally and cytotopically related mRNAs with puf family RNA-binding proteins in yeast. PLOS Biology. 2004;2:E79. doi: 10.1371/journal.pbio.0020079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Greenfeld M, Solomatin SV, Herschlag D. Removal of covalent heterogeneity reveals simple folding behavior for P4-P6 RNA. Journal of Biological Chemistry. 2011;286:19872–19879. doi: 10.1074/jbc.M111.235465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Hadizadeh N, Johnson RC, Marko JF. Facilitated dissociation of a nucleoid protein from the bacterial chromosome. Journal of Bacteriology. 2016;198:1735–1742. doi: 10.1128/JB.00225-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Hellman LM, Fried MG. Electrophoretic mobility shift assay (EMSA) for detecting protein-nucleic acid interactions. Nature Protocols. 2007;2:1849–1861. doi: 10.1038/nprot.2007.249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Herschlag D. RNA chaperones and the RNA folding problem. Journal of Biological Chemistry. 1995;270:20871–20874. doi: 10.1074/jbc.270.36.20871. [DOI] [PubMed] [Google Scholar]
  16. Hertel KJ, Herschlag D, Uhlenbeck OC. A kinetic and thermodynamic framework for the hammerhead ribozyme reaction. Biochemistry. 1994;33:3374–3385. doi: 10.1021/bi00177a031. [DOI] [PubMed] [Google Scholar]
  17. Hulme EC, Trevethick MA. Ligand binding assays at equilibrium: validation and interpretation. British Journal of Pharmacology. 2010;161:1219–1237. doi: 10.1111/j.1476-5381.2009.00604.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Jain N, Lin HC, Morgan CE, Harris ME, Tolbert BS. Rules of RNA specificity of hnRNP A1 revealed by global and quantitative analysis of its affinity distribution. PNAS. 2017;114:2206–2211. doi: 10.1073/pnas.1616371114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. James LC, Tawfik DS. Structure and kinetics of a transient antibody binding intermediate reveal a kinetic discrimination mechanism in antigen recognition. PNAS. 2005;102:12730–12735. doi: 10.1073/pnas.0500909102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Jiang F, Zhou K, Ma L, Gressel S, Doudna JA. A Cas9-guide RNA complex preorganized for target DNA recognition. Science. 2015;348:1477–1481. doi: 10.1126/science.aab1452. [DOI] [PubMed] [Google Scholar]
  21. Karbstein K, Herschlag D. Extraordinarily slow binding of guanosine to the Tetrahymena group I ribozyme: implications for RNA preorganization and function. PNAS. 2003;100:2300–2305. doi: 10.1073/pnas.252749799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Kladwang W, Hum J, Das R. Ultraviolet shadowing of RNA can cause significant chemical damage in seconds. Scientific Reports. 2012;2:517. doi: 10.1038/srep00517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Koroshetz WJ, Behrman S, Brame CJ, Branchaw JL, Brown EN, Clark EA, Dockterman D, Elm JJ, Gay PL, Green KM, Hsi S, Kaplitt MG, Kolber BJ, Kolodkin AL, Lipscombe D, MacLeod MR, McKinney CC, Munafò MR, Oakley B, Olimpo JT, Percie du Sert N, Raman IM, Riley C, Shelton AL, Uzzo SM, Crawford DC, Silberberg SD. Framework for advancing rigorous research. eLife. 2020;9:e55915. doi: 10.7554/eLife.55915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Lambert N, Robertson A, Jangi M, McGeary S, Sharp PA, Burge CB. RNA Bind-n-Seq: quantitative assessment of the sequence and structural binding specificity of RNA binding proteins. Molecular Cell. 2014;54:887–900. doi: 10.1016/j.molcel.2014.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Landis SC, Amara SG, Asadullah K, Austin CP, Blumenstein R, Bradley EW, Crystal RG, Darnell RB, Ferrante RJ, Fillit H, Finkelstein R, Fisher M, Gendelman HE, Golub RM, Goudreau JL, Gross RA, Gubitz AK, Hesterlee SE, Howells DW, Huguenard J, Kelner K, Koroshetz W, Krainc D, Lazic SE, Levine MS, Macleod MR, McCall JM, Moxley RT, Narasimhan K, Noble LJ, Perrin S, Porter JD, Steward O, Unger E, Utz U, Silberberg SD. A call for transparent reporting to optimize the predictive value of preclinical research. Nature. 2012;490:187–191. doi: 10.1038/nature11556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. LeCuyer KA, Crothers DM. Kinetics of an RNA conformational switch. PNAS. 1994;91:3373–3377. doi: 10.1073/pnas.91.8.3373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Lin SY, Riggs AD. Lac repressor binding to non-operator DNA: detailed studies and a comparison of eequilibrium and rate competition methods. Journal of Molecular Biology. 1972;72:671–690. doi: 10.1016/0022-2836(72)90184-2. [DOI] [PubMed] [Google Scholar]
  28. Ma E, Harrington LB, O'Connell MR, Zhou K, Doudna JA. Single-Stranded DNA cleavage by divergent CRISPR-Cas9 enzymes. Molecular Cell. 2015;60:398–407. doi: 10.1016/j.molcel.2015.10.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Maerkl SJ, Quake SR. A systems approach to measuring the binding energy landscapes of transcription factors. Science. 2007;315:233–237. doi: 10.1126/science.1131007. [DOI] [PubMed] [Google Scholar]
  30. McDonnell JM. Surface plasmon resonance: towards an understanding of the mechanisms of biological molecular recognition. Current Opinion in Chemical Biology. 2001;5:572–577. doi: 10.1016/S1367-5931(00)00251-9. [DOI] [PubMed] [Google Scholar]
  31. Miller MT, Higgin JJ, Hall TM. Basis of altered RNA-binding specificity by PUF proteins revealed by crystal structures of yeast Puf4p. Nature Structural & Molecular Biology. 2008;15:397–402. doi: 10.1038/nsmb.1390. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Mueller-Planitz F, Herschlag D. Coupling between ATP binding and DNA cleavage by DNA topoisomerase II: a unifying kinetic and structural mechanism. The Journal of Biological Chemistry. 2008;283:17463–17476. doi: 10.1074/jbc.M710014200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Myszka DG. Improving biosensor analysis. Journal of Molecular Recognition. 1999;12:279–284. doi: 10.1002/(SICI)1099-1352(199909/10)12:5&#x0003c;279::AID-JMR473&#x0003e;3.0.CO;2-3. [DOI] [PubMed] [Google Scholar]
  34. Nature Announcement: Reducing our irreproducibility. Nature. 2013;496:398. doi: 10.1038/496398a. [DOI] [Google Scholar]
  35. Nosek BA, Errington TM. Making sense of replications. eLife. 2017;6:e23383. doi: 10.7554/eLife.23383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Nutiu R, Friedman RC, Luo S, Khrebtukova I, Silva D, Li R, Zhang L, Schroth GP, Burge CB. Direct measurement of DNA affinity landscapes on a high-throughput sequencing instrument. Nature Biotechnology. 2011;29:659–664. doi: 10.1038/nbt.1882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. O'Connell MR, Oakes BL, Sternberg SH, East-Seletsky A, Kaplan M, Doudna JA. Programmable RNA recognition and cleavage by CRISPR/Cas9. Nature. 2014;516:263–266. doi: 10.1038/nature13769. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Peluso P, Herschlag D, Nock S, Freymann DM, Johnson AE, Walter P. Role of 4.5S RNA in assembly of the bacterial signal recognition particle with its receptor. Science. 2000;288:1640–1643. doi: 10.1126/science.288.5471.1640. [DOI] [PubMed] [Google Scholar]
  39. Pisareva VP, Pisarev AV, Hellen CU, Rodnina MV, Pestova TV. Kinetic analysis of interaction of eukaryotic release factor 3 with guanine nucleotides. Journal of Biological Chemistry. 2006;281:40224–40235. doi: 10.1074/jbc.M607461200. [DOI] [PubMed] [Google Scholar]
  40. Plant AL, Locascio LE, May WE, Gallagher PD. Improved reproducibility by assuring confidence in measurements in biomedical research. Nature Methods. 2014;11:895–898. doi: 10.1038/nmeth.3076. [DOI] [PubMed] [Google Scholar]
  41. Pollard TD. A guide to simple and informative binding assays. Molecular Biology of the Cell. 2010;21:4061–4067. doi: 10.1091/mbc.e10-08-0683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Pollard TD, De La Cruz EM. Take advantage of time in your experiments: a guide to simple, informative kinetics assays. Molecular Biology of the Cell. 2013;24:1103–1110. doi: 10.1091/mbc.e13-01-0030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Raper AT, Stephenson AA, Suo Z. Functional insights revealed by the kinetic mechanism of CRISPR/Cas9. Journal of the American Chemical Society. 2018;140:2971–2984. doi: 10.1021/jacs.7b13047. [DOI] [PubMed] [Google Scholar]
  44. Richardson CD, Ray GJ, DeWitt MA, Curie GL, Corn JE. Enhancing homology-directed genome editing by catalytically active and inactive CRISPR-Cas9 using asymmetric donor DNA. Nature Biotechnology. 2016;34:339–344. doi: 10.1038/nbt.3481. [DOI] [PubMed] [Google Scholar]
  45. Rutkauskas M, Sinkunas T, Songailiene I, Tikhomirova MS, Siksnys V, Seidel R. Directional R-Loop formation by the CRISPR-Cas surveillance complex cascade provides efficient Off-Target site rejection. Cell Reports. 2015;10:1534–1543. doi: 10.1016/j.celrep.2015.01.067. [DOI] [PubMed] [Google Scholar]
  46. Ryder SP, Recht MI, Williamson JR. Quantitative analysis of protein-RNA interactions by gel mobility shift. Methods in Molecular Biology. 2008;488:99–115. doi: 10.1007/978-1-60327-475-3_7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Sanders CR. Biomolecular Ligand-Receptor Binding Studies: Theory, Practice, and Analysis. Nashville: Vanderbilt University; 2010. [Google Scholar]
  48. Semenova E, Jore MM, Datsenko KA, Semenova A, Westra ER, Wanner B, van der Oost J, Brouns SJ, Severinov K. Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence. PNAS. 2011;108:10098–10103. doi: 10.1073/pnas.1104144108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Smith KD, Lipchock SV, Ames TD, Wang J, Breaker RR, Strobel SA. Structural basis of ligand binding by a c-di-GMP riboswitch. Nature Structural & Molecular Biology. 2009;16:1218–1223. doi: 10.1038/nsmb.1702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Sternberg SH, Redding S, Jinek M, Greene EC, Doudna JA. DNA interrogation by the CRISPR RNA-guided endonuclease Cas9. Nature. 2014;507:62–67. doi: 10.1038/nature13011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Sternberg SH, LaFrance B, Kaplan M, Doudna JA. Conformational control of DNA target cleavage by CRISPR-Cas9. Nature. 2015;527:110–113. doi: 10.1038/nature15544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Strohkendl I, Saifuddin FA, Rybarski JR, Finkelstein IJ, Russell R. Kinetic basis for DNA target specificity of CRISPR-Cas12a. Molecular Cell. 2018;71:816–824. doi: 10.1016/j.molcel.2018.06.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Tome JM, Ozer A, Pagano JM, Gheba D, Schroth GP, Lis JT. Comprehensive analysis of RNA-protein interactions by high-throughput sequencing-RNA affinity profiling. Nature Methods. 2014;11:683–688. doi: 10.1038/nmeth.2970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Uhlenbeck OC. Keeping RNA happy. RNA. 1995;1:4–6. [PMC free article] [PubMed] [Google Scholar]
  55. Velázquez-Campoy A, Ohtaka H, Nezami A, Muzammil S, Freire E. Isothermal titration calorimetry. Current Protocols in Cell Biology. 2004;23:17.8.1–17.8.24. doi: 10.1002/0471143030.cb1708s23. [DOI] [PubMed] [Google Scholar]
  56. Wang ZX. An exact mathematical expression for describing competitive binding of two different ligands to a protein molecule. FEBS Letters. 1995;360:111–114. doi: 10.1016/0014-5793(95)00062-E. [DOI] [PubMed] [Google Scholar]
  57. Weeks KM, Crothers DM. RNA binding assays for Tat-derived peptides: implications for specificity. Biochemistry. 1992;31:10281–10287. doi: 10.1021/bi00157a015. [DOI] [PubMed] [Google Scholar]
  58. Westra ER, van Erp PB, Künne T, Wong SP, Staals RH, Seegers CL, Bollen S, Jore MM, Semenova E, Severinov K, de Vos WM, Dame RT, de Vries R, Brouns SJ, van der Oost J. CRISPR immunity relies on the consecutive binding and degradation of negatively supercoiled invader DNA by cascade and Cas3. Molecular Cell. 2012;46:595–605. doi: 10.1016/j.molcel.2012.03.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Westra ER, Semenova E, Datsenko KA, Jackson RN, Wiedenheft B, Severinov K, Brouns SJ. Type I-E CRISPR-cas systems discriminate target from non-target DNA through base pairing-independent PAM recognition. PLOS Genetics. 2013;9:e1003742. doi: 10.1371/journal.pgen.1003742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Wong I, Lohman TM. A double-filter method for nitrocellulose-filter binding: application to protein-nucleic acid interactions. PNAS. 1993;90:5428–5432. doi: 10.1073/pnas.90.12.5428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Wright AV, Sternberg SH, Taylor DW, Staahl BT, Bardales JA, Kornfeld JE, Doudna JA. Rational design of a split-Cas9 enzyme complex. PNAS. 2015;112:2984–2989. doi: 10.1073/pnas.1501698112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Wu LC, Tuot DS, Lyons DS, Garcia KC, Davis MM. Two-step binding mechanism for T-cell receptor recognition of peptide MHC. Nature. 2002;418:552–556. doi: 10.1038/nature00920. [DOI] [PubMed] [Google Scholar]
  63. Yang X, Li H, Huang Y, Liu S. The dataset for protein-RNA binding affinity. Protein Science. 2013;22:1808–1811. doi: 10.1002/pro.2383. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Sebastian Deindl1

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

Given the ubiquitous nature of binding measurements in the literature, including newly emerging high-throughput approaches, this manuscript addresses an important and timely topic. This manuscript is particularly compelling in providing an easy-to-follow set of practical guidelines exemplified with relevant binding data. The authors' approach to this important topic is highly pedagogical and should be a must-read for anyone with the ambition to quantitatively characterize binding equilibria.

Decision letter after peer review:

Thank you for submitting your article "How to measure and evaluate binding affinities" for consideration by eLife. Your article has been favorably reviewed by two peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by John Kuriyan as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

We would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). Specifically, we are asking editors to accept without delay manuscripts, like yours, that they judge can stand as eLife papers without additional data, even if they feel that they would make the manuscript stronger. Thus the revisions requested below only address clarity and presentation.

Summary:

In this manuscript, a review of 100 studies reporting on binding measurements is presented, allowing the authors to identify and illustrate a number of pitfalls and issues that often adversely affect the reliability and meaningful biological interpretation of binding equilibrium measurements. Using example binding measurements to illustrate the most relevant points, the authors provide a straightforward, practical set of guidelines in terms of a step-by-step “checklist” that can be followed to ensure acquisition of high-quality data best suited to quantitatively describe simple binding equilibria. Given the ubiquitous nature of binding measurements in the literature, including newly emerging high-throughput approaches, this manuscript addresses an important and timely topic. While there may be at least partial overlap with previously published literature on this topic, this manuscript is particularly compelling in providing an easy-to-follow set of practical guidelines exemplified with relevant binding data. The manuscript is well written and accompanied with a number of high-quality and clear illustrations. As such, the authors' approach to this important topic is highly pedagogical and should be a must-read for anyone with the ambition to quantitatively characterize binding equilibria.

Revisions:

The authors should address the following points to further improve clarity of the manuscript:

1) While the need for the equilibration time control is clear, the requirement for changing the concentration of the second species to probe whether ligand depletion could affect Kd measurements seems to be less universal. If a Kd value, as obtained via binding experiment, is substantially larger than the concentration of the labeled species, it would not be strictly necessary to test for potential ligand depletion. It would be important to take this notion into account in the literature survey so as to indicate those cases where a titration regime could indeed be plausible. It would be useful to provide examples from the literature of Kd values that were underestimated due to ligand depletion. The authors should consider emphasizing the specific conditions where ligand depletion might be overlooked (e.g. high-affinity interactions requiring the use of particularly low concentrations of the labeled binding partner, where concentration uncertainty could play a significant role, and a very low active concentration of the protein). That said, varying the equilibration time is an extremely useful control that should be recommended, but perhaps the authors could be more specific as described above. The authors should also consider emphasizing that the inability to obtain a good fit with a hyperbolic function should be considered a serious warning sign that could indicate insufficient equilibration or ligand depletion. The requirement to rigorously report binding curves and fits could be an important part of a binding data reporting standard. In particular for indirect methods such as EMSAs, binding curves and fits are often omitted.

2) Cpf1 is discussed as an example where affinity is substantially underestimated. However, this particular example appears more complicated and likely requires factors other than insufficient incubation time to be considered: First, in one of the studies reporting a 1000-fold lower affinity, a koff of 1/(several seconds) was directly measured using a smFRET assay; Second, the experimental conditions that are known to affect binding were different in all three studies being compared, including temperature, buffer composition (specifically, divalent ions), as well as RNA and DNA sequences; Moreover, in some cases, the Cpf1 proteins were from different species (Strohkendl et al., 2018, and the study reporting the lowest affinity).

Indeed, this illustrates another common mistake when reporting or using binding affinities: treating them as constant values rather than functions of many variables, and ignoring experimental conditions and other important details when comparing the values. It would be important (and educational) to emphasize this in the manuscript, see also minor points for more details. This all being said, Cpf1 still makes a good case for the authors' main point regarding the need to prove that binding is at equilibrium, since the lack of this proof in the study reporting the lowest affinity creates a lot of confusion.

3) In Appendix 3, Weeks and Crothers, 1992 is cited for a precise competitive binding equation for the case of Kd,comp close to total concentration of P, but the solution to the quadratic equation in Weeks and Crothers does not represent a general equation for competitive binding. Instead, the same approximation as in Lin and Riggs (Kd,comp>>total concentration of P) is assumed and the equation is solved for theta to obtain a binding curve rather than a single point for theta=0.5. This approach should still be considered superior when compared to determining Kd,comp from a single data point since it takes all the other data of the curve into account. However, this approach still cannot be used in case of comparable affinities for competitor and labeled ligand. Instead, a general competitive binding curve should be used that represents a correct solution to the cubic equation (see, for example, PMID: 7875313).

eLife. 2020 Aug 6;9:e57264. doi: 10.7554/eLife.57264.sa2

Author response


Revisions:

The authors should address the following points to further improve clarity of the manuscript:

1) While the need for the equilibration time control is clear, the requirement for changing the concentration of the second species to probe whether ligand depletion could affect Kd measurements seems to be less universal. If a Kd value, as obtained via binding experiment, is substantially larger than the concentration of the labeled species, it would not be strictly necessary to test for potential ligand depletion. It would be important to take this notion into account in the literature survey so as to indicate those cases where a titration regime could indeed be plausible.

We updated our literature survey to distinguish between studies in which titration is and is not plausible, and have included additional analysis of literature data based on the concentration of the limiting species relative to the reported KD values (see revised Figure 1 and newly added Figure 1—figure supplement 2, as well as accompanying text changes). While most authors appear to be aware of the need to avoid titration, approx. one fifth of all studies used the hyperbolic equation at concentrations that violate the [P]total ≈ [P]free condition. This included several studies in which the reported KD was equal to or even lower than the stated limiting species concentration, putting these studies at high risk for inaccuracies that accompany KD values determined in the titration regime. In these and other cases varying the concentration of the limiting species would provide a definitive control for titration (and for other concentration-related artifacts). Further, while it becomes less likely that there are titration artifacts when reported KD values are much greater than the concentration of the second species, one typically does not know for certain if all of the added material is active or if other possible artifacts, such as contamination with another species, are present. Finally, we emphasize that our primary purpose is to make future measurements reliable, so we have tried to make the case that changes are needed in how measurements are typically or often made, without calling out individual studies.

It would be useful to provide examples from the literature of Kd values that were underestimated due to ligand depletion. The authors should consider emphasizing the specific conditions where ligand depletion might be overlooked (e.g., high-affinity interactions requiring the use of particularly low concentrations of the labeled binding partner, where concentration uncertainty could play a significant role, and a very low active concentration of the protein).

We have added a new “Implications of the titration regime” section, analogous to the “Implications of insufficient equilibration” section, where we address the above points.

That said, varying the equilibration time is an extremely useful control that should be recommended, but perhaps the authors could be more specific as described above. The authors should also consider emphasizing that the inability to obtain a good fit with a hyperbolic function should be considered a serious warning sign that could indicate insufficient equilibration or ligand depletion. The requirement to rigorously report binding curves and fits could be an important part of a binding data reporting standard. In particular for indirect methods such as EMSAs, binding curves and fits are often omitted.

We agree that poor fits present an important red flag and now discuss this point in both the equilibration and titration sections. We also amended the checklist (Appendix 4) to include questions about systematic deviations from the fit and whether binding curves are displayed. Nevertheless, it is important not to rely on the quality of fits alone (given potential noise, often sparse data points, subtle and difficult to recognize systematic deviations, or real differences from simple 1:1 binding model). In addition, it is not uncommon for authors to vary the Hill coefficient to compensate for poor fits or to show gel images and derived K1/2 values without plots or fits of the data. We chose not to elaborate on these practices in order to focus on guidelines for correct measurements rather than on criticisms of unreliable ones.

2) Cpf1 is discussed as an example where affinity is substantially underestimated. However, this particular example appears more complicated and likely requires factors other than insufficient incubation time to be considered: First, in one of the studies reporting a 1000-fold lower affinity, a koff of 1/(several seconds) was directly measured using a smFRET assay;

We have carefully reviewed the paper in question and have consulted with an expert in Cpf1 kinetics. While we recognize the potential for additional complexities, the data strongly support a dominant role of insufficient equilibration in underestimating the affinity. To respond to the first point, the kinetics data in PMID: 29735714 that show rapid dissociation on the scale of seconds refer to targets with 9 mismatches (Figure 3E) or 16 mismatches (Figure S3E). For the fully complementary target, Figure 3A indicates virtually no dissociation after >1 h, consistent with extremely slow equilibration. Figure 3A also suggests slow equilibration (>1 h) for targets with up to 7 mismatches.

Second, the experimental conditions that are known to affect binding were different in all three studies being compared, including temperature, buffer composition (specifically, divalent ions), as well as RNA and DNA sequences; Moreover, in some cases, the Cpf1 proteins were from different species (Strohkendl et al., 2018 and the study reporting the lowest affinity).

We recognize that affinity can vary dramatically across conditions and now address this general point in the manuscript. Nevertheless, in this particular case the conditions used to study Acidaminococcus sp Cspf1 were near-identical between PMID: 29735714 and PMID: 30078724, despite a 1000-fold difference between reported KD values. We now state “for the same enzyme at similar conditions” in the text and we include details of the conditions below. Moreover, the consistency between the kinetic data in both studies (see the previous point) argues against major acceleration of equilibration in response to the small differences in conditions.

We now only mention the 1000-fold difference between the affinities reported for the same Cpf1 ortholog (AsCpf1) and do not mention the 100,000-fold difference from a study that investigated a different Cpf1 ortholog.

Indeed, this illustrates another common mistake when reporting or using binding affinities: treating them as constant values rather than functions of many variables, and ignoring experimental conditions and other important details when comparing the values. It would be important (and educational) to emphasize this in the manuscript, see also minor points for more details.

We completely agree that the dependence of affinity on specific conditions is an important point and now discuss it in the manuscript.

This all being said, Cpf1 still makes a good case for the authors' main point regarding the need to prove that binding is at equilibrium, since the lack of this proof in the study reporting the lowest affinity creates a lot of confusion.

We agree.

3) In Appendix 3, Weeks and Crothers, 1992 is cited for a precise competitive binding equation for the case of Kd,comp close to total concentration of P, but the solution to the quadratic equation in Weeks and Crothers does not represent a general equation for competitive binding. Instead, the same approximation as in Lin and Riggs (Kd,comp>>total concentration of P) is assumed and the equation is solved for theta to obtain a binding curve rather than a single point for theta=0.5. This approach should still be considered superior when compared to determining Kd,comp from a single data point since it takes all the other data of the curve into account. However, this approach still cannot be used in case of comparable affinities for competitor and labeled ligand. Instead, a general competitive binding curve should be used that represents a correct solution to the cubic equation (see, for example, PMID: 7875313).

We are grateful to the reviewers for pointing us to the general formulation of the competitive binding equation. We have updated Appendix 3 with the new equation and we have added the reference to the cubic equation.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Supplementary file 1. Literature survey of 100 RNA/protein binding studies.
    elife-57264-supp1.xlsx (62.4KB, xlsx)
    Supplementary file 2. Literature survey of CRISPR nuclease binding studies, representative of high-affinity interactions.
    elife-57264-supp2.xlsx (12.8KB, xlsx)
    Transparent reporting form

    Data Availability Statement

    No datasets were generated in this work. The figures include all data, or, where most appropriate for clarity, representative data from a single experiment for every type of experiment performed.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES