Abstract
Cytoskeletal motor proteins are essential to the function of a wide range of intracellular mechano-systems. The biophysical characterization of their movement along their filamentous tracks is therefore of large importance. Toward this end, single-molecule, in vitro stepping-motility assays are commonly used to determine motor velocity and run length. However, comparing results from such experiments has proved difficult due to influences from variations in the experimental conditions and the data analysis methods. Here, we investigate the movement of fluorescently labeled, processive, dimeric motor proteins and propose a unified algorithm to correct the measurements for finite filament length as well as photobleaching. Particular emphasis is put on estimating the statistical errors associated with the proposed evaluation method, as knowledge of these values is crucial when comparing measurements from different experiments. Testing our approach with simulated and experimental data from GFP-labeled kinesin-1 motors stepping along immobilized microtubules, we show 1) that velocity distributions should be fitted by a t location-scale probability density function rather than by a normal distribution; 2) that the impossibility to measure events shorter than the image acquisition time needs to be taken into account; 3) that the interaction time and run length of the motors can be estimated independent of the filament length distribution; and 4) that the dimeric nature of the motors needs to be considered when correcting for photobleaching. Moreover, our analysis reveals that controlling the temperature during the experiments with a precision below 1 K is of importance. We believe our method will not only improve the evaluation of experimental data, but also allow for better statistical comparisons between different populations of motor proteins (e.g., with distinct mutations or linked to different cargos) and filaments (e.g., in distinct nucleotide states or with different posttranslational modifications). Therefore, we include a detailed workflow for image processing and analysis (including MATLAB code), serving as a tutorial for the estimation of motility parameters in stepping-motility assays.
Introduction
Cytoskeletal motor proteins are essential for long-range intracellular transport (1), the malfunction of which can cause a number of pathologies including neurodegenerative diseases (2). The precise characterization of motor proteins with regard to their intrinsic function and the investigation of factors that influence their behavior thus constitutes an important part of medical and biophysical research. To study motor proteins in minimal in vitro systems, fluorescence imaging of single motors (stepping along their filamentous tracks) remains at the forefront of biophysical tools (3, 4, 5, 6). However, quantitative estimation of the crucial motility parameters, namely velocity, interaction time, and run length, proves to be challenging owing to fundamental limitations in the experimental design. Because the processive run of a motor may be prematurely terminated by the end of a filament or the motor may be rendered invisible due to photobleaching of the attached fluorescent marker, so-called censored events will be part of any experimental data (7). As these censored events are prone to significantly bias the results, reliable correction methods are needed. Although corrections for finite filament length and photobleaching have been investigated individually in the past (8, 9, 10, 11, 12), the field still lacks a unified methodology. Here, we suggest an approach that addresses the above-mentioned challenges. Besides evaluating experimental data obtained from the motility of single, GFP-labeled kinesin-1 motors, we perform numerical simulations using a priori known parameters to show how statistical analysis allows for characterizing the certainty of a given measurement. Knowledge of the latter is of crucial importance when data from different measurements are to be compared. Along with our findings, we emphasize the importance of a precise temperature control during the measurements and describe a detailed workflow that facilitates the analysis of experimental data.
Materials and Methods
Motor proteins and filaments
Histidine-tagged, truncated (1–430 amino acids) rat kinesin-1 labeled with eGFP (rKin430-eGFP) was expressed and purified as previously described (13). Porcine tubulin was purified from porcine brain (Vorwerk Podemus, Dresden, Germany) using established protocols (14). Microtubules were grown for 2 h at 37°C from a 80 μL BRB80 (80 mM PIPES; Sigma-Aldrich, St. Louis, MO), pH 6.9 adjusted with KOH (Merck, Kenilworth, NJ), 1 mM EGTA (Sigma-Aldrich), and 1 mM MgCl2 (Merck) solution supplemented by 2 μM tubulin (75% unlabeled and 25% rhodamine-labeled TAMRA; Thermo Fisher Scientific, Waltham, MA), 1 mM GMP-CPP (Jena Bioscience, Jena, Germany), and 1 mM MgCl2. A quantity of 80 μL of the microtubule solution was centrifuged in a Beckman Airfuge (A95 rotor; Beckman Coulter, Brea, CA) at 10,0000 × g for 5 min. The pellet was resuspended in a volume of 200 μL BRB80T (BRB80 supplemented by 1 mM taxol; Sigma-Aldrich). The solution was kept at room temperature overnight and the 200 μL microtubule solution was centrifuged and resuspended (in 200 μL BRB80T) again before the experiment.
Single molecule stepping assay
The employed stepping assays using total internal reflection fluorescence microscopy have been extensively described by Korten et al. (15). Briefly, we performed the experiments in flow channels (16), self-built from two glass coverslips (22 × 22 mm2 and 18 × 18 mm2; Corning, Corning, NY) that were cleaned in piranha solution (H2O2/H2SO2, 3:5; both purchased from Sigma-Aldrich), silanized with 0.05% dichlorodimethylsilane in trichloroethylene (Sigma-Aldrich) and glued together by heated pieces of Parafilm M (Pechiney Plastic Packaging, Chicago, IL). The flow sequence was as follows: 1) the flow channel was filled with a solution of TetraSpeck microspheres (diameter 100 nm; Thermo Fisher Scientific) diluted 200-fold in BRB80 that were used for drift and color correction (17). 2) After 2 min, the solution was exchanged with a BRB80 solution containing 77.5 μg/mL anti-β-tubulin antibodies (SAP4G5; Sigma-Aldrich). 3) After 5 min, the surface was blocked with a solution with 1% Pluronic F-127 (Sigma-Aldrich) in BRB80 for 15 min. 4) Microtubules diluted 10-fold to prevent microtubule intersections were incubated for 5 min to bind to the tubulin antibodies. 5) The microtubule solution was finally replaced by the motility solution—BRB80 containing 10 μM taxol, 0.04 mM glucose (Sigma-Aldrich), 0.2 mg/mL glucose oxidase (SERVA Electrophoresis, Heidelberg, Germany), 0.02 mg/mL catalase (Sigma-Aldrich), 10 mM DTT (Fermentas/Thermo Fisher Scientific), 0.1 mg/mL casein (Sigma-Aldrich), and 10 mM Mg-ATP (Sigma-Aldrich)—supplemented by 4 μg/mL rKin430-eGFP. For bleaching time estimation, undiluted sparsely labeled microtubules were used (10× less rhodamine labeling) and ATP was replaced with 10 mM AMP-PNP (Sigma-Aldrich) in all solutions.
Optical imaging
Fluorescence imaging was performed using an inverted fluorescence microscope (Observer Z1; Zeiss, Jena, Germany) with a 100× oil immersion objective (APOCHROMAT; numerical aperture 1.46; Zeiss) and an additional 1.33× magnifying Optovar (custom-build). The final pixel size was 117 nm. Microtubules were observed by epifluorescence using a Lumen 200 metal arc lamp (Prior Scientific Instruments, Fulbourn, United Kingdom) with a TRITC filter set (excitation 534/30, emission 593/40, dc R561; Chroma Technology, Rockingham, VT). rKin430-eGFP motor proteins were observed in total internal reflection fluorescence mode by using a PhoxX 488 nm Laser (Omicron-Laserage, Rodgau-Dudenhofen, Germany) with a GFP filter set (excitation 470/40, emission 525/50, dc 495; Zeiss) filter set. Image acquisition was performed at 100 ms exposure time in streaming mode by an electron-multiplied charge-coupled device camera (iXon Ultra DU-897U; Andor, Belfast, Northern Ireland) in conjunction with a MetaMorph imaging system (Universal Imaging, Downingtown, PA). The temperature was measured directly in the flow channel with a small temperature sensor (IT-23; Physitemp Instruments, Clifton, NJ) connected to a multipurpose thermometer (BAT-10; Physitemp Instruments). Temperature control was implemented using a custom-made hollow brass ring (MPI-CBG Mechanical Workshop, Dresden, Germany) around the objective connected to a water bath with combined cooling and heating unit (F25-MC Refrigerated/Heating Circulator; JULABO, Seelbach, Germany). Although the electric components of the microscope setup usually increase the room temperature as well as the temperature of the microscope body, our temperature control kept the temperature in the flow channel stable within 0.5 K over hours. See the Supporting Material for further experimental considerations.
Single molecule analysis
Single kinesin-1 molecules and microtubules were tracked using FIESTA (18). Molecules that showed any pauses or stalling were disregarded. After drift and color offset correction, the molecule position was projected on the microtubule centerline (see the Supporting Material for detailed instructions on tracking with FIESTA). The resulting distance along the centerline as well as the detachment position was utilized for further evaluation in different MATLAB scripts (The MathWorks, Natick, MA). For velocity simulations, a Monte Carlo simulation of a Poisson stepper was used, and to create exponential distributions, the MATLAB function “exprnd” was employed. Renormalization of the probability density function (PDF) was done with the Symbolic Math Toolbox of MATLAB and integration to obtain the cumulative distribution function (CDF) with the functions “int” and “matlabFunction”. The evaluation of the cumulative distribution function utilized “ecdf”, which also includes the optional Kaplan–Meier estimator; for least-square fitting, “fit” was used; and for maximum likelihood estimation, “mle” was employed. Bootstrapping was done with “parfor”, included in the Parallel Computing Toolbox of MATLAB. The MATLAB code for the simulations as well as for the evaluation can be found within a compressed file in the Supporting Material (see description there).
Bootstrapping method
The statistical error of an evaluation method with a limited number of measurements can be estimated using a bootstrapping method (4). Briefly, from the data set, individual measurements are randomly selected with replacement. Here, the complete data set is always available when picking the measurement, which means that any measurement can also be selected more than once. Now, the new randomly selected data set is analyzed using the desired evaluation method. The procedure is repeated for sufficient number of repetitions (e.g., n = 100) with randomly selected data sets. The resulting bootstrapping distribution can be described by a normal distribution with the mean denoting the actual result and its SD describing the statistical error. This statistical error is only the result from random sampling and describes the error that is to be expected when repeating the experiment.
Results
Single motor protein stepping assay
Motor proteins moving along their filaments can be described theoretically as Poisson steppers (Fig. 1). A simplified model of the motor protein kinesin-1 stepping along microtubules (MTs) is shown in Fig. 1 A. Whereas the attachment rate kon = ·[Kinesin] is influenced by the motor protein concentration [Kinesin] in solution, the detachment rate koff only depends on the motor-filament interaction. This interaction is described by 1) the motor velocity v, which can be used to derive the stepping-rate kstep = v/d with d denoting the step size; 2) the interaction time τ, which can be used to derive the detachment rate koff = 1/τ; and 3) the run length R, which can be used to link the detachment rate koff to the mechano-chemical cycle of the motor protein (e.g., R = v ⋅ τ). The first challenge in the determination of these parameters can be seen in the experimental kymograph in Fig. 1 B. There, clear linear motion can be observed for some motors (e.g., red box), but it is unclear if short interactions are actual movement or unspecific interactions (e.g., blue box). In our analysis, we therefore require a motor to be visible for five or more consecutive imaging frames and to move over a distance longer than the size of two pixels without pausing. Although these experimenter-defined thresholds appear arbitrary, we will show that their choice does not affect the results.
In the following, we will describe the procedures to estimate velocity, interaction time, and run length. Estimating the mean velocity will be rather straightforward, but we will show that it is more challenging (but equally important) to estimate the associated statistical error. As motor velocity is a good reporter of the environmental conditions (foremost temperature but also ionic strength and pH), knowledge about both mean velocity and error is of importance when potentially combining data from different fields of view or different experiments. Without taking precautions, the environmental parameters can vary significantly over the course of an experiment and unwanted influences can be minimized by pooling data from measurements with nondiffering velocities only. For reducing the systematic error in estimating the interaction time and run length, data analysis needs to address additional experimental challenges such as limited filament length and photobleaching. In addition to addressing the challenges that arise in single motor protein stepping assays, we provide extensive Supporting Material that includes the workflow for image processing in FIESTA, the MATLAB scripts used for the analysis, and many other experimental considerations (e.g., temperature control). Along with the proposed workflow for estimation of the motility parameters (see Workflow for Evaluating Velocity and Workflow for Evaluating Interaction Time and Run Length), this work can be used as a tutorial for investigating processive motor proteins. To facilitate the analysis, we incorporated the estimation of the motility parameters (including the proposed corrections) into FIESTA (version 1.6 or later).
Evaluation of the velocity
The velocity of individual processive motors is determined by tracking their positions over time, X(t) and Y(t). These positions are projected on the centerline of the filament and the distance D(t) the motor moved along the filament is calculated. Fitting D(t) with a linear function D(t) = v·t + c yields the velocity v. A typical experimental velocity distribution of N = 543 kinesin-1 motors is shown in Fig. 2 A. Although upon first sight the distribution resembles a normal distribution, hypothesis testing reveals that the data cannot be described by a normal distribution (p < 0.001, Kolmogorov-Smirnov test).
To investigate the reason behind the deviation from the normal distribution, we created a Monte Carlo simulation with 10,000 traces of motor proteins stochastically stepping with a rate of kstep = 100 s−1 and step size dstep= 8 nm (koff = 0, total time per trace 20 s). We looked at the number of steps Nsteps taken by each of these simulated motor proteins at specific time points (e.g., t1 = 1 s, t2 = 3 s, etc.). At each time point, Nsteps is described by a Poisson distribution that can be approximated with normal distributions (Nsteps > 10; Fig. S1 A). Because the velocity is described by v = Nsteps· dstep /T, the mean velocities are the same at each time point, but the widths of the normal distributions vary (Fig. S1 B).
If the detachment rate is changed to koff = 0.5 s−1, each motor has a different interaction time, and thereby the velocity distribution of all motors is a mix of normal distributions with the same mean values but different widths. In general, motors with shorter interaction times have a higher variance in the velocity distribution than motors that interact longer (under the same imaging conditions, Fig. S1 B). Consequently, the observed velocity distribution is not a normal distribution.
To further adjust the simulation to more realistic experimental conditions, spatial averaging over the positions D(t) during the acquisition time of individual imaging frames (e.g., 100 ms accounting for a finite frame rate of f = 10 s−1) was performed and a positional error (due to tracking uncertainty) was incorporated by adding normally distributed noise (σ = 100 nm). Due to the finite acquisition time, no exact information about the attachment and detachment time can be extracted. Consequently, the positional averaging during acquisition of these frames would bias the estimation toward slower velocities (see the Supporting Material). Therefore, the first and last tracked frames have to be excluded in the linear regression of D(t).
After obtaining a single velocity for each motor, a mean velocity v for one experiment can be obtained by estimating the characteristic parameters of a PDF with maximum likelihood estimation (MLE). As PDF, we used a t location-scale (TLS) distribution (19) that includes a shape parameter ν in addition to the location μ and scale σ parameters, which are conventionally used to describe a normal distribution. The TLS PDF that MATLAB uses in MLE is described by the following equation:
(1) |
As can been seen in Fig. 2 A, the TLS PDF fits the velocity distribution better than the normal distribution because the shape parameter ν accounts for heavier tails (see also Fig. S2 A). To compare the statistical errors when using these PDFs, we picked out a random set of N motor proteins from our simulation (with replacement, see bootstrapping in Materials and Methods). We calculated the velocity for each motor, and estimated the mean velocity for that random set using both normal and TLS PDFs. The procedure was repeated n = 100 times and the deviation from the true velocity = 0.8 μm/s (kstep = 100 s−1 and dstep = 8 nm) was calculated:
(2) |
The relative deviation is plotted as function of the number of measurements in Fig. 2 B. Although there is only a small difference between using a normal or TLS PDF when evaluating the Monte Carlo simulation, this difference increases when evaluating time-averaged traces with additional positional error (corresponding to experimental data). It turns out that, to reach a relative error Δv/v of 2% (with Δv = , confidence interval 95%), only 200 measurements are necessary when using a TLS PDF compared to 750 measurements when using a normal PDF. Here, the difference between TLS PDF and normal PDF depends on the average number of data points in each trace available (defined by time resolution and interaction time) for each trace as well as on the positional error. On average, using either distribution yields the same mean velocity, but the TLS PDF yields a smaller relative error because it fits the velocity distribution significantly better (19) (see the Supporting Material for a more extensive investigation of the velocity evaluation).
In addition to measuring the velocity precisely, it is also essential to limit any external influence on the motility parameters, most importantly the temperature. Conventionally, velocity measurements on motor proteins are performed at “room temperature” and the actual temperature is not specified precisely. However, temperature variations can lead to small but significant velocity differences within one experiment and only by actively stabilizing the temperature in the flow channel. Using a temperature control system (within ±0.5 K; see the Supporting Material), we could remove the temperature influence on the motor stepping (Fig. 2 C). In contrast, not accounting for temperature differences when comparing data sets (e.g., from different days or labs) can lead to a grave misinterpretation of data. Even a 1 K temperature increase can influence the velocity by >5% (Fig. 2 D). The temperature effect becomes even more prominent for measurements at room temperature (e.g., data points at 24.2–25.7°C) with ∼13% increase in velocity over 1.5 K. Thus, measuring and reporting both temperature and velocity is of utmost importance when comparing the motility parameters of motor proteins. In turn, the motor velocity can be used as a control parameter to assure a constant temperature. In fact, together with our temperature control system, this strategy enabled us to pool experimental data sets from different image sequences and fields of view into a combined data set for further analysis.
Workflow for evaluating velocity
-
1)
Extract the distance versus time information for each motor protein (see the Supporting Materials and Methods for instructions on how to track and analyze single fluorescent motor proteins using FIESTA (17)).
-
2)
Fit distance versus time trace with linear regression leaving out first and last frame of the trace.
-
3)
Use MLE to fit a TLS pdf to the velocity distribution.
To calculate the error of the mean velocity ν with the TLS pdf, either use the 95% confidence interval (estimated with MLE) or bootstrapping (see Materials and Methods). The MATLAB code can be found within a compressed file (Data S1).
Evaluation of the interaction time and run length
Interaction time and run length of motor proteins are theoretically exponentially distributed (20) and our Monte Carlo simulations indeed show exponential distributions for both parameters. This can be attributed to a stochastic detachment with rate koff where the interaction time is τ = koff−1 and the run length is R = v · koff−1. To test different methods to evaluate censored data, we simulated exponential distributions and evaluated them using three different methods previously described in the literature: 1) least-squares fitting of the probability density function (LSF-PDF) where the data is binned in a histogram, and the locations and heights of the bins are fitted by an exponential PDF (20); 2) least-squares fitting of the cumulative distribution function (LSF-CDF) where the data is used to create a cumulative probability distribution that is fitted with the CDF (4); and 3) MLE, where the data is used directly to estimate the parameters of the exponential distribution. For complete exponential distributions (where all events are measurable), the MLE method yields the most precise results, because MLE can be solved analytically. In contrast, least-squares fitting involves numerical parameter optimization that is terminated when a certain tolerance is reached. The tolerance slightly decreases the precision with which the parameters are estimated. However, the experimental limitations prevent us from measuring complete exponential distributions. First, it is impossible to include motility events with τ shorter than the time resolution (in our case, 100 ms). Second, events with short interaction times might be easily discarded as noise during the evaluation procedure. Due to these missing short events, the measured exponential distributions are not complete and the evaluation method has to be adjusted: 1) using LSF-PDF, the first bin is underrepresented and needs to be disregarded when fitting the PDF (Fig. 3 A); 2) using LSF-CDF, a cutoff parameter x0 has to be introduced in the CDF (4) to account for the missing measurements (Fig. 3 B), which cutoff parameter can be set as a constant (LSF-CDF(static) (4)) or as a free fit parameter (LSF-CDF(free) (5)); and 3) using MLE, a cutoff parameter can be employed to renormalize the PDF (21). For a truly single exponential PDF, the corrected parameter can then be estimated by subtracting the cutoff x0 from all measurements and evaluating the data with MLE .
In the following, we characterize the evaluation of exponential distributions with LSF-PDF, LSF-CDF, and MLE by creating a random data set of an exponential distribution with = 2 (we disregard the difference between run length and interaction time for now because the following applies to any exponentially distributed data). Analogous to the velocity in the previous section, we analyze the deviation of the estimated mean (from n = 100 independent data sets) from the true in dependence upon the number of measurements N:
(3) |
Fig. 4 A shows the results when using the complete exponential distribution. The relative deviation depends only on the number of measurements due to the statistical nature of the exponential function. As expected, all methods show a clear behavior, but the LSF-PDF yields a higher statistical error due to “coarse graining” by binning. Here, the data is binned but the distribution of the measurements within the bins is skewed (because an exponential distribution is continuously decreasing). Therefore, the data points for fitting the PDF differ slightly from the expected exponential distribution, which leads to the higher statistical error.
If the simulation is adjusted to represent realistic experimental results (Monte Carlo simulation with spatial averaging, additional positional error, and missing short events), two methods do not show the expected behavior (see Fig. 4 B). Both LSF-CDF(static) and MLE can yield a systematic error because the cutoff is fixed and chosen rather arbitrarily. In experiments, however, the exact cutoff could be hidden within the limited time resolution or the limited tracking accuracy. For the example given in Fig. 4 B, the shortest interaction time was 0.5 s (at least five frames with a time resolution of 100 ms), but because the interaction can only be measured for full frames, a number of motors with interaction times between ∼0.45 and 0.5 s will also be also present in the distribution. Therefore, the correct cutoff should actually be smaller than 0.5 s (more discussion on choosing the cutoff in the Supporting Material). The LSF-PDF method is not influenced by the missing short events as long as all bins that are affected by the missing events are disregarded, but again yields a higher statistical error due to coarse-graining. LSF-CDF(free) avoids choosing a fixed cutoff x0 (by adding it as an additional fit parameter) and yields precise results for the modified distribution. Admittedly, employing an additional fit parameter comes at the price that it may distort the results when evaluating more complex distributions (e.g., double exponential). Note that the statistical error cannot be smaller than , even if the confidence interval of the fitting is smaller. This fitting error from least-squares-fitting occurs in addition to the statistical error that results from the random sampling of the exponential distribution. Hence, it is more important to measure a large number of data points rather than improving the precision of the individual measurements. The code for fitting a particular experimental data set can be found within a compressed file in the Supporting Material, along with an extensive discussion on the creation of the CDF, the least-squares-fitting, and the cutoff issue. In the following, we only use LSF-CDF(free) to evaluate exponential distributions. We note that MLE would work similarly (including correction for censored events). For comparison, we included both evaluation methods in the MATLAB code and also compared their statistical errors using simulations (see the Supporting Material).
Correction for finite filament length
Because each filament has a finite length, some motor proteins are destined to run into the end of their track and detach (Fig. 5 A). These events influence the measurement of run length (or interaction time) and introduce a dependence on the filament length (8). Therefore, identical motor proteins moving along longer filaments would have a higher observed run length (or interaction time) than motors stepping on short filaments. Here, we present a correction for these so-called “end-events” by using the Kaplan–Meier estimator (22) in LSF-CDF(free) to adjust the cumulative probability distribution for these censored events. Because it is possible to image the filaments and track single motor proteins, the detachment positions along the filament can be determined. Any event where a motor protein detaches near the end of a filament (within one pixel), is then scored as end-event (the exact procedure to calculate the adjusted cumulative probability distribution can be found in the Supporting Material). To verify the proposed method, we simulated events of motor proteins landing on filaments with a random length distribution (assuming a Schulz distribution (23), see equation in the Supporting Material) and assessed which motors reach the filament end. These traces are included in the analysis as censored events.
Fig. 5 B compares the run lengths from our simulated data with and without correction. Note that neglecting the length correction leads to a systematic error that influences the measurement because a certain number of motor proteins will always reach the end even for long filaments. The method was also verified experimentally when we tracked 5208 kinesin-1 motor proteins stepping along MTs (same data as in Fig. 2 C) in eight different fields of view (temperature 23.5−24°C). Fig. 5 C shows the run lengths grouped according to the length of their MTs (combined data set was separated in groups of N ≈ 1000). A dependence of the observed run length on the MT length is seen in the uncorrected data, whereas the correction using the Kaplan–Meier estimator yields the same run length for all groups as well as for the combined distribution. The errors can be estimated using bootstrapping, where measurements are randomly selected (with replacement, see Materials and Methods), classified for end-events and then evaluated using LSF-CDF(free) with the Kaplan–Meier estimator. After sufficient repetitions (n = 100), the resulting bootstrapping distribution yields the mean run length and the statistical error after correction ΔR = 2σR. Therefore, any influence of the Kaplan–Meier estimator on the statistical error can be accounted for. An alternative method to verify length correction can be found in the Supporting Material.
Correction for photobleaching
Another experimental limitation is the statistical nature of photobleaching after a fluorophore has emitted a certain number of photons. Photobleaching influences the measurement of the observed interaction time (or run length) and introduces a dependence on the bleaching rate kbleach (see Fig. 6 A). Even though the lifetime of the fluorophores can be increased by adding antifade solutions (15), the effect of photobleaching cannot be eliminated fully in the experiments. Here, we describe the bleaching probability as a combination of one- and two-fluorophore bleaching (see Eq. 4), because even though dimeric, GFP-labeled motor proteins are always tagged with two fluorophores, not all of these fluorophores are active (see the Supporting Material):
(4) |
(5) |
In Eq. 4, the parameter ρ denotes the fraction of motors with only one active fluorophore. Because combining the evaluation of detachment and photobleaching is not trivial (Eq. 5) and the corresponding addition of more parameters into the LSF-CDF leads to unstable solutions, we introduce a different approach by assigning a bleaching probability to each individual motor protein according to its interaction time. Afterwards, the data is analyzed with a single exponential using LSF-CDF(free) several times (n = 100) and in each iteration different events will be randomly scored as bleaching events in agreement with their bleaching probability. Combined with the end-events, these censored events are corrected for by using the Kaplan–Meier estimator. This way, the bleaching correction is averaged over many iterations of the run length or interaction time estimation. Additionally, the data can be resampled in every iteration to combine photobleaching correction with the bootstrapping method that now not only yields a corrected measurement, but also the statistical error of the result.
To verify the proposed correction, we extended our simulations to include photobleaching with a mixture of one- and two-fluorophore bleaching, ρ = 0.5. Fig. 6 B shows the dependence of the observed and corrected interaction times on the bleaching time. Here, only when correcting for both finite filament length and photobleaching, the expected interaction of = 2 s was estimated. We tested the photobleaching correction experimentally by imaging single kinein-1 motor proteins in stepping assays at different laser excitation intensities. Here, for each intensity, the bleaching rate kbleach as well as ratio ρ was measured in a second flow channel (on the same coverslip) by immobilizing GFP-labeled kinesin-1 on MTs with AMP-PNP (a nonhydrolysable analog of ATP). The evaluation of the photobleaching using FIESTA is described in the Supporting Material. Results in Fig. 6 C validate the correction for photobleaching, because the dependence of the interaction time on the laser intensity is removed and the estimation of a combined interaction time is possible.
Workflow for evaluating interaction time and run length
-
1)
Extract the interaction time and run length information for each motor protein (see the Supporting Materials and Methods for instructions on how to track and analyze single fluorescent motor proteins using FIESTA).
-
2)
Score end-events according to their detachment position in relation to the filament end.
-
3)
Estimate bleaching probability from the separate channel with immobilized motors (see the Supporting Materials and Methods for photobleaching measurement with FIESTA).
-
4)
Assign bleaching-events, create cumulative probability distribution (censored end- and bleaching-events) with the Kaplan-Meier-Estimator and use LSF-CDF (free) to estimate interaction time and length.
-
5)
Bootstrapping: repeat step 4 with different randomly selected traces. Reassign bleaching events randomly in each iteration according to the bleaching probability.
The MATLAB code for evaluation of velocity, bleaching time, interaction time, and run length can be found within a compressed file (Data S1; description in the Supporting Materials and Methods). Note that censored events also include events where the motor proteins move in or out of the field of view, as well as traces that start or end in the first or last imaging frame, respectively.
Discussion
We presented a method to investigate the motility parameters of single, fluorescently labeled motor proteins in stepping assays, including experimental enhancements, software for tracking and analysis, and precise evaluation of the measurements. For verification, simulations were performed to check whether the methods yield the true results and different methods for evaluation and correction were compared with respect to their systematic and statistical error. The best methods were then used to characterize the temperature dependence of the velocity as well as the influence of the filament length and photobleaching on interaction time and run length experimentally. Furthermore, correction methods are proposed to minimize the influence of the experimental setup on the obtained results. These corrections are shown to work in simulations as well as with experimental data and show the advantage to previously used methods for the evaluation of data from single fluorescent motor proteins.
First, we found that a measured velocity distribution from single motor proteins cannot be described as a normal distribution. When assuming a simple Poisson–Stepper model the velocity of motors with longer interaction times can be estimated more precisely then for short events, which leads to deviations from the normal distribution due to heavier tails. Still, the normal distribution is commonly used when evaluating velocity distributions (3, 4, 5, 6), even though Norris et al. (6) already show clear deviations (Fig. S2). Here, we propose using a TLS distribution for estimation of the mean velocity because the TLS PDF fits better to the simulated as well as the experimentally measured velocity distributions. Compared to the normal distribution the TLS method yields a smaller statistical error. Because it is possible to precisely measure the velocity (Δv/v < 0.01 with N > 1000) this motility parameter can be used as a control parameter to verify that the temperature indeed was stable even without the information of the additional temperature sensor in the flow channel.
Second, we compared different methods to evaluate exponential distributions and we found that using the LSF-CDF yields the best results. Here, the introduction of a cutoff x0 as a free fit parameter is sufficient to account for missing short events. We note that the MLE method can also be used if the PDF is renormalized to account for the missing short events (21), but the a priori knowledge of the cutoff parameter can essentially introduce a systematic error and bias the results. Nonetheless, because MLE can also account for censored events, the LSF-CDF(free) method can easily be replaced by MLE. (Note: The MATLAB code in the Supporting Material includes both methods.) We compared both methods using simulations (see the Supporting Material) and found that MLE has a slightly smaller statistical error (mainly due to one less fitting parameter). However, when evaluating noisy data (simulated or experimental) qualitatively the fit of the PDF to the distribution was strongly influenced by the choice of the cutoff parameter. While the actual method to be used is a matter of choice we recommend using LSF-CDF(free) or extending MLE to allow for estimation of the best cutoff parameter (e.g., by machine learning).
Third, we propose specific corrections for finite filament length and photobleaching using the Kaplan–Meier estimator with the LSF-CDF(free) method. Because we can track both the fluorescently labeled motors and the filaments, it is possible to determine end-events, which censor some specific events in a data set. By using the Kaplan–Meier estimator we can adjust the CDF for these end-events and thereby correct for the finite filament length. The advantage over previously published methods (8), which use a correction term that includes the average filament length of the experiment, is that the underlying length distribution of the filaments does not influence the evaluation. Therefore, it does not matter if motor proteins are stepping on one particular filament or on a random set of filaments. In addition, variations in filament length between different fields of view do not affect the results. Correction for photobleaching is not as trivial as the filament length correction. Previously, corrections for photobleaching assumed the simple relation kobserved = koff + kbleach (9, 24), with kobserved describing the observed detachment rate being the superposition of the real detachment rate koff and the bleaching rate kbleach. There, by either estimating the bleaching rate (9) or extrapolating kobserved using different laser intensities (24), a corrected interaction time could be calculated. Unfortunately, these methods fail when the molecules of interest are labeled with more than one fluorophore as is the case with dimeric motor proteins where a GFP is expressed on each monomeric motor unit. Most importantly, usually not all fluorophores are active and a mixture of motor proteins with either one or two active fluorophores is observed (12). Therefore, we propose to measure the bleaching time as well as the ratio of one to two fluorophore bleaching to calculate the bleaching probability for a specific experiment. Over many iterations, we can now randomly assign bleaching-events according to the interaction time of the motor proteins. By including them in the censored events, our correction for both, photobleaching and finite filament length, is achieved. Consequently, our simulations show slight deviations from the expected true values only for bleaching times in the range of—or shorter than—the interaction time.
How well does our proposed method perform when 1) the interaction time of the motor is significantly longer than the bleaching time of the fluorophores, 2) the run lengths of the motors are significantly larger than the filament lengths, and 3) the motors exhibit frequent switching between movement and stalling? With regard to the limitations originating from photobleaching, adjustment of the imaging conditions (e.g., exposure time and light intensity, such that the bleaching rates are lower than the motor detachment rates) are effective means to reduce the systematic error. Furthermore, novel photostable fluorescent proteins as well as functionalized fluorophores (e.g., GFP-boosters based on nanobodies) will reduce the influence of photobleaching. In any case, comparing the motility parameters obtained under different imaging conditions can directly show if photobleaching influenced the results. With regard to investigating highly processive motor proteins, which reach the filament ends in most of the cases, the only solution is to use longer MTs. Tweaking the protocols for MT polymerization (e.g., by extending the growths times in conjunction with lowering the tubulin concentration) allows for the generation of MTs with lengths above 50 μm, sufficient to reliably determine the run lengths in any cellular context. Moreover, discarding all motility events from filaments with lengths below a threshold is possible, because our proposed correction algorithms do not require a certain MT length distribution. In general, the more motor traces are censored the less accurate the precision will be. With respect to super-processive motility (such as Kinesin-3 (25) and Kinesin-8 (26)) we note that the run length is not a suitable motility parameter and rather other measures (such as the motor’s end-reach probability in dependence of the landing position) should be applied. With regard to motors that exhibit frequent stalling, our corrections for finite filament length and photobleaching do stay valid (Kaplan–Meier estimator can still be used), but the motors cannot be described by simple Poisson steppers anymore. Therefore, the underlying models will have to be adjusted accordingly, e.g., by using double-exponential functions for interaction time and run length.
We conclude, to precisely characterize the stepping of motor proteins on their filaments, the following steps are necessary:
-
1.
The temperature should be stable throughout the experiments to combine and evaluate many traces at the same condition in one data set. Because even small changes in temperature influence the motility parameters significantly, it is essential to measure the assay temperature precisely. Here, measuring and specifying the room temperature is not sufficient as the actual temperature in the flow channel can be up to 3 K higher due to microscope-internal heat sources, such as electrical components and light sources. Hence, any results given for velocity, run length, and interaction time should include the temperature in the flow channel within ±1 K. Furthermore, to investigate different motor or filament populations, we recommend incorporating them in the same flow channel or at least on the same coverslip to minimize any temperature differences in the experiments. A detailed description of the temperature control can be found in the Supporting Material, which is easy to implement using any mechanical workshop (SolidWork files are available on request).
-
2.
Corrections for finite filament length and photobleaching need to be included, because otherwise both interaction time and run length are underestimated (e.g., see Zimmermann et al. (27)). Here, the systematic errors can be on the same order as the statistical error and therefore careful consideration of the evaluation method is essential. For that reason, we provide an extensive description of the evaluation method, including the MATLAB code, to efficiently measure a sufficient number of motor proteins (to reduce the statistical error) and to address limitations in the design of the experimental assay (to remove the systematic error).
-
3.
The motility parameters velocity, interaction time, and run length should always be estimated when comparing different data sets. Motor proteins could have the same run length, but different velocities (kstep) and interaction times (koff), so only comparing one motility parameter might result in the misconception that even if the experiment yields the same result for different motors, the underlying motility mechanism could still be different.
In summary, the described evaluation should allow for a better statistical comparison of motor proteins influenced by external factors, e.g., ionic strength, ATP concentration, nucleotide state of the filaments, or posttranslational modifications of the filaments. Furthermore, comparison of different motor proteins as well as motor populations, e.g., structural differences or binding of regulatory proteins, will then become possible. We believe the methodology developed in our work will provide a reliable framework for the evaluation of a wide range of experiments with single fluorescently labeled motor proteins.
Author Contributions
F.R. and S.D. conceived and designed the experiments. F.R. and L.K. performed the experiments and analyzed the data. F.R. contributed reagents/materials/analysis tools. F.R. and S.D. wrote the article.
Acknowledgments
We thank Friedrich Schwarz for help with the design and the mechanical workshop at the Max Planck Institute for Molecular Cell Biology and Genetics (MPI-CBG) for manufacturing the cooling/heating ring for the objective; Corina Bräuer for technical support; Aniruddha Mitra, Jens Ehrig, Rahul Grover, and Georg Krainer for comments on the manuscript; and the whole Diez Lab for fruitful discussions.
We acknowledge financial support from the European Research Council (Starting Grant 242933), the Deutsche Forschungsgemeinschaft (DFG) through the Center for Advancing Electronics Dresden (CFAED), and the Technische Universität Dresden.
Editor: E. Michael Ostap.
Footnotes
Supporting Materials and Methods, eight figures, three tables, and one data file are available at http://www.biophysj.org/biophysj/supplemental/S0006-3495(17)31036-6.
Supporting Material
References
- 1.Vale R.D. The molecular motor toolbox for intracellular transport. Cell. 2003;112:467–480. doi: 10.1016/s0092-8674(03)00111-9. [DOI] [PubMed] [Google Scholar]
- 2.Morfini G., Schmidt N., Kins S. Conventional kinesin: biochemical heterogeneity and functional implications in health and disease. Brain Res. Bull. 2016;126:347–353. doi: 10.1016/j.brainresbull.2016.06.009. [DOI] [PubMed] [Google Scholar]
- 3.Vale R.D., Funatsu T., Yanagida T. Direct observation of single kinesin molecules moving along microtubules. Nature. 1996;380:451–453. doi: 10.1038/380451a0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Thorn K.S., Ubersax J.A., Vale R.D. Engineering the processive run length of the kinesin motor. J. Cell Biol. 2000;151:1093–1100. doi: 10.1083/jcb.151.5.1093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Walter W.J., Beránek V., Diez S. Tubulin acetylation alone does not affect kinesin-1 velocity and run length in vitro. PLoS One. 2012;7:e42218. doi: 10.1371/journal.pone.0042218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Norris S.R., Núñez M.F., Verhey K.J. Influence of fluorescent tag on the motility properties of kinesin-1 in single-molecule assays. Biophys. J. 2015;108:1133–1143. doi: 10.1016/j.bpj.2015.01.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gijbels I. Censored data. WIREs Comp. Stat. 2010;2:178–188. [Google Scholar]
- 8.Thompson A.R., Hoeprich G.J., Berger C.L. Single-molecule motility: statistical analysis and the effects of track length on quantification of processive motion. Biophys. J. 2013;104:2651–2661. doi: 10.1016/j.bpj.2013.05.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Pierce D.W., Vale R.D. Assaying processive movement of kinesin by fluorescence microscopy. Methods Enzymol. 1998;298:154–171. doi: 10.1016/s0076-6879(98)98016-8. [DOI] [PubMed] [Google Scholar]
- 10.Kawaguchi K., Ishiwata S. Temperature dependence of force, velocity, and processivity of single kinesin molecules. Biochem. Biophys. Res. Commun. 2000;272:895–899. doi: 10.1006/bbrc.2000.2856. [DOI] [PubMed] [Google Scholar]
- 11.Böhm K.J., Stracke R., Unger E. Effect of temperature on kinesin-driven microtubule gliding and kinesin ATPase activity. FEBS Lett. 2000;466:59–62. doi: 10.1016/s0014-5793(99)01757-3. [DOI] [PubMed] [Google Scholar]
- 12.Lakämper S., Kallipolitou A., Meyhöfer E. Single fungal kinesin motor molecules move processively along microtubules. Biophys. J. 2003;84:1833–1843. doi: 10.1016/S0006-3495(03)74991-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Rogers K.R., Weiss S., Cross R. KIF1D is a fast non-processive kinesin that demonstrates novel K-loop-dependent mechanochemistry. EMBO J. 2001;20:5101–5113. doi: 10.1093/emboj/20.18.5101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Castoldi M., Popov A.V. Purification of brain tubulin through two cycles of polymerization-depolymerization in a high-molarity buffer. Protein Expr. Purif. 2003;32:83–88. doi: 10.1016/S1046-5928(03)00218-3. [DOI] [PubMed] [Google Scholar]
- 15.Korten T., Nitzsche B., Diez S. Fluorescence imaging of single kinesin motors on immobilized microtubules. Methods Mol. Biol. 2011;783:121–137. doi: 10.1007/978-1-61779-282-3_7. [DOI] [PubMed] [Google Scholar]
- 16.Gell C., Bormuth V., Howard J. Microtubule dynamics reconstituted in vitro and imaged by single-molecule fluorescence microscopy. Methods Cell Biol. 2010;95:221–245. doi: 10.1016/S0091-679X(10)95013-9. [DOI] [PubMed] [Google Scholar]
- 17.Leduc C., Ruhnow F., Diez S. Detection of fractional steps in cargo movement by the collective operation of kinesin-1 motors. Proc. Natl. Acad. Sci. USA. 2007;104:10847–10852. doi: 10.1073/pnas.0701864104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ruhnow F., Zwicker D., Diez S. Tracking single particles and elongated filaments with nanometer precision. Biophys. J. 2011;100:2820–2828. doi: 10.1016/j.bpj.2011.04.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hughes J., Shastry S., Fricks J. Estimating velocity for processive motor proteins with random detachment. J. Agric. Biol. Environ. Stat. 2013;18:204–217. doi: 10.1007/s13253-013-0131-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Block S.M., Goldstein L.S.B., Schnapp B.J. Bead movement by single kinesin molecules studied with optical tweezers. Nature. 1990;348:348–352. doi: 10.1038/348348a0. [DOI] [PubMed] [Google Scholar]
- 21.Woody M.S., Lewis J.H., Ostap E.M. MEMLET: an easy-to-use tool for data fitting and model comparison using maximum-likelihood estimation. Biophys. J. 2016;111:273–282. doi: 10.1016/j.bpj.2016.06.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kaplan E.L., Meier P. Nonparametric estimation from incomplete observations. J. Am. Stat. Assoc. 1958;53:457–481. [Google Scholar]
- 23.Jeune-Smith Y., Hess H. Engineering the length distribution of microtubules polymerized in vitro. Soft Matter. 2010;6:1778–1784. [Google Scholar]
- 24.Mashanov G.I., Tacon D., Molloy J.E. Visualizing single molecules inside living cells using total internal reflection fluorescence microscopy. Methods. 2003;29:142–152. doi: 10.1016/s1046-2023(02)00305-5. [DOI] [PubMed] [Google Scholar]
- 25.Soppina V., Norris S.R., Verhey K.J. Dimerization of mammalian kinesin-3 motors results in superprocessive motion. Proc. Natl. Acad. Sci. USA. 2014;111:5562–5567. doi: 10.1073/pnas.1400759111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Varga V., Helenius J., Howard J. Yeast kinesin-8 depolymerizes microtubules in a length-dependent manner. Nat. Cell Biol. 2006;8:957–962. doi: 10.1038/ncb1462. [DOI] [PubMed] [Google Scholar]
- 27.Zimmermann D., Santos A., Rock R.S. Actin age orchestrates myosin-5 and myosin-6 run lengths. Curr. Biol. 2015;25:2057–2062. doi: 10.1016/j.cub.2015.06.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.