Skip to main content
BioMedical Engineering OnLine logoLink to BioMedical Engineering OnLine
. 2007 Jun 28;6:24. doi: 10.1186/1475-925X-6-24

A finite element model for protein transport in vivo

Kouroush Sadegh Zadeh 1,2,, Howard C Elman 1, Hubert J Montas 2, Adel Shirmohammadi 2
PMCID: PMC1940256  PMID: 17598901

Abstract

Background

Biological mass transport processes determine the behavior and function of cells, regulate interactions between synthetic agents and recipient targets, and are key elements in the design and use of biosensors. Accurately predicting the outcomes of such processes is crucial to both enhancing our understanding of how these systems function, enabling the design of effective strategies to control their function, and verifying that engineered solutions perform according to plan.

Methods

A Galerkin-based finite element model was developed and implemented to solve a system of two coupled partial differential equations governing biomolecule transport and reaction in live cells. The simulator was coupled, in the framework of an inverse modeling strategy, with an optimization algorithm and an experimental time series, obtained by the Fluorescence Recovery after Photobleaching (FRAP) technique, to estimate biomolecule mass transport and reaction rate parameters. In the inverse algorithm, an adaptive method was implemented to calculate sensitivity matrix. A multi-criteria termination rule was developed to stop the inverse code at the solution. The applicability of the model was illustrated by simulating the mobility and binding of GFP-tagged glucocorticoid receptor in the nucleoplasm of mouse adenocarcinoma.

Results

The numerical simulator shows excellent agreement with the analytic solutions and experimental FRAP data. Detailed residual analysis indicates that residuals have zero mean and constant variance and are normally distributed and uncorrelated. Therefore, the necessary and sufficient criteria for least square parameter optimization, which was used in this study, were met.

Conclusion

The developed strategy is an efficient approach to extract as much physiochemical information from the FRAP protocol as possible. Well-posedness analysis of the inverse problem, however, indicates that the FRAP protocol provides insufficient information for unique simultaneous estimation of diffusion coefficient and binding rate parameters. Care should be exercised in drawing inferences, from FRAP data, regarding concentrations of free and bound proteins, average binding and diffusion times, and protein mobility unless they are confirmed by long-range Markov Chain-Monte Carlo (MCMC) methods and experimental observations.

Background

Transport of mass, energy, and momentum has a crucial role in many branches of science and engineering. In biological systems, transport phenomena are central to the biological processes that take place in different parts of organisms. They determine the behavior and function of cells, tissues, and organs, and regulate interactions between synthetic agents (e.g. drugs) and recipient targets. These phenomena are crucial elements in the design and use of biosensors, high density cell culture, filtration units for kidney dialysis, heart-lung bypass machines, and membrane oxygenators in human medical care and ion selective electrodes, pH-meters, electrical conductivity meters, and time domain reflectometery used in biosystem analysis. Transport processes are critical in the removal of toxins from the blood, remediation of impaired water bodies (sources of waterborne diseases), and bioremediation of contaminated landscape [1,2].

Fluorescence Recovery after Photobleaching (FRAP) is one of the most widely used experimental protocols to study biological transport processes such as diffusion and reaction [3-30]. FRAP is a straightforward technique used to monitor the movement of fluorescence molecules. These molecules can absorb light of one wavelength (blue for instance) and emit light of another (e.g. green). However, if exposed to repeated cycles of excitation-emission, they lose their ability to emit fluorescence. This phenomenon is called "photobleaching" or "photochemical bleaching" [24]. In this technique a small region of living cell containing Green Fluorescent Protein (GFP)-tagged protein is exposed to a brief but intense laser beam, produced by a laser scanning confocal microscope, to irreversibly inactivate fluorescence emission in that region. Before exposure to the light, the living cell is in equilibrium with a uniform population of fluorescence [25]. Photobleaching creates two different populations of fluorescence molecules, which are spatially separated at the beginning of the experiment. Unbleached molecules from the undisturbed area move toward the bleached region and the rate of fluorescence recovery is measured as a function of time. The result is a noisy graph known as a recovery curve. However, because of the noise, the original graph by itself is not suitable for quantitative study of the dynamics of living cells. The FRAP community generally uses a data processed normalized average fluorescence recovery curve that has less noise. By analyzing the recovery curve, one can quantify how many fluorescent photons return to the bleached area in comparison to the amount of light that was there before photobleaching. This is known as percent recovery. The other question that can be addressed is that of how fast the fluorophores move toward the bleached area. This is a measure of the free molecular diffusion coefficient of the bio-macromolecule under study.

One of the first attempts to estimate bio-macromolecule mass transport and binding rate parameters using in vivo information was carried out by Kaufmann and Jain [13]. Sprague et al. [25] developed a diffusion-reaction model to simulate FRAP experiment but the solution is in Laplace space and requires numerical inversion to return to real time. The model presented recently by Lele et al. [26] properly respects cell boundaries but is in the form of a Fourier-Bessel series and can suffer from Gibbs phenomena. Carrero et al. (2003, 2004) presented an excellent review on the effects of boundary conditions, influence of the membrane, and the location of the photobleaching on the estimation of diffusion coefficients for diffusing biomolecules in a bounded domain [24,28]. They showed that overestimations or underestimations can result from ignoring this influence [24]. Beaudouin et al. [29] used a diffusion-reaction models to study mobility of five chromatin-interacting proteins inside living cells. They found that transient interactions are common for chromatin proteins. Individual proteins locally sample chromatin for binding sites rather than diffusing globally followed by binding at random nuclear positions. They concluded that complementary procedures are needed to measure transient biochemical interactions in living cells.

Although experimental methods are representative of the biological system, they are expensive, tedious and time-consuming. An alternative approach is to use mathematical modeling. In this regard, several sophisticated mathematical models have been developed to predict and simulate the fate and transport of drugs and bio-macro-molecules in biological systems. However, the use of these models is not an easy task since they contain numerous parameters that need to be determined before the model(s) can be used for the considered situation. The success of model predictions depends largely on the proper representation of relevant processes, uncertainty in model parameters [31,32], and parameter identification which is a critical step in the modeling process. Difficulties in model calibration and parameter identification are quite common in modeling mass transport problems in biological systems [1,30].

The main objective of this paper is to develop a mass-lumped Galerkin-based finite element model (FEM) to solve a system of two partial differential equations governing protein transport and binding in living cells and couple it with the Osborne-Moré [33,34] extended version of the Levenberg-Marquardt [35,36] algorithm and an experimental data set, obtained by the Fluorescence Recovery after Photobleaching (FRAP) technique, to quantify bio-macromolecule diffusion coefficient and binding rate parameters. The applicability of the developed FEM-based inverse modeling strategy is illustrated by simulating the mobility and binding of GFP-tagged glucocorticoid receptor in the nucleoplasm of mouse adenocarcinoma.

Methods

Direct problem

A one site-mobile-immobile model was selected as the direct (forward) problem to describe bio-macromolecular diffusion inside living cell in cylindrical coordinate system [1,30]:

Ft(ρ,t)=DF2Fρ2(ρ,t)+DF1ρFρ(ρ,t)KaF(ρ,t)+KdC(ρ,t)Ct(ρ,t)=Ka*F(ρ,t)KdC(ρ,t) (1)

Since a circular bleach spot was used to bombard the cell and to track the fluorescently tagged biomolecules inside bleach spot during the time course of a FRAP experiment, system (1) was written in cylindrical coordinate system.

Furthermore, the binding reaction was modeled by primary rate kinetic or single binding site model [1,13,25,26,30]:

F+SKdKaC (2)

subject to the following initial and Neumann boundary conditions:

F(ρ,0)={0,Feq,0<ρww<ρRC(ρ,0)={0,Ceq,0<ρww<ρRFρ|ρ=0,R=0Cρ|ρ=0,R=0

where F is the concentration of free biomolecule, S is the concentration of vacant binding sites, C is the concentration of the bound complex(C = FS), DF is the molecular diffusion coefficient (L2T-1) of free biomolecule, Ka is the free biomolecule-vacant binding site association rate coefficient (T-1), Kd is dissociation rate coefficient (T-1), Ka = KaS is the pseudo-association rate coefficient, ρ is radial coordinate (L) in the cylindrical coordinate system, w is the radius of the bleached area, R is the length of the spatial domain, and Feq and Ceq are equilibrium concentration of F and C [25,30]:

Feq=KdKa+Kd,Ceq=KaKa+Kd

The initial condition implies that photochemical bleaching inactivates the fluorescence tag on the biomolecules in the bleached area but does not change the concentrations of free and bound biomolecules or vacant binding sites. The boundary conditions imply that the diffusive biomolecule flux is zero at the center of the bleach spot and at the cell or nucleus membrane during time course of a FRAP experiment [30].

The forward problem was solved by the finite element method. The weak formulation of the dependent variables F and C were approximated using piecewise linear approximating functions [1,31,37-39]:

F'(ρ,t)=j=1NFj(t)φj(ρ)C'(ρ,t)=j=1NCj(t)φj(ρ) (3)

in which N is total number of nodes in the spatial domain, ϕj(ρ) are the selected linear basis functions, and Fj(t) and Cj(t) are the associated time-dependent unknown coefficients that represent the solution of Equation (1) at nodes within the domain.

Substitution of expressions (3) into Equation (1) will not satisfy the partial differential equation and hence will produce a residual. The goal of the finite element approximation is to minimize this error. This can be accomplished by introducing the weight function, ϕi(ρ), and setting the integral of the weighted residuals to zero. In the Galerkin method, which was used in this study, the weighting functions are chosen to be identical to the basis function [1,31]:

Ω[F't(ρ,t)DF2F'ρ2(ρ,t)DF1ρF'ρ(ρ,t)+KaF'(ρ,t)KdC'(ρ,t)]φi(ρ)dΩ=0Ω[C't(ρ,t)KaF'(ρ,t)+KdC'(ρ,t)]φi(ρ)dΩ=0 (4)

where Ω is the study domain. Applying Green's first identity [37,39] to equation (4) yields:

eΩeF't(ρ,t)φi(ρ)dΩ+DFeΩeF'ρ(ρ,t)φi(ρ)ρdΩDFeΩeρF'r(ρ,t)φi(ρ)dΩ+KaeΩeF'(ρ,t)φi(ρ)dΩKdeΩeC'(ρ,t)φi(ρ)dΩ=qfneΩeC't(ρ,t)φi(ρ)dΩKaeΩeF'(ρ,t)φi(ρ)dΩ+KdeΩeC'(ρ,t)φi(ρ)dΩ=qcn (5)

in which Ωe is the domain of element and qfn and qcn are fluxes of the free and bound bio-macromolecules across the boundary out of the element, respectively.

The time derivatives in equation (5) were approximated using an Euler time-marching algorithm (backward finite difference scheme). Inserting equation (3) into equation (5) and integrating over the elements produces a system of time-dependent ordinary differential equations which can be formulated in matrix form:

[A]{F}n+1{F}nΔt+[B]{F}n+1+[E]{F}n+1[H]{C}n+1={Qf}n+1[A]{C}n+1{C}nΔt[E]{F}n+1+[H]{C}n+1={Qc}n+1 (6)

where:

[A1]=eΩeφiφjdΩ[B]=eDfΩeφiρφjρdΩeDfρeΩeφiρφjdΩ[E]=eKaΩeφiφjdΩ[H]=eKdΩeφiφjdΩ{Qf}={0,0,.........,0,qfn}{Qc}={0,0,.........,0,qcn}

in which ρe is the radial position of the centroid of element e.

Parameter optimization

The inverse problem was formulated as a nonlinear optimization problem in which model parameters p = [Df, Ka, Kd] were optimized by minimizing a penalty function representing the discrepancy between the observed and predicted average fluorescence intensity recovery time series inside the bleach spot [1]. If the measurement errors asymptotically follow a multivariate normal distribution with zero mean and covariance matrix,V, the likelihood function can be formulated as [40]:

L(p)=(2π)N/2det[V]1/2exp[12(UU(p))TV1(UU(p))] (7)

where L(β) is the likelihood function, N is the number of observations, p is the vector of the parameters being optimized, U* is a vector and/or matrix of observations, and U is a corresponding vector and/or matrix of model predictions as a function of the parameters being optimized. This vector is obtained by solving the forward problem. The maximum likelihood estimator consists of those values of the unknown parameters that maximize the magnitude of the same likelihood function [40]. Since logarithm is a monotonically increasing function of its argument (the value of p that maximizes L(p) also maximizes ln L(p)), and because ln L(p) is simpler and much easier to use than L(p) itself, therefore ln L(p) is usually used in optimization:

lnL(p)=N2ln(2π)12det[V]12(UU(p))TV1(UU(p))] (8)

The maximum of the likelihood function must satisfy the set of equations ln L(p)/∂p = 0. When the error covariance matrix is known, maximization of equation (8) is equivalent to the minimization of the following weighted least square problem [1]:

ϕ(p) = [(U* - U(p))TV-1(U* - U(p))] (9)

where ϕ(p) is the objective or penalty function. If there is information about the values and distributions of parameters, it can be incorporated in the objective function as well [1]:

φ(p)=[(UU(p))TV1(UU(p))]+[(pp^)TVp1(pp^)] (10)

in which p* is parameter vector containing a priori information, p^ is the corresponding predicted parameter vector, and Vp is the covariance matrix for the parameter vector. This kind of optimization is known as Bayesian estimation. The second term in equation (10), which is sometimes called the plausibility criterion [1] insures that the optimized values of the parameters remain in some feasible region around p*. Matrices V and Vp, which are sometimes called weighting matrices, provide information about the measurement accuracy as well as any possible correlation between measurement errors and between parameters.

A limitation of equation (10) is that the error covariance matrix is generally not known. A common approach to overcoming this problem is to make some a priori assumptions about the structure of the error covariance matrix. In the absence of any additional information regarding the accuracy of input data, the simplest and most recommended way is to assume that observation errors are uncorrelated which implies setting V equal to the identity matrix and Vp to zero. In this case the optimization problem collapses to the well known ordinary least squares formulation [41,42]:

φ(p)=12r(p)Tr(p) (11)

where r is the residual (differences between the observed and predicted state variable) column vector.

Minimization of equation (11) was carried out iteratively by first starting with an initial guess of parameter vector, {p(k)} and updating it at each iteration until the termination criteria were met [1,31,32]:

p(k+1) = p(k) + α(k)Δp(k) (12)

where α(k) is a scalar step length and Δp(k) is the direction of the search or step direction [41].

Using QR decomposition [43] the linear least square problem below, which is the Osborne-Moré extended version of the Levenberg-Marquardt algorithm, was solved to obtain the search direction in each iteration:

min(r(pk)0)+(J(p)k(λkDk)12)Δpk2 (13)

where λ is a positive scalar known as Marquardt's parameter or Lagrange multiplier [36], J is the Jacobian or sensitivity matrix, and D is a positive definite scaling matrix that ensures the descent property of the algorithm even if the initial guess is not "smart". For non-zero values of λ, the Hessian approximation is always a positive definite matrix, which ensures the descent property of the algorithm [41].

A combination of "one-sided" and "two-sided" finite difference methods [30-32] was used to calculate the partial derivatives of the state variable with respect to model parameters and to construct the Jacobian matrix. The "One-sided" finite difference method estimates the partial derivatives of the state variable with respect to model parameters by solving the forward problem (Eq. 1) p+1 times (p is number of parameters to be estimated). On the other hand, the "two-sided" finite difference method estimates the partial derivatives of the state variable with respect to model parameters by solving the forward problem (Eq. 1) 2p+1 times. At the early stages of the optimization, where the search is far from the solution, the "one-sided" finite difference scheme, which is computationally cheap but not as accurate as the "two-sided" approach, was used. As the optimization proceeds in the descent direction, the algorithm switches to a more accurate but computationally expensive approach in which the partial derivatives of the state variable with respect to the model parameters are calculated using a two-sided finite difference scheme. The switch was made when ϕ(p) ≤ 1 × 10-2. A detailed description of the procedure to update the Jacobian matrix is presented in [1,30].

In order to update λ at each iteration, the optimization starts with an initial parameter vector and a large λ(λ = 1). As long as the objective function decreases in each iteration, the value of λ is reduced. Otherwise, it is increased. The approach avoids calculation of λ and step length in each iteration and is therefore computationally cheap. A detailed description of the code for updating λ is given in [31].

Finally, the following stopping rule was used to end the search [1,31,32]:

if (φ(p)|p=p^δ1&Δφ(p)φ(p)δ2&φ(p)δ3)

    Stop

else

    Continue Optimization Loop

end

where φ(p)|p=p^ is the gradient of the penalty function at solution, Δφ(p)φ(p) is relative changes in the magnitude of the parameters in two consecutive iterations, and δ1, δ2, and δ3 are user defined small values.

The accuracy of the optimization was assessed by goodness-of-fit analysis. The Root Mean Squared Error (RMSE) and Coefficient of Determination (R2) were calculated for every set of optimized parameters [44,45]:

RMSE = (rTr/(N - p))1/2 (14)
R2=[I^iIiI^iI^i]2[I^2(I^i)2][I2(Ii)2] (15)

where Ii and I^i are the observed and predicted total normalized average fluorescence intensity, F + C, inside bleached area during time course of FRAP experiment, respectively and N is the number of observations on FRAP time series.

Experimental Studies

A FRAP experiment was conducted on the mouse adenocarcinoma cell line 3617 at the Laboratory of Receptor Biology and Gene Expression, National Cancer Institute-National Institutes of Health, Bethesda, MD (McNally, personal communication). This data set consists of 43 fluorescent recovery values gathered in the course of a 20-second FRAP experiment and post processed to reduce noise. The developed inverse modeling strategy was then used to quantify mass transport and binding rate parameters of GFP-tagged glucocorticoid receptor.

Results and Discussion

Model validation

Before being incorporated into the framework of the developed inverse modeling strategy, the numerical model was first validated against the exact solution of Sadegh Zadeh and Montas (unpublished) and the semi-analytic solution of [25]. The results are depicted in Figures 1a, 1b, and 1c (for the exact solution of Sadegh Zadeh and Montas) and in Figures 1d (for the solution of [25]). To validate our model with the solution of [25], the average of the normalized fluorescence intensity within the bleach spot during the time course of the FRAP experiment, I(t), was calculated by [1,30]:

Figure 1.

Figure 1

Spatial and temporal distributions of GFP-GR inside bleach spot after photo-chemical bleaching during time course of a FRAP experiment. The Figure shows comparison of the analytic solution (solid lines) and numerical model (dots) at times of 0, 0.01, 0.02, 0.05, 0.1, 0.2, 0.5, 1, and 2 seconds for free GFP-GR (a), bound complex (b), and total GFP-GR (c). Validation of the numerical model (dots) with the analytic solution (solid lines) of [25] is depicted in (d). The graph presents average normalized fluorescent intensity, obtained by equation (16), inside the bleach spot.

I(t)=2w20wρ[F(ρ,t)+C(ρ,t)]dρ (16)

While Figure 1a shows spatial and temporal distributions of free GFP-GR, F(ρ, Df, Ka, Kd, t), inside and outside bleach spot after photochemical bleaching during time course of a FRAP experiment, Figure 1b presents distributions of bound protein, C(ρ, Df, Ka, Kd, t), at times of 0, 0.01, 0.02, 0.05, 0.1, 0.2, 0.5, 1, and 2 seconds. Both Figures indicates that there is excellent agreement between the analytic solution and numerical simulator. Spatial and temporal distributions of total fluorescently tagged GFP-GR, [F + C](ρ, Df, Ka, Kd, t), inside and outside the bleached area are given in Figure 1c, which presents excellent agreement between the analytic and numerical solutions. The same time range was used to perform the comparison. As Figure 1d indicates, there is excellent agreement between the two solutions in simulating the average normalized fluorescence intensity within the bleach spot during time course of the FRAP experiment.

Model calibration

The developed methodology was then used to estimate diffusion coefficient and binding rate parameters of GFP-tagged glucocorticoid receptor (GFP-GR). The results are given in Table 1 and Figure 2 (the experimental FRAP time series data are from McNally, personal communication). The Root Mean Squared Error (RMSE) and Coefficient of Determination (R2) were calculated, using equations (14) and (15), for every set of optimized parameters and presented in the last two columns of Table 1. The values for diffusion coefficient, binding rate parameters, and corresponding indices estimated by [25] are given as the first run in Table 1 and Figure 2 for sake of comparison. Table 1 and Figure 2 indicate that many combinations of the three parameters can essentially produce the same error level (RMSE) and yields equally excellent fits. The values obtained by [25] represent only one of the possible solutions. In other words, the inverse problem is not well-posed and has no unique solution. Therefore, one may conclude that the Fluorescence Recovery after Photobleaching technique, though useful in studying the dynamics of biological systems, provide insufficient information to uniquely estimate free diffusion coefficient and binding rate parameters of biomolecule simultaneously.

Table 1.

The results of parameter estimation for GFP-GR using an experimental time series obtained by the Fluorescence Recovery after Photobleaching (FRAP) technique.

run Df (μ m2s-1) Ka (s-1) Kd (s-1) F eq C eq tb (ms) td (ms) RMSE R2
1* 9.20 500 86.4 0.1473 0.8527 11.60 2 0.0255 0.9886
2 1.9049 1.0549 3.7657 0.7812 0.2188 265 0.948 0.0259 0.9897
3 1.2319 0.1113 12.0951 0.9909 0.0091 83 8985 0.0245 0.9903
4 3.4980 10.2501 8.3917 0.4502 0.5498 119 97.6 0.0275 0.9882
5 22.8472 369.7719 24.1606 0.0613 0.9387 41 2.7 0.0251 0.9898
6 81.9332 4785 75.4057 0.0155 0.9845 13 0.2 0.0233 0.9912
7 1.2160 0.7076 5057 0.9999 0.0001 0.2 1413 0.0245 0.9903
8 1.8034 0.5172 2.4778 0.8273 0.1727 403 1933 0.0259 0.9896
9 4.6451 35.3378 15.8084 0.3091 0.6909 63 28.3 0.0257 0.9895
10 4.4471 1.4531 2.0004 0.5792 0.4208 500 688 0.0321 0.9855
11 1.2014 0.00003 20.3473 1.0000 0.0000 49 3.3 × 107 0.0246 0.9901
12 20.3662 928 62 0.0626 0.9374 16 1078 0.0233 0.9912
13 7.4662 101 23.3243 0.1876 0.8124 43 9901 0.0247 0.9904
14 1.2210 7.5124 1423 0.9947 0.0053 0.7 1333113 0.0245 0.9903
15 19.5330 160 13.9388 0.0801 0.9199 72 6250 0.0290 0.9865
16 1.2350 32.2818 1590 0.9801 0.0199 0.6 30977 0.0245 0.9902
17 11.0675 300.4231 40.4005 0.1185 0.8815 25 3329 0.0235 0.9910
18 9.1683 1524 237.1330 0.1346 0.8654 4 656 0.0239 0.9908
19 8.3273 145.5289 28.1600 0.1621 0.8379 35 6871 0.0241 0.9906
20 1.2275 23.0060 3478 0.9934 0.0066 0.3 43467 00245 0.9908
21 4.9839 653.6816 214.1402 0.2468 0.7532 5 1530 0.0239 0.9908
22 94.2711 1857.1133 27.6413 0.0147 0.9853 36 538.5 00247 0.9901
23 1.2584 21.1318 590 0.9654 0.0346 2 47322 0.0245 0.9903
24 4.5057 63.7910 26.8675 0.2964 0.7036 37 15676 0.0239 0.9908

* The parameter values in the first row were obtained by Sprague et al. (2004). The experimental FRAP data are from J. McNally (personal communication).

Figure 2.

Figure 2

Predicted and experimental normalized average fluorescent intensity recovery curves for GFP-GR (dots: Observed, solid lines: Simulated) obtained by equation (16). The experimental data are from J. McNally (personal communication).

To illustrate the non-uniqueness of the inversion, we plotted the optimized parameter values in three-dimensional parameter hyper-space (Fig. 3). The plot visibly indicates that the different combinations of model parameters can lead to same penalty (objective) function or error levels. The plot also suggests a potential linear form of correlation between bio-macromolecular diffusion coefficient and pseudo-association rate parameter. The first set of optimized parameters, obtained by [25], shows that 86% of the GFP-GR is bound to DNA, nuclear matrix or unknown binding sites and only 14% is free. Our analysis, however, suggests that using FRAP, one cannot conclude how much of the bio-macromolecule under study is free and how much is bound. As Table 1 shows, the concentration of free GFP-GR ranges from zero to 100 per cent. The same is true for the concentration of the bound complex. For example, referring to the results obtained in run 11, one may conclude that 100 per cent of the GFP-GR is free, while the results of run 6 show that all of it is bound. Note that both parameter sets produce excellent fits with the same RMSE and coefficient of determination (see Figure 2).

Figure 3.

Figure 3

Three-dimensional parameter hyper-space.

Analysis of Table 1 indicates that the average binding time per vacant site, calculated by tb = 1/Kd [25], ranges between 0.3 and 500 mili-seconds. Similarly, the average time for diffusion of GFP-GR from one binding site to the next, obtained by td = 1/Ka[46], varies over several orders of magnitude. The broad range of average binding time and average diffusion time for GFP-GR indicates that it is doubtful that one can infer an average time for macro-molecule diffusion and binding inside living cells from FRAP results alone.

This analysis explains the conflicting parameter values that may have been reported in literature for biomolecules using the FRAP protocol. Based on our findings, using experimental FRAP time series and coupling it with curve-fitting procedures can lead to misleading conclusions regarding binding reaction, slow or rapid mobility of biomolecules, and concentrations of free macromolecule, vacant binding sites and bound complex inside living cells and tissues.

In this study, the choice of a numerical approach rather than an analytic solution and a finite element approach rather than the finite difference scheme was made so that the parameter estimation could be readily extended to arbitrary initial and boundary conditions, complex domain geometry, and especially so that it could be extended to a ternary system of coupled nonlinear partial differential equations governing transport of free bio-macromolecule, bound complex, and vacant binding site where all three transport entities are moving species and it is impossible to obtain an exact solution for the systems of equations.

So far most of the FRAP studies have assumed an infinite domain to specify boundary conditions and to solve the system of partial differential equations governing biomolecule transport in-vivo [[13,25,29,30] among many others]. This assumption is unrealistic in the context of living cells. In this study, we address and improve this shortcoming by specifying a finite domain and by formulating Neumann boundary conditions on the cell membrane.

Residual analysis

The inverse methodology used in this study is based upon the following assumptions: 1) residuals have a mean of zero, 2) residuals have constant variance, 3) residuals are uncorrelated, and 4) residuals are normally distributed. When these assumptions are met, the parameter optimization estimates poses optimal statistical properties [40-42]. When these conditions are not met the parameter optimization method may no longer produce optimal parameter estimates. Residuals, or errors in parameter optimization, are defined as the difference between the observed and simulated state variable(s). An analysis of the residuals is a useful and key technique to study possible trends, oscillations, and correlation of errors. It is also important in validating the assumptions on which the inverse modeling strategy rests.

To analyze the residuals, they were plotted against average normalized fluorescence intensity, I¯(t), within the bleach spot during the time course of the FRAP experiment. Since the residuals are time and/or space series, their possible correlation was thoroughly analyzed. Different statistical measures such as error frequency analysis, normal probability plot, and hypotheses tests were explored to make decision about the residuals. The Student's t-test was used to test if the residuals have a mean of zero. Bartlett's test [47] was applied to determine if the residuals have constant variance. To test the normality of the residuals the Chi-square and Kolmogorov-Smirnov one sample tests were employed. Finally, the t-statistic [48] was used to test if the residuals are correlated.

The basic assumption in the hypothesis test on the residuals' mean is that the data come from a normally distributed population with unknown variance. In this study, the following null and alternative hypotheses were formulated:

H0:μ=μ0HA:μμ0 (17)

To perform the test the following critical t-statistic (t) was used:

t=x¯μ0s/N (18)

in which x¯, s, and N are the mean, standard deviation, and size of the sample (errors), respectively. μ0 is the mean of the population which is zero.

For -tα/2 <t <tα/2 the null hypothesis (mean is zero) cannot be rejected at the significance level of α. The rejection regions t < -tα/2 or tα/2 <t indicate that the null hypothesis can be rejected at the level of significance α.

The mean and standard deviation of the residuals were -0.0029 and 0.0234 with sample size n = 43. The t-statistic was calculated as:

t=0.002900.0234/43=0.8127

For 42 degrees of freedom, the tabled t-values for different levels of significance are given in Table 2. The calculated t-statistic was then compared with the tabled t-values at different levels of significance and the results summarized in Table 2. As the Table indicates the null hypothesis (mean of the residuals is zero) can not be rejected even at 20 per cent level of significance. The possibility of committing error type one is extremely slim.

Table 2.

The results of hypothesis test on the residuals' mean in FRAP model.

α 0.01 0.05 0.1 0.2
t-value 2.8120 2.0175 1.6820 1.3020
Decision Accept H0 Accept H0 Accept H0 Accept H0

The Bartlett's test statistic was used to verify for equality of variances across sub-groups of a sample against the alternative that variances are not constant. Equal variance across samples is called homogeneity of variances and is usually used in several statistical tests such as analysis of variance and nonlinear optimization which assumes that the errors have constant variance [47]:

T=(Nk)lnsp2i=1k(Ni1)lnsi21+13(k1)[i=1k1Ni11Nk] (19)

where si2 is the variance of the subgroup, Ni is the sample size of the subgroup, k is the number of subgroups, and sp2 is the pooled variance. This variance is a weighted average of the variances:

sp2=i=1k(Ni1)lnsi2/(Nk) (20)

The rejection region is those values of T > χ(α,k1)2 in which χ(α,k1)2 is the upper critical value of the chi-square distribution with k - 1 degree of freedom at the level of significance α.

The following null and alternative hypotheses were formulated:

H0:σ12=σ22HA:σ12σ22 (21)

To verify if the residuals have constant variance they were divided into different sections. One of the possible solutions in Table 1 was chosen and the residual plot versus laser beam recovery (Fig. 3) was divided into three regions. The variance in each region was calculated and compared with each other using the Bartlett test. The residuals were divided into three groups as:

S12

= S2(r(1:5)) = 9.2651 × 10-4

S22

= S2(r(6:24)) = 9.3655 × 10-4

S32

= S2(r(25:33)) = 9.3126 × 10-5

The pooled weighted variance was found to be Sp2 = 7.1030 × 10-4. The Bartlett's statistic was calculated as T = 9.5454 which is less than the upper critical value of the χ2 for two degrees of freedom (k = 3) at one per cent level of significance (χ(0.01,2)2 = 10.60). It is, however, more than the tabled value for five per cent level of significance. At one percent level of significance the null hypothesis (the residuals have constant variance) cannot be rejected. Based on this test and analysis of residual plot versus laser beam recovery, it is concluded that the residuals have equal variance.

The following null and alternative hypotheses were used to test possible correlation among the residuals:

H0:ρ=0HA:ρ0 (22)

where ρ is the correlation coefficient in the population. For n > 2 these hypotheses can be tested using the following t-statistic [48]:

t=rs1rs2N2 (23)

in which rs is the correlation coefficient in the sample.

The null hypothesis (correlation coefficient is zero) is rejected when the absolute value of the t-statistic is greater than the critical t-value (t < -tα/2 or tα/2 <t) at the level of significance α.

The residuals were first divided into two sub-groups:

r1 = r(1 : end - 1)
r2 = r(2 : end)

The correlation coefficient (rs = 0.1931) was then calculated and was used to obtain the critical t-statistics (with sample size n = 42):

t=0.193110.0373422=1.2447

These critical t-statistics were then compared with the tabled t-values at different levels of significances and the results presented in Table 3. As the Table indicates the null hypothesis (residuals are uncorrelated) can not be rejected even at 20 per cent level of significance. The possibility of having error type one is almost zero and, therefore, we didn't perform autocorrelation/serial correlation analysis.

Table 3.

The results of hypothesis test on the correlation of residuals in FRAP model.

α 0.01 0.05 0.1 0.2
t-value 2.7040 2.0210 1.6820 1.3030
Decision Accept H0 Accept H0 Accept H0 Accept H0

One of the assumptions of the least squares theory, which was applied in this study, is the normality of the residuals. In other words, it is assumed that the errors are normally distributed. To analyze the normality of the errors, two qualitative and two quantitative methods were used: 1) Error frequency analysis and normal probability plots, and 2) Hypothesis tests on the normality of the residuals using the chi-square goodness of fit test, which is based on the differences between the observed (oi) and expected (ei) error frequencies will be used [49]:

χ2=i=1k(oiei)2ei (24)

where k is the number of intervals or cells.

Error frequency analysis was first performed by constructing residuals' histogram. The histogram is presented in Figure 4. The Figure visibly shows that the errors are normally distributed. This was confirmed by the analysis of the normal probability plot (Fig. 5) and the chi-square hypothesis test on the normality of the random variable.

Figure 4.

Figure 4

Residuals versus normalized average fluorescent intensity in FRAP experiment using one-site-mobile-immobile model.

Figure 5.

Figure 5

Histograms of residuals for normalized average fluorescent intensity using one-site-mobile-immobile model.

Residual frequency analysis and normal probability plots, though useful in figuring out the underlying probability distribution function, are only qualitative means to study possible normal distribution of random variable. To verify normality of the errors, the chi-square test was used. To perform the test, first the residuals were grouped into different cells (histograms). The number of residuals in every cell were counted which is ei. Then using the upper limit of the cells, the mean, standard deviation, and the cumulative normal distribution, the expected frequencies were calculated. The cells were merged when the observed error frequencies were less than 5. Then the χ2 index was calculated and compared with χ(1α,v)2[50]. This information was used in the hypothesis test. The null and alternative hypotheses were formulated as:

H0:r=N(μ,σ)HA:rN(μ,σ) (25)

or;

H0 : r = N(0.0012,0.0224)
HA : r N(0.0012,0.0224)

where μ and σ are the mean and the standard deviation of the residuals in population.

Since the calculated χ2 (3.6190) is less than the tabled value (χ(0.80,2)2 = 4.61), the null hypothesis (the residuals are normally distributed) cannot be rejected even at 20 per cent level of significance implying that the residuals are normally distributed.

The chi-square test is a powerful test when the sample size is large. However, combining cells when the expected error frequencies are less than five loses information and hence decreases the power of the test. Furthermore, for very small samples this test is not applicable [49]. To overcome these limitations, the Kolmogorov-Smirnov one sample test is usually used since it treats each observation separately and does not lose information through merging of categories. This test is more powerful than the chi-square test when sample size is not large.

The Kolmogorov-Smirnov one sample test was used to verify if the residuals are normally distributed. Results (not shown here) indicate that the null hypothesis (the errors are normally distributed) cannot be rejected even at 20 per cent level of significance implying that the residuals are strongly normally distributed.

In conclusion, detailed residual analysis indicates that: 1) residuals have zero mean, 2) residuals have constant variance, 3) residuals are normally distributed, and 4) residuals are uncorrelated. Therefore, the necessary and sufficient criteria for least square parameter optimization, which was used in this study, were met.

Conclusion

A Galerkin-based finite element model was developed and applied to solve a system of two coupled partial differential equations governing GFP-GR transport and reaction in living cells. A finite domain was used to formulate boundary conditions on the cell membrane. The simulator was coupled with the Levenberg-Marquardt algorithm and a FRAP time series to estimate protein mass transport and reaction rate parameters. The developed strategy presents excellent agreement with the experimental data and the developed strategy is an efficient approach to extract as much physiochemical information from the FRAP protocol as possible. Uniqueness analysis of the inverse problem indicates that the FRAP protocol provides insufficient information for unique quantification of diffusion coefficient and binding rate parameters.

Competing interests

The author(s) declare that they have no competing interests.

Figure 6.

Figure 6

Normal probability plot for FRAP experiment using one-site-mobile-immobile model.

Contributor Information

Kouroush Sadegh Zadeh, Email: kouroush@cs.umd.edu.

Howard C Elman, Email: elman@cs.umd.edu.

Hubert J Montas, Email: montas@umd.edu.

Adel Shirmohammadi, Email: ashirmo@umd.edu.

Acknowledgements

This study was supported in part by the U.S. Department of Energy under Grant No. DOEG0204ER25619. The authors would like to thank Dr. J. McNally head of the Receptor Biology and Gene Expression Laboratory at the National Cancer Institute, Bethesda, MD for providing the experimental data.

References

  1. Sadegh Zadeh K. PhD thesis. University of Maryland, Fischell Department of Bioengineering; 2006. Multi-scale inverse modeling in biological mass transport processes. [Google Scholar]
  2. Truskey GA, Yuan F, Katz DF. Transport Phenomena in Biological Systems. Prentice Hall, NJ. 2004.
  3. Yahara I, Edelman GM. Restriction of the mobility of lymphocyte immunoglobulin receptors by concanavalin A. Proc Natl Acad Sci USA. 1972;69(3):608–612. doi: 10.1073/pnas.69.3.608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Edidin M. Rotational and translational diffusion in membranes. Ann Rev Biophys Bioeng. 1974;3:179–201. doi: 10.1146/annurev.bb.03.060174.001143. [DOI] [PubMed] [Google Scholar]
  5. Poo MM, Cone RA. Lateral diffusion of rhodopsin in the photoreceptor membrane. Nature. 1974;247:438–441. doi: 10.1038/247438a0. [DOI] [PubMed] [Google Scholar]
  6. Liebman PA, Entine G. Lateral diffusion of visual pigment in photoreceptor disk membrane. Science. 1974;185:457–459. doi: 10.1126/science.185.4149.457. [DOI] [PubMed] [Google Scholar]
  7. Edelman GM. In: Cold Spring Harbor Conferences on Cell Proliferation. Clarkson B, Baserga R, editor. Vol. 1. 1974. Surface alterations and mitogenesis in lymphocytes. In Control of Proliferation in Animal Cells; pp. 357–379. [Google Scholar]
  8. Bretscher MS, Rafe MC. Mammalian plasma membrane. Nature. 1975;258:43. doi: 10.1038/258043a0. [DOI] [PubMed] [Google Scholar]
  9. Ediden M, Zagyansky Y, Lardner TJ. Measurement of membrane protein lateral diffusion in single cells. Science. 1976;191:466–468. doi: 10.1126/science.1246629. [DOI] [PubMed] [Google Scholar]
  10. Axelrod D, Koppel DE, Schlessinger J, Elson E, Webb WW. Mobility measurement by fluorescence photobleaching recovery kinetics. Biophys J. 1976;16:1055–1069. doi: 10.1016/S0006-3495(76)85755-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Schlessinger J, Kopple DE, Axelrod D, Jacobson K, Webb WW, Elson E. Lateral transport on cell membranes: mobility of cancanavalin A receptors on myoblasts. Proc Natl Acad Sci USA. 1976;73:2409–2413. doi: 10.1073/pnas.73.7.2409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Thompson NL, Burghardt TP, Axelrod D. Measuring surface dynamics of macromolecules by total internal reflection fluorescence with photobleaching recovery or correlation spectroscopy. Biophys J. 1981;33:435–454. doi: 10.1016/S0006-3495(81)84905-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Kaufman E, Jain RK. Quantification of transport and binding parameters using fluorescence recovery after photobleaching. Potential for in vivo applications. Biophys J. 1990;58:873–885. doi: 10.1016/S0006-3495(90)82432-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Tsay TT, Jacobsen KA. Spatial Fourier analysis of video photobleaching measurements. Principles and optimization. Biophys J. 1991;60:360–368. doi: 10.1016/S0006-3495(91)82061-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Tardy Y, McGrath JL, Hartwig JH, Dewey CF. Interpreting photoactivated fluorescence microscopy measurements of steady-state actin dynamics. Biophys J. 1995;69:1674–1682. doi: 10.1016/S0006-3495(95)80085-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Berk DA, Yuan F, Leunig M, Jain RK. Direct in vivo measurement of targeted binding in human tumor xenograft. Proc Natl Acad Sci USA. 1997;94(5):1785–1790. doi: 10.1073/pnas.94.5.1785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Meyvis TK, De Smedt SC, Van Oostveldt P, Demeester J. Fluorescence recovery after photobleaching: a versatile tool for mobility and interaction measurements in pharmaceutical research. Pharm Res. 1999;16:1153–1162. doi: 10.1023/A:1011924909138. [DOI] [PubMed] [Google Scholar]
  18. McNally JG, Muller WG, Walker D, Wolford R, Hager GL. The glucocorticoid receptor: rapid exchange with regulatory sites in living cells. Science. 2000;287:1262–1265. doi: 10.1126/science.287.5456.1262. [DOI] [PubMed] [Google Scholar]
  19. Phair RD, Misteli T. High mobility of proteins in the mammaliancell nucleus. Nature. 2000;404:604–609. doi: 10.1038/35007077. [DOI] [PubMed] [Google Scholar]
  20. Bulinski JC, Odde DJ, Howell BJ, Salmon ED, Waterman-Storer CM. Rapid dynamics of the microtubule binding of ensconsin in vivo. J Cell Sci. 2001;114:3885–3897. doi: 10.1242/jcs.114.21.3885. [DOI] [PubMed] [Google Scholar]
  21. Dunder M, Hoffmann-Rohrer U, Hu Q, Grummt I, Rothblum LI, Phair RD, Misteli T. A kinetic framework for a mammalian RNA polymerase in vivo. Science. 2002;298:1623–1626. doi: 10.1126/science.1076164. [DOI] [PubMed] [Google Scholar]
  22. Kimura H, Sugaya K, Cook PR. The transcription cycle of RNA polymerase II in living cells. J Cell Biol. 2002;159:777–782. doi: 10.1083/jcb.200206019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Coscoy S, Waharte F, Gautreau A, Martin M, Louvard D, Mangeat P, Arpin M, Amblard F. Molecular analysis of microscopic ezrin dynamics by two photon FRAP. Proc Natl Acad Sci USA. 2002;99:12813–12818. doi: 10.1073/pnas.192084599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Carrero G, McDonald D, Crawford E, de Vries G, Hendzel MJ. Using FRAP and mathematical modeling to determine the in vivo kinetics of nuclear proteins. Methods. 2003;29:14–28. doi: 10.1016/S1046-2023(02)00288-8. [DOI] [PubMed] [Google Scholar]
  25. Sprague B, Pego RL, Stavreva DA, McNally JG. Analysis of binding reactions by fluorescence recovery after photobleaching. Biophys J. 2004;86:3473–3495. doi: 10.1529/biophysj.103.026765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Lele T, Oh P, Nickerson JA, Ingber DE. An improved mathematicalapproach for determination of molecular kinetics in living cells with FRAP. Mech Chem Biosyst. 2004;1(3):181–190. [PubMed] [Google Scholar]
  27. Farla P, Hersmus R, Geverts B, Mari PO, Nigg AL, Dubbink HJ, Trapman J, Houtsmuller AB. The androgen receptor ligand-binding stabilizes DNA binding in living cells. J Struct Biol. 2004;147:50–61. doi: 10.1016/j.jsb.2004.01.002. [DOI] [PubMed] [Google Scholar]
  28. Carrero G, Crawford E, Hendzel MJ, deVries G. Characterizing fluorescence recovery curves for nuclear proteins undergoing binding events. Bull Math Biol. 2004;66:1515–1545. doi: 10.1016/j.bulm.2004.02.005. [DOI] [PubMed] [Google Scholar]
  29. Beaudouin J, Mora-Bermudez F, Klee T, Daigle N, Ellenberg J. Dissecting the contribution of diffusion and interaction to the mobility of nuclear proteins. Biophys J. 2006;90:1878–1894. doi: 10.1529/biophysj.105.071241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Sadegh Zadeh K, Montas HJ, Shirmohammadi A. Identification of biomolecule mass transport and binding rate parameters in living cells by inverse modeling. Theor Biol Med Model. 2006;3:36. doi: 10.1186/1742-4682-3-36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Sadegh Zadeh K, Montas HJ, Shirmohammadi A, Elman HC. Quantification of hydraulic parameters in fluid flow through heterogeneous variably saturated porous media with realistic initial condition. J Comput Appl Math. 2006. in review.
  32. Sadegh Zadeh K, Montas HJ, Shirmohammadi A. Multi-objective optimization in variably saturated fluid flow. J Comput Appl Math. 2006. (in review)
  33. Osborne MR. Nonlinear least squares-the Levenberg-Marquardt revisited. J Aus Math Soc. 1976;Ser. B19:343–357. [Google Scholar]
  34. Moré JJ. In: Numerical analysis, Lecture Notes in Mathematics 630, Springer Verlag. Watson GA, editor. 1977. The Levenberg-Marquardt algorithm: Implementation and theory; pp. 105–116. [Google Scholar]
  35. Levenberg K. A method for the solution of a certain non-linear problems in least squares. Q Appl Math. 1944;2:164–168. [Google Scholar]
  36. Marquardt DW. An algorithm for least squares estimation of nonlinear parameters. SIAM J Appl Math. 1963;11:431–441. doi: 10.1137/0111030. [DOI] [Google Scholar]
  37. Zienkiewicz OC. Finite Element Methods in Structural and Continuum Mechanics. London, McGraw-Hill. 1967.
  38. Pinder GF, Gray WG. Finite Element Simulation in Surface and Subsurface Hydrology. Academic Press, New York; 1977. [Google Scholar]
  39. Huyakorn PS, Pinder GF. Computational Methods in Subsurface Flow. San Diego. Academic Press; 1983. [Google Scholar]
  40. Bard Y. Nonlinear Parameter Estimation. Academic Press, New York; 1974. [Google Scholar]
  41. Seber GAF, Wild CJ. Nonlinear Regression. John Wiley and sons. New York; 2004. [Google Scholar]
  42. Beck JV, Arnold KJ. Parameter Estimation in Science and Engineering. John Wiley, New York; 1977. [Google Scholar]
  43. Golub GH, van Loan CF. Matrix Computations. Johns Hopkins University Press. Baltimore; 1983. [Google Scholar]
  44. Daniel C, Wood FS. Fitting Equations to Data. New York: Wiley-Interscience; 1971. [Google Scholar]
  45. Sadegh Zadeh K, Shirmohammadi A, Elman HC, Montas HJ, Johnson AT. Proceedings of the 23th Southern Biomedical Engineering Conference. April 13–15. Catholic University of America, Washington, DC; 2007. Quantification of reaction-diffusion parameters for nuclear proteins. [Google Scholar]
  46. Berg OG. Effective diffusion rate through a polymer network: influence of nonspecific binding and intersegment transfer. Biopolymers. 1986;25:811–821. doi: 10.1002/bip.360250506. [DOI] [PubMed] [Google Scholar]
  47. Snedecor GW, Cochran WG. Statistical Methods. Eighth. Iowa State University Press; 1989. [Google Scholar]
  48. Mc Cuen RH. Statistical Methods for Engineers. Prentice Hall. Upper Saddle River. NJ; 1985. [Google Scholar]
  49. Mc Cuen RH. Modeling hydrologic change: statistical methods. Lewis Publishers. 2003.
  50. Abramovitch M, Stegun IA. Handbook of mathematical functions. Fourth Printing. Applied Math. Ser. 55, U. S. Government Printing Office, Washington DC. 1965.

Articles from BioMedical Engineering OnLine are provided here courtesy of BMC

RESOURCES